Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO

Abstract In view of the complex environments and varying object scales in drone-captured imagery, a novel PARE-YOLO algorithm based on YOLOv8 for small object detection is proposed. This model enhances feature extraction and fusion across multiple scales through a restructured neck network. Addition...

Full description

Saved in:
Bibliographic Details
Main Authors: Huiying Zhang, Pan Xiao, Feifan Yao, Qinghua Zhang, Yifei Gong
Format: Article
Language:English
Published: Nature Portfolio 2025-02-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-88857-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823862306512044032
author Huiying Zhang
Pan Xiao
Feifan Yao
Qinghua Zhang
Yifei Gong
author_facet Huiying Zhang
Pan Xiao
Feifan Yao
Qinghua Zhang
Yifei Gong
author_sort Huiying Zhang
collection DOAJ
description Abstract In view of the complex environments and varying object scales in drone-captured imagery, a novel PARE-YOLO algorithm based on YOLOv8 for small object detection is proposed. This model enhances feature extraction and fusion across multiple scales through a restructured neck network. Additionally, it incorporates a lightweight detection head that is optimized for small objects, thereby significantly improving detection performance in cluttered and intricate backgrounds. To further enhance the extraction of small object features, the conventional C2f is replaced with a novel architecture. Moreover, the EMA-GIoU loss function is proposed to mitigate class imbalance and enhance robustness, particularly in scenarios characterized by skewed class distributions. Evaluation on the VisDrone2019 dataset indicates that PARE-YOLO achieves a 5.9% improvement in mean Average Precision (mAP) at a threshold of 0.5, compared to the original YOLOv8 model. In addition, the PARE-YOLO model exhibits significant robustness, achieving a mean Average Precision (mAP) at a threshold of 0.5 values on the HIT-UAV dataset that are 0.8%, 0.5%, and 1.2% higher than those of YOLOv8, YOLOv10, and RT-DETR. These results underscore the effectiveness of PARE-YOLO in addressing the challenges inherent in aerial scenarios. The code will be available online (https://github.com/Sunnyxiao69/PARE-YOLO).
format Article
id doaj-art-6b0a0545b4664891bb7630097bd30a12
institution Kabale University
issn 2045-2322
language English
publishDate 2025-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-6b0a0545b4664891bb7630097bd30a122025-02-09T12:32:35ZengNature PortfolioScientific Reports2045-23222025-02-0115111710.1038/s41598-025-88857-wFusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLOHuiying Zhang0Pan Xiao1Feifan Yao2Qinghua Zhang3Yifei Gong4College of Information and Control Engineering, Jilin Institute of Chemical Technology, JilinCollege of Information and Control Engineering, Jilin Institute of Chemical Technology, JilinCollege of Information and Control Engineering, Jilin Institute of Chemical Technology, JilinCollege of Information and Control Engineering, Jilin Institute of Chemical Technology, JilinCollege of Information and Control Engineering, Jilin Institute of Chemical Technology, JilinAbstract In view of the complex environments and varying object scales in drone-captured imagery, a novel PARE-YOLO algorithm based on YOLOv8 for small object detection is proposed. This model enhances feature extraction and fusion across multiple scales through a restructured neck network. Additionally, it incorporates a lightweight detection head that is optimized for small objects, thereby significantly improving detection performance in cluttered and intricate backgrounds. To further enhance the extraction of small object features, the conventional C2f is replaced with a novel architecture. Moreover, the EMA-GIoU loss function is proposed to mitigate class imbalance and enhance robustness, particularly in scenarios characterized by skewed class distributions. Evaluation on the VisDrone2019 dataset indicates that PARE-YOLO achieves a 5.9% improvement in mean Average Precision (mAP) at a threshold of 0.5, compared to the original YOLOv8 model. In addition, the PARE-YOLO model exhibits significant robustness, achieving a mean Average Precision (mAP) at a threshold of 0.5 values on the HIT-UAV dataset that are 0.8%, 0.5%, and 1.2% higher than those of YOLOv8, YOLOv10, and RT-DETR. These results underscore the effectiveness of PARE-YOLO in addressing the challenges inherent in aerial scenarios. The code will be available online (https://github.com/Sunnyxiao69/PARE-YOLO).https://doi.org/10.1038/s41598-025-88857-wLightweight detection headAerial small object detectionYOLOEMA-GIoU
spellingShingle Huiying Zhang
Pan Xiao
Feifan Yao
Qinghua Zhang
Yifei Gong
Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO
Scientific Reports
Lightweight detection head
Aerial small object detection
YOLO
EMA-GIoU
title Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO
title_full Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO
title_fullStr Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO
title_full_unstemmed Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO
title_short Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO
title_sort fusion of multi scale attention for aerial images small target detection model based on pare yolo
topic Lightweight detection head
Aerial small object detection
YOLO
EMA-GIoU
url https://doi.org/10.1038/s41598-025-88857-w
work_keys_str_mv AT huiyingzhang fusionofmultiscaleattentionforaerialimagessmalltargetdetectionmodelbasedonpareyolo
AT panxiao fusionofmultiscaleattentionforaerialimagessmalltargetdetectionmodelbasedonpareyolo
AT feifanyao fusionofmultiscaleattentionforaerialimagessmalltargetdetectionmodelbasedonpareyolo
AT qinghuazhang fusionofmultiscaleattentionforaerialimagessmalltargetdetectionmodelbasedonpareyolo
AT yifeigong fusionofmultiscaleattentionforaerialimagessmalltargetdetectionmodelbasedonpareyolo