VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex Scenes
Accurate detection of vulnerable road users (VRUs) is critical for enhancing traffic safety and advancing autonomous driving systems. However, due to their small size and unpredictable movements, existing detection methods struggle to provide stable and accurate results under real-time conditions. T...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10854459/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832575561356017664 |
---|---|
author | Yunxiang Liu Yuqing Shi |
author_facet | Yunxiang Liu Yuqing Shi |
author_sort | Yunxiang Liu |
collection | DOAJ |
description | Accurate detection of vulnerable road users (VRUs) is critical for enhancing traffic safety and advancing autonomous driving systems. However, due to their small size and unpredictable movements, existing detection methods struggle to provide stable and accurate results under real-time conditions. To overcome these challenges, this paper proposes an improved VRU detection algorithm based on YOLOv8, named VRU-YOLO. First, we redesign the neck structure and construct a Detail Enhancement Feature Pyramid Network (DEFPN) to enhance the extraction and fusion capabilities of small target features. Second, the YOLOv8 network’s Spatial Pyramid Pooling Fast (SPPF) module is replaced with a novel Feature Pyramid Convolution Fast (FPCF) module based on dilated convolution, effectively mitigating feature loss in small target processing. Additionally, a lightweight Optimized Shared Detection Head (OSDH-Head) is introduced, reducing computational complexity while improving detection efficiency. Finally, to alleviate the deficiencies of traditional loss functions in shape matching and computational efficiency, we propose the Wise-Powerful Intersection over Union (WPIoU) loss function, which further optimizes the regression of target bounding boxes. Experimental results on a custom-built multi-source VRU dataset show that the proposed model enhances precision, recall, mAP50, and mAP50:95 by 1.3%, 3.4%, 3.3%, and 1.8%, respectively, in comparison to the baseline model. Moreover, in a generalization test conducted on the remote sensing small target dataset VisDrone2019, the VRU-YOLO model achieved an mAP50 of 31%. This study demonstrates that the improved model offers more efficient performance in small object detection scenarios, making it well-suited for VRU detection in complex road environments. |
format | Article |
id | doaj-art-00eefacd5915419a89f8ad9f57ab1119 |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-00eefacd5915419a89f8ad9f57ab11192025-01-31T23:04:27ZengIEEEIEEE Access2169-35362025-01-0113199962001510.1109/ACCESS.2025.353432110854459VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex ScenesYunxiang Liu0Yuqing Shi1https://orcid.org/0009-0003-3865-9388School of Computer Science and Information Engineering, Shanghai Institute of Technology, Shanghai, ChinaSchool of Computer Science and Information Engineering, Shanghai Institute of Technology, Shanghai, ChinaAccurate detection of vulnerable road users (VRUs) is critical for enhancing traffic safety and advancing autonomous driving systems. However, due to their small size and unpredictable movements, existing detection methods struggle to provide stable and accurate results under real-time conditions. To overcome these challenges, this paper proposes an improved VRU detection algorithm based on YOLOv8, named VRU-YOLO. First, we redesign the neck structure and construct a Detail Enhancement Feature Pyramid Network (DEFPN) to enhance the extraction and fusion capabilities of small target features. Second, the YOLOv8 network’s Spatial Pyramid Pooling Fast (SPPF) module is replaced with a novel Feature Pyramid Convolution Fast (FPCF) module based on dilated convolution, effectively mitigating feature loss in small target processing. Additionally, a lightweight Optimized Shared Detection Head (OSDH-Head) is introduced, reducing computational complexity while improving detection efficiency. Finally, to alleviate the deficiencies of traditional loss functions in shape matching and computational efficiency, we propose the Wise-Powerful Intersection over Union (WPIoU) loss function, which further optimizes the regression of target bounding boxes. Experimental results on a custom-built multi-source VRU dataset show that the proposed model enhances precision, recall, mAP50, and mAP50:95 by 1.3%, 3.4%, 3.3%, and 1.8%, respectively, in comparison to the baseline model. Moreover, in a generalization test conducted on the remote sensing small target dataset VisDrone2019, the VRU-YOLO model achieved an mAP50 of 31%. This study demonstrates that the improved model offers more efficient performance in small object detection scenarios, making it well-suited for VRU detection in complex road environments.https://ieeexplore.ieee.org/document/10854459/Vulnerable road userssmall object detectionfeature fusionshared convolutionYOLOv8 |
spellingShingle | Yunxiang Liu Yuqing Shi VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex Scenes IEEE Access Vulnerable road users small object detection feature fusion shared convolution YOLOv8 |
title | VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex Scenes |
title_full | VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex Scenes |
title_fullStr | VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex Scenes |
title_full_unstemmed | VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex Scenes |
title_short | VRU-YOLO: A Small Object Detection Algorithm for Vulnerable Road Users in Complex Scenes |
title_sort | vru yolo a small object detection algorithm for vulnerable road users in complex scenes |
topic | Vulnerable road users small object detection feature fusion shared convolution YOLOv8 |
url | https://ieeexplore.ieee.org/document/10854459/ |
work_keys_str_mv | AT yunxiangliu vruyoloasmallobjectdetectionalgorithmforvulnerableroadusersincomplexscenes AT yuqingshi vruyoloasmallobjectdetectionalgorithmforvulnerableroadusersincomplexscenes |