Application of semi-supervised object detection technology incorporating attention mechanism in rail transit

The operation and maintenance of rail transit systems require high-accuracy object detection and tracking technology to ensure the safety of passengers and the normal operation of traffic systems. In recent years, with the emergence of Vision in Transformer (VIT), large models incorporating attentio...

Full description

Saved in:
Bibliographic Details
Main Authors: WANG Junyu, CHEN Zhe, SUN Junyong
Format: Article
Language:zho
Published: Editorial Department of Electric Drive for Locomotives 2025-01-01
Series:机车电传动
Subjects:
Online Access:http://edl.csrzic.com/thesisDetails#10.13890/j.issn.1000-128X.2025.01.104
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The operation and maintenance of rail transit systems require high-accuracy object detection and tracking technology to ensure the safety of passengers and the normal operation of traffic systems. In recent years, with the emergence of Vision in Transformer (VIT), large models incorporating attention mechanisms have garnered widespread application in the field of object detection. However, attention mechanisms require substantial demands of data in the process of model training to achieve high precision and robustness. Moreover, the process of image data processing is often accompanied by significant consumption of manpower and material resources. This paper proposes a semi-supervised object detection strategy that incorporates an attention mechanism, to reduce the cost of data processing and improve the robustness of the models. Detectors such as Grounding DINO and YOLO-World are employed as the backbone of the algorithm, and attention mechanisms such as CBAM, CoTAttention, and SEAttention are applied at the algorithm’s HEAD layer. The study results show that, with only 10% of the data processed, a mean Average Precision (mAP) of 0.70±0.04 is achieved on the dataset,corresponding to a mAP gain of 5.14% compared to traditional semi-supervised object detection techniques. The study findings provide a theoretical reference for future research on object detection large models in the field of rail transit.
ISSN:1000-128X