LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object Detection
Object detection in high-resolution remote sensing image (HRRSI) faces great challenges of large-scale variations in object size, densely distributed small objects, and complex background interferences. To address these challenges, we propose an innovative single-stage local-global feature extractio...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11018430/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849424671102992384 |
|---|---|
| author | Qinggang Wu Yang Li Junru Yin Xiaotian You |
| author_facet | Qinggang Wu Yang Li Junru Yin Xiaotian You |
| author_sort | Qinggang Wu |
| collection | DOAJ |
| description | Object detection in high-resolution remote sensing image (HRRSI) faces great challenges of large-scale variations in object size, densely distributed small objects, and complex background interferences. To address these challenges, we propose an innovative single-stage local-global feature extraction and coordination network (LGC-YOLO) to improve the detection accuracy of objects in HRRSIs. LGC-YOLO mainly comprises three modules of local-global spatial feature extraction (LGSFE), gradient optimized spatial information interaction (GOSII), and edge-semantic feature coordination fusion (ESFCF), which synergistically improves the feature extraction and object detection capabilities of LGC-YOLO. First, LGSFE captures local and global features of dense objects through receptive-field attention convolution and global pooling in a multibranch structure, which effectively alleviates the misalignment between the extracted features of objects and their intrinsic characteristics, thereby providing more accurate and abundant features for subsequent object detection. Second, GOSII is designed to dynamically adjust the weights of each feature channel through combining SRU blocks and the SimAM attention mechanism, which are further optimized and embedded into C2f to enhance the representation ability of contextual features. GOSII captures crucial features from complex backgrounds and improves information transmission. Finally, ESFCF integrates the edge and semantic information within shallow feature maps to address the issue of inaccurate localization for small objects, and further improves object detection accuracy by compensating for the loss of edge details in feature extraction. Extensive experiments on three commonly used remote sensing datasets of NWPU VHR-10, VisDrone 2019, and DOTA demonstrate the superiority of our method in object classification and localization compared to other state-of-the-art methods. |
| format | Article |
| id | doaj-art-3f3c824db7144810bb6b2853c05257dc |
| institution | Kabale University |
| issn | 1939-1404 2151-1535 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| spelling | doaj-art-3f3c824db7144810bb6b2853c05257dc2025-08-20T03:30:03ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-0118153761539310.1109/JSTARS.2025.357523911018430LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object DetectionQinggang Wu0https://orcid.org/0009-0008-6789-8472Yang Li1Junru Yin2Xiaotian You3College of Computer Science and Technology, Zhengzhou University of Light Industry, Zhengzhou, ChinaCollege of Computer Science and Technology, Zhengzhou University of Light Industry, Zhengzhou, ChinaCollege of Computer Science and Technology, Zhengzhou University of Light Industry, Zhengzhou, ChinaCollege of Computer Science and Technology, Zhengzhou University of Light Industry, Zhengzhou, ChinaObject detection in high-resolution remote sensing image (HRRSI) faces great challenges of large-scale variations in object size, densely distributed small objects, and complex background interferences. To address these challenges, we propose an innovative single-stage local-global feature extraction and coordination network (LGC-YOLO) to improve the detection accuracy of objects in HRRSIs. LGC-YOLO mainly comprises three modules of local-global spatial feature extraction (LGSFE), gradient optimized spatial information interaction (GOSII), and edge-semantic feature coordination fusion (ESFCF), which synergistically improves the feature extraction and object detection capabilities of LGC-YOLO. First, LGSFE captures local and global features of dense objects through receptive-field attention convolution and global pooling in a multibranch structure, which effectively alleviates the misalignment between the extracted features of objects and their intrinsic characteristics, thereby providing more accurate and abundant features for subsequent object detection. Second, GOSII is designed to dynamically adjust the weights of each feature channel through combining SRU blocks and the SimAM attention mechanism, which are further optimized and embedded into C2f to enhance the representation ability of contextual features. GOSII captures crucial features from complex backgrounds and improves information transmission. Finally, ESFCF integrates the edge and semantic information within shallow feature maps to address the issue of inaccurate localization for small objects, and further improves object detection accuracy by compensating for the loss of edge details in feature extraction. Extensive experiments on three commonly used remote sensing datasets of NWPU VHR-10, VisDrone 2019, and DOTA demonstrate the superiority of our method in object classification and localization compared to other state-of-the-art methods.https://ieeexplore.ieee.org/document/11018430/Attention mechanismedge featuresremote sensing object detection (RSOD)small objectYOLO |
| spellingShingle | Qinggang Wu Yang Li Junru Yin Xiaotian You LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object Detection IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Attention mechanism edge features remote sensing object detection (RSOD) small object YOLO |
| title | LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object Detection |
| title_full | LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object Detection |
| title_fullStr | LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object Detection |
| title_full_unstemmed | LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object Detection |
| title_short | LGC-YOLO: Local-Global Feature Extraction and Coordination Network With Contextual Interaction for Remote Sensing Object Detection |
| title_sort | lgc yolo local global feature extraction and coordination network with contextual interaction for remote sensing object detection |
| topic | Attention mechanism edge features remote sensing object detection (RSOD) small object YOLO |
| url | https://ieeexplore.ieee.org/document/11018430/ |
| work_keys_str_mv | AT qinggangwu lgcyololocalglobalfeatureextractionandcoordinationnetworkwithcontextualinteractionforremotesensingobjectdetection AT yangli lgcyololocalglobalfeatureextractionandcoordinationnetworkwithcontextualinteractionforremotesensingobjectdetection AT junruyin lgcyololocalglobalfeatureextractionandcoordinationnetworkwithcontextualinteractionforremotesensingobjectdetection AT xiaotianyou lgcyololocalglobalfeatureextractionandcoordinationnetworkwithcontextualinteractionforremotesensingobjectdetection |