QEDetr: DETR with Query Enhancement for Fine-Grained Object Detection

Fine-grained object detection aims to accurately localize the object bounding box while identifying the specific model of the object, which is more challenging than conventional remote sensing object detection. Transformer-based object detector (DETR) can capture remote inter-feature dependencies by...

Full description

Saved in:
Bibliographic Details
Main Authors: Chenguang Dong, Shan Jiang, Haijiang Sun, Jiang Li, Zhenglei Yu, Jiasong Wang, Jiacheng Wang
Format: Article
Language:English
Published: MDPI AG 2025-03-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/17/5/893
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850052786600804352
author Chenguang Dong
Shan Jiang
Haijiang Sun
Jiang Li
Zhenglei Yu
Jiasong Wang
Jiacheng Wang
author_facet Chenguang Dong
Shan Jiang
Haijiang Sun
Jiang Li
Zhenglei Yu
Jiasong Wang
Jiacheng Wang
author_sort Chenguang Dong
collection DOAJ
description Fine-grained object detection aims to accurately localize the object bounding box while identifying the specific model of the object, which is more challenging than conventional remote sensing object detection. Transformer-based object detector (DETR) can capture remote inter-feature dependencies by using attention, which is suitable for fine-grained object detection tasks. However, most existing DETR-like object detectors are not specifically optimized for remote sensing detection tasks. Therefore, we propose an oriented fine-grained object detection method based on transformers. First, we combine denoising training and angle coding to propose a baseline DETR-like object detector for oriented object detection. Next, we propose a new attention mechanism for extracting finer-grained features by constraining the angle of sampling points during the attentional process, ensuring that the sampling points are more evenly distributed across the object features. Then, we propose a multiscale fusion method based on bilinear pooling to obtain the enhanced query and initialize a more accurate object bounding box. Finally, we combine the localization accuracy of each query with its classification accuracy and propose a new classification loss to further enhance the high-quality queries. Evaluation results on the FAIR1M dataset show that our method achieves an average accuracy of 48.5856 mAP and the highest accuracy of 49.7352 mAP in object detection, outperforming other methods.
format Article
id doaj-art-8541cd7b37be49d5aa0621d054f3c16a
institution DOAJ
issn 2072-4292
language English
publishDate 2025-03-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj-art-8541cd7b37be49d5aa0621d054f3c16a2025-08-20T02:52:42ZengMDPI AGRemote Sensing2072-42922025-03-0117589310.3390/rs17050893QEDetr: DETR with Query Enhancement for Fine-Grained Object DetectionChenguang Dong0Shan Jiang1Haijiang Sun2Jiang Li3Zhenglei Yu4Jiasong Wang5Jiacheng Wang6Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaCollege of Biological and Agricultural Engineering, Jilin University, Changchun 130022, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaFine-grained object detection aims to accurately localize the object bounding box while identifying the specific model of the object, which is more challenging than conventional remote sensing object detection. Transformer-based object detector (DETR) can capture remote inter-feature dependencies by using attention, which is suitable for fine-grained object detection tasks. However, most existing DETR-like object detectors are not specifically optimized for remote sensing detection tasks. Therefore, we propose an oriented fine-grained object detection method based on transformers. First, we combine denoising training and angle coding to propose a baseline DETR-like object detector for oriented object detection. Next, we propose a new attention mechanism for extracting finer-grained features by constraining the angle of sampling points during the attentional process, ensuring that the sampling points are more evenly distributed across the object features. Then, we propose a multiscale fusion method based on bilinear pooling to obtain the enhanced query and initialize a more accurate object bounding box. Finally, we combine the localization accuracy of each query with its classification accuracy and propose a new classification loss to further enhance the high-quality queries. Evaluation results on the FAIR1M dataset show that our method achieves an average accuracy of 48.5856 mAP and the highest accuracy of 49.7352 mAP in object detection, outperforming other methods.https://www.mdpi.com/2072-4292/17/5/893remote sensingfine-grained object detectionDETR
spellingShingle Chenguang Dong
Shan Jiang
Haijiang Sun
Jiang Li
Zhenglei Yu
Jiasong Wang
Jiacheng Wang
QEDetr: DETR with Query Enhancement for Fine-Grained Object Detection
Remote Sensing
remote sensing
fine-grained object detection
DETR
title QEDetr: DETR with Query Enhancement for Fine-Grained Object Detection
title_full QEDetr: DETR with Query Enhancement for Fine-Grained Object Detection
title_fullStr QEDetr: DETR with Query Enhancement for Fine-Grained Object Detection
title_full_unstemmed QEDetr: DETR with Query Enhancement for Fine-Grained Object Detection
title_short QEDetr: DETR with Query Enhancement for Fine-Grained Object Detection
title_sort qedetr detr with query enhancement for fine grained object detection
topic remote sensing
fine-grained object detection
DETR
url https://www.mdpi.com/2072-4292/17/5/893
work_keys_str_mv AT chenguangdong qedetrdetrwithqueryenhancementforfinegrainedobjectdetection
AT shanjiang qedetrdetrwithqueryenhancementforfinegrainedobjectdetection
AT haijiangsun qedetrdetrwithqueryenhancementforfinegrainedobjectdetection
AT jiangli qedetrdetrwithqueryenhancementforfinegrainedobjectdetection
AT zhengleiyu qedetrdetrwithqueryenhancementforfinegrainedobjectdetection
AT jiasongwang qedetrdetrwithqueryenhancementforfinegrainedobjectdetection
AT jiachengwang qedetrdetrwithqueryenhancementforfinegrainedobjectdetection