A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing Images

Artificial target detection is a key technology for Earth observation applications in remote sensing images, including environmental monitoring, urban planning, intelligence reconnaissance, and land mapping. Recently, transformer-based object detectors achieved great success in computer vision throu...

Full description

Saved in:
Bibliographic Details
Main Authors: Nan Mo, Ruixi Zhu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10804220/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841557080662605824
author Nan Mo
Ruixi Zhu
author_facet Nan Mo
Ruixi Zhu
author_sort Nan Mo
collection DOAJ
description Artificial target detection is a key technology for Earth observation applications in remote sensing images, including environmental monitoring, urban planning, intelligence reconnaissance, and land mapping. Recently, transformer-based object detectors achieved great success in computer vision through the attention mechanism. However, these detectors lack prior information, which may limit their multiscale, multishape, and densely distributed target detection capabilities. To address these problems, we propose a novel transformer-based object detection method with geometric and object co-occurrence prior knowledge for remote sensing images, which makes improvements based on the deformable detection transformer (DETR). First, we introduce dynamic anchor object queries with multipatterns to detect objects with multiscale and dense distribution. Second, we propose a novel distance with geometrical invariance to measure the position deviation of multishape objects. Last, we design a graph convolutional reference module with co-occurrence prior knowledge to improve the inferential ability of the detector. Experimental results confirm that the proposed method outperforms most of the state-of-the-art methods with mean average precisions (mAP) of 70.2% in the DIOR, 91.0% in the HRRSD datasets, and 91.4% in the NWPU VHR-10 dataset, respectively. The mAP values are more than 5% higher compared with those obtained by deformable DETR in above three public datasets.
format Article
id doaj-art-435eb945850844b2818d9ebf716f5f93
institution Kabale University
issn 1939-1404
2151-1535
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj-art-435eb945850844b2818d9ebf716f5f932025-01-07T00:00:30ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01182383240010.1109/JSTARS.2024.351875310804220A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing ImagesNan Mo0https://orcid.org/0000-0001-6918-4416Ruixi Zhu1https://orcid.org/0000-0002-5006-0840School of Geomatics Science and Technology, Nanjing Tech University, Nanjing, ChinaDepartment of Research, Nanjing Research Institute of Electronic Technology, Nanjing, ChinaArtificial target detection is a key technology for Earth observation applications in remote sensing images, including environmental monitoring, urban planning, intelligence reconnaissance, and land mapping. Recently, transformer-based object detectors achieved great success in computer vision through the attention mechanism. However, these detectors lack prior information, which may limit their multiscale, multishape, and densely distributed target detection capabilities. To address these problems, we propose a novel transformer-based object detection method with geometric and object co-occurrence prior knowledge for remote sensing images, which makes improvements based on the deformable detection transformer (DETR). First, we introduce dynamic anchor object queries with multipatterns to detect objects with multiscale and dense distribution. Second, we propose a novel distance with geometrical invariance to measure the position deviation of multishape objects. Last, we design a graph convolutional reference module with co-occurrence prior knowledge to improve the inferential ability of the detector. Experimental results confirm that the proposed method outperforms most of the state-of-the-art methods with mean average precisions (mAP) of 70.2% in the DIOR, 91.0% in the HRRSD datasets, and 91.4% in the NWPU VHR-10 dataset, respectively. The mAP values are more than 5% higher compared with those obtained by deformable DETR in above three public datasets.https://ieeexplore.ieee.org/document/10804220/Co-occurrence matrixdynamic anchor object querygeometrical invariance distance (GID)transformer-based detector
spellingShingle Nan Mo
Ruixi Zhu
A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Co-occurrence matrix
dynamic anchor object query
geometrical invariance distance (GID)
transformer-based detector
title A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing Images
title_full A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing Images
title_fullStr A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing Images
title_full_unstemmed A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing Images
title_short A Novel Transformer-Based Object Detection Method With Geometric and Object Co-Occurrence Prior Knowledge for Remote Sensing Images
title_sort novel transformer based object detection method with geometric and object co occurrence prior knowledge for remote sensing images
topic Co-occurrence matrix
dynamic anchor object query
geometrical invariance distance (GID)
transformer-based detector
url https://ieeexplore.ieee.org/document/10804220/
work_keys_str_mv AT nanmo anoveltransformerbasedobjectdetectionmethodwithgeometricandobjectcooccurrencepriorknowledgeforremotesensingimages
AT ruixizhu anoveltransformerbasedobjectdetectionmethodwithgeometricandobjectcooccurrencepriorknowledgeforremotesensingimages
AT nanmo noveltransformerbasedobjectdetectionmethodwithgeometricandobjectcooccurrencepriorknowledgeforremotesensingimages
AT ruixizhu noveltransformerbasedobjectdetectionmethodwithgeometricandobjectcooccurrencepriorknowledgeforremotesensingimages