A Rotation Target Detection Network Based on Multi-Kernel Interaction and Hierarchical Expansion
Remote sensing targets typically exhibit characteristics of gradual scale changes and diverse orientations. Most existing remote sensing detectors adapt to these differences by adding multi-level structures for feature fusion. However, this approach leads to incomplete coverage of the overall target...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-08-01
|
| Series: | Applied Sciences |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/15/15/8727 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Remote sensing targets typically exhibit characteristics of gradual scale changes and diverse orientations. Most existing remote sensing detectors adapt to these differences by adding multi-level structures for feature fusion. However, this approach leads to incomplete coverage of the overall target by the extracted local features, resulting in the loss of critical directional information and an increase in computational complexity which affect the detector’s performance. To address this issue, this paper proposes a Rotation Target Detection Network based on Multi-kernel Interaction and Hierarchical Expansion (MIHE-Net) as a systematic solution. Specifically, we first refine scale modeling through the Multi-kernel Context Interaction (MCI) module and Hierarchical Expansion Attention (HEA) mechanism, achieving sufficient extraction of local features and global information for targets of different scales. Additionally, the Midpoint Offset Loss Function is employed to mitigate the impact of gradual scale changes on target direction perception, enabling precise regression for targets across various scales. We conducted comparative experiments on three commonly used remote sensing target datasets (DOTA, HRSC2016, and UCAS-AOD), with mean average precision (mAP) as the core evaluation metric. The mAP values of the method in this paper on the three datasets reached 81.72%, 92.43%, and 91.86% respectively, which were 0.65%, 1.93%, and 1.87% higher than those of the optimal method, significantly outperforming existing one-stage and two-stage detectors. Through multi-scale feature interaction and direction-aware optimization, MIHE-Net effectively addresses the challenges posed by scale gradation and direction diversity in remote sensing target detection, providing an efficient and feasible solution for high-precision remote sensing target detection. |
|---|---|
| ISSN: | 2076-3417 |