A Rotation Target Detection Network Based on Multi-Kernel Interaction and Hierarchical Expansion

Remote sensing targets typically exhibit characteristics of gradual scale changes and diverse orientations. Most existing remote sensing detectors adapt to these differences by adding multi-level structures for feature fusion. However, this approach leads to incomplete coverage of the overall target...

Full description

Saved in:
Bibliographic Details
Main Authors: Qi Wang, Guanghu Xu, Donglin Jing
Format: Article
Language:English
Published: MDPI AG 2025-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/15/8727
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Remote sensing targets typically exhibit characteristics of gradual scale changes and diverse orientations. Most existing remote sensing detectors adapt to these differences by adding multi-level structures for feature fusion. However, this approach leads to incomplete coverage of the overall target by the extracted local features, resulting in the loss of critical directional information and an increase in computational complexity which affect the detector’s performance. To address this issue, this paper proposes a Rotation Target Detection Network based on Multi-kernel Interaction and Hierarchical Expansion (MIHE-Net) as a systematic solution. Specifically, we first refine scale modeling through the Multi-kernel Context Interaction (MCI) module and Hierarchical Expansion Attention (HEA) mechanism, achieving sufficient extraction of local features and global information for targets of different scales. Additionally, the Midpoint Offset Loss Function is employed to mitigate the impact of gradual scale changes on target direction perception, enabling precise regression for targets across various scales. We conducted comparative experiments on three commonly used remote sensing target datasets (DOTA, HRSC2016, and UCAS-AOD), with mean average precision (mAP) as the core evaluation metric. The mAP values of the method in this paper on the three datasets reached 81.72%, 92.43%, and 91.86% respectively, which were 0.65%, 1.93%, and 1.87% higher than those of the optimal method, significantly outperforming existing one-stage and two-stage detectors. Through multi-scale feature interaction and direction-aware optimization, MIHE-Net effectively addresses the challenges posed by scale gradation and direction diversity in remote sensing target detection, providing an efficient and feasible solution for high-precision remote sensing target detection.
ISSN:2076-3417