YOLOv9-GDV: A Power Pylon Detection Model for Remote Sensing Images

Under the background of continuous breakthroughs in the spatial resolution of satellite remote sensing technology, high-resolution remote sensing images have become a frontier data source for intelligent inspection research of power infrastructure. To address existing issues in remote sensing image...

Full description

Saved in:
Bibliographic Details
Main Authors: Ke Zhang, Ningxuan Zhang, Chaojun Shi, Qiaochu Lu, Xian Zheng, Yujie Cao, Xiaoyun Zhang, Jiyuan Yang
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/17/13/2229
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Under the background of continuous breakthroughs in the spatial resolution of satellite remote sensing technology, high-resolution remote sensing images have become a frontier data source for intelligent inspection research of power infrastructure. To address existing issues in remote sensing image application algorithms such as difficulties in power target feature extraction, low detection accuracy, and false positives/missed detections, this paper proposes the YOLOv9-GDV power tower detection algorithm specifically for power tower detection in high-resolution satellite remote sensing images. Firstly, under high-resolution imaging conditions where transmission tower features are prominent, a Global Pyramid Attention (GPA) mechanism is proposed. This mechanism enhances global representation capabilities, enabling the model to better understand object–background relationships and effectively integrate multi-scale spatial information, thereby improving detection accuracy and robustness. Secondly, a Diverse Branch Block (DBB) is embedded in the feature extraction–fusion module, which enriches the feature space by enhancing the representation capability of single-convolution operations, thereby improving model feature extraction performance without increasing inference time costs. Finally, the Variable Minimum Point Distance Intersection over Union (VMPDIoU) loss is proposed to optimize the model’s loss function. This method employs variable input parameters to directly calculate key point distances between predicted and ground-truth boxes, more accurately reflecting positional differences between detection results and reference targets, thus effectively improving the model’s mean Average Precision (mAP). On the Satellite Remote Sensing Power Tower Dataset (SRSPTD), the YOLOv9-GDV algorithm achieves an mAP of 80.2%, representing a 4.7% improvement over the baseline algorithm. On the multi-scene high-resolution power transmission tower dataset (GFTD), the algorithm obtains an mAP of 94.6%, showing a 2.3% improvement over the original model. The significant mAP improvements on both datasets validate the effectiveness and feasibility of the proposed method.
ISSN:2072-4292