Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.

To address challenges in remote sensing images, such as the abundance of buildings, difficulty in contour extraction, and slow update speeds, a high-resolution remote sensing image building segmentation and extraction method based on the YOLOv5ds network structure was proposed using Gaofen-2 images....

Full description

Saved in:
Bibliographic Details
Main Authors: Fangzhe Chang, Tianyue Ma, Dantong Wang, Shoujie Zhu, Dengping Li, Shuntian Feng, Xiaoyong Fan
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0317106
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849328796039118848
author Fangzhe Chang
Tianyue Ma
Dantong Wang
Shoujie Zhu
Dengping Li
Shuntian Feng
Xiaoyong Fan
author_facet Fangzhe Chang
Tianyue Ma
Dantong Wang
Shoujie Zhu
Dengping Li
Shuntian Feng
Xiaoyong Fan
author_sort Fangzhe Chang
collection DOAJ
description To address challenges in remote sensing images, such as the abundance of buildings, difficulty in contour extraction, and slow update speeds, a high-resolution remote sensing image building segmentation and extraction method based on the YOLOv5ds network structure was proposed using Gaofen-2 images. This method, named YOLOv5ds-RC, comprises three primary components: target detection, semantic segmentation, and edge optimization. In the semantic segmentation module, an upsampling and multiple convolutional layers branch out from the second feature fusion layer of the Feature Pyramid Networks (FPN), producing a category mapping image that matches the original image size. For edge optimization, a Raster compression module is incorporated at the end of the segmentation network to refine the segmentation contours. This approach enables effective segmentation of Gaofen-2 images, achieving detailed results at the individual building scale across urban areas and facilitating rapid contour optimization and extraction. Experimental results indicate that YOLOv5ds-RC achieves an accuracy of 0.8849, a recall of 0.63904, an average precision (AP) at 0.5 of 0.75863, and a mean average precision (mAP) from 0.5 to 0.95 of 0.47388. These metrics significantly surpass those of the original YOLOv5ds, which recorded values of 0.81483 for accuracy, 0.51332 for recall, 0.63552 for AP at 0.5, and 0.34922 for mAP. The algorithm effectively corrects target displacement deviations in non-orthogonal images and achieves more objective and accurate contour extraction, meeting the requirements for rapid extraction. Due to these features, YOLOv5ds-RC can further enhance fully automated rapid extraction and historical change analysis in land use change monitoring.
format Article
id doaj-art-4c3d492befb24960a588f36cb223488c
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-4c3d492befb24960a588f36cb223488c2025-08-20T03:47:28ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01203e031710610.1371/journal.pone.0317106Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.Fangzhe ChangTianyue MaDantong WangShoujie ZhuDengping LiShuntian FengXiaoyong FanTo address challenges in remote sensing images, such as the abundance of buildings, difficulty in contour extraction, and slow update speeds, a high-resolution remote sensing image building segmentation and extraction method based on the YOLOv5ds network structure was proposed using Gaofen-2 images. This method, named YOLOv5ds-RC, comprises three primary components: target detection, semantic segmentation, and edge optimization. In the semantic segmentation module, an upsampling and multiple convolutional layers branch out from the second feature fusion layer of the Feature Pyramid Networks (FPN), producing a category mapping image that matches the original image size. For edge optimization, a Raster compression module is incorporated at the end of the segmentation network to refine the segmentation contours. This approach enables effective segmentation of Gaofen-2 images, achieving detailed results at the individual building scale across urban areas and facilitating rapid contour optimization and extraction. Experimental results indicate that YOLOv5ds-RC achieves an accuracy of 0.8849, a recall of 0.63904, an average precision (AP) at 0.5 of 0.75863, and a mean average precision (mAP) from 0.5 to 0.95 of 0.47388. These metrics significantly surpass those of the original YOLOv5ds, which recorded values of 0.81483 for accuracy, 0.51332 for recall, 0.63552 for AP at 0.5, and 0.34922 for mAP. The algorithm effectively corrects target displacement deviations in non-orthogonal images and achieves more objective and accurate contour extraction, meeting the requirements for rapid extraction. Due to these features, YOLOv5ds-RC can further enhance fully automated rapid extraction and historical change analysis in land use change monitoring.https://doi.org/10.1371/journal.pone.0317106
spellingShingle Fangzhe Chang
Tianyue Ma
Dantong Wang
Shoujie Zhu
Dengping Li
Shuntian Feng
Xiaoyong Fan
Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.
PLoS ONE
title Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.
title_full Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.
title_fullStr Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.
title_full_unstemmed Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.
title_short Method for building segmentation and extraction from high-resolution remote sensing images based on improved YOLOv5ds.
title_sort method for building segmentation and extraction from high resolution remote sensing images based on improved yolov5ds
url https://doi.org/10.1371/journal.pone.0317106
work_keys_str_mv AT fangzhechang methodforbuildingsegmentationandextractionfromhighresolutionremotesensingimagesbasedonimprovedyolov5ds
AT tianyuema methodforbuildingsegmentationandextractionfromhighresolutionremotesensingimagesbasedonimprovedyolov5ds
AT dantongwang methodforbuildingsegmentationandextractionfromhighresolutionremotesensingimagesbasedonimprovedyolov5ds
AT shoujiezhu methodforbuildingsegmentationandextractionfromhighresolutionremotesensingimagesbasedonimprovedyolov5ds
AT dengpingli methodforbuildingsegmentationandextractionfromhighresolutionremotesensingimagesbasedonimprovedyolov5ds
AT shuntianfeng methodforbuildingsegmentationandextractionfromhighresolutionremotesensingimagesbasedonimprovedyolov5ds
AT xiaoyongfan methodforbuildingsegmentationandextractionfromhighresolutionremotesensingimagesbasedonimprovedyolov5ds