Global road extraction using a pseudo-label guided framework: from benchmark dataset to cross-region semi-supervised learning

Recent advancements in satellite remote sensing technology and computer vision have enabled rapid extraction of road networks from massive, Very High-Resolution (VHR) satellite imagery. However, current road extraction methods face the following limitations: 1) Insufficient availability of accurate...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaoyan Lu, Yanfei Zhong, Zhuo Zheng, JunJue Wang, Dingyuan Chen, Yu Su
Format: Article
Language:English
Published: Taylor & Francis Group 2025-05-01
Series:Geo-spatial Information Science
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/10095020.2024.2362760
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recent advancements in satellite remote sensing technology and computer vision have enabled rapid extraction of road networks from massive, Very High-Resolution (VHR) satellite imagery. However, current road extraction methods face the following limitations: 1) Insufficient availability of accurate and diverse training datasets for global-scale road extraction; 2) Costly and time-consuming manual labeling of millions of road samples; and 3) Limited generalization ability of deep learning models across diverse global contexts, resulting in better performance for regions well-represented in the training dataset, but worse performance when faced with domain gaps. To address these challenges, a semi-supervised framework was developed in this study, which includes a global-scale benchmark dataset – termed GlobalRoadSet (GRSet) – and a pseudo-label guided semi-supervised road extraction network – termed GlobalRoadNetSF (GRNetSF). The GRSet dataset was constructed using high-resolution satellite imagery and open-source crowdsourced OpenStreetMap (OSM) data. It comprises 47,210 samples collected from 121 capital cities across six populated continents. The GRNetSF trains the network by generating pseudo-labels for unlabeled images, combined with a few labeled samples from the target region. To enhance the quality of the pseudo-labels, strong data augmentation perturbation and auxiliary feature perturbation techniques are employed to ensure model prediction consistency. The proposed GRNetSF_GRSet framework was implemented in over 30 cities worldwide, where most of the Intersection-over-Union (IoU) values increased by more than 10%. This outcome confirms its strong generalization ability.
ISSN:1009-5020
1993-5153