CUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning Models

Abstract Delineating the extent of urban villages (UVs) is crucial for effective urban planning and management, as well as for providing targeted policy and financial support. Unlike field surveys, the interpretation of satellite imagery provides an efficient, near real-time, and objective means of...

Full description

Saved in:
Bibliographic Details
Main Authors: Ziyi Wang, Qiao Sun, Xiao Zhang, Zekun Hu, Jiaoqi Chen, Cheng Zhong, Hui Li
Format: Article
Language:English
Published: Nature Portfolio 2025-03-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-04701-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849761597598203904
author Ziyi Wang
Qiao Sun
Xiao Zhang
Zekun Hu
Jiaoqi Chen
Cheng Zhong
Hui Li
author_facet Ziyi Wang
Qiao Sun
Xiao Zhang
Zekun Hu
Jiaoqi Chen
Cheng Zhong
Hui Li
author_sort Ziyi Wang
collection DOAJ
description Abstract Delineating the extent of urban villages (UVs) is crucial for effective urban planning and management, as well as for providing targeted policy and financial support. Unlike field surveys, the interpretation of satellite imagery provides an efficient, near real-time, and objective means of mapping UV. However, current research efforts predominantly concentrate on individual cities, resulting in a scarcity of interpretable UV maps for numerous other cities. This gap in availability not only hinders public awareness of the distribution and evolution of UV but also limits the reliability and transferability of models due to the insufficient number and diversity of samples. To address this issue, we developed CUGUV, a benchmark dataset that includes a diverse collection of thousands of UV samples, carefully curated from 15 major cities across various geographical regions in China. The dataset can be accessed through this link: https://doi.org/10.6084/m9.figshare.26198093 . This dataset can serve as a foundation for evaluating and improving the robustness and transferability of models. Subsequently, we present an innovative framework that effectively integrates and learns from multiple data sources to better address the cross-city UV mapping task. Tests show that the proposed models achieve over 92% in overall accuracy, precision, and F1-scores, outperforming state-of-the-art models. This highlights the effectiveness of both the proposed dataset and model. This presented dataset and model bolsters our capability to better understand and accurately model these complex and diverse phenomena, ultimately leading to a notable improvement in the performance of large-scale UV mapping.
format Article
id doaj-art-328d9a5bc34745329598a71fda5057da
institution DOAJ
issn 2052-4463
language English
publishDate 2025-03-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-328d9a5bc34745329598a71fda5057da2025-08-20T03:05:57ZengNature PortfolioScientific Data2052-44632025-03-0112111510.1038/s41597-025-04701-wCUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning ModelsZiyi Wang0Qiao Sun1Xiao Zhang2Zekun Hu3Jiaoqi Chen4Cheng Zhong5Hui Li6School of Earth Sciences, China University of GeosciencesSchool of Electronic Engineering, Naval University of EngineeringCCCC Second Highway Consultants Co.Ltd.State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan UniversityBadong National Observation and Research Station of Geohazards, China University of GeosciencesBadong National Observation and Research Station of Geohazards, China University of GeosciencesSchool of Earth Sciences, China University of GeosciencesAbstract Delineating the extent of urban villages (UVs) is crucial for effective urban planning and management, as well as for providing targeted policy and financial support. Unlike field surveys, the interpretation of satellite imagery provides an efficient, near real-time, and objective means of mapping UV. However, current research efforts predominantly concentrate on individual cities, resulting in a scarcity of interpretable UV maps for numerous other cities. This gap in availability not only hinders public awareness of the distribution and evolution of UV but also limits the reliability and transferability of models due to the insufficient number and diversity of samples. To address this issue, we developed CUGUV, a benchmark dataset that includes a diverse collection of thousands of UV samples, carefully curated from 15 major cities across various geographical regions in China. The dataset can be accessed through this link: https://doi.org/10.6084/m9.figshare.26198093 . This dataset can serve as a foundation for evaluating and improving the robustness and transferability of models. Subsequently, we present an innovative framework that effectively integrates and learns from multiple data sources to better address the cross-city UV mapping task. Tests show that the proposed models achieve over 92% in overall accuracy, precision, and F1-scores, outperforming state-of-the-art models. This highlights the effectiveness of both the proposed dataset and model. This presented dataset and model bolsters our capability to better understand and accurately model these complex and diverse phenomena, ultimately leading to a notable improvement in the performance of large-scale UV mapping.https://doi.org/10.1038/s41597-025-04701-w
spellingShingle Ziyi Wang
Qiao Sun
Xiao Zhang
Zekun Hu
Jiaoqi Chen
Cheng Zhong
Hui Li
CUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning Models
Scientific Data
title CUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning Models
title_full CUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning Models
title_fullStr CUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning Models
title_full_unstemmed CUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning Models
title_short CUGUV: A Benchmark Dataset for Promoting Large-Scale Urban Village Mapping with Deep Learning Models
title_sort cuguv a benchmark dataset for promoting large scale urban village mapping with deep learning models
url https://doi.org/10.1038/s41597-025-04701-w
work_keys_str_mv AT ziyiwang cuguvabenchmarkdatasetforpromotinglargescaleurbanvillagemappingwithdeeplearningmodels
AT qiaosun cuguvabenchmarkdatasetforpromotinglargescaleurbanvillagemappingwithdeeplearningmodels
AT xiaozhang cuguvabenchmarkdatasetforpromotinglargescaleurbanvillagemappingwithdeeplearningmodels
AT zekunhu cuguvabenchmarkdatasetforpromotinglargescaleurbanvillagemappingwithdeeplearningmodels
AT jiaoqichen cuguvabenchmarkdatasetforpromotinglargescaleurbanvillagemappingwithdeeplearningmodels
AT chengzhong cuguvabenchmarkdatasetforpromotinglargescaleurbanvillagemappingwithdeeplearningmodels
AT huili cuguvabenchmarkdatasetforpromotinglargescaleurbanvillagemappingwithdeeplearningmodels