Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of Ordos

Abstract With global climate change and accelerating urbanization, urban flood is becoming more frequent worldwide. Understanding the urban vulnerability is crucial for making decisions on urban flood control. This study uses urban flood susceptibility (UFS) as an indicator, and comprehensively appl...

Full description

Saved in:
Bibliographic Details
Main Authors: Yu Qin, Yingdong Yu, Jiahong Liu, Ruifen Liu
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-08162-4
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849332939940167680
author Yu Qin
Yingdong Yu
Jiahong Liu
Ruifen Liu
author_facet Yu Qin
Yingdong Yu
Jiahong Liu
Ruifen Liu
author_sort Yu Qin
collection DOAJ
description Abstract With global climate change and accelerating urbanization, urban flood is becoming more frequent worldwide. Understanding the urban vulnerability is crucial for making decisions on urban flood control. This study uses urban flood susceptibility (UFS) as an indicator, and comprehensively applies three machine learning models, XGBoost, CatBoost and LightGBM, in the Kangyi area of Ordos City. Combined with the Shapley Additive explanations method, the driving mechanism and spatial heterogeneity of flood susceptibility was explored in the study area. The results show: (1) Model performance comparison: All three models have high accuracy, with XGBoost performing well in overall classification (OA = 0.96) and CatBoost performing well in distinguishing flood/non-flood samples (AUC = 0.85). (2) Multi-model adaptability assessment: The proposed “model-factor-space” framework highlights the sensitivity of XGBoost to urbanization indicators, the ability of CatBoost to capture natural geographical elements, and the efficiency of LightGBM in analyzing terrain thresholds. (3) Dynamic thresholds and synergies: Impervious surface density (ISD) is the most critical factor, and when ISD > 0.2, the risk of flooding will continue to increase by 60%. Comprehensive analysis with spatial heterogeneity shows that high-risk areas are mainly affected by ISD, road density (> 10,000 m/km2) and low altitude (< 1300m) in urban built-up areas, while low-to-medium risk areas are sensitive to vegetation coverage (> 7,000) and Dis2Water bodies (> 1,500m). (4) Hierarchical governance strategy: A three-level spatial governance strategy is proposed: in the core area, priority is given to ISD control (< 0.2) and pipe network upgrades; in the transitional area, slope interception and ecological restoration are combined; and in the potential risk area, a multi-scale monitoring and early warning system is established for multi-scale monitoring.
format Article
id doaj-art-a4ba71cf9e2c4da0876237264807506e
institution Kabale University
issn 2045-2322
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-a4ba71cf9e2c4da0876237264807506e2025-08-20T03:46:03ZengNature PortfolioScientific Reports2045-23222025-07-0115112010.1038/s41598-025-08162-4Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of OrdosYu Qin0Yingdong Yu1Jiahong Liu2Ruifen Liu3Hubei University of TechnologyState Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower ResearchState Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower ResearchHubei University of TechnologyAbstract With global climate change and accelerating urbanization, urban flood is becoming more frequent worldwide. Understanding the urban vulnerability is crucial for making decisions on urban flood control. This study uses urban flood susceptibility (UFS) as an indicator, and comprehensively applies three machine learning models, XGBoost, CatBoost and LightGBM, in the Kangyi area of Ordos City. Combined with the Shapley Additive explanations method, the driving mechanism and spatial heterogeneity of flood susceptibility was explored in the study area. The results show: (1) Model performance comparison: All three models have high accuracy, with XGBoost performing well in overall classification (OA = 0.96) and CatBoost performing well in distinguishing flood/non-flood samples (AUC = 0.85). (2) Multi-model adaptability assessment: The proposed “model-factor-space” framework highlights the sensitivity of XGBoost to urbanization indicators, the ability of CatBoost to capture natural geographical elements, and the efficiency of LightGBM in analyzing terrain thresholds. (3) Dynamic thresholds and synergies: Impervious surface density (ISD) is the most critical factor, and when ISD > 0.2, the risk of flooding will continue to increase by 60%. Comprehensive analysis with spatial heterogeneity shows that high-risk areas are mainly affected by ISD, road density (> 10,000 m/km2) and low altitude (< 1300m) in urban built-up areas, while low-to-medium risk areas are sensitive to vegetation coverage (> 7,000) and Dis2Water bodies (> 1,500m). (4) Hierarchical governance strategy: A three-level spatial governance strategy is proposed: in the core area, priority is given to ISD control (< 0.2) and pipe network upgrades; in the transitional area, slope interception and ecological restoration are combined; and in the potential risk area, a multi-scale monitoring and early warning system is established for multi-scale monitoring.https://doi.org/10.1038/s41598-025-08162-4Urban flooding susceptibilityMulti-model collaborationSHAP interpretation methodDynamic thresholdsSpatial heterogeneity
spellingShingle Yu Qin
Yingdong Yu
Jiahong Liu
Ruifen Liu
Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of Ordos
Scientific Reports
Urban flooding susceptibility
Multi-model collaboration
SHAP interpretation method
Dynamic thresholds
Spatial heterogeneity
title Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of Ordos
title_full Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of Ordos
title_fullStr Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of Ordos
title_full_unstemmed Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of Ordos
title_short Machine learning-based identification of key factors and spatial heterogeneity analysis of urban flooding: a case study of the central urban area of Ordos
title_sort machine learning based identification of key factors and spatial heterogeneity analysis of urban flooding a case study of the central urban area of ordos
topic Urban flooding susceptibility
Multi-model collaboration
SHAP interpretation method
Dynamic thresholds
Spatial heterogeneity
url https://doi.org/10.1038/s41598-025-08162-4
work_keys_str_mv AT yuqin machinelearningbasedidentificationofkeyfactorsandspatialheterogeneityanalysisofurbanfloodingacasestudyofthecentralurbanareaofordos
AT yingdongyu machinelearningbasedidentificationofkeyfactorsandspatialheterogeneityanalysisofurbanfloodingacasestudyofthecentralurbanareaofordos
AT jiahongliu machinelearningbasedidentificationofkeyfactorsandspatialheterogeneityanalysisofurbanfloodingacasestudyofthecentralurbanareaofordos
AT ruifenliu machinelearningbasedidentificationofkeyfactorsandspatialheterogeneityanalysisofurbanfloodingacasestudyofthecentralurbanareaofordos