Long-Term (2015–2024) Daily PM<sub>2.5</sub> Estimation in China by Using XGBoost Combining Empirical Orthogonal Function Decomposition

Fine particulate matter (PM<sub>2.5</sub>) has garnered significant scientific and public health concern owing to its capacity for deep penetration into the human respiratory system, presenting significant health risks. Despite the implementation of strict environmental policies in China...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiacheng Jiang, Jiaxin Dong, Yu Ding, Wenjia Ni, Jie Yang, Siwei Li
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/17/9/1632
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Fine particulate matter (PM<sub>2.5</sub>) has garnered significant scientific and public health concern owing to its capacity for deep penetration into the human respiratory system, presenting significant health risks. Despite the implementation of strict environmental policies in China over the past decade to reduce PM<sub>2.5</sub> levels, long-term public health concerns remain a serious issue. Our study aims to provide a high-quality, seamless daily PM<sub>2.5</sub> dataset for China covering the years 2015 to 2024. A two-step PM<sub>2.5</sub> estimation model is established based on a machine learning algorithm and a spatio-temporal decomposition method. First, we utilize the machine learning algorithm XGBoost (EXtreme Gradient Boosting) to address gaps in the daily MAIAC (Multi-Angle Implementation of Atmospheric Correction) AOD (Aerosol Optical Depth), with R<sup>2</sup>/RMSE (coefficient of determination/Root Mean Square Error) of 0.67/0.2678 compared to AERONET (Aerosol Robotic Network) AOD. Then, a novel approach by integrating XGBoost with EOF (Empirical Orthogonal Function) decomposition is introduced for PM<sub>2.5</sub> estimation. The integration of EOF allows for the incorporation of entire meteorological field information into the PM<sub>2.5</sub> estimation model, significantly enhancing its accuracy: spatial CV (cross-validation)-R<sup>2</sup> improved from 0.8340 to 0.8935, and spatial CV-RMSE reduced from 13.8177 to 11.0668. Leveraging the newly produced dataset, we analyze the spatio-temporal variations of PM<sub>2.5</sub> across China with EOF decomposition, particularly noting that PM<sub>2.5</sub> levels in the eastern anthropogenic intensive regions continuously declined from 2015 to 2020, and fluctuated steadily during 2020–2024. This research underscores the critical need for sustained and effective air quality management strategies in China.
ISSN:2072-4292