Global air quality index prediction using integrated spatial observation data and geographics machine learning

Air pollution can occur in the whole world, with each region having its unique driving factors that contribute to human's health. However, effective mitigation of air pollution is often hindered by the uneven distribution of air quality monitoring stations, which tend to be concentrated in pote...

Full description

Saved in:
Bibliographic Details
Main Authors: Tania Septi Anggraini, Hitoshi Irie, Anjar Dimara Sakti, Ketut Wikantika
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Science of Remote Sensing
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666017225000033
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850116403165659136
author Tania Septi Anggraini
Hitoshi Irie
Anjar Dimara Sakti
Ketut Wikantika
author_facet Tania Septi Anggraini
Hitoshi Irie
Anjar Dimara Sakti
Ketut Wikantika
author_sort Tania Septi Anggraini
collection DOAJ
description Air pollution can occur in the whole world, with each region having its unique driving factors that contribute to human's health. However, effective mitigation of air pollution is often hindered by the uneven distribution of air quality monitoring stations, which tend to be concentrated in potential hotspots like major cities. This study aims to detect and improve the accuracy of the Global Air Quality Index from Remote Sensing (AQI-RS) by integrating AQI from ground-based stations with driving factors such as meteorological, environmental, sources of air pollution, and air pollution magnitude from satellite observation parameters as independent variables using Geographics Machine Learning (GML). This study utilizes 425 air pollution stations and the driving factors data globally from 2013 to 2024. The GML considers geographical characteristics in the analysis by calculating the optimal bandwidth area in its algorithm. The study employs nine scenarios to identify which parameters significantly contribute to the model and determine the best parameter combinations. In determining the best scenario, this study considers the R2 value, Root Mean Square Error (RMSE), and uncertainty in each of the scenarios. This study produced an AQI-RS model with an average R2, RMSE, and uncertainty in the best scenario of 0.89, 5.58, and 5.69 (AQI unit), respectively. The results indicate that GML significantly improves the accuracy of global AQI-RS over previous studies. By considering geographical characteristics using GML, this research is expected to gain an accurate prediction of AQI globally especially in regions without ground-based air pollution stations for the worldwide mitigation.
format Article
id doaj-art-cbbd43ebeaa84124a412b30153ed7c30
institution OA Journals
issn 2666-0172
language English
publishDate 2025-06-01
publisher Elsevier
record_format Article
series Science of Remote Sensing
spelling doaj-art-cbbd43ebeaa84124a412b30153ed7c302025-08-20T02:36:19ZengElsevierScience of Remote Sensing2666-01722025-06-011110019710.1016/j.srs.2025.100197Global air quality index prediction using integrated spatial observation data and geographics machine learningTania Septi Anggraini0Hitoshi Irie1Anjar Dimara Sakti2Ketut Wikantika3Doctoral Program in Geodesy and Geomatics Engineering, Faculty of Earth Science and Technology, Institut Teknologi Bandung, Bandung, 40132, Indonesia; Center for Environmental Remote Sensing, Chiba University, 1-33 Yayoicho, Inage-Ku, Chiba, 263-8522, Japan; Geographics Information Sciences, Faculty of Social Science Education, Universitas Pendidikan Indonesia, Bandung, 40154, Indonesia; Center for Remote Sensing, Institut Teknologi Bandung, Bandung, 40132, IndonesiaCenter for Environmental Remote Sensing, Chiba University, 1-33 Yayoicho, Inage-Ku, Chiba, 263-8522, JapanCenter for Remote Sensing, Institut Teknologi Bandung, Bandung, 40132, Indonesia; Geographic Information Sciences and Technology Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung, 40132, Indonesia; Corresponding author. Center for Remote Sensing, Institut Teknologi Bandung, Bandung, 40132, Indonesia.Center for Remote Sensing, Institut Teknologi Bandung, Bandung, 40132, Indonesia; Geographic Information Sciences and Technology Research Group, Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Bandung, 40132, IndonesiaAir pollution can occur in the whole world, with each region having its unique driving factors that contribute to human's health. However, effective mitigation of air pollution is often hindered by the uneven distribution of air quality monitoring stations, which tend to be concentrated in potential hotspots like major cities. This study aims to detect and improve the accuracy of the Global Air Quality Index from Remote Sensing (AQI-RS) by integrating AQI from ground-based stations with driving factors such as meteorological, environmental, sources of air pollution, and air pollution magnitude from satellite observation parameters as independent variables using Geographics Machine Learning (GML). This study utilizes 425 air pollution stations and the driving factors data globally from 2013 to 2024. The GML considers geographical characteristics in the analysis by calculating the optimal bandwidth area in its algorithm. The study employs nine scenarios to identify which parameters significantly contribute to the model and determine the best parameter combinations. In determining the best scenario, this study considers the R2 value, Root Mean Square Error (RMSE), and uncertainty in each of the scenarios. This study produced an AQI-RS model with an average R2, RMSE, and uncertainty in the best scenario of 0.89, 5.58, and 5.69 (AQI unit), respectively. The results indicate that GML significantly improves the accuracy of global AQI-RS over previous studies. By considering geographical characteristics using GML, this research is expected to gain an accurate prediction of AQI globally especially in regions without ground-based air pollution stations for the worldwide mitigation.http://www.sciencedirect.com/science/article/pii/S2666017225000033DisasterAir quality indexDriving factorsGeographics machine learning
spellingShingle Tania Septi Anggraini
Hitoshi Irie
Anjar Dimara Sakti
Ketut Wikantika
Global air quality index prediction using integrated spatial observation data and geographics machine learning
Science of Remote Sensing
Disaster
Air quality index
Driving factors
Geographics machine learning
title Global air quality index prediction using integrated spatial observation data and geographics machine learning
title_full Global air quality index prediction using integrated spatial observation data and geographics machine learning
title_fullStr Global air quality index prediction using integrated spatial observation data and geographics machine learning
title_full_unstemmed Global air quality index prediction using integrated spatial observation data and geographics machine learning
title_short Global air quality index prediction using integrated spatial observation data and geographics machine learning
title_sort global air quality index prediction using integrated spatial observation data and geographics machine learning
topic Disaster
Air quality index
Driving factors
Geographics machine learning
url http://www.sciencedirect.com/science/article/pii/S2666017225000033
work_keys_str_mv AT taniaseptianggraini globalairqualityindexpredictionusingintegratedspatialobservationdataandgeographicsmachinelearning
AT hitoshiirie globalairqualityindexpredictionusingintegratedspatialobservationdataandgeographicsmachinelearning
AT anjardimarasakti globalairqualityindexpredictionusingintegratedspatialobservationdataandgeographicsmachinelearning
AT ketutwikantika globalairqualityindexpredictionusingintegratedspatialobservationdataandgeographicsmachinelearning