Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm

Abstract Visibility is an important indicator of air quality and of any consequent meteorological and climate change. Therefore, visibility in Seoul, which is the most polluted city in South Korea, was estimated using machine learning (ML) algorithms based on meteorological (temperature, relative hu...

Full description

Saved in:
Bibliographic Details
Main Authors: Bu-Yo Kim, Joo Wan Cha, Ki-Ho Chang, Chulkyu Lee
Format: Article
Language:English
Published: Springer 2022-08-01
Series:Aerosol and Air Quality Research
Subjects:
Online Access:https://doi.org/10.4209/aaqr.220125
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823862884614012928
author Bu-Yo Kim
Joo Wan Cha
Ki-Ho Chang
Chulkyu Lee
author_facet Bu-Yo Kim
Joo Wan Cha
Ki-Ho Chang
Chulkyu Lee
author_sort Bu-Yo Kim
collection DOAJ
description Abstract Visibility is an important indicator of air quality and of any consequent meteorological and climate change. Therefore, visibility in Seoul, which is the most polluted city in South Korea, was estimated using machine learning (ML) algorithms based on meteorological (temperature, relative humidity, and precipitation) and particulate matter (PM10 and PM2.5) data acquired from an automatic weather station, and the estimated visibility was compared with the observed visibility. Meteorological data, observed at 1-h intervals between 2018 and 2020, were used. Through learning and validation of each ML algorithm, the extreme gradient boosting (XGB) algorithm was found to be most suitable for visibility estimations (bias = 0 km, root mean square error (RMSE) = 0.08 km, and r = 1 for training data set). Among the meteorological and particulate matter data used for learning the XGB algorithm, the relative importance of PM2.5 and relative humidity variables were high (51% and 19%, respectively), whereas precipitation and wind speed had the low relative importance (approximately 1%). The estimation accuracy for the test dataset was good (bias = −0.11 km, RMSE = 2.08 km, and r = 0.94); the estimation accuracy was higher in the dry season (bias = −0.06 km, RMSE = 1.79 km, and r = 0.96) than in the rainy season (bias = −0.17 km, RMSE = 2.34 km, and r = 0.91). The results of this study indicated a higher correlation than the results of previous visibility estimation studies. The proposed method promotes accurate estimation of visibility in areas with poor visibility, and thus, it can be used to assess public health in areas with poor air quality.
format Article
id doaj-art-4653ca1752cd493b92387bfec05eae35
institution Kabale University
issn 1680-8584
2071-1409
language English
publishDate 2022-08-01
publisher Springer
record_format Article
series Aerosol and Air Quality Research
spelling doaj-art-4653ca1752cd493b92387bfec05eae352025-02-09T12:18:04ZengSpringerAerosol and Air Quality Research1680-85842071-14092022-08-01221011510.4209/aaqr.220125Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning AlgorithmBu-Yo Kim0Joo Wan Cha1Ki-Ho Chang2Chulkyu Lee3Research Applications Department, National Institute of Meteorological SciencesResearch Applications Department, National Institute of Meteorological SciencesResearch Applications Department, National Institute of Meteorological SciencesObservation Research Department, National Institute of Meteorological SciencesAbstract Visibility is an important indicator of air quality and of any consequent meteorological and climate change. Therefore, visibility in Seoul, which is the most polluted city in South Korea, was estimated using machine learning (ML) algorithms based on meteorological (temperature, relative humidity, and precipitation) and particulate matter (PM10 and PM2.5) data acquired from an automatic weather station, and the estimated visibility was compared with the observed visibility. Meteorological data, observed at 1-h intervals between 2018 and 2020, were used. Through learning and validation of each ML algorithm, the extreme gradient boosting (XGB) algorithm was found to be most suitable for visibility estimations (bias = 0 km, root mean square error (RMSE) = 0.08 km, and r = 1 for training data set). Among the meteorological and particulate matter data used for learning the XGB algorithm, the relative importance of PM2.5 and relative humidity variables were high (51% and 19%, respectively), whereas precipitation and wind speed had the low relative importance (approximately 1%). The estimation accuracy for the test dataset was good (bias = −0.11 km, RMSE = 2.08 km, and r = 0.94); the estimation accuracy was higher in the dry season (bias = −0.06 km, RMSE = 1.79 km, and r = 0.96) than in the rainy season (bias = −0.17 km, RMSE = 2.34 km, and r = 0.91). The results of this study indicated a higher correlation than the results of previous visibility estimation studies. The proposed method promotes accurate estimation of visibility in areas with poor visibility, and thus, it can be used to assess public health in areas with poor air quality.https://doi.org/10.4209/aaqr.220125SeoulMeteorological dataPM10PM2.5Visibility estimationMachine learning
spellingShingle Bu-Yo Kim
Joo Wan Cha
Ki-Ho Chang
Chulkyu Lee
Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm
Aerosol and Air Quality Research
Seoul
Meteorological data
PM10
PM2.5
Visibility estimation
Machine learning
title Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm
title_full Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm
title_fullStr Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm
title_full_unstemmed Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm
title_short Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm
title_sort estimation of the visibility in seoul south korea based on particulate matter and weather data using machine learning algorithm
topic Seoul
Meteorological data
PM10
PM2.5
Visibility estimation
Machine learning
url https://doi.org/10.4209/aaqr.220125
work_keys_str_mv AT buyokim estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm
AT joowancha estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm
AT kihochang estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm
AT chulkyulee estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm