Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm
Abstract Visibility is an important indicator of air quality and of any consequent meteorological and climate change. Therefore, visibility in Seoul, which is the most polluted city in South Korea, was estimated using machine learning (ML) algorithms based on meteorological (temperature, relative hu...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Springer
2022-08-01
|
Series: | Aerosol and Air Quality Research |
Subjects: | |
Online Access: | https://doi.org/10.4209/aaqr.220125 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1823862884614012928 |
---|---|
author | Bu-Yo Kim Joo Wan Cha Ki-Ho Chang Chulkyu Lee |
author_facet | Bu-Yo Kim Joo Wan Cha Ki-Ho Chang Chulkyu Lee |
author_sort | Bu-Yo Kim |
collection | DOAJ |
description | Abstract Visibility is an important indicator of air quality and of any consequent meteorological and climate change. Therefore, visibility in Seoul, which is the most polluted city in South Korea, was estimated using machine learning (ML) algorithms based on meteorological (temperature, relative humidity, and precipitation) and particulate matter (PM10 and PM2.5) data acquired from an automatic weather station, and the estimated visibility was compared with the observed visibility. Meteorological data, observed at 1-h intervals between 2018 and 2020, were used. Through learning and validation of each ML algorithm, the extreme gradient boosting (XGB) algorithm was found to be most suitable for visibility estimations (bias = 0 km, root mean square error (RMSE) = 0.08 km, and r = 1 for training data set). Among the meteorological and particulate matter data used for learning the XGB algorithm, the relative importance of PM2.5 and relative humidity variables were high (51% and 19%, respectively), whereas precipitation and wind speed had the low relative importance (approximately 1%). The estimation accuracy for the test dataset was good (bias = −0.11 km, RMSE = 2.08 km, and r = 0.94); the estimation accuracy was higher in the dry season (bias = −0.06 km, RMSE = 1.79 km, and r = 0.96) than in the rainy season (bias = −0.17 km, RMSE = 2.34 km, and r = 0.91). The results of this study indicated a higher correlation than the results of previous visibility estimation studies. The proposed method promotes accurate estimation of visibility in areas with poor visibility, and thus, it can be used to assess public health in areas with poor air quality. |
format | Article |
id | doaj-art-4653ca1752cd493b92387bfec05eae35 |
institution | Kabale University |
issn | 1680-8584 2071-1409 |
language | English |
publishDate | 2022-08-01 |
publisher | Springer |
record_format | Article |
series | Aerosol and Air Quality Research |
spelling | doaj-art-4653ca1752cd493b92387bfec05eae352025-02-09T12:18:04ZengSpringerAerosol and Air Quality Research1680-85842071-14092022-08-01221011510.4209/aaqr.220125Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning AlgorithmBu-Yo Kim0Joo Wan Cha1Ki-Ho Chang2Chulkyu Lee3Research Applications Department, National Institute of Meteorological SciencesResearch Applications Department, National Institute of Meteorological SciencesResearch Applications Department, National Institute of Meteorological SciencesObservation Research Department, National Institute of Meteorological SciencesAbstract Visibility is an important indicator of air quality and of any consequent meteorological and climate change. Therefore, visibility in Seoul, which is the most polluted city in South Korea, was estimated using machine learning (ML) algorithms based on meteorological (temperature, relative humidity, and precipitation) and particulate matter (PM10 and PM2.5) data acquired from an automatic weather station, and the estimated visibility was compared with the observed visibility. Meteorological data, observed at 1-h intervals between 2018 and 2020, were used. Through learning and validation of each ML algorithm, the extreme gradient boosting (XGB) algorithm was found to be most suitable for visibility estimations (bias = 0 km, root mean square error (RMSE) = 0.08 km, and r = 1 for training data set). Among the meteorological and particulate matter data used for learning the XGB algorithm, the relative importance of PM2.5 and relative humidity variables were high (51% and 19%, respectively), whereas precipitation and wind speed had the low relative importance (approximately 1%). The estimation accuracy for the test dataset was good (bias = −0.11 km, RMSE = 2.08 km, and r = 0.94); the estimation accuracy was higher in the dry season (bias = −0.06 km, RMSE = 1.79 km, and r = 0.96) than in the rainy season (bias = −0.17 km, RMSE = 2.34 km, and r = 0.91). The results of this study indicated a higher correlation than the results of previous visibility estimation studies. The proposed method promotes accurate estimation of visibility in areas with poor visibility, and thus, it can be used to assess public health in areas with poor air quality.https://doi.org/10.4209/aaqr.220125SeoulMeteorological dataPM10PM2.5Visibility estimationMachine learning |
spellingShingle | Bu-Yo Kim Joo Wan Cha Ki-Ho Chang Chulkyu Lee Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm Aerosol and Air Quality Research Seoul Meteorological data PM10 PM2.5 Visibility estimation Machine learning |
title | Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm |
title_full | Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm |
title_fullStr | Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm |
title_full_unstemmed | Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm |
title_short | Estimation of the Visibility in Seoul, South Korea, Based on Particulate Matter and Weather Data, Using Machine-learning Algorithm |
title_sort | estimation of the visibility in seoul south korea based on particulate matter and weather data using machine learning algorithm |
topic | Seoul Meteorological data PM10 PM2.5 Visibility estimation Machine learning |
url | https://doi.org/10.4209/aaqr.220125 |
work_keys_str_mv | AT buyokim estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm AT joowancha estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm AT kihochang estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm AT chulkyulee estimationofthevisibilityinseoulsouthkoreabasedonparticulatematterandweatherdatausingmachinelearningalgorithm |