AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data

Air pollution levels have remained a significant issue worldwide despite advancements in technology, primarily due to rapid industrialization and urbanization. Among the various pollutants, PM2.5 significantly impacts air quality, posing health risks suc...

Full description

Saved in:
Bibliographic Details
Main Authors: Siti Khadijah Arafin, Nor Azura Md Ghani, Marshima Mohd Rosli, Nurain Ibrahim
Format: Article
Language:English
Published: Mahidol University 2025-05-01
Series:Environment and Natural Resources Journal
Subjects:
Online Access:https://ph02.tci-thaijo.org/index.php/ennrj/article/view/257350/172065
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849718749370777600
author Siti Khadijah Arafin
Nor Azura Md Ghani
Marshima Mohd Rosli
Nurain Ibrahim
author_facet Siti Khadijah Arafin
Nor Azura Md Ghani
Marshima Mohd Rosli
Nurain Ibrahim
author_sort Siti Khadijah Arafin
collection DOAJ
description Air pollution levels have remained a significant issue worldwide despite advancements in technology, primarily due to rapid industrialization and urbanization. Among the various pollutants, PM2.5 significantly impacts air quality, posing health risks such as respiratory and cardiovascular diseases. Accurate prediction of PM2.5 levels is essential for effective air quality management. However, multicollinearity in air quality data can hindermodel performance.To address this issue, this study introduces theAdjcorT-RBFNN, a two-stage feature selection method, to classify air quality in Klang, Selangor. The AdjcorT-RBFNN model selects the optimal combination of 9 feature combinations from 10 variables and outperforms the RBFNN model, which uses all 10 variables. With 7 hidden nodes and a learning rate of 0.01 for both models, AdjcorT-RBFNN achieves higher accuracy (0.62), sensitivity (0.64), specificity (0.60), precision (0.60), F1 score (0.62), and AUROC (0.62), confirming its effectiveness in classification tasks. The optimal features for predicting air quality in Klang are identified as PM2.5, PM10, relative humidity, SO2, wind direction, O3, CO, ambient temperature, and NO2. Monte Carlo simulations validate the model’s effectiveness, showing that AdjcorT-RBFNN consistently outperforms RBFNN, especially with strong negative correlations (ρ=-0.8) and larger sample sizes (N=150 and 200) further enhance classification accuracy. Compared to RBFNN, AdjcorT-RBFNN enhances class discrimination and reduces false positives, improving its reliability in detecting true classifications. These findings highlight the importance of feature selection in improving model performance, particularly in datasets with multicollinearity. Researchers, and health organizations can leverage AdjcorT-RBFNN for more accurate air quality predictions, supporting informed pollution control strategies.
format Article
id doaj-art-8bf4899201a34c76af0ba167108c8cd5
institution DOAJ
issn 1686-5456
2408-2384
language English
publishDate 2025-05-01
publisher Mahidol University
record_format Article
series Environment and Natural Resources Journal
spelling doaj-art-8bf4899201a34c76af0ba167108c8cd52025-08-20T03:12:19ZengMahidol UniversityEnvironment and Natural Resources Journal1686-54562408-23842025-05-0123324225510.32526/ennrj/23/20250006AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated DataSiti Khadijah Arafin0Nor Azura Md Ghani1 Marshima Mohd Rosli2 Nurain Ibrahim3School of Mathematical Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia School of Mathematical Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia School of Computing Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, MalaysiaSchool of Mathematical Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia Air pollution levels have remained a significant issue worldwide despite advancements in technology, primarily due to rapid industrialization and urbanization. Among the various pollutants, PM2.5 significantly impacts air quality, posing health risks such as respiratory and cardiovascular diseases. Accurate prediction of PM2.5 levels is essential for effective air quality management. However, multicollinearity in air quality data can hindermodel performance.To address this issue, this study introduces theAdjcorT-RBFNN, a two-stage feature selection method, to classify air quality in Klang, Selangor. The AdjcorT-RBFNN model selects the optimal combination of 9 feature combinations from 10 variables and outperforms the RBFNN model, which uses all 10 variables. With 7 hidden nodes and a learning rate of 0.01 for both models, AdjcorT-RBFNN achieves higher accuracy (0.62), sensitivity (0.64), specificity (0.60), precision (0.60), F1 score (0.62), and AUROC (0.62), confirming its effectiveness in classification tasks. The optimal features for predicting air quality in Klang are identified as PM2.5, PM10, relative humidity, SO2, wind direction, O3, CO, ambient temperature, and NO2. Monte Carlo simulations validate the model’s effectiveness, showing that AdjcorT-RBFNN consistently outperforms RBFNN, especially with strong negative correlations (ρ=-0.8) and larger sample sizes (N=150 and 200) further enhance classification accuracy. Compared to RBFNN, AdjcorT-RBFNN enhances class discrimination and reduces false positives, improving its reliability in detecting true classifications. These findings highlight the importance of feature selection in improving model performance, particularly in datasets with multicollinearity. Researchers, and health organizations can leverage AdjcorT-RBFNN for more accurate air quality predictions, supporting informed pollution control strategies.https://ph02.tci-thaijo.org/index.php/ennrj/article/view/257350/172065adjcortadjcort-rbfnnair pollutionair quality predictionparticulate matter pm2.rbfnn
spellingShingle Siti Khadijah Arafin
Nor Azura Md Ghani
Marshima Mohd Rosli
Nurain Ibrahim
AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data
Environment and Natural Resources Journal
adjcort
adjcort-rbfnn
air pollution
air quality prediction
particulate matter pm2.
rbfnn
title AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data
title_full AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data
title_fullStr AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data
title_full_unstemmed AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data
title_short AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data
title_sort adjcort rbfnn for air quality classification mitigating multicollinearity with real and simulated data
topic adjcort
adjcort-rbfnn
air pollution
air quality prediction
particulate matter pm2.
rbfnn
url https://ph02.tci-thaijo.org/index.php/ennrj/article/view/257350/172065
work_keys_str_mv AT sitikhadijaharafin adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata
AT norazuramdghani adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata
AT marshimamohdrosli adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata
AT nurainibrahim adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata