AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data
Air pollution levels have remained a significant issue worldwide despite advancements in technology, primarily due to rapid industrialization and urbanization. Among the various pollutants, PM2.5 significantly impacts air quality, posing health risks suc...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Mahidol University
2025-05-01
|
| Series: | Environment and Natural Resources Journal |
| Subjects: | |
| Online Access: | https://ph02.tci-thaijo.org/index.php/ennrj/article/view/257350/172065 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849718749370777600 |
|---|---|
| author | Siti Khadijah Arafin Nor Azura Md Ghani Marshima Mohd Rosli Nurain Ibrahim |
| author_facet | Siti Khadijah Arafin Nor Azura Md Ghani Marshima Mohd Rosli Nurain Ibrahim |
| author_sort | Siti Khadijah Arafin |
| collection | DOAJ |
| description | Air pollution levels have remained a significant issue worldwide despite advancements in technology, primarily due to rapid industrialization and urbanization. Among the various pollutants, PM2.5 significantly impacts air quality, posing health risks such as respiratory and cardiovascular diseases. Accurate prediction of PM2.5 levels is essential for effective air quality management. However, multicollinearity in air quality data can hindermodel performance.To address this issue, this study introduces theAdjcorT-RBFNN, a two-stage feature selection method, to classify air quality in Klang, Selangor. The AdjcorT-RBFNN model selects the optimal combination of 9 feature combinations from 10 variables and outperforms the RBFNN model, which uses all 10 variables. With 7 hidden nodes and a learning rate of 0.01 for both models, AdjcorT-RBFNN achieves higher accuracy (0.62), sensitivity (0.64), specificity (0.60), precision (0.60), F1 score (0.62), and AUROC (0.62), confirming its effectiveness in classification tasks. The optimal features for predicting air quality in Klang are identified as PM2.5, PM10, relative humidity, SO2, wind direction, O3, CO, ambient temperature, and NO2. Monte Carlo simulations validate the model’s effectiveness, showing that AdjcorT-RBFNN consistently outperforms RBFNN, especially with strong negative correlations (ρ=-0.8) and larger sample sizes (N=150 and 200) further enhance classification accuracy. Compared to RBFNN, AdjcorT-RBFNN enhances class discrimination and reduces false positives, improving its reliability in detecting true classifications. These findings highlight the importance of feature selection in improving model performance, particularly in datasets with multicollinearity. Researchers, and health organizations can leverage AdjcorT-RBFNN for more accurate air quality predictions, supporting informed pollution control strategies. |
| format | Article |
| id | doaj-art-8bf4899201a34c76af0ba167108c8cd5 |
| institution | DOAJ |
| issn | 1686-5456 2408-2384 |
| language | English |
| publishDate | 2025-05-01 |
| publisher | Mahidol University |
| record_format | Article |
| series | Environment and Natural Resources Journal |
| spelling | doaj-art-8bf4899201a34c76af0ba167108c8cd52025-08-20T03:12:19ZengMahidol UniversityEnvironment and Natural Resources Journal1686-54562408-23842025-05-0123324225510.32526/ennrj/23/20250006AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated DataSiti Khadijah Arafin0Nor Azura Md Ghani1 Marshima Mohd Rosli2 Nurain Ibrahim3School of Mathematical Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia School of Mathematical Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia School of Computing Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, MalaysiaSchool of Mathematical Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia Air pollution levels have remained a significant issue worldwide despite advancements in technology, primarily due to rapid industrialization and urbanization. Among the various pollutants, PM2.5 significantly impacts air quality, posing health risks such as respiratory and cardiovascular diseases. Accurate prediction of PM2.5 levels is essential for effective air quality management. However, multicollinearity in air quality data can hindermodel performance.To address this issue, this study introduces theAdjcorT-RBFNN, a two-stage feature selection method, to classify air quality in Klang, Selangor. The AdjcorT-RBFNN model selects the optimal combination of 9 feature combinations from 10 variables and outperforms the RBFNN model, which uses all 10 variables. With 7 hidden nodes and a learning rate of 0.01 for both models, AdjcorT-RBFNN achieves higher accuracy (0.62), sensitivity (0.64), specificity (0.60), precision (0.60), F1 score (0.62), and AUROC (0.62), confirming its effectiveness in classification tasks. The optimal features for predicting air quality in Klang are identified as PM2.5, PM10, relative humidity, SO2, wind direction, O3, CO, ambient temperature, and NO2. Monte Carlo simulations validate the model’s effectiveness, showing that AdjcorT-RBFNN consistently outperforms RBFNN, especially with strong negative correlations (ρ=-0.8) and larger sample sizes (N=150 and 200) further enhance classification accuracy. Compared to RBFNN, AdjcorT-RBFNN enhances class discrimination and reduces false positives, improving its reliability in detecting true classifications. These findings highlight the importance of feature selection in improving model performance, particularly in datasets with multicollinearity. Researchers, and health organizations can leverage AdjcorT-RBFNN for more accurate air quality predictions, supporting informed pollution control strategies.https://ph02.tci-thaijo.org/index.php/ennrj/article/view/257350/172065adjcortadjcort-rbfnnair pollutionair quality predictionparticulate matter pm2.rbfnn |
| spellingShingle | Siti Khadijah Arafin Nor Azura Md Ghani Marshima Mohd Rosli Nurain Ibrahim AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data Environment and Natural Resources Journal adjcort adjcort-rbfnn air pollution air quality prediction particulate matter pm2. rbfnn |
| title | AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data |
| title_full | AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data |
| title_fullStr | AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data |
| title_full_unstemmed | AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data |
| title_short | AdjcorT-RBFNN for Air Quality Classification: Mitigating Multicollinearity with Real and Simulated Data |
| title_sort | adjcort rbfnn for air quality classification mitigating multicollinearity with real and simulated data |
| topic | adjcort adjcort-rbfnn air pollution air quality prediction particulate matter pm2. rbfnn |
| url | https://ph02.tci-thaijo.org/index.php/ennrj/article/view/257350/172065 |
| work_keys_str_mv | AT sitikhadijaharafin adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata AT norazuramdghani adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata AT marshimamohdrosli adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata AT nurainibrahim adjcortrbfnnforairqualityclassificationmitigatingmulticollinearitywithrealandsimulateddata |