A comprehensive evaluation of deep learning approaches for ground-level ozone prediction across different regions
Recently, the issue of near-surface ozone pollution has become a growing concern. To effectively manage and control ozone pollution, emerging deep learning (DL) techniques have been applied for future ozone concentration trend prediction, generating promising outcomes. However, existing studies empl...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-05-01
|
| Series: | Ecological Informatics |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S1574954125000330 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Recently, the issue of near-surface ozone pollution has become a growing concern. To effectively manage and control ozone pollution, emerging deep learning (DL) techniques have been applied for future ozone concentration trend prediction, generating promising outcomes. However, existing studies employ various DL models and rely on diverse datasets to predict ozone concentrations. This leads to a lack of comprehensive evaluations of how the architecture and depth of different DL models influence the predictive accuracy of ozone concentration trends when assessed using a unified dataset. This lack of uniformity in evaluations creates a gap in our understanding of the influence of different neural network architectures and depths on ozone concentration predictions. In this work, we aim to address this research gap by conducting a systematic performance evaluation that benchmarks six prominent DL architectures, each with varying depths, to evaluate their effectiveness for predicting ozone concentrations across diverse geographical regions. Our findings indicate that the best-performing DL model in the nationwide prediction task is the one-layer bidirectional long short-term memory (Bi-LSTM) model, which achieves an R2 of 0.66, an RMSE of 15.32μg⋅m−3, and an MAE of 11.51μg⋅m−3. In contrast, the poorest-performing model in the same prediction task is the one-block transformer-based model, with an R2 of 0.57, an RMSE of 17.34μg⋅m−3, and an MAE of 13.3μg⋅m−3. Furthermore, fully connected networks (FCNs) demonstrate robust and efficient predictive performance across both nationwide and regional prediction tasks. Notably, our study reveals that no single DL model consistently performs well across all prediction tasks, emphasizing the need for tailored approaches that cater to the specific attributes of each region. Additionally, we observe that DL models with more than two hidden layers frequently suffer from overfitting. Particularly for the Bi-LSTM architecture, as the number of hidden layers increases from 1 to 7, we observe a 12% reduction in R2 performance. Our analysis also identifies the most influential meteorological factors among the top-performing DL models, offering insights for feature selection and optimization in model development. This research contributes to a deeper understanding of the design and selection of appropriate DL architectures for predicting the concentrations of ozone and other air pollutants. |
|---|---|
| ISSN: | 1574-9541 |