Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach
Abstract Dengue, a mosquito-borne viral disease, continues to pose severe risks to public health and economic stability in tropical and subtropical regions, particularly in developing nations like Bangladesh. The necessity for advanced forecasting mechanisms has never been more critical to enhance t...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2024-12-01
|
Series: | Scientific Reports |
Subjects: | |
Online Access: | https://doi.org/10.1038/s41598-024-83770-0 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841559458609627136 |
---|---|
author | Mahadee Al Mobin |
author_facet | Mahadee Al Mobin |
author_sort | Mahadee Al Mobin |
collection | DOAJ |
description | Abstract Dengue, a mosquito-borne viral disease, continues to pose severe risks to public health and economic stability in tropical and subtropical regions, particularly in developing nations like Bangladesh. The necessity for advanced forecasting mechanisms has never been more critical to enhance the effectiveness of vector control strategies and resource allocations. This study formulates a dynamic data pipeline to forecast dengue incidence based on 13 meteorological variables using a suite of state-of-the-art machine learning models and custom features engineering, achieving an accuracy of 84.02%, marking a substantial improvement over existing studies. A novel wrapper feature selection algorithm employing a custom objective function is proposed in this study, which significantly improves model accuracy by 12.63% and reduces the mean absolute percentage error by 70.82%. The custom objective function’s output can be transformed to quantify the contribution of each variable to the target variable’s variability, providing deeper insights into the workings of black box models. The study concludes that relative humidity is redundant in predicting dengue infection, while meteorological factors exhibit more significant short-term impacts compared to long-term and immediate impacts. Sunshine (hours) emerges as the meteorological factor with the most immediate impact, whereas precipitation is the most impactful predictor over both short-term (8-month lag) and long-term (26–30-month lag) periods. Forecasts for 2024 using the best-performing model predict a rise in dengue cases starting in May, peaking at 24,000 cases per month by August and persisting at high levels through October before declining to half by year-end. These findings offer critical insights into temporal climate effects on dengue transmission, aiding the development of effective forecasting systems. |
format | Article |
id | doaj-art-aaaef53405f443cc81d25acb996aa99c |
institution | Kabale University |
issn | 2045-2322 |
language | English |
publishDate | 2024-12-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj-art-aaaef53405f443cc81d25acb996aa99c2025-01-05T12:29:29ZengNature PortfolioScientific Reports2045-23222024-12-0114112410.1038/s41598-024-83770-0Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approachMahadee Al Mobin0Bangladesh Institute of Governance and ManagementAbstract Dengue, a mosquito-borne viral disease, continues to pose severe risks to public health and economic stability in tropical and subtropical regions, particularly in developing nations like Bangladesh. The necessity for advanced forecasting mechanisms has never been more critical to enhance the effectiveness of vector control strategies and resource allocations. This study formulates a dynamic data pipeline to forecast dengue incidence based on 13 meteorological variables using a suite of state-of-the-art machine learning models and custom features engineering, achieving an accuracy of 84.02%, marking a substantial improvement over existing studies. A novel wrapper feature selection algorithm employing a custom objective function is proposed in this study, which significantly improves model accuracy by 12.63% and reduces the mean absolute percentage error by 70.82%. The custom objective function’s output can be transformed to quantify the contribution of each variable to the target variable’s variability, providing deeper insights into the workings of black box models. The study concludes that relative humidity is redundant in predicting dengue infection, while meteorological factors exhibit more significant short-term impacts compared to long-term and immediate impacts. Sunshine (hours) emerges as the meteorological factor with the most immediate impact, whereas precipitation is the most impactful predictor over both short-term (8-month lag) and long-term (26–30-month lag) periods. Forecasts for 2024 using the best-performing model predict a rise in dengue cases starting in May, peaking at 24,000 cases per month by August and persisting at high levels through October before declining to half by year-end. These findings offer critical insights into temporal climate effects on dengue transmission, aiding the development of effective forecasting systems.https://doi.org/10.1038/s41598-024-83770-0Machine learningDengueTime series analysisTropical diseaseFeature selection |
spellingShingle | Mahadee Al Mobin Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach Scientific Reports Machine learning Dengue Time series analysis Tropical disease Feature selection |
title | Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach |
title_full | Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach |
title_fullStr | Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach |
title_full_unstemmed | Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach |
title_short | Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach |
title_sort | forecasting dengue in bangladesh using meteorological variables with a novel feature selection approach |
topic | Machine learning Dengue Time series analysis Tropical disease Feature selection |
url | https://doi.org/10.1038/s41598-024-83770-0 |
work_keys_str_mv | AT mahadeealmobin forecastingdengueinbangladeshusingmeteorologicalvariableswithanovelfeatureselectionapproach |