Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach

Abstract Dengue, a mosquito-borne viral disease, continues to pose severe risks to public health and economic stability in tropical and subtropical regions, particularly in developing nations like Bangladesh. The necessity for advanced forecasting mechanisms has never been more critical to enhance t...

Full description

Saved in:
Bibliographic Details
Main Author: Mahadee Al Mobin
Format: Article
Language:English
Published: Nature Portfolio 2024-12-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-83770-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841559458609627136
author Mahadee Al Mobin
author_facet Mahadee Al Mobin
author_sort Mahadee Al Mobin
collection DOAJ
description Abstract Dengue, a mosquito-borne viral disease, continues to pose severe risks to public health and economic stability in tropical and subtropical regions, particularly in developing nations like Bangladesh. The necessity for advanced forecasting mechanisms has never been more critical to enhance the effectiveness of vector control strategies and resource allocations. This study formulates a dynamic data pipeline to forecast dengue incidence based on 13 meteorological variables using a suite of state-of-the-art machine learning models and custom features engineering, achieving an accuracy of 84.02%, marking a substantial improvement over existing studies. A novel wrapper feature selection algorithm employing a custom objective function is proposed in this study, which significantly improves model accuracy by 12.63% and reduces the mean absolute percentage error by 70.82%. The custom objective function’s output can be transformed to quantify the contribution of each variable to the target variable’s variability, providing deeper insights into the workings of black box models. The study concludes that relative humidity is redundant in predicting dengue infection, while meteorological factors exhibit more significant short-term impacts compared to long-term and immediate impacts. Sunshine (hours) emerges as the meteorological factor with the most immediate impact, whereas precipitation is the most impactful predictor over both short-term (8-month lag) and long-term (26–30-month lag) periods. Forecasts for 2024 using the best-performing model predict a rise in dengue cases starting in May, peaking at 24,000 cases per month by August and persisting at high levels through October before declining to half by year-end. These findings offer critical insights into temporal climate effects on dengue transmission, aiding the development of effective forecasting systems.
format Article
id doaj-art-aaaef53405f443cc81d25acb996aa99c
institution Kabale University
issn 2045-2322
language English
publishDate 2024-12-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-aaaef53405f443cc81d25acb996aa99c2025-01-05T12:29:29ZengNature PortfolioScientific Reports2045-23222024-12-0114112410.1038/s41598-024-83770-0Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approachMahadee Al Mobin0Bangladesh Institute of Governance and ManagementAbstract Dengue, a mosquito-borne viral disease, continues to pose severe risks to public health and economic stability in tropical and subtropical regions, particularly in developing nations like Bangladesh. The necessity for advanced forecasting mechanisms has never been more critical to enhance the effectiveness of vector control strategies and resource allocations. This study formulates a dynamic data pipeline to forecast dengue incidence based on 13 meteorological variables using a suite of state-of-the-art machine learning models and custom features engineering, achieving an accuracy of 84.02%, marking a substantial improvement over existing studies. A novel wrapper feature selection algorithm employing a custom objective function is proposed in this study, which significantly improves model accuracy by 12.63% and reduces the mean absolute percentage error by 70.82%. The custom objective function’s output can be transformed to quantify the contribution of each variable to the target variable’s variability, providing deeper insights into the workings of black box models. The study concludes that relative humidity is redundant in predicting dengue infection, while meteorological factors exhibit more significant short-term impacts compared to long-term and immediate impacts. Sunshine (hours) emerges as the meteorological factor with the most immediate impact, whereas precipitation is the most impactful predictor over both short-term (8-month lag) and long-term (26–30-month lag) periods. Forecasts for 2024 using the best-performing model predict a rise in dengue cases starting in May, peaking at 24,000 cases per month by August and persisting at high levels through October before declining to half by year-end. These findings offer critical insights into temporal climate effects on dengue transmission, aiding the development of effective forecasting systems.https://doi.org/10.1038/s41598-024-83770-0Machine learningDengueTime series analysisTropical diseaseFeature selection
spellingShingle Mahadee Al Mobin
Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach
Scientific Reports
Machine learning
Dengue
Time series analysis
Tropical disease
Feature selection
title Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach
title_full Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach
title_fullStr Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach
title_full_unstemmed Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach
title_short Forecasting dengue in Bangladesh using meteorological variables with a novel feature selection approach
title_sort forecasting dengue in bangladesh using meteorological variables with a novel feature selection approach
topic Machine learning
Dengue
Time series analysis
Tropical disease
Feature selection
url https://doi.org/10.1038/s41598-024-83770-0
work_keys_str_mv AT mahadeealmobin forecastingdengueinbangladeshusingmeteorologicalvariableswithanovelfeatureselectionapproach