Enhancing Dengue Outbreak Prediction in Bangladesh: A Weighted Average Ensemble Machine Learning Approach
DOI:
https://doi.org/10.56532/mjsat.v6i2.601Keywords:
Dengue Outbreak Prediction, Ensemble Learning, Machine Learning Models, Public Health Informatics, Dengue TransmissionAbstract
Dengue fever is one of the most urgent public health threats in Bangladesh, particularly in cities like Chattogram, where rapid urban growth, poor waste management, and changing climate conditions create a fertile ground for outbreaks. However, most existing dengue prediction models lack contextual adaptation, often relying on single classifiers that fail to capture localized socio-environmental factors, limiting their predictive reliability in the Bangladeshi context. To strengthen early prediction and response, this study introduces a Weighted Average Ensemble Learning (WAEL) model that combines the strengths of Support Vector Machine (SVM), Random Forest (RF), and AdaBoost classifiers. Using a dataset of 199 records and nine attributes collected from a Figshare survey on dengue awareness and prevention, extensive preprocessing steps such as feature selection, cleaning, and imputation were applied to ensure high data quality. Six baseline machine learning models, including Decision Tree, Naïve Bayes, k-Nearest Neighbor, SVM, RF, and AdaBoost, were evaluated, with RF and AdaBoost emerging as the strongest individual performers, each achieving 92.5% accuracy and F1-scores above 0.95. The proposed WAEL approach surpassed all individual models, achieving 93% accuracy, 94% precision, 98.5% recall, and an F1-score of 0.955. These findings demonstrate the advantage of ensemble methods in producing more reliable and context-aware predictions by harnessing the complementary strengths of multiple classifiers. Beyond technical performance, the study offers valuable insights for policymakers and health authorities to identify high-risk areas, vulnerable populations, and key factors driving dengue transmission, ultimately providing a data-driven framework for targeted prevention and early intervention in Bangladesh.
References
S. Bhatt, P. W. Gething, O. J. Brady, J. P. Messina, A. W. Farlow, C. L. Moyes, et al., “The global distribution and burden of dengue,” Nature, vol. 496, no. 7446, pp. 504–507, Apr. 2013, doi: https://doi.org/10.1038/nature12060.
S. Yesmin, S. Sarmin, A. M. Ahammad, M. A. Rafi, and M. J. Hasan, “Epidemiological investigation of the 2019 dengue outbreak in Dhaka, Bangladesh,” Journal of Tropical Medicine, vol. 2023, pp. 1–7, 2023, doi: https://doi.org/10.1155/2023/8898453.
M. A. Rob, M. Hossain, M. A. Sattar, I. U. Ahmed, A. F. M. N. Chowdhury, H. M. H. Mehedi, et al., “Circulating dengue virus serotypes, demographics, and epidemiology in the 2023 dengue outbreak in Chittagong, Bangladesh,” Eur. J. Microbiol. Immunol., vol. 14, no. 3, pp. 272–279, Aug. 22, 2024, doi: https://doi.org/10.1556/1886.2024.00069.
M. S. Hossain, A. A. Noman, S. M. A. Mamun, A. Al Mosabbir, et al., “Twenty-two years of dengue outbreaks in Bangladesh: epidemiology, clinical spectrum, serotypes, and future disease risks,” Trop. Med. Health, vol. 51, art. no. 37, Jul. 11, 2023, doi: https://doi.org/10.1186/s41182-023-00528-6.
J. Clarke, A. Lim, P. Gupte, D. M. Pigott, W. G. Van Panhuis, and O. J. Brady, “A global dataset of publicly available dengue case count data,” Scientific Data, vol. 11, no. 1, 2024, doi: https://doi.org/10.1038/s41597-024-03120-7.
M. M. Khan, M. A. H. Miah, M. K. Alam, M. A. Islam, M. A. Rahman, R. I. I. Noor, et al., “Clinico-epidemiological profiling of dengue patients in a non-endemic region of Bangladesh,” Trans. R. Soc. Trop. Med. Hyg., vol. 119, no. 1, pp. 58–64, 2024, doi: https://doi.org/10.1093/trstmh/trae074.
M. N. Hasan et al., “The 2023 fatal dengue outbreak in Bangladesh highlights a paradigm shift of geographical distribution of cases,” Epidemiology and Infection, vol. 153, 2025, doi: https://doi.org/10.1017/S0950268824001791.
K. M. Y. Arafat, A. Hossain, M. Ikfat, M. A. Amin, K. Tanvir, D. Gomes, and M. Rahman, "FRF-HHO: Early ovarian cancer prediction using explainable fuzzy random forest optimized by Harris Hawks algorithm," Advances in Biomarker Sciences and Technology, vol. 8, 2026, doi: https://doi.org/10.1016/j.abst.2026.01.003.
C. Cortes and V. Vapnik, “Support vector networks,” Machine Learning, vol. 20, no. 3, pp. 273–297, 1995, doi: https://doi.org/10.1007/BF00994018.
P. Chanprasopchai, I. M. Tang, and P. Pongsumpun, “SIR model for dengue disease with effect of dengue vaccination,” Computational and Mathematical Methods in Medicine, vol. 2018, pp. 1–14, 2018, doi: https://doi.org/10.1155/2018/9861572.
M. S. D. P. Nayak and K. A. Narayan, “Forecasting dengue fever incidence using ARIMA analysis,” International Journal of Collaborative Research on Internal Medicine & Public Health, vol. 11, no. 6, pp. 924–932, 2019. [Online]. Available: https://www.iomcworld.org/articles/forecasting-dengue-fever-incidence-using-arima-analysis.pdf.
S. G. Kakarla et al., “Lag effect of climatic variables on dengue burden in India,” Epidemiology and Infection, vol. 147, 2019, doi: https://doi.org/10.1017/S0950268819000608.
S. Mala and M. Jat, “Geographic information system based spatio temporal dengue fever cluster analysis and mapping,” The Egyptian Journal of Remote Sensing and Space Science, vol. 22, no. 8, pp. 297–304, 2019, doi: https://doi.org/10.1016/j.ejrs.2019.08.002.
T. S. Ho et al., “Comparing machine learning with case control models to identify confirmed dengue cases,” PLoS Neglected Tropical Diseases, vol. 14, no. 11, p. e0008843, 2020, doi: https://doi.org/10.1371/journal.pntd.0008843.
E. Munarsih and I. Saluza, “Comparison of exponential smoothing method and autoregressive integrated moving average (ARIMA) method in predicting dengue fever cases in the city of Palembang,” Journal of Physics: Conference Series, vol. 1521, p. 032100, 2020, doi: https://doi.org/10.1088/1742-6596/1521/3/032100.
W. Hoyos, J. Aguilar, and M. Toro, “Dengue models based on machine learning techniques: A systematic literature review,” Artificial Intelligence in Medicine, vol. 119, p. 102157, 2021, doi: https://doi.org/10.1016/j.artmed.2021.102157.
M. Yusuff, “The role of GIS in mapping dengue hotspots,” unpublished, 2023.
Sarker and Karim, “Predictive models for dengue outbreak of Dhaka, Bangladesh: Generalized additive models with B spline regression,” Jahangirnagar University Journal of Statistical Studies, vol. 37, pp. 101–115, 2023.
M. C. Ramírez Soto, J. V. B. Machuca, D. H. Stalder, D. Champin, and M. G. Martínez Fernández, “SIR SI model with a Gaussian transmission rate: Understanding the dynamics of dengue outbreaks in Lima, Peru,” PLoS ONE, vol. 18, no. 4, p. e0284263, 2023, doi: https://doi.org/10.1371/journal.pone.0284263.
K. Tanvir, M. Rahman, and D. Gomes, “A hybrid optimization and Tree-Based learning framework for dengue diagnosis using hematological data,” SSRN Electronic Journal, Jan. 2025, doi: https://doi.org/10.2139/ssrn.5687196.
M. J. Hasan et al., “Clinical and epidemiological characteristics of the dengue outbreak of 2024: a multicenter observation from Bangladesh,” Tropical Medicine and Health, vol. 53, p. 45, 2025, doi: https://doi.org/10.1186/s41182-025-00691-y.
S. Rahman, F. Mehejabin, and R. Rashid, “Knowledge and prevention practice against dengue vectors among dengue patients and general people in Chattogram, Bangladesh,” F1000Research, vol. 11, p. 146, 2022, doi: https://doi.org/10.12688/f1000research.108731.1.
G. Guo, H. Wang, Y. Bell, Davidand Bi, and K. Greer, KNN Model Based Approach in Classification. Berlin, Heidelberg: Springer, 2003.
M. Rahman, M. Hasan, M. M. Billah, and R. J. Sajuti, “Grading system prediction of educational performance analysis using data mining approach,” Malaysian Journal of Science and Advanced Technology, vol. 2, no. 4, pp. 204–211, 2022, doi: https://doi.org/10.56532/mjsat.v2i4.96.
A. A. Chowdhury, A. Das, S. K. Saha, M. Rahman, and K. T. Hasan, “Sentiment analysis of COVID 19 vaccination from survey responses in Bangladesh,” Research Square, unpublished, 2021.
M. Rahman, M. Hasan, M. M. Billah, and R. J. Sajuti, “Political fake news detection from different news source on social media using machine learning techniques,” AIUB Journal of Science and Engineering (AJSE), 2022, doi: https://doi.org/10.53799/ajse.v21i2.383.
P. Favaro and A. Vedaldi, AdaBoost. Boston, MA, USA: Springer US, 2014.
M. A. I. Neloy, N. Nahar, M. S. Hossain, and K. Andersson, “A weighted average ensemble technique to predict heart disease,” pp. 17–29, 2022.
N. Dey, M. Srinivas, and R. B. V. Subramanyam, “A novel contactless middle finger knuckle based person identification using ensemble learning,” pp. 981–986, 2023.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Mahfujur Rahman, Noboranjan Dey, Nazmus Sakib, Rukaiya Jahan Sajuti, Mehedi Hasan, Abdullah Hel Azmain, Rahul Biswas, Dipta Justin Gomes, Kazi Tanvir, Mirza Asif Mahmud

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
