A Geostatistical and Machine Learning Approach to Modelling the Effects of Water Stress, Climate Change, and Sanitation Infrastructure on the Spatial Distribution of Infectious Diseases
Abstract
Infectious disease is becoming an increasingly important environmental issue, particularly in rapidly urbanising LMICs that lack the infrastructure needed to manage these challenges. It is important to map their spatial and temporal interactions for prevention and adaptation. We discuss novel advances in geostatistics and machine learning (ML) designed to map the spatial structure of these factors that influence the spatial pattern of infectious diseases; this includes core geostatistical methods such as kriging, variogram modelling, spatial regression, and autocorrelation analysis, as well as ML models such as random forests, convolutional neural networks (CNNs), and long short-term memory (LSTM) networks. The advantage of multimethod geostatistical-ML approaches over monolithic ones is that they achieve greater accuracy, interpretability, and uncertainty management. Case studies from 2020–2025 demonstrate that remote sensing, hydrologic and infrastructure data can be used to augment cholera, malaria and dengue models. Some of the challenges are data quality assurance, data interpretability, scalability, and privacy issues related to health data. Future priorities should focus on explainable AI, federated learning, and climate-health digital twins, which will help create resilient, secure, and future-proof models applicable globally. Finally, the integration of geostatistics and machine learning is a promising interdisciplinary approach to disease prediction and bolstering population resilience in a rapidly evolving world.
Keywords
Full Text:
PDFReferences
1. WHO. (2023). World Health Statistics 2023. Retrieved from https://www.who.int/publications/b/69040
2. Bain, W., Yang, H., Shah, F. A., Suber, T., Drohan, C., Al-Yousif, N., DeSensi, R. S., Bensen, N., Schaefer, C., Rosborough, B. R., Somasundaram, A., Workman, C. J., Lampenfeld, C., Cillo, A. R., Cardello, C., Shan, F., Bruno, T. C., Vignali, D. a. A., Ray, P., & Kitsios, G. D. (2021). COVID-19 versus Non–COVID-19 Acute Respiratory Distress Syndrome: Comparison of Demographics, Physiologic Parameters, Inflammatory Biomarkers, and Clinical Outcomes. Annals of the American Thoracic Society, 18(7), 1202–1210. doi: 10.1513/annalsats.202008-1026oc
3. Hutton, G., & Chase, C. (2017). Water Supply, Sanitation, and Hygiene. In The World Bank eBooks (pp. 171–198). doi: 10.1596/978-1-4648-0522-6_ch9
4. Ahmed, S. H., Shaikh, T. G., Waseem, S., Zahid, M., Ahmed, K. A. H. M., Ullah, I., & Hasibuzzaman, M. A. (2024). Water-related diseases following flooding in South Asian countries – a healthcare crisis. European Journal of Clinical and Experimental Medicine, 22(1), 232–242. doi: 10.15584/ejcem.2024.1.29
5. Semenza, J. C., Rocklöv, J., & Ebi, K. L. (2022). Climate Change and Cascading Risks from Infectious Disease. Infectious Diseases and Therapy, 11(4), 1371–1390. doi: 10.1007/s40121-022-00647-3
6. Ebi, K. L., & Hess, J. J. (2020). Health risks due to climate change: Inequity in causes and consequences. Health Affairs, 39(12), 2056–2062. doi: 10.1377/hlthaff.2020.01125
7. Chaudhry, D. (2023). Climate Change and Health of the Urban Poor: The role of environmental justice. The Journal of Climate Change and Health, 15, 100277. doi: 10.1016/j.joclim.2023.100277
8. Diggle, P. J. & Giorgi, E. (2019). Model-based Geostatistics for Global Public Health Methods and Applications. CRC Press.
9. Rezaei, M. (2025). Artificial intelligence in knowledge management: Identifying and addressing the key implementation challenges. Technological Forecasting and Social Change, 217, 124183. doi: 10.1016/j.techfore.2025.124183
10. Hussain, S. S. A., Bedi, S., Yadav, C. P., Mohanty, A. K., Mahatme, K., Tyagi, S., Krishnan, N. M. A., Kota, S. H., & Sharma, A. (2025). Hybrid models combining trend and seasonality components with machine learning algorithms provide accurate forecasting of malaria incidence. PLOS Global Public Health, 5(10), e0004500. doi: 10.1371/journal.pgph.0004500
11. Campbell, A. M., Racault, M., Goult, S., & Laurenson, A. (2020). Cholera Risk: A machine learning approach applied to essential climate variables. International Journal of Environmental Research and Public Health, 17(24), 9378. doi: 10.3390/ijerph17249378
12. Tewara, M. A., Yunxia, L., Lin, W., Barong, B. H., Mbah-Fongkimeh, P. N., Zhaolei, Z., Xinhui, L., Miao, Z., Liu, X., & Xue, F. (2020). Geographically weighted regression modelling of the spatial association between malaria cases and environmental factors in Cameroon. Research Square. doi: 10.21203/rs.2.13021/v2
13. Zhou, G., He, X., Yang, K., Li, L., Guo, H., Wang, G., Li, J., Chen, Y., & Yang, Y. (2023). Effects of temperature and relative humidity on behaviour and physiological indices in goats. Small Ruminant Research, 229, 107126. doi: 10.1016/j.smallrumres.2023.107126
14. Bhatt, S., Weiss, D. J., Cameron, E., Bisanzio, D., Mappin, B., Dalrymple, U., Battle, K. E., Moyes, C. L., Henry, A., Eckhoff, P. A., Wenger, E. A., Briët, O., Penny, M. A., Smith, T. A., Bennett, A., Yukich, J., Eisele, T. P., Griffin, J. T., Fergus, C. A., & Gething, P. W. (2015). The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015. Nature, 526(7572), 207–211. doi: 10.1038/nature15535
15. Bisanzio, D., Bosa, H. K., Bakamutumaho, B., Nasimiyu, C., Atwine, D., Kyabayinze, D., Olaro, C., Breiman, R. F., Njenga, M. K., Mwebesa, H., Aceng, J. R., & Reithinger, R. (2025). Modelling case burden and duration of the Sudan Ebola virus disease outbreak in Uganda, 2022. Emerging Infectious Diseases, 31(9), 1829–1832. doi: 10.3201/eid3109.241545
16. Adewumi, I. O. (2025). AI for Cholera Outbreak Prediction, Real-Time Tracking, and Low-Resource Diagnostics using Federated and Privacy-Preserving Machine Learning. Research Square. doi: 10.21203/rs.3.rs-7441133/v1
17. Liu, Q., Chi, S., Dmytruk, K., Dmytruk, O., & Tan, S. (2022). Coronaviral infection and interferon response: The Virus-Host Arms Race and COVID-19. Viruses, 14(7), 1349. doi: 10.3390/v14071349
18. Manaf, M., Ali, Z., & Scholz, M. (2026). Integrating random forest-based regression kriging for analysing the spatial variability of rainfall in arid and semi-arid regions. Scientific Reports, 16(1), 5298. doi: 10.1038/s41598-026-36074-4
19. Cressie, N. (2021). A few statistical principles for data science. Australian & New Zealand Journal of Statistics, 63(1), 182–200. doi: 10.1111/anzs.12324
20. Brunsdon, C., Fotheringham, A., & Charlton, M. (2002). Geographically weighted summary statistics — a framework for localised exploratory data analysis. Computers Environment and Urban Systems, 26(6), 501–524. doi: 10.1016/s0198-9715(01)00009-6
21. Giorgi, E., Diggle, P. J., Snow, R. W., & Noor, A. M. (2018). Geostatistical Methods for Disease Mapping and Visualisation Using Data from Spatio‐temporally Referenced Prevalence Surveys. International Statistical Review, 86(3), 571–597. doi: 10.1111/insr.12268
22. Chen, Z., Chong, K. C., Wong, M. C., Boon, S. S., Huang, J., Wang, M. H., Ng, R. W., Lai, C. K., & Chan, P. K. (2021). A global analysis of replacement of genetic variants of SARS-CoV-2 in association with containment capacity and changes in disease severity. Clinical Microbiology and Infection, 27(5), 750–757. doi: 10.1016/j.cmi.2021.01.018
23. Chutia, D., Borah, K., Singh, L., Sarmah, D., & Singha, L. (2026). Updates on dengue virus infection: Epidemiology, molecular pathogenesis, and clinical strategies. Vacunas, 27(3), 500652. doi: 10.1016/j.vacun.2026.500652
24. Agboka, K. M., Abdel-Rahman, E. M., Salifu, D., Kanji, B., Ndjomatchoua, F. T., Guimapi, R. A., Ekesi, S., & Tobias, L. (2025). Towards combining self-organising maps (SOM) and convolutional neural networks (CNN) to improve model accuracy: Application to phenotypic resistance in malaria vectors. MethodsX, 14, 103198. doi: 10.1016/j.mex.2025.103198
25. De Oliveira, A. F., Da Costa Leite, I., & Valente, J. G. (2015). Global burden of diarrheal disease attributable to the water supply and sanitation system in the State of Minas Gerais, Brazil: 2005. Ciência & Saúde Coletiva, 20(4), 1027–1036. doi: 10.1590/1413-81232015204.00372014
Article Metrics
Metrics powered by PLOS ALM
Refbacks
- There are currently no refbacks.
Copyright (c) 2026 Jumoke Ifeoluwa Ogunremi, Philena Amie Sombo

This work is licensed under a Creative Commons Attribution 4.0 International License.



