Abstract
One in 25 deaths worldwide is related to liver disease, and often with multiple hepatosplenic conditions. Yet, little is understood of the risk factors for hepatosplenic multimorbidity, especially in the context of chronic infections. We present a novel Bayesian multitask learning framework to jointly model 45 hepatosplenic conditions assessed using point-of-care B-mode ultrasound for 3155 individuals aged 5-91 years within the SchistoTrack cohort across rural Uganda, where chronic intestinal schistosomiasis is endemic. We identify distinct and shared biomedical, socioeconomic, and spatial risk factors for individual conditions and hepatosplenic multimorbidity, and introduce methods for measuring condition dependencies as risk factors. Notably, for gastro-oesophageal varices, we discover key risk factors of older age, lower haemoglobin concentration, and schistosomal periportal fibrosis. Our findings provide a compendium of risk factors to inform surveillance, triage, and follow-up, while our model enables improved prediction of hepatosplenic multimorbidity, and if validated on other anatomical systems, general multimorbidity.
Similar content being viewed by others
Data availability
The raw data are protected and are not available due to data privacy laws. The metadata generated in this study are provided in the Supplementary Information.
Code availability
Synthetically generated data is provided based on random sampling of the covariates and conditions to allow the running of the code. The model implementation code is shared as supplementary material and can be found in Supplementary Code 1.
References
Langenberg, C., Hingorani, A. D. & Whitty, C. J. Biological and functional multimorbidity–from mechanisms to management. Nat. Med. 29, 1649–1657 (2023).
Barnett, K. et al. Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study. Lancet 380, 37–43 (2012).
Guthrie, B., Payne, K., Alderson, P., McMurdo, M. E. & Mercer, S. W. Adapting clinical guidelines to take account of multimorbidity. BMJ 345, e6341 (2012).
Skou, S. T. et al. Multimorbidity. Nat. Rev. Dis. Primers 8, 48 (2022).
MacMahon, S. et al. Multimorbidity: A Priority for Global Health Research Vol. 127 (Academy of Medical Sciences: London, UK, 2018).
Haug, N. et al. High-risk multimorbidity patterns on the road to cardiovascular mortality. BMC. Med. 18, 44 (2020).
Tai, X. Y. et al. Cardiometabolic multimorbidity, genetic risk, and dementia: a prospective cohort study. Lancet Healthy Longev. 3, e428–e436 (2022).
Souza, D. L. et al. Trends of multimorbidity in 15 European countries: a population-based study in community-dwelling adults aged 50 and over. BMC Public Health 21, 76 (2021).
Anjorin, S. et al. Epidemiology of periportal fibrosis and relevance of current Schistosoma mansoni infection within the context of repeated mass drug administration in rural Uganda: a population-based, cross-sectional study. Lancet Microbe 5, 100944 (2024).
Zhi, Y.-C. et al. Clinically validated graphical approaches identify hepatosplenic multimorbidity in individuals at risk of schistosomiasis. R. Soc. Open Sci. 12, 242256 (2025).
Opio, C. K. et al. Profiling lifetime episodes of upper gastrointestinal bleeding among patients from rural sub-Saharan Africa where Schistosoma mansoni is endemic. Pan. Afr. Med. J. 24, 296 (2016).
Wiegand, R. E. et al. Associations between infection intensity categories and morbidity prevalence in school-age children are much stronger for Schistosoma haematobium than for S. mansoni. PLoS Negl. Trop. Dis. 15, e0009444 (2021).
World Health Organization. WHO Guideline on Control and Elimination of Human Schistosomiasis (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/9789240041608
Ewuzie, A. et al. Association of current Schistosoma mansoni, Schistosoma japonicum, and Schistosoma mekongi infection status and intensity with periportal fibrosis: a systematic review and meta-analysis. Lancet Glob. Health 13, e69–e80 (2025).
Langenberg, M. C. et al. A controlled human Schistosoma mansoni infection model to advance novel drugs, vaccines and diagnostics. Nat. Med. 26, 326–332 (2020).
Lee, S. I. et al. Epidemiology of pre-existing multimorbidity in pregnant women in the uk in 2018: a cross-sectional study. Lancet 398, S7 (2021).
Sewpaul, R., Mbewu, A. D., Fagbamigbe, A. F., Kandala, N.-B. & Reddy, S. P. Prevalence of multimorbidity of cardiometabolic conditions and associated risk factors in a population-based sample of South Africans: a cross-sectional study. Public Health Pract. 2, 100193 (2021).
Caruana, R. Multitask learning. Mach. Learn. 28, 41–75 (1997).
Collobert, R. et al. Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011).
Zhang, Q., Zhang, G., Lu, J. & Wu, D. A framework of hybrid recommender system for personalized clinical prescription. In 2015 10Th International Conference on Intelligent Systems and Knowledge Engineering (ISKE) 189–195 (IEEE, 2015).
World Health Organization. Ultrasound in Schistosomiasis: A Practical Guide to the Standard Use of Ultrasonography for Assessment of Schistosomiasis-Related Morbidity (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/TDR-STR-SCH-00.1
Puthur, C., Nabatte, B., Tinkitina, B., Kabatereine, N. B. & Chami, G. F. Design and maintenance of community-based cohorts in sub-Saharan Africa: a longitudinal evaluation of participant attrition in schistotrack. BMJ Public Health 3, e002151 (2025).
Chowdhury, S. R., Das, D. C., Sunna, T. C., Beyene, J. & Hossain, A. Global and regional prevalence of multimorbidity in the adult population in community settings: a systematic review and meta-analysis. EClinicalMedicine 57, 101860 (2023).
Jiang, Y. et al. Ukb-mdrmf: a multi-disease risk and multimorbidity framework based on UK Biobank data. Nat. Commun. 16, 3767 (2025).
Lv, Y. et al. Hypersplenism: history and current status. Exp. Ther. Med. 12, 2377–2382 (2016).
Chami, G. F. et al. Influence of Schistosoma mansoni and hookworm infection intensities on anaemia in Ugandan villages. PLoS Negl. Trop. Dis. 9, e0004193 (2015).
Topley, J. M., Rogers, D., Stevens, M. & Serjeant, G. R. Acute splenic sequestration and hypersplenism in the first five years in homozygous sickle cell disease. Arch. Dis. Child. 56, 765–769 (1981).
Reitzug, F. et al. Current water contact and Schistosoma mansoni infection have distinct determinants: a data-driven population-based study in rural Uganda. Nat. Commun. 15, 9530 (2024).
World Health Organization. Guideline on Haemoglobin Cutoffs to Define Anaemia in Individuals and Populations (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/9789240088542
McConnell, M. J. & Iwakiri, Y. Portal hypertension in alcohol-associated hepatitis. Curr. Hepatol. Rep. 22, 67–73 (2023).
Reitzug, F., Ledien, J. & Chami, G. F. Associations of water contact frequency, duration, and activities with schistosome infection risk: a systematic review and meta-analysis. PLoS Negl. Trop. Dis. 17, e0011377 (2023).
Watermeyer, G. et al. Inflammatory bowel disease in sub-saharan africa: epidemiology, risk factors, and challenges in diagnosis. Lancet Gastroenterol. Hepatol. 7, 952–961 (2022).
Watermeyer, G. et al. Challenges in the management of inflammatory bowel disease in sub-saharan africa. Lancet Gastroenterol. Hepatol. 7, 962–972 (2022).
del Olmo, J. A. et al. Predictors of morbidity and mortality after the first episode of upper gastrointestinal bleeding in liver cirrhosis. J. Hepatol. 32, 19–24 (2000).
Bosch, J. & García-Pagán, J. C. Complications of cirrhosis. I. Portal hypertension. J. Hepatol. 32, 141–156 (2000).
Berzigotti, A. Non-invasive evaluation of portal hypertension using ultrasound elastography. J. Hepatol. 67, 399–411 (2017).
Rumack, C. M. & Levine, D. Diagnostic Ultrasound E-book (Elsevier Health Sciences, 2023).
Goncalves, A. et al. Bayesian multitask learning regression for heterogeneous patient cohorts. J. Biomed. Inform. 100, 100059 (2019).
Cheng, Y., Xia, Y. & Wang, X. Bayesian multitask learning for medicine recommendation based on online patient reviews. Bioinformatics 39, btad491 (2023).
Seitz, H. K. et al. Alcoholic liver disease. Nat. Rev. Dis. Primers 4, 16 (2018).
Nuffield Department of Population Health, University of Oxford. Schistotrack: A Prospective Multimorbidity Cohort (WHO, accessed 23 January 2026); https://www.bdi.ox.ac.uk/research/schistotrack
Peña-Rosas, J. P. & World Health Organization. Haemoglobin Concentrations for the Diagnosis of Anaemia and Assessment of Severity (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/WHO-NMH-NHD-MNM-11.1
World Health Organization. Stepwise Approach to NCD Risk Factor Surveillance (Steps) (WHO, accessed 23 January 2026); https://www.who.int/teams/noncommunicable-diseases/surveillance/systems-tools/steps
Lyne, B. et al. Co-occurrence of alcohol use and Schistosoma mansoni infection: prevalence, patterns, and risk factors in rural Uganda. Preprint at medRxiv https://doi.org/10.1101/2025.10.14.25337972 (2025).
Fu, S. et al. Multi-task deep learning network to predict future macrovascular invasion in hepatocellular carcinoma. EClinicalMedicine 42, 101201 (2021).
Silver, D. L., Poirier, R. & Currie, D. Inductive transfer with context-sensitive neural networks. Mach. Learn. 73, 313–336 (2008).
Igiraneza, A. B. et al. Learning patterns of HIV-1 resistance to broadly neutralizing antibodies with reduced subtype bias using multi-task learning. PLoS Comput. Biol. 20, e1012618 (2024).
Wang, X. et al. Multitask joint strategies of self-supervised representation learning on biomedical networks for drug discovery. Nat. Mach. Intell. 5, 445–456 (2023).
Xue, Y., Liao, X., Carin, L. & Krishnapuram, B. Multi-task learning for classification with Dirichlet process priors. J. Mach. Learn. Res. 8, 35–63 (2007).
White, B. S. et al. Bayesian multi-source regression and monocyte-associated gene expression predict bcl-2 inhibitor resistance in acute myeloid leukemia. NPJ Precis. Oncol. 5, 71 (2021).
Yuan, H., Paskov, I., Paskov, H., González, A. J. & Leslie, C. S. Multitask learning improves prediction of cancer drug sensitivity. Sci. Rep. 6, 31619 (2016).
Gelman, A., Carlin, J. B., Stern, H. S. & Rubin, D. B. Bayesian Data Analysis (Chapman and Hall/CRC, 1995).
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. Smote: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).
Friedman, J., Hastie, T. & Tibshirani, R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9, 432–441 (2008).
Borchani, H., Varando, G., Bielza, C. & Larranaga, P. A survey on multi-output regression. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 5, 216–233 (2015).
Shuman, D. I., Narang, S. K., Frossard, P., Ortega, A. & Vandergheynst, P. The emerging field of signal processing on graphs: extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Process. Mag. 30, 83–98 (2013).
Ortega, A., Frossard, P., Kovačević, J., Moura, J. M. & Vandergheynst, P. Graph signal processing: overview, challenges, and applications. Proc. IEEE. 106, 808–828 (2018).
Griffin, J. E. & Brown, P. J. Hierarchical shrinkage priors for regression models. Bayesian Anal. 12, 135–159 (2017).
Van de Schoot, R. et al. Bayesian statistics and modelling. Nat. Rev. Methods Primers 1, 1 (2021).
Loaiza-Ganem, G. & Cunningham, J. P. The continuous Bernoulli: fixing a pervasive error in variational autoencoders. Adv. Neural Inf. Process. Syst. 32, 13287–13297 (2019).
Korkmaz, M. Ç, Leiva, V. & Martin-Barreiro, C. The continuous Bernoulli distribution: mathematical characterization, fractile regression, computational simulations, and applications. Fract. Fract. 7, 386 (2023).
Neal, R. M. et al. MCMC using Hamiltonian dynamics. In Handbook of Markov Chain Monte Carlo (Chapman and Hall/CRC, 2011).
Hoffman, M. D., Gelman, A. et al. The no-u-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J. Mach. Learn. Res. 15, 1593–1623 (2014).
Bingham, E. et al. Pyro: Deep universal probabilistic programming. J. Mach. Learn. Res. 20, 1–6 (2019).
Dai, B., Ding, S. & Wahba, G. Multivariate Bernoulli distribution. Bernoulli 19, 1465–1483 (2012).
Chib, S. & Greenberg, E. Analysis of multivariate probit models. Biometrika 85, 347–361 (1998).
Storey, J. D. A direct approach to false discovery rates. J. R. Stat. Soc. B Stat. Methodol. 64, 479–498 (2002).
Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proc. Natl. Acad. Sci. USA. 100, 9440–9445 (2003).
Hespanhol, L. C., Vallio, C. S., da Cunha Menezes Costa, L. & Saragiotto, B. T. Understanding and interpreting confidence and credible intervals around effect estimates. Braz. J. Phys. Ther. 23, 290–301 (2019).
Acknowledgements
We are thankful for the involvement of our study participants and the SchistoTrack teams, especially the surveyors, nurses, sonographers, and laboratory technicians. We also like to thank the Uganda Ministry of Health, local district leaders, focal health workers, and village health teams. Special thanks also to the Oxford team for the fieldwork, data wrangling, everyday discussions, and feedback. This research was funded in whole, or in part, by the UKRI EPSRC [EP/X021793/1]. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. NDPH Pump Priming Fund, John Fell Fund, Robertson Foundation, UKRI EPSRC (EP/X021793/1) grants were awarded to G.F.C.
Author information
Authors and Affiliations
Contributions
Conceptualisation: G.F.C. and Y.C.Z. Data curation: Y.C.Z., V.A., J.B.O., B.N., C.K.O., N.B.K. and G.F.C. Formal analysis: Y.C.Z. Investigation, methodology, visualisation: Y.C.Z. Writing - original draft: Y.C.Z. and G.F.C. Validation: Y.C.Z. and G.F.C. Writing - review and editing: Y.C.Z., V.A., J.B.O., B.N., C.K.O., N.B.K. and G.F.C. Funding acquisition and supervision: G.F.C. Resources: G.F.C.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Andres Aldana, Matthias C Reichert and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhi, YC., Anguajibi, V., Oryema, J.B. et al. Bayesian machine learning enables discovery of risk factors for hepatosplenic multimorbidity related to schistosomiasis. Nat Commun (2026). https://doi.org/10.1038/s41467-026-69528-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-026-69528-4


