Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Communications
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. nature communications
  3. articles
  4. article
Bayesian machine learning enables discovery of risk factors for hepatosplenic multimorbidity related to schistosomiasis
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 03 March 2026

Bayesian machine learning enables discovery of risk factors for hepatosplenic multimorbidity related to schistosomiasis

  • Yin-Cong Zhi1,
  • Victor Anguajibi2,
  • John B. Oryema3,
  • Betty Nabatte4,
  • Christopher K. Opio5,
  • Narcis B. Kabatereine3 &
  • …
  • Goylette F. Chami  ORCID: orcid.org/0000-0002-4653-08461 

Nature Communications , Article number:  (2026) Cite this article

  • 3040 Accesses

  • 15 Altmetric

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Gastrointestinal diseases
  • Machine learning
  • Parasitic infection
  • Risk factors

Abstract

One in 25 deaths worldwide is related to liver disease, and often with multiple hepatosplenic conditions. Yet, little is understood of the risk factors for hepatosplenic multimorbidity, especially in the context of chronic infections. We present a novel Bayesian multitask learning framework to jointly model 45 hepatosplenic conditions assessed using point-of-care B-mode ultrasound for 3155 individuals aged 5-91 years within the SchistoTrack cohort across rural Uganda, where chronic intestinal schistosomiasis is endemic. We identify distinct and shared biomedical, socioeconomic, and spatial risk factors for individual conditions and hepatosplenic multimorbidity, and introduce methods for measuring condition dependencies as risk factors. Notably, for gastro-oesophageal varices, we discover key risk factors of older age, lower haemoglobin concentration, and schistosomal periportal fibrosis. Our findings provide a compendium of risk factors to inform surveillance, triage, and follow-up, while our model enables improved prediction of hepatosplenic multimorbidity, and if validated on other anatomical systems, general multimorbidity.

Similar content being viewed by others

Using blood routine indicators to establish a machine learning model for predicting liver fibrosis in patients with Schistosoma japonicum

Article Open access 20 May 2024

The interaction of Schistosoma mansoni infection with diabetes mellitus and obesity in mice

Article Open access 09 June 2023

Deep phenotyping of patient lived experience in functional bowel disorders using machine learning

Article Open access 09 October 2025

Data availability

The raw data are protected and are not available due to data privacy laws. The metadata generated in this study are provided in the Supplementary Information.

Code availability

Synthetically generated data is provided based on random sampling of the covariates and conditions to allow the running of the code. The model implementation code is shared as supplementary material and can be found in Supplementary Code 1.

References

  1. Langenberg, C., Hingorani, A. D. & Whitty, C. J. Biological and functional multimorbidity–from mechanisms to management. Nat. Med. 29, 1649–1657 (2023).

    Google Scholar 

  2. Barnett, K. et al. Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study. Lancet 380, 37–43 (2012).

    Google Scholar 

  3. Guthrie, B., Payne, K., Alderson, P., McMurdo, M. E. & Mercer, S. W. Adapting clinical guidelines to take account of multimorbidity. BMJ 345, e6341 (2012).

    Google Scholar 

  4. Skou, S. T. et al. Multimorbidity. Nat. Rev. Dis. Primers 8, 48 (2022).

    Google Scholar 

  5. MacMahon, S. et al. Multimorbidity: A Priority for Global Health Research Vol. 127 (Academy of Medical Sciences: London, UK, 2018).

  6. Haug, N. et al. High-risk multimorbidity patterns on the road to cardiovascular mortality. BMC. Med. 18, 44 (2020).

    Google Scholar 

  7. Tai, X. Y. et al. Cardiometabolic multimorbidity, genetic risk, and dementia: a prospective cohort study. Lancet Healthy Longev. 3, e428–e436 (2022).

    Google Scholar 

  8. Souza, D. L. et al. Trends of multimorbidity in 15 European countries: a population-based study in community-dwelling adults aged 50 and over. BMC Public Health 21, 76 (2021).

    Google Scholar 

  9. Anjorin, S. et al. Epidemiology of periportal fibrosis and relevance of current Schistosoma mansoni infection within the context of repeated mass drug administration in rural Uganda: a population-based, cross-sectional study. Lancet Microbe 5, 100944 (2024).

    Google Scholar 

  10. Zhi, Y.-C. et al. Clinically validated graphical approaches identify hepatosplenic multimorbidity in individuals at risk of schistosomiasis. R. Soc. Open Sci. 12, 242256 (2025).

    Google Scholar 

  11. Opio, C. K. et al. Profiling lifetime episodes of upper gastrointestinal bleeding among patients from rural sub-Saharan Africa where Schistosoma mansoni is endemic. Pan. Afr. Med. J. 24, 296 (2016).

    Google Scholar 

  12. Wiegand, R. E. et al. Associations between infection intensity categories and morbidity prevalence in school-age children are much stronger for Schistosoma haematobium than for S. mansoni. PLoS Negl. Trop. Dis. 15, e0009444 (2021).

    Google Scholar 

  13. World Health Organization. WHO Guideline on Control and Elimination of Human Schistosomiasis (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/9789240041608

  14. Ewuzie, A. et al. Association of current Schistosoma mansoni, Schistosoma japonicum, and Schistosoma mekongi infection status and intensity with periportal fibrosis: a systematic review and meta-analysis. Lancet Glob. Health 13, e69–e80 (2025).

    Google Scholar 

  15. Langenberg, M. C. et al. A controlled human Schistosoma mansoni infection model to advance novel drugs, vaccines and diagnostics. Nat. Med. 26, 326–332 (2020).

    Google Scholar 

  16. Lee, S. I. et al. Epidemiology of pre-existing multimorbidity in pregnant women in the uk in 2018: a cross-sectional study. Lancet 398, S7 (2021).

    Google Scholar 

  17. Sewpaul, R., Mbewu, A. D., Fagbamigbe, A. F., Kandala, N.-B. & Reddy, S. P. Prevalence of multimorbidity of cardiometabolic conditions and associated risk factors in a population-based sample of South Africans: a cross-sectional study. Public Health Pract. 2, 100193 (2021).

    Google Scholar 

  18. Caruana, R. Multitask learning. Mach. Learn. 28, 41–75 (1997).

    Google Scholar 

  19. Collobert, R. et al. Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011).

    Google Scholar 

  20. Zhang, Q., Zhang, G., Lu, J. & Wu, D. A framework of hybrid recommender system for personalized clinical prescription. In 2015 10Th International Conference on Intelligent Systems and Knowledge Engineering (ISKE) 189–195 (IEEE, 2015).

  21. World Health Organization. Ultrasound in Schistosomiasis: A Practical Guide to the Standard Use of Ultrasonography for Assessment of Schistosomiasis-Related Morbidity (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/TDR-STR-SCH-00.1

  22. Puthur, C., Nabatte, B., Tinkitina, B., Kabatereine, N. B. & Chami, G. F. Design and maintenance of community-based cohorts in sub-Saharan Africa: a longitudinal evaluation of participant attrition in schistotrack. BMJ Public Health 3, e002151 (2025).

    Google Scholar 

  23. Chowdhury, S. R., Das, D. C., Sunna, T. C., Beyene, J. & Hossain, A. Global and regional prevalence of multimorbidity in the adult population in community settings: a systematic review and meta-analysis. EClinicalMedicine 57, 101860 (2023).

    Google Scholar 

  24. Jiang, Y. et al. Ukb-mdrmf: a multi-disease risk and multimorbidity framework based on UK Biobank data. Nat. Commun. 16, 3767 (2025).

    Google Scholar 

  25. Lv, Y. et al. Hypersplenism: history and current status. Exp. Ther. Med. 12, 2377–2382 (2016).

    Google Scholar 

  26. Chami, G. F. et al. Influence of Schistosoma mansoni and hookworm infection intensities on anaemia in Ugandan villages. PLoS Negl. Trop. Dis. 9, e0004193 (2015).

    Google Scholar 

  27. Topley, J. M., Rogers, D., Stevens, M. & Serjeant, G. R. Acute splenic sequestration and hypersplenism in the first five years in homozygous sickle cell disease. Arch. Dis. Child. 56, 765–769 (1981).

    Google Scholar 

  28. Reitzug, F. et al. Current water contact and Schistosoma mansoni infection have distinct determinants: a data-driven population-based study in rural Uganda. Nat. Commun. 15, 9530 (2024).

    Google Scholar 

  29. World Health Organization. Guideline on Haemoglobin Cutoffs to Define Anaemia in Individuals and Populations (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/9789240088542

  30. McConnell, M. J. & Iwakiri, Y. Portal hypertension in alcohol-associated hepatitis. Curr. Hepatol. Rep. 22, 67–73 (2023).

    Google Scholar 

  31. Reitzug, F., Ledien, J. & Chami, G. F. Associations of water contact frequency, duration, and activities with schistosome infection risk: a systematic review and meta-analysis. PLoS Negl. Trop. Dis. 17, e0011377 (2023).

    Google Scholar 

  32. Watermeyer, G. et al. Inflammatory bowel disease in sub-saharan africa: epidemiology, risk factors, and challenges in diagnosis. Lancet Gastroenterol. Hepatol. 7, 952–961 (2022).

    Google Scholar 

  33. Watermeyer, G. et al. Challenges in the management of inflammatory bowel disease in sub-saharan africa. Lancet Gastroenterol. Hepatol. 7, 962–972 (2022).

    Google Scholar 

  34. del Olmo, J. A. et al. Predictors of morbidity and mortality after the first episode of upper gastrointestinal bleeding in liver cirrhosis. J. Hepatol. 32, 19–24 (2000).

    Google Scholar 

  35. Bosch, J. & García-Pagán, J. C. Complications of cirrhosis. I. Portal hypertension. J. Hepatol. 32, 141–156 (2000).

    Google Scholar 

  36. Berzigotti, A. Non-invasive evaluation of portal hypertension using ultrasound elastography. J. Hepatol. 67, 399–411 (2017).

    Google Scholar 

  37. Rumack, C. M. & Levine, D. Diagnostic Ultrasound E-book (Elsevier Health Sciences, 2023).

  38. Goncalves, A. et al. Bayesian multitask learning regression for heterogeneous patient cohorts. J. Biomed. Inform. 100, 100059 (2019).

    Google Scholar 

  39. Cheng, Y., Xia, Y. & Wang, X. Bayesian multitask learning for medicine recommendation based on online patient reviews. Bioinformatics 39, btad491 (2023).

    Google Scholar 

  40. Seitz, H. K. et al. Alcoholic liver disease. Nat. Rev. Dis. Primers 4, 16 (2018).

    Google Scholar 

  41. Nuffield Department of Population Health, University of Oxford. Schistotrack: A Prospective Multimorbidity Cohort (WHO, accessed 23 January 2026); https://www.bdi.ox.ac.uk/research/schistotrack

  42. Peña-Rosas, J. P. & World Health Organization. Haemoglobin Concentrations for the Diagnosis of Anaemia and Assessment of Severity (WHO, accessed 23 January 2026); https://www.who.int/publications/i/item/WHO-NMH-NHD-MNM-11.1

  43. World Health Organization. Stepwise Approach to NCD Risk Factor Surveillance (Steps) (WHO, accessed 23 January 2026); https://www.who.int/teams/noncommunicable-diseases/surveillance/systems-tools/steps

  44. Lyne, B. et al. Co-occurrence of alcohol use and Schistosoma mansoni infection: prevalence, patterns, and risk factors in rural Uganda. Preprint at medRxiv https://doi.org/10.1101/2025.10.14.25337972 (2025).

  45. Fu, S. et al. Multi-task deep learning network to predict future macrovascular invasion in hepatocellular carcinoma. EClinicalMedicine 42, 101201 (2021).

    Google Scholar 

  46. Silver, D. L., Poirier, R. & Currie, D. Inductive transfer with context-sensitive neural networks. Mach. Learn. 73, 313–336 (2008).

    Google Scholar 

  47. Igiraneza, A. B. et al. Learning patterns of HIV-1 resistance to broadly neutralizing antibodies with reduced subtype bias using multi-task learning. PLoS Comput. Biol. 20, e1012618 (2024).

    Google Scholar 

  48. Wang, X. et al. Multitask joint strategies of self-supervised representation learning on biomedical networks for drug discovery. Nat. Mach. Intell. 5, 445–456 (2023).

    Google Scholar 

  49. Xue, Y., Liao, X., Carin, L. & Krishnapuram, B. Multi-task learning for classification with Dirichlet process priors. J. Mach. Learn. Res. 8, 35–63 (2007).

    Google Scholar 

  50. White, B. S. et al. Bayesian multi-source regression and monocyte-associated gene expression predict bcl-2 inhibitor resistance in acute myeloid leukemia. NPJ Precis. Oncol. 5, 71 (2021).

    Google Scholar 

  51. Yuan, H., Paskov, I., Paskov, H., González, A. J. & Leslie, C. S. Multitask learning improves prediction of cancer drug sensitivity. Sci. Rep. 6, 31619 (2016).

    Google Scholar 

  52. Gelman, A., Carlin, J. B., Stern, H. S. & Rubin, D. B. Bayesian Data Analysis (Chapman and Hall/CRC, 1995).

  53. Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. Smote: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).

    Google Scholar 

  54. Friedman, J., Hastie, T. & Tibshirani, R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9, 432–441 (2008).

    Google Scholar 

  55. Borchani, H., Varando, G., Bielza, C. & Larranaga, P. A survey on multi-output regression. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 5, 216–233 (2015).

    Google Scholar 

  56. Shuman, D. I., Narang, S. K., Frossard, P., Ortega, A. & Vandergheynst, P. The emerging field of signal processing on graphs: extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Process. Mag. 30, 83–98 (2013).

    Google Scholar 

  57. Ortega, A., Frossard, P., Kovačević, J., Moura, J. M. & Vandergheynst, P. Graph signal processing: overview, challenges, and applications. Proc. IEEE. 106, 808–828 (2018).

    Google Scholar 

  58. Griffin, J. E. & Brown, P. J. Hierarchical shrinkage priors for regression models. Bayesian Anal. 12, 135–159 (2017).

    Google Scholar 

  59. Van de Schoot, R. et al. Bayesian statistics and modelling. Nat. Rev. Methods Primers 1, 1 (2021).

    Google Scholar 

  60. Loaiza-Ganem, G. & Cunningham, J. P. The continuous Bernoulli: fixing a pervasive error in variational autoencoders. Adv. Neural Inf. Process. Syst. 32, 13287–13297 (2019).

    Google Scholar 

  61. Korkmaz, M. Ç, Leiva, V. & Martin-Barreiro, C. The continuous Bernoulli distribution: mathematical characterization, fractile regression, computational simulations, and applications. Fract. Fract. 7, 386 (2023).

    Google Scholar 

  62. Neal, R. M. et al. MCMC using Hamiltonian dynamics. In Handbook of Markov Chain Monte Carlo (Chapman and Hall/CRC, 2011).

  63. Hoffman, M. D., Gelman, A. et al. The no-u-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J. Mach. Learn. Res. 15, 1593–1623 (2014).

    Google Scholar 

  64. Bingham, E. et al. Pyro: Deep universal probabilistic programming. J. Mach. Learn. Res. 20, 1–6 (2019).

    Google Scholar 

  65. Dai, B., Ding, S. & Wahba, G. Multivariate Bernoulli distribution. Bernoulli 19, 1465–1483 (2012).

    Google Scholar 

  66. Chib, S. & Greenberg, E. Analysis of multivariate probit models. Biometrika 85, 347–361 (1998).

    Google Scholar 

  67. Storey, J. D. A direct approach to false discovery rates. J. R. Stat. Soc. B Stat. Methodol. 64, 479–498 (2002).

    Google Scholar 

  68. Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proc. Natl. Acad. Sci. USA. 100, 9440–9445 (2003).

    Google Scholar 

  69. Hespanhol, L. C., Vallio, C. S., da Cunha Menezes Costa, L. & Saragiotto, B. T. Understanding and interpreting confidence and credible intervals around effect estimates. Braz. J. Phys. Ther. 23, 290–301 (2019).

    Google Scholar 

Download references

Acknowledgements

We are thankful for the involvement of our study participants and the SchistoTrack teams, especially the surveyors, nurses, sonographers, and laboratory technicians. We also like to thank the Uganda Ministry of Health, local district leaders, focal health workers, and village health teams. Special thanks also to the Oxford team for the fieldwork, data wrangling, everyday discussions, and feedback. This research was funded in whole, or in part, by the UKRI EPSRC [EP/X021793/1]. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. NDPH Pump Priming Fund, John Fell Fund, Robertson Foundation, UKRI EPSRC (EP/X021793/1) grants were awarded to G.F.C.

Author information

Authors and Affiliations

  1. Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK

    Yin-Cong Zhi & Goylette F. Chami

  2. Uganda Institute of Allied Health Sciences, Kampala, Uganda

    Victor Anguajibi

  3. Pakwach Local District Government, Uganda Ministry of Health, Pakwach Town, Uganda

    John B. Oryema & Narcis B. Kabatereine

  4. Division of Vector-Borne and Neglected Tropical Diseases Control, Uganda Ministry of Health, Kampala, Uganda

    Betty Nabatte

  5. Aga Khan University Hospital, Nairobi, Kenya

    Christopher K. Opio

Authors
  1. Yin-Cong Zhi
    View author publications

    Search author on:PubMed Google Scholar

  2. Victor Anguajibi
    View author publications

    Search author on:PubMed Google Scholar

  3. John B. Oryema
    View author publications

    Search author on:PubMed Google Scholar

  4. Betty Nabatte
    View author publications

    Search author on:PubMed Google Scholar

  5. Christopher K. Opio
    View author publications

    Search author on:PubMed Google Scholar

  6. Narcis B. Kabatereine
    View author publications

    Search author on:PubMed Google Scholar

  7. Goylette F. Chami
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Conceptualisation: G.F.C. and Y.C.Z. Data curation: Y.C.Z., V.A., J.B.O., B.N., C.K.O., N.B.K. and G.F.C. Formal analysis: Y.C.Z. Investigation, methodology, visualisation: Y.C.Z. Writing - original draft: Y.C.Z. and G.F.C. Validation: Y.C.Z. and G.F.C. Writing - review and editing: Y.C.Z., V.A., J.B.O., B.N., C.K.O., N.B.K. and G.F.C. Funding acquisition and supervision: G.F.C. Resources: G.F.C.

Corresponding author

Correspondence to Goylette F. Chami.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Andres Aldana, Matthias C Reichert and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Peer Review file (download PDF )

Description of Additional Supplementary Files (download PDF )

Supplementary Code 1 (download ZIP )

Reporting Summary (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhi, YC., Anguajibi, V., Oryema, J.B. et al. Bayesian machine learning enables discovery of risk factors for hepatosplenic multimorbidity related to schistosomiasis. Nat Commun (2026). https://doi.org/10.1038/s41467-026-69528-4

Download citation

  • Received: 23 September 2025

  • Accepted: 03 February 2026

  • Published: 03 March 2026

  • DOI: https://doi.org/10.1038/s41467-026-69528-4

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Associated content

Collection

Digital Medicine for Infectious Diseases

Focus

Microbiology and infectious diseases

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Videos
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Editors
  • Journal Information
  • Open Access Fees and Funding
  • Calls for Papers
  • Editorial Values Statement
  • Journal Metrics
  • Editors' Highlights
  • Contact
  • Editorial policies
  • Top Articles

Publish with us

  • For authors
  • For Reviewers
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Nature Communications (Nat Commun)

ISSN 2041-1723 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing