Machine learning predicts sepsis deterioration trajectories

Zhang, Rui; Long, Fang; Zhao, Zhanqi; Wu, Jingyi; Tan, Ruoming; Xu, Wen; Li, Lei; Long, Yun; Qu, Hongping

doi:10.1038/s41746-026-02565-x

Download PDF

Article
Open access
Published: 26 March 2026

Machine learning predicts sepsis deterioration trajectories

Rui Zhang¹^na1,
Fang Long²^na1,
Zhanqi Zhao^3,4^na1,
Jingyi Wu¹,
Ruoming Tan¹,
Wen Xu¹,
Lei Li¹,
Yun Long⁴ &
…
Hongping Qu¹

npj Digital Medicine , Article number: (2026) Cite this article

1825 Accesses
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Sepsis has heterogeneous clinical trajectories, but conventional severity scores offer only static risk estimates. Timely, dynamic prediction could enable personalized intervention. In this multicenter retrospective study of 47,936 ICU patients meeting Sepsis-3 criteria from one institutional and two public datasets (MIMIC-III, eICU; sensitivity in MIMIC-IV), group-based trajectory modeling identified latent recovery patterns. An ensemble machine-learning model incorporating dynamic physiological variability was trained, temporally validated, and externally tested; clinical impact was assessed following implementation. Three trajectories emerged: rapid recovery (41.5%), slow recovery (36.4%), and clinical deterioration (22.1%). In the final binary classification task, AUROC was 0.92 (development), 0.89 (internal), 0.84 (MIMIC-III) and 0.77 (eICU); median warning time before deterioration was 17.6 h (Overall pooled across all cohorts). Reduced heart rate variability (SD < 10 bpm) predicted mortality (adjusted HR 2.17). Implementation reduced ICU stay by 1.8 days, machanical ventilation by 2.3 days, and 28-day mortality by 5.7%. This externally validated trajectory-based model offers accurate, early risk stratification for sepsis, supporting proactive, individualized critical care.

Development of a machine learning-based prediction model for sepsis-associated delirium in the intensive care unit

Article Open access 04 August 2023

Explainable machine learning for predicting ICU mortality in myocardial infarction patients using pseudo-dynamic data

Article Open access 31 July 2025

Machine learning analysis of s-EASIX for predicting 30-day mortality in sepsis patients from MIMIC-IV

Article Open access 26 February 2026

Data availability

The data used in this study are available from the MIMIC-III and eICU Collaborative Research Database, subject to completion of the required data use agreements and credentialing. Access can be obtained via PhysioNet (https://physionet.org/) for MIMIC-III and via the eICU Collaborative Research Database program for eICU. The authors are not permitted to publicly share the underlying patient-level data. Institutional data are available from corresponding authors upon reasonable request.

Code availability

Analysis code to reproduce the experiments is available at https://github.com/ccmzhangrui/sepsis-trajectory-python-data.

References

Rudd, K. E. et al. Global, regional, and national sepsis incidence and mortality, 1990-2017: analysis for the Global Burden of Disease Study. Lancet. 395, 200–211 (2020).
Google Scholar
Prescott, H. C. & Angus, D. C. Enhancing recovery from sepsis: a review. JAMA. 319, 62–75 (2018).
Google Scholar
Iwashyna, T. J., Ely, E. W., Smith, D. M. & Langa, K. M. Long-term cognitive impairment and functional disability among survivors of severe sepsis. JAMA. 304, 1787–1794 (2010).
Google Scholar
Kennedy, M., Joyce, N., Howell, M. D., Lawrence Motley, J. & Shapiro, N. I. Identifying infected emergency department patients admitted to the hospital ward at risk of clinical deterioration and intensive care unit transfer. Acad. Emerg. Med. 17, 1080–1085 (2010).
Google Scholar
Vincent, J. L. & Moreno, R. Clinical review: scoring systems in the critically ill. Crit Care 14, 207 (2010).
Google Scholar
Raith, E. P. et al. Prognostic accuracy of the SOFA score, SIRS criteria, and qSOFA score for in-hospital mortality among adults with suspected infection admitted to the intensive care unit. JAMA 317, 290–300 (2017).
Google Scholar
Ferreira, F. L., Bota, D. P., Bross, A., Mélot, C. & Vincent, J. L. Serial evaluation of the SOFA score to predict outcome in critically ill patients. JAMA. 286, 1754–1758 (2001).
Google Scholar
Zimmerman, J. E., Kramer, A. A., McNair, D. S. & Malila, F. M. A. cuteP. hysiology and Chronic Health Evaluation (APACHE) IV: hospital mortality assessment for today’s critically ill patients. Crit. Care Med. 34, 1297–1310 (2006).
Google Scholar
Fleuren, L. M. et al. Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive Care Med. 46, 383–400 (2020).
Google Scholar
Nemati, S. et al. An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit. Care Med. 46, 547–553 (2018).
Google Scholar
Shashikumar, S. P. et al. Early sepsis detection in critical care patients using multiscale blood pressure and heart rate dynamics. J. Electrocardiol. 50, 739–743 (2017).
Google Scholar
Henry, K. E., Hager, D. N., Pronovost, P. J. & Saria, S. A targeted real-time early warning score (TREWScore) for septic shock. Sci. Transl. Med. 7, 299ra122 (2015).
Google Scholar
Desautels, T. et al. Prediction of sepsis in the intensive care unit with minimal electronic health record data: a machine learning approach. JMIR Med. Inform. 4, e28 (2016).
Google Scholar
Futoma, J. et al. An improved multi-output Gaussian process RNN with real-time validation for early sepsis detection. Proc. Mach. Learn. Healthcare 68, 243–254 (2017).
Google Scholar
Wong, A. et al. External validation of a widely implemented proprietary sepsis prediction model in hospitalized patients. JAMA Intern. Med. 181, 1065–1070 (2021).
Google Scholar
Seymour, C. W. et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321, 2003–2017 (2019).
Google Scholar
Scicluna, B. P. et al. Classification of patients with sepsis according to blood genomic endotype: a prospective cohort study. Lancet Respir. Med. 5, 816–826 (2017).
Google Scholar
Moorman, J. R. et al. Mortality reduction by heart rate characteristic monitoring in very low birth weight neonates: a randomized trial. J. Pediatr. 159, 900–906 (2011).
Google Scholar
Scheffer, M. et al. Early-warning signals for critical transitions. Nature. 461, 53–59 (2009).
Google Scholar
Ahmad, S., Tejuja, A., Newman, K. D., Zarychanski, R. & Seely, A. J. Clinical review: a review and analysis of heart rate variability and the diagnosis and prognosis of infection. Crit. Care. 13, 232 (2009).
Google Scholar
Buchman, T. G. The community of the self. Nature. 420, 246–251 (2002).
Google Scholar
Seymour, C. W. et al. Time to treatment and mortality during mandated emergency care for sepsis. N Engl. J. Med. 376, 2235–2244 (2017).
Google Scholar
Yealy, D. M. et al. A randomized trial of protocol-based care for early septic shock. N Engl. J. Med. 370, 1683–1693 (2014).
Google Scholar
Giannini, H. M. et al. A machine learning algorithm to predict severe sepsis and septic shock: development, implementation, and impact on clinical practice. Crit. Care Med. 47, 1485–1492 (2019).
Google Scholar
Shimabukuro, D. W., Barton, C. W., Feldman, M. D., Mataraso, S. J. & Das, R. Effect of a machine learning-based severe sepsis prediction algorithm on patient survival and hospital length of stay: a randomised clinical trial. BMJ Open Respir. Res. 4, e000234 (2017).
Google Scholar
Futoma, J., Hariharan, S. & Heller, K. Learning to detect sepsis with a multitask Gaussian process RNN classifier. Proc. 34th Int. Conf. Mach. Learn. 70, 1174–1182 (2017).
Google Scholar
Mao, Q. et al. Multicentre validation of a sepsis prediction algorithm using only vital sign data in the emergency department, general ward and ICU. BMJ Open 8, e017833 (2018).
Google Scholar
Reyna, M. A. et al. Early prediction of sepsis from clinical data: the PhysioNet/Computing in Cardiology Challenge 2019. Crit. Care Med. 48, 210–217 (2020).
Google Scholar
Singer, M. et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA. 315, 801–810 (2016).
Google Scholar
Seymour, C. W. et al. Assessment of clinical criteria for sepsis: for the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). JAMA 315, 762–774 (2016).
Google Scholar
Rhee, C. et al. Incidence and trends of sepsis in US hospitals using clinical vs claims data, 2009-2014. JAMA 318, 1241–1249 (2017).
Google Scholar
von Elm, E. et al. The strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Ann Intern Med 147, 573–577 (2007).
Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 350, g7594 (2015).
Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J R Stat. Soc. Ser, B Stat Methodol. 58, 267–288 (1996).
Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Google Scholar
Charlson, M. E., Pompei, P., Ales, K. L. & MacKenzie, C. R. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J. Chronic Dis. 40, 373–383 (1987).
Google Scholar
Vincent, J. L. et al. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure. Intens. Care Med. 22, 707–710 (1996).
Google Scholar
Delahanty, R. J., Alvarez, J., Flynn, L. M., Sherwin, R. L. & Jones, S. S. Development and evaluation of a machine learning model for the early identification of patients at risk for sepsis. Ann. Emerg. Med. 73, 334–344 (2019).
Google Scholar
Futoma, J. et al. An improved multi-output Gaussian process RNN with real-time validation for early sepsis detection. Proc. Mach. Learn. Healthcare. 68, 243–254 (2017).
Google Scholar
Pincus, S. M. Approximate entropy as a measure of system complexity. Proc. Natl. Acad. Sci. USA 88, 2297–2301 (1991).
Google Scholar
Van Buuren, S. & Groothuis-Oudshoorn, K. mice: multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011).
Google Scholar
Umscheid, C. A. et al. Development, implementation, and impact of an automated early warning and response system for sepsis. J. Hosp. Med. 10, 26–31 (2015).
Google Scholar
Austin, P. C. An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate Behav. Res. 46, 399–424 (2011).
Google Scholar
Harrell, F. E. Jr, Lee, K. L. & Mark, D. B. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med 15, 361–387 (1996).
Google Scholar
Altman, D. G. & Royston, P. The cost of dichotomising continuous variables. BMJ. 332, 1080 (2006).
Google Scholar

Download references

Acknowledgements

We thank all the healthcare workers involved in the diagnosis and treatment of patients in the study. Special acknowledgment goes to the critical care teams at Ruijin Hospital for their dedication to patient care and data collection. We also express our gratitude to the teams responsible for the creation and maintenance of the MIMIC-III and eICU databases, whose work made this research possible. We thank Longxiang Su and Fang Wang for their valuable support and assistance during the study.The research funded by Shanghai Science and Technology Commission (23Y11900100), Science and Technology Commission of Shanghai Municipality (24ZR1447000), National Natural Science Foundation of China(82470088) and National High-Level Hospital Clinical Research Funding (2022-PUMCH-D-005, 2022-PUMCH-B-115).

Author information

These authors contributed equally: Rui Zhang, Fang Long, Zhanqi Zhao.

Authors and Affiliations

Department of Critical Care Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Rui Zhang, Jingyi Wu, Ruoming Tan, Wen Xu, Lei Li & Hongping Qu
Department of Critical Care Medicine, Zhuzhou Lukou District People’s Hospital, Zhuzhou, China
Fang Long
School of Biomedical Engineering, Guangzhou Medical University, Guangzhou, China
Zhanqi Zhao
Department of Critical Care Medicine, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, China
Zhanqi Zhao & Yun Long

Authors

Rui Zhang
View author publications
Search author on:PubMed Google Scholar
Fang Long
View author publications
Search author on:PubMed Google Scholar
Zhanqi Zhao
View author publications
Search author on:PubMed Google Scholar
Jingyi Wu
View author publications
Search author on:PubMed Google Scholar
Ruoming Tan
View author publications
Search author on:PubMed Google Scholar
Wen Xu
View author publications
Search author on:PubMed Google Scholar
Lei Li
View author publications
Search author on:PubMed Google Scholar
Yun Long
View author publications
Search author on:PubMed Google Scholar
Hongping Qu
View author publications
Search author on:PubMed Google Scholar

Contributions

R.Z., Z.Z, and F.L. conceived the study. R.Z., F.L., Z.Z., and R.T. performed data curation. R.Z. and F.L. developed methodology. W.X., L.L., J.W., Y.L., and H.Q. supervised the project. All authors reviewed the manuscript and approved the final version.

Corresponding authors

Correspondence to Rui Zhang, Jingyi Wu, Ruoming Tan, Wen Xu, Lei Li, Yun Long or Hongping Qu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, R., Long, F., Zhao, Z. et al. Machine learning predicts sepsis deterioration trajectories. npj Digit. Med. (2026). https://doi.org/10.1038/s41746-026-02565-x

Download citation

Received: 21 May 2025
Accepted: 09 March 2026
Published: 26 March 2026
DOI: https://doi.org/10.1038/s41746-026-02565-x