A multimodal embedding model for sepsis data representation

Liu, Tuo; Li, Yonglin; Chen, Hongyi; Li, Naiqing; Zhang, Yan; Huang, Xuanqi; Wang, Jin; Chen, Rui; Zeng, Yuping; Liu, Yuntao; Zheng, Danwen; Wu, Darong; Wang, Changdong; Yu, Tao; Xi, Xiaotu; Zhang, Zhongde

doi:10.1038/s41746-026-02446-3

Download PDF

Article
Open access
Published: 23 February 2026

A multimodal embedding model for sepsis data representation

Tuo Liu¹^na1,
Yonglin Li^2,3,4^na1,
Hongyi Chen¹,
Naiqing Li¹,
Yan Zhang^5,6,
Xuanqi Huang¹,
Jin Wang^2,3,4,
Rui Chen^3,4,
Yuping Zeng⁷,
Yuntao Liu^3,8,
Danwen Zheng^3,4,
Darong Wu^3,8,
Changdong Wang¹,
Tao Yu⁵,
Xiaotu Xi^3,4 &
…
Zhongde Zhang^3,8

npj Digital Medicine , Article number: (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Sepsis research has long been constrained by limited labeled data and models designed for specific tasks that primarily rely on tabular inputs, overlooking the valuable insights contained in clinical text. To address these limitations, we propose the Sepsis Data Representation Model (SepsisDRM), an embedding model that jointly processes tabular and textual data to capture comprehensive patient representations. Trained on a dataset comprising 19,526 sepsis patients, SepsisDRM demonstrates strong generalization across diverse sepsis-related tasks without task-specific tuning. It effectively stratifies patients into four clinically interpretable phenotypes and achieves robust performance in predicting 28-day outcomes, with AUC scores of 0.92, 0.94, and 0.78 on retrospective, prospective, and external datasets, respectively. As the first embedding model developed specifically for sepsis, SepsisDRM establishes a novel paradigm for sepsis research and offers a promising approach for studies in other fields that involve the integration of both tabular and textual data.

Data availability

GDHCM dataset used to train SepsisDRM, and GDHCM retrospective dataset, GDHCM prospective dataset, SYSMH external validation dataset used to test SepsisDRM, are not publicly available due to its potentially identifiable nature.

Code availability

To ensure long-term accessibility and facilitate reproducibility, the complete source code of SepsisDRM, along with the synthetically generated toy datasets and pre-trained model weights, has been archived on Zenodo with the persistent identifier https://doi.org/10.5281/zenodo.17828465. The repository includes detailed documentation and environment configuration files.

References

Evans, L. et al. Surviving sepsis campaign: International guidelines for management of sepsis and septic shock 2021. Intensiv. Care Med. 47, 1181–1247 (2021).
Google Scholar
Levy, M. M. et al. 2001 SCCM/ESICM/ACCP/ATS/SIS International Sepsis Definitions Conference. Crit. Care Med. 31, 1250 (2003).
Google Scholar
Rudd, K. E. et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the Global Burden of Disease Study. Lancet (Lond., Engl.) 395, 200–211 (2020).
Google Scholar
Vincent, J.-L. et al. Sepsis in European intensive care units: results of the SOAP study. Crit. Care Med. 34, 344–353 (2006).
Google Scholar
Leligdowicz, A. et al. Association between source of infection and hospital mortality in patients who have septic shock. Am. J. Respir. Crit. Care Med. 189, 1204–1213 (2014).
Google Scholar
Antonucci, E. et al. Myocardial depression in sepsis: from pathogenesis to clinical manifestations and treatment. J. Crit. Care 29, 500–511 (2014).
Google Scholar
Seymour, C. W. et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321, 2003–2017 (2019).
Google Scholar
Zhang, Z. et al. Exploring disease axes as an alternative to distinct clusters for characterizing sepsis heterogeneity. Intensiv. Care Med. 49, 1349–1359 (2023).
Google Scholar
Guo, F. et al. Clinical applications of machine learning in the survival prediction and classification of sepsis: coagulation and heparin usage matter. J. Transl. Med. 20, 265 (2022).
Google Scholar
Yan, F. et al. Association between the stress hyperglycemia ratio and 28-day all-cause mortality in critically ill patients with sepsis: a retrospective cohort study and predictive model establishment based on machine learning. Cardiovasc. Diabetol. 23, 163 (2024).
Google Scholar
Ibarra-Estrada, M. et al. Early adjunctive methylene blue in patients with septic shock: a randomized controlled trial. Crit. Care 27, 110 (2023).
Google Scholar
Gabarre, P. et al. Albumin versus saline infusion for sepsis-related peripheral tissue hypoperfusion: a proof-of-concept prospective study. Crit. Care 28, 43 (2024).
Google Scholar
Moor, M. et al. Foundation models for generalist medical artificial intelligence. Nature 616, 259–265 (2023).
Google Scholar
Xu, H. et al. A whole-slide foundation model for digital pathology from real-world data. Nature 630, 181–188 (2024).
Google Scholar
Pai, S. et al. Foundation model for cancer imaging biomarkers. Nat. Mach. Intell. 6, 354–367 (2024).
Google Scholar
Lu, M. Y. et al. A visual-language foundation model for computational pathology. Nat. Med. 30, 863–874 (2024).
Google Scholar
Christensen, M., Vukadinovic, M. & Yuan, N. Vision-language foundation model for echocardiogram interpretation. Nat. Med. 30, 1481–1488 (2024).
Google Scholar
Desautels, T. et al. Prediction of early unplanned intensive care unit readmission in a UK tertiary care hospital: a cross-sectional machine learning approach. BMJ Open 7, e017199 (2017).
Google Scholar
Shashikumar, S. P., Shah, A. J., Li, Q., Clifford, G. D. & Nemati, S. A deep learning approach to monitoring and detecting atrial fibrillation using wearable technology. In 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), 141–144 (IEEE, 2017).
Huang, K., Altosaar, J. & Ranganath, R. Clinicalbert: Modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:1904.05342 (2019).
Alsentzer, E. et al. Publicly available clinical bert embeddings. In Proceedings of the 2nd Clinical Natural Language Processing Workshop, 72–78 (2019).
Cheerla, A. & Gevaert, O. Deep learning with multimodal representation for pancancer prognosis prediction. Bioinformatics 35, i446–i454 (2019).
Google Scholar
Kline, A. et al. Multimodal machine learning in precision health: a scoping review. NPJ Digit. Med. 5, 171 (2022).
Google Scholar
Fleuren, L. M. et al. Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensiv. Care Med. 46, 383–400 (2020).
Google Scholar
Zuin, G. et al. Prediction of SARS-CoV-2 positivity from million-scale complete blood counts using machine learning. Commun. Med. 2, 72 (2022).
Google Scholar
Liu, Y. et al. Roberta: a robustly optimized BERT pretraining approach. arXiv:1907.11692 (2019).
Song, Y. et al. Xuebijing injection versus placebo for critically ill patients with severe community-acquired pneumonia: a randomized controlled trial. Crit. Care Med. 47, e735–e743 (2019).
Google Scholar
Li, C. et al. The current evidence for the treatment of sepsis with xuebijing injection: bioactive constituents, findings of clinical studies and potential mechanisms. J. Ethnopharmacol. 265, 113301 (2021).
Google Scholar
Sinha, P. et al. Identifying molecular phenotypes in sepsis: an analysis of two prospective observational cohorts and secondary analysis of two randomised controlled trials. Lancet Respir. Med. 11, 965–974 (2023).
Google Scholar
G, E. et al. Sepsis-induced endothelial dysfunction drives acute-on-chronic liver failure through angiopoietin-2-HGF-C/EBPβ pathway. Hepatology (Baltimore, MD) 78, (2023) https://pubmed.ncbi.nlm.nih.gov/36943063/.
Cheng, C. et al. Pharmacologically significant constituents collectively responsible for anti-sepsis action of XueBiJing, a Chinese herb-based intravenous formulation. Acta Pharmacol. Sin. 45, 1077–1092 (2024).
Google Scholar
Rey, C. et al. Procalcitonin and c-reactive protein as markers of systemic inflammatory response syndrome severity in critically ill children. Intensiv. Care Med. 33, 477–484 (2007).
Google Scholar
Pierrakos, C. & Vincent, J.-L. Sepsis biomarkers: a review. Crit. Care 14, R15 (2010).
Google Scholar
Cawley, G. C. & Talbot, N. L. On over-fitting in model selection and subsequent selection bias in performance evaluation. J. Mach. Learn. Res. 11, 2079–2107 (2010).
Google Scholar
Kenward, M. G. & Carpenter, J. Multiple imputation: current perspectives. Stat. Methods Med. Res. 16, 199–218 (2007).
Google Scholar
MacQueen, J. Some methods for classification and analysis of multivariate observations. In Proc. of the Fifth Berkeley Symposium on Mathematical Statistics and Probability Vol. 1 (eds Le Cam, L. M. & Neyman, J.), 281–297 (University of California Press, 1967).
Ward Jr, J. H. Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58, 236–244 (1963).
Google Scholar
Ng, A. Y., Jordan, M. I. & Weiss, Y. On spectral clustering: analysis and an algorithm. Adv. Neural Inf. Process. Syst. (NeurIPS) 14, 849–856 (2002).
Google Scholar
Reynolds, D. A. Gaussian mixture models. In Encyclopedia of Biometrics (eds Li, S. Z. & Jain, A.), 659–663 (Springer US, 2009).
Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proc. of the Second International Conference on Knowledge Discovery and Data Mining (KDD) (eds Simoudis, E., Han, J. & Fayyad, U. M.), 226–231 (AAAI Press, 1996).
Rousseeuw, P. J. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987).
Google Scholar
Caliński, T. & Harabasz, J. A dendrite method for cluster analysis. Commun. Stat.-theory Methods 3, 1–27 (1974).
Google Scholar
Davies, D. L. & Bouldin, D. W. A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-1, 224–227 (1979).
Google Scholar
Hosmer Jr, D. W., Lemeshow, S. & Sturdivant, R. X. Applied Logistic Regression (John Wiley & Sons, 2013).
Gorishniy, Y., Rubachev, I., Khrulkov, V. & Babenko, A. Revisiting deep learning models for tabular data. Adv. Neural Inf. Process. Syst. 34, 18932–18943 (2021).
Google Scholar
Salton, G. & Buckley, C. Term-weighting approaches in automatic text retrieval. In Information Processing & Management Vol. 24, 513–523 (Elsevier, 1988).
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proc. of NAACL-HLT, 4171–4186 (2019).
Beltagy, I., Peters, M. E. & Cohan, A. Longformer: the long-document transformer. arXiv preprint arXiv:2004.05150 (2020).
Zaheer, M. et al. Big Bird: transformers for longer sequences. In Advances in Neural Information Processing Systems Vol. 33 (eds Larochelle, H. et al.), 17283–17297 (Curran Associates, Inc., 2020).
Ngiam, J. et al. Multimodal deep learning. In Proc. of the 28th International Conference on Machine Learning (ICML) (eds Getoor, L. & Scheffer, T.) 689–696 (International Machine Learning Society, 2011).
Baltrušaitis, T., Ahuja, C. & Morency, L.-P. Multimodal machine learning: a survey and taxonomy. IEEE Trans. Pattern Anal. Mach. Intell. 41, 423–443 (2019).
Google Scholar
Tsai, Y.-H. H. et al. Multimodal transformer for unaligned multimodal language sequences. In Proc. of ACL (eds Korhonen, A., Traum, D. & Márton, G.) 6558–6569 (Association for Computational Linguistics, 2019).
Lu, J., Batra, D., Parikh, D. & Lee, S. Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. In Advances in Neural Information Processing Systems Vol. 32 (eds Wallach, H. et al.) (Curran Associates, Inc., 2019).
Alberti, C. et al. Fusion of detected objects in text for visual question answering. In Proc. of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP-IJCNLP) (eds Inui, K. et al.) 2131–2140 (Association for Computational Linguistics, 2019).
Loshchilov, I. & Hutter, F. Decoupled weight decay regularization. In International Conference on Learning Representations (ICLR) (eds Bach, F. & Blei, D.) (OpenReview.net, 2019).
Lin, T.-Y., Goyal, P., Girshick, R., He, K. & Dollár, P. Focal loss for dense object detection. In Proc. of the IEEE International Conference on Computer Vision (ICCV) (eds Venice, G. et al.) 2980–2988 (IEEE Computer Society, 2017).
Gao, T., Yao, X. & Chen, D. Simcse: Simple contrastive learning of sentence embeddings. In Proc. of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP) (eds Moens, M., Huang, X., Specia, L. & Yih, S. W.) 6894–6910 (Association for Computational Linguistics, 2021).
Lilliefors, H. W. On the Kolmogorov–Smirnov test for normality with mean and variance unknown. J. Am. Stat. Assoc. 62, 399–402 (1967).
Google Scholar
Fisher, R. A. Statistical methods for research workers. In Breakthroughs in Statistics, Springer Series in Statistics (eds Kotz, S. & Johnson, N. L.) 66–70 (Springer, 1992).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B (Methodological) 57, 289–300 (1995).
Google Scholar
Kruskal, W. H. & Wallis, W. A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 47, 583–621 (1952).
Google Scholar
Dunn, O. J. Multiple comparisons using rank sums. Technometrics 6, 241–252 (1964).
Google Scholar
Paszke, A. et al. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems, Vol. 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8–14, 2019, Vancouver, BC, Canada 8024–8035 (2019).

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (2024YFA1011900), Science and Technology Program of Guangzhou, China (2024A03J1188), Guangdong Provincial Key Laboratory of Research on Emergency in TCM (2023B1212060062), National Natural Science Foundation of China (82374392), the Incubation Program for the Science and Technology Development of Chinese Medicine Guangdong Laboratory (HQL2024PZ022), National Major Projects for Science and Technology Development (2025ZD01903002), and Guangdong Healthcare Talent Development Project (0720240226). The authors sincerely thank all clinicians and data management staff involved in this study for their valuable assistance. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

These authors contributed equally: Tuo Liu, Yonglin Li.

Authors and Affiliations

School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
Tuo Liu, Hongyi Chen, Naiqing Li, Xuanqi Huang & Changdong Wang
Guangzhou University of Chinese Medicine, Guangzhou, China
Yonglin Li & Jin Wang
The Second Affiliated Hospital of Guangzhou University of Chinese Medicine (Guangdong Provincial Hospital of Chinese Medicine), Guangzhou, China
Yonglin Li, Jin Wang, Rui Chen, Yuntao Liu, Danwen Zheng, Darong Wu, Xiaotu Xi & Zhongde Zhang
Guangdong Provincial Key Laboratory of Research on Emergency in TCM, Guangzhou, China
Yonglin Li, Jin Wang, Rui Chen, Danwen Zheng & Xiaotu Xi
Department of Emergency Medicine, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China
Yan Zhang & Tao Yu
Institute of Cardiopulmonary Cerebral Resuscitation, Sun Yat-Sen University, Guangzhou, China
Yan Zhang
Information Management Office, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine (Guangdong Provincial Hospital of Chinese Medicine), Guangzhou, China
Yuping Zeng
State Key Laboratory of Traditional Chinese Medicine Syndrome, Guangzhou, China
Yuntao Liu, Darong Wu & Zhongde Zhang

Authors

Tuo Liu
View author publications
Search author on:PubMed Google Scholar
Yonglin Li
View author publications
Search author on:PubMed Google Scholar
Hongyi Chen
View author publications
Search author on:PubMed Google Scholar
Naiqing Li
View author publications
Search author on:PubMed Google Scholar
Yan Zhang
View author publications
Search author on:PubMed Google Scholar
Xuanqi Huang
View author publications
Search author on:PubMed Google Scholar
Jin Wang
View author publications
Search author on:PubMed Google Scholar
Rui Chen
View author publications
Search author on:PubMed Google Scholar
Yuping Zeng
View author publications
Search author on:PubMed Google Scholar
Yuntao Liu
View author publications
Search author on:PubMed Google Scholar
Danwen Zheng
View author publications
Search author on:PubMed Google Scholar
Darong Wu
View author publications
Search author on:PubMed Google Scholar
Changdong Wang
View author publications
Search author on:PubMed Google Scholar
Tao Yu
View author publications
Search author on:PubMed Google Scholar
Xiaotu Xi
View author publications
Search author on:PubMed Google Scholar
Zhongde Zhang
View author publications
Search author on:PubMed Google Scholar

Contributions

T.L.: study conceptualization and design, construction of model, technical implementation, data analysis, statistical analysis, manuscript drafting; Y.L.: study conceptualization and design, data preparation, resources, statistical analysis, manuscript drafting; H.C.: study conceptualization and design, construction of model, data analysis; N.L.: study conceptualization and design, data preparation, statistical analysis; Y.Z., X.H.: data analysis, statistical analysis; J.W., R.C., Y.Z., and Y.L.: resources; D.Z.: data analysis; D.W.: resources; C.W.: study conceptualization and design, construction of model, manuscript drafting; T.Y. and X.X.: study conceptualization and design, data preparation, resources; Z.Z.: study conceptualization and design, data preparation, resources, manuscript drafting. All authors reviewed and approved the final manuscript.

Corresponding authors

Correspondence to Changdong Wang, Tao Yu, Xiaotu Xi or Zhongde Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, T., Li, Y., Chen, H. et al. A multimodal embedding model for sepsis data representation. npj Digit. Med. (2026). https://doi.org/10.1038/s41746-026-02446-3

Download citation

Received: 30 June 2025
Accepted: 07 February 2026
Published: 23 February 2026
DOI: https://doi.org/10.1038/s41746-026-02446-3

Subjects

Abstract

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links