Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
An interpretable machine learning model predicts frailty risk in middle-aged and older adults with gastrointestinal disease: a longitudinal study
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 28 April 2026

An interpretable machine learning model predicts frailty risk in middle-aged and older adults with gastrointestinal disease: a longitudinal study

  • Yin Chen  ORCID: orcid.org/0000-0002-9712-33271 &
  • Mingyu Chen2 

Scientific Reports (2026) Cite this article

  • 825 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Computational biology and bioinformatics
  • Diseases
  • Health care
  • Medical research
  • Risk factors

Abstract

Frailty is a significant health concern in the aging global population, particularly among middle-aged and older adults with gastrointestinal disease (GID). Early detection of individuals at increased risk is critical for implementing timely preventive and therapeutic interventions. This study aimed to develop and validate an interpretable machine learning (ML) model to assess frailty risk in this population. To overcome the “black box” nature of conventional ML models, we integrated Shapley Additive exPlanations (SHAP), which helps identify key predictors of frailty and improve the interpretability of the model’s decision-making process. This study analyzed data from the 2013-2015 survey waves of the China Health and Retirement Longitudinal Study (CHARLS). To identify the most predictive variables for frailty, we employed a dual-method approach combining the Boruta algorithm and Least Absolute Shrinkage and Selection Operator (LASSO) regression. We applied ten different ML algorithms to develop prediction models: Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting Classifier (GBC), Light Gradient Boosting Machine (LightGBM), K-Nearest Neighbors (KNN), Decision Tree (DT), Multilayer Perceptron (MLP), Naive Bayes (NB), and Adaptive Boosting (AdaBoost). The area under the receiver operating characteristic (ROC) curve (AUC) served as the primary performance metric. Additional metrics, including sensitivity, specificity, precision, and the F1-score, were used to comprehensively evaluate model accuracy. Calibration curves were generated to assess the consistency between predicted probabilities and observed risk, and the Brier score was used as a quantitative measure of calibration accuracy. Decision curve analysis (DCA) was performed to evaluate the clinical net benefit of each model. To understand the impact of individual predictors on the output, the SHAP method was used to provide transparent insights into each feature’s contribution to the estimated frailty risk. A total of 1,404 participants met the eligibility criteria for this study, of whom 444 (31.62%) were classified as frail. Using the Boruta algorithm and LASSO regression, we identified 10 key predictors of frailty. Among all the ML models tested, the LR model showed the best overall performance, achieving an AUC of 0.759 (95% CI: 0.711–0.806). Shapley Additive exPlanations (SHAP) analysis further revealed the top five predictors of frailty in this population: depression, grip strength, education level, the total number of chronic diseases, and self-rated health. This study introduces an interpretable ML model that effectively detects frailty risk among middle-aged and older adults with GID. The model demonstrates strong predictive accuracy and transparency, supporting its potential as a clinical decision-support tool pending further external validation and real-world deployment. Such proactive measures could improve patient care and promote better long-term health outcomes in this population.

Similar content being viewed by others

Enhancing the convenience of frailty index assessment for elderly Chinese people with machine learning methods

Article Open access 05 October 2024

Development of a web platform for predicting fall risk in cardiovascular patients using machine learning

Article Open access 02 April 2026

Explainable machine learning for long-term cardiovascular disease risk prediction in Chinese middle-aged and older adults: a 9-year longitudinal cohort study with web-based risk calculator

Article Open access 25 March 2026

Acknowledgements

The authors thank Jie Liu, PhD (Chinese PLA General Hospital), Qilin Yang (The Second Affiliated Hospital of Guangzhou Medical University), and Haibo Li, PhD (Capital Institute of Pediatrics) for their valuable contributions to statistical analysis, manuscript review, and critical feedback. The authors also sincerely thank CHARLS, its participants, and staff for supporting this study.

Author information

Authors and Affiliations

  1. Department of General Surgery, The Affiliated Xuancheng Hospital of Wannan Medical University (Xuancheng People’s Hospital), Xuancheng, 242000, P. R. China

    Yin Chen

  2. Department of Internal Medicine, Guangwai Hospital (Guangwai Geriatric Hospital) of Xicheng District, No. 2A, Sanyili, Xicheng District, Beijing, 100053, P. R. China

    Mingyu Chen

Authors
  1. Yin Chen
    View author publications

    Search author on:PubMed Google Scholar

  2. Mingyu Chen
    View author publications

    Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Mingyu Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics

The CHARLS study protocol complies with the Declaration of Helsinki and received approval from the Institutional Review Board of Peking University (Approval No. IRB00001052-11015). All participants provided written informed consent prior to enrollment.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1. (download DOCX )

Supplementary Information 2. (download TIF )

Supplementary Information 3. (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, Y., Chen, M. An interpretable machine learning model predicts frailty risk in middle-aged and older adults with gastrointestinal disease: a longitudinal study. Sci Rep (2026). https://doi.org/10.1038/s41598-026-50348-x

Download citation

  • Received: 29 November 2025

  • Accepted: 21 April 2026

  • Published: 28 April 2026

  • DOI: https://doi.org/10.1038/s41598-026-50348-x

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Frailty
  • Machine learning
  • CHARLS
  • Shapley additive explanation
  • Gastrointestinal disease
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing