Abstract
The aim of this study was to develop and validate a machine learning-based fall risk prediction model for middle-aged and elderly individuals with cardiovascular disease. Data were sourced from the China Health and Retirement Longitudinal Study. Key predictive variables were identified using Least Absolute Shrinkage and Selection Operator regression. Six machine learning algorithms were employed to construct predictive models. Model performance was assessed through both internal and external validation, with SHapley Additive exPlanation values used to interpret the optimal model. A total of 1784 participants were analyzed, of which 434 (24.3%) experienced falls during a two-year follow-up period. Nine predictive factors were incorporated into the model, with the Light Gradient Boosting Machine model demonstrating the best performance, achieving areas under the receiver operating characteristic curve of 0.839 (95% CI 0.804–0.874) for internal validation and 0.816 (95% CI 0.798–0.833) for external validation. This model provides a scientific basis for personalized fall prevention strategies aimed at reducing fall incidence and enhancing the quality of life for patients with cardiovascular disease. Finally, we formed a web platform based on the best model to predict the probability of falls in cardiovascular patients.
Data availability
All data are available from the CHARLS portal site (https://charls.pku.edu.cn/).
Abbreviations
- CVD:
-
Cardiovascular disease
- CHARLS:
-
China Health and Retirement Longitudinal Survey
- WC:
-
Waist circumference
- BMI:
-
Body mass index
- SBP:
-
Systolic blood pressure
- DBP:
-
Diastolic blood pressure
- CES-D:
-
Center for epidemiologic studies depression scale
- ADL:
-
Activities of daily living
- SMOTE:
-
Synthetic minority over-sampling technique
- LASSO:
-
Least absolute shrinkage and selection operator
- SHAP:
-
SHapley additive exPlanations
- ROC:
-
Receiver operating characteristic
- AUC:
-
Area under the receiver operating characteristic curve
- PPV:
-
Positive predictive value
- NPV:
-
Negative predictive value
- RF:
-
Random forest
- LightGBM:
-
Light gradient boosting machine
- LR:
-
Logistic regression
- XGBoost:
-
Extreme gradient boosting
- SVM:
-
Support vector machine
- DNN:
-
Deep neural network
- CKD:
-
Chronic kidney disease
References
World Health Organization. Falls. Geneva: World Health Organization; 2021. https://www.who.int/news-room/fact-sheets/detail/falls.
Lamb SE, Jorstad-Stein EC, Hauer K, Becker C, Prevention of Falls Network E, Outcomes Consensus G. 2005 Development of a common outcome data set for fall injury prevention trials: the Prevention of Falls Network Europe consensus. J Am Geriatr Soc. 53:1618–1622. https://doi.org/10.1111/j.1532-5415.2005.53455.x
Bergen, G., Stevens, M. R. & Burns, E. R. Falls and fall injuries among adults aged >/= 65 years: United States, 2014. MMWR Morb Mortal Wkly Rep. 65, 993–998. https://doi.org/10.15585/mmwr.mm6537a2 (2016).
Lewis, S. R. et al. Population-based interventions for preventing falls and fall-related injuries in older people. Cochrane Database Syst Rev. 1, CD013789. https://doi.org/10.1002/14651858.CD013789.pub2 (2024).
Florence, C. S. et al. Medical costs of fatal and nonfatal falls in older adults. J. Am. Geriatr. Soc. 66, 693–698. https://doi.org/10.1111/jgs.15304 (2018).
Timmis, A. et al. European society of cardiology: Cardiovascular disease statistics 2021. Eur. Heart J. 43, 716–799. https://doi.org/10.1093/eurheartj/ehab892 (2022).
Chong, B. et al. Global burden of cardiovascular diseases: Projections from 2025 to 2050. Eur J Prevent Cardiol https://doi.org/10.1093/eurjpc/zwae281 (2024).
Juraschek, S. P. et al. Subclinical cardiovascular disease and fall risk in older adults: Results from the atherosclerosis risk in communities study. J. Am. Geriatr. Soc. 67, 1795–1802. https://doi.org/10.1111/jgs.16041 (2019).
Dorfman, J. D. et al. Risks factors for significant injury after geriatric falls. Int J Aging Res https://doi.org/10.28933/ijoar-2018-12-2305 (2019).
Kaur, I., Doja, M. N. & Ahmad, T. Data mining and machine learning in cancer survival research: An overview and future recommendations. J. Biomed. Inform. 128, 104026. https://doi.org/10.1016/j.jbi.2022.104026 (2022).
Zuo, W. & Yang, X. Network-based predictive models for artificial intelligence: An interpretable application of machine learning techniques in the assessment of depression in stroke patients. BMC Geriatr. https://doi.org/10.1186/s12877-025-05837-5 (2025).
Bellantuono, L., Huang, A. A. & Huang, S. Y. Increasing transparency in machine learning through bootstrap simulation and shapely additive explanations. PLoS ONE https://doi.org/10.1371/journal.pone.0281922 (2023).
Chen, H., Yang, F., Duan, Y., Yang, L. & Li, J. A novel higher performance nomogram based on explainable machine learning for predicting mortality risk in stroke patients within 30 days based on clinical features on the first day ICU admission. BMC Med. Inform. Decis. Mak. 24, 161. https://doi.org/10.1186/s12911-024-02547-7 (2024).
Zhao, Y., Hu, Y., Smith, J. P., Strauss, J. & Yang, G. Cohort profile: The China health and retirement longitudinal study (CHARLS). Int. J. Epidemiol. 43, 61–68. https://doi.org/10.1093/ije/dys203 (2014).
Ma, X., He, J., Hu, Q., Wang, W. & Qiao, H. Association between cooking with solid fuels and depressive symptoms among middle-aged and older adults in China: The mediating effect of the residential environment. Ecotoxicol. Environ. Saf. 284, 116886. https://doi.org/10.1016/j.ecoenv.2024.116886 (2024).
Chai, S. et al. The relationship between handgrip strength and cognitive function among older adults in China: Functional limitation plays a mediating role. J. Affect. Disord. 347, 144–149. https://doi.org/10.1016/j.jad.2023.11.056 (2024).
Martin-Morales, A. et al. Predicting cardiovascular disease mortality: Leveraging machine learning for comprehensive assessment of health and nutrition variables. Nutrients https://doi.org/10.3390/nu15183937 (2023).
Chen, X., Lin, S., Zheng, Y., He, L. & Fang, Y. Long-term trajectories of depressive symptoms and machine learning techniques for fall prediction in older adults: Evidence from the China Health and retirement longitudinal study (CHARLS). Arch. Gerontol. Geriatr. 111, 105012. https://doi.org/10.1016/j.archger.2023.105012 (2023).
Han, S., Jung, H., Yoo, P. D., Provetti, A. & Cali, A. NOTE: Non-parametric oversampling technique for explainable credit scoring. Sci. Rep. 14, 26070. https://doi.org/10.1038/s41598-024-78055-5 (2024).
Zhu, H. et al. Machine learning model for cardiovascular disease prediction in patients with chronic kidney disease. Front Endocrinol (Lausanne) 15, 1390729. https://doi.org/10.3389/fendo.2024.1390729 (2024).
Olson, R. S., Cava, W., Mustahsan, Z., Varik, A. & Moore, J. H. Data-driven advice for applying machine learning to bioinformatics problems. Pac. Symp. Biocomput. 23, 192–203 (2018).
Christodoulou, E. et al. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J. Clin. Epidemiol. 110, 12–22. https://doi.org/10.1016/j.jclinepi.2019.02.004 (2019).
Brereton, R. G. & Lloyd, G. R. Support vector machines for classification and regression. Analyst 135, 230–267. https://doi.org/10.1039/b918972f (2010).
Becker, T., Rousseau, A. J., Geubbelmans, M., Burzykowski, T. & Valkenborg, D. Decision trees and random forests. Am. J. Orthod. Dentofacial Orthop. 164, 894–897. https://doi.org/10.1016/j.ajodo.2023.09.011 (2023).
Yan, J. et al. LightGBM: Accelerated genomically designed crop breeding through ensemble learning. Genome Biol. 22, 271. https://doi.org/10.1186/s13059-021-02492-y (2021).
Panadero, I. et al. Regressions on quantum neural networks at maximal expressivity. Sci. Rep. 14, 31669. https://doi.org/10.1038/s41598-024-81436-5 (2024).
Li, X. et al. Development of an interpretable machine learning model associated with heavy metals’ exposure to identify coronary heart disease among US adults via SHAP: Findings of the US NHANES from 2003 to 2018. Chemosphere 311, 137039. https://doi.org/10.1016/j.chemosphere.2022.137039 (2023).
Bourke, R. et al. Cardiovascular disorders and falls among older adults: A systematic review and meta-analysis. J Gerontol A Biol Sci Med Sci https://doi.org/10.1093/gerona/glad221 (2024).
O’Halloran, A. M. et al. Cardiovascular disease and the risk of incident falls and mortality among adults aged >/= 65 years presenting to the emergency department: A cohort study from national registry data in Denmark. BMC Geriatr 24, 93. https://doi.org/10.1186/s12877-023-04618-2 (2024).
Lindberg, D. S. et al. Identification of important factors in an inpatient fall risk prediction model to improve the quality of care using EHR and electronic administrative data: A machine-learning approach. Int. J. Med. Inform. 143, 104272. https://doi.org/10.1016/j.ijmedinf.2020.104272 (2020).
Partridge, L., Deelen, J. & Slagboom, P. E. Facing up to the global challenges of ageing. Nature 561, 45–56. https://doi.org/10.1038/s41586-018-0457-8 (2018).
Zhou, Z., Wang, D., Sun, J., Zhu, M. & Teng, L. A machine learning-based prediction model for the probability of fall risk among Chinese community-dwelling older adults. Comput. Inform. Nurs. 42, 913–921. https://doi.org/10.1097/CIN.0000000000001202 (2024).
Lo, Y. et al. Using machine learning on home health care assessments to predict fall risk. Stud. Health Technol. Inform. 264, 684–688. https://doi.org/10.3233/SHTI190310 (2019).
Sansom, K. et al. Efficacy of obstructive sleep apnea treatment in reducing fall risk in older adults: Study protocol for a clinical trial. Sleep Med 124, 695–702. https://doi.org/10.1016/j.sleep.2024.11.003 (2024).
Chang, Y. C., Chang, M. C., Chang, Y. J. & Chen, M. D. Understanding factors relevant to poor sleep and coping methods in people with schizophrenia. BMC Psychiatry 21, 373. https://doi.org/10.1186/s12888-021-03384-y (2021).
Serrano-Checa, R. et al. Sleep quality, anxiety, and depression are associated with fall risk factors in older women. Int. J. Environ. Res. Public Health https://doi.org/10.3390/ijerph17114043 (2020).
Bohannon, R. W., Hull, D. & Palmeri, D. Muscle strength impairments and gait performance deficits in kidney transplantation candidates. Am. J. Kidney Dis. 24, 480–485. https://doi.org/10.1016/s0272-6386(12)80905-x (1994).
Huang, J. F. et al. Association between bone mineral density and severity of chronic kidney disease. Int J Endocrinol 2020, 8852690. https://doi.org/10.1155/2020/8852690 (2020).
Lin, P. et al. Development and validation of prediction model for fall accidents among chronic kidney disease in the community. Front Public Health 12, 1381754. https://doi.org/10.3389/fpubh.2024.1381754 (2024).
Bugdayci, D. et al. Frequency and predictors for falls in the ambulatory patients with rheumatoid arthritis: A longitudinal prospective study. Rheumatol. Int. 33, 2523–2527. https://doi.org/10.1007/s00296-013-2757-5 (2013).
Yao, S. & Chen, X. W. Association of pain with falls and fractures among middle-aged Korean community-dwelling adults. J. Gerontol. A. Biol. Sci. Med. Sci. https://doi.org/10.1093/gerona/glae241 (2024).
Seydi, M. et al. The effect of pain on gait in older people: A systematic review and meta-analysis. J Pain https://doi.org/10.1016/j.jpain.2024.104758 (2024).
Pitcher, M. H., Von Korff, M., Bushnell, M. C. & Porter, L. Prevalence and profile of high-impact chronic pain in the United States. J. Pain 20, 146–160. https://doi.org/10.1016/j.jpain.2018.07.006 (2019).
Stenhagen, M., Ekstrom, H., Nordell, E. & Elmstahl, S. Accidental falls, health-related quality of life and life satisfaction: A prospective study of the general elderly population. Arch. Gerontol. Geriatr. 58, 95–100. https://doi.org/10.1016/j.archger.2013.07.006 (2014).
Hajek, A. & König, H.-H. Falls and subjective well-being. Results of the population-based German ageing survey. Arch Gerontol Geriatr 72, 181–186. https://doi.org/10.1016/j.archger.2017.06.010 (2017).
Nebes, R. D. et al. Decreased working memory and processing speed mediate cognitive impairment in geriatric depression. Psychol. Med. 30, 679–691. https://doi.org/10.1017/s0033291799001968 (2000).
Carriere, I. et al. Patterns of selective serotonin reuptake inhibitor use and risk of falls and fractures in community-dwelling elderly people: The three-city cohort. Osteoporos. Int. 27, 3187–3195. https://doi.org/10.1007/s00198-016-3667-7 (2016).
Castelli, L. et al. A new assessment tool for risk of falling and telerehabilitation in neurological diseases: A randomized controlled ancillary study. Appl Sci (Basel) https://doi.org/10.3390/app152011247 (2025).
Beck Jepsen, D. et al. Predicting falls in older adults: An umbrella review of instruments assessing gait, balance, and functional mobility. BMC Geriatr. 22, 615. https://doi.org/10.1186/s12877-022-03271-5 (2022).
Yang, A. et al. A novel electronic-health-record based, machine-learning model to predict 1-year risk of fall hospitalisation in older adults: A Hong Kong territory-wide cohort and modelling study. Age Ageing https://doi.org/10.1093/ageing/afaf285 (2025).
Sun, L. et al. Construction and validation of a prediction model for fall risk in hospitalized older adults with osteoporosis. Front. Public Health 12, 1526660. https://doi.org/10.3389/fpubh.2024.1526660 (2024).
Acknowledgements
This article is primarily supported by the China Health and Retirement Longitudinal Study. We would also like to thank the participants for their valuable contributions and the research teams involved in data collection and management.
Funding
This research was supported by the S&T Program of Hebei (20377787D); Hebei Provincial Department of Science and Technology—Central Government Guided Local Science and Technology Development Fund Project (246Z7738G); Hebei Provincial Government-Funded Clinical Medicine Outstanding Talent Training Project (ZF2025068); Hebei Provincial Health Commission Research Project (20240495).
Author information
Authors and Affiliations
Contributions
C.H.W. and H.H.W. primarily contributed to revisions of the manuscript and funding acquisition. J.Y.D., X.Y.Y. and Z.Q.Z. was mainly involved in manuscript writing, and data processing. J.Y.S. primarily contributed to literature retrieval and data processing. All authors agreed on the final version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethics approval and consent to participate
The CHARLS was approved by the Institutional Review Board of Peking University (IRB00001052-11015) and all participants gave written informed consent.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Dong, J., Yang, X., Zhang, Z. et al. Development of a web platform for predicting fall risk in cardiovascular patients using machine learning. Sci Rep (2026). https://doi.org/10.1038/s41598-026-43482-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-43482-z