Abstract
To develop an optimal predictive model for hyperkalemia in patients with chronic kidney disease (CKD). Clinical data of CKD patients were collected from Shandong Provincial Hospital Affiliated to Shandong First Medical University between January 2017 and December 2023, including 343 hyperkalemia cases and 713 cases with normal potassium levels. The data were divided into training and test sets at a 7:3 ratio. Important features were screened via univariate analysis, collinearity diagnosis, and LASSO regression, identifying 20 feature variables for hyperkalemia. Five machine learning models were established: logistic regression (LR), decision tree (DT), gradient boosting machine (GBM), support vector machine (SVM), and K-nearest neighbors (KNN). Models were compared using the area under the curve (AUC), Brier score, calibration curve, decision curve analysis (DCA), and overfitting control. The LR model was identified as the optimal model, showing excellent performance in predicting hyperkalemia. The AUCs were 0.899 (training set) and 0.868 (test set), with corresponding F1 scores of 0.819 and 0.725. Calibration and DCA curves demonstrated high predictive accuracy and clinical benefit. Additionally, the nomogram based on the LR model could assist in clinical decision-making. Our findings suggest that the LR model is the optimal predictive model for the risk of hyperkalemia in CKD patients.
Acknowledgements
ZL, WH and RW designed the study. WH and BL performed the data extraction, analyzed and interpreted the data and drafted the manuscript. JL, LG and MJ assisted in clinical data collation. YJ, JS and SC assisted in analyzing the data. ZZ, XL and XH interpreted the data. ZL and RW revised the manuscript. All authors read and approved the final manuscript.
Funding
This work was funded by National Natural Science Foundation of China (No. 82370721), Shandong Provincial Natural Science Foundation (No. ZR2022LSW020, ZR2023LZY014, ZR2024MH330), and Taishan Scholars Program (No. tstp20240854).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Han, W., Liu, B., Li, J. et al. Development and validation of a machine learning-based risk prediction model for hyperkalemia in patients with chronic kidney disease. Sci Rep (2026). https://doi.org/10.1038/s41598-026-53273-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-53273-1