Abstract
We aimed to construct and validate interpretable models for predicting mortality risk using machine learning (ML) methods to identify the risk factors associated with mortality in patients with diabetic neuropathy (DN). We selected patients from the US-based critical care database (Medical Information Mart for Intensive Care (MIMIC-IV)). Independent risk factors for in-hospital death were screened using Least Absolute Shrinkage and Selection Operator (LASSO). Subsequently, we constructed mortality risk prediction models utilizing random forest (RF), extreme gradient boosting (XGBoost), support vector machine (SVM), and logistic regression (LR). Finally, we comprehensively assessed and interpreted the best-performing ML model using SHapley Additive exPlanations (SHAP) analysis. The final study enrolled 1,313 patients with DN, randomly split into training (1,050, 80%) and validation sets (263, 20%). The RF model demonstrated superior performance, with an area under the curve (AUC) of 0.780 for the validation set. According to the feature importance result, red blood cell distribution width (RDW)_mean was identified as the most influential feature. This study demonstrated the potential of leveraging ML as a viable approach for predicting mortality risk in patients with DN. An interpretable ML model exhibited strong performance that could support clinical decision-making and enhance patient prognosis to some extent.
Introduction
Diabetic foot ulcers (DFUs) are one of the most serious complications of diabetes. It affects approximately 18.6 million people worldwide each year and is associated with increased rates of amputation and death1. Studies have shown a high mortality rate after major lower limb amputations due to diabetes and peripheral vascular disease. The 5-year mortality rate was very high among patients with any amputation (major and minor combined), ranging from 53% to 100%2. In light of the significant challenges posed by DFUs, including high rates of disability and mortality, as well as substantial treatment costs, it is crucial to proactively prevent the occurrence of DFUs. Existing studies have shown that diabetic neuropathy (DN) is closely related to DFUs3. DN is the most common complication of diabetes mellitus. It is characterized by a glove and stocking-like presentation of distal sensory or motor function loss4. In most cases, sensory neuropathy is accompanied by reduced perception of temperature and the sensation of pain. Consequently, the risk for trauma is significantly higher5,6,7,8. Due to the missing pain symptomatology, serious ulcerations are underestimated by both patients and doctors9,10. Therefore, DN emerges as a significant factor contributing to the development of DFUs, and prioritizing DN prevention can yield substantial benefits in terms of reducing amputation rates and patient mortality.
Plenty of studies have demonstrated that various factors, including age, duration of diabetes, HbA1c, fasting blood glucose (FBG), High triglyceride (HTG), total cholesterol (TC), hypertension, urine protein concentration, Diabetic retinopathy (DR), Diabetic kidney disease (DKD), and Cardiovascular diseases (CVD), can be used to predict the risk of DN11,12,13,14,15. However, most of these studies have focused on risk factors for the occurrence of DN, and few have looked at risk factors for death in patients with DN. Therefore, exploring the risk factors for death in patients with DN and establishing a prediction model for mortality risk can lead to early intervention and treatment of patients with DN to mitigate their poor prognosis and thus reduce their mortality.
Existing research shows that artificial intelligence methods can develop effective diagnostic and predictive tools to identify various diseases16. In recent years, machine learning (ML) algorithms have been found to have excellent predictive performance17. Unfortunately, due to the “black box” nature of ML algorithms, the lack of interpretability has been a major barrier to the implementation of ML models in healthcare18. Therefore, in this study, we not only built a model using ML algorithms to predict DN mortality risk factors and validated its performance, but also used SHapley Additive exPlanations (SHAP) analysis to comprehensively evaluate and interpret the ML model19.
Methods
Database
The present study is a secondary analysis of the existing data extracted from the Medical Information Mart for Intensive Care (MIMIC-IV), which covers the period from 2008 to 2019. MIMIC-IV is the result of a collaboration between Beth Israel Deaconess Medical Center (BIDMC) and Massachusetts Institute of Technology (MIT). Data collected at BIDMC as part of routine clinical care is deidentified, transformed, and made available to researchers who have completed training in human research and signed a data use agreement20. A researcher who has completed the Collaborative Institutional Training Initiative examination (Certification number 61042076 for author Huang) can access this database. Consent for data acquisition in the MIMIC project was approved by the institutional review boards of the Massachusetts Institute of Technology and Beth Israel Deaconess Medical Center. Therefore, patient informed consent and ethical approval were exempted from the present study.
Study population
We first obtained raw data using Structured Query Language (version 14.0) with Navicat Premium software (version 15.0.12). Structured Query Language was used to extract patient data, including sociodemographic characteristics, vital signs, laboratory parameters, complications, and microbiological information21. Patients in the database who met the following criteria were selected for the present study: [1] met the diagnostic criteria for DN. ICD-9 (24960, 24961, 25060, 25061, 25062 and 25063) and ICD-10 (E1040, E1042, E1043, E1140, E1141, E1142, E1143 and E1149) codes were used to identify patients with DN in the MIMIC-IV database; [2] first ICU admission at first hospitalization; [3] hospital length of stay > 48 h; [4] age of > 18 years; [5] data of complete blood counts were not missing. Data from these patients were used as the training cohort for model establishment. The data extraction procedure is illustrated in Fig. 1.
Data extraction
We extracted a total of 56 variables. Six of the variables are combination inflammatory indicators, which are calculated by extracting existing complete blood counts and performing a formula. The predictors included: (a) demographic information: age and gender; (b) comorbidities: myocardial infarct, congestive heart failure, chronic pulmonary disease, renal disease, hypertension, obesity; (c) the first-day laboratory results in the ICU: the mean level of glucose, the lowest and highest levels of anion gap, the lowest and highest levels of bicarbonate, the lowest and highest levels of creatinine, the lowest and highest levels of blood urea nitrogen (BUN), hemoglobin, platelets, white blood cells (WBC), red blood cells (RBC), red blood cell distribution width (RDW), lymphocytes, neutrophils, monocytes, the lowest and highest levels of calcium, the lowest and highest levels of chloride, the lowest and highest levels of sodium, the lowest and highest levels of potassium, Maximum and minimum values of partial thromboplastin time (PTT), Maximum and minimum values of prothrombin time (PT), and Maximum and minimum values of international normalized ratio (INR); (d) the first-day vital signs in the ICU: the mean levels of heart rate, mean blood pressure (Meanbp), the mean levels of respiratory rate, and the mean levels of temperature; (e)organ scores: simplified acute physiology score Ⅱ(SAPS-Ⅱ), acute physiology score Ⅲ (APS-Ⅲ), logistic organ dysfunction system (LODS), oxford acute severity of illness score (OASIS), Glasgow coma scale (GCS), and charlson comorbidity index (CCI); (f)combination inflammatory indicators: neutrophil–lymphocyte ratio (NLR), platelet–lymphocyte ratio (PLR), lymphocyte-to-monocyte ratio (LMR), platelet–neutrophil ratio (PNR), platelet–WBC ratio (PWR), and systemic immune inflammation index (SII).
Statistical analysis
The data were analyzed using SPSS 27.0 software. Normality testing was conducted using the Shapiro-Wilks test. Continuous variables with normal distributions are presented as mean (SD, standard deviation) and compared using independent samples t-tests. Non-normally distributed continuous variables are expressed as median (interquartile range) and compared using the Kruskal–Wallis test. Categorical variables are described as percentages and compared using the chi-square test. Variables with more than 20% missing values were excluded from further analysis, while those with 20% or fewer missing values were imputed. Missing data in continuous variables were imputed using the mean, while the counting data were imputed using mode. We used the Replace Missing Values function in SPSS 27.0 software to handle the missing data. Patients were classified into “survival” and “non-survival” groups based on their survival status. The results of variable comparisons are displayed in Table 1.
Model development and validation
In the present study, ML algorithms were used to construct a model for predicting the risk factors of death in DN. The LASSO method was used for model selection to reduce model complexity and the risk of overfitting and to optimize model training speed. LASSO is based on regression analysis methods, which select the most useful predictive features from the original data set by constructing a penalty function for the variable. Nonzero coefficient predictors of the coefficients were selected in the LASSO regression model to construct the subsequent DN death risk prediction model22,23,24. In this study, to address the issue of data imbalance between the groups of survival and non-survival, we used a stratified sampling method to randomly divide the overall dataset into two groups, allocating 80% to the training cohort (n = 1050) and 20% to the internal validation cohort (n = 263). Then, RF, XGBoost, SVM, and LR were used to establish a mortality risk prediction model. To minimize overfitting and identify the optimization hyperparameters, 10-fold cross-validation was also performed25. In addition, we presented various parameters associated with model performance in this study, including the area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, and F1 score. After comparing the performance of four different ML models, the best-performing model is selected as the final model. Subsequently, we comprehensively evaluated and interpreted the ML models using SHAP analysis. The programming written using R software (package version 4.3.2) was used to conduct data analysis.
Results
Patients’ characteristics
In total, 1,313 patients diagnosed with DN were eventually enrolled in our study. Patients were categorized into “survival” (n = 812) and “non-survival” (n = 501) groups, according to their survival status. Table 1 shows the overall baseline characteristics, vital signs, and laboratory parameters of the two groups. In univariate analysis, age at admission; congestive heart failure; chronic pulmonary disease; renal disease; hypertension; obesity; glucose_mean; aniongap_min; aniongap_max; creatinine_min; creatinine_max; BUN_min; BUN_max; hemoglobin_mean; WBC_mean; RBC_mean; RDW_mean; lymphocytes_mean; neutrophils_mean; monocytes_mean; chloride_min; chloride_max; PTT_min; PTT_max; PT_min; PT_max; INR_min; INR_max; meanbp_mean; resprate_mean; SAPS-Ⅱ; APS-Ⅲ; LODS; OASIS; GCS; CCI; NLR; PLR; LMR; PNR; PWR and SII were considered significant between the groups (Table 1).
Features selected in models
This study used LASSO to select features from the data of the training cohort (Fig. 2). Nonzero coefficient predictors of the coefficients were chosen in the LASSO regression model to construct the subsequent DN death risk prediction model. Figure 2.A shows that the “a” line represents the minimum optimal harmonic coefficient of Mean Squared Error (MSE) (λ = 0.01055914) and a total of 21 variables are screened while the “b line represents the optimal harmonic coefficient within 1 SE of MSE (λ = 0.02687268) and a total of 12 variables are screened. Then, we use the boxplot to compare the model constructed with the two λ values. From the median of the boxplots of the two models, lambda. min is closer to the actual outcome and slightly more accurate than lambda.1se. However, it is advisable to reduce the number of feature variables in the model to mitigate overfitting. The screened variables corresponding to lambda.1se are ultimately selected. The 12 variables are: age, obesity, aniongap_min, RDW_mean, neutrophils_mean, chloride_max, PTT_min, PT_min, APS-Ⅲ, CCI, LMR, PWR.
Screening of death predictors in DN patients based on LASSO regression. A: The relationship between log(λ) and MSE after tenfold cross-verification. The “a” line represents the minimum optimal harmonic coefficient of MSE (λ = 0.01055914) and a total of 21 variables are screened. The “b” line represents the optimal harmonic coefficient within 1 SE of MSE (λ = 0.02687268) and a total of 12 variables are screened. B: The relationship between log(λ) and LASSO regression coefficient.
Model comparison
In the model development and validation stage, we first determined the optimal hyperparameters of the RF model: mtry = 2, ntree = 9, max_depth = 10, min_samples_split = 30, min_samples_leaf = 10, max_features=’ sqrt ‘, random_state = 42. The optimal hyperparameters of the SVM model were gamma = 0.1 and cost = 1. Additionally, the optimal hyperparameters of the XGBoost model: num_boost_round = 50, max_depth = 1, eta = 0.3, gamma = 0, colsample_bytree = 0.6, min_child_weight = 1, and subsample = 1. The final models were trained using optimized hyperparameters. The four ML models (RF, SVM, XGBoost and LR) demonstrated good discriminative power with AUCs (95%CI) of 0.999(0.998, 1.000), 0.762 (0.735, 0.788), 0.837 (0.813, 0.861), 0.792 (0.765, 0.819) in the training cohort, and 0.780 (0.723, 0.837), 0.679 (0.625, 0.734), 0.732 (0.672, 0.793), 0.773(0.718, 0.829) in the internal validation cohort, respectively.
Receiver operating characteristic (ROC) curve comparisons of the four models in the training cohort and the internal validation cohort are shown in Fig. 3. Four ML-based models have been developed and validated, and the RF model performed the strongest in predicting mortality risk of DN with the highest AUC value (0.999) in the training cohort (Fig. 3.A). At the same time, the RF model also performed best in the internal validation queue, with the highest AUC value (0.780) (Fig. 3.B). Results of the accuracy, precision, recall, and F1-score of the four models are shown in Fig. 4. Upon internal validation, the RF model demonstrated superior performance to the other three models, achieving accuracy, precision, recall, and F1 scores of 0.721, 0.735, 0.858, and 0.792 respectively (Fig. 4.B). Therefore, the RF model was selected as the final model. Furthermore, decision curve analysis (DCA) (Fig. 5.A) and clinical impact curve (CIC) (Fig. 5.B) were also performed to further demonstrate the performance of the RF model. Clinical intervention guided by the RF model provided a greater net benefit in the validation cohort when the threshold probability was 0.1–0.7 (Fig. 5.A). The CIC analysis demonstrated the clinical efficacy of this predictive model. When the threshold probability was greater than 60% of the prediction score probability value, the prediction model identified a high-risk population for DN death that closely matched the actual population, confirming the robust clinical effectiveness of the prediction model (Fig. 5.B).
Model interpretation
To reveal the distribution of the effect of each input acting on RF, SHAP analysis was utilized. The significance of each feature was assessed using the Shapley value, which compared the model’s prediction with and without the feature utilizing the “shapviz” package (Fig. 6.A and B). A feature importance analysis was conducted to interpret the significance of the variables, revealing that the top 5 contributing variables were RDW_mean, neutrophils_mean, CCI, chloride_max, and PT_min in descending order. Thus, the RDW_mean was found to be the most important variable in predicting the mortality risk in patients with DN. Among the important variables, neutrophils_mean, CCI, chloride_max, and PT_min are included (Fig. 6.A). The updated SHAP analysis revealed notable changes in feature importance rankings compared to the original model. While RDW_mean and neutrophils_mean maintained their top positions, consistent with their established roles in inflammation and oxidative stress, chloride_max and PT_min emerged as new top-five predictors, replacing APS III and age. The persistence of RDW_mean as the most important feature reinforces its clinical relevance in critical illness, while the elevated importance of chloride_max reflects electrolyte imbalance and PT_min indicates coagulation dysfunction, suggesting these variables may provide additional prognostic value. The consistency of CCI’s high ranking across both analyses supports its well-documented role in mortality prediction. In each feature important row, the attributions of all patients to the outcome were drawn with dots of different colors, where the yellow dots represent the high-risk value and the purple dots represent the low-risk value. Elevated levels of RDW and neutrophils, higher scores on the CCI, lower serum chloride levels, and increased PT were all associated with an increased predicted risk of mortality (Fig. 6.B). The SHAP explanatory force plots in Fig. 6.C-E were utilized to analyze three cases within the validation group (#156, #994, and #1144). The patient in case 156 had a survival outcome; case 994 resulted in hospital death, and the patient in case 1144 experienced non-hospital death within one year. The Shapley value of each variable is depicted by an arrow, where the yellow arrow signifies a positive impact, i.e., the feature enhances the predicted value; whereas the purple arrow denotes a negative impact, indicating that the feature diminishes the predicted value. The force plots also display the primary variables and their corresponding values. In the three cases in Fig. 6.C-E, the contribution values of the variables are different, indicating that the influential variables on the prediction vary from case to case.
Comparing the different ML models’ AUC in the training (A) and validation (B) cohorts ML, machine learning; AUC, area under the receiver operating characteristic curve; RF, Random Forest; SVM, Support vector Machine; XGBoost, Extreme Gradient Boosting; LR, Logistic regression. Note. Figure 3 reflects updated AUC values for the four models.
The comparison of performance in the four models in training (A) and validation (B) cohorts RF, Random Forest; SVM, Support vector Machine; XGBoost, Extreme Gradient Boosting; LR, Logistic regression. Note. Figure 4 presents the updated performance metrics, including accuracy, precision, recall, and F1 scores, across the four evaluated models.
DCA (A) and CIC (B) of the RF model DCA, decision curve analysis; CIC, clinical impact curve; RF, Random Forest. Note. Figure 5 presents the updated DCA and CIC for the RF model.
The model’s interpretation. A: The importance ranking of the 12 variables according to the mean (|SHAP value|). B: The importance ranking of the 12 risk factors with stability and interpretation using the optimal model. In the right figure, the higher SHAP value of a feature is given, the higher risk of death the patient would have, while it is the opposite in the left one. The yellow part in the feature value represents a higher value. C: SHAP value force plot of case 156 was utilized for the individual prediction of characteristic variables. D: SHAP value force plot of case 994 was utilized for the individual prediction of characteristic variables. E: SHAP value force plot of case 1144 was utilized for the individual prediction of characteristic variables. Note. Figure 6 presents the revised feature importance ranking and corresponding SHAP explanatory force plots.
Discussion
This study has made several contributions. First, we introduced the RF algorithm, one of the state-of-the-art ML methods for developing prediction models25,26. The RF algorithm is particularly well-suited for analyzing large-scale datasets and demonstrates superior performance in generating high-accuracy classification predictions. These capabilities make it a valuable tool for supporting personalized medical decision-making, particularly in applications such as disease risk assessment27. To optimize model performance, hyperparameter tuning was conducted using the “caret” package. In this study, comparative analyses with conventional risk prediction models, including traditional LR, revealed that the RF model achieved significantly enhanced predictive performance across multiple key metrics. Specifically, the RF algorithm exhibited notable improvements in AUC, accuracy, precision, recall, and F1-score, underscoring its robustness and superiority over classical approaches. Furthermore, DCA and CIC were conducted to further illustrate the performance of the RF model. Finally, ML is a good predictive tool and usually has high accuracy. However, it has always been regarded as a “black box,” indicating poor interpretability28. In our study, we conducted SHAP analysis to interpret one of our best-performing models and identify several predictors most strongly associated with risk factors for death in patients with DN.
Initially, RDW was ranked as the most important variable among the 12 predictors based on the mean |SHAP value|. Previous studies have indicated that the development and progression of DN are associated with oxidative stress and inflammation4,29,30,31. Additionally, elevated RDW has been linked to higher rates of amputation and mortality in DN patients32,33. Although the exact mechanisms underlying the relationship between high RDW and poor prognosis in DN are not fully understood, inflammation and oxidative stress are likely contributing factors. Inflammation can disrupt bone marrow function and iron metabolism, which inhibits red blood cell production, shortens erythrocyte half-life, and promotes anisocytosis. These changes ultimately lead to an increased RDW34. Similarly, oxidative stress directly damages erythrocytes, reducing their survival and further elevating RDW levels35,36. High oxidative stress may lower RBC survival and promote the release of immature erythrocytes of varying sizes into the bloodstream, leading to anisocytosis37,38. RDW elevation occurs in response to both inflammatory stimuli and oxidative stress—both of which are associated with worse outcomes in DN. Therefore, increased RDW is closely related to an unfavorable prognosis in DN, underscoring its value as a predictive marker for patient outcomes.
Subsequently, our results indicate that inflammatory markers such as neutrophils, LMR, and PWR are associated with mortality risk in DN, meriting further study. Previous retrospective studies have established a strong link between inflammation and diabetic neuropathies39,40. Diabetes and neurodegeneration involve inflammatory pathways such as NF-κB and MAPK41. Activation of these pathways promotes the release of pro-inflammatory cytokines like IL-1, IL-6, and TNF-α, which contribute to apoptosis and nerve damage. These substances further amplify inflammation through positive feedback loops with necrotic tissue, initiating a cascade response42. Therefore, the inflammatory response is closely associated with the prognosis of DN. Meanwhile, Tuna Demirdal et al.43, as well as YunXu et al.44, demonstrated that some combinations of inflammatory markers, such as PLR, NLR, and LMR, are associated with the risk of amputation in patients with diabetic foot ulcers. In summary, inflammatory markers play a significant role in predicting mortality risk in patients with DN.
Additionally, the CCI is a tool specifically designed to predict long-term mortality. The clinometric sensitivity of the CCI has been demonstrated in a variety of medical conditions, with stepwise increases in the CCI associated with stepwise increases in mortality45. The APS-Ⅲ can predict vital status at hospital discharge with the use of data recorded at ICU admission46. It always shows excellent mortality prediction performance. In this study, CCI and APS-Ⅲ were also important characteristic variables in predicting the risk of death in patients with DN. Age is also an essential factor that cannot be ignored. The older the age, the higher the risk of death in DN patients.
Furthermore, low serum chloride levels are associated with higher mortality risk in patients with DN. Serum chloride plays a vital role in maintaining acid-base balance and plasma electroneutrality47,48. A study by Kai Zhang et al.49 found that lower serum chloride concentrations were negatively correlated with in-hospital mortality among diabetics. Additionally, a cross-sectional analysis by Bhagwant G. Pawar et al.50 showed that reduced chloride levels in type 2 diabetes patients were strongly associated with poor glycemic control and longer disease duration. Mechanistically, low serum chloride may affect muscle cell volume and pH regulation by activating with-no-lysine (WNK) kinases51,52 and enhancing Na–K–2Cl and Na–Cl co-transporter activity53. Such dysregulation of cardiomyocyte pH can lead to arrhythmias and impaired myocardial contractility54. Therefore, decreased serum chloride may contribute to poor prognosis in diabetic neuropathy patients.
Moreover, our study suggests that specific coagulation parameters are associated with mortality risk in DN patients. Abnormal coagulation function has been implicated in the pathological progression of DN, with a hypercoagulable state contributing to diabetic vascular complications55. For instance, elevated plasma D-dimer levels have been linked to impaired peripheral nerve conduction and a higher prevalence of DPN in type 2 diabetes, indicating its potential as a prognostic marker in DN56. However, the relationship between PT and PTT and the risk of mortality in patients with DN has seldom been reported. This study effectively supplemented this point, proving that coagulation function indicators such as PT and PTT can be used as predictors of death risk in patients with DN, laying a foundation for further exploration of the mechanism related to coagulation function indicators and death risk in patients with DN.
Ultimately, obesity is a crucial and pertinent factor in our predictive model. Some studies have shown that57,58 obesity is an important risk factor for early diabetic neuropathy. Simultaneously, research has indicated that the absence of blood-brain (nerve) barrier protection renders the peripheral nervous system (PNS) susceptible to dysfunction induced by obesity, such as loss of peripheral sensory neurons and small intraepidermal nerve fibers, distribution of sensory polyneuropathy in a glove-sock pattern, and decreased motor and sensory nerve function59. The mechanism may involve obesity’s effects on anabolic and catabolic processes mediated through the autonomic nervous system (ANS), a component of the PNS60. Obesity is thus instrumental in the development and progression of diabetic neuropathy. Our study further supports that obesity contributes to mortality in patients with diabetic neuropathy. These results highlight the importance of weight management in clinical care to improve prognosis in this population.
This study has several important limitations that must be acknowledged. First, we observed a substantial performance gap (ΔAUC = 0.219) between the training set (AUC 0.999, 95% CI 0.998–1.000) and the validation set (AUC 0.780, 95% CI 0.723–0.837), indicating significant overfitting in the RF model. This discrepancy suggests that while the model achieves near-perfect discrimination in the training data, its generalizability to unseen data is suboptimal. The underlying reasons may include: (1) the limited dataset size (n = 1,313), which restricts the model’s ability to learn robust patterns; (2) potential feature selection bias, where certain predictors may be spuriously correlated with outcomes in the training cohort but not in broader populations; and (3) data heterogeneity, as the MIMIC-IV ICU dataset may not fully represent real-world variability in patient characteristics. Clinically, the observed 21.9% decline in AUC (from 0.999 to 0.780) raises concerns about the model’s reliability in new ICU cohorts. If deployed without further refinement, the model could produce overly optimistic risk predictions, potentially leading to misinterpretations in clinical decision-making. To address these limitations, we propose the following steps: (1) External validation: We will seek authorization to assess the model’s generalizability by utilizing the eICU Collaborative Research Database, which contains diverse ICU patient data collected from multiple hospitals across the United States. (2) Multi-center data collaboration: We plan to collect additional institutional data and establish partnerships with other hospitals to expand the external validation cohort. (3) Incorporation of alternative models (e.g., gradient boosting, neural networks) in follow-up studies. Second, missing data were present in this study, leading to the exclusion of variables with more than 20% missing values. However, we addressed these limitations by employing interpolation methods using SPSS 27.0 software. Finally, it is crucial to note that this is a retrospective study based on a public database and thus carries inherent limitations.
Conclusion
In conclusion, machine learning methods are reliable tools for predicting mortality risk in patients with DN. We utilized data from the MIMIC-IV database to develop a machine learning-based model for predicting mortality risk in DN patients. The results indicated that RDW_mean, neutrophils_mean, CCI, chloride_max, and PT_min were the most important predictors of mortality risk in DN. The interpretable model demonstrated robust performance, showing potential to assist in clinical decision-making and improve prognostic outcomes. However, it should be noted that the applicability of the proposed random forest model remains to be further validated in external cohorts, and results should be interpreted with caution until such validation is performed.
Data availability
MIMIC-IV data were obtained from https://mimic.mit.edu. Observed data of patients with DN are available from the corresponding authors upon reasonable request.
Abbreviations
- ANS:
-
Autonomic Nervous System
- APS-Ⅲ:
-
Acute Physiology Score Ⅲ
- AUC:
-
Area Under the Curve
- BIDMC:
-
Beth Israel Deaconess Medical Center
- BUN:
-
Blood Urea Nitrogen
- CCI:
-
Charlson Comorbidity Index
- CIC:
-
Clinical Impact Curve
- CVD:
-
Cardiovascular Diseases
- DCA:
-
Decision Curve Analysis
- DFUs:
-
Diabetic Foot Ulcers
- DKD:
-
Diabetic Kidney Disease
- DN:
-
Diabetic Neuropathy
- DR:
-
Diabetic Retinopathy
- FBG:
-
Fasting Blood Glucose
- GCS:
-
Glasgow Coma Scale
- HTG:
-
High Triglyceride
- ICD:
-
International Classification of Diseases
- ICU:
-
Intensive Care Unit
- IL-1:
-
Interleukin-1
- IL-6:
-
Interleukin-6
- INR:
-
International Normalized Ratio
- LASSO:
-
Least Absolute Shrinkage and Selection Operator
- LMR:
-
Lymphocyte-to-Monocyte Ratio
- LODS:
-
Logistic Organ Dysfunction System
- LR:
-
Logistic Regression
- MAPK:
-
Mitogen-Activated Protein Kinase
- Meanbp:
-
Mean Blood Pressure
- MIMIC-IV:
-
Medical Information Mart for Intensive Care IV
- MIT:
-
Massachusetts Institute of Technology
- ML:
-
Machine Learning
- MSE:
-
Mean Squared Error
- NF-κB:
-
Nuclear Factor Kappa B
- NLR:
-
Neutrophil-Lymphocyte Ratio
- OASIS:
-
Oxford Acute Severity of Illness Score
- PLR:
-
Platelet-Lymphocyte Ratio
- PNR:
-
Platelet-Neutrophil Ratio
- PNS:
-
Peripheral Nervous System
- PT:
-
Prothrombin Time
- PTT:
-
Partial Thromboplastin Time
- PWR:
-
Platelet-White Blood Cells Ratio
- RBC:
-
Red Blood Cells
- RDW:
-
Red Blood Cell Distribution Width
- RF:
-
Random Forest
- ROC:
-
Receiver Operating Characteristic
- SAPS-Ⅱ:
-
Simplified Acute Physiology Score Ⅱ
- SD:
-
Standard Deviation
- SHAP:
-
SHapley Additive exPlanations
- SII:
-
Systemic Immune Inflammation Index
- SVM:
-
Support Vector Machine
- TC:
-
Total Cholesterol
- TNF-α :
-
Tumor Necrosis Factor-alpha
- WBC:
-
White Blood Cells
- WNK:
-
With-No-Lysine
- XGBoost:
-
Extreme Gradient Boosting
References
Armstrong, D. G., Tan, T. W., Boulton, A. J. M. & Bus, S. A. Diabetic foot ulcers: A review. JAMA 330 (1), 62–75. https://doi.org/10.1001/jama.2023.10578 (2023). PMID: 37395769; PMCID: PMC10723802.
Thorud, J. C., Plemmons, B., Buckley, C. J., Shibuya, N. & Jupiter, D. C. Mortality after nontraumatic major amputation among patients with diabetes and peripheral vascular disease: A systematic review. J. Foot Ankle Surg. 55 (3), 591–599. https://doi.org/10.1053/j.jfas.2016.01.012 (2016 May-Jun). Epub 2016 Feb 19. PMID: 26898398.
Volmer-Thole, M. & Lobmann, R. Neuropathy and diabetic foot syndrome. Int. J. Mol. Sci. 17 (6), 917. https://doi.org/10.3390/ijms17060917 (2016). PMID: 27294922; PMCID: PMC4926450.
Feldman, E. L. et al. Diabetic neuropathy. Nat. Rev. Dis. Primers. 5 (1), 42. https://doi.org/10.1038/s41572-019-0097-9 (2019). PMID: 31197183; PMCID: PMC7096070.
Boulton, A. J., Kirsner, R. S. & Vileikyte, L. Clinical practice. Neuropathic diabetic foot ulcers. N Engl J Med. ;351(1):48–55. (2004). https://doi.org/10.1056/NEJMcp032966. PMID: 15229307.
Kamenov, Z. A. & Traykov, L. D. Diabetic somatic neuropathy. Adv Exp Med Biol. ;771:155 – 75. (2012). https://doi.org/10.1007/978-1-4614-5441-0_14. PMID: 23393678.
Peltier, A., Goutman, S. A. & Callaghan, B. C. Painful diabetic neuropathy. BMJ. ;348:g1799. (2014). https://doi.org/10.1136/bmj.g1799. Erratum in: BMJ. 2014;348:g3440. PMID: 24803311.
Russell, J. W., Zilliox, L. A. & Diabetic neuropathies. Continuum (Minneap Minn). ;20(5 Peripheral Nervous System Disorders):1226-40. (2014). https://doi.org/10.1212/01.CON.0000455884.29545.d2. PMID: 25299279; PMCID: PMC4208099.
Boulton, A. J. What you can’t feel can hurt you. J Vasc Surg. ;52(3 Suppl):28S-30S. (2010). https://doi.org/10.1016/j.jvs.2010.06.005. PMID: 20804930.
Lobmann, R. Therapie des diabetischer Fusssyndroms Das sind die fünf häufigsten Fehler [Diabetic foot: how to avoid therapeutic mistakes]. MMW Fortschr Med. ;155(18):63 – 6. German. (2013). https://doi.org/10.1007/s15006-013-2284-z. PMID: 24288927.
Liu, X. et al. Development and validation of a risk prediction model for early diabetic peripheral neuropathy based on a systematic review and meta-analysis. Front. Public. Health. 11, 1128069. https://doi.org/10.3389/fpubh.2023.1128069 (2023). PMID: 36908480; PMCID: PMC9992641.
Gelaw, N. B. et al. Development and validation of risk prediction model for diabetic neuropathy among diabetes mellitus patients at selected referral hospitals, in Amhara regional state Northwest Ethiopia, 2005–2021. PLoS One. 18 (8), e0276472. https://doi.org/10.1371/journal.pone.0276472 (2023). PMID: 37643198; PMCID: PMC10465000.
Wu, B., Niu, Z. & Hu, F. Study on risk factors of peripheral neuropathy in type 2 diabetes mellitus and establishment of prediction model. Diabetes Metab. J. 45 (4), 526–538. https://doi.org/10.4093/dmj.2020.0100 (2021). Epub 2021 Jul 30. PMID: 34352988; PMCID: PMC8369209.
Lian, X. et al. Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning. BMC Med. Inf. Decis. Mak. 23 (1), 146. https://doi.org/10.1186/s12911-023-02232-1 (2023). PMID: 37533059; PMCID: PMC10394817.
Braffett, B. H. et al. Risk factors for diabetic peripheral neuropathy and cardiovascular autonomic neuropathy in the diabetes control and complications Trial/Epidemiology of diabetes interventions and complications (DCCT/EDIC) study. Diabetes 69 (5), 1000–1010. https://doi.org/10.2337/db19-1046 (2020). Epub 2020 Feb 12. PMID: 32051148; PMCID: PMC7171957.
Choi, R. Y., Coyner, A. S., Kalpathy-Cramer, J., Chiang, M. F. & Campbell, J. P. Introduction to machine learning, neural Networks, and deep learning. Transl Vis. Sci. Technol. 9 (2), 14. https://doi.org/10.1167/tvst.9.2.14 (2020). PMID: 32704420; PMCID: PMC7347027.
Beam, A. L. & Kohane, I. S. Big Data and Machine Learning in Health Care. JAMA. ;319(13):1317–1318. (2018). https://doi.org/10.1001/jama.2017.18391. PMID: 29532063.
Cabitza, F., Rasoini, R. & Gensini, G. F. Unintended Consequences of Machine Learning in Medicine. JAMA. ;318(6):517–518. (2017). https://doi.org/10.1001/jama.2017.7797. PMID: 28727867.
Lundberg, S. & Lee, S. I. A Unified Approach to Interpreting Model Predictions. (2017). https://doi.org/10.48550/arXiv.1705.07874
Johnson, A. E. W. et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci Data. ;10(1):1. doi: 10.1038/s41597-022-01899-x. Erratum in: Sci Data. 2023;10(1):31. Erratum in: Sci Data. 2023;10(1):219. PMID: 36596836; PMCID: PMC9810617. (2023).
Yang, J. et al. Brief introduction of medical database and data mining technology in big data era. J. Evid. Based Med. 13 (1), 57–69. https://doi.org/10.1111/jebm.12373 (2020). Epub 2020 Feb 22. PMID: 32086994; PMCID: PMC7065247.
Van Calster, B. & Wynants, L. Machine Learning in Medicine. N Engl J Med. ;380(26):2588. (2019). https://doi.org/10.1056/NEJMc1906060. PMID: 31242379.
McEligot, A. J., Poynor, V., Sharma, R. & Panangadan, A. Logistic LASSO regression for dietary intakes and breast cancer. Nutrients 12 (9), 2652. https://doi.org/10.3390/nu12092652 (2020). PMID: 32878103; PMCID: PMC7551912.
Ou, Q. et al. LASSO-based machine learning algorithm to predict the incidence of diabetes in different stages. Aging Male. ;26(1):2205510. doi: 10.1080/13685538.2023.2205510. PMID: 37156752. (2023).
Fan, T. et al. Predicting the risk factors of diabetic ketoacidosis-associated acute kidney injury: A machine learning approach using XGBoost. Front. Public. Health. 11, 1087297. https://doi.org/10.3389/fpubh.2023.1087297 (2023). PMID: 37089510; PMCID: PMC10117643.
Breiman, L. Random forests. Mach. Learn. 45 (1), 5–32 (2001).
Dong, B., Liu, X. & Yu, S. Utilizing machine learning algorithms to identify biomarkers associated with diabetic nephropathy: A review. Med. (Baltim). 103 (8), e37235. https://doi.org/10.1097/MD.0000000000037235 (2024). PMID: 38394492; PMCID: PMC11309603.
Bayramli, I. et al. Temporally informed random forests for suicide risk prediction. J. Am. Med. Inf. Assoc. 29 (1), 62–71. https://doi.org/10.1093/jamia/ocab225 (2021). PMID: 34725687; PMCID: PMC8714280.
Baum, P., Toyka, K. V., Blüher, M., Kosacka, J. & Nowicki, M. Inflammatory mechanisms in the pathophysiology of diabetic peripheral neuropathy (DN)-New aspects. Int. J. Mol. Sci. 22 (19), 10835. https://doi.org/10.3390/ijms221910835 (2021). PMID: 34639176; PMCID: PMC8509236.
Galiero, R. et al. Peripheral neuropathy in diabetes mellitus: pathogenetic mechanisms and diagnostic options. Int. J. Mol. Sci. 24 (4), 3554. https://doi.org/10.3390/ijms24043554 (2023). PMID: 36834971; PMCID: PMC9967934.
Feldman, E. L., Nave, K. A., Jensen, T. S. & Bennett, D. L. H. New horizons in diabetic neuropathy: Mechanisms, Bioenergetics, and pain. Neuron 93 (6), 1296–1313. https://doi.org/10.1016/j.neuron.2017.02.005 (2017). PMID: 28334605; PMCID: PMC5400015.
Hong, J. et al. Impact of red cell distribution width and red cell distribution width/albumin ratio on all-cause mortality in patients with type 2 diabetes and foot ulcers: a retrospective cohort study. Cardiovasc. Diabetol. 21 (1), 91. https://doi.org/10.1186/s12933-022-01534-4 (2022). PMID: 35658957; PMCID: PMC9166463.
Arıcan, G., Kahraman, H. Ç., Özmeriç, A., İltar, S. & Alemdaroğlu, K. B. Monitoring the prognosis of diabetic foot ulcers: predictive value of Neutrophil-to-Lymphocyte ratio and red blood cell distribution width. Int. J. Low Extrem Wounds. 19 (4), 369–376 (2020). Epub 2020 Feb 10. PMID: 32037920.
Weiss, G. & Goodnough, L. T. Anemia of chronic disease. N Engl J Med. ;352(10):1011-23. (2005). https://doi.org/10.1056/NEJMra041809. PMID: 15758012.
Tsuboi, S. et al. Impact of red blood cell distribution width on long-term mortality in diabetic patients after percutaneous coronary intervention. Circ J. ;77(2):456 – 61. (2013). https://doi.org/10.1253/circj.cj-12-0730. Epub 2012 Oct 17. PMID: 23075764.
Friedman, J. S. et al. SOD2-deficiency anemia: protein oxidation and altered protein expression reveal targets of damage, stress response, and antioxidant responsiveness. Blood 104 (8), 2565–2573. https://doi.org/10.1182/blood-2003-11-3858 (2004). Epub 2004 Jun 17. PMID: 15205258.
Kiefer, C. R. & Snyder, L. M. Oxidation and erythrocyte senescence. Curr Opin Hematol. ;7(2):113-6. (2000). https://doi.org/10.1097/00062752-200003000-00007. PMID: 10698298.
Vayá, A. et al. Red blood cell distribution width and erythrocyte deformability in patients with acute myocardial infarction. Clin Hemorheol Microcirc. ;59(2):107 – 14. (2015). https://doi.org/10.3233/CH-131751. PMID: 23752170.
Cheng, Y. et al. How inflammation dictates diabetic peripheral neuropathy: an enlightening review. CNS Neurosci. Ther. 30 (4), e14477. https://doi.org/10.1111/cns.14477 (2024). Epub 2023 Oct 5. PMID: 37795833; PMCID: PMC11017439.
Stino, A. M., Rumora, A. E., Kim, B. & Feldman, E. L. Evolving concepts on the role of dyslipidemia, bioenergetics, and inflammation in the pathogenesis and treatment of diabetic peripheral neuropathy. J. Peripher Nerv. Syst. 25 (2), 76–84. https://doi.org/10.1111/jns.12387 (2020). PMID: 32412144; PMCID: PMC7375363.
Yeung, Y. T., Aziz, F., Guerrero-Castilla, A. & Arguelles, S. Signaling Pathways in Inflammation and Anti-inflammatory Therapies. Curr Pharm Des. ;24(14):1449–1484. (2018). https://doi.org/10.2174/1381612824666180327165604. PMID: 29589535.
Sandireddy, R., Yerra, V. G., Areti, A., Komirishetty, P. & Kumar, A. Neuroinflammation and oxidative stress in diabetic neuropathy: futuristic strategies based on these targets. Int. J. Endocrinol. 2014, 674987. https://doi.org/10.1155/2014/674987 (2014). Epub 2014 Apr 30. PMID: 24883061; PMCID: PMC4021687.
Demirdal, T. & Sen, P. The significance of neutrophil-lymphocyte ratio, platelet-lymphocyte ratio and lymphocyte-monocyte ratio in predicting peripheral arterial disease, peripheral neuropathy, osteomyelitis and amputation in diabetic foot infection. Diabetes Res Clin Pract. ;144:118–125. doi: 10.1016/j.diabres.2018.08.009. Epub 2018 Sep 1. PMID: 30176260. (2018).
Xu, Y. et al. The impact of inflammatory biomarkers on amputation rates in patients with diabetic foot ulcers. Int. Wound J. 21 (4), e14827. https://doi.org/10.1111/iwj.14827 (2024). PMID: 38522433; PMCID: PMC10961172.
Charlson, M. E., Carrozzino, D., Guidi, J. & Patierno, C. Charlson comorbidity index: A critical review of clinimetric properties. Psychother. Psychosom. 91 (1), 8–35. https://doi.org/10.1159/000521288 (2022). Epub 2022 Jan 6. PMID: 34991091.
Moreno, R. P. et al. SAPS 3 Investigators. SAPS 3–From evaluation of the patient to evaluation of the intensive care unit. Part 2: Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med. ;31(10):1345-55. (2005). https://doi.org/10.1007/s00134-005-2763-5. Epub 2005 Aug 17. Erratum in: Intensive Care Med. 2006;32(5):796. PMID: 16132892; PMCID: PMC1315315.
Berend, K., van Hulsteijn, L. H. & Gans, R. O. Chloride: the queen of electrolytes? Eur. J. Intern. Med. 23 (3), 203–211. https://doi.org/10.1016/j.ejim.2011.11.013 (2012). Epub 2011 Dec 21. PMID: 22385875.
Yunos, N. M., Bellomo, R., Story, D. & Kellum, J. Bench-to-bedside review: Chloride in critical illness. Crit Care. ;14(4):226. doi: 10.1186/cc9052. Epub 2010 Jul 8. PMID: 20663180; PMCID: PMC2945073. (2010).
Zhang, K. et al. Association between serum chloride and in-hospital mortality in congestive heart failure with diabetes: data from the MIMIC-IV database. J. Diabetes Metab. Disord. 23 (1), 859–870. https://doi.org/10.1007/s40200-023-01362-7 (2023). PMID: 38932886; PMCID: PMC11196478.
Pawar, B. G. et al. Correlation of serum electrolyte imbalances with diabetic duration and medication use: A Cross-Sectional comparative study. Cureus 16 (9), e70065. https://doi.org/10.7759/cureus.70065 (2024). PMID: 39449943; PMCID: PMC11501493.
Hjort, R. et al. Physical Activity, genetic Susceptibility, and the risk of latent autoimmune diabetes in adults and type 2 diabetes. J. Clin. Endocrinol. Metab. 105 (11), e4112–e4123. https://doi.org/10.1210/clinem/dgaa549 (2020). PMID: 32835373; PMCID: PMC7947966.
Piala, A. T. et al. Chloride sensing by WNK1 involves Inhibition of autophosphorylation. Sci. Signal. 7 (324), ra41. https://doi.org/10.1126/scisignal.2005050 (2014). PMID: 24803536; PMCID: PMC4123527.
Yang, C. L., Zhu, X. & Ellison, D. H. The thiazide-sensitive Na-Cl cotransporter is regulated by a WNK kinase signaling complex. J. Clin. Invest. 117 (11), 3403–3411. https://doi.org/10.1172/JCI32033 (2007). PMID: 17975670; PMCID: PMC2045602.
Orchard, C. H. & Cingolani, H. E. Acidosis and arrhythmias in cardiac muscle. Cardiovasc Res. ;28(9):1312-9. (1994). https://doi.org/10.1093/cvr/28.9.1312. PMID: 7954638.
Domingueti, C. P., Dusse, L. M., de Carvalho Md, L. P., Gomes, K. B. & Fernandes, A. P. Diabetes mellitus: the linkage between oxidative stress, inflammation, hypercoagulability and vascular complications. J. Diabetes Complications 2016 May-Jun ;30(4):738–745. https://doi.org/10.1016/j.jdiacomp.2015.12.018. Epub 2015 Dec 18. PMID: 26781070. (2016).
Zhuang, L. et al. Increased plasma D-dimer levels May be a promising indicator for diabetic peripheral neuropathy in type 2 diabetes. Front. Endocrinol. (Lausanne). 13, 930271. https://doi.org/10.3389/fendo.2022.930271 (2022). PMID: 36082076; PMCID: PMC9445160.
Smith, A. G., Singleton, J. R. & J Diabetes Complications. Obesity and hyperlipidemia are risk factors for early diabetic neuropathy. ;27(5):436–442. https://doi.org/10.1016/j.jdiacomp.2013.04.003. (2013). Sep-Oct Epub 2013 Jun 2. PMID: 23731827; PMCID: PMC3766404.
Fentie, D. et al. Diabetic peripheral neuropathy and their associations with overweight/obesity and impaired blood glucose among diabetic patients in Eastern ethiopia: Institutional-based study. SAGE Open. Med. 11, 20503121231220784 (2023). PMID: 38148763; PMCID: PMC10750549.
O’Brien, P. D., Hinder, L. M., Callaghan, B. C. & Feldman, E. L. Neurological consequences of obesity. Lancet Neurol. 16 (6), 465–477. https://doi.org/10.1016/S1474-4422(17)30084-4 (2017). PMID: 28504110; PMCID: PMC5657398.
Lambert, E. et al. Sympathetic nervous system activity is associated with obesity-induced subclinical organ damage in young adults. Hypertension 56 (3), 351–358. https://doi.org/10.1161/HYPERTENSIONAHA.110.155663 (2010). Epub 2010 Jul 12. PMID: 20625075.
Acknowledgements
We are thankful to the participants of this research.
Funding
This research was supported by multiple grants:
1) The Guangdong Province Famous TCM Doctor (Wang He) Studio Construction Project (supported research personnel and expertise);
2) National Natural Science Foundation of China (82104910) (funded study design and methodology development);
3) Guangzhou University of Chinese Medicine University-Hospital Joint Fund (GZYSE2024G09) (supported data analysis and manuscript preparation);
4) Guangdong Province TCM Key Hospital Construction Project (provided infrastructure and technical support).
Author information
Authors and Affiliations
Contributions
YH, YL, SW, YZ, and PJ were responsible for the preparation of materials, data collection, and analysis, as well as drafting the initial version of the manuscript. WM, HW, and JB critically revised the manuscript. All authors commented on previous versions of the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The research involving human participants was approved by the Massachusetts Institute of Technology (Cambridge, MA) and Beth Israel Deaconess Medical Center (Boston, MA). Informed consent for participation was not required for this study by national legislation and institutional requirements. A researcher who has completed the Collaborative Institutional Training Initiative examination (Certification number 61042076 for author Huang) can access this database. Therefore, patient informed consent and ethical approval were exempted from the present study.
Competing interests
The authors declare no competing interests.
Plagiarism screening report
The plagiarism screening report has been included in the Supplementary Materials for full transparency.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Huang, Y., Mo, W., Liu, Y. et al. Machine learning models predict mortality risk in diabetic neuropathy patients using MIMIC-IV data. Sci Rep 15, 38702 (2025). https://doi.org/10.1038/s41598-025-22363-x
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-22363-x






