Abstract
Following complete mesocolic excision (CME), heart failure (HF) emerges as a significant complication, exerting substantial impacts on both short-term and long-term patient prognoses. The primary objective of our investigation was to develop a machine learning model capable of discerning preoperative and intraoperative high-risk factors, facilitating the prediction of HF occurrence subsequent to CME. A cohort comprising 1158 patients diagnosed with colon cancer was enrolled in our study, encompassing 172 individuals who developed postoperative HF. We compiled 37 feature variables, spanning patient demographic traits, foundational medical histories, preoperative examination characteristics, surgery types, and intraoperative details. Four distinct machine learning algorithms—extreme gradient boosting (XGBoost), random forest (RF), support vector machine (SVM), and k-nearest neighbor algorithm (KNN)—were employed to construct the model. The k-fold cross-validation method, ROC curve, calibration curve, decision curve analysis (DCA), and external validation were employed for comprehensive model evaluation. The XGBoost algorithm exhibited superior performance compared to the other three prediction models. Specifically, within the training set of the internal validation set, the XGBoost algorithm demonstrated an AUC value of 0.990 (0.983–0.996), accuracy of 0.929 (0.923–0.935), sensitivity of 0.983 (0.976–0.990), specificity of 0.918 (0.910–0.927), and F1 value of 0.799 (0.784–0.814). In the validation set of the internal validation set, the XGBoost algorithm recorded an AUC value of 0.941 (0.890–0.991), accuracy of 0.897 (0.882–0.911), sensitivity of 0.898 (0.860–0.937), specificity of 0.875 (0.845–0.905), and F1 value of 0.711 (0.656–0.766). The AUC value for the external validation set was 0.93, indicating robust extrapolative capabilities of the XGBoost prediction model. The HF prediction model post-CME, derived from the XGBoost machine learning algorithm in this study, attests to its elevated predictive accuracy and clinical utility.
Introduction
Colon cancer, a gastrointestinal malignancy characterized by a bleak prognosis, exhibits an escalating incidence attributed to shifts in human lifestyle and dietary patterns. According to a 2019 epidemiological survey1, colon cancer ranks third globally in malignant tumor incidence, following lung and breast cancer. Radical surgery remains the primary treatment for colon cancer, with complete mesocolic excision (CME), initially proposed by Hohenberger2,3, proving effective in diminishing morbidity and mortality rates. CME underscores the importance of thoroughly excising the tumor and colonic mesentery, clearing surrounding lymph nodes, and entirely obstructing the main vessels of the colon, embodying the tenets of radical colon cancer surgery.
Despite the comprehensive exploration of CME’s benefits, clinicians have observed that certain patients experience postoperative ischemia-reperfusion injury and hemodynamic changes, elevating the risk of postoperative heart failure (HF). The incidence of these complications is anticipated to rise with the increasing age of the population undergoing the procedure. HF, the ultimate stage of cardiovascular disease, poses a significant global public health challenge due to its associated high morbidity and mortality4. Recognizing the pivotal role of imaging in HF diagnosis, a worldwide multicenter prospective study named VISION emphasized the significance of elevated postoperative troponin in predicting myocardial injury and HF after non-cardiac surgery5. Nevertheless, subjecting all patients to such tests would impose a considerable burden on healthcare systems globally. Consequently, it is imperative to pinpoint risk factors for postoperative HF development in patients.
Artificial intelligence (AI) is rapidly advancing in the medical realm6,7,8, with machine learning, a prominent AI branch, lauded for its stable model construction and precise prediction capabilities. Embraced by clinicians, machine learning is employed in clinical prediction and various domains9,10,11. In this investigation, we scrutinized clinical data from colon cancer patients, utilizing machine learning algorithms to formulate a predictive model for postoperative HF. By doing so, it holds the promise of curbing healthcare costs and aiding clinicians in devising timely and accurate individualized treatment plans for their patients.
Materials and methods
Study subjects
This study leveraged data from the clinical databases of Wuxi People’s Hospital, affiliated with Nanjing Medical University.
The inclusion criteria were defined as follows: (a) patients who underwent open complete mesocolic excision (CME) or laparoscopic-assisted CME; (b) the surgical team comprised senior surgeons proficient in independently performing CME; (c) patients received a confirmed postoperative pathological diagnosis of colon cancer; and (d) patients received a confirmed postoperative diagnosis of heart failure (HF).
Exclusion criteria were outlined as follows: (A) patients with other malignant tumors; (B) patients with distant metastasis of colon cancer confirmed through pathological examination or imaging; (C) patients with a preoperative diagnosis of acute HF, atrial fibrillation, or other severe cardiac dysfunction diseases; (D) patients with a history of open-heart surgery; and (E) patients with missing cases, incomplete clinical data, or absent follow-up.
During the postoperative period, the two surgeons conducted regular assessments, including history taking, electrocardiogram (ECG), and echocardiography, every other month. All participants in the study underwent a follow-up period lasting a minimum of 5 years post-surgery. The study adhered to the principles of the Declaration of Helsinki and received approval from the Ethics Committee of Wuxi People’s Hospital, with approval number KY22085.
Study design and data collection
A comprehensive set of 37 variables was collected, encompassing preoperative variables (within 24 h of the surgery day), intraoperative variables, and postoperative variables. The preoperative variables included patient demographic characteristics (gender, age, smoking history, alcohol history, and body mass index), basic clinical characteristics (American Society of Anesthesiologists score, nutrition risk screening 2002 score, history of surgery, duration of disease, adjuvant chemotherapy history, and adjuvant radiotherapy history), basic medical history (anemia, tachycardia, chronic kidney disease, diabetes, hypertension, chronic obstructive pulmonary disease, and hyperlipidemia), laboratory tests (albumin, carcinoembryonic antigen, carbohydrate antigen 19 − 9, procalcitonin, C-reactive protein, serum amyloid A, and neutrophil to lymphocyte ratio), and tumor characteristics (T-stage, N-stage, tumor size, and number of tumors). Intraoperative variables covered the surgical approach, type of surgery, duration of surgery, intraoperative bleeding, whether it was an emergency procedure, intraoperative tachycardia, and intraoperative blood transfusion. The study’s outcome variable focused on postoperative heart failure (HF).
Diagnosis of HF and definition of associated factors
The diagnosis and evaluation of heart failure (HF) necessitate a comprehensive approach encompassing patient history, physical examination, laboratory investigations, cardiac imaging, and functional assessments. Initially, the likelihood of heart failure is gauged through clinical manifestations such as recent weight gain, jugular venous distension, and peripheral edema. Elevated jugular venous pressure and alterations in the location of the apical impulse are also hallmark indicators in the diagnostic process. Furthermore, in patients with suspected postoperative heart failure, an electrocardiogram (ECG) is performed to assess rhythm, heart rate, QRS morphology and duration, as well as to identify pathological changes, including atrial fibrillation or left ventricular hypertrophy. Chest imaging is conducted to identify pulmonary conditions or other potential etiologies of dyspnea, providing critical insights into pulmonary congestion, pulmonary edema, and cardiomegaly. However, normal findings on chest imaging do not definitively exclude the diagnosis of heart failure. Plasma levels of B-type natriuretic peptide (BNP) or N-terminal proBNP (NT-proBNP) are also measured to evaluate the severity and prognosis of the condition. Ultimately, cardiac ultrasound, regarded as the diagnostic gold standard, is employed to assess both the structure and function of the heart, with left ventricular ejection fraction (LVEF) quantified using the Simpson biplane method.
In this study, the diagnosis of HF was grounded in a unified, comprehensive evaluation encompassing patient history, physical examination, laboratory investigations, cardiac imaging, and functional assessments. In instances where the diagnosis remained uncertain, clinical manifestations such as recent weight gain, jugular venous distension, and peripheral edema were prioritized during the initial evaluation. All patients suspected of postoperative heart failure underwent echocardiography, with LVEF measured using the Simpson biplane method. Patients with an LVEF of less than 50% were classified as having heart failure with reduced ejection fraction (HFrEF), while those exhibiting a normal LVEF (≥ 50%) were diagnosed with heart failure with preserved ejection fraction (HFpEF), provided other clinical symptoms were present. In cases where echocardiographic data were unavailable or inconclusive, the diagnosis was augmented with clinical evaluations, including ECG findings and plasma levels of BNP or NT-proBNP.
Development and evaluation of predictive models for machine learning algorithms
The construction and assessment of clinical prediction models were carried out using SPSS software and R software, involving the following steps:
-
(a).
Data Preprocessing:
-
- Colon cancer patients from January 2010 to December 2015 comprised the model establishment set, while patients from January 2016 to January 2018 formed the external validation set.
-
- The model establishment set was randomly divided into a training set (70%) and a test set (30%).
-
(b).
Univariate and Multivariate Regression Analyses:
-
- Univariate and multivariate regression analyses were performed on the data from the model establishment set.
-
- Categorical variables underwent chi-square tests, t-tests were used for continuous variables following a normal distribution, and the rank sum test was applied for non-normally distributed continuous variables.
-
- Logistic regression analysis was conducted on significant variables from the univariate analysis to identify independent influences on postoperative HF.
-
- Four models (XGBoost, RF, SVM, and KNN) were used to rank the importance of each independent influence, with the top ten variables selected.
-
(c).
Machine Learning Algorithm Prediction Models:
-
- The selected feature variables were integrated into four machine learning algorithm prediction models (SVM, RF, XGBoost, and KNN).
-
- Models were evaluated for discrimination, calibration, and clinical usefulness. The best model was chosen based on these evaluations.
-
- ROC curves were generated to determine the area under the curve (AUC) value, calibration curves assessed agreement between predicted and actual results, and decision curve analysis (DCA) gauged patient benefit post-intervention.
-
(d).
External Validation:
-
- The best model underwent external validation using an external test set.
-
- ROC curves and calibration curves were plotted to assess generalizability and predictive efficiency.
-
(e).
Model Interpretation:
-
- SHAP analysis (Shapley value) was employed to ascertain the contribution of each feature to the prediction.
-
- SHAP summary plots, ranking the importance of risk factors, and SHAP force plots, analyzing and interpreting individual sample prediction results, were constructed based on Shapley values.
Code availability
This work is supported by the Extreme Smart Analysis platform (https://www.xsmartanalysis.com/). In addition, to ensure that versions of custom code, software, or algorithms described in the publication are maintained, we will release them as uploads to GitHub (https://github.com/SongyunZhao123/zsy.git).
Results
Basic clinical information of the patients
The study comprised a total of 1158 patients, with 172 (14.85%) experiencing postoperative heart failure (HF) (Fig. 1; Table 1). The original dataset is provided in Supplementary Table.
Flow diagram of patients included in the study.
Analysis of risk factors for postoperative HF
Results from univariate and multivariate analyses identified independent influencing factors for postoperative HF, including age, body mass index (BMI), smoking history, hyperlipidemia, hypertension, diabetes, preoperative tachycardia, chronic obstructive pulmonary disease (COPD), N-stage, blood transfusion, intraoperative tachycardia, C-reactive protein (CRP) level, and neutrophil to lymphocyte ratio (NLR) (P < 0.05) (Table 2). Machine learning models (XGBoost, RF, SVM, and KNN) highlighted risk factors, such as advanced age, preoperative and intraoperative tachycardia, BMI ≥ 25 kg/m2, smoking history, COPD history, hypertension history, NLR ≥ 3, and CRP level ≥ 10 mg/l (Fig. 2A-D).
The variable influence factor ranking plots of the four models. (A) Variable importance ranking diagram of the XGBoost model. (B) Variable importance ranking diagram of the RF model. (C) Variable importance ranking diagram of the SVM model. (D) Variable importance ranking diagram of the KNN model.
Model Building and evaluation
The ROC curve results show that XGBoost has an AUC value as high as 0.990 in the training set; the AUC value in the validation set is 0.941, which is the best performance among the four models (Table 3). The calibration curve results show that the calibration curves of the four models are similar to the ideal curves, and the models have high consistency between the predicted and actual results. The DCA curves showed that all four models achieved a net clinical benefit relative to either the full treatment or no treatment plan (Fig. 3A-D). The k-fold cross-validation method was used to compare the generalization ability of the four models. A sample of 254 cases (30.09%) was taken as the test set in the internal validation set, and the remaining samples were used as the training set for 10-fold cross-validation, AUC = 0.9252 ± 0.0464 in the validation set of XGBoost, AUC = 0.9272 in the test set, accuracy = 0.8819 (Fig. 4A-C); AUC = 0.9071 ± 0.0533 in the validation set of RF, AUC = 0.9227 in the test set, accuracy = 0.8780; AUC = 0.9188 ± 0.0601 in the validation set of SVM, AUC = 0.9218 in the test set, accuracy = 0.8543; AUC = 0.8587 ± 0.1024 in the validation set of KNN, AUC = 0.9032 in the test set, accuracy = 0.9173. After a comprehensive comparison, the XGBoost algorithm was chosen to construct the model in this study.
Evaluation of the four models for predicting heart failure. (A) ROC curves for the training set of the four models. (B) ROC curves for the validation set of the four models. (C) Calibration plots of the four models. The 45° dotted line on each graph represents the perfect match between the observed (y-axis) and predicted (x-axis) complication probabilities. A closer distance between two curves indicates greater accuracy. (D) DCA curves of the four models. The intersection of the red curve and the All curve is the starting point, and the intersection of the red curve and the None curve is the node within which the corresponding patients can benefit.
Internal validation of the XGBoost model. (A) ROC curve of the XGBoost model for the training set. (B) ROC curve of the XGBoost model for the validation set. (C) ROC curve of the XGBoost model for the test set. (D) External validation of the XGBoost model.
Model external validation
The ROC curve for the external validation set yielded an AUC value of 0.93, indicating high accuracy in disease prediction (Fig. 4D).
Model explanation
The SHAP summary plot result shows that the risk factors for HF after CME were ranked as advanced age, history of COPD, history of smoking, intraoperative tachycardia, history of hypertension, CRP level ≥ 10 mg/l, NLR ≥ 3, preoperative tachycardia, and BMI ≥ 25 kg/m2 (Fig. 5). The SHAP force plots show the predictive analysis of the study model for four HF patients. The model predicted a 0.004 probability of HF in patient I, and the characteristics that increased its probability were NLR ≥ 3, BMI ≥ 25 kg/m2, and preoperative tachycardia; the model predicted a 0.012 probability of HF in patient II, and the characteristics that increased its probability were NLR ≥ 3, BMI ≥ 25 kg/m2, history of smoking, and intraoperative tachycardia; the model predicted a probability of HF in patient III of 0.958, and the characteristics that increased its probability were NLR ≥ 3, CRP level ≥ 10 mg/l, history of COPD, history of hypertension, preoperative tachycardia, and intraoperative tachycardia; the model predicted a probability of HF in patient IV of 0.497, and the characteristics that increased its probability were age ≥ 65, CRP level ≥ 10 mg/l, history of hypertension, preoperative tachycardia, and intraoperative tachycardia (Fig. 6A-D).
SHAP summary plot. Risk factors are arranged along the y-axis based on their importance, which is given by the mean of their absolute Shapley values. The higher the risk factor is positioned in the plot, the more important it is for the model.
SHAP force plot. The contributing variables are arranged in the horizontal line, sorted by the absolute value of their impact. Blue represents features that have a negative effect on disease prediction, with a decrease in SHAP values; red represents features that have a positive effect on disease prediction, with an increase in SHAP values. (A) Predictive Analysis of Patient (I) (B) Predictive Analysis of Patient (II) (C) Predictive Analysis of Patient (III) (D) Predictive Analysis of Patient IV.
Discussion
Although HF is not the sole complication after procedures like CME, its potential occurrence remains a crucial concern. All major surgeries, including CME, elevate patients’ risk of various complications, HF included. Identifying specific risk factors for HF in this context can facilitate enhanced patient monitoring and management during the critical postoperative phase12. Although clinicians have various methods to identify major complications like postoperative HF after total mesocolonectomy, several key factors justify the use of novel machine learning methods to explore HF. Machine learning algorithms possess the capability to analyze extensive and complex datasets, revealing subtle patterns and risk factors that might elude traditional statistical methods. This can result in more accurate predictions and personalized risk assessments. By precisely predicting the risk of HF, healthcare providers can implement targeted interventions and preventive measures, thereby enhancing patient outcomes, reducing mortality rates, and improving overall quality of care.
This study assesses the predictive models for heart failure (HF) risk constructed by four distinct machine learning algorithms. Notably, the XGBoost algorithm demonstrates exceptional accuracy and stands out for its efficiency, flexibility, and universal adaptability13,14. The AUC values of XGBoost in the training and validation sets, 0.990 and 0.941 respectively, underscore its exceptional discriminative power, significantly surpassing other models such as RF, SVM, and KNN. In 10-fold cross-validation, XGBoost achieved an AUC of 0.9252 ± 0.0464, reflecting its strong generalization capability. Although KNN exhibited slightly higher accuracy in the test set (0.9173), XGBoost demonstrated more stable and consistent performance, with superior AUC values and overall evaluation metrics. In contrast, other algorithms exhibited certain limitations. While Random Forest (RF) is proficient at managing high-dimensional data and mitigating overfitting, its AUC (0.9071 ± 0.0533) and accuracy (0.8780) were marginally lower than XGBoost’s. Moreover, RF is more sensitive to hyperparameter tuning, necessitating intricate adjustments to achieve optimal performance15. Support Vector Machine (SVM) achieved an AUC of 0.9188 ± 0.0601, indicating strong predictive capability. However, its computational complexity is notably high, particularly when handling large-scale datasets, leading to significantly increased training times. Additionally, SVM is more sensitive to noisy data, and its performance diminishes compared to XGBoost when addressing data imbalances. Meanwhile, although KNN demonstrated high accuracy (0.9173) in this study, its AUC of 0.8587 ± 0.1024 reveals a relatively weak discriminative ability. KNN is prone to the “curse of dimensionality” when handling high-dimensional data, with substantial computational overhead, especially in the presence of numerous feature variables. This renders its efficiency and effectiveness inferior to that of XGBoost, particularly when managing large-scale datasets16. Moreover, XGBoost’s calibration curve aligns closely with the ideal curve, indicating a high degree of agreement between predicted probabilities and actual outcomes, thereby enhancing the model’s reliability in clinical applications. XGBoost is also highly effective at managing high-dimensional data, utilizing its built-in feature importance ranking function to prioritize variables that most significantly influence the prediction of postoperative heart failure. This capability proves particularly advantageous in complex datasets with up to 37 feature variables. Additionally, XGBoost incorporates L1 and L2 regularization, which mitigates overfitting, allowing the model to maintain robust performance across both the training and validation phases.
Several studies17,18 have validated the efficacy of machine learning algorithm applications in clinical diagnosis and prognosis. Furthermore, machine learning techniques excel in accurately predicting adverse outcomes in disease progression when compared to traditional diagnostic methods. In the context of this study, machine learning algorithms played a pivotal role in developing the predictive model. The resulting model facilitates patient avoidance of unnecessary tests, alleviates financial burdens on patients’ families, and reduces the side effects associated with diagnostic instruments. Most importantly, the model aids clinical decision-makers in accurately identifying high-risk patients, enabling timely intervention and improving patient prognosis. The SHAP analysis results from this study identify advanced age, preoperative and intraoperative tachycardia, BMI ≥ 25 kg/m2, a history of smoking, a history of chronic obstructive pulmonary disease (COPD), a history of hypertension, neutrophil-to-lymphocyte ratio (NLR) ≥ 3, and C-reactive protein (CRP) level ≥ 10 mg/l as significant risk factors for postoperative HF following colorectal cancer surgery. We will enhance our postoperative care for these patients by closely monitoring cardiac ultrasound, electrocardiogram, and serum levels of brain natriuretic peptide (BNP) or N-terminal pro-brain natriuretic peptide (NT-proBNP) to promptly detect any abnormal cardiac function. Additionally, we can initiate or adjust cardiac medications such as beta-blockers, ACE inhibitors, and diuretics for these patients to improve their short-term prognosis and survival.
As age increases, patients face a heightened likelihood of developing organic lesions in the heart. In older patients, the heart muscle undergoes various degrees of hypertrophy, fibrosis, and other pathological changes, significantly impacting the heart’s pumping function. Major procedures such as colorectal cancer surgery induce changes in the sympathetic and endocrine systems in older patients, increasing the volume and pressure load on the heart and elevating the risk of cardiovascular diseases, including HF. This susceptibility is also observed in patients with preoperative and intraoperative tachycardia. The study results emphasize that both these patient groups are at a higher risk of postoperative HF. The authors posit that surgery, as a specific type of trauma, exacerbates existing complications in patients. Those with tachycardia face a potential risk of HF, and patients with atypical clinical symptoms may develop acute HF in response to surgical triggers. Moreover, the procedure may lead to increased pressure in the patient’s left atrium, significantly heightening the risk of postoperative HF. Philippe et al. have similarly demonstrated a strong association between patients’ heart rate and postoperative HF, indicating a 10% increase in the risk of postoperative HF for every 10 beats/min increase in heart rate19,20,21. This further underscores the utility of a rapid heart rate as a novel risk factor for predicting postoperative HF.
The current investigation utilizes four distinct samples to elucidate the predictive capacity of the model concerning postoperative heart failure (HF) in patients. For instance, in the disease prediction analysis of samples I and II, elevated Body Mass Index (BMI) emerged as a notable risk factor. Such patients are predisposed to comorbid chronic conditions like hypertension and diabetes due to obesity and an unhealthy lifestyle22,23,24, significantly heightening the risk of HF25,26. Furthermore, in obese patients undergoing colorectal cancer surgery, whether via open or laparoscopic-assisted procedures, achieving local exposure of the surgical field becomes more challenging. Additionally, individuals with high BMI tend to possess more perivascular adipose tissue, increasing the likelihood of injury. Collectively, these factors contribute to heightened procedural complexity, necessitate advanced surgical techniques, and elevate the risk of intraoperative bleeding, thereby increasing the hemodynamic burden on the patient. Zhou’s work also substantiates that high BMI independently serves as a risk factor for HF, demonstrated through an obesity-induced animal model27. In predictive analyses of samples II and III, a history of smoking and chronic obstructive pulmonary disease (COPD) played pivotal roles in postoperative HF development. Long-term smoking and chronic inflammation elevate the cardiopulmonary burden, potentially leading to pulmonary stasis, compensatory myocardial hypertrophy, and compromised cardiac function. Importantly, these patients may exhibit altered airway structures due to chronic inflammation, reduced protective defenses, an increased likelihood of pulmonary infection, and heightened susceptibility to postoperative HF28. This underscores the importance of clinicians providing active health education, encouraging smoking cessation two to three weeks before surgery, promoting appropriate exercise to enhance cardiopulmonary function, and closely monitoring pulmonary signs and laboratory parameters. Prophylactic use of antimicrobial drugs may also mitigate the risk of pulmonary infections.
A 20-year follow-up investigation highlighted hypertension as an independent risk factor for heart failure (HF)29. Correspondingly, a recent study reported a twofold higher risk of HF in hypertensive patients compared to those with normal blood pressure30. Our study aligns with these findings, underscoring a heightened risk of postoperative HF in hypertensive individuals. Hypertensive patients exhibit increased cardiac ejection resistance and oxygen consumption, contributing to pathological outcomes like cardiomyocyte hypertrophy and interstitial fibrosis, thereby elevating HF risk. Additionally, elevated blood pressure activates the renin-angiotensin-aldosterone system (RAAS) and sympathetic nervous system, causing excessive contraction of the systolic left ventricular wall, resulting in myocardial hypertrophy and ventricular wall thickening. Giovanni’s experimental study, involving 2078 participants, further corroborates the strong association between left ventricular wall hypertrophy and the development of heart failure with preserved ejection fraction (HFpEF) and heart failure with reduced ejection fraction (HFrEF)31. Consequently, we assert that a history of hypertension serves as a crucial high-risk factor for predicting postoperative HF. Clinicians are urged to prioritize hypertensive patients preoperatively, emphasizing active health education and lifestyle modifications, such as smoking cessation, reduced salt and sugar intake, and appropriate exercise32. Clinical decision-makers can also consider employing thiazide-like diuretics, angiotensin-converting enzyme inhibitors, and angiotensin II receptor blockers for high-risk patients to mitigate blood pressure levels and enhance myocardial remodeling33.
Recently, some researchers have sought to detect heart failure using imaging methods that could replace echocardiography, positing that these alternatives have the potential to reduce global healthcare expenditures. However, this approach has been associated with an elevated misdiagnosis rate and an increased risk of patient mortality. Despite the successful construction of several heart failure (HF)-related prediction models in recent years, these studies have tended to focus primarily on the general clinical characteristics of patients34,35. The development of postoperative HF is intricately influenced by neurological and endocrine systems, where alterations in these systems can lead to the overexpression of proinflammatory cytokines and signaling molecules. In light of this, the present study incorporated relevant inflammatory indicators for discussion, revealing a higher incidence of HF in patients with elevated postoperative Neutrophil-to-Lymphocyte Ratio (NLR) and C-reactive protein (CRP). A meta-analysis of 14 studies further corroborated a strong association between patients’ inflammatory markers and HF development36. Similarly, Christian employed CRP to predict patients’ cardiovascular function, offering precise individualized treatment and thereby supporting the findings of the current study37,38,39.
The present study undertook a rigorous and comprehensive evaluation of the model, assessing its discrimination, calibration, and clinical utility. This analysis holds the potential to establish a paradigm for forecasting postoperative complications in complex abdominal surgeries exhibiting similar characteristics. However, several notable limitations warrant consideration. Although the study incorporated a wide array of variables, the primary emphasis was placed on demographics, preoperative evaluations, surgical type, and intraoperative data. Key postoperative factors related to heart failure, such as postoperative care, long-term medication adherence, and the patient’s mental health, were regrettably not incorporated, thus potentially restricting the model’s capacity to predict heart failure in a more holistic manner. Additionally, certain variables may be susceptible to measurement inaccuracies or inconsistencies (e.g., subjective interpretation of preoperative test results), introducing the possibility of bias into the analysis. Notably, the potential cardiotoxicity of certain chemotherapy agents, particularly anthracyclines, which are frequently employed in colon cancer treatment, was not addressed. Future investigations should prioritize exploring this concern in greater depth. While the study predominantly focused on patients with HFrEF and HFpEF, ensuring a broad representation of heart failure subtypes within the postoperative cohort, we acknowledge that different HF types may carry distinct prognostic implications, which constitutes a limitation of the current analysis. Future research would benefit from a more nuanced examination of specific HF subtypes. Furthermore, while the machine learning algorithm demonstrated enhanced accuracy, its model was more intricate and less interpretable. The entire computational and decision-making process of the model operates as a black box, lacking the intuitive clarity of logistic regression models40,41,42. The data utilized in this study were derived from a single-center cohort of colon cancer patients, which may not fully reflect patient populations in other regions or healthcare institutions. Notably, variations in the incidence of postoperative heart failure across different populations may result in the model performing less optimally in external cohorts than in the current study. Disparities in surgical techniques, standards of care, and the pathological characteristics of patients across hospitals may influence both the incidence and the risk factors for heart failure, thereby limiting the generalizability of the model. Multicenter prospective studies should be incorporated into future research to further enhance the reliability and robustness of the results. To enable clinical application of the model, various patient datasets can be input for external validation, providing an estimation of the probability of postoperative complications and assessing the model’s accuracy. However, the success of the model’s implementation relies on the precision and consistency of data input. Discrepancies in data formats and quality across hospitals, or even within different departments of the same institution at varying times, pose a significant challenge to its widespread adoption. To address this, hospital information systems must standardize data collection protocols, ensuring seamless integration of preoperative, intraoperative, and postoperative data to facilitate real-time automated predictions. Moving forward, we aim to incorporate the model into a clinical decision support system (CDSS) that will assist physicians in making critical preoperative and intraoperative decisions. This CDSS will feature a streamlined user interface and integrate other essential information, such as surgical risk scores and postoperative care plans, offering comprehensive guidance to clinicians.
Conclusion
This study employed the XGBoost machine learning algorithm to construct a predictive model for assessing the risk of heart failure (HF) following colorectal cancer surgery (CME). The model exhibited commendable accuracy and clinical utility, offering valuable support for timely diagnosis by surgeons. The findings underscored that HF remains a significant concern for CME patients post-surgery, with strong associations identified with advanced age, preoperative and intraoperative tachycardia, BMI ≥ 25 kg/m2, a history of smoking, a history of chronic obstructive pulmonary disease (COPD), a history of hypertension, Neutrophil-to-Lymphocyte Ratio (NLR) ≥ 3, and C-reactive protein (CRP) levels ≥ 10 mg/l.
Data availability
The original data presented in the study are included in the article/Supplementary Tables 1, and further inquiries can be directed to the corresponding author (2021122190@stu.njmu.edu.cn). This work is supported by the Extreme Smart Analysis platform (https://www.xsmartanalysis.com/). In addition, to ensure that versions of custom code, software, or algorithms described in the publication are maintained, we will release them as uploads to GitHub (https://github.com/SongyunZhao123/zsy.git).
References
Mattiuzzi, C., Sanchis-Gomar, F. & Lippi, G. Concise update on colorectal Cancer epidemiology. Ann. Transl Med. 7 (21), 609. https://doi.org/10.21037/atm.2019.07.91 (2019).
Hohenberger, W., Weber, K., Matzel, K., Papadopoulos, T. & Merkel, S. Standardized Surgery for Colonic Cancer: Complete Mesocolic Excision and Central Ligation–Technical Notes and Outcome. Colorectal Dis 11(4):354 – 64; discussion 64 – 5. Epub 20091105. (2009). https://doi.org/10.1111/j.1463-1318.2008.01735.x
Jayne, D. G. et al. Five-Year Follow-up of the medical research Council clasicc trial of laparoscopically assisted versus open surgery for colorectal Cancer. Br. J. Surg. 97 (11), 1638–1645. https://doi.org/10.1002/bjs.7160 (2010).
McAdams-DeMarco, M. A. et al. Frailty as a novel predictor of mortality and hospitalization in individuals of all ages undergoing Hemodialysis. J. Am. Geriatr. Soc. 61 (6), 896–901. https://doi.org/10.1111/jgs.12266 (2013). Epub 20130527.
Devereaux, P. J. et al. Association of postoperative High-Sensitivity troponin levels with myocardial injury and 30-Day mortality among patients undergoing noncardiac surgery. Jama 317 (16), 1642–1651. https://doi.org/10.1001/jama.2017.4360 (2017).
Jackson, S. L. et al. National burden of heart failure events in the united states, 2006 to 2014. Circ. Heart Fail. 11 (12), e004873. https://doi.org/10.1161/circheartfailure.117.004873 (2018).
Croner, R. S., Ptok, H., Merkel, S. & Hohenberger, W. Implementing complete mesocolic excision for Colon Cancer - Mission completed? Innov. Surg. Sci. 3 (1), 17–29. https://doi.org/10.1515/iss-2017-0042 (2018). Epub 20180210.
Li, W. et al. A tool for predicting overall survival in patients with ewing sarcoma: A multicenter retrospective study. BMC Cancer. 22 (1), 914. https://doi.org/10.1186/s12885-022-09796-7 (2022). Epub 20220823.
Li, W. et al. Risk analysis of pulmonary metastasis of chondrosarcoma by Establishing and validating a new clinical prediction model: A clinical study based on seer database. BMC Musculoskelet. Disord. 22 (1), 529. https://doi.org/10.1186/s12891-021-04414-2 (2021). Epub 20210609.
Cypko, M. A., Stoehr, M., Oeltze-Jafra, S., Dietz, A. & Lemke, H. U. A guide for constructing bayesian network graphs of Cancer treatment decisions. Stud. Health Technol. Inf. 245, 1355 (2017).
Li, W. et al. Development of a Machine Learning-Based Predictive Model for Lung Metastasis in Patients with Ewing Sarcoma. Front Med (Lausanne) 9:807382. Epub 20220401. (2022). https://doi.org/10.3389/fmed.2022.807382
Zong, X. et al. Phenylacetylglutamine as a risk factor and prognostic Indicator of heart failure. ESC Heart Fail. 9 (4), 2645–2653. https://doi.org/10.1002/ehf2.13989 (2022). Epub 20220527.
Wang, L., Wang, X., Chen, A., Jin, X. & Che, H. Prediction of Type 2 Diabetes Risk and Its Effect Evaluation Based on the Xgboost Model. Healthcare (Basel) 8(3). Epub 20200731. (2020). https://doi.org/10.3390/healthcare8030247
Li, W. et al. A Machine Learning-Based Predictive Model for Predicting Lymph Node Metastasis in Patients with Ewing’s Sarcoma. Front Med (Lausanne) 9:832108. Epub 20220406. (2022). https://doi.org/10.3389/fmed.2022.832108
Wang, H. et al. Prediction of the risk of C5 palsy after posterior laminectomy and fusion with cervical myelopathy using a support vector machine: an analysis of 184 consecutive patients. J. Orthop. Surg. Res. 16 (1), 332. https://doi.org/10.1186/s13018-021-02476-5 (2021). Epub 20210521.
Hao, P. Y., Chiang, J. H. & Chen, Y. D. Possibilistic classification by support vector networks. Neural Netw. 149, 40–56. https://doi.org/10.1016/j.neunet.2022.02.007 (2022). Epub 20220212.
Li, W. et al. Establishment and validation of a nomogram and web calculator for the risk of new vertebral compression fractures and cement leakage after percutaneous vertebroplasty in patients with osteoporotic vertebral compression fractures. Eur. Spine J. 31 (5), 1108–1121. https://doi.org/10.1007/s00586-021-07064-z (2022). Epub 20211125.
Li, W. et al. An External-Validated Prediction Model to Predict Lung Metastasis among Osteosarcoma: A Multicenter Analysis Based on Machine Learning. Comput Intell Neurosci 2022:2220527. Epub 20220506. (2022). https://doi.org/10.1155/2022/2220527
Steg, P. G. et al. Determinants and prognostic impact of heart failure complicating acute coronary syndromes: observations from the global registry of acute coronary events (Grace). Circulation 109 (4), 494–499. https://doi.org/10.1161/01.Cir.0000109691.16944.Da (2004). Epub 20040126.
Lewis, E. F. et al. Predictors of late development of heart failure in stable survivors of myocardial infarction: the care study. J. Am. Coll. Cardiol. 42 (8), 1446–1453. https://doi.org/10.1016/s0735-1097(03)01057-x (2003).
O’Connor, C. M. et al. Clinical characteristics and Long-Term outcome of patients in whom congestive heart failure develops after thrombolytic therapy for acute myocardial infarction: development of a predictive model. Am. Heart J. 133 (6), 663–673. https://doi.org/10.1016/s0002-8703(97)70168-6 (1997).
Stamler, J. Epidemiologic findings on body mass and blood pressure in adults. Ann. Epidemiol. 1 (4), 347–362. https://doi.org/10.1016/1047-2797(91)90045-e (1991).
Chan, J. M. et al. Fat distribution, and weight gain as risk factors for clinical diabetes in men. Diabetes Care. 17 (9), 961–969. https://doi.org/10.2337/diacare.17.9.961 (1994).
Colditz, G. A., Willett, W. C., Rotnitzky, A. & Manson, J. E. Weight gain as a risk factor for clinical diabetes mellitus in women. Ann. Intern. Med. 122 (7), 481–486. https://doi.org/10.7326/0003-4819-122-7-199504010-00001 (1995).
Kannel, W. B. & McGee, D. L. Diabetes and glucose tolerance as risk factors for cardiovascular disease: the Framingham study. Diabetes Care. 2 (2), 120–126. https://doi.org/10.2337/diacare.2.2.120 (1979).
Cho, E. et al. A prospective study of obesity and risk of coronary heart disease among diabetic women. Diabetes Care. 25 (7), 1142–1148. https://doi.org/10.2337/diacare.25.7.1142 (2002).
Zhou, Y. T. et al. Lipotoxic heart disease in obese rats: implications for human obesity. Proc. Natl. Acad. Sci. U S A. 97 (4), 1784–1789. https://doi.org/10.1073/pnas.97.4.1784 (2000).
Fuse, K. et al. Marker chromosome is a strong poor prognosis factor after allogeneic Hsct for Adverse-Risk aml patients. Eur. J. Haematol. 105 (5), 616–625. https://doi.org/10.1111/ejh.13495 (2020). Epub 20200811.
Levy, D., Larson, M. G., Vasan, R. S., Kannel, W. B. & Ho, K. K. The progression from hypertension to congestive heart failure. Jama 275 (20), 1557–1562 (1996).
Vasan, R. S. & Levy, D. The role of hypertension in the pathogenesis of heart failure. A clinical mechanistic overview. Arch. Intern. Med. 156 (16), 1789–1796 (1996).
de Simone, G., Gottdiener, J. S., Chinali, M. & Maurer, M. S. Left ventricular mass predicts heart failure not related to previous myocardial infarction: the cardiovascular health study. Eur. Heart J. 29 (6), 741–747. https://doi.org/10.1093/eurheartj/ehm605 (2008). Epub 20080119.
Georgiopoulou, V. V., Kalogeropoulos, A. P. & Butler, J. Heart failure in hypertension: prevention and treatment. Drugs 72 (10), 1373–1398. https://doi.org/10.2165/11631100-000000000-00000 (2012).
Sciarretta, S., Palano, F., Tocci, G., Baldini, R. & Volpe, M. Antihypertensive treatment and development of heart failure in hypertension: A bayesian network Meta-Analysis of studies in patients with hypertension and high cardiovascular risk. Arch. Intern. Med. 171 (5), 384–394. https://doi.org/10.1001/archinternmed.2010.427 (2011). Epub 20101108.
Zheng, L., Smith, N. J., Teng, B. Q., Szabo, A. & Joyce, D. L. Predictive model for heart failure readmission using nationwide readmissions database. Mayo Clin. Proc. Innov. Qual. Outcomes. 6 (3), 228–238. https://doi.org/10.1016/j.mayocpiqo.2022.04.002 (2022). Epub 20220517.
de Oliveira Correia, E. T., Mechanick, J. I., Dos Santos Barbetta, L. M., Jorge, A. J. L. & Mesquita, E. T. Cardiometabolic-Based Chronic Disease: Adiposity and Dysglycemia Drivers of Heart Failure. Heart Fail Rev Epub 20220404. (2022). https://doi.org/10.1007/s10741-022-10233-x
Zhang, S. et al. Predictive value of neutrophil to lymphocyte ratio in patients with acute St segment elevation myocardial infarction after percutaneous coronary intervention: A Meta-Analysis. BMC Cardiovasc. Disord. 18 (1), 75. https://doi.org/10.1186/s12872-018-0812-6 (2018). Epub 20180502.
Al Aseri, Z. A., Habib, S. S. & Marzouk, A. Predictive value of high sensitivity C-Reactive protein on progression to heart failure occurring after the first myocardial infarction. Vasc Health Risk Manag. 15, 221–227. https://doi.org/10.2147/vhrm.S198452 (2019). Epub 20190715.
Xu, M., Yan, L., Xu, J., Yang, X. & Jiang, T. Predictors and prognosis for incident in-Hospital heart failure in patients with preserved ejection fraction after first acute myocardial infarction: an observational study. Med. (Baltim). 97 (24), e11093. https://doi.org/10.1097/md.0000000000011093 (2018).
Stumpf, C. et al. C-Reactive protein levels predict systolic heart failure and outcome in patients with first St-Elevation myocardial infarction treated with coronary angioplasty. Arch. Med. Sci. 13 (5), 1086–1093. https://doi.org/10.5114/aoms.2017.69327 (2017). Epub 20170803.
Wang, H. et al. Analysis of risk factors for perioperative hidden blood loss in unilateral biportal endoscopic spine surgery: A retrospective multicenter study. J. Orthop. Surg. Res. 16 (1), 559. https://doi.org/10.1186/s13018-021-02698-7 (2021). Epub 20210915.
Li, W. et al. Machine learning applications for the prediction of bone cement leakage in percutaneous vertebroplasty. Front. Public. Health. 9, 812023. https://doi.org/10.3389/fpubh.2021.812023 (2021). Epub 20211210.
Li, W. et al. Machine Learning-Based Prediction of Lymph Node Metastasis among Osteosarcoma Patients. Front Oncol 12:797103. Epub 20220420. (2022). https://doi.org/10.3389/fonc.2022.797103
Acknowledgements
We thank all the participants involved in this study.
Funding
This work was supported by Wuxi Taihu Lake Talent Plan, Supports for Leading Talents in Medical and Health Profession(2020THRC-DJ-SNW).
Author information
Authors and Affiliations
Contributions
Yuan Liu conceived the study. Yu Zhang, Yuan Liu and Yuankun Liu drafted the manuscript. Pengpeng Zhang, Jiaheng Xie, Yi Xie and Ning Zhao performed the literature search and collected the data. Chao Cheng and Songyun Zhao helped with the final revision of this manuscript. All authors contributed to the article and approved the submitted version.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval and consent to participate
This study was conducted in compliance with the principles outlined in the Declaration of Helsinki and received approval from the Ethics Committee of Wuxi People’s Hospital, with approval number KY22085. Due to the retrospective nature of the study, the Ethics Committee of Wuxi People’s Hospita waived the need of obtaining informed consent’ in the manuscript. Before analysis, all confidential patient information was removed from the complete dataset to ensure data privacy and anonymity.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Liu, Y., Liu, Y., Zhang, Y. et al. Using machine learning algorithms to predict risk factors of heart failure after complete mesocolic excision in colorectal cancer patients. Sci Rep 15, 25441 (2025). https://doi.org/10.1038/s41598-025-11726-z
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-11726-z





