Abstract
Endovascular aortoiliac revascularization is a common treatment option for peripheral artery disease that carries non-negligible risks. Outcome prediction tools may support clinical decision-making but remain limited. We developed machine learning algorithms that predict 30-day post-procedural outcomes. The National Surgical Quality Improvement Program targeted vascular database was used to identify patients who underwent endovascular aortoiliac revascularization between 2011–2021. Input features included 37 pre-operative demographic/clinical variables. The primary outcome was 30-day post-procedural major adverse limb event (MALE) or death. Data were split into training (70%) and test (30%) sets. Using 10-fold cross-validation, 6 machine learning models were trained using pre-operative features. Overall, 6601 patients were included, and 30-day MALE/death occurred in 470 (7.1%) individuals. The best-performing model was XGBoost, achieving an AUROC (95% CI) of 0.94 (0.93–0.95). In comparison, logistic regression had an AUROC (95% CI) of 0.74 (0.73–0.76). The XGBoost model accurately predicted 30-day post-procedural outcomes, performing better than logistic regression.
Similar content being viewed by others
Introduction
Peripheral artery disease (PAD) impacts more than 200 million people globally, leading to reduced quality of life, mounting health care costs, amputation, and death1,2,3,4. A distinct subset of PAD is aortoiliac occlusive disease (AIOD), which involves atherosclerosis of the infrarenal aorta and iliac arteries, affecting approximately 14% of the population in general5. Historically, advanced AIOD was treated with open revascularization6. Over the last few decades, endovascular intervention has become an increasingly common less invasive alternative7. Nevertheless, endovascular revascularization of AIOD carries a significant risk of adverse events, with a systematic review of 19 studies demonstrating mortality and complication rates up to 6.7% and 45%, respectively8. Consequently, careful assessment of perioperative risk is recommended by the Global Vascular Guidelines when patients are considered for revascularization9.
There are currently no standardized tools to support prediction of complications following endovascular revascularization for AIOD. The Vascular Quality Initiative (VQI) Cardiac Risk Index (CRI) is limited to open revascularization10. Other tools such as the American College of Surgeons (ACS) National Surgical Quality Improvement Program (NSQIP) online surgical risk calculator11 rely on modeling techniques that require manual input of clinical variables, making them challenging to use in busy medical environments12. Therefore, there is an important need to develop better and more practical risk prediction tools for patients undergoing endovascular aortoiliac revascularization.
Machine learning (ML) is a rapidly evolving technology that enables computers to learn from large amounts of data and make accurate predictions13. Using advanced analytics, ML can learn non-linear, complex relationships between user inputs, such as patient characteristics, and outputs, such as clinical outcomes13. The advantage of newer ML techniques over traditional statistical methods is that they can better model non-linear relationships between covariates and outcomes14, which is common in health care data15. Previously, we built a ML algorithm that can accurately predict 30-day complications following open revascularization for AIOD, which achieved better performance compared to logistic regression16. To complement this model and provide further guidance for the management of patients with advanced AIOD, NSQIP data were used to train ML algorithms that can predict 30-day outcomes following endovascular revascularization for AIOD.
Results
Patients and events
Overall, 7102 patients underwent endovascular revascularization for AIOD in the targeted NSQIP vascular database from 2011 to 2021. The following exclusions were made: intervention performed for aortoiliac aneurysm (n = 162), acute limb ischemia (n = 7), or dissection (n = 17), unreported symptom status (n = 132) or procedure type (n = 58), and concurrent major amputation (n = 7) or surgical bypass (n = 118). Overall, 6601 patients were included. The primary outcome of 30-day major adverse limb event (MALE) or death occurred in 470 (7.1%) individuals. The 30-day secondary endpoints occurred in this distribution: major vascular reintervention (n = 242 [3.7%]), untreated loss of patency (n = 63 [1.0%]), major amputation (n = 113 [1.7%]), death (n = 130 [2.0%]), major adverse cardiovascular event (MACE, n = 314 [4.8%]; composite of myocardial infarction (n = 205), stroke (n = 22), and death (n = 130)), wound complication (n = 236 [3.6%]), bleeding requiring transfusion or secondary procedure (n = 374 [5.7%]), other morbidity (n = 337 [5.1%]; composite of pneumonia (n = 75), unplanned reintubation (n = 67), pulmonary embolism (n = 7), failure to wean from ventilator (n = 47), acute kidney injury (n = 72), urinary tract infection (n = 42), cardiac arrest (n = 40), deep vein thrombosis (n = 30), sepsis (n = 51), septic shock (n = 42), Clostridium difficile infection (n = 14)), non-home discharge (n = 430 [6.5%]), and unplanned readmission (n = 507 [7.7%]). A flowchart summarizing patient selection and outcomes is reported in Fig. 1.
Pre-operative characteristics
In comparison to patients who did not develop 30-day MALE or death, individuals with a primary outcome were older with a greater proportion being Black, residing in nursing homes or other facilities, or transferred from another hospital. A higher percentage of individuals with 30-day MALE or death had congestive heart failure (CHF), insulin-dependent diabetes, end stage renal disease (ESRD) requiring dialysis, and ≥1 high-risk physiologic factor, and were partially or totally dependent functionally. Despite having more cardiovascular comorbidities, a lower proportion of patients with an event received statins or antiplatelets. A greater proportion of patients with 30-day MALE or death had an ankle brachial index (ABI) ≤ 0.39 or no recorded ABI and pedal pulses that were non-palpable, as well as a prior surgical bypass involving the segment being treated. Individuals with a primary outcome were more likely to present with chronic limb threatening ischemia (CLTI), undergo more distal revascularization of the aortoiliac segment (i.e., external iliac rather than common iliac intervention), receive urgent or emergent intervention, and have an American Society of Anesthesiologists (ASA) classification ≥4. Patients who underwent intervention by vascular surgeons were less likely to develop a primary outcome compared to other specialists (Table 1).
Model performance
Six different ML models were trained and subsequently assessed on testing data for predicting 30-day MALE or death after endovascular revascularization for AIOD. Extreme Gradient Boosting (XGBoost) achieved the top performance with an area under the receiver operating characteristic curve [AUROC] (95% CI) of 0.94 (0.93–0.95) in comparison to random forest [0.92 (0.91–0.93)], radial basis function support vector machine [0.90 (0.89–0.91)], Naïve Bayes [0.85 (0.84–0.87)], multilayer perceptron artificial neural network [0.76 (0.74–0.77)], and logistic regression [0.74 (0.73–0.76)]. XGBoost attained the following secondary performance metrics: accuracy 0.86 (95% CI 0.85–0.88), specificity 0.87, sensitivity 0.86, negative predictive value 0.86, and positive predictive value 0.87 (Table 2).
XGBoost obtained the following AUROC’s (95% CI) for predicting 30-day secondary endpoints: major vascular reintervention [0.86 (0.85–0.87)], untreated loss of patency [0.95 (0.94–0.96)], major amputation [0.97 (0.86–0.98)], death [0.97 (0.96–0.98)], MACE [0.90 (0.89–0.91)], bleeding requiring transfusion or secondary procedure [0.95 (0.94–0.96)], wound complication [0.90 (0.89–0.91)], other morbidity [0.88 (0.87–0.89)], non-home discharge [0.97 (0.96–0.98)], and unplanned readmission [0.88 (0.87–0.89)] (Table 3). Logistic regression had worse performance compared to XGBoost across all primary and secondary outcomes, with AUROC’s ranging from 0.70–0.78.
Figure 2 illustrates the ROC curve for XGBoost in predicting 30-day MALE or death. The model was well-calibrated, achieving a Brier score of 0.08, which demonstrates excellent agreement between predicted/observed event probabilities (Fig. 3). For XGBoost, the most important predictors of 30-day MALE or death were: (1) CLTI, (2) functional status, (3) ≥ 1 high-risk physiologic factor, (4) pre-operative dialysis, (5) CHF, (6) transferred from another hospital, (7) urgency, (8) diabetes, (9) primary procedure (more distal revascularization of the aortoiliac segment), and (10) absence of pre-operative statin (Fig. 4). Subgroup analysis of feature importance based on symptom status demonstrated that eight of the ten most important features were identical for CLTI patients and individuals who were asymptomatic or claudicants, and the two most important features were functional status and ≥1 high-risk physiologic factor for both subgroups (Supplementary Fig. 1).
Subgroup analysis
Model performance was excellent across subgroups based on sex, age, race, ethnicity, procedure type, symptom status, urgency, and concurrent infrainguinal endovascular revascularization with a range of AUROC values from 0.92 to 0.95 and no significant differences were found between majority/minority subgroups (Supplementary Figs. 2–9).
Discussion
Using targeted NSQIP vascular data from 2011 to 2021, which comprised 6601 patients who received endovascular revascularization for AIOD, we built ML algorithms that pre-operatively predict 30-day MALE or death with excellent performance (AUROC 0.94). Our models also predicted 30-day untreated loss of patency, major vascular reintervention, major amputation, death, MACE, wound complication, bleeding, other morbidity, non-home discharge, and unplanned readmission with AUROC’s ranging from 0.86 to 0.97. There were three other notable findings. First, individuals who suffer adverse events following endovascular aortoiliac revascularization are a high-risk cohort with predictive factors pre-operatively. They have more comorbidities and poorer functional status, with a greater proportion having high-risk anatomic and physiologic features. Furthermore, a greater proportion of patients with adverse events had CLTI, underwent more distal revascularization, and required urgent/emergent intervention. Despite these differences, they were less likely to receive optimal medical therapy including antiplatelets and statins. This represents an important opportunity to improve medical management of patients with PAD. Second, we evaluated six different ML models and XGBoost achieved the best predictive performance. Our algorithm was well-calibrated and remained robust across demographic/clinical subgroups. Third, we reported the most important predictors of 30-day MALE/death in our algorithms. These variables can help clinicians understand patient characteristics that lead to specific risk predictions, thereby supporting patient/procedure selection and pre-operative optimization.
Risk prediction tools for patients undergoing endovascular aortoiliac revascularization are limited. Bertges et al.10 used multivariable logistic regression to develop the VQI CRI for predicting myocardial infarction after carotid endarterectomy, lower extremity bypass, and aortic aneurysm repair, achieving an overall AUROC of 0.7510. Notably, the model did not include endovascular aortoiliac revascularization, and therefore, our work fills an important gap in this tool10. Applying ML to a contemporary NSQIP cohort, we achieved an AUROC of 0.94 for predicting MALE or death at 30 days following endovascular revascularization for AIOD. This demonstrates the benefit of leveraging advanced data analytics to build procedure-specific risk prediction models using contemporary datasets. Our model provides accurate risk predictions for a patient population that has often not been included in existing tools10 and complements our previously described ML algorithm for predicting outcomes at 30 days after open revascularization for AIOD16.
Bonde and colleagues (2021) built ML models on a sample of patients in the NSQIP database undergoing more than 2900 different procedures to predict post-operative complications, attaining AUROC’s between 0.85 and 0.8817. Since individuals with PAD are a high-risk population with many vascular comorbidities, the ability for general risk prediction tools to perform well on this cohort could be limited18. By building ML models tailored to individuals receiving endovascular revascularization for AIOD, we attained AUROC’s above 0.90. Our algorithms can also predict clinically relevant limb-related outcomes such as major vascular reintervention and major amputation, which are of importance to interventionalists and vascular surgeons. Consequently, developing ML algorithms specific to endovascular aortoiliac revascularization can increase performance and applicability.
Our findings can be interpreted in several ways. First, individuals who suffer from adverse events after endovascular revascularization for AIOD are a high-risk group with many cardiovascular risk factors19. Multiple societal guidelines recommend statins and antiplatelets for all PAD patients9,20,21,22, yet individuals who had complications in our study were less likely to be taking these medications. The BEST-CLI trial corroborates the fact that the rates of best medical therapy are suboptimal in PAD patients23. Consequently, there remain critical opportunities to improve outcomes for PAD patients by accurately determining their perioperative risk and optimizing them medically before intervention. In our ML algorithm, we found that clinical presentation with CLTI was the most important predictor of 30-day MALE or death, which corroborates existing literature given that these patients generally have more extensive lesions and severe ischemia, increasing the risk of limb loss and adverse events24. Poor pre-operative functional status was also a strong predictive feature for worse outcomes, as it correlates well with the overall health status of the patient and their ability to tolerate physiologically demanding procedures25. Other important predictors included high-risk physiological factors, pre-operative dialysis, pre-operative CHF, among others, highlighting that the overall clinical status of the patient and their pre-existing conditions are important predictors of their post-operative outcomes26. Second, the ML models achieved better performance than existing tools for various reasons. In comparison to logistic regression, newer ML technology can better learn non-linear and complex relationships between covariates and outcomes27,28. This is particularly important in health care data because many demographic/clinical factors can influence patient outcomes29. XGBoost was the top-performing algorithm, which has important advantages over other ML approaches including relatively fewer issues with overfitting in addition to quicker computing while retaining precision30,31,32. Additionally, XGBoost performs well on structured data, which could be the reason it outperformed more complicated algorithms such as neural networks33. These findings are corroborated by previous work demonstrating that ML algorithms can achieve better discrimination than conventional statistical models34,35. Additionally, we used methodology that strengthens model validity including calibration measurement, adherence to reporting standards (TRIPOD + AI statement), and collaboration between computer scientists, biostatisticians, and clinicians34,35. Third, the performance of our algorithms remained excellent across demographic/clinical subpopulations. This is notable given that algorithm bias against underrepresented populations remains an important concern within ML research36. These biases were likely avoided because ACS NSQIP is a multi-national database that captures diverse patient populations with rich sociodemographic information37,38. Fourth, a small proportion of patients in our cohort underwent revascularization for asymptomatic disease (~5%). The reasons for these interventions are unclear from our dataset but may be related to treatment of hemodynamically significant stenoses of previous revascularization procedures, patient preference, poor adherence to guideline-directed revascularization, or coding errors39.
Clinical decision-making can be supported by our ML algorithms in several ways. At the pre-operative stage, individuals predicted to be at elevated risk of adverse events could be assessed further in terms of both non-modifiable and modifiable factors26. Individuals with risks that are non-modifiable could be considered for alternative options including medical management alone, amputation, or close surveillance40,41,42. Those with modifiable risks could benefit from further evaluation and optimization including referrals to anesthesiologists, cardiologists, and/or internal medicine specialists when appropriate43,44. Conversely, low risk patients could be considered for open revascularization, which may be more durable45. Post-operatively, high risk patients could receive increased monitoring on the ward or intensive care unit46. Individuals at elevated risk for unplanned readmission or non-home discharge could receive increased support from allied health to ensure safe discharge planning47. These clinical decisions supported by the ML risk prediction tool may improve outcomes by mitigating complications related to endovascular revascularization for AIOD.
The ML algorithm programming code is openly available on GitHub. This tool may be utilized by clinicians caring for patients under consideration for endovascular revascularization for AIOD. The algorithm could be implemented at the >700 NSQIP-participating centers globally. There is also potential to implement the model at non-NSQIP sites because the input variables are commonly captured for the routine care of vascular patients48. Given the challenges of deploying prediction models into clinical practice, consideration of implementation science principles is critical49. Our ML models have the advantage of providing automated risk predictions, thereby improving practicality in busy clinical settings compared with traditional risk predictors that generally require manual input of variables11. Specifically, our algorithm can autonomously extract a patient’s NSQIP data to make risk predictions. The clinical applicability of our risk prediction tool stems from both its automated nature and highly accurate predictions that can guide vascular specialists in terms of patient/procedure selection, counseling, and peri-operative management to improve care for patients with AIOD being considered for endovascular revascularization. We advocate for dedicated health care data analytics teams at the institution level, as their benefits to patient care have been previously demonstrated and model implementation can be facilitated by these experts50.
Our study has various limitations. First, the algorithms were built using data from NSQIP. Additional studies are needed to evaluate whether generalization of predictive performance can be made to non-NSQIP centers. Second, our dataset captured 30-day endpoints. Assessment of ML algorithms on datasets with longer follow-up periods could support prediction of long-term risk. Third, several variables that may contribute to risk predictions were not captured in our dataset, including lesion characteristics and use of low-dose rivaroxaban, drug eluting technology, post-procedural antiplatelet therapy, intravascular lithotripsy, covered stents, and intravascular ultrasound. Future predictive models developed on datasets that record these variables could improve accuracy. Additionally, some data points were unknown, not reported, or not documented; further refinement of datasets with more complete clinical information may improve model performance. Fourth, the sample size was lower than expected over a 10-year period likely because ACS NSQIP is primarily a surgical database, and procedures performed by interventional radiologists or other non-surgical specialists may be under-captured. Additional investigation of endovascular aortoiliac revascularization performed by non-surgical specialists may increase the sample size for analysis.
Using the NSQIP targeted vascular database, we built ML algorithms that predict 30-day MALE/death and other clinically relevant outcomes following endovascular revascularization for AIOD with excellent performance using pre-operative data. Given that our ML algorithms performed better than existing tools and logistic regression, they have potential for important utility in the peri-operative management of patients being considered for endovascular aortoiliac revascularization to mitigate adverse outcomes. Prospective validation of our prediction models is warranted.
Methods
Design
This was an ML-based prognostic study and findings were reported based on the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis + Artificial Intelligence (TRIPOD + AI) statement51. The methods were based on our previous work to demonstrate the robustness and reproducibility of our ML model development and evaluation process16.
Dataset
The ACS NSQIP database contains data on patients from >700 hospitals in ~15 countries worldwide52. Trained clinical reviewers prospectively collect data from electronic health records and ACS regularly audits the data for accuracy53. Targeted vascular registries in NSQIP were established in 2011 and comprise additional variables and outcomes specific to vascular procedures54. Research ethics board review was not required as the data source was a deidentified registry.
Cohort
All individuals in the NSQIP targeted vascular database who received endovascular revascularization for AIOD (angioplasty or stent of the aorta or iliac arteries) for PAD between 2011 and 2021 were included. Individuals who underwent the procedure for aortoiliac aneurysm, dissection, acute limb ischemia, malignancy, or trauma, patients with unreported symptom status or procedure type, and those who received concurrent major amputation or bypass were excluded.
Features
The input features for the ML models included 37 pre-operative variables. To maximize model performance, all pre-operative NSQIP variables were used, as ML methods excel at handling many inputs. Feature selection was not performed as it worsened model performance, and given the goal of automated predictions, reducing the number of features would decrease predictive performance without a significant change to model run time. Demographic features included age, body mass index, sex, race, and ethnicity. Comorbidities consisted of diabetes, hypertension, smoking status, chronic obstructive pulmonary disease, CHF, ESRD on dialysis, functional status, and high-risk physiologic factor [defined as ≥1 of: (1) ESRD, (2) age above 80, (3) CHF class III or IV (New York Heart Association), (4) ejection fraction below 30%, (5) unstable angina <30 days before intervention, or (6) myocardial infarction <30 days before intervention]. Medications consisted of statins, antiplatelets, and beta blockers. Labs included serum creatinine, blood urea nitrogen, serum sodium, albumin, hematocrit, white blood cell count, platelet count, partial thromboplastin time, and international normalized ratio. Limb hemodynamics based on ABI and toe pressure, and high-risk anatomic factors (defined as a prior bypass or endovascular intervention involving the currently treated segment), were noted. Concurrent procedures documented were minor amputation (below the ankle) and endovascular infrainguinal revascularization. Other characteristics were symptom status [CLTI (defined as tissue loss or rest pain), claudication, or asymptomatic], procedure type, urgency of intervention (elective, urgent, or emergent), ASA class, and specialty of the primary operator. Urgent and emergent interventions, generally performed within 24–48 h and <24 h of admission, respectively, were mostly performed for patients with severe ischemia resulting in CLTI. Isolated internal iliac artery interventions were included and generally performed on patients with lifestyle-limiting buttock claudication, rest pain, and/or tissue loss. Supplementary Table 1 provides a complete list of input features and definitions.
Outcomes
The primary endpoint was 30-day post-procedural MALE (defined as a composite of major vascular reintervention, untreated loss of patency, or major amputation) or death. The definition of major vascular reintervention was thrombectomy/thrombolysis involving the treated segment or a new or revision aortoiliac bypass or interposition graft. The definition of untreated loss of patency was an occlusion of the treated segment with no subsequent revascularization. The definition of major amputation was a transtibial or more proximal amputation of the ipsilateral leg. The definition of death was all-cause mortality. These outcomes and definitions align with the Society for Vascular Surgery reporting standards55.
The 30-day secondary outcomes consisted of individual components of MALE or death, MACE, bleeding requiring transfusion or secondary procedure, wound complication, other morbidity, non-home discharge, and unplanned readmission. MACE was a composite endpoint of myocardial infarction (ischemic changes on electrocardiogram, elevated troponin, or diagnosis by a physician or advanced provider), stroke (neurologic dysfunction exceeding 24 h in the setting of suspected stroke), or death. The definition of wound complication was a non-healing wound at the incision (if present), cellulitis, or dehiscence. Other morbidity was a composite endpoint of unplanned reintubation, failure to wean from the ventilator (cumulative time of ventilator-assisted respirations over 48 h), pneumonia, pulmonary embolism, acute kidney injury (rise in serum creatinine over 2 mg/dL from the pre-operative value or requirement of dialysis in a patient who was not on dialysis pre-operatively), cardiac arrest, urinary tract infection, Clostridium difficile infection, deep vein thrombosis requiring therapy, sepsis, or septic shock. The definition of non-home discharge was discharge to skilled care, rehabilitation, or other facility. These outcomes are defined by the ACS NSQIP data dictionary56.
Model development
We trained 6 ML models to predict 30-day primary and secondary outcomes: Naïve Bayes classifier, XGBoost, radial basis function support vector machine, random forest, multilayer perceptron artificial neural network, and logistic regression. We selected these models because of their demonstrated efficacy in predicting postoperative outcomes57,58,59. We used logistic regression for baseline comparison because traditional risk predictors most commonly use this modeling technique60.
We randomly split our data into two subsets: 70% training and 30% testing61. Testing data was reserved for model evaluation and not used for model training. We performed 10-fold cross-validation and grid search on training data to identify hyperparameters that were optimized for each ML model, including logistic regression62,63. The multivariable logistic regression model was developed and optimized with the same rigor as the advanced ML models with 10-fold cross-validation and grid search62,63. Preliminary analysis showed that the primary endpoint was rare, affecting 470/6601 (7.1%) of patients. To improve class balance, Random Over-Sample Examples (ROSE) was applied to training data64. ROSE uses smoothed bootstrapping to draw new samples from the feature space neighborhood around the minority class and is a commonly used method to facilitate predictive modeling of rare events64. The models were then assessed on test set data and ranked based on the primary discriminatory metric of AUROC. Our best performing model was XGBoost, which had the following optimized hyperparameters for our dataset: number of rounds = 200, maximum tree depth = 3, learning rate = 0.3, gamma = 0, column sample by tree = 0.6, minimum child weight = 1, and subsample = 0.9. The process for selecting these hyperparameters is described in Supplementary Table 2. Once the top-performing ML algorithm for the primary endpoint was identified, the same model was further trained to predict secondary endpoints.
Statistical analysis
Baseline features were reported as means and standard deviations or numbers and proportions. Differences between patients with vs. without 30-day MALE or death were evaluated with independent t-test (continuous variables) or chi-square test (categorical variables). All continuous data were normally distributed. To account for multiple comparisons, Bonferroni correction was used to set statistical significance. AUROC was the primary model evaluation metric as it considers both sensitivity and specificity65. Secondary model evaluation metrics were specificity, sensitivity, negative predictive value, positive predictive value, and accuracy. Model robustness was evaluated using a calibration curve and Brier score, which measures the agreement between predicted/observed event probabilities66. Feature importance was assessed using the variable importance score (gain), which measures the influence of each covariate in contributing to a prediction67. We reported feature importance for 3 groups: the overall cohort, asymptomatic/claudication patients, and individuals with CLTI. Model performance was assessed across subgroups based on sex, age, race, ethnicity, procedure type, symptom status, urgency, and concurrent infrainguinal endovascular revascularization.
Using a sample size calculator validated for prediction models, we determined that to achieve an AUROC over 0.8 with an outcome rate of approximately 7% and 37 input variables, 3815 patients with 268 events was the minimum cohort size required68,69. We included 6601 patients who had 470 primary events, which satisfied this condition. There was < 5% missing data for variables of interest; therefore, available case analysis was applied whereby only non-missing covariates for each patient were considered70,71. This is a valid method for analyzing datasets with minimal missing data70,71. Data were missing completely at random, and imputation was not performed to reflect modeling of real-world data, which generally has missing information that is not imputed72. All subjects included in the study (n = 6601) were used in all analyses. R version 4.3.073 was used for all analyses with the following packages: caret74, xgboost75, ranger76, naivebayes77, e107178, nnet79, and pROC80.
Data availability
The data used for this study comes from the American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP). Access to and use of the data requires approval through an application process available at https://www.facs.org/quality-programs/data-and-registries/acs-nsqip/participant-use-data-file/.
Code availability
The complete code used for model development and evaluation in this project is openly available on GitHub: https://github.com/benli12345/AIE-ML-NSQIP.
References
Fowkes, F. G. R. et al. Comparison of global estimates of prevalence and risk factors for peripheral artery disease in 2000 and 2010: a systematic review and analysis. Lancet Lond. Engl. 382, 1329–1340 (2013).
Agnelli, G., Belch, J. J. F., Baumgartner, I., Giovas, P. & Hoffmann, U. Morbidity and mortality associated with atherosclerotic peripheral artery disease: a systematic review. Atherosclerosis 293, 94–100 (2020).
Kim, M., Kim, Y., Ryu, G. W. & Choi, M. Functional status and health-related quality of life in patients with peripheral artery disease: a cross-sectional study. Int. J. Environ. Res. Public. Health 18, 10941 (2021).
Kohn, C. G., Alberts, M. J., Peacock, W. F., Bunz, T. J. & Coleman, C. I. Cost and inpatient burden of peripheral artery disease: findings from the National Inpatient Sample. Atherosclerosis 286, 142–146 (2019).
Heaton, J. & Khan, Y. S. Aortoiliac Occlusive Disease. in StatPearls (StatPearls Publishing, Treasure Island (FL), 2022).
Beckman, J. A., Schneider, P. A. & Conte, M. S. Advances in revascularization for peripheral artery disease: revascularization in PAD. Circ. Res. 128, 1885–1912 (2021).
Topfer, L.-A. & Spry, C. New technologies for the treatment of peripheral artery disease. in CADTH Issues in Emerging Health Technologies (Canadian Agency for Drugs and Technologies in Health, Ottawa (ON), 2016).
Jongkind, V., Akkersdijk, G. J. M., Yeung, K. K. & Wisselink, W. A systematic review of endovascular treatment of extensive aortoiliac occlusive disease. J. Vasc. Surg. 52, 1376–1383 (2010).
Conte, M. S. et al. Global vascular guidelines on the management of chronic limb-threatening ischemia. J. Vasc. Surg. 69, 3S–125S.e40 (2019).
Bertges, D. J. et al. The Vascular Quality Initiative Cardiac Risk Index for prediction of myocardial infarction after vascular surgery. J. Vasc. Surg. 64, 1411–1421.e4 (2016).
Bilimoria, K. Y. et al. Development and evaluation of the universal ACS NSQIP surgical risk calculator: a decision aid and informed consent tool for patients and surgeons. J. Am. Coll. Surg. 217, 833–842 (2013).
Sharma, V. et al. Adoption of clinical risk prediction tools is limited by a lack of integration with electronic health records. BMJ Health Care Inf. 28, e100253 (2021).
Baştanlar, Y. & Özuysal, M. Introduction to machine learning. Methods Mol. Biol. 1107, 105–128 (2014).
Ngiam, K. Y. & Khor, I. W. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. 20, e262–e273 (2019).
Liew, B. X. W., Kovacs, F. M., Rügamer, D. & Royuela, A. Machine learning versus logistic regression for prognostic modelling in individuals with non-specific neck pain. Eur. Spine J. Off. Publ. Eur. Spine Soc. Eur. Spinal Deform. Soc. Eur. Sect. Cerv. Spine Res. Soc. 31, 2082–2091 (2022).
Li, B. et al. Predicting outcomes following open revascularization for aortoiliac occlusive disease using machine learning. J. Vasc. Surg. S0741-5214(23)01614–2 https://doi.org/10.1016/j.jvs.2023.07.006 (2023).
Bonde, A. et al. Assessing the utility of deep neural networks in predicting postoperative surgical complications: a retrospective study. Lancet Digit. Health 3, e471–e485 (2021).
Hers, T. M. et al. Inaccurate risk assessment by the ACS NSQIP risk calculator in aortic surgery. J. Clin. Med. 10, 5426 (2021).
Bonaca, M. P. et al. Rivaroxaban in peripheral artery disease after revascularization. N. Engl. J. Med. 382, 1994–2004 (2020).
Conte, M. S. et al. Society for Vascular Surgery practice guidelines for atherosclerotic occlusive disease of the lower extremities: management of asymptomatic disease and claudication. J. Vasc. Surg. 61, 2S–41S.e1 (2015).
Gerhard-Herman, M. D. et al. 2016 AHA/ACC guideline on the management of patients with lower extremity peripheral artery disease: executive summary: a report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation 135, e686–e725 (2017).
Aboyans, V. et al. Editor’s Choice - 2017 ESC guidelines on the diagnosis and treatment of peripheral arterial diseases, in collaboration with the European Society for Vascular Surgery (ESVS). Eur. J. Vasc. Endovasc. Surg. J. Eur. Soc. Vasc. Surg. 55, 305–368 (2018).
Farber, A. et al. Surgery or endovascular therapy for chronic limb-threatening ischemia. N. Engl. J. Med. https://doi.org/10.1056/NEJMoa2207899 (2022).
Farber, A. Chronic limb-threatening ischemia. N. Engl. J. Med. 379, 171–180 (2018).
Stewart, A. L. et al. Functional status and well-being of patients with chronic conditions. Results from the medical outcomes study. JAMA 262, 907–913 (1989).
Shaydakov, M. E. & Tuma, F. Operative Risk. in StatPearls (StatPearls Publishing, Treasure Island (FL), 2022).
Stoltzfus, J. C. Logistic regression: a brief primer. Acad. Emerg. Med. J. Soc. Acad. Emerg. Med. 18, 1099–1104 (2011).
Kia, B. et al. Nonlinear dynamics based machine learning: Utilizing dynamics-based flexibility of nonlinear circuits to implement different functions. PloS One 15, e0228534 (2020).
Chatterjee, P., Cymberknop, L. J. & Armentano, R. L. Nonlinear systems in healthcare towards intelligent disease prediction. Nonlinear Syst. Theor. Asp. Recent Appl. 1, e88163 (2019).
Ravaut, M. et al. Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data. Npj Digit. Med. 4, 1–12 (2021).
Wang, R., Zhang, J., Shan, B., He, M. & Xu, J. XGBoost machine learning algorithm for prediction of outcome in aneurysmal subarachnoid hemorrhage. Neuropsychiatr. Dis. Treat. 18, 659–667 (2022).
Fang, Z.-G., Yang, S.-Q., Lv, C.-X., An, S.-Y. & Wu, W. Application of a data-driven XGBoost model for the prediction of COVID-19 in the USA: a time-series study. BMJ Open 12, e056685 (2022).
Viljanen, M., Meijerink, L., Zwakhals, L. & van de Kassteele, J. A machine learning approach to small area estimation: predicting the health, housing and well-being of the population of Netherlands. Int. J. Health Geogr. 21, 4 (2022).
Shin, S. et al. Machine learning vs. conventional statistical models for predicting heart failure readmission and mortality. ESC Heart Fail 8, 106–115 (2021).
Cho, S. M. et al. Machine learning compared with conventional statistical models for predicting myocardial infarction readmission and mortality: a systematic review. Can. J. Cardiol. 37, 1207–1214 (2021).
Gianfrancesco, M. A., Tamang, S., Yazdany, J. & Schmajuk, G. Potential biases in machine learning algorithms using electronic health record data. JAMA Intern. Med. 178, 1544–1547 (2018).
Mazmudar, A., Vitello, D., Chapman, M., Tomlinson, J. S. & Bentrem, D. J. Gender as a risk factor for adverse intraoperative and postoperative outcomes of elective pancreatectomy. J. Surg. Oncol. 115, 131–136 (2017).
Halsey, J. N., Asti, L. & Kirschner, R. E. The impact of race and ethnicity on surgical risk and outcomes following palatoplasty: an analysis of the NSQIP pediatric database. Cleft Palate-Craniofacial J. Off. Publ. Am. Cleft Palate-Craniofacial Assoc. 10556656221078154 https://doi.org/10.1177/10556656221078154 (2022).
Rümenapf, G., Morbach, S., Schmidt, A. & Sigl, M. Intermittent claudication and asymptomatic peripheral arterial disease. Dtsch. Ärztebl. Int. 117, 188–193 (2020).
Bevan, G. H. & White Solaru, K. T. Evidence-based medical management of peripheral artery disease. Arterioscler. Thromb. Vasc. Biol. 40, 541–553 (2020).
Barrows, R. J., Krumsdorf, U., Zankl, A., Katus, H. & Tiefenbacher, C. P. Significance of close surveillance of patients with peripheral arterial disease. Angiology 60, 462–467 (2009).
Paulus, N., Jacobs, M. & Greiner, A. Primary and secondary amputation in critical limb ischemia patients: different aspects. Acta Chir. Belg. 112, 251–254 (2012).
O’Connor, D. B. et al. An anaesthetic pre-operative assessment clinic reduces pre-operative inpatient stay in patients requiring major vascular surgery. Ir. J. Med. Sci. 180, 649–653 (2011).
Davis, F. M. et al. The clinical impact of cardiology consultation prior to major vascular surgery. Ann. Surg. 267, 189–195 (2018).
Premaratne, S., Newman, J., Hobbs, S., Garnham, A. & Wall, M. Meta-analysis of direct surgical versus endovascular revascularization for aortoiliac occlusive disease. J. Vasc. Surg. 72, 726–737 (2020).
Gillies, M. A. et al. Intensive care utilization and outcomes after high-risk surgery in Scotland: a population-based cohort study. Br. J. Anaesth. 118, 123–131 (2017).
Patel, P. R. & Bechmann, S. Discharge Planning. in StatPearls (StatPearls Publishing, Treasure Island (FL), 2022).
Nguyen, L. L. & Barshes, N. R. Analysis of large databases in vascular surgery. J. Vasc. Surg. 52, 768–774 (2010).
Northridge, M. E. & Metcalf, S. S. Enhancing implementation science by applying best principles of systems science. Health Res. Policy Syst. 14, 74 (2016).
Batko, K. & Ślęzak, A. The use of Big Data Analytics in healthcare. J. Big Data 9, 3 (2022).
Collins, G. S. et al. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ 385, e078378 (2024).
ACS NSQIP. ACS https://www.facs.org/quality-programs/data-and-registries/acs-nsqip/.
Shiloach, M. et al. Toward robust information: data quality and inter-rater reliability in the American College of Surgeons National Surgical Quality Improvement Program. J. Am. Coll. Surg. 210, 6–16 (2010).
Cohen, M. E. et al. Optimizing ACS NSQIP modeling for evaluation of surgical quality and risk: patient risk adjustment, procedure mix adjustment, shrinkage adjustment, and surgical focus. J. Am. Coll. Surg. 217, 336–346.e1 (2013).
Stoner, M. C. et al. Reporting standards of the Society for Vascular Surgery for endovascular treatment of chronic lower extremity peripheral artery disease. J. Vasc. Surg. 64, e1–e21 (2016).
ACS NSQIP Participant Use Data File. ACS https://www.facs.org/quality-programs/data-and-registries/acs-nsqip/participant-use-data-file/.
Elfanagely, O. et al. Machine learning and surgical outcomes prediction: a systematic review. J. Surg. Res. 264, 346–361 (2021).
Bektaş, M., Tuynman, J. B., Costa Pereira, J., Burchell, G. L. & van der Peet, D. L. Machine learning algorithms for predicting surgical outcomes after colorectal surgery: a systematic review. World J. Surg. https://doi.org/10.1007/s00268-022-06728-1 (2022).
Senders, J. T. et al. Machine learning and neurosurgical outcome prediction: a systematic review. World Neurosurg. 109, 476–486.e1 (2018).
Shipe, M. E., Deppen, S. A., Farjah, F. & Grogan, E. L. Developing prediction models for clinical use using logistic regression: an overview. J. Thorac. Dis. 11, S574–S584 (2019).
Dobbin, K. K. & Simon, R. M. Optimally splitting cases for training and testing high dimensional classifiers. BMC Med. Genom.4, 31 (2011).
Jung, Y. & Hu, J. A K-fold averaging cross-validation procedure. J. Nonparametr. Stat. 27, 167–179 (2015).
Adnan, M., Alarood, A. A. S., Uddin, M. I. & Ur Rehman, I. Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models. PeerJ Comput. Sci. 8, e803 (2022).
Wibowo, P. & Fatichah, C. Pruning-based oversampling technique with smoothed bootstrap resampling for imbalanced clinical dataset of COVID-19. J. King Saud. Univ. Comput. Inf. Sci. 34, 7830–7839 (2022).
Hajian-Tilaki, K. Receiver Operating Characteristic (ROC) curve analysis for medical diagnostic test evaluation. Casp. J. Intern. Med. 4, 627–635 (2013).
Redelmeier, D. A., Bloch, D. A. & Hickam, D. H. Assessing predictive accuracy: how to compare Brier scores. J. Clin. Epidemiol. 44, 1141–1146 (1991).
Loh, W.-Y. & Zhou, P. Variable importance scores. J. Data Sci. 19, 569–592 (2021).
Riley, R. D. et al. Calculating the sample size required for developing a clinical prediction model. BMJ m441 https://doi.org/10.1136/bmj.m441 (2020).
Ensor, J. pmsampsize: Sample Size for Development of a Prediction Model. The Comprehensive R Archive Network https://cran.r-project.org/package=pmsampsize (2023).
Kang, H. The prevention and handling of the missing data. Korean J. Anesthesiol. 64, 402–406 (2013).
Groenwold, R. H. H. et al. Missing covariate data in clinical research: when and when not to use the missing-indicator method for analysis. CMAJ Can. Med. Assoc. J. 184, 1265–1269 (2012).
Hughes, R. A., Heron, J., Sterne, J. A. C. & Tilling, K. Accounting for missing data in statistical analyses: multiple imputation is not always the answer. Int. J. Epidemiol. 48, 1294–1304 (2019).
Download R-4.3.0 for Windows. The R-project for statistical computing. https://cran.r-project.org/bin/windows/base/.
Kuhn, M. et al. caret: Classification and Regression Training. The Comprehensive R Archive Network https://CRAN.R-project.org/package=caret (2024).
Chen, T. & Guestrin, C. XGBoost: a scalable tree boosting system. Proc. 22nd ACM SIGKDD Int. Conf. Knowl. Discov. Data Min. KDD 16, 785–794 (2016).
Wright, M. N., Wager, S. & Probst, P. ranger: A Fast Implementation of Random Forests. The Comprehensive R Archive Network https://cran.r-project.org/package=ranger (2024).
naivebayes: High Performance Implementation of the Naive Bayes Algorithm version 0.9.7 from CRAN. https://rdrr.io/cran/naivebayes/.
https://www.rdocumentation.org/packages/e1071/versions/1.7-11/topics/svm svm function - RDocumentation.
Ripley, B. & Venables, W. nnet: Feed-Forward Neural Networks and Multinomial Log-Linear Models. The Comprehensive R Archive Network https://CRAN.R-project.org/package=nnet (2025).
Robin, X. et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform. 12, 77 (2011).
Acknowledgements
This work was partially funded by the Canadian Institutes of Health Research, Ontario Ministry of Health, PSI Foundation, and Schwartz Reisman Institute for Technology and Society at the University of Toronto (Dr. Li). Dr. Hussain is funded by a Brigham and Women’s Hospital Heart and Vascular Center Faculty Award. The funding sources did not play a role in the design or conduct of the research. The ACS NSQIP and the hospitals participating in the ACS NSQIP are the source of the data used herein; they have not verified, and are not responsible for, the statistical validity of the data analysis or the conclusions derived by the authors. Presented at the following scientific meetings: Society for Vascular Surgery 2024 Vascular Annual Meeting, June 19–22, 2024, Chicago, Illinois, United States. Canadian Society for Vascular Surgery 2024 Annual Meeting, September 13–14, 2024, St. John’s, Newfoundland, Canada. 25th Congress of the Asian Society for Vascular Surgery, December 3-6, 2024, Bangkok, Thailand.
Author information
Authors and Affiliations
Contributions
All authors meet all four criteria: (1) Substantial contributions to the conception or design of the work or the acquisition, analysis, or interpretation of the data, (2) Drafting the work or revising it critically for important intellectual content, (3) Final approval of the completed version, and (4) Accountability for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. B.L. and M.A.O. had full access to and verified all the data in the study and take responsibility for the integrity of the data and accuracy of the data analysis. B.L., B.A., C.deM., M.M., and M.A.O. conceived of and designed the study. B.L. drafted the article, developed the models, and performed statistical analysis (with the support of data scientist D.B.). All authors (B.L., B.A., D.B., L.A.O., M.A.H., D.S.L., D.N.W., O.D.R., C.deM., M.M., and M.A.O.) acquired, analyzed, or interpreted data, and critically revised the article for important intellectual content. C.deM., M.M., and M.A.O. provided supervision. M.A.O. had the final responsibility for the decision to submit for publication. All authors (B.L., B.A., D.B., L.A.O., M.A.H., D.S.L., D.N.W., O.D.R., C.deM., M.M., and M.A.O.) had full access to the full data in the study, accept responsibility to submit for publication, and approved of the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Presented at the following scientific meetings:
Society for Vascular Surgery 2024 Vascular Annual Meeting, June 19-22, 2024, Chicago, Illinois, United States
Canadian Society for Vascular Surgery 2024 Annual Meeting, September 13-14, 2024, St. John’s, Newfoundland, Canada
25th Congress of the Asian Society for Vascular Surgery, December 3-6, 2024, Bangkok, Thailand.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, B., Aljabri, B., Beaton, D. et al. Predicting outcomes following endovascular aortoiliac revascularization using machine learning. npj Digit. Med. 8, 475 (2025). https://doi.org/10.1038/s41746-025-01865-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41746-025-01865-y