Introduction

Peripherally inserted central catheters (PICCs) are widely used for patients with cancer undergoing long-term chemotherapy. These catheters reduce the need for multiple venous punctures and protect peripheral blood vessels and surrounding tissues from the damaging effects of chemotherapeutic agent extravasation, thereby reducing the risk of chemical phlebitis and tissue necrosis1,2,3. Despite these significant advantages of PICC, unplanned catheter removal remains the most undesirable accident during the intended catheterization period4,5,6,7,8. PICC unplanned extubation (PICC-UE) refers to the premature removal of the catheter during the intended catheterization period due to severe complications or unintended dislodgement resulting from patient-related or operator-related factors13. The reported incidence of PICC-UE varies considerably across studies, with rates ranging from 2.5% to 40.7%, predominantly observed in cancer patients undergoing chemotherapy and requiring long-term catheterization9. PICC-UE not only exacerbates patient discomfort and imposes additional economic burdens but also disrupts post-extubation planned treatment, thus seriously affecting patients’ quality of life and, in some cases, posing a threat to their lives10,11. Therefore, identifying risk factors for PICC-UE is crucial to improving the safety and effectiveness of PICC use.

Previous studies have identified several risk factors for PICC-UE, including women, diabetes patients, a history of thrombosis, elevated D-dimer levels, undergoing targeted therapy, experiencing infusion failure, and using a valved or double-lumen catheter12. Self-management has also been recognized as a personal factor that may lead to PICC-UE after patients return home12. However, to date, no study has incorporated health-related quality of life (HRQOL) as a potential risk factor under investigation. Clinical observations indicate that discharged patients with a lower quality of life are more susceptible to PICC-UE events, primarily due to diminished capacity to adhere to recommended catheter care protocols. Therefore, PICC self-management ability and HRQOL should be considered complementary factors during the follow-up period. It is essential to conduct comprehensive studies on these risk factors to enable in-depth analysis and provide empirical evidence for developing effective strategies for preventing PICC-UE.

To systematically evaluate the contributions of various risk factors, this study applied four machine learning (ML) algorithms, recognized as an advanced data-driven approach to intelligent analysis and prediction13, to identify the most significant risk factors associated with PICC-UE. Given the low incidence of PICC-UE reported in the majority of existing studies and the resulting category imbalance in the dataset, the focal loss (FL) method was further employed to optimize the training loss function and improve algorithm performance. Finally, the SHapley Additive exPlanations (SHAP) tool was employed to provide a quantitative and visually interpretable assessment of the relationship between each risk factor and PICC-UE outcome.

Methods

Patients and grouping

During February 2021 and June 2022, cancer patients who had undergone PICC catheterization were consecutively enrolled from the Intravenous Catheter Care Clinic of Mianyang Central Hospital, School of Medicine, University of Electronic Science and Technology of China. Each patient was prospectively observed and followed up weekly until catheter removal. The inclusion criteria comprised patients: (i) who met the indications for and successfully underwent PICC placement, (ii) with placement sites in the upper arm, (iii) with a histopathological diagnosis of malignant tumor, and (iv) who participated voluntarily and provided written informed consent (patients or their family). The exclusion criteria comprised patients: (i) aged < 18 years, (ii) with incomplete medical records, (iii) who did not receive maintenance care at our hospital after catheter placement, (iv) with mental or communication disorders, and (v) who discontinued treatment or died during the study period.

The primary outcome, PICC-UE, was defined as PICC-UE owing to any PICC-related complication or accidental dislodgement by the patient during healthcare, necessitating the insertion of new vascular access for treatment. According to the presence or absence of PICC-UE, patients were categorized into the PICC-UE group and the non-PICC-UE group.

Quality control of PICC catheterization

Specially trained intravenous infusion specialist nurses performed PICC placement and maintenance according to standard procedures. Following catheterization, the position of the catheter tip was verified by chest X-ray. Throughout the treatment period, catheter maintenance was performed by oncology nurses who had undergone standardized training. All procedures during chemotherapy intervals were carried out by specialized nurses from the intravenous therapy center. For skin disinfection during catheter insertion and routine maintenance, iodine tincture (manufactured by Shanghai Licon Disinfection High-Tech Co., Ltd.) was used, and 3 M transparent dressings were applied as wound coverings. The researchers conducted prospective observations with weekly follow-ups for each patient until catheter removal, which occurred upon completion of treatment, unplanned removal, or death.

Compilation of data collection form

Based on literature review and clinical expertise, the potential influencing factors of PICC-UE were categorized and summarized to develop a data collection form. This form comprise four categories and 30 indicators: (1) sociodemographic information, including age, gender, height, weight, body mass index (BMI), education level, marital status, hypertension, diabetes, drug allergy history, left-sided PICC placement history, right-sided PICC placement history, and clinical diagnosis; (2) PICC placement data, including patient handedness, diameter of punctured vein, arm circumference, insertion length, exposed catheter length, current PICC placement side (left or right arm), number of puncture attempts, number of catheter lumens, and punctured vein; (3) recent laboratory data one week prior to extubation, including albumin (ALB), white blood cell count (WBC), platelet count (PLT), international normalized ratio (INR), prothrombin time (PT), and activated partial thromboplastin time (APTT); (4) on-site investigation data collected at extubation, including the scores of Cancer Patients PICC Self-management Scale and Health-related Quality of Life (HRQOL) questionnaire.

Data acquisition

The sociodemographic data of cancer patients were collected by researchers from electronic medical records, as well as the recent laboratory data one week before extubation. The PICC placement data were measured or recorded by the specialist nurse in the Venous Catheter Care Outpatient Clinic. The PICC self-management and HRQOL scores within one week before extubation were collected via face-to-face interviews conducted by the researchers at the time of extubation.

Questionnaires

Cancer Patients PICC Self-management Scale: This scale, developed by Ling Li et al.14, was used to reflect each patient’s PICC self-management ability through seven dimensions (each dimension included five items): daily life with PICC, exercise with PICC, adherence to PICC maintenance, daily catheter observation, handling of abnormalities, information acquisition for PICC maintenance, and PICC management confidence.

HRQOL questionnaire: The five-dimensional EuroQol scale (EQ-5D)15 was used to reflect patient HRQOL through the following five dimensions (each dimension included three items): mobility, self-care, daily activities, pain/discomfort, and anxiety/depression. Patients selected one level for each dimension according to their condition that day. A time trade-off conversion table was used to calculate the composite score of the five dimensions, i.e., the EQ-5D index, with values ranging between − 0.11 and 1.00, with scores closer to 1 indicating better health.

Data analysis

SPSS 26.0 and Python 1.2.1 software were used for statistical analysis. Measurement data were described using M (P25, P75) after a Shapiro–Wilk test indicated that the data were non-normally distributed, and a Mann–Whitney U test was used for difference analysis. Count data are described as n (%), and a Pearson’s chi-square test was used for difference analysis. Point prevalence was employed, which refers to the proportion of individuals presenting a clinical condition at a specific time. This was calculated using the following formula:

$$\begin{aligned} {\text{Prevalence }}\left( \% \right) & \,=\,{\text{the number of patients with PICC}} - {\text{UE }}/ \\ {\text{ }} & \;\;\;\;{\text{total number of evaluated patients}} \times {\text{1}}00. \\ \end{aligned}$$

To assess potential multicollinearity introduced by the interaction term (WQOL), the Variance Inflation Factor (VIF) was calculated. To mitigate structural multicollinearity between the product term and its individual components, we applied mean-centering to the HRQOL and self-management scores before calculating WQOL. A VIF value of less than 5 was considered indicative of no severe multicollinearity.

A machine-learning algorithm was implemented in the sci-kit-learn Python library (version 1.2.1) and XGBoost (version 1.7.3) software. If p < 0.050 was considered significant. Given the possible low incidence of PICC-UE and the extensive number of risk factor indicators collected, this study employed four machine learning algorithms with three variable encodings to improve the accuracy and robustness of risk factor identification. To address the challenge of data imbalance arising from the possible rare occurrence of PICC-UE, four sampling strategies for mitigating class imbalance were incorporated. Furthermore, to better explain the contribution of each identified risk factor, the SHAP (SHapley Additive exPlanations) tool was applied to provide both case-specific and global model interpretations.

Given the imbalance of the dataset (23 positive events), we employed Stratified 5-Fold Cross-Validation during the model evaluation phase to prevent overfitting and ensure the generalizability of the results. Furthermore, to assess the reliability of the predicted probabilities, we performed a calibration analysis. The agreement between predicted probabilities and observed outcomes was evaluated using the Brier score and visualized via calibration plots.

Results

Occurrence of PICC-UE in cancer patients

During the study period, 278 patients with cancer were admitted for PICC placement, and based on the inclusion and exclusion criteria, 212 patients were enrolled. The patient enrollment process is shown in Fig. 1.

Fig. 1
Fig. 1
Full size image

Flow chart of patient enrollment, study participation and data analysis. Note: PICC, peripherally inserted central catheter; PICC-UE, unplanned extubation of peripherally inserted central catheters.

The PICC-UE occurred in 23 of 212 patients (overall incidence: 10.8%). The primary causes of PICC-UE were accidental catheter dislodgement (6.6%, 14/212), catheter occlusion (2.4%, 5/212), severe contact dermatitis (1.4%, 3/212), and suspected catheter-associated bloodstream infections (0.5%, 1/212).

Synergistic effect of HRQOL and self-management

In the planar scatter plot depicting self-management and HRQOL scores, non-PICC-UE patients were predominantly clustered in the upper right quadrant (Fig. 2A), which is strongly suggestive of the ability to differentiate non-PICC-UE from PICC-UE. Therefore, we constructed a novel feature metric using the product of the self-management and HRQOL scores, named weighted quality of life (WQOL). Collinearity diagnostics showed that after mean-centering, the VIF values for WQOL, HRQOL, and self-management were all reduced to below 2.5 (ranging from 1.11 to 2.37, see Table 1), indicating that multicollinearity was effectively eliminated and would not inflate the feature importance estimates in the subsequent machine learning models. Further covariance analysis found that PICC-UE had higher goodness of fit with respect to self-management, HRQOL, and their synergistic effect (that is, WQOL) compared to models involving self-management and HRQOL alone (R2 = 0.93 vs. 0.89, ∆ R2 = 0.04) (Fig. 2B and C). The findings suggested that WQOL was a comprehensive indicator integrating self-management and HRQOL, and can effectively predict the occurrence of PICC-UE.

Fig. 2
Fig. 2
Full size image

Feature construction. Note: (A) The distribution of the sample in a two-d imensional planar plot of self-management and HRQOL total scores; (B) The effect of self-management and HRQOL indicators on subjects; (C) The effect of self-management, HRQOL and Self-management* indicators on subjects. HRQOL, health-related quality of life; PICC-UE, unplanned extubation.

Table 1 Collinearity assessment of risk factors (Variance inflation Factor) before and after mean-centering.

Univariate association of each factor with PICC-UE occurrence

Univariate analysis was performed on the 31 characteristic variables; 13 factors were significantly different between the PICC-UE and non-PICC-UE groups, including marital status, PICC history of left arm, PICC history of right arm, drug allergy history, body mass index, diagnosis, puncture vein, puncture vein diameter, arm circumference, exposed length, self-management, HRQOL, and WQOL (all p < 0.100) (Table 2).

Table 2 PICC-UE single factor analysis (n = 212).

Optimal independent association based on four machine-learning algorithms

The corresponding comparative analysis excluding WQOL incorporation showed that: (1) Among three mainstream encoding methods, the WoE encoding was superior to the One-Hot and CE encodings (AUC = 0.99 vs. 0.97 and 0.94) (Fig. 3A); (2) Among four imbalance-data processing methods, the FL method was superior to NearMiss, ENN, and SMOTE methods (AUC = 0.99 vs. 0.91–0.95) (Fig. 3B); (3) Among four machine-learning algorithms (RF, LR SVM, and XGB), the XGB had the highest AUC and recall (0.99 and 0.87, respectively) (Fig. 3C). Decision curve analysis also showed that the WoE encoding (Fig. 3E), FL imbalance-data processing (Fig. 3F), and XGB algorithm (Fig. 3G) had broader net benefits among the corresponding comparisons. So, the XGB algorithm with WoE encoding and FL method had a better ability to identify risk factors. Based on the above-mentioned discovery, further comparison between the XGB algorithm (or XGB + FL algorithm) and its combination incorporating WQOL (i.e., XGB + FC algorithm or XGB + FL + FC algorithm) indicated that the inclusion of WQOL enhanced the prediction of PICC-UE occurrence (AUC = 0.993 vs. 0.985 or 0.994 vs. 0.976; recall = 0.783 vs. 0.739 or 0.870 vs. 0.783) (Fig. 3D; Table 3), with a broader net benefit (Fig. 3H). It is evident that the predictive role of WQOL in relation to PICC-UE should not be overlooked. Besides, the calibration analysis yielded a Brier score of 0.03, indicating a low discrepancy between predicted probabilities and actual outcomes. As shown in Fig. 4(Calibration Plot), the calibration curve closely follows the ideal diagonal line, suggesting that the model provides reliable probability estimates and is not suffering from significant overfitting.

Fig. 3
Fig. 3
Full size image

Optimal multiple analysis based on machine-learning algorithm. Note: (A) Comparison of three mainstream variable encoding methods; (B) Comparison of four methods for data imbalance processing; (C) Comparison of four ML models; (D) Validation of the advantages and disadvantages of the four XGB models; (E) Decision curve analysis of three mainstream variable encoding methods; (F) Decision curve analysis of four methods for data imbalance processing; (G) Decision curve analysis of four ML models; (H) Decision curve analysis of the four XGB models. WoE, weight of encoding; CE, counting encoder; ENN, edited nearest neighbor; SMOTE, synthetic minority over-sampling technique; XGB, extreme gradient boosting; SVM, support vector machine; FC, feature construction; FL, focal loss.

Table 3 Comparison of models (n = 212).
Fig. 4
Fig. 4
Full size image

Calibration plot of the predictive model.

Contribution explanation of each factor to the occurrence of PICC-UE

To further clarify the contribution explanation of each risk factor, this study employed the SHAP tool for feature interpretation. Figure 5A shows the SHAP summary plot of the feature importance ranking of various significant factors for PICC-UE risk. Figure 5B shows the three highest-ranked feature importance factors. WQOL played a crucial role in PICC-UE risk, followed by self-management and upper arm circumference on the PICC side, with SHAP values of 1.02, 0.22, and 0.22, respectively.

Fig. 5
Fig. 5
Full size image

SHAP Interpretability analysis. Note: (A) SHAP summary plot of the feature importance ranking; (B) The three highest ranked feature importance metrics; (C) The waterfall plots from the SHAP analysis for one non-PICC-UE patients; (D) The waterfall plots from the SHAP analysis for one PICC-UE patients. WQOL, weighted quality of life; HRQOL, health-related quality of life; BMI, body mass index; SHAP, shapley additive explanations.

Figure 5C and D illustrated the SHAP waterfall plots to interpret the feature effects of all factors on PICC-UE risk. The arrow size of the waterfall represented the degree of impact of each factor on PICC-UE risk in an individual, and the color identified the increasing or decreasing risk (red or blue). Figure 5C depicted a non-PICC-UE individual, with a total score of -1.301, to predict a low risk of PICC-UE occurrence, while Fig. 5D depicted a PICC-UE individual, with a total score of 1.45, to predict a high risk of PICC-UE occurrence. Notably, only WQOL demonstrated significantly discriminatory power between the two completely different individuals (SHAP-value=-0.98 and 1.67) despite having identical self-management scores (both SHAP-value = 0.36) and similar interpretability of upper arm circumference on the PICC side (SHAP-value=-0.17 and − 0.07).

Discussion

The reported PICC-UE incidence varies considerably, ranging from 7.5% to 22.0% in China16 and 2.5% to 40.7% in other countries7. Our results showed a 10.8% PICC-UE incidence, which is lower than that reported in most previous studies7,16. Variability in PICC-UE incidence across studies is likely attributable to differences in study populations and sample sizes.

PICC maintenance often extends beyond the hospital setting into long-term home care for the majority of patients. In the home environment, the individual patient’s ability plays a crucial role in PICC-UE risk. Among patient-specific factors, quality of life reflects a patient’s functional capacity to perform certain activities, whereas self-management pertains to the actual execution of those activities. While prior research has highlighted the role of self-management in influencing PICC-UE outcomes, the association between quality of life and PICC-UE remains insufficiently examined. Our study found a significant correlation between quality of life and self-management, which accounts for the historical neglect of quality of life in previous studies. Furthermore, patient ability and willingness to engage in self-care tasks vary, encompassing four distinct categories: capable and willing, capable but unwilling, incapable but willing, and incapable and unwilling. PICC-UE risk likely varies across these scenarios. Therefore, we integrated patients’ quality of life and self-management, introducing WQOL to evaluate self-ability comprehensively. Our machine learning-based optimization analysis demonstrated that WQOL exhibited significantly greater feature importance compared to other indicators, serving as a more robust predictor of PICC-UE. In addition, the machine learning-based optimization algorithm also identified two other risk factors with relatively lower contributions, namely self-management and upper arm circumference. In the two typical non-PICC-UE and PICC-UE cases presented, the SHAP tool accurately interpreted the positive relationship between WQOL and PICC-UE risk. Notably, in both typical cases, good self-management was associated with a risk reduction, whereas upper arm circumference was linked to an increased risk. In contrast, WQOL, which integrates assessments of quality of life and self-management, demonstrates greater practical utility in predicting PICC-UE.

Conventional wisdom has long held that there is no direct association between quality of life and PICC-UE. However, our SHAP analysis demonstrates that the WQOL significantly influences PICC-UE occurrence. Specifically, higher WQOL values correspond to higher SHAP values and are associated with a lower PICC-UE probability. Our finding shows a negative correlation between WQOL and PICC-UE, and demonstrate a notable influence on PICC-UE occurrence. In this study, five cases with low WQOL (manifested as high self-management ability and low HRQOL) experienced significant declines in their activity and daily living abilities, hindering timely hospital visits for maintenance. This led to tube blockages and eventual PICC-UE. Therefore, in clinical decision-making, it is advisable to provide on-site services by intravenous therapy specialists for patients with low WQOL who cannot visit the hospital. This ensures timely medical assistance.

Better self-management is essential to prevent complications in patients taking catheters home during chemotherapy intervals. These patients return home during the interval between chemotherapy sessions. They are required to engage in PICC self-management, such as monitoring the puncture site for bleeding, curling of the transparent dressing, and catheter dislodgement. They also need to perform reasonable functional exercises for the upper limb on the PICC side17. Similar to our research12, Jiang L et al. also observed that patients with PICC exhibit low levels of catheter self-management at home, leading to a high incidence of PICC-UE. In our study, self-management’s SHAP value of 0.22 ranked second. Of 12 patients with home catheters, some experienced catheter removal due to inadequate protection during bathing and dressing, resulting in PICC-UE. This suggests patients frequently neglect catheter maintenance or lack the skills for proper self-care. Thus, it is crucial for nursing staff to provide comprehensive health education on PICC self-maintenance and train patients in necessary skills.

This study innovatively identified upper arm circumference at the PICC insertion site, which was underemphasized in previous research, as the third most significant factor associated with an increased risk of PICC-UE. Among the 23 patients who developed PICC-UE in our study, 14 were women aged 55 to 92 years with arm circumferences ranging from 27 to 32 cm. This indicates that PICC-UE occurred in more than 50% of older women with larger arm circumferences. Loose skin and greater upper arm circumference in older women may lead to greater local tissue mobility during activities owing to gravity, increasing the risk of dressing misalignment or curling, which can cause catheter dislodgement. Therefore, nursing staff should be alert to dislodgement risks in older women with larger arm circumferences undergoing PICC placement. We recommend using catheter immobilizers and advising such patients to avoid strenuous upper-limb activities to minimize the occurrence of PICC-UE.

Our study had several strengths. We used machine learning algorithms to optimize PICC-UE risk assessment, highlighting for the first time WQOL’s significant impact on PICC-UE as a measure of patients’ comprehensive abilities. To address data imbalance, we improved the focal loss function, ensuring greater attention to rare PICC-UE cases. Additionally, the SHAP tool interpreted the prediction model from both global and local perspectives.

Limitations: This study had some limitations. As a single-center study, the conclusions require further external validation to assess their clinical utility. While the newly constructed metric in this study serves as a predictor of PICC-UE risk, PICC self-management ability, and patient HRQOL are dynamic in nature. Thus, these factors must be assessed continuously, and interventions should be individualized. Given the considerable variability in HRQOL during long-term PICC use among cancer patients, this study conducted HRQOL assessments one week before catheter removal. Further research is warranted to evaluate alternative assessment time points. Of the two factors in relation to WQOL, only self-management demonstrated a negative association with PICC-UE, suggesting a potential reverse causality between HRQOL measured one week before removal and the occurrence of PICC-UE. However, it remains unclear whether this association is influenced by self-management behaviors. Future research should focus on multicenter studies with broader populations and incorporate more clinical risk factors to better guide clinical decision-making.

Conclusion

PICC-UE occurrence in patients with cancer and PICC is linked to poor clinical outcomes and increased healthcare resource use. Therefore, this study developed a newly constructed key metric (i.e., WQOL) to facilitate early PICC-UE risk prediction in such patients. The clinical applicability of the XGB algorithm-based PICC-UE risk prediction requires testing in actual clinical practice. Furthermore, the SHAP tool’s interpretability suggests WQOL has significant potential for future clinical application and could aid nursing staff in the early and dynamic identification of PICC-UE risk.