Fig. 4: CLL-TIM is robust to missing data on external test cohort. | Nature Communications

Fig. 4: CLL-TIM is robust to missing data on external test cohort.

From: Machine learning can identify newly diagnosed patients with CLL at high risk of infection

Fig. 4

a Kaplan–Meier graphs of infection-free, CLL treatment-free survival for CLL-TIMT20-30 predicted high-risk and low-risk groups on a subset of the external test cohort (BENCH-E, n = 281 with n = 142 high-confidence predictions). Patients in BENCH-E have full CLL-IPI and a full 2-year follow-up. P-value is by log-rank test. b Cumulative incidence plots for CLL-TIMT20-30 predicted high-risk and low-risk groups for CLL treatment (dark green), infection (bright green) and death (gray) as first events on BENCH-E. c Two-year outcome PR-AUC (Precision-Recall Area-Under-Curve) for CLL-TIM and CLL-IPI on BENCH-E. To allow for an equitable comparison, CLL-TIMT20-30 (i.e. with removal of uncertain predictions) was benchmarked against an additional two versions of CLL-IPI score; CLL-IPI with removal of patients in the intermediate-risk category, CLL-IPI NI_4+, and CLL-IPI with removal of the intermediate and high-risk category, CLL-IPI NIH_7+. For box-and-whisker plots, whiskers are 95% confidence intervals generated using 5000 bootstrapped datasets sampled from each respective cohort (See Methods), white square is the mean, centre line is the median, bounds of box are the interquartile range and black dots are outliers. We performed model comparison using a one-tailed Mann–Whitney U-test on the mean difference of the PR-AUC over the bootstrapped datasets. *** indicate p < 0.0005. d Average missing feature rate for patients in external test cohort. Shaded distributions are blue – low-risk high-confidence predictions, gray – low-confidence predictions, gold – high-confidence high-risk predictions. Missing feature rate is the percentage of CLL-TIM’s 228 features that were missing for the given patient. Data shown for CLL-TIM’s predictions on the external German test cohort (n = 365).

Back to article page