Fig. 5: Variation in lift value (ratio of cancer incidence in cohort to baseline incidence) with increasing cohort size.
From: Constructing multicancer risk cohorts using national data from medical helplines and secondary care

Lift curve values for cohorts from a 0.5% to 100% of the population b 0.5% to 22% of the population. Predictions were obtained from a XGBoost model trained on all variables, and on another trained only on demographic variables to showcase the improved accuracy in identifying high risk groups when additional variables such as comorbidities and symptoms related to 111 calls are added.