Fig. 2: Validation of the WID-buccal-, -cervical, and -blood-BC indices.
From: Systems epigenetic approach towards non-invasive breast cancer detection

a Scatter plot of the WID-buccal-BC index and immune cell proportion or b subject age by disease status (blue = control [n = 93], red = BC case [n = 94]) in the validation set. c Box plot of WID-buccal-BC index residuals after accounting for age and immune cell proportion in the validation set. d ROC curves of the WID-buccal-BC index residuals (after adjusting for age and immune cell proportion) overall and stratifying samples by median immune cell proportion in the validation set. e Scatter plot of the WID-cervical index and immune cell proportion or f subject age by disease status (blue = control [n = 93], red = BC case [n = 91]) in the validation set. g Box plot of the WID-cervical-BC index residuals after accounting for age and immune cell proportion in the validation set. h ROC curves of the WID-cervical-BC index residuals (after adjusting for age and immune cell proportion) overall and stratifying samples by median immune cell proportion in the validation set. i Scatter plot of the WID-blood-BC index with and estimated neutrophil proportion by disease status (blue = control [n = 30], red = BC case [n = 50]), in GSE23703619. j Box plot of the WID-blood-BC index residuals, adjusted for neutrophil proportion in GSE23703619; p = 0.8854. k ROC curves of the WID-blood-BC index residuals, adjusted for neutrophil proportion, overall, and stratifying samples by median neutrophil proportion in GSE23703619. p values in c, g, j are derived from a two-sided Wilcoxon test. Shaded error regions in a, b, e, f, i correspond to 95% confidence intervals for predictions from linear models. Box plots correspond to standard Tukey representation, with boxes indicating mean and interquartile range, and lines indicating smallest and largest values within 1.5 times of the 25th and 75th percentile, respectively. Individual data points are overlaid. No corrections for multiple testing were carried out. BC breast cancer, WID women’s cancer risk identification, AUC area under the (receiver operating characteristic) curve. Source data are provided as a Source Data file.