Fig. 5: GMWI2 performance on healthy and non-healthy external validation cohorts.

a GMWI2 scores from healthy (494 samples) and non-healthy (646 samples) groups. Scores are significantly higher in the healthy group compared to the non-healthy group (P = 1.6 × 10–43; two-sided Mann–Whitney U test). The effect size is represented by Cliff’s Delta (d = 0.48). The balanced accuracy of the classification is 72.1%. b GMWI2 scores across five healthy (H1–H5) and three non-healthy cohorts (AS4 ankylosing spondylitis, PD6 Parkinson’s disease, PC5 pancreatic cancer). The superscript numbers adjacent to phenotype abbreviations correspond to specific studies detailed in Supplementary Data 6. Asterisk (*) indicates significantly higher score in a healthy cohort compared to the corresponding non-healthy cohort (P < 0.01, two-sided Mann–Whitney U test. Exact P-values provided in Supplementary Data 6). Numbers next to each asterisk refer to the healthy cohort compared against each non-healthy condition. Sample size of each group or cohort are shown in parentheses. Standard box-and-whisker plots (i.e., center line, median; box limits, upper and lower quartiles; whiskers, 1.5× interquartile range; points, outliers in (a) or individual GMWI2 scores in (b)) are used to depict groups of numerical data.