Fig. 8: Similarity of coefficients between the gold standard and various forms of leakage. | Nature Communications

Fig. 8: Similarity of coefficients between the gold standard and various forms of leakage.

From: Data leakage inflates prediction performance in connectome-based machine learning models

Fig. 8

The boxes are colored by the leakage family: orange (non-leaky analysis choices), blue (feature leakage), green (covariate-related leakage), yellow (subject-level leakage). Boxplot elements were defined as follows: the center line is the median across 100 random iterations; box limits are the upper and lower quartiles; whiskers are 1.5x the interquartile range; points are outliers. Certain values, such as leaky site correction in PNC, are omitted because the relevant fields (e.g., site) do not exist. See also Supplementary Figs. 7 and 8. ABCD Adolescent Brain Cognitive Development, HBN Healthy Brain Network, HCPD Human Connectome Project Development, PNC Philadelphia Neurodevelopmental Cohort.

Back to article page