Fig. 6: Comparative performance of Factor 23 versus individual items.
From: Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation

The factor score is compared to each of the top 10 items and to an unweighted sum of z-scores of those items. a, Comparison of incremental R2 for mortality prediction in N = 217,393 individuals with complete data for the included items and sufficient accuracy in the independent-variable factor scores for Factor 23 (Methods). The comparative baseline model for each included covariates for the first 20 genetic PCs, age, chromosomal sex, age2, age × chromosomal sex, age2 × chromosomal sex, dummy variables representing the assessment centres of origin and days from baseline assessment to T0. b, Comparison of point estimates of heritability. Results show estimated observed-scale SNP heritability ±1 standard error from GWAS with the listed sample size (N). c, Comparison of variance explained by polygenic scores for Factor 23 vs its top 3 component items for 5 relevant traits in the external Add Health study. Barplots show estimated variance explained (change in R2 from adding polygenic scores to linear regression models for each outcome (Methods)) with error bars representing 95% bootstrapped confidence intervals, with a lower bound of 0 for visualization purposes. See Supplementary Table 13 for comparison to all top 10 items.