Fig. 4: Performance comparison of alternative methods for prediction of five binary traits in 23andMe. | Nature Communications

Fig. 4: Performance comparison of alternative methods for prediction of five binary traits in 23andMe.

From: An ensemble penalized regression method for multi-ancestry polygenic risk prediction

Fig. 4

We analyzed five binary traits, a any CVD, b depression, c migraine diagnosis, d morning person, and e SBMN. PRS are trained using 23andMe data that available for five populations: African American, Latino, EAS, EUR, and SAS, and then tuned in an independent set of individuals from 23andMe of the corresponding ancestry. Performance is reported based on adjusted AUC accounting for sex, age, PC1-5 in a held-out validation sample of individuals from 23andMe of the corresponding ancestry. The ratio of sample sizes for training, tuning and validation is roughly about 7:2:1, and detailed numbers are in Supplementary Data 7 and 8. The PRS-CSx package is restricted to SNPs from HM3, whereas other alternative methods use SNPs from either HM3 or MEGA. LDpred2 and its corresponding EUR and weighted methods are excluded to avoid misinterpretation, as a result of our collaboration restrictions with 23andMe, Inc., preventing us from updating these methods to the latest version of its package. Bars in the figure show the performance of adjusted AUC for each method in each dataset. Colors are described on the right side of the figure. Source data are provided in Supplementary Data 9.

Back to article page