Extended Data Fig. 1: Ancestry of PsychENCODE subjects.

Principal component analysis was performed using PLINK after merging the PsychENCODE genotype data with the 1000 Genomes Project reference panel. The PsychENCODE genotype data was available for a total 1,864 subjects to begin with. Each point represents an individual and points are color-coded by corresponding ethnicity. Global ancestry was inferred by k-nearest neighbors algorithm with the first five principal components. Downstream analyses were restricted to samples of European ancestry (n = 812).