Fig. 6

Quality control of genotype data. Genotype QC for sex (a,b) and ancestry inference (c,d) for MSSM-Penn-Pitt (a,c) and HBCC (b,d). (a,b) F statistic from plink’s check-sex function, plotted by reported sex. Following data QC there is 100% concordance between reported sex and inferred sex based on F statistic for both MSSM-Penn-Pitt (a) and HBCC (b). (c,d) The first two principal components (PC) of genetic ancestry as inferred by GEMTOOLs. For both MSSM-Penn-Pitt (c) and HBCC (d) we see good concordance between reported ethnicity and genetic background clusters inferred by GEMTOOLs.