Fig. 2: Population substructure analysis results of our dataset.
From: Human whole-exome genotype data for Alzheimer’s disease

Plots from principal components analysis showing principal component (PC) 1 vs. PC2, PC2 vs. PC3, and PC1 vs. PC3 for sets of samples initially clustered on self-reported race/ethnicity (samples shown in black dots) with respect to 1kG reference populations (all other symbols). a Individual self-reporting as non-Hispanic White and clustering within 3 SD of EUR sample populations were assigned the ancestry label “Non-Hispanic White” (NHW). This plot includes 32 individuals excluded as outliers. b Individuals self-reporting as non-Hispanic Black and clustering within 3 SD of EUR and AFR sample populations or distributed between the populations were assigned the ancestry label “African American” (AFA). This plot includes 29 individuals excluded as outliers. c Individuals clustering within 3 SD of EUR and AFR sample populations and Latin American sample populations groups in the 1000 Genomes/Human Genome Diversity Project collection and between those sample population groups were assigned the ancestry label “Caribbean Hispanic” (CHI), which was also reflective of the geographic sampling of samples in the source datasets. No subjects initially classified as CHI were excluded.