Fig. 5: Self-reported ancestry, and the accuracy gap in capturing individual-level cis-regulatory landscape by eQTL data.
From: Haplotype-aware modeling of cis-regulatory effects highlights the gaps remaining in eQTL data

The percentage of genes with an excess allelic imbalance by self-reported ancestry after accounting for known eQTLs as a function of power in adipose subcutaneous (A) (see Supplementary Data 4 for detailed statistics for each individual), and for genes with over 80% power across ten GTEx tissues with largest sample sizes (B, C). AA and EA denote African American, and European American individuals, respectively. The eQTL data used in panels (A, B) is from the official GTEx release, while for panel (C) the eQTLs were mapped using a subset of the samples with equal number of AA and EA individuals. Error bars in (B, C) represent 95% bootstrap confidence intervals of the median (see Supplementary Data 5 and Supplementary Data 6 for number of analyzed samples/genes per tissue/population). Boxplots in panel (A) represent first quartile, median, and third quartiles. Whiskers represent Q1–1.5* interquartile range (IQR) and, Q3 + 1.5*IQR. Outliers are hidden for ease of viewing.