Fig. 3: Greater differentiation is observed across Indigenous segments than across global populations.
From: Clinical genetic variation across Hispanic populations in the Mexican Biobank

a, Boxplots indicate the distribution of asFst values using biomedically relevant SNPs only for genomic segments of Indigenous and European ancestry in the MXB: the centerline represents the median, the box bounds correspond to the 25th and 75th percentiles, and whiskers extend to the most extreme values within 1.5× the interquartile range; points beyond this are outliers. The violin plots display the kernel density estimation of the data distribution. A significant difference is observed (n = 42,769, P = 0.042, 95% confidence interval = −1.314061 × 10−4 to −2.337399 × 10−6, t test two-sided), with Indigenous segments showing higher asFst values, indicating greater genetic differentiation within this ancestral component. b, Geographic distribution of asF for the C allele of rs4149056 in Indigenous genomic segments across Mexico. The map reveals regional variation in the ancestral Indigenous American substratum of Mexicans, with much higher frequencies in the ancestrally Mayan Yucatan. Image in b is adapted from ref. 38, used under a free use license. c, Bar plot showing the frequency of rs4149056 (C allele) in Indigenous American segments (asF) across MXB states compared with the frequency in Indigenous Mexican populations from the Native Mexican Diversity Project (NMDP) shows a correspondence in between each state and the local Indigenous groups. Frequencies for worldwide groups from the 1000 Genomes Project (1KGP) and the MCPS are provided for comparison. The sample size for each population is inside each dot.