Fig. 4: Analysis of genetic clusters in relation to geographic ancestry.

a The figure depicts significant associations between genetic clusters and seven geographic variables as determined by chi-squared tests. Sky-blue bars indicate the strength of the associations in terms of \(-{\log }_{10}(P\,{\mbox{value}})\). A red dotted line marks the statistical significance threshold of P = 0.05. b Two models are compared using ROC curves. The sig model uses only geographic variables that have significant relationships in Figure 4a, while the all model includes all geographic data. c ADMIXTURE analysis results with the number of hypothetical ancestral populations (K) set to four are presented. Distinct ancestral genetic mixtures are labeled from a–d. d This chart displays how the previously defined genetic clusters are distributed across the ancestral groups found in the ADMIXTURE analysis.