Fig. 2: Callset statistics.

a, Overview of the samples for which variants are called from haplotype-resolved assemblies as well as their het:hom ratios. Color corresponds to the population from which the samples originate. b, The number of different substitutions reported for all samples. c, Length distribution of insertions and deletions across all samples (in basepairs). d, Total number of distinct variant alleles detected across all 11 samples (first row), as well as the number of bubbles in the corresponding pangenome graph (second row). We distinguished small (1–19 bp), midsize (20–49 bp) and large (≥50 bp) variants. Biallelic bubbles were classified as SNPs, insertions or deletions; complex corresponds to all remaining bubbles with more than two branches resulting from inserting overlapping variant calls into the graph.