Table 1 Source genomes for the RPGG.

From: Profiling variable-number tandem repeat variation across populations using repeat-pangenome graphs

Genome

Continental population

Study

Coverage

Assembly N50 (Mb)

Fraction of VNTR annotated

Ancestry

AK1

EAS

KG

54

0.88

0.840

Korean

HG00268

EUR

DP

67

3.51

0.967

Finnish

HG00512

EAS

HGSVG

28

8.83

0.995

Han Chinese

HG00513

EAS

HGSVG

30

1.57

0.993

Han Chinese

HG00514

EAS

HGSVG

31

1.32

0.948

Han Chinese

HG00731

AMR

HGSVG

31

2.18

0.995

Puerto Rican

HG00732

AMR

HGSVG

16

1.3

0.992

Puerto Rican

HG00733

AMR

HGSVG

46

6.88

0.992

Puerto Rican

HG01352

AMR

DP

68

5.97

0.992

Colombian

HG02059

EAS

DP

76

19.5

0.992

Vietnamese

HG02106

AMR

DP

57

0.88

0.640

Peruvian

HG02818

AFR

DP

56

0.66

0.802

Gambian

HG04217

SAS

DP

60

0.86

0.269

Telugu

NA12878

EUR

DP

54

4.67

0.971

Central European

NA19238

AFR

HGSVG

23

2.64

0.991

Yoruba

NA19239

AFR

HGSVG

35

4.87

0.994

Yoruba

NA19240

AFR

HGSVG

49

3.4

0.989

Yoruba

NA19434

AFR

DP

62

11

0.980

Luhya

NA24385

EUR

GIAB

54

1.32

0.981

Ashkenazim

  1. Continental populations represented are East Asian (EAS), European (EUR), Admixed Amerindian (AMR), South Asian (SAS), and African (AFR). Coverage is estimated diploid coverage based on alignment to GRCh38. Assembly N50 is of haplotype-resolved assemblies. The fraction of VNTR annotated are all VNTR with at least 700 flanking bases assembled.