Table 1 Summary of genome assembly and annotation for cotton

From: Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense

Genomic feature

G. hirsutum

G. barbadense

Total length of contigs

2,281,853,441

2,222,525,789

Total length of assemblies

2,347,017,486

2,266,656,771

Estimated gap size, bp

65,164,045

44,130,982

Percentage of anchoring, bp

98.94%

97.68%

Percentage of anchoring and ordering, bp

96.16%

96.35%

Number of contigsa

4,746

4,930

Contig L50, bp

1,891,906

2,151,565

Number of scaffoldsb

2,190

3,032

Scaffold L50, bp

97,738,592

92,880,876

GC content

34.3%

34.2%

Percentage of repeat sequences

69.86%

69.83%

Number of genes

70,199

71,297

Number of transcripts

115,835

109,778

  1. aHi-C + BioNano corrected contigs.
  2. bHi-C-assembled genome sequences.