Table 1 Statistics and annotated analysis of the cauliflower genome assembly

From: Draft genome sequence of cauliflower (Brassica oleracea L. var. botrytis) provides new insights into the C genome in Brassica species

 

Number

Size

Sequence coverage (X)

Percentage

Estimate of genome size

 

603.04 Mb

  

PacBio reads

 

69.06 Gb

114.52

 

Illumina reads

 

45.99 Gb

76.26

 

Total reads

 

115.05 Gb

190.78

 

Contigs

1,484

584.60 Mb

  

Coverage of sequenced genome

   

96.94 %

N50 of contigs

82

2.11 Mb

  

Longest contig

 

9.81 Mb

  

GC content

   

36.76 %

Total repetitive sequences

 

331.20 Mb

 

56.65 %

Total protein-coding genes

47,772

108.40 Mb

 

18.54 %

Annotated protein-coding genes

46,628

  

97.60 %

Average length per gene (exon + intron)

 

2,035 bp

  

Average exons per gene

4.78

242 bp

  

Average length per intron

 

260 bp

  

Noncoding RNAs

8,106

1.32 Mb

 

0.23 %