Table 4 DBG2OLC assembly performance comparison on various genomes.

From: DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies

Genome

Size

Coverage

NG50

Contigs

NGA50

Identity

Misassemblies

Longest

Sum

A. thaliana

120 Mbp

10x PacBio

405,464

881

258,924

99.77%

704

1,549,329

119 Mb

  

20x PacBio

2,431,755

306

926,138

99.90%

117

6,015,430

120 Mb

  

40x PacBio

3,601,597

243

1,605,981

99.93%

131

15,473,059

129 Mb

H. sapiens

3.0 Gbp

10x PacBio

432,739

16,689

347,104

99.56%

—

3,507,306

2.97 G

  

20x PacBio

1,886,756

9,757

1,416,766

99.82%

—

14,597,500

3.13 Gb

  

30x longest PacBio

6,085,133

13,095

4,124,714

99.85%

—

23,825,526

3.21 Gb

E. coli

4.6 Mbp

30x Nanopore

4,680,635

1

1,850,974

99.77%

1

4,680,635

4.7 Mb