Table 2 Comparative compression ratios achieved by Assembltrie on the MPEG HTS benchmarking data set

From: Optimal compressed representation of high throughput sequence data via light assembly

Sample

L  / cov

Compression rates

Assembltrie

Assembltrie (corrected)

Orcom

BEETL

Mince

k-Path

SRR554369

100/25

0.369

0.345

0.518

1.133

0.484

0.673

SRR327342

63/80

0.272

0.291

0.304

0.986

0.312

0.384

MH0001.081026

44/NA

0.781

0.758

0.804

1.785

0.786

2.545

SRR870667

108/20

1.821

1.733

0.884

1.287

0.735

0.707

ERR174310

101/7

0.701

0.570

0.686

1.493

0.746

0.797

ERP001775

101/20

0.350

0.322

0.364

N/A

N/A

N/A

Sim. T. cacao

108/19

0.538

0.479

0.667

N/A

N/A

N/A

NA12878

101/7

0.444

N/A

0.650

N/A

N/A

N/A

  1. Compression rates in bits per base for each software tool (with 8 threads) and each MPEG benchmark sample. The second column provides the read length and coverage; and the last columns present the compression performances for different software tools. Assembltrie outperforms all of the existing sequence-only compressors with different level of improvement depending on the read length/coverage (possibly with the greedy strand correction heuristic), except on the sample SRR870667 from T.cacao (which has an unusually high error rate)