Table 4 The simulated and real sequencing datasets used in this study.

From: Benchmarking UMI clustering tools for accurate detection of low-frequency variants from deep sequencing

Data

Read length (bp)

VAF (%)

Coverage

Mean read pairs per UMI (bp)

No. of Variant

Data content

Simulated data

110

10

5,000X-25,000X

12

100

Targeted sequencing data

110

5

5,000X-25,000X

12

100

Targeted sequencing data

110

1

5,000X-25,000X

12

100

Targeted sequencing data

110

0.5

5,000X-25,000X

12

100

Targeted sequencing data

110

0.25

5,000X-25,000X

12

100

Targeted sequencing data

110

0.1

5,000X-25,000X

12

100

Targeted sequencing data

110

0.05

5,000X-25,000X

12

100

Targeted sequencing data

110

0.025

5,000X-25,000X

12

100

Targeted sequencing data

Reference data (N0015)

150

5

/

12

338

Targeted sequencing data

Sample data (M0253)

150 bp

0.5

/

12

37

Targeted sequencing data