Table 1 Comparison between the synthesis and selection-based methods and previously reported methods of oligo subset selection

From: Oligonucleotide subsets selection by single nucleotide resolution barcode identification

Access method

PCR

Hybridization

Proposed method

2 barocde (TC access)

Proposed method

3 barocde (TCA access)

Proposed method

4 barocde (TCAC access)

Maximum number of designable subsets

5,625

–

~ 4n*

~ 4n*

~ 4n*

Designed subsets

35

3

14

44

128

Index / data region length

40 bp / 110 bp (26.7% of 150 bp)

20 bp / 117 bp (14.6% of 137 bp)

2 bp / 158 bp (1.25% of 160 bp)

3 bp / 157 bp (1.86% of 160 bp)

4 bp / 156 bp (2.5% of 160 bp)

Target enrichment before access / after access (100% match)

3.86% / 45.23% (11.7X)

3.85% / ~ 62.5% (16.2X)

4.83% / 63.77% (13.2X)

2.00% / 40.81% (20.4X)

0.68% / 22.92% (33.7X)

Target enrichment before access / after access (98% match)

3.86% / 61.13% (15.8X)

–

5.85% / 78.31% (13.4X)

2.39% / 47.90% (20.0X)

0.84% / 29.19% (34.7X)

Ratio of missing sequence (100% match)

1.94%

–

0.26%

1.03%

1.00%

Ratio of missing sequence (98% match)

1.80%

–

0.26%

1.03%

1.00%

  1. For PCR and hybridization, the number of required primers or capture probes increases proportionally with the number of subsets. In contrast, the proposed method requires only universal primers and leverages a hierarchical data structure.
  2. *n = barcode length; the barcode length is determined by the size of the library. Homopolymers longer than 3 nucleotides are excluded.