Fig. 2: Model experiment demonstrating feasibility of cluster construction. | Communications Biology

Fig. 2: Model experiment demonstrating feasibility of cluster construction.

From: Correcting errors in PCR-derived libraries for rare allele detection by reconstructing parental and daughter strand information

Fig. 2

a Schematic illustration of the experiment. The oligonucleotide was designed to contain a 12-nt barcode for molecular identity. Primers were designed to have a UID and adaptor sequences for Illumina sequencing. b Number of paired-UIDs (nPairedUIDs). GC content (%) of left UIDs (c) and right UIDs (d). Simulated data was created using a pool of 100,000 randomly generated UID sequences. e Comparison of nPairedUIDs between normal-GC ( < 80%) and high-GC ( ≥ 80%) groups. Groups were compared using the two-sided Wilcoxon rank sum test. (****P = 2.50 × 10−152). f Cluster size distribution. g Number of reads per UID-pair and per cluster, in which pairs and clusters are shown in ranked order. h Distribution of UID-pairs per cluster. i Specificity (%) of clusters before and after barcode content correction within a hamming distance of 2, in which clusters are shown in ranked order. j Distribution of redundancy for a given cluster size. k Representative lineages of clusters in which sequencing errors were observed.

Back to article page