Fig. 2: Error-handling capabilities of DBGPS in comparison with MA and large-scale performance simulations.
From: Robust data storage in DNA by de Bruijn graph-based de novo strand assembly

With twenty sequence copies, the performance of DBGPS and MA in handling various rates of (a) DNA breaks, (b) DNA rearrangements, (c) indels, (d) substitutions, and (e) mixed errors. The mixed errors comprise DNA breaks, DNA rearrangements, substitutions, insertions, and deletions in a ratio of 1:1:2:1:1. f Strand reconstruction rates with various strand copies containing 3% error mixtures of substitutions (1.5%), insertions (0.75%), and deletions (0.75%). g Strand reconstruction time by DBGPS with data scales ranging from 1 MB to 1 GB. The small bar chart at the top shows the fold changes in reconstruction time per strand compared to that of the 1 MB scale. (h) Illustration of the differences between the DBG constructed with numerous short sequences and that constructed with long sequence(s). The nodes with different colors stand for various k-mers. Each node stands for a unique k-mer. Data are presented as mean values of three independent simulations in figures a–g. The standard deviation (SD) values, which are too small to be clearly visualized, are listed in the source data. Source data are provided as a Source Data file.