Fig. 1: Assembly errors across the assemblers. | Nature Biotechnology

Fig. 1: Assembly errors across the assemblers.

From: Troubleshooting common errors in assemblies of long-read metagenomes

Fig. 1

a, A schematic representation of long reads mapping to a contig with multiple types of read disagreement with the reference, including INDELs and SNVs representing more than half or all the coverage, and clipping events spanning the entire coverage. All metrics for b–g are normalized by assembly size and exclude the two mock community metagenomes (n = 19 assemblies). For all box plots, the box represents the interquartile range (IQR), the central line indicates the median and whiskers extend to 1.5× the IQR. b, Number of clipping events supported by at least 10 reads. c, Number of regions over 1,000 bp with no apparent coverage. d,e, Number of SNVs representing >50% (d) or all (e) of the coverage at a given locus. f,g, Distribution of INDELs >50% of the coverage (f) or all of the coverage (g). h, Length distribution of circular contigs by each assembler. The darker color represents the distribution of circular contigs with at least one clipping event.

Back to article page