Extended Data Fig. 5: Impact of within-batch genome order on the compressibility of microbial collections. | Nature Methods

Extended Data Fig. 5: Impact of within-batch genome order on the compressibility of microbial collections.

From: Efficient and robust search of microbial genomes via phylogenetic compression

Extended Data Fig. 5

While a substantial part of the benefits of phylogenetic compression comes from organizing genomes into batches of phylogenetically related genomes, proper genome reordering within individual batches is also crucial for maximizing data compressibility. The plots demonstrate that the impact of within-batch reordering grows with the amount of diversity included (GISP vs. NCTC3k) and with the number of genomes (GISP vs. SC2). Accurate phylogenies inferred using RAxML provided a small compression benefit for assemblies over trees computed using Mashtree (GISP).

Back to article page