Fig. 4

Bar plots depicting the relative abundances of the species present in the mock community as estimated by various programs and parameter settings. The column “True” displays the correct abundances of the clades in the mock. The less abundant true species are clustered as a unique observation defined “Others”. Species inferred but not actually included in the mock are categorized as “misclassified”. The x-axis represents the classification types: “E” denotes the E-value threshold, “m” indicates the coverage threshold and “c” represents the confidence level of the classification, depending on the classifier available options. The prefix “contigs” specifies classifications based on contigs rather than individual reads in case of Kaiju and Kraken2. The prefixes “default”, “metalarge” and “custom” refer to the different MEGAHIT assembly settings.