Fig. 2: Bin recovery from the CAMI-high dataset using BASALT.

Contigs assembled by Opera-MS were processed with BASALT using default parameters. a–c Summary of MAGs with completeness ≥50 and contamination ≤10. The number of MAGs are indicated on side bars at the top/right of the main figure. d Percentage of rRNAs detected in bins after Gap Filling. e Summary of tRNAs in the processed bins after Gap Filling. The boxplot shows the distribution of data, the central dot in the box represents the median, the box bounds represent the 25th and 75th percentiles, and whiskers represent the minima to maxima values. f Completeness, contamination and quality rates after processing with Bin Selection (Bin selected, n = 290), Refinement (Refined, n = 298), and Gap Filling (Gap-filled, n = 352) modules. Data are presented as means ± SD from MAGs retrieved at each step.