Fig. 4: Contamination levels estimation using NC-detected sequences from external studies.
From: Negativeome characterization and decontamination in early-life virome studies

a Correlation of the percentage of strains shared between samples and internal versus external NCs. b Correlation of the percentage of strains shared between samples and internal NCs with the percentage of vOTUs shared between samples and NCs from external studies. In (a, b), each dot is a sample. Data are shown for 1254 samples. Per study distribution: Garmaeva et al. (n = 205), Liang et al. (n = 324), Maqsood et al. (n = 78), and Shah et al. (n = 647). The percentage of strains and vOTUs shared with NCs is calculated per sample as the number of strains or vOTUs shared with NCs divided by the sample richness. The solid line represents the fitted linear regression, and the shaded band denotes the 95% confidence interval of the model. Spearman correlation rho’s are depicted in the upper right corner of each panel.