Fig. 4: Inference of S. marcescens clusters in metagenomic samples.

Cluster-specific core genes with high rates of specificity and sensibility (see “Methods” section) were searched on metagenomic samples of the MGnify database to identify phylogenetic clusters of S. marcescens in a broad set of biomes. A metagenomic assembly was considered positive to a Cluster if >10% of cluster-specific core genes were found within it. When a metagenomic assembly resulted positive to core S. marcescens genes, but not to genes associated to Cluster 1, Cluster 2 or Cluster 5, the assembly was defined as Other SMA. The heatmap shows the residuals of the χ 2 test used to investigate whether S. marcescens clusters are associated to samples from specific biomes. Statistically significant associations are marked with asterisks (‘∗’, p value < 0.05). Here, only biomes with more than 100 positive samples are shown, see Fig. S11 for all biomes.