Extended Data Fig. 2: 16S rRNA gene-based phylogenetic tree illustrating diversity and origin of SMGA’s bacterial genomes as well as identified close relatives in the Silva database.

The cladogram depicts the taxonomical position of all the sequences coloured by genus (outer ring). The sequence ID of each organism (in the SMGA and Silva database) used in the phylogeny are printed in the top of the outer ring. Grey dots in the cladogram indicates a Bootstrap support higher than 70 %. The genome of Prochlorococcus marinus subsp. marinus CCMP1375 (Silva ID: AE017126) was used as an outgroup. Scale bar indicates 5% estimated sequence divergence.