Figure 5

(a,b) Taxonomic overlap between NCOG and BioGEOTRACES, Tara Oceans, and Tara Polar for 16S (a) and Tara Oceans and Tara Polar for 18S-V9 (b). Size of circles indicates the mean number of ASVs identified per region per database. Edge color represents the database for the respective data (NCOG, BioGEOTRACES, Tara Ocean, or Tara Polar). Fill color represents the percentage of NCOG ASVs found in each respective region/dataset. (c,d) Relationship between regional richness (per database) and the % overlap between NCOG 16S ASVs and regional 16S and 18S-V9 ASVs. Each point represents a different region, as seen in (a,b). Colors represent the three biogeographic categories (Endemic, Generalist, Cosmopolitan). Linear fits between region-database richness and percent overlap were derived from the glm package in R.