Fig. 7: Overlapping occurrence of viral scaffolds, their percent G/C base content and CRISPR spacer to viral protospacer hits.
From: Marine viruses disperse bidirectionally along the natural water cycle

Overview of shared viral scaffolds (>10 kb length) between seawater, aerosol, and precipitation obtained from 55 metagenomes and determined by the mapping of reads. A viral genome was considered present in a sample if at least 75% of the genome were covered with reads at least 90% identical to the genome, in accordance with suggested viromics benchmarks78 (a). Percent of the bases guanine (G) and cytosine (C) in viral scaffolds from rain, aerosol, foam, surface microlayer (SML), and subsurface 1-m deep water (SSW) based on read mapping. Rain_only refers to viral genomes exclusively found in rain in this study. Stars indicate significant differences after Kruskal Wallis test and Dunn’s multiple comparison test (****, adjusted p = <0.0001). In each pairwise comparison, the marine groups were significantly different from the atmospheric groups. Rain_total was also significantly different from Rain_only (***, adjusted p = 0.0002), which is not indicated to reduce complexity of the figure. The line of the box plot represents the median, the box extends from the 25th to 75th percentiles, whiskers indicate the min. to the max. value (b). CRISPR spacers (origin indicated as square) matching assembled viral scaffolds (circles) derived from different ecosystems (c). Source data are provided as a Source Data file.