Fig. 1: Overview of Stordalen Mire soil viruses.
From: Host-linked soil viral ecology along a permafrost thaw gradient

a, An accumulation curve of viral populations in bulk soil metagenomes (n = 201). The means are represented by red circles and 200 randomizations of sample order are shown in teal. b, A network of shared predicted protein content among Stordalen Mire viruses (n = 1,907), RefSeq prokaryotic viral genomes (n = 2,010) and soil-associated viral contigs >10 kb from Paez-Espino et al.19 (n = 3,112) and Roux et al.18 (n = 2,040). Nodes (circles) represent genomes and contigs, and the shared edges (lines) indicate shared protein content. c, Pie charts indicate per cent relative abundances of Stordalen Mire viral populations (n = 828, 782 and 475 populations detected in palsa, bog and fen, respectively; palsa: n = 72 samples, bog: n = 65 samples and fen: n = 64 samples) that: have predicted taxonomy (green), have unknown taxonomy but share a viral cluster (VC) with viruses from public datasets (from b, blue), or were previously unknown (in a Stordalen Mire-exclusive VC, yellow). The bar graphs indicate the per cent relative abundances of viral taxa in each habitat, considering only viruses with predicted taxonomy (n = 323). d, Principal coordinates analysis (PCoA) of viral community composition, as derived from read mapping to viral contigs (n = 1,907) and Bray–Curtis dissimilarities; each point is one sample (n = 201). The analysis of similarity (ANOSIM) statistics consider viral community composition grouped by habitat (palsa: n = 72 samples, bog: n = 65 samples and fen: n = 64 samples).