Fig. 4: Viral protein-coding genes and AMGs across EHSs.
From: Ecosystem health shapes viral ecology in peatland soils

a, UpSet plot showing amino acid identity-based clustering of all viral protein-coding genes (n = 77,662 genes). Intersections represent protein clusters with viral proteins from multiple EHSs, while non-intersecting groups represent proteins unique to a single EHS. The distribution of PHROG22 functional categories across these intersections is shown in the stacked bar plot at the top. b, UpSet plot of unique KEGG KOfams24 (n = 59 families) among viral AMGs (n = 100 genes) across EHSs. Intersections indicate KOfams shared across different EHSs, while non-intersecting groups highlight KOfams unique to a single EHS. The stacked bar plot at the top illustrates the distribution of KEGG metabolism categories associated with these viral KOfams across the intersections.