Fig. 1: Annotation of the high-quality viral genomes recovered from metagenomes collected from built environments.

a Boxplots of the contig lengths of the predicted viral and non-viral contigs >1 kb, as determined by Virsorter2 (Vs2) and DeepVirFinder (DVF) and assessed by CheckV. The number of contigs (n) is indicated. Boxplots represent the median, the first quartiles and third quartiles with whiskers drawn within the 1.5 interquartile range value. Points outside the whiskers are outliers. b Accumulation curves of the viral genomes in the combined, pier, public facility, residence, and subway datasets. c Metadata and taxonomy of 1174 viral genomes with >50% completeness. d Principal coordinate analysis of the Bray–Curtis dissimilarity matrix for all of the samples. The color and shape of the symbols indicate the built environments and surface materials, respectively.