Extended Data Fig. 5: Host predictions for Chesapeake Bay viral populations from 2018 virome libraries.
From: Interaction dynamics and virus–host range for estuarine actinophages captured by epicPCR

Host predictions for Chesapeake Bay viral populations from spring (May), summer (August), and winter (December) 2018 virome libraries. a, Predicted host and phage family based on RNR homology. 634 RNR gene homologs were found across 10,858 total viral populations. RNR nucleotide sequences were aligned with MAFFT using the peptide sequences as a guide in TranslatorX. Only the 437 RNR sequences that spanned the same region (460A to 693I in the E. coli class I alpha RNR peptide) were further analyzed to avoid possibly double-counting partial RNR sequences on separate contigs. RNR homology was queried by BLASTn against the NCBI nr database. Only top hits with e values < 1E-10 were classified. The total number of RNR sequences analyzed at each time point are indicated in parentheses. b, Mean nucleotide identity between Chesapeake Bay RNR sequences and reference sequence top hits. Only Chesapeake Bay RNR sequences sharing at least 65% nucleotide identity over 90% of the sequence with a reference sequence were assigned a reference hit. The number of RNR sequences are indicated in parentheses. Box and whisker markers represent the minimum, first quartile, median, third quartile, and maximum values.