Table 1 CrAssphage ORFs with homology to known proteins or domains.

From: A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes

ORF

Function of top hits

Species

Host phylum

orf00014

Hypothetical protein

Phages

Ambiguous

orf00017

Uracil-DNA glycosylase

Acetivibrio cellulolyticus

Firmicutes

orf00016

DNA helicase

Francisella philomiragia

Proteobacteria

orf00018

DNA polymerase

Labrenzia

Proteobacteria

orf00025

DNA primase/helicase

Veillonella sp.

Firmicutes

orf00029

DNA ligase

Erwinia phage

Proteobacteria

orf00031

Deoxynucleoside monophosphate kinase

Enterobacteria phage

Proteobacteria

orf00032

Baseplate hub

Aeromonas phage

Proteobacteria

orf00033

Thymidylate synthase complementing protein ThyX

Prevotella sp.

Bacteroidetes

orf00035

Hypothetical protein

Bacteroides sp.

Bacteroidetes

orf00037

Phage/plasmid-related protein

Mucilaginibacter

Bacteroidetes

orf00038

Deoxyuridine 5'-triphosphate nucleotidohydrolase

Acinetobacter phage

Proteobacteria

orf00039

Endonuclease

Paenibacillus sp.

Firmicutes

orf00040

Deoxyuridine 5'-triphosphate nucleotidohydrolase

Salmonella phage

Proteobacteria

orf00042

Glutaredoxin/thioredoxin

Phages

Ambiguous

orf00047

Hypothetical protein

Clostridium bolteae

Firmicutes

orf00050

Plasmid replication protein domain

Firmicutes

Firmicutes

orf00052

Phage-structural protein

Cellulophaga phage

Bacteroidetes

orf00053

Phage-structural protein

Synechococcus phage

Cyanobacteria

orf00056

Hypothetical protein

Escherichia phage

Proteobacteria

orf00065

Hypothetical protein

Alistipes putredinis

Bacteroidetes

orf00066

Phage-related protein

Escherichia phage

Proteobacteria

orf00070

Predicted protein

Bacteroides sp.

Bacteroidetes

orf00071

Predicted protein

Bacteroides sp.

Bacteroidetes

orf00072

Hypothetical protein

Acinetobacter schindleri

Proteobacteria

orf00073

Phage-related protein

Bacteroides stercoris

Bacteroidetes

orf00074

Phage-structural protein, contains BACON domains

Bacteroides sp.

Bacteroidetes

orf00075

Phage-structural protein

Mycobacterium phage

Actinobacteria

orf00077

Recombination endonuclease sunbunit

Bacteroides vulgatus

Bacteroidetes

orf00076

Phage-related protein

Desulfitobacterium hafniense

Firmicutes

orf00086

Phage-structural protein

Veillonella sp.

Firmicutes

orf00088

Phage-structural protein

Pseudomonas phage

Proteobacteria

orf00091

Phage-structural protein

Cellulophaga phage

Bacteroidetes

orf00092

Hypothetical protein

Veillonella sp.

Firmicutes

orf00093

DNA helicase

Staphylococcus phage

Firmicutes

orf00094

Endolysin

Marinilabilia salmonicolor

Bacteroidetes

orf00095

Endolysin

Phage

Proteobacteria

orf00096

Phage-related protein

Marinimicrobia sp.

Marinimicrobia

orf00102

Plasmid replication protein domain

Firmicutes

Firmicutes

  1. ORF, open reading frame.
  2. Function and taxonomy information of the hits are displayed. For details see Supplementary Data 2.