Figure 1 | Scientific Reports

Figure 1

From: Discovering viral genomes in human metagenomic data by predicting unknown protein families

Figure 1

Flowchart of the ORFan protein family prediction pipeline. The diagram starts with the raw set of reads from the libraries described in Table 1. Squares in blue describe the preprocessing steps performed to obtain a data set consisting of unannotated sequences. The unannotated sequences were subsequently processed through our prediction pipeline (in green) resulting in 32 predicted protein families.

Back to article page