Figure 1

Flowchart of major steps for SMDB construction. First, a core database was constructed for selected genes by retrieving protein sequences from UniProt databases using keywords. Second, a full database was constructed by integrating target genes from databases, including COG, eggNOG, KEGG, M5nr, and NR. Third, a PERL script was developed to generate functional and taxonomic profiles for shotgun metagenomes using searching tools.