Extended Data Figure 2: Binning of ‘Entotheonella factor’ contigs and natural product biosynthetic genes.
From: An environmental bacterial taxon with a large and distinct metabolic repertoire

a, Plot of G+C content versus relative frequency for all 18,093 contigs assembled from sequencing of the enriched filamentous bacterial fraction from Theonella swinhoei Y. BLASTX analysis was performed against the RefSeq database, and the taxonomic domain classification for the best hit was assigned to each contig. Scaffolds attributed to bacterial sources are shown in red, contigs attributed to Archaea are shown in blue, those exhibiting eukaryotic features in green, and unclassified contigs in black. b, Filtered scaffolds, in which contaminating non-bacterial contigs and bacterial contigs that could not be assembled into scaffolds were removed. Scaffolds containing ORFs predicted to code for natural product biosynthesis are indicated in blue. The Bacteria-associated contigs and scaffolds clustered into two main groups designated ‘Entotheonella factor TSY1’ for the dominant organism, ‘Entotheonella factor TSY2’ for the less-abundant organism, and pTSY for plasmid-associated scaffolds. Scaffolds containing biosynthetic genes associated with the known compound classes, cyclotheonamides (cth), konbamides (kon), keramamides (ker), and nazumamides (naz) all cluster with TSY1. The onnamide (onn) and polytheonamide (poy) biosynthetic genes are located on the plasmid, pTSY.