Fig. 1: Recovery of genomes from globally distributed gut metagenomes.
From: New insights from uncultivated genomes of the global human gut microbiome

a, Geographical distribution of metagenomes. Sample sizes are indicated in parentheses, and pin colour indicates the majority age group and lifestyle (infants, ā¤3Ā years old; adults, ā„18Ā years old). Several locations are represented by multiple studies; several studies were conducted in multiple locations. b, Computational pipeline for assembling MAGs. c, Pipeline for identifying and removing incorrectly binned contigs. d, Quality metrics across low- (nĀ =Ā 101,651), medium- (med., nĀ =Ā 36,319) and high-quality (nĀ =Ā 24,345) MAGs. e, Barriers to MAG recovery. Single nucleotide polymorphisms (SNPs) were called for MAGs with sufficient read depth (nĀ =Ā 17,671), and compared with N50. Red line is from a Spearman correlation (ĻĀ =Ā ā0.61). f, At least 10ā20Ć depth is required to assemble a MAG, but assembly rates vary between taxa. AB, Actinobacteria; AR, Archaea; BD, Bacteroidetes; FR, Firmicutes; VM, Verrucomicrobia; PR, Proteobacteria; SP, Spirochaetes. Sequencing read depth was estimated using IGGsearch (seeĀ Methods), and curves were fit using logistic regression. For box plots, the middle line denotes the median; the box denotes the interquartile range (IQR); and the whiskers denote 1.5Ć IQR.