Fig. 1: The reconstruction of sequence catalog from the early-life human gut microbiome.

a The number and proportion of fecal metagenomes stratified by clinical features including age, gender, delivery mode, gestational age, and feeding patterns. b Overview of the computational pipeline to generate ELGG and ELGP catalogs. c Quality metrics across near-complete (n = 25,303), medium with quality score (QS) > 75 (n = 2063) and medium with QS ≤ 75 (n = 4911) MAGs. CPM copies per million reads. Boxes show the interquartile range (IQR), with the horizonal line as the median, the whiskers indicating the range of the data (up to 1.5× IQR), and points beyond the whiskers as outliers. d Completeness and contamination scores for each of 32,277 genomes. QS = completeness–5 × contamination.