Fig. 3

Overview of quality of metagenomic libraries. (a) Dot plot showing read-quality of all metagenomic libraries with a median and interquartile range of Phred -quality score. (b) The percentage of reads containing predicted features in raw reads after pair-merging, artefact removal, host DNA removal, and feature extraction. (c) The taxonomic prediction of raw reads is shown at the domain level. The “others” shown here means reads that contain the virus, archaea, unclassified taxa, and other sequences. (d) Correlation of taxonomic compositions (top 10) between 16 S rRNA sequencing and shotgun metagenomic sequencing. The relationship between the fold-change of taxonomic abundance from 16 S rRNA sequencing data and the fold-change of taxonomic abundance from metagenomic sequencing data was analyzed for top 10 taxonomic units at genus scale. “R” and “P” indicate the Pearson’s R and significance of the pairing, respectively. Significance was assessed by two-tailed P-values and we used an α level of 0.05 for all statistical tests. “AK” indicate the genus Akkermansia.