Extended Data Fig. 4: Topographical analyses of metagenome data.

Comparison of a) alpha diversity (Shannon Index, each dot denotes the Shannon diversity of a sample while the box inter-quartile range with median at the center and whiskers represent maximum and minimum value) and b) beta diversity (Bray Curtis Dissimilarity index, across 5 background negative controls (bronchoscope), 118 bronchoalveolar lavage (BAL) and 64 upper airway (UA) samples (Kruskal-Wallis p-value = 0.00000000000000022 and PERMANOVA p-value= 0.001, without multiple comparisons, respectively). (c) Boxplots showing the relative abundance values in log10 across all metagenome samples for the 118 BAL and 64 Upper Airway samples. The 50 taxa with the highest relative abundance values in the BAL metagenome are displayed; the top 10 in the BAL are highlighted in bold. Each column consists of two plots displaying the most abundant bacteria and fungi identified. Numbers in parentheses next to the taxa labels displays its ranking in relative abundance for either the BAL or UA metagenome samples, respectively. Each dot denotes the relative abundance of a taxa per sample while the box inter-quartile range with median at the center and whiskers represent maximum and minimum value.