Figure 1

The general pipeline of the study: Reads from the iHART dataset that were unmapped or poorly aligned to GRCh38 were extracted and reclassified to a database of viruses, bacteria, archaea using Kraken. An F-regression was then performed on bacterial and viral counts against various sample-level metadata.