Fig. 4: Bacterial genome-wide association analysis.

The phylogeny of global confirmed S. mitis whole-genome sequences was built using 473,175 SNPs out of 1,237,113 nucleotide bases and annotated with disease status that was pruned to select for pairs of genetically closest carriage and invasive disease isolates. Source data are provided as a Source Data file. This phylogenetic-based approach provided an approximate matching of the isolates for the bacterial genome-wide association analysis. Source data are provided as a Source Data file.