Fig. 5: Population structure of 493 Shigella and E. coli reference genomes based on core-genome SNVs.

This maximum-likelihood phylogenetic tree genomes are based on 92,688 core-genome single-nucleotide variants (SNVs). Nodes supported by bootstrap values ≥95% are indicated by red dots. Phylogenetic clades containing Shigella genomes are labelled with the same nomenclature (S1-S3, SON, SD1, SD8, SD10, and SB12) as in Fig. 1. All the Shigella genomes are also labelled on the right with cgMLST HC2000 and HC1100 data. The scale bar indicates the number of nucleotide substitutions per variable sites (SNVs).