Figure 3

RT106 strains harbor a complete and unique 46 kb genomic island 1. The relatedness of the 265 C. difficile strains that carry GI1 segments (> 7.7 kb, 98% identity) is shown in a maximum likelihood tree (log likelihood = − 479,911.97) based on 40,879 core SNPs identified using Panseq. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) are shown next to the branches. GI1 is drawn to scale on the right to illustrate regions present in different sequence types (ST). Tree scale: 0.01 represents 0.01 substitutions per nucleotide site. The complete 46 kb GI1 is present in RT106/ST28/ST42. Genes were colored based on functional categories from gene ontology (GO) analysis. A 7.1 kb region carried by all the strains (black dashed box) was used for determining progenitor STs of the element in the molecular clock analysis in Fig. 4.