Fig. 5: Phylogenetic distance by across sources and sequence types.
From: Parameters for one health genomic surveillance of Escherichia coli from Australia

This series of box plots compare the phylogenetic distances of isolate pairs (the unit under study). STs for a given pair are shown on the y-axis, SNP distances are shown on the x-axis and the colour of a data point representing a pair of isolates indicates the cgMLST distance for that pair. Counts of strain pairs for given combinations of STs and pairs of sources are available in Supplementary Data 3. Central lines within boxplots represents the mean, while the bounds of the box indicate the first and third quartiles (25th and 75th percentiles). Whiskers extend to the minimum and maximum values. A red dotted line at 100 SNPs indicates a threshold of relatedness used to indicate moderate phylogenetic overlap. Panels visualise these parameters for pairs of isolates from the denoted pairs of sources – only isolate pairs from different sources are shown.