Fig. 6: Distribution of metabolic regulator start codons across the Enterobacteriaceae family.

a, Non-ATG start codons are conserved among several genera of the Enterobacteriaceae family. Shigella and Citrobacter genomes have a similar level of lacI GTG start codon conservation as Escherichia. The bacterial families were grouped according to their phylogenetic lineage. b, Alternative start codons are found in multiple carbohydrate-related regulators within the Enterobacteriaceae family. The data shown in this figure rely on existing annotations, which can be ambiguous or missing. The conservation level is represented by a circle-filled colour code that varies depending on the regulator and genus. c, Percentage of non-ATG start codons in carbohydrate-related (n = 32) and carbohydrate-unrelated (n = 29) transcriptional regulators in Enterobacteriaceae. A two-tailed Mann–Whitney U test was used to compare the two groups (****P = 1.5 × 10−11). Data are shown as median (black horizontal line) and 25% and 75% percentiles (hinges). Whiskers extend from the hinges to the maxima and minima, no further than 1.5× distance of the interquartile range.