Extended Data Fig. 4: Maximum-likelihood phylogenetic tree of the aldehyde dehydrogenase superfamily.

(a-c Relative levels of G3P (a), SA (b) and SAG (c) in mock- and Pst- avrRpt2 inoculated Col-0 and 35S–PipOX plants. The leaves were sampled 24 h post treatment. The error bars represent SD (n = 3 or 4 biological replicates). Asterisks denote a significant difference (multiple unpaired t test, **P < 0.002; ***P < 0.0002; ****P < 0.0001). These experiments were repeated at least twice with similar results. (d) Phylogenetic analysis showing the major clades in the ALDH family. Several conserved clades were collapsed into filled triangles for convenience. Only key clades are labeled, and clades that contain Arabidopsis or human proteins are labeled as such with their gene names and biochemical activities where known. Additionally displayed are the metabolites, such as amino acids lignin and ethanol, whose pathways involve the functioning of the corresponding ALDH. The raw data for these trees can be accessed from the supplementary material (Supplementary Fig. 2). Clades with bootstrap values > 80% are marked with filled circles. The color of the collapsed triangle reflects the phyletic distribution of the branches in that clade, as given in the key.