Fig. 5: Increased old yellow enzyme (OYE) gene family size promotes ROS resistance in yeasts.
From: Machine learning reveals genes impacting oxidative stress resistance across yeasts

A A phylogenetic tree showing the variation in size of the OYE gene family in bars for the species included in the ML model. The classification of each species is indicated as a filled circle for ROS-resistant species and an empty circle for ROS-sensitive species. B A categorical comparison showing that ROS-resistant yeasts (Class 1) had, on average, more copies of OYE genes than ROS-sensitive yeasts (Class 0). The statistical significance was determined using a two-sided t-test, and the dots represent each species included in the model (n = 57 per class). C The correlation between the number of OYE genes in each species and their relative growth (empirical area under the curve, EAUC) at either 1 or 2 mM TBOOH as specified where the shaded area represents the 95% confidence interval. The p-values shown are based on t-tests of regression coefficients phylogenetically corrected with a generalized least squares model, and the R values are adjusted for phylogenetic effects. D A spot assay showing growth of the K. lactis strains transformed with the following plasmids: an empty vector (pIL75), three independent transformants overexpressing the OYE ortholog KYE1 (pIL75-PKlTEF1-KYE1) or GFP (pIL75-PKlTEF1-GFP) to control for the effect of protein overexpression. Plates were supplemented with varying concentrations of TBOOH as indicated and incubated at 30 °C for four days before imaging. Spot assays are representative images of three replicates. E The K. lactis strains were grown in liquid SC + MSG + G418 medium with or without 0.5 mM TBOOH in a 96-well plate format. The ROS resistance of these strains was compared using the EAUC in 0.5 mM TBOOH relative to no-TBOOH controls. The points in the boxplots represent biological replicates (n = 3 for pIL75, n = 6 for KYE1 and n = 3 for GFP), and the p-values are based on two-sided t-tests relative to the GFP control. For all boxplots in this figure, the center line represents the media, the bounds of the boxes represent the interquartile range, and the whiskers represent the spread of the data. Source data are provided as a Source Data file.