Abstract
Phenotypically agnostic screens for positive selection in pathogen populations provide a means of pinpointing genes and regulatory regions involved in adaptation to the local environment or host population. We screened a large (n = 2506) collection of Vietnamese Mycobacterium tuberculosis (Mtb) isolates, finding targets of selection to be lineage-specific, and encompass diverse functions, including dormancy (Rv0080), zinc homeostasis (zur), and virulence (ESX-1 structure). Extending our screen to the wider Mtb complex (MTBC) phylogeny demonstrated Rv0080 to display an extraordinarily dynamic evolutionary history, acquiring premature stop codons or putative functional mutations on branches upstream of 8 of the 10 human-adapted lineages, and undergoing positive selection in the remaining 2. Lineage 1, which is one of two such lineages retaining the ancestral Rv0080 sequence, displays a rate of selection for this gene (dN/dS=9.37) exceeding any other in the Mtb genome, save a transcription factor linked to its expression (Rv0042c; dN/dS=11.02). Deletion of Rv0080’s M. smegmatis orthologue confers a survival advantage in hypoxic conditions, as does the evolution of nonsense or missense mutations on an ancestral Rv0080 background. We show the dormancy survival regulon experienced recurrent episodes of reductive evolution across the MTBC phylogeny, illuminating a novel mechanism via which it adapted to human populations.
Similar content being viewed by others
Data availability
All data analysed in this manuscript were drawn from previously published studies and are publicly available. All relevant sources from which data were obtained are referenced in the text.
Code availability
Code for performing the Rosetta DDG prediction is available on GitHub (https://github.com/jlingford/ddg_rosetta).
References
Aguileta, G., Refrégier, G., Yockteng, R., Fournier, E. & Giraud, T. Rapidly evolving genes in pathogens: Methods for detecting positive selection and examples among fungi, bacteria, viruses and protists. Infect. Genet. Evol. 9, 656–670 (2009).
Xu, Z. et al. Positive selection signals of hepatitis B virus and their association with disease stages and viral genotypes. Infect. Genet. Evol. 19, 176–187 (2013).
Chiner-Oms, A., Lopez, M. A., Moreno-Molina, M., Furió, V. & Comas, I. Gene evolutionary trajectories in Mycobacterium tuberculosis reveal temporal signs of selection. Proc. Natl. Acad. Sci. USA 119, e2113600119 (2022).
Liu, Q. et al. Tuberculosis treatment failure associated with evolution of antibiotic resilience. Science 378, 1111–1118 (2022).
Chiner et al. Genomic determinants of speciation and spread of the Mycobacterium tuberculosis complex. Sci. Adv. 5, eaaw3307 (2019).
Vargas, R. et al. Phase variation as a major mechanism of adaptation in Mycobacterium tuberculosis complex. Proc. Natl. Acad. Sci. 120, e2301394120 (2023).
Wilson, D. J. et al. GenomegaMap: within-species genome-wide dN/dS estimation from over 10,000 genomes. Mol. Biol. Evol. 37, 2450–2460 (2020).
Liu, Q. et al. Local adaptation of Mycobacterium tuberculosis on the Tibetan Plateau. Proc. Natl. Acad. Sci. USA 118, e2017831118 (2021).
Tantivitayakul, P. et al. Homoplastic single nucleotide polymorphisms contributed to phenotypic diversity in Mycobacterium tuberculosis. Sci. Rep. 10, 8024 (2020).
Grandjean, L. et al. Convergent evolution and topologically disruptive polymorphisms among multidrug-resistant tuberculosis in Peru. PLOS ONE 12, e0189838 (2017).
Zwyer, M. et al. Back-to-Africa introductions of Mycobacterium tuberculosis as the main cause of tuberculosis in Dar es Salaam, Tanzania. PLOS Pathog. 19, e1010893 (2023).
Koleske, B. N., Jacobs, W. R. & Bishai, W. R. The Mycobacterium tuberculosis genome at 25 years: lessons and lingering questions. J. Clin. Investig. 133, e173156 (2023).
Gröschel, M. I. et al. Differential rates of Mycobacterium tuberculosis transmission associate with host–pathogen sympatry. Nat. Microbiol. 9, 2113–2127 (2024).
Guyeux, C. et al. Newly Identified Mycobacterium africanum Lineage 10, Central Africa. Emerg. Infect. Dis. 30, 560–563 (2024).
Silcocks, M. & Dunstan, S. J. Parallel signatures of Mycobacterium tuberculosis and human Y-chromosome phylogeography support the Two Layer model of East Asian population history. Commun. Biol. 6, 1037 (2023).
Matsumura, H. et al. Craniometrics reveal “Two Layers” of prehistoric human dispersal in eastern eurasia. Sci. Rep. 9, 1451 (2019).
Holt, K. E. et al. Frequent transmission of the Mycobacterium tuberculosis Beijing lineage and positive selection for the EsxW Beijing variant in Vietnam. Nat. Genet. 50, 849–856 (2018).
Coscolla, M. & Gagneux, S. Consequences of genomic diversity in Mycobacterium tuberculosis. Semin. Immunol. 26, 431–444 (2014).
Gagneux, S. Ecology and evolution of Mycobacterium tuberculosis. Nat. Rev. Microbiol. 16, 202–213 (2018).
Marin, M. et al. Benchmarking the empirical accuracy of short-read sequencing across the M. tuberculosis genome. Bioinformatics 38, 1781–1787 (2022).
Sagulenko, P., Puller, V. & Neher, R. A. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 4, vex042 (2018).
Edwards, D. J., Duchene, S., Pope, B. & Holt, K. E. SNPPar: identifying convergent evolution and other homoplasies from microbial whole-genome alignments. Microb. Genomics 7, 000694 (2021).
Kapopoulou, A., Lew, J. M. & Cole, S. T. The MycoBrowser portal: a comprehensive and manually annotated resource for mycobacterial genomes. Tuberculosis 91, 8–13 (2011).
Brynildsrud, O. B. et al. Global expansion of Mycobacterium tuberculosis lineage 4 shaped by colonial migration and local adaptation. Sci. Adv. 4, eaat5869 (2018).
Hicks, N. D. et al. Clinically prevalent mutations in Mycobacterium tuberculosis alter propionate metabolism and mediate multidrug tolerance. Nat. Microbiol. 3, 1032–1042 (2018).
Hicks, N. D. et al. Mutations in dnaA and a cryptic interaction site increase drug resistance in Mycobacterium tuberculosis. PLOS Pathog. 16, e1009063–e1009063 (2020).
Chen, T., He, L., Deng, W. & Xie, J. The Mycobacterium DosR regulon structure and diversity revealed by comparative genomic analysis. J. Cell. Biochem. 114, 1–6 (2013).
Galagan, J. E. et al. The Mycobacterium tuberculosis regulatory network and hypoxia. Nature 499, 178–183 (2013).
Kumar, A. et al. Mycobacterium tuberculosis DosR regulon gene Rv0079 encodes a putative, ‘Dormancy Associated Translation Inhibitor (DATIN). PLoS ONE 7, e38709 (2012).
He, H., Bretl, D. J., Penoske, R. M., Anderson, D. M. & Zahrt, T. C. (2011). Components of the Rv0081-Rv0088 locus, which encodes a predicted formate hydrogenlyase complex, are coregulated by Rv0081, MprA, and DosR in Mycobacterium tuberculosis. J. Bacteriol. 193, 5105–5118 (2011).
Selvaraj, S., Sambandam, V., Sardar, D. & Anishetty, S. In silico analysis of DosR regulon proteins of Mycobacterium tuberculosis. Gene 506, 233–241 (2012).
Hershberg, R. et al. High functional diversity in Mycobacterium tuberculosis driven by genetic drift and human demography. PLoS Biol. 6, e311 (2008).
Ahmed, F. H. et al. Sequence–structure–function classification of a catalytically diverse oxidoreductase superfamily in Mycobacteria. J. Mol. Biol. 427, 3554–3571 (2015).
Harold, L. K. et al. FAD-sequestering proteins protect mycobacteria against hypoxic and oxidative stress. J. Biol. Chem. 294, 2903–5814 (2019).
Leman, J. K. et al. Macromolecular modeling and design in Rosetta: recent methods and frameworks. Nat. Methods 17, 665–680 (2020).
Sora, V. et al. RosettaDDGPrediction for high-throughput mutational scans: from stability to binding. Protein Sci. 32, e4527 (2023).
Rustad, T. R. et al. Mapping and manipulating the Mycobacterium tuberculosis transcriptome using a transcription factor overexpression-derived regulatory network. Genome Biol. 15, 502 (2014).
Berney, M., Greening, C., Conrad, R., Jacobs, W. R. & Cook, G. M. An obligately aerobic soil bacterium activates fermentative hydrogen production to survive reductive stress during hypoxia. Proc. Natl. Acad. Sci. USA 111, 11479–11484 (2014).
Supply, P. et al. Genomic analysis of smooth tubercle bacilli provides insights into ancestry and pathoadaptation of Mycobacterium tuberculosis. Nat. Genet. 45, 172–179 (2013).
Riccardi, G., Milano, A., Pasca, M. R. & Nies, D. H. Genomic analysis of zinc homeostasis in Mycobacterium tuberculosis. FEMS Microbiol. Lett. 287, 1–7 (2008).
Maciag, A. et al. Global analysis of the Mycobacterium tuberculosis Zur (FurB) regulon. J. Bacteriol. 189, 730–740 (2007).
Goethe, E. et al. Critical role of Zur and SmtB in zinc homeostasis of Mycobacterium smegmatis. MSystems 5, e00880-19 (2020).
Mitra, R. et al. Geometric deep learning of protein–DNA binding specificity. Nat. Methods 21, 1674–1683 (2024).
Gilston, B. A. et al. Structural and mechanistic basis of zinc regulation across the E. coli Zur regulon. PLoS Biol. 12, e1001987 (2014).
Sonnenkalb, L. et al. Bedaquiline and clofazimine resistance in Mycobacterium tuberculosis: an in-vitro and in-silico data analysis. Lancet Microbe 4, e358–e368 (2023).
Culviner, P. H. et al. Evolution of Mycobacterium tuberculosis transcription regulation is associated with increased transmission and drug resistance. Cell 188, 6620–6635.e14 (2025).
Walker, T. M. et al. The 2021 WHO catalogue of Mycobacterium tuberculosis complex mutations associated with drug resistance: a genotypic analysis. Lancet Microbe 3, e265–e273 (2022).
Neyrolles, O., Wolschendorf, F., Mitra, A. & Niederweis, M. Mycobacteria, metals, and the macrophage. Immunol. Rev. 264, 249–263 (2015).
Menardo, F. et al. Local adaptation in populations of Mycobacterium tuberculosis endemic to the Indian Ocean Rim. F1000Research 10, 60 (2021).
Ford, C. B. et al. Use of whole genome sequencing to estimate the mutation rate of Mycobacterium tuberculosis during latent infection. Nat. Genet. 43, 482–486 (2011).
Lillebaek, T. et al. Substantial molecular evolution and mutation rates in prolonged latent Mycobacterium tuberculosis infection in humans. Int. J. Med. Microbiol. 306, 580–585 (2016).
Lin, P. L. et al. Quantitative comparison of active and latent tuberculosis in the cynomolgus macaque model. Infect. Immun. 77, 4631–4642 (2009).
Colangeli, R. et al. Whole genome sequencing of Mycobacterium tuberculosis reveals slow growth and low mutation rates during latent infections in humans. PLoS ONE 9, e91024 (2014).
Colangeli, R. et al. Mycobacterium tuberculosis progresses through two phases of latent infection in humans. Nat. Commun. 11, 4870 (2020).
Silcocks, M. et al. Evolution and transmission of antibiotic resistance is driven by Beijing lineage Mycobacterium tuberculosis in Vietnam. Microbiol. Spectr. 11, e0256223 (2023).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10 (2011).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
McKenna, A. et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
Sobkowiak, B. et al. Identifying mixed Mycobacterium tuberculosis infections from whole genome sequence data. BMC Genomics 19, 613 (2018).
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Freschi, L. et al. Population structure, biogeography and transmissibility of Mycobacterium tuberculosis. Nat. Commun. 12, 6099 (2021).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Comas, I. et al. Human T cell epitopes of Mycobacterium tuberculosis are evolutionarily hyperconserved. Nat. Genet. 42, 498–503 (2010).
Sayers, E. W. et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 50, D20–D26 (2022).
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539 (2011).
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinforma. 10, 421 (2009).
Edokimov, K. et al. Whole-genome sequencing of Mycobacterium tuberculosis from Cambodia. Sci. Rep. 12, 7693 (2022).
Ajawatanawong, P. et al. A novel Ancestral Beijing sublineage of Mycobacterium tuberculosis suggests the transition site to Modern Beijing sublineages. Sci. Rep. 9, 13718 (2019).
Phelan, J. E. et al. Mycobacterium tuberculosis whole genome sequencing provides insights into the Manila strain and drug-resistance mutations in the Philippines. Sci. Rep. 9, 9305 (2019).
Bainomugisa, A. et al. Genomic epidemiology of tuberculosis in eastern Malaysia: insights for strengthening public health responses. Microb. Genomics 7, 000573 (2021).
Hang, N. T. L. et al. Whole genome sequencing, analyses of drug resistance-conferring mutations, and correlation with transmission of Mycobacterium tuberculosis carrying katG-S315T in Hanoi, Vietnam. Sci. Rep. 9, 15354 (2019).
Munir, A. et al. Identification and characterization of genetic determinants of isoniazid and rifampicin resistance in Mycobacterium tuberculosis in Southern India. Sci. Rep. 9, 10283 (2019).
Coscolla, M. et al. Phylogenomics of Mycobacterium africanum reveals a new lineage and a complex evolutionary history. Microb. Genomics 7, 000477 (2021).
Brites, D. et al. A new phylogenetic framework for the animal-adapted Mycobacterium tuberculosis complex. Front. Microbiol. 9, 2820 (2018).
Wang, Q. et al. PE/PPE proteins mediate nutrient transport across the outer membrane of Mycobacterium tuberculosis. Science 367, 1147–1151 (2020).
McNeil, M. B., Ryburn, H. W., Tirados, J., Cheung, C.-Y. & Cook, G. M. Multiplexed transcriptional repression identifies a network of bactericidal interactions between mycobacterial respiratory complexes. IScience 25, 103573–103573 (2021).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Evans, R. et al. Protein complex prediction with AlphaFold-Multimer. Preprint at https://www.biorxiv.org/content/10.1101/2021.10.04.463034v2 (2021).
Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682 (2022).
Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).
Pettersen, E. F. et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82 (2021).
DePristo, M. A., Weinreich, D. M. & Hartl, D. L. Missense meanderings in sequence space: a biophysical view of protein evolution. Nat. Rev. Genet. 6, 678–687 (2005).
Li, B., Yang, Y. T., Capra, J. A. & Gerstein, M. B. Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks. PLOS Comput. Biol. 16, e1008291 (2020).
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/ (2021).
Acknowledgements
The authors acknowledge support from the National Health and Medical Research Council, Australia APP1172853 (S.J.D.), the US National Institutes of Health U19AI162583 (S.J.D.), and the Australian Research Council DE250101214 (M.S.).
Author information
Authors and Affiliations
Contributions
M.S. and S.J.D. conceived the initial genomic study, with J.P.L., C.G., C.A.M., J.S.C. and M.B.M. playing conceptual roles in the structural and functional analyses that followed. M.S. completed the genomic and bioinformatic analyses, under the supervision of S.J.D., and with contributions from D.E. and K.E.H. relating to the use of SNPPar software. J.P.L. completed the protein structure prediction analysis for Rv0080/Fsq under the supervision of C.G., and C.A.M. completed the protein structure prediction of zur, with contributions from S.L.N. and M.J.M. Mycobacterial functional experiments were completed by C.Y.C. W.J.J. R.M.M. C.D.R. and L.K.H. under the supervision of M.B.M., J.S.C., and G.M.C. Whole genome sequencing data from Vietnamese patients was generated and contributed by P.V.K.T., K.E.H., N.T.T.T., M.C. and S.J.D. Assistance with bioinformatic or statistical aspects of this analysis was provided by E.P.T., A.C.B., X.C., N.S.B. and T.R.H. The initial draft of this manuscript was prepared by M.S., with relevant sections contributed by J.P.L, C.A.M and M.B.M. The final draft of the manuscript was edited and approved by all authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Thierry Wirth, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Silcocks, M., Lingford, J.P., Cheung, CY. et al. Dormancy regulon reduction was pivotal to the evolution of Mycobacterium tuberculosis. Nat Commun (2026). https://doi.org/10.1038/s41467-026-71566-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-026-71566-x


