Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Diversity-generating retroelements for programmable targeted hypermutagenesis

Abstract

Diversity-generating retroelements (DGRs) are natural systems that accelerate the evolution of diverse bacterial functions through targeted hypermutation. We establish a method using DGRs coupled to recombineering (DGRec), which enables the diversification of any sequence of interest in Escherichia coli. Detailed characterization of reverse transcriptase sequence biases demonstrates how it maximizes the exploration of the sequence space while avoiding nonsense mutations. By leveraging the high error rate of the DGR reverse transcriptase at adenines, DGRec can efficiently diversify user-defined sequence windows of 50–200 bp. Mutations can be focused at specific positions, with rates reaching up to 1.38 × 10−2 per base per generation, allowing up to 24 mutations to accumulate within a single target sequence after 48 h. We apply DGRec to phage λ host-range engineering, to the evolution of dCas9 variants and to accelerated evolution of specific nanobodies through a bacterial display setup. Lastly, we establish the feasibility of DGR-mediated mutagenesis in yeast by adapting a recombination and selection strategy previously developed for retrons.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: DGRec targeted hypermutagenesis in E. coli.
The alternative text for this image may have been generated using AI.
Fig. 2: Impact of the close context on bRT mutation rates and mutation biases.
The alternative text for this image may have been generated using AI.
Fig. 3: Modeling of DGRec mutagenesis efficiency based on dgrRNA secondary structures.
The alternative text for this image may have been generated using AI.
Fig. 4: DGRec reproduction of a phage–bacterial receptor arms race dynamics.
The alternative text for this image may have been generated using AI.
Fig. 5: DGRec evolution of dCas9 and nanobodies.
The alternative text for this image may have been generated using AI.
Fig. 6: Proof-of-concept implementation of DGR-mediated mutagenesis in yeast.
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data availability

Illumina sequencing data were deposited to the Sequence Read Archive under Bioproject PRJNA1140560.

Code availability

Software used to analyze the data is available as a Python package (https://github.com/dbikard/dgrec). Jupyter notebooks showing the analyses of the manuscript are also available (https://gitlab.pasteur.fr/dbikard/dgrec_analysis).

References

  1. Packer, M. S. & Liu, D. R. Methods for the directed evolution of proteins. Nat. Rev. Genet. 16, 379–394 (2015).

    Article  CAS  PubMed  Google Scholar 

  2. Lynch, M. Evolution of the mutation rate. Trends Genet. 26, 345–352 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Morrison, M. S., Podracky, C. J. & Liu, D. R. The developing toolkit of continuous directed evolution. Nat. Chem. Biol. 16, 610–619 (2020).

    Article  CAS  PubMed  Google Scholar 

  4. Molina, R. S. et al. In vivo hypermutation and continuous evolution. Nat. Rev. Methods Primers 2, 36 (2022).

    Article  CAS  Google Scholar 

  5. Ravikumar, A., Arzumanyan, G. A., Obadi, M. K. A., Javanpour, A. A. & Liu, C. C. Scalable, continuous evolution of genes at mutation rates above genomic error thresholds. Cell 175, 1946–1957 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Tian, R. et al. Engineered bacterial orthogonal DNA replication system for continuous evolution. Nat. Chem. Biol. 19, 1504–1512 (2023).

    Article  CAS  PubMed  Google Scholar 

  7. Tian, R. et al. Establishing a synthetic orthogonal replication system enables accelerated evolution in E. coli. Science 383, 421–426 (2024).

    Article  CAS  PubMed  Google Scholar 

  8. Halperin, S. O. et al. CRISPR-guided DNA polymerases enable diversification of all nucleotides in a tunable window. Nature 560, 248–252 (2018).

    Article  CAS  PubMed  Google Scholar 

  9. Hurtado, J. E. et al. Nickase fidelity drives EvolvR-mediated diversification in mammalian cells. Nat. Commun. 16, 3723 (2025).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Moore, C. L., Papa, L. J. III & Shoulders, M. D. A processive protein chimera introduces mutations across defined DNA regions in vivo. J. Am. Chem. Soc. 140, 11560–11564 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Janeway, C. A., Jr, Travers, P., Walport, M. & Shlomchik, M. J. The Generation of Diversity in Immunoglobulins 5th edn (Garland Science, 2001).

  12. Liu, M. et al. Reverse transcriptase-mediated tropism switching in Bordetella bacteriophage. Science 295, 2091–2094 (2002).

    Article  CAS  PubMed  Google Scholar 

  13. Doulatov, S. et al. Tropism switching in Bordetella bacteriophage defines a family of diversity-generating retroelements. Nature 431, 476–481 (2004).

    Article  CAS  PubMed  Google Scholar 

  14. Paul, B. G. et al. Retroelement-guided protein diversification abounds in vast lineages of Bacteria and Archaea. Nat. Microbiol. 2, 1–7 (2017).

    Article  Google Scholar 

  15. Roux, S. et al. Ecology and molecular targets of hypermutation in the global microbiome. Nat. Commun. 12, 3076 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Macadangdang, B. R. et al. Targeted protein evolution in the gut microbiome by diversity-generating retroelements. Science 390, eadv2111 (2025).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Wu, L. et al. Diversity-generating retroelements: natural variation, classification and evolution inferred from a large-scale genomic survey. Nucleic Acids Res. 46, 11–24 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  18. Dai, W. et al. Three-dimensional structure of tropism-switching Bordetella bacteriophage. Proc. Natl Acad. Sci. USA 107, 4347–4352 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Guo, H. et al. Target site recognition by a diversity-generating retroelement. PLoS Genet. 7, e1002414 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Handa, S. et al. Template-assisted synthesis of adenine-mutagenized cDNA by a retroelement protein complex. Nucleic Acids Res. 46, 9711–9725 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. McMahon, S. A. et al. The C-type lectin fold as an evolutionary solution for massive sequence variation. Nat. Struct. Mol. Biol. 12, 886–892 (2005).

    Article  CAS  PubMed  Google Scholar 

  22. Handa, S. et al. RNA control of reverse transcription in a diversity-generating retroelement. Nature 638, 1122–1129 (2025).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Macadangdang, B. R., Makanani, S. K. & Miller, J. F. Accelerated Evolution by Diversity-Generating Retroelements. Annu Rev Microbiol. 76, 389–411 (2022).

    Article  CAS  PubMed  Google Scholar 

  24. Handa, S., Paul, B. G., Miller, J. F., Valentine, D. L. & Ghosh, P. Conservation of the C-type lectin fold for accommodating massive sequence variation in archaeal diversity-generating retroelements. BMC Struct. Biol. 16, 13 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  25. Farzadfard, F. & Lu, T. K. Genomically encoded analog memory with precise in vivo DNA writing in living cell populations. Science 346, 1256272 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  26. Simon, A. J., Morrow, B. R. & Ellington, A. D. Retroelement-based genome editing and evolution. ACS Synth. Biol. 7, 2600–2611 (2018).

    Article  CAS  PubMed  Google Scholar 

  27. Schubert, M. G. et al. High-throughput functional variant screens via in vivo production of single-stranded DNA. Proc. Natl Acad. Sci. USA 118, e2018181118 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. González-Delgado, A., Lopez, S. C., Rojas-Montero, M., Fishman, C. B. & Shipman, S. L. Simultaneous multi-site editing of individual genomes using retron arrays. Nat Chem Biol. 20, 1482–1492 (2024).

    Article  PubMed  PubMed Central  Google Scholar 

  29. Fishman, C. B. et al. Continuous multiplexed phage genome editing using recombitrons. Nat Biotechnol. 43, 1299–1310 (2025).

    Article  CAS  PubMed  Google Scholar 

  30. Lopez, S. C., Crawford, K. D., Lear, S. K., Bhattarai-Kline, S. & Shipman, S. L. Precise genome editing across kingdoms of life using retron-derived DNA. Nat Chem Biol. 18, 199–206 (2022).

    Article  CAS  PubMed  Google Scholar 

  31. Wannier, T. M. et al. Improved bacterial recombineering by parallelized protein discovery. Proc. Natl Acad. Sci. USA 117, 13689–13698 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Naorem, S. S. et al. DGR mutagenic transposition occurs via hypermutagenic reverse transcription primed by nicked template RNA. Proc. Natl Acad. Sci. USA 114, E10187–E10195 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Kou, R. et al. Benefits and challenges with applying unique molecular identifiers in next generation sequencing to detect low frequency mutations. PLoS One 11, e0146638 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  34. Handa, S., Reyna, A., Wiryaman, T. & Ghosh, P. Determinants of adenine-mutagenesis in diversity-generating retroelements. Nucleic Acids Res. 49, 1033–1045 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Wannier, T. M. et al. Recombineering and MAGE. Nat. Rev. Methods Primers 1, 7 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Fishman, C. B. et al. Continuous multiplexed phage genome editing using recombitrons. Nat. Biotechnol. 43, 1299–1310 (2025).

    Article  CAS  PubMed  Google Scholar 

  37. Lorenz, R. et al. ViennaRNA Package 2.0. Algorithms Mol. Biol. 6, 26 (2011).

    Article  PubMed  PubMed Central  Google Scholar 

  38. Brockhurst, M. A., Koskella, B. & Zhang, Q. G. Bacteria–phage antagonistic coevolution and the implications for phage therapy. In Bacteriophages (eds Harper, D. R., Abedon, S. T., Burrowes, B. H. & McConville, M.L.) (Springer, 2017).

  39. Borin, J. M. et al. Rapid bacteria–phage coevolution drives the emergence of multiscale networks. Science 382, 674–678 (2023).

    Article  CAS  PubMed  Google Scholar 

  40. Meyer, J. R. et al. Repeatability and contingency in the evolution of a key innovation in phage lambda. Science 335, 428–432 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Andrews, B. & Fields, S. Balance between promiscuity and specificity in phage λ host range. ISME J. 15, 2195–2205 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Ge, X. & Wang, J. Structural mechanism of bacteriophage lambda tail’s interaction with the bacterial receptor. Nat. Commun. 15, 4185 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Salema, V. et al. Selection of single domain antibodies from immune libraries displayed on the surface of E. coli cells with two β-domains of opposite topologies. PLoS ONE 8, e75126 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Salema, V. & Fernández, L. Á Escherichia coli surface display for the selection of nanobodies. Microb. Biotechnol. 10, 1468–1484 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Casasnovas, J. M. et al. Nanobodies protecting from lethal SARS-CoV-2 infection target receptor binding epitopes preserved in virus variants other than Omicron. Front. Immunol. 13, 863831 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Lopez, S. C., Crawford, K. D., Lear, S. K., Bhattarai-Kline, S. & Shipman, S. L. Precise genome editing across kingdoms of life using retron-derived DNA. Nat. Chem. Biol. 18, 199–206 (2022).

    Article  CAS  PubMed  Google Scholar 

  47. González-Delgado, A., Lopez, S. C., Rojas-Montero, M., Fishman, C. B. & Shipman, S. L. Simultaneous multi-site editing of individual genomes using retron arrays. Nat. Chem. Biol. 20, 1482–1492 (2024).

    Article  PubMed  PubMed Central  Google Scholar 

  48. Ni, Y. et al. Reducing competition between msd and genomic DNA improves retron editing efficiency. EMBO Rep. 25, 5316–5330 (2024).

    Article  PubMed  PubMed Central  Google Scholar 

  49. Esvelt, K. M., Carlson, J. C. & Liu, D. R. A system for the continuous directed evolution of biomolecules. Nature 472, 499–503 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Miller, S. M., Wang, T. & Liu, D. R. Phage-assisted continuous and non-continuous evolution. Nat. Protoc. 15, 4101–4127 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Filsinger, G. T. et al. A diverse single-stranded DNA-annealing protein library enables efficient genome editing across bacterial phyla. Proc. Natl Acad. Sci. USA 122, e2414342122 (2025).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. St-Pierre, F. et al. One-step cloning and chromosomal integration of DNA. ACS Synth. Biol. 2, 537–541 (2013).

    Article  CAS  PubMed  Google Scholar 

  53. Thomason, L. C., Costantino, N. & Court, D. L. E. coli genome manipulation by P1 transduction. Curr. Protoc. Mol. Biol. Chapter 1, 1.17.1–1.17.8 (2007).

    PubMed  Google Scholar 

  54. Gibson, D. G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6, 343–345 (2009).

    Article  CAS  PubMed  Google Scholar 

  55. Engler, C., Gruetzner, R., Kandzia, R. & Marillonnet, S. Golden Gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS One 4, e5553 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  56. Hartley, J. L., Temple, G. F. & Brasch, M. A. DNA cloning using in vitro site-specific recombination. Genome Res. 10, 1788–1795 (2000).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Jakočiūnė, D. & Moodley, A. A rapid bacteriophage DNA extraction method. Methods Protoc. 1, 27 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  58. Anzalone, A. V. et al. Programmable deletion, replacement, integration and inversion of large DNA sequences with twin prime editing. Nat. Biotechnol. 40, 731–740 (2022).

    Article  CAS  PubMed  Google Scholar 

  59. Clement, K., Farouni, R., Bauer, D. E. & Pinello, L. AmpUMI: design and analysis of unique molecular identifiers for deep amplicon sequencing. Bioinformatics 34, i202–i210 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Clement, K. et al. CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nat. Biotechnol. 37, 224–226 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank L. Cerdan and L.-A. Fernandez for providing VHH-1.29 and the pNeae vector, H. Mouquet, C. Planchais and P. Rosenbaum for providing the SARS-CoV-2 RBDs and expertise on antibody engineering, T. Rose for guidance on the immunotube panning experiments, P. Lafaye and G. Ayme for their expertise on nanobody selection, P. England for the BLI experiments and S. Volant and E. Jacquemet for the help in setting up a TR functionality model. This study was funded by the European Research Council (101044479), Agence Nationale de la Recherche (ANR-10-LABX-62-IBEID) and Ecole Doctorale Complexité du Vivant, Sorbonne Université (Contrat doctoral 4481/2022) to P.R, the Ecole Doctorale Frontières de l’Innovation en Recherche et Education funded by the Bettencourt Schueller foundation and the Ecole Universitaire de Recherche Interdisciplinaire de Paris graduate program (ANR-17-EURE-0012) and Fondation pour la Recherche Médicale (ECO202206015569) to E.L.R. and the National Science Foundation (2509382) to S.L.S.

Author information

Authors and Affiliations

Authors

Contributions

R.L., conceptualization, methodology, investigation, writing, visualization and supervision. P.R., conceptualization, methodology, investigation, writing, software and visualization. E.L.R., conceptualization, methodology, investigation, writing and visualization. D.W., methodology and investigation. L.R., methodology, investigation and software. C.F., methodology and investigation. A.M., methodology and investigation. W.R., methodology and investigation. L.W., methodology and investigation. I.N., methodology and investigation. P.V., investigation. N.B., methodology and investigation. K.M.C.M., investigation. A.B., investigation. T.C., investigation. O.S., methodology and investigation. N.W., methodology and supervision. L.R., methodology and investigation. R.M., methodology, supervision and project administration. S.C., methodology and supervision. S.L.S., methodology and supervision. D.B., conceptualization, methodology, writing, software, visualization, supervision, project administration and funding acquisition.

Corresponding authors

Correspondence to Raphael Laurenceau or David Bikard.

Ethics declarations

Competing interests

The following patent applications related to this work have been filed by Institut Pasteur with D.B., R.L. and W.R. as inventors: EP4294922A1 and WO2024038003A1. The remaining authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Mutation rate per base per generation for the different nucleotides.

Dot plot representing mutation rate per base per generation of each base type inside and outside the targeted region (VR), with an active or a catalytically dead bRT. Statistical significance was calculated by a two-sided student test with unequal variance (ns = p > 0.05; * = p < 0.05; ** = p < 0.01). A, T, G, C bases are respectively n = 9, 25, 18, 15 inside the VR, and n = 13, 14, 8, 11 outside the VR, points are shown only when at least one mutation is observed at the position.

Extended Data Fig. 2 Amplicon sequencing characterization of DGRec components.

Amplicon sequencing of the sacB chromosomal gene locus targeted by the TR-AM009, using the UMI correction procedure as in Fig. 1. The plot area was restricted to the targeted region (VR). All tested combinations are shown with 2 biological replicates. All panels have the TR-AM009 expressed from the pRL021 DGRec backbone, except for ΔCspRecT and MutL* where it is expressed from the pRL038 backbone. Yellow ticks on the X-axis indicate adenine positions from the TR.

Extended Data Fig. 3 bRT variants with altered error rates.

A) DGRec mutagenesis on sacB performed by pRL021_TR-AM009 accompanied by pRL014 (wild type RT), pRL037 (bRT with I181N mutation) or pRL036 (bRT with R74A mutation). Mutagenesis was below the detection level for the R74A variant and was significantly reduced for the I181N variant. B) Closer view on wild type versus I181N variant mutagenesis profile, showing how the bias in the incorporation of nucleotides at adenines is altered.

Extended Data Fig. 4 Position of the DGR RNA, and dual targeting within cells.

A) Plasmid maps of pRL038 and pRL021, two compatible DGRec plasmids containing each a dgrRNA locus. CmR: chloramphenicol resistance gene; KanR: kanamycin resistance gene. Plasmid maps created with BioRender.com. B) Biological duplicates of cells expressing the DGR RNA with TR-AM009 either on the pRL038 or the pRL021 backbone. C) Dual targeting with a distinct DGR RNA placed on the two plasmids in the same cells.

Extended Data Fig. 5 Self-targeting of the DGR RNA in the DGRec system.

A) schematic representation of dgrRNA chromosomal VR targeting (1) versus self-targeting (2). B) Amplicon sequencing after 48 h mutagenesis with TR-AM009, compared at two positions in the same culture: around the targeted VR inside sacB in the chromosome, and around the TR on the DGR RNA inside the plasmid.

Extended Data Fig. 6 Effect of base -5 to base +5 on the bRT biases.

Ternary scatter plots of the rate of A to T mutations (bottom axis), A to C mutations (right axis), A to G mutations (left axis) depending on the base -5 up to +5. Each point represents the barycenter of the distribution for a given base. The contour indicates the convex hull enclosing 75% of the data points (nA=984).

Extended Data Fig. 7 Amino acid mutation table of the bRT.

Frequency at which different amino acids are reached by bRT mutagenesis depending on the starting codon. Red dots highlight the amino acid encoded by the starting codon.

Extended Data Fig. 8 Position of DGRec mutations for TR of different sizes.

TR cloned from fragmented E. coli DNA were assayed in a self-mutagenesis assay. The fraction of DGRec mutants that carry a mutation at each adenine position is reported. The grey box highlights a region for which no sequencing data was obtained.

Extended Data Fig. 9 Distributions of adenine mutations.

A) Example of the genotype with the highest count of adenine mutagenized (24) from the high-throughput library of 378 model-selected TRs. Adenines mutagenized are colored in red, Adenines not mutagenized are colored in green. B) Example of the genotype with the highest percentage of adenine mutagenized (90%) from the high-throughput library of 378 model selected TRs. Adenines mutagenized are colored in red, Adenines not mutagenized are colored in green. Thymine mutagenized are colored in blue. C to F) Respectively, distribution of the maximum count, maximum percentage, average count and average percentage of mutagenized adenines in mutagenized genotypes for a given TR (n = 378 TRs).

Extended Data Fig. 10 Plasmid maps of all the DGRec backbone plasmids used in this study.

CmR: chloramphenicol resistance gene; KanR: kanamycin resistance gene. In the DGRec 2 plasmid systems, pRL014 or pRL038 are used in combination with pRL021, while in the single-plasmid system, pPR150 alone is needed. Figure created with BioRender.com.

Supplementary information

Supplementary Information (download PDF )

Supplementary Methods, Figures 1–22 and Tables 1–6.

Reporting Summary (download PDF )

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rochette, P., Lopez-Rodriguez, E., Wen, D.J. et al. Diversity-generating retroelements for programmable targeted hypermutagenesis. Nat Biotechnol (2026). https://doi.org/10.1038/s41587-026-03078-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1038/s41587-026-03078-4

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research