Abstract
Genome size variation is a fundamental feature of plant genomes and plays an important role in phenotypic diversity, ecological adaptation, and plant evolution across angiosperms. In the Coffea genus (Rubiaceae, 141 species/taxa), significant genome size variations have been observed. There has been nearly a twofold increase between species from East and West Africa and a notable increase from northwest to southeast Madagascar, resulting in geographic gradients. Previous studies suggest a role of Long Terminal Repeat (LTR) retrotransposons in these variations; however, the low resolution of the data to support this hypothesis did not allow for a clear understanding of LTR retrotransposons dynamics within the genus. Here, we present an analysis of the genomes of 22 Coffea species mainly from Africa and Madagascar and their genome size variations within a robust phylogenetic framework. Our results show that genome size and Transposable Elements (TE) landscape are first structured by phylogenetic relationships, reflecting shared evolutionary history and lineage-specific LTR retrotransposon dynamics particularly involving the Tekay/Del, TAT, and SIRE lineages. These lineages contribute to the differentiation of phylogeographic groups, reflecting specific patterns of genomic divergence linked to species adaptation and speciation. We also detected significant association between specific TE families and environmental variables (such as isothermality and annual precipitation). These correlations suggest that environmental factors modulate repeatome evolution and a potential adaptive role of these TEs. These findings highlight the importance of TEs in genome dynamics at the intersection of evolutionary processes and environmental adaptations and open new perspectives on their adaptive role within the Coffea genus.
Data availability
The data used in this study is available with bioproject accession numbers PRJEB100521 at Eu-ropean Nucleotide Archive (ENA, EMBL-EBI) and PRJNA898910, PRJNA242989 at Nation-al Center for Biotechnology Information (NCBI).
References
He, B. et al. Evolution of plant genome size and composition. Genom. Proteom. Bioinform. 22, qzae078 (2024).
Stitzer, M. C., Anderson, S. N., Springer, N. M. & Ross-Ibarra, J. The genomic ecosystem of transposable elements in maize. PLoS Genet. 17, e1009768 (2021).
Ibarra-Laclette, E. et al. Architecture and evolution of a minute plant genome. Nature 498, 94–98 (2013).
Orozco-Arias, S., Isaza, G. & Guyot, R. Retrotransposons in plant genomes: structure, Identification, and classification through bioinformatics and machine learning. IJMS 20, 3837 (2019).
Piegu, B. et al. Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice. Genome Res. 16, 1262–1269 (2006).
Phillips, A. L. et al. The first long-read nuclear genome assembly of Oryza australiensis, a wild rice from Northern Australia. Sci. Rep. 12, 10823 (2022).
Vicient, C. M. & Casacuberta, J. M. Impact of transposable elements on polyploid plant genomes. Ann. Botany. 120, 195–207 (2017).
Nadir, S. et al. A novel discovery of a long terminal repeat retrotransposon-induced hybrid weakness in rice. J. Exp. Bot. 70, 1197–1207 (2019).
Serrato-Capuchina, A. & Matute, D. The role of transposable elements in speciation. Genes 9, 254 (2018).
Borredá, C., Pérez-Román, E., Ibanez, V., Terol, J. & Talon, M. Reprogramming of retrotransposon activity during speciation of the genus citrus. Genome Biol. Evol. https://doi.org/10.1093/gbe/evz246 (2019).
Zhang, Q. J. & Gao, L. Z. Rapid and recent evolution of LTR retrotransposons drives rice genome evolution during the speciation of AA-genome Oryza species. G3 Genes|Genomes|Genetics. 7, 1875–1885 (2017).
Galindo-González, L., Mhiri, C., Deyholos, M. K. & Grandbastien M.-A. LTR-retrotransposons in plants: engines of evolution. Gene 626, 14–25 (2017).
Casacuberta, E. & González, J. The impact of transposable elements in environmental adaptation. Mol. Ecol. 22, 1503–1517 (2013).
Baduel, P. & Quadrana, L. Jumpstarting evolution: how transposition can facilitate adaptation to rapid environmental changes. Curr. Opin. Plant. Biol. 61, 102043 (2021).
Schrader, L. & Schmitz, J. The impact of transposable elements in adaptive evolution. Mol. Ecol. 28, 1537–1549 (2019).
Schley, R. J. et al. The ecology of palm genomes: repeat-associated genome size expansion is constrained by aridity. http://biorxiv.org/lookup/doi/; https://doi.org/10.1101/2021.11.04.467295 (2021).
Bezandry, R. et al. The evolutionary history of three Baracoffea species from Western Madagascar revealed by Chloroplast and nuclear genomes. PLoS One. 19, e0296362 (2024).
Guyot, R. et al. WCSdb: a database of wild Coffea species. Database. 2020, baaa069 (2020).
Davis, A. P., Tosh, J., Ruch, N. & Fay, M. F. Growing coffee: Psilanthus (Rubiaceae) subsumed on the basis of molecular and morphological data; implications for the size, morphology, distribution and evolutionary history of coffea: Psilanthus subsumed in coffea. Bot. J. Linn. Soc. 167, 357–377 (2011).
Rimlinger, A. et al. Phenotypic diversity assessment within a major ex situ collection of wild endemic coffees in Madagascar. Ann. Botany. 126, 849–863 (2020).
Couturon, E. et al. Caféiers sauvages: un trésor en péril au coeur des forêts tropicales!= Wild coffee-trees: a threatened treasure in the heart of tropical forests! (2016) Montpellier: Association Biodiversité, Ecovalorisation et Caféiers, 117 p. ISBN 978-2-7466-9109-4.
Hamon, P. et al. Genotyping-by-sequencing provides the first well-resolved phylogeny for coffee (Coffea) and insights into the evolution of caffeine content in its species: GBS coffee phylogeny and the evolution of caffeine content. Mol. Phylog. Evolut. 109, 351–361. https://doi.org/10.1016/j.ympev.2017.02.009. Epub 2017 Feb 16 (2017).
Yu, Q. et al. Micro-collinearity and genome evolution in the vicinity of an ethylene receptor gene of cultivated diploid and allotetraploid coffee species (Coffea): recent speciation event of coffea Arabica. Plant J. 67, 305–317 (2011).
Salojärvi, J. et al. The genome and population genomics of allopolyploid coffea Arabica reveal the diversification history of modern coffee cultivars. Nat. Genet. 56, 721–731 (2024).
Razafinarivo, N. J. et al. Geographical gradients in the genome size variation of wild coffee trees (Coffea) native to Africa and Indian ocean Islands. Tree. Genet. Genomes. 8, 1345–1358 (2012).
Noirot, M. Genome size variations in diploid African coffea species. Ann. Botany. 92, 709–714 (2003).
Guyot, R. et al. Partial sequencing reveals the transposable element composition of coffea genomes and provides evidence for distinct evolutionary stories. Mol. Genet. Genomics. 291, 1979–1990 (2016).
Jingade, P., Huded, A. K. C. & Mishra, M. K. First report on genome size and ploidy determination of five Indigenous coffee species using flow cytometry and stomatal analysis. Braz J. Bot. https://doi.org/10.1007/s40415-021-00714-y (2021).
Charr, J. C. et al. Complex evolutionary history of coffees revealed by full plastid genomes and 28,800 nuclear SNP analyses, with particular emphasis on coffea canephora (Robusta coffee). Mol. Phylogenet. Evol. 151, 106906 (2020).
Denoeud, F. et al. The coffee genome provides insight into the convergent evolution of caffeine biosynthesis. Science 345, 1181–1184 (2014).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One. 5, e9490 (2010).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Tosh, J. et al. Evolutionary history of the Afro-Madagascan Ixora species (Rubiaceae): species diversification and distribution of key morphological traits inferred from dated molecular phylogenetic trees. Ann. Botany. 112, 1723–1742 (2013).
Novák, P., Neumann, P., Pech, J., Steinhaisl, J. & Macas, J. RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. Bioinformatics 29, 792–793 (2013).
Zimin, A. V. et al. The MaSuRCA genome assembler. Bioinformatics 29, 2669–2677 (2013).
Neumann, P., Novák, P., Hoštáková, N. & Macas, J. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mob. DNA. 10, 1 (2019).
Fick, S. E. & Hijmans, R. J. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315 (2017).
Raharimalala, N. et al. The absence of the caffeine synthase gene is involved in the naturally decaffeinated status of coffea humblotiana, a wild species from Comoro Archipelago. Sci. Rep. 11, 8119 (2021).
Michael, T. P. Plant genome size variation: bloating and purging DNA. Briefings Funct. Genomics Proteom. 13, 308–317 (2014).
Laten, H. M., Majumdar, A. & Gaucher, E. A. SIRE-1, a copia/Ty1-like retroelement from soybean, encodes a retroviral envelope-like protein. Proc. Natl. Acad. Sci. USA. 95, 6897–6902 (1998).
Pearce, S. SIRE-1, A putative plant retrovirus is closely related to a legume TY1-copia retrotransposon family. Cell. Mol. Biol. Lett. https://doi.org/10.2478/s11658-006-0053-z (2007).
Nascimento, J., Sader, M., Ribeiro, T. & Pedrosa-Harand, A. Influence of Ty3/gypsy and Ty1/copia LTR-retrotransposons on the large genomes of alstroemeriaceae: genome landscape of Bomarea Edulis (Tussac). Herb. Protoplasma. 262, 881–894. https://doi.org/10.1007/s00709-025-02036-2 (2025).
Gorinšek, B., Gubenšek, F. & Kordiš, D. Evolutionary genomics of chromoviruses in eukaryotes. Mol. Biol. Evol. 21, 781–798 (2004).
Cruz, G. M. Q. et al. Virus-Like attachment sites and plastic CpG islands: landmarks of diversity in plant Del retrotransposons. PLoS ONE. 9, e97099 (2014).
Castro, N. et al. Repeatome evolution across space and time: unravelling repeats dynamics in the plant genus Erythrostemon Klotzsch (Leguminosae Juss). Mol. Ecol. https://doi.org/10.1111/mec.17510 (2024).
Lee, J. et al. Rapid amplification of four retrotransposon families promoted speciation and genome size expansion in the genus Panax. Sci. Rep. 7, 9045 (2017).
Cerca, J. et al. Evolutionary genomics of oceanic Island radiations. Trends Ecol. Evol. 38, 631–642 (2023).
Yang, H. et al. Consistent accumulation of transposable elements in species of the Hawaiian Tetragnatha spiny-leg adaptive radiation across the Archipelago chronosequence. Evolutionary J. Linn. Soc. 3, kzae005 (2024).
Craddock, E. M. Profuse evolutionary diversification and speciation on volcanic islands: transposon instability and amplification bursts explain the genetic paradox. Biol. Direct. 11, 44 (2016).
Wright, D. A. & Voytas, D. F. Potential retroviruses in plants: Tat1 is related to a group of Arabidopsis Thaliana Ty3/gypsy retrotransposons that encode Envelope-Like proteins. Genetics 149, 703–715 (1998).
Cintra, L. A. et al. An 82 bp tandem repeat family typical of 3′ non-coding end of Gypsy/TAT LTR retrotransposons is conserved in Coffea spp. Pericentromeres. Genome 65, 137–151 (2022).
Zhang, Q. J. et al. The chromosome-level reference genome of tea tree unveils recent bursts of non-autonomous LTR retrotransposons in driving genome size evolution. Mol. Plant. 13, 935–938 (2020).
Ito, H. Environmental stress and transposons in plants. Genes Genet. Syst. 97, 169–175 (2022).
Cacho, N. I., McIntyre, P. J., Kliebenstein, D. J. & Strauss, S. Y. Genome size evolution is associated with climate seasonality and glucosinolates, but not life history, soil nutrients or range size, across a clade of mustards. Ann. Botany. 127, 887–902 (2021).
Carta, A. & Peruzzi, L. Testing the large genome constraint hypothesis: plant traits, habitat and climate seasonality in liliaceae. New Phytol. 210, 709–716 (2016).
Acknowledgements
The authors thank the French National Research Agency (ANR, Bridges_Coffea project, Grant Number ANR-23-CE20-0047-01) and FAPESP (Grant Number #2023/03353-3) for financial support. We would also like to thank the Rufford Foundation (Small Grant 39692-1) and the following HPC bioinformatics platform for its support: the French Bioinformatics Institute (IFB, funded by ANR, ANR-11-INBS-0013).
Funding
ANR, Bridges_Coffea project, Grant Number ANR-23-CE20-0047-01. Fapesp Grant Number # 2023/03353-3.
Author information
Authors and Affiliations
Contributions
MD, LGG and SOA conducted the main analyses; RB, NR, LFPP, DC, PDB, CF, LB, PD, PH participated to data acquisition (sample and sequencing); DSD and RG designed and conceived the study and wrote the draft manuscript. All authors participated to revise the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
41598_2026_40031_MOESM1_ESM.csv
Sup. Data 1. GPS positions, genome size and bioclimatic data (Worlclim) for the species used in this study. Lat: latitude, long: longitude, group: phylogeographic group, bio1 to bio19 (worldclim data), All to Satellite columns: RepeatExplorer results (number of reads per elements).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Dupeyron, M., Gonzalez-Garcia, L., Orozco-Arias, S. et al. Evolutionary history and climate-driven dynamics of transposable elements has shaped genome evolution in the Coffea genus. Sci Rep (2026). https://doi.org/10.1038/s41598-026-40031-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-40031-6