Abstract
Gene exchange between viruses and their hosts acts as a key facilitator of horizontal gene transfer and is hypothesized to be a major driver of evolutionary change. Our understanding of this process comes primarily from bacteria and phage co-evolution, but the mode and functional importance of gene transfers between eukaryotes and their viruses remain anecdotal. Here we systematically characterized viral–eukaryotic gene exchange across eukaryotic and viral diversity, identifying thousands of transfers and revealing their frequency, taxonomic distribution and projected functions. Eukaryote-derived viral genes, abundant in the Nucleocytoviricota, highlighted common strategies for viral host-manipulation, including metabolic reprogramming, proteolytic degradation and extracellular modification. Furthermore, viral-derived eukaryotic genes implicate genetic exchange in the early evolution and diversification of eukaryotes, particularly through viral-derived glycosyltransferases, which have impacted structures as diverse as algal cell walls, trypanosome mitochondria and animal tissues. These findings illuminate the nature of viral–eukaryotic gene exchange and its impact on the evolution of viruses and their eukaryotic hosts.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout




Similar content being viewed by others
Data availability
All data, including proteomes, protein families, annotations, alignments, phylogenies and summaries of detected HGTs (both before and after contamination filtering) are available from Dryad (https://datadryad.org/stash/dataset/doi:10.5061/dryad.z08kprrc9).
Code availability
All the code used for phylogenetic interpretation, contamination scoring, and functional enrichments and analyses are available from Dryad (https://datadryad.org/stash/dataset/doi:10.5061/dryad.z08kprrc9).
References
Zimmerman, A. E. et al. Metabolic and biogeochemical consequences of viral infection in aquatic ecosystems. Nat. Rev. Microbiol. 18, 21–34 (2019).
Koonin, E. V. & Krupovic, M. The depths of virus exaptation. Curr. Opin. Virol. 31, 1–8 (2018).
Frank, J. A. & Feschotte, C. Co-option of endogenous viral sequences for host cell function. Curr. Opin. Virol. 25, 81–89 (2017).
Moreira, D. & Brochier-Armanet, C. Giant viruses, giant chimeras: the multiple evolutionary histories of Mimivirus genes. BMC Evol. Biol. 8, 12 (2008).
Filée, J., Pouget, N. & Chandler, M. Phylogenetic evidence for extensive lateral acquisition of cellular genes by Nucleocytoplasmic large DNA viruses. BMC Evol. Biol. 8, 320 (2008).
Monier, A. et al. Host-derived viral transporter protein for nitrogen uptake in infected marine phytoplankton. Proc. Natl Acad. Sci. USA 114, E7489–E7498 (2017).
Monier, A. et al. Horizontal gene transfer of an entire metabolic pathway between a eukaryotic alga and its DNA virus. Genome Res. 1441–1449 (2009).https://doi.org/10.1101/gr.091686.109
Schulz, F. et al. Giant virus diversity and host interactions through global metagenomics. Nature 578, 432–436 (2020).
Aswad, A. & Katzourakis, A. Cell-derived viral genes evolve under stronger purifying selection in rhadinoviruses. J. Virol. 92, e00539-18 (2018).
Schulz, F. et al. Giant viruses with an expanded complement of translation system components. Science 356, 82–85 (2017).
Guglielmini, J., Woo, A. C., Krupovic, M., Forterre, P. & Gaia, M. Diversification of giant and large eukaryotic dsDNA viruses predated the origin of modern eukaryotes. Proc. Natl Acad. Sci. USA 116, 19585–19592 (2019).
Enav, H., Mandel-Gutfreund, Y. & Béjà, O. Comparative metagenomic analyses reveal viral-induced shifts of host metabolism towards nucleotide biosynthesis. Microbiome 2, 9 (2014).
Vardi, A. et al. Host-virus dynamics and subcellular controls of cell fate in a natural coccolithophore population. Proc. Natl Acad. Sci. USA 109, 19327–19332 (2012).
Biémont, C. A brief history of the status of transposable elements: from junk DNA to major players in evolution. Genetics 186, 1085–1093 (2010).
Gornik, S. G. et al. Loss of nucleosomal DNA condensation coincides with appearance of a novel nuclear protein in dinoflagellates. Curr. Biol. 22, 2303–2312 (2012).
Medina, E. M., Turner, J. J., Gordân, R., Skotheim, J. M. & Buchler, N. E. Punctuated evolution and transitional hybrid network in an ancestral cell cycle of fungi. Elife 5, e09492 (2016).
Pastuzyn, E. D. et al. The neuronal gene Arc encodes a repurposed retrotransposon Gag protein that mediates intercellular RNA transfer. Cell 172, 275–288 (2018).
Mi, S. et al. Syncytin is a captive retroviral envelope protein involved in human placental morphogenesis. Nature 403, 785–789 (2002).
Cornelis, G. et al. An endogenous retroviral envelope syncytin and its cognate receptor identified in the viviparous placental Mabuya lizard. Proc. Natl Acad. Sci. USA 114, E10991–E11000 (2017).
Irwin, N. A. T. et al. Viral proteins as a potential driver of histone depletion in dinoflagellates. Nat. Commun. 9, 1535 (2018).
Forterre, P. & Prangishvili, D. The major role of viruses in cellular evolution: facts and hypotheses. Curr. Opin. Virol. 3, 558–565 (2013).
Delaroque, N., Maier, L., Knippers, R. & Müller, D. G. Persistent virus integration into the genome of its algal host, Ectocarpus siliculosus (Phaeophyceae). J. Gen. Virol. 80, 1367–1370 (1999).
Blanc, G., Gallot-Lavallée, L. & Maumus, F. Provirophages in the Bigelowiella genome bear testimony to past encounters with giant viruses. Proc. Natl Acad. Sci. USA 112, E5318–E5326 (2015).
Leonard, G. et al. Comparative genomic analysis of the ‘pseudofungus’ Hyphochytrium catenoides. Open Biol. 8, 170184 (2018).
McCrone, J. T. & Lauring, A. S. Genetic bottlenecks in intraspecies virus transmission. Curr. Opin. Virol. 28, 20–25 (2018).
Awadalla, P. The evolutionary genomics of pathogen recombination. Nat. Rev. Genet. 4, 50–60 (2003).
Koonin, E. V. & Yutin, N. Origin and evolution of eukaryotic large nucleo-cytoplasmic DNA viruses. Intervirology 53, 284–292 (2010).
Koonin, E. V., Dolja, V. V. & Krupovic, M. Origins and evolution of viruses of eukaryotes: the ultimate modularity. Virology 479–480, 2–25 (2015).
Filée, J. & Chandler, M. Gene exchange and the origin of giant viruses. Intervirology 53, 354–361 (2010).
Sun, T.-W. et al. Host range and coding potential of eukaryotic giant viruses. Viruses 12, 1337 (2020).
Fraser, M. J., Smith, G. E. & Summers, M. D. Acquisition of host cell DNA sequences by Baculoviruses: relationship between host DNA insertions and FP mutants of Autographa californica and Galleria mellonella nuclear polyhedrosis viruses. J. Virol. 47, 287–300 (1983).
Liu, H. et al. Widespread horizontal gene transfer from double-stranded RNA viruses to eukaryotic nuclear genomes. J. Virol. 84, 11876–11887 (2010).
Bordenstein, S. R. & Bordenstein, S. R. Eukaryotic association module in phage WO genomes from Wolbachia. Nat. Commun. 7, 13155 (2016).
Richards, T. A., Hirt, R. P., Williams, B. A. P. & Embley, T. M. Horizontal gene transfer and the evolution of parasitic Protozoa. Protist 154, 17–32 (2003).
Hayward, A., Cornwallis, C. K. & Jern, P. Pan-vertebrate comparative genomics unmasks retrovirus macroevolution. Proc. Natl Acad. Sci. USA 112, 464–469 (2015).
Cock, J. M. et al. The Ectocarpus genome and the independent evolution of multicellularity in brown algae. Nature 465, 617–621 (2010).
Moniruzzaman, M., Weinheimer, A. R., Martinez-Gutierrez, C. A. & Aylward, F. O. Widespread endogenization of giant viruses shapes genomes of green algae. Nature 588, 141–145 (2020).
Maumus, F. & Blanc, G. Study of gene trafficking between Acanthamoeba and giant viruses suggests an undiscovered family of amoeba-infecting viruses. Genome Biol. Evol. 8, 3351–3363 (2016).
Fischer, M. G. & Hackl, T. Host genome integration and giant virus-induced reactivation of the virophage mavirus. Nature 540, 288–291 (2016).
Maumus, F., Epert, A., Nogué, F. & Blanc, G. Plant genomes enclose footprints of past infections by giant virus relatives. Nat. Commun. 5, 4268 (2014).
Patel, M. R., Emerman, M. & Malik, H. S. Paleovirology – ghosts and gifts of viruses past. Curr. Opin. Virol. 1, 304–309 (2011).
Williams, T. A., Embley, T. M. & Heinz, E. Informational gene phylogenies do not support a fourth domain of life for nucleocytoplasmic large DNA viruses. PLoS ONE 6, e21080 (2011).
Tonetti, M., Sturla, L., Bisso, A., Benatti, U. & de Flora, A. Synthesis of GDP-L-fucose by the human FX protein. J. Biol. Chem. 271, 27274–27279 (1996).
Forterre, P., Gribaldo, S., Gadelle, D. & Serre, M. C. Origin and evolution of DNA topoisomerases. Biochimie 89, 427–446 (2007).
Malik, S. S., Azem-e-Zahra, S., Kim, K. M., Caetano-Anollés, G. & Nasir, A. Do viruses exchange genes across superkingdoms of life? Front. Microbiol. 8, 2110 (2017).
Gilbert, C. & Cordaux, R. Viruses as vectors of horizontal transfer of genetic material in eukaryotes. Curr. Opin. Virol. 25, 16–22 (2017).
Chen, J. et al. Genome hypermobility by lateral transduction. Science 362, 207–212 (2018).
Touchon, M., Moura de Sousa, J. A. & Rocha, E. P. Embracing the enemy: the diversification of microbial gene repertoires by phage-mediated horizontal gene transfer. Curr. Opin. Microbiol. 38, 66–73 (2017).
Krupovic, M. & Koonin, E. V. Multiple origins of viral capsid proteins from cellular ancestors. Proc. Natl Acad. Sci. USA 114, E2401–E2410 (2017).
Moreira, D. & López-García, P. Ten reasons to exclude viruses from the tree of life. Nat. Rev. Microbiol. 7, 306–311 (2009).
Almagro Armenteros, J. J., Sønderby, C. K., Sønderby, S. K., Nielsen, H. & Winther, O. DeepLoc: prediction of protein subcellular localization using deep learning. Bioinformatics 33, 3387–3395 (2017).
Duffy, S., Shackelton, L. A. & Holmes, E. C. Rates of evolutionary change in viruses: patterns and determinants. Nat. Rev. Genet. 9, 267–276 (2008).
Jung, J. U., Stäger, M. & Desrosiers, R. C. Virus-encoded cyclin. Mol. Cell. Biol. 14, 7235–7244 (1994).
Erives, A. J. Phylogenetic analysis of the core histone doublet and DNA topo II genes of Marseilleviridae: evidence of proto-eukaryotic provenance. Epigenetics Chromatin 10, 55 (2017).
Schvarcz, C. R. & Steward, G. F. A giant virus infecting green algae encodes key fermentation genes. Virology 518, 423–433 (2018).
Pagarete, A., Allen, M. J., Wilson, W. H., Kimmance, S. A. & De Vargas, C. Host-virus shift of the sphingolipid pathway along an Emiliania huxleyi bloom: survival of the fattest. Environ. Microbiol. 11, 2840–2848 (2009).
Schneider-Schaulies, J. & Schneider-Schaulies, S. Sphingolipids in viral infection. Biol. Chem. 396, 585–595 (2015).
Fernández de Castro, I., Tenorio, R. & Risco, C. Virus assembly factories in a lipid world. Curr. Opin. Virol. 18, 20–26 (2016).
Hiramatsu, S., Ishihara, M., Fujie, M. & Usami, S. Expression of a chitinase gene and lysis of the host cell wall during Chlorella virus CVK2 infection. Virology 260, 308–315 (1999).
Klenk, H. D. & Garten, W. Host cell proteases controlling virus pathogenicity. Trends Microbiol. 2, 39–43 (1994).
Thaker, S. K., Ch’ng, J. & Christofk, H. R. Viral hijacking of cellular metabolism. BMC Biol. 17, 59 (2019).
Mahalingam, S., Meanger, J., Foster, P. S. & Lidbury, B. A. The viral manipulation of the host cellular and immune environments to enhance propagation and survival: a focus on RNA viruses. J. Leukoc. Biol. 72, 429–439 (2002).
Ravindran, M. S., Bagchi, P., Cunningham, C. N. & Tsai, B. Opportunistic intruders: how viruses orchestrate ER functions to infect cells. Nat. Rev. Microbiol. 14, 407–420 (2016).
de Chassey, B., Meyniel-Schicklin, L., Vonderscher, J., André, P. & Lotteau, V. Virus–host interactomics: new insights and opportunities for antiviral drug discovery. Genome Med. 6, 115 (2014).
Puschnik, A. S., Majzoub, K., Ooi, Y. S. & Carette, J. E. A CRISPR toolbox to study virus–host interactions. Nat. Rev. Microbiol. 15, 351–364 (2017).
Michel, G., Tonon, T., Scornet, D., Cock, J. M. & Kloareg, B. Central and storage carbon metabolism of the brown alga Ectocarpus siliculosus: insights into the origin and evolution of storage carbohydrates in eukaryotes. New Phytol. 188, 67–81 (2010).
Durak, G. M. et al. A role for diatom-like silicon transporters in calcifying coccolithophores. Nat. Commun. 7, 10543 (2016).
Armstrong, P. B., Armstrong, M. T., Pardy, R. L., Child, A. & Wainwright, N. Immunohistochemical demonstration of a lipopolysaccharide in the cell wall of a eukaryote, the green alga, Chlorella. Biol. Bull. 203, 203–204 (2002).
Laurent, T. C. & Fraser, J. R. E. Hyaluronan. FASEB 6, 2397–2404 (1992).
Loomis, W. F., Wheeler, S. A., Springer, W. R. & Barondes, S. H. Adhesion mutants of Dictyostelium discoideum lacking the saccharide determinant recognized by two adhesion-blocking monoclonal antibodies. Dev. Biol. 109, 111–117 (1985).
Bandini, G. et al. An essential, kinetoplastid-specific GDP-Fuc: β-D-Gal α-1,2- fucosyltransferase is located in the mitochondrion of Trypanosoma brucei. eLife 10, e70272 (2021).
Schrom, M. & Bablanian, R. Altered cellular morphology resulting from cytocidal virus infection. Arch. Virol. 70, 173–187 (1981).
Raaben, M. et al. The ubiquitin-proteasome system plays an important role during various stages of the coronavirus infection cycle. J. Virol. 84, 7869–7879 (2010).
Leier, H. C. et al. A global lipid map defines a network essential for Zika virus replication. Nat. Commun. 11, 3652 (2020).
Chuong, E. B. The placenta goes viral: retroviruses control gene expression in pregnancy. PLoS Biol. 16, e3000028 (2018).
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Keeling, P. J. et al. The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): illuminating the functional diversity of eukaryotic life in the oceans through transcriptome sequencing. PLoS Biol. 12, e1001889 (2014).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Enright, A. J., van Dongen, S. & Ouzounis, C. A. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30, 1575–1584 (2002).
Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2014).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Mistry, J., Finn, R. D., Eddy, S. R., Bateman, A. & Punta, M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 41, e121 (2013).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Anisimova, M., Gil, M., Dufayard, J. F., Dessimoz, C. & Gascuel, O. Survey of branch support methods demonstrates accuracy, power, and robustness of fast likelihood-based approximation schemes. Syst. Biol. 60, 685–699 (2011).
Shimodaira, H. & Hasegawa, M. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. 16, 1114–1116 (1999).
Nguyen, L. T., Schmidt, H. A., Von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Tria, F. D. K., Landan, G. & Dagan, T. Phylogenetic rooting using minimal ancestor deviation. Nat. Ecol. Evol. 1, 0193 (2017).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Letunic, I. & Bork, P. Interactive Tree of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 47, 256–259 (2019).
Susko, E. & Roger, A. J. On reduced amino acid alphabets for phylogenetic inference. Mol. Biol. Evol. 24, 2139–2150 (2007).
Huerta-Cepas, J., Serra, F. & Bork, P. ETE 3: reconstruction, analysis, and visualization of phylogenomic data. Mol. Biol. Evol. 33, 1635–1638 (2016).
Federhen, S. The NCBI Taxonomy database. Nucleic Acids Res. 40, 136–143 (2012).
Walker, P. J. et al. Changes to virus taxonomy and to the international code of virus classification and nomenclature ratified by the international committee on taxonomy of viruses (2021). Arch. Virol. 166, 2633–2648 (2021).
Huerta-Cepas, J. et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol. Biol. Evol. 34, 2115–2122 (2017).
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
Supek, F., Bošnjak, M., Škunca, N. & Šmuc, T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS ONE 6, e21800 (2011).
Dean, S., Sunter, J. D. & Wheeler, R. J. TrypTag.org: a trypanosome genome-wide protein localisation resource. Trends Parasitol. 33, 80–82 (2017).
Aslett, M. et al. TriTrypDB: a functional genomic resource for the Trypanosomatidae. Nucleic Acids Res. 38, 457–462 (2009).
Burki, F., Roger, A. J., Brown, M. W. & Simpson, A. G. B. The new tree of eukaryotes. Trends Ecol. Evol. 35, 43–55 (2020).
Acknowledgements
We thank R. Wheeler for providing fluorescent micrographs of Trypanosoma brucei as part of TrypTag. This work was supported by grants from the Natural Sciences and Engineering Research Council of Canada (NSERC, RGPIN-2014-03994) and from the Gordon and Betty Moore Foundation (https://doi.org/10.37807/GBMF9201) to P.J.K. N.A.T.I. was supported by a Junior Research Fellowship from Merton College, Oxford and an NSERC Canadian Graduate Scholarship. A.A.P. was supported by a European Molecular Biology Organization (EMBO) Long-term Fellowship (ALTF 118-2017). T.A.R. is supported by a Royal Society University Research Fellowship (UF130382).
Author information
Authors and Affiliations
Contributions
N.A.T.I. and A.A.P. conceptualized the project; P.J.K. and T.A.R. acquired funding; N.A.T.I. and A.A.P. conducted the investigations; P.J.K. and T.A.R. provided resources and supervised the project; N.A.T.I. wrote the paper with input from all authors.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Microbiology thanks Valerian Dolja, Jolien van Hooff and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Dataset assembly and statistics.
a. A schematic representation of the dataset assembly pipeline. b. The numbers of eukaryotic, viral, and prokaryotic genomes and proteins examined and included in the initial and final dataset. The final dataset reflects the dataset upon which the HGT analysis was conducted. c. The representation of viral phyla and other groups (that is, those lacking phyla classifications) in the initial and final datasets. d, e. Summary statistics for the final clustered protein families including the number of sequences present (d) and the trimmed alignment lengths (e). See Methods for additional information.
Extended Data Fig. 2 Phylogenetic pipeline and contamination analysis overview.
a, b. Schematic representation of the HGT identification and phylogenetic analysis pipeline (a) and the contamination scoring protocol (b). c. The distribution of eukaryotic contig lengths that contain viral HGTs. d. The distribution of contamination scores across eukaryotic recipient sequences. e. The number of well-supported HGTs that are identified using different contamination score thresholds. Dashed lines denote the defined scoring threshold (≥ 2). See Methods for additional information.
Extended Data Fig. 3 Eukaryote-to-virus transfers across eukaryotic supergroups.
a-j. Expanded versions of Fig. 1e displaying transfers in the Opisthokonta (a), Archaeplastida (b), Rhodophyta (c), SAR (d), Cryptophyceae (e), Haptista (f), Metamonada (g), Amoebozoa (h), Diplomonada (i), and Apusozoa (j). Bar charts represent HGTs present in an individual genome, whereas pie charts present inferred ancestral HGTs. Bar height and pie diameter reflect transfer frequency and colours denote viral taxonomy. For clarity, viral taxa were mapped to their nearest family, phylum, or higher-level classification. Because of this, multiple families from the same phylum are shown, such as the NCLDV lineages, which are denoted with an asterisk (note that some unclassified viruses include candidate NCLDV lineages without formal taxonomic descriptions). Taxonomic information and phylogenies are based on the NCBI (National Center for Biotechnology Information) Taxonomy database92.
Extended Data Fig. 4 Virus-to-eukaryote transfers across eukaryotic supergroups.
a-j. Expanded versions of Fig. 1f displaying transfers in the Opisthokonta (a), Archaeplastida (b), Rhodophyta (c), SAR (d), Cryptophyceae (e), Haptista (f), Metamonada (g), Amoebozoa (h), Diplomonada (i), and Apusozoa (j). Bar charts represent HGTs present in an individual genome, whereas pie charts present inferred ancestral HGTs. Bar height and pie diameter reflect transfer frequency and colours denote viral taxonomy. For clarity, viral taxa were mapped to their nearest family, phylum, or higher-level classification. Because of this, multiple families from the same phylum are shown, such as the NCLDV lineages, which are denoted with an asterisk (note that some unclassified viruses include candidate NCLDV lineages without formal taxonomic descriptions). Taxonomic information and phylogenies are based on the NCBI (National Center for Biotechnology Information) Taxonomy database92.
Extended Data Fig. 5 Examples of ancient viral-eukaryotic gene transfers occurring prior to the last eukaryotic common ancestor.
Phylogenetic analyses were conducted in IQ-Tree after recoding alignments with a 4-bin Dayhoff matrix. The number of sequences within collapsed clades are noted and more recent HGTs were removed for clarity but did not affect the tree topologies. Statistical support was assessed using SH-aLRT (n = 1,000) and substitution models were selected using ModelFinder and included GTR+F+ASC+R10 (a), GTR+F+R10 (b), GTR+F+R7 (c), GTR+F+R8 (d), and GTR+F+R8 (e).
Extended Data Fig. 6 Gene origin identification and transduction examples.
a. A schematic illustrating how viral gene origins were approximated by moving up through the phylogeny from the donor towards the root until a cellular lineage was encountered. The pie chart reflects the proportion of well supported virus-to-eukaryote HGTs that were assigned a given origin. b, c. Example phylogenies illustrating cases of eukaryote-to-eukaryote and prokaryote-to-eukaryote transduction. The number of sequences within collapsed clades are noted. Phylogenies were generated in IQ-Tree using the LG+R7 (b) or LG+R9 (c) substitution models as selected using ModelFinder and statistical support was assessed using SH-aLRT (n = 1,000)86,88.
Extended Data Fig. 7 Phylogenies for glycosyltransferases denoted in Fig. 4.
Phylogenetic analyses were conducted in IQ-Tree with statistical support assessed using SH-aLRT (n = 1,000). The number of sequences within collapsed clades are noted. Substitution models were selected using ModelFinder and included LG+R10 (a), LG+F+R10 (b), LG+R9 (c), LG+F+R8 (d, e), LG+R6 (f), LG+F+R7 (g), LG+F+G4 (h), LG+F+R5 (i, j), and LG+I+G4 (k).
Supplementary information
Supplementary Table (download XLSX )
Supplementary Tables 1–5, separated in different tabs.
Rights and permissions
About this article
Cite this article
Irwin, N.A.T., Pittis, A.A., Richards, T.A. et al. Systematic evaluation of horizontal gene transfer between eukaryotes and viruses. Nat Microbiol 7, 327–336 (2022). https://doi.org/10.1038/s41564-021-01026-3
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41564-021-01026-3
This article is cited by
-
Mathematical modeling of plasmid-mediated antibiotic resistance dissemination in dual cell populations
Advances in Continuous and Discrete Models (2026)
-
Revisiting giant virus-host dynamics in brown algae: old stories and new perspectives
The EMBO Journal (2026)
-
A bacterial gene acquired by parasitoid wasps contributes to venom secretion against host defence
The EMBO Journal (2026)
-
Diversity, distribution and ecological potential of Oceanospirillales-associated viruses in the global ocean
Science China Earth Sciences (2026)
-
Reassessing viral origins and evolutionary placement in the tree of life
Antonie van Leeuwenhoek (2026)


