Abstract
Flaviviridae is a family of non-segmented positive-sense RNA viruses that includes major pathogens such as hepatitis C virus, dengue viruses and yellow fever virus. Recent large-scale metagenomic surveys have identified many RNA viruses related to members of this family, such as orthoflaviviruses and pestiviruses. These viruses diverge by having different genome lengths and configurations, and host range. Here we performed an analysis of RNA-directed RNA polymerase (RdRP) hallmark gene sequences of flaviviruses and ‘flavi-like’ viruses. We uncovered four divergent clades and multiple lineages that are congruent with phylogenies of their helicase genes, protein profile hidden Markov model profiles, and evolutionary relationships based on predicted RdRP protein structures. These results support their classification into three families (Flaviviridae, Pestiviridae and Hepaciviridae) and 12 genera in the established order Amarillovirales, with groupings correlating with genome properties and host range. This taxonomy provides a framework for future evolutionary studies on this important viral family.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout





Data availability
Databases, sequence alignments and raw sequence distance data are provided in Supplementary Information. All RdRP predicted structures and resultant structure-based trees can be found in GitHub at https://github.com/GroveLab/Flavi_RdRp_Structures_Simmonds_2025 (ref. 59).
Code availability
All code used in the analysis is freely available from the sources cited in the manuscript. Correspondence about the analysis should be addressed to P.S. or J.C.O.M. (RdRP sequence analysis), J.G. (RdRP structure analysis), R.M. (GRAViTy analysis) or A.B. (helicase analysis).
References
Simmonds, P. et al. ICTV virus taxonomy profile: Flaviviridae. J. Gen. Virol. 98, 2–3 (2017).
Choo, Q.-L. et al. Genetic organization and diversity of the hepatitis C virus. Proc. Natl Acad. Sci. USA 88, 2451–2455 (1991).
Scheel, T. K. H., Simmonds, P. & Kapoor, A. Surveying the global virome: identification and characterization of HCV-related animal hepaciviruses. Antivir. Res. 115, 83–93 (2015).
Stapleton, J. T., Foung, S., Muerhoff, A. S., Bukh, J. & Simmonds, P. The GB viruses: a review and proposed classification of GBV-A, GBV-C (HGV), and GBV-D in genus Pegivirus within the family Flaviviridae. J. Gen. Virol. 92, 233–246 (2011).
Wu, Z. et al. The first nonmammalian pegivirus demonstrates efficient in vitro replication and high lymphotropism. J. Virol. 94, e01150-20 (2020).
Postler, T. S. et al. Renaming of the genus Flavivirus to Orthoflavivirus and extension of binomial species names within the family Flaviviridae. Arch. Virol. 168, 224 (2023).
Shi, M. et al. Divergent viruses discovered in arthropods and vertebrates revise the evolutionary history of the Flaviviridae and related viruses. J. Virol. 90, 659–669 (2016).
Qin, X.-C. et al. A tick-borne segmented RNA virus contains genome segments derived from unsegmented viral ancestors. Proc. Natl Acad. Sci. USA 111, 6744–6749 (2014).
Ladner, J. T. et al. A multicomponent animal virus isolated from mosquitoes. Cell Host Microbe 20, 357–367 (2016).
Zhang, S. et al. Conserved untranslated regions of multipartite viruses: natural markers of novel viral genomic components and tags of viral evolution. Virus Evol. 10, veae004 (2024).
Paraskevopoulou, S. et al. Viromics of extant insect orders unveil the evolution of the flavi-like superfamily. Virus Evol. 7, veab030 (2021).
Colmant, A. M. G., Charrel, R. N. & Coutard, B. Jingmenviruses: ubiquitous, understudied, segmented flavi-like viruses. Front. Microbiol. 13, 997058 (2022).
Bamford, C. G. G., de Souza, W. M., Parry, R. & Gifford, R. J. Comparative analysis of genome-encoded viral sequences reveals the evolutionary history of flavivirids (family Flaviviridae). Virus Evol. 8, veac085 (2022).
Simmonds, P. et al. Four principles to establish a universal virus taxonomy. PLoS Biol. 21, e3001922 (2023).
Brüssow, H. The not so universal tree of life or the place of viruses in the living world. Phil. Trans. R. Soc. Lond. B Biol. Sci. 364, 2263–2274 (2009).
Krupovic, M., Dolja, V. V. & Koonin, E. V. Origin of viruses: primordial replicators recruiting capsids from hosts. Nat. Rev. Microbiol. 17, 449–458 (2019).
Nasir, A., Romero-Severson, E. & Claverie, J.-M. Investigating the concept and origin of viruses. Trends Microbiol. 28, 959–967 (2020).
Koonin, E. V. et al. Global organization and proposed megataxonomy of the virus world. Microbiol. Mol. Biol. Rev. 84, e00061-19 (2020).
Kuhn, J. H. et al. Classify viruses—the gain is worth the pain. Nature 566, 318–320 (2019).
Varsani, A. et al. Summary of taxonomy changes ratified by the International Committee on Taxonomy of Viruses (ICTV) from the Animal DNA Viruses and Retroviruses Subcommittee, 2025. J. Gen. Virol. 106, 002113 (2025).
Arhab, Y., Bulakhov, A. G., Pestova, T. V. & Hellen, C. U. T. Dissemination of internal ribosomal entry sites (IRES) between viruses by horizontal gene transfer. Viruses 12, 612 (2020).
Belsham, G. J. Divergent picornavirus IRES elements. Virus Res. 139, 183–192 (2009).
Mifsud, J. C. O. et al. Mapping glycoprotein structure reveals Flaviviridae evolutionary history. Nature 633, 695–703 (2024).
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Kartashov, M. Y. et al. Novel flavi-like virus in ixodid ticks and patients in Russia. Ticks Tick Borne Dis. 14, 102101 (2023).
Debat, H. & Bejerman, N. Two novel flavi-like viruses shed light on the plant-infecting koshoviruses. Arch. Virol. 168, 184 (2023).
Schönegger, D., Marais, A., Faure, C. & Candresse, T. A new flavi-like virus identified in populations of wild carrots. Arch. Virol. 167, 2407–2409 (2022).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Varadi, M. et al. AlphaFold protein structure database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res. 52, D368–D375 (2024).
van Kempen, M. et al. Fast and accurate protein structure search with Foldseek. Nat. Biotechnol. 42, 243–246 (2024).
Moi, D. et al. Structural phylogenetics unravels the evolutionary diversification of communication systems in gram-positive bacteria and their viruses. Nat. Struct. Mol. Biol. https://doi.org/10.1038/s41594-025-01649-8 (2025).
Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682 (2022).
Chai Discovery Team et al. Chai-1: decoding the molecular interactions of life. Preprint at bioRxiv https://doi.org/10.1101/2024.10.10.615955 (2024).
Lin, Z. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379, 1123–1130 (2023).
Mayne, R., Aiewsakun, P., Turner, D., Adriaenssens, E. M. & Simmonds, P. GRAViTy-V2: a grounded viral taxonomy application. NAR Genom. Bioinform. 6, lqae183 (2024).
Marin, M. S., Zanotto, P. M. D. A., Gritsun, T. S. & Gould, E. A. Phylogeny of TYU, SRE, and CFA virus: different evolutionary rates in the genus Flavivirus. Virology 206, 1133–1139 (1995).
Mifsud, J. C. O. et al. Transcriptome mining extends the host range of the Flaviviridae to non-bilaterians. Virus Evol. 9, veac124 (2023).
Kobayashi, K. et al. Gentian Kobu-sho-associated virus: a tentative, novel double-stranded RNA virus that is relevant to gentian Kobu-sho syndrome. J. Gen. Plant Pathol. 79, 56–63 (2013).
Wolf, Y. I. et al. Origins and evolution of the global RNA virome. mBio 9, e02329-18 (2018).
Fletcher, S. P. & Jackson, R. J. Pestivirus internal ribosome entry site (IRES) structure and function: elements in the 5’ untranslated region important for IRES function. J. Virol. 76, 5024–5033 (2002).
Reusken, C. B. E. M., Dalebout, T. J., Eerligh, P., Bredenbeek, P. J. & Spaan, W. J. M. Analysis of hepatitis C virus/classical swine fever virus chimeric 5’NTRs: sequences within the hepatitis C virus IRES are required for viral RNA replication. J. Gen. Virol. 84, 1761–1769 (2003).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Scornavacca, C., Zickmann, F. & Huson, D. H. Tanglegrams for rooted phylogenetic trees and networks. Bioinformatics 27, i248–i256 (2011).
Galili, T. dendextend: an R package for visualizing, adjusting and comparing trees of hierarchical clustering. Bioinformatics 31, 3718–3720 (2015).
Aiewsakun, P. & Simmonds, P. The genomic underpinnings of eukaryotic virus taxonomy: creating a sequence-based framework for family-level virus classification. Microbiome 6, 38 (2018).
Lesburg, C. A. et al. Crystal structure of the RNA-dependent RNA polymerase from hepatitis C virus reveals a fully encircled active site. Nat. Struct. Biol. 6, 937–943 (1999).
Liu, W., Shi, X. & Gong, P. A unique intra-molecular fidelity-modulating mechanism identified in a viral RNA-dependent RNA polymerase. Nucleic Acids Res. 46, 10840–10854 (2018).
Noble, C. G. et al. A conserved pocket in the dengue virus polymerase identified through fragment-based screening. J. Biol. Chem. 291, 8541–8548 (2016).
Liu, Z. et al. Crystal structures of RNA-dependent RNA polymerases from Jingmen tick virus and Alongshan virus. hLife 2, 18–31 (2024).
Mariani, V., Biasini, M., Barbato, A. & Schwede, T. lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics 29, 2722–2728 (2013).
Howe, K., Bateman, A. & Durbin, R. QuickTree: building huge neighbour-joining trees of protein sequences. Bioinformatics 18, 1546–1547 (2002).
Letunic, I. & Bork, P. Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 52, W78–W82 (2024).
Bittrich, S., Segura, J., Duarte, J. M., Burley, S. K. & Rose, Y. RCSB protein data bank: exploring protein 3D similarities via comprehensive structural alignments. Bioinformatics 40, btae370 (2024).
Meng, E. C. et al. UCSF ChimeraX: tools for structure building and analysis. Protein Sci. 32, e4792 (2023).
GroveLab. Flavivirus RdRP Structures - Simmonds et al. 2025. GitHub https://github.com/GroveLab/Flavi_RdRp_Structures_Simmonds_2025 (2025).
Acknowledgements
We thank A. Crane for critically editing the manuscript. This work was supported in part through a Laulima Government Solutions, LLC, prime contract with the National Institute of Allergy and Infectious Diseases (Contract No. HHSN272201800013C). J.H.K. performed this work as an employee of Tunnell Government Services (TGS), a subcontractor of Laulima Government Solutions, LLC, under Contract No. HHSN272201800013C. N.V. acknowledges partial support from the Centers for Research in Emerging Infectious Diseases (CREID) Coordinating Research on Emerging Arboviral Threats Encompassing the NEOtropics (CREATE-NEO) U01AI151807 grant by the National Institutes of Health (NIH). A.B. was supported by a postdoctoral fellowship from Foundation pour la Recherche Mèdicale (grant number SPF202110014092). J.G. was supported by a Wellcome Trust/Royal Society Sir Henry Dale Fellowship (107653/Z/15/Z) and MRC-University of Glasgow Centre for Virus Research core support from the Medical Research Council (MC_UU_00034/1). J.T.S. was supported by Veterans Administration Merit Review BX000207 and VA SEQCure Network grants. The views and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the US Department of Health and Human Services or of the institutions and companies affiliated with the authors. Mention of trade names, commercial products or organizations does not imply endorsement by the US Government.
Author information
Authors and Affiliations
Contributions
P.S. and J.H.K., in correspondence with other members of the ICTV Flaviviridae Study Group (M.B., J.B., J.F.D., A.K., V.L., J.T.S., D.B.S. and N.V.), conceived the study. P.S., A.B., J.G., R.M, J.C.O.M. and J.H.K. conceptualized the experimental section. P.S., A.B., J.G., R.M., D.B.S. and J.C.O.M. performed analyses. All authors wrote/revised the manuscript and P.S. and J.H.K. supervised the work. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
All authors declare no competing interests.
Peer review
Peer review information
Nature Microbiology thanks Patrick Dolan, Alexander Ploss and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Comparison of phylogenetic trees of RNA-directed RNA polymerase domain (NS5/NS5B) produced by different tree-building methods.
Phylogenetic trees constructed by likelihood (a, b) and distance-based (c) methods using flavivirus and ‘flavi-like’ RNA-directed RNA polymerase (RdRP) domain amino-acid sequences. Clades (I–IV) and lineages (a–w) labelled in each tree are based on those in Fig. 1b. Tentative threshold levels of divergence separating clades and lineages are shown as red dotted lines in BEAST and UPGMA trees. An alternative threshold corresponding to the assignment of lineages Ia-Id to a common lineage is shown in a blue dotted line. Abbreviations: BP, before present; BEAST, Bayesian evolutionary analysis by sampling trees cross-platform program; JTT, Jones-Taylor-Thornton matrix; ML, maximum likelihood; UPGMA, unweighted pair group method with arithmetic mean.
Extended Data Fig. 3 Mean pairwise amino acid sequence identities between lineages in clades I–III.
Mean pairwise sequence identities of RdRP domain amino acid sequences between and within lineages of clades I–III. Dotted line indicates an approximate threshold dividing within- and between-lineage distances; between-lineage comparisons above threshold shaded in grey; within-lineage distances below the inter-lineage threshold shown in black.
Supplementary information
Supplementary Information (download PDF )
Labelled version of phylogenetic trees and dendrograms, and results from an analysis of protein structure relationships using a method different from that shown in Fig. 3.
Supplementary Data 1
FASTA alignment.
Supplementary Data 2
Tree file with sequence label annotations.
Supplementary Data 3
FASTA alignment.
Supplementary Data 4
GRAViTy run parameters.
Supplementary Table 5 (download XLSX )
Sequence listing.
Rights and permissions
About this article
Cite this article
Simmonds, P., Butković, A., Grove, J. et al. Taxonomic expansion and reorganization of Flaviviridae. Nat Microbiol 10, 3026–3037 (2025). https://doi.org/10.1038/s41564-025-02134-0
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41564-025-02134-0