Abstract
Hoverflies (Diptera: Syrphidae) are globally beneficial insects that provide key ecological services, such as pollination and biological pest control. Importantly, extensive research has revealed that numerous species within this group engage in long-distance migrations, thereby enabling these services to operate across broad spatial scales. Hence, the conservation and sustainable use of this functionally important group is crucial for maintaining ecosystem balance and stability, especially in the face of ongoing losses of ecosystem functioning. However, limited genomic and transcriptomic resources hinder the advancement of research on this significant group. To address this gap, we generated a comprehensive developmental- and tissue-resolved transcriptome for Episyrphus balteatus, the dominant and most extensively researched migratory hoverfly species. We sequenced 30 RNA-seq libraries across three life stages (egg, larva, and pupa) and seven adult tissues (antenna, proboscis, head, thorax, body, leg, and wing), yielding 133.87 Gb of clean reads. The subsequent de novo assembly yielded 85,676 unigenes (N50 = 1,028 bp) exhibiting high completeness (97.9% BUSCO), and 45,479 unigenes (53.08%) were functionally annotated. Notably, differential expression analysis identified key gene sets and enriched pathways linked to development, sensory perception, and environmental responsiveness, which are molecular characteristics that may facilitate long-distance migration and ecosystem service delivery. Together, this dataset serves as a valuable community resource for ecological, evolutionary, and comparative research on hoverflies and other beneficial insects.
Similar content being viewed by others
Data availability
The raw data for the 30 samples of E. balteatus in this study are available at the SRA database, with accession number SRP640040 (https://identifiers.org/ncbi/insdc.sra:SRP640040)24. The assembled E. balteatus transcriptome assembly has been submitted to DDBJ/EMBL/GenBank under the accession GLKO00000000 (http://identifiers.org/ncbi/insdc:GLKO01000000)25. The read counts of unigenes, transcriptome assembly and annotation files have been submitted to the GEO database with accession number GSE324442 (https://identifiers.org/geo/GSE324442)26.
Code availability
No custom code was used in this study. Fastp (Version 0.19.5): https://github.com/OpenGene/fastp. Trinity (Version v2.8.5): https://github.com/trinityrnaseq/trinityrnaseq. Transrate (Version v1.0.3): http://hibberdlab.com/transrate/index.html. BUSCO (Version 3.0.2): https://busco.ezlab.org/. RSEM (Version 1.3.1): http://deweylab.biostat.wisc.edu/rsem/. DESeq2 (Version 1.24.0): http://bioconductor.org/packages/stats/bioc/DESeq2/.
References
Bauer, S. & Hoye, B. J. Migratory animals couple biodiversity and ecosystem functioning worldwide. Science 344, 1242552 (2014).
Chapman, J. W., Reynolds, D. R. & Wilson, K. Long-range seasonal migration in insects: mechanisms, evolutionary drivers and ecological consequences. Ecol. Lett. 18, 287–302 (2015).
Satterfield, D. A., Sillett, T. S., Chapman, J. W., Altizer, S. & Marra, P. P. Seasonal insect migrations: massive, influential, and overlooked. Front. Ecol. Environ. 18, 335–344 (2020).
Zhou, Y. et al. Long-term insect censuses capture progressive loss of ecosystem functioning in East Asia. Sci. Adv. 9, eade9341 (2023).
Hu, G. et al. The east asian insect flyway: geographical and climatic factors driving migration among diverse crop pests. Annu. Rev. Entomol. 70, 1–22 (2025).
Potts, S. G. et al. Global pollinator declines: trends, impacts and drivers. Trends Ecol. Evol. 25, 345–353 (2010).
Seibold, S. et al. Arthropod decline in grasslands and forests is associated with landscape-level drivers. Nature 574, 671–674 (2019).
Artamendi, M., Martin, P. A., Bartomeus, I. & Magrach, A. Loss of pollinator diversity consistently reduces reproductive success for wild and cultivated plants. Nat. Ecol. Evol. 9, 296–313 (2025).
Wotton, K. R. et al. Mass seasonal migrations of hoverflies provide extensive pollination and crop protection services. Curr. Biol. 29, 2167–2173.e5 (2019).
Doyle, T. et al. Pollination by hoverflies in the Anthropocene. Proc. Biol. Sci. 287, 20200508 (2020).
Jia, H. et al. Windborne migration amplifies insect-mediated pollination services. eLife 11, e76230 (2022).
Lack, E. Migration of insects and birds through a Pyrenean pass. J. Anim. Ecol. 63–67 (1951).
Menz, M. H. M., Brown, B. V. & Wotton, K. R. Quantification of migrant hoverfly movements (Diptera: Syrphidae) on the West Coast of North America. R. Soc. Open Sci. 6, 190153 (2019).
Finch, J. T. D. & Cook, J. M. Flies on vacation: evidence for the migration of Australian Syrphidae (Diptera). Ecol. Entomol. 45, 896–900 (2020).
Ouin, A. et al. Can deuterium stable isotope values be used to assign the geographic origin of an auxiliary hoverfly in south-western France? Rapid Commun. Mass Spectrom. 25, 2793–2798 (2011).
Raymond, L., Vialatte, A. & Plantegenest, M. Combination of morphometric and isotopic tools for studying spring migration dynamics in Episyrphus balteatus. Ecosphere 5, 1–16 (2014).
Gao, B. et al. Adaptive strategies of high-flying migratory hoverflies in response to wind currents. Proc. Biol. Sci. 287, 20200406 (2020).
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Smith-Unna, R., Boursnell, C., Patro, R., Hibberd, J. M. & Kelly, S. TransRate: reference-free quality assessment of de novo transcriptome assemblies. Genome Res. 26, 1134–1144 (2016).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).
Conesa, A. et al. A survey of best practices for RNA-seq data analysis. Genome Biol. 17, 13 (2016).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP640040 (2025).
Yuan, H. et al. Episyrphus balteatus, transcriptome shotgun assembly. Transcriptome Shotgun Assembly Sequence Database NCBI http://identifiers.org/ncbi/insdc:GLKO01000000 (2025).
NCBI Gene Expression Omnibus https://identifiers.org/geo/GSE324442 (2026).
Acknowledgements
This study was supported by Key R&D Program of Zhejiang (2024SSYS0105), the Postdoctoral Fellowship Program (Grade C) of China Postdoctoral Science Foundation (Grant No. GZC20241954) and Zhejiang Provincial Natural Science Foundation of China under Grant No. LQN25C140001.
Author information
Authors and Affiliations
Contributions
H.Y., H.R.J. and K.M.W conceived the research project. H.Y., H.R.J. and X.Y.Z. participated in the data analysis. H.R.J. and H.L. collected the samples. H.Y., H.R.J. and X.Y.Z. wrote the manuscript. H.Y., H.R.J., X.Y.Z. and K.M.W. revised the manuscript. All authors have read and approved the final manuscript for submission.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Yuan, H., Jia, H., Zhou, X. et al. Comprehensive stage- and tissue-specific transcriptome of the global ecosystem service insect, marmalade hoverfly Episyrphus balteatus. Sci Data (2026). https://doi.org/10.1038/s41597-026-07148-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-026-07148-9


