Abstract
The wasps of Rhynchium exemplify solitary vespid predators controlling Lepidopteran pests through venom-mediated paralysis, with their venom possessing significant medicinal potential. As dominant models within the species-rich subfamily Eumeninae representing 70% of Vespidae diversity, they provide critical insights into sociality evolution and biocontrol mechanisms. To boost research on Vespidae, we used PacBio long-read, short-read RNA-seq (Illumina) and Hi-C scaffolding technologies to create a high-quality chromosome-level genome assembly for Rhynchium brunneum, an important solitary insect. We obtained a 328.90 Mb assembly with a Scaffold N50 size of 15.98 Mb. We detected 96.2% Benchmarking Universal Single-Copy Orthologues (BUSCO) in the genome assembly, which contains 51.77% repetitive sequences and has 12,999 protein-coding genes annotated. In R. brunneum, we identified 173 gene expansions and 274 genes that underwent contraction or loss. The high-quality genome of R. brunneum provides a valuable genetic resource for future research in evolution, molecular biology, and applied studies.
Data availability
The raw sequencing data of Rhynchium brunneum has been deposited at the National Center for Biotechnology Information (NCBI). The PacBio, Illumina, Hi-C, and transcriptome data are available under accession numbers SRR35603922-SRR3560393159,60,61,62,63,64,65,66,67,68. The assembled genome has been deposited in the NCBI assembly with the accession number GCA_055773155.169. The genome annotation information has been deposited in the Figshare database70.
Code availability
No custom scripts or code were generated in this study. All data analyses were performed according to the manual and protocols of the published bioinformatic tools.
References
Lee, S. H., Baek, J. H. & Yoon, K. A. Differential Properties of Venom Peptides and Proteins in Solitary vs. Social Hunting Wasps. Toxins 8, 32, https://doi.org/10.3390/toxins8020032 (2016).
Klein, A., Steffan-Dewenter, I. & Tscharntke, T. Foraging trip duration and density of megachilid bees, eumenid wasps and pompilid wasps in tropical agroforestry systems. Journal of Animal Ecology 73, 517–525, https://doi.org/10.1111/j.0021-8790.2004.00826.x (2004).
Dang, H. & Nguyen, L. Nesting biology of the potter wasp Rhynchium brunneum brunneum (Fabricius, 1793) (Hymenoptera: Vespidae: Eumeninae) in North Vietnam. Journal of Asia-Pacific Entomology 22, https://doi.org/10.1016/j.aspen.2019.02.003 (2019).
Carpenter, J. M. The phylogenetic relationships and natural classification of the Vespoidea (Hymenoptera). Systematic Entomology 7, 11–38, https://doi.org/10.1111/j.1365-3113.1982.tb00124.x (1982).
Hermes, M. G., Melo, G. A. R. & Carpenter, J. M. The higher-level phylogenetic relationships of the Eumeninae (Insecta, Hymenoptera, Vespidae), with emphasis on Eumenes sensu lato. Cladistics 30, 453–484, https://doi.org/10.1111/cla.12059 (2014).
Li, T., Barthelemy, C. & Carpenter, J. The Eumeninae (Hymenoptera, Vespidae) of Hong Kong (China), with description of two new species, two new synonymies and a key to the known taxa. Journal of Hymenoptera Research 72, 127–176, https://doi.org/10.3897/jhr.72.37691 (2019).
Brozoski, F., de Lima, V. A., Ferrari, R. R. & Buschini, M. L. T. Nesting Biology of the Potter Wasp Ancistrocerus flavomarginatus (Hymenoptera, Vespidae, Eumeninae) Revealed by Trap-Nest Experiments in Southern Brazil. Neotropical Entomology 52, 11–23, https://doi.org/10.1007/s13744-022-01004-2 (2023).
West-Eberhard, M. Behavior of the primitively social wasp Montezumia cortesioides Willink (Vespidae Eumeninae) and the origins of vespid sociality. Ethology Ecology & Evolution 17, 201–215, https://doi.org/10.1080/08927014.2005.9522592 (2005).
Hermes, M., Somavilla, A. & Garcete Barrett, B. R. On the nesting biology of Pirhosigma Giordani Soika (Hymenoptera, Vespidae, Eumeninae), with special reference to the use of vegetable matter. Revista Brasileira de Entomologia 57, 433–436, https://doi.org/10.1590/S0085-56262013005000044 (2013).
Kelstrup, H. C., West-Eberhard, M. J., Nascimento, F., Riddiford, L. & Hartfelder, K. Behavior, ovarian status, and juvenile hormone titer in the emblematic social wasp Zethus miniatus (Vespidae, Eumeninae). Behavioral Ecology and Sociobiology 77, https://doi.org/10.1007/s00265-023-03334-6 (2023).
Peng, Y.-L., He, S.-L., Chen, B. & Li, T.-J. An Integrative Phylogenetic Analysis of the Genus Rhynchium Spinola (Hymenoptera: Vespidae: Eumeninae) from China Based on Morphology, Genomic Data and Geographical Distribution. Insects 16, https://doi.org/10.3390/insects16020217 (2025).
Fateryga, A. & Amolin, A. Nesting and biology of Jucancistrocerus caspicus (Hymenoptera, Vespidae, Eumeninae). Entomological Review 94, 73–78, https://doi.org/10.1134/S0013873814010084 (2014).
Dolezel, J. & Bartos, J. Plant DNA flow cytometry and estimation of nuclear genome size. Annals of Botany 95, 99–110, https://doi.org/10.1093/aob/mci005 (2005).
Dolezel, J., Greilhuber, J. & Suda, J. Estimation of nuclear DNA content in plants using flow cytometry. Nature Protocols 2, 2233–2244, https://doi.org/10.1038/nprot.2007.310 (2007).
Ranallo-Benavidez, T. R., Jaron, K. S. & Schatz, M. C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nature Communications 11, 1432, https://doi.org/10.1038/s41467-020-14998-3 (2020).
Cowan, D. P. Sibling matings in a hunting wasp: adaptive inbreeding? Science 205, 1403–1405, https://doi.org/10.1126/science.205.4413.1403 (1979).
Chapman, T. & C Stewart, S. Extremely high levels of inbreeding in a natural population of the free-living wasp Ancistrocerus antilope (Hymenoptera: Vespidae: Eumeninae). Heredity 76, https://doi.org/10.1038/hdy.1996.8 (1996).
Aron, S., de Menten, L., Van Bockstaele, D. R., Blank, S. M. & Roisin, Y. When Hymenopteran Males Reinvented Diploidy. Current Biology 15, 824–827, https://doi.org/10.1016/j.cub.2005.03.017 (2005).
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nature Methods 18, 170–175, https://doi.org/10.1038/s41592-020-01056-5 (2021).
Astashyn, A. et al. Rapid and sensitive detection of genome contamination at scale with FCS-GX. Genome Biology 25, 60, https://doi.org/10.1186/s13059-024-03198-7 (2024).
Guan, D. et al. Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics 36, 2896–2898, https://doi.org/10.1093/bioinformatics/btaa025 (2020).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760, https://doi.org/10.1093/bioinformatics/btp324 (2009).
Zhou, C., McCarthy, S. A. & Durbin, R. YaHS: yet another Hi-C scaffolding tool. Bioinformatics 39, https://doi.org/10.1093/bioinformatics/btac808 (2023).
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95, https://doi.org/10.1126/science.aal3327 (2017).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212, https://doi.org/10.1093/bioinformatics/btv351 (2015).
Mei, Y. et al. InsectBase 2.0: a comprehensive gene resource for insects. Nucleic Acids Research 50, D1040–d1045, https://doi.org/10.1093/nar/gkab1090 (2022).
Pflug, J. M., Holmes, V. R., Burrus, C., Johnston, J. S. & Maddison, D. R. Measuring Genome Sizes Using Read-Depth, k-mers, and Flow Cytometry: Methodological Comparisons in Beetles (Coleoptera). G3: Genes, Genomes, Genetics 10, 3047–3060, https://doi.org/10.1534/g3.120.401028 (2020).
He, K., Lin, K., Wang, G. & Li, F. Genome Sizes of Nine Insect Species Determined by Flow Cytometry and k-mer Analysis. Frontiers In Physiology 7, 569, https://doi.org/10.3389/fphys.2016.00569 (2016).
Nandakumar, S., Grushko, O. & Buttitta, L. A. Polyploidy in the adult Drosophila brain. Elife 9, https://doi.org/10.7554/eLife.54385 (2020).
Ren, D., Song, J., Ni, M., Kang, L. & Guo, W. Regulatory Mechanisms of Cell Polyploidy in Insects. Frontiers in Cell and Developmental Biology 8, https://doi.org/10.3389/fcell.2020.00361 (2020).
Rangel, J., Strauss, K., Seedorf, K., Hjelmen, C. E. & Johnston, J. S. Endopolyploidy changes with age-related polyethism in the honey bee, Apis mellifera. PLoS One 10, e0122208, https://doi.org/10.1371/journal.pone.0122208 (2015).
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proceedings of the National Academy of Sciences 117, 9451–9457, https://doi.org/10.1073/pnas.1921046117 (2020).
Ou, S. & Jiang, N. LTR_FINDER_parallel: parallelization of LTR_FINDER enabling rapid identification of long terminal repeat retrotransposons. Mobile DNA 10, 48, https://doi.org/10.1186/s13100-019-0193-0 (2019).
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics 9, 18, https://doi.org/10.1186/1471-2105-9-18 (2008).
Ou, S. & Jiang, N. LTR_retriever: A Highly Accurate and Sensitive Program for Identification of Long Terminal Repeat Retrotransposons. Plant Physiology 176, 1410–1422, https://doi.org/10.1104/pp.17.01310 (2018).
Gabriel, L. et al. BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA. Genome Research 34, 769–777, https://doi.org/10.1101/gr.278090.123 (2024).
Brůna, T., Lomsadze, A. & Borodovsky, M. GeneMark-ETP significantly improves the accuracy of automatic annotation of large eukaryotic genomes. Genome Research 34, 757–768, https://doi.org/10.1101/gr.278373.123 (2024).
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Research 34, W435–439, https://doi.org/10.1093/nar/gkl200 (2006).
Gabriel, L., Hoff, K. J., Brůna, T., Borodovsky, M. & Stanke, M. TSEBRA: transcript selector for BRAKER. BMC Bioinformatics 22, 566, https://doi.org/10.1186/s12859-021-04482-0 (2021).
Kovaka, S. et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biology 20, 278, https://doi.org/10.1186/s13059-019-1910-1 (2019).
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nature Biotechnology 37, 907–915, https://doi.org/10.1038/s41587-019-0201-4 (2019).
Tang, S., Lomsadze, A. & Borodovsky, M. Identification of protein coding regions in RNA transcripts. Nucleic Acids Research 43, e78, https://doi.org/10.1093/nar/gkv227 (2015).
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240, https://doi.org/10.1093/bioinformatics/btu031 (2014).
Huerta-Cepas, J. et al. Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper. Molecular Biology and Evolution 34, 2115–2122, https://doi.org/10.1093/molbev/msx148 (2017).
Huerta-Cepas, J. et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Research 47, D309–d314, https://doi.org/10.1093/nar/gky1085 (2019).
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biology 20, 238, https://doi.org/10.1186/s13059-019-1832-y (2019).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology and Evolution 30, 772–780, https://doi.org/10.1093/molbev/mst010 (2013).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973, https://doi.org/10.1093/bioinformatics/btp348 (2009).
Kück, P. & Longo, G. C. FASconCAT-G: extensive functions for multiple sequence alignment preparations concerning phylogenetic studies. Frontiers in Zoology 11, 81, https://doi.org/10.1186/s12983-014-0081-x (2014).
Minh, B. Q. et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Molecular Biology and Evolution 37, 1530–1534, https://doi.org/10.1093/molbev/msaa015 (2020).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nature Methods 14, 587–589, https://doi.org/10.1038/nmeth.4285 (2017).
Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Molecular Biology and Evolution 35, 518–522, https://doi.org/10.1093/molbev/msx281 (2018).
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Systematic Biology 59, 307–321, https://doi.org/10.1093/sysbio/syq010 (2010).
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Molecular Biology and Evolution 24, 1586–1591, https://doi.org/10.1093/molbev/msm088 (2007).
dos Reis, M. & Yang, Z. Approximate likelihood calculation on a phylogeny for Bayesian estimation of divergence times. Molecular Biology and Evolution 28, 2161–2172, https://doi.org/10.1093/molbev/msr045 (2011).
Han, M. V., Thomas, G. W., Lugo-Martinez, J. & Hahn, M. W. Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Molecular Biology and Evolution 30, 1987–1997, https://doi.org/10.1093/molbev/mst100 (2013).
Yu, G., Wang, L., Han, Y. & He, Q. Clusterprofiler: An R Package for Comparing Biological Themes Among Gene Clusters. OMICS: A Journal of Integrative Biology 16, 284–287, https://doi.org/10.1089/omi.2011.0118 (2012).
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Research 40, e49, https://doi.org/10.1093/nar/gkr1293 (2012).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603922 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603923 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603924 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603925 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603926 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603927 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603928 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603929 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603930 (2025).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR35603931 (2025).
Wang, J., He, S. L., Chen, B. & Li, T. J. Genbank https://identifiers.org/insdc.gca:GCA_055773155.1 (2026).
Wang, J. Chromosome-level genome assembly of Rhynchium brunneum (Fabricius, 1787) (Hymenoptera: Vespidae). figshare https://doi.org/10.6084/m9.figshare.30227221 (2025).
Acknowledgements
We are grateful to Chun-Lin He (Henan University of Science and Technology, Luoyang, China) for providing us with some specimens for this research. This study was funded by the Science & Technology Fundamental Resources Investigation Program (No. 2022FY202100) and the National Natural Science Foundation of China (No. 31772490, 31372247, 31000976).
Author information
Authors and Affiliations
Contributions
T.-J.L. and B.C. contributed to the research design. J.W. and S.-L.H. analyzed the data. J.W., S.-L.H. and T.-J.L. wrote the draft manuscript and revised the manuscript. All co-authors contributed to this manuscript and approved it.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Wang, J., He, S., Chen, B. et al. Chromosome-level genome assembly of Rhynchium brunneum (Fabricius, 1787) (Hymenoptera: Vespidae). Sci Data (2026). https://doi.org/10.1038/s41597-026-07168-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-026-07168-5