Introduction

Initially documented by Oyama1, Volva habei is a species affiliated with the Ovulidae family. Its key features encompass a smaller shell size compared to preceding species, relatively short anterior and posterior siphonal grooves, pronounced inflation in the central region, and a shell surface exhibiting a light pink or white hue. The siphonal grooves terminate without pigmentation, and the shell size ranges between 48 and 110 mm. This species predominantly resides in China, spanning from Zhejiang to Guangdong, and can also be found internationally, including regions such as Japan. It commonly inhabits shallow sandy seabeds within the subtidal zone.

Ovulidae (Fleming, 1822), commonly known as egg cowries, constitutes a widely distributed family of specialized carnivorous mollusks with a proclivity for grazing on the polyps and tissues of Anthozoa, particularly soft corals and sea fans. Predominantly found in tropical regions, with only a limited number of species inhabiting temperate waters, Ovulidae2 traces its origins back to the early Eocene3,4 and underwent a significant increase in diversity during the Pleistocene. This diversification was especially notable in the warm seas of the central Indo-West Pacific Region, currently boasting the highest documented specific diversity within the family. Despite a scarcity of detailed anatomical studies5, a close affinity with Cypraeidae, the ‘true’ cowries, has consistently been inferred, as indicated by prior works6. In the molecular systematics of this taxonomic family, Ovulidae is acknowledged as the group closest to Cypraeidae7,8. The taxonomic classification of Ovulidae has undergone multiple revisions9,10. Historically, based on shell morphology, the family has been dichotomized into two subfamilies (Schilder 1932): (1) Ovulinae, represented by genera such as Ovula, Primovula, Diminovula, Margovula, Calpurnus and Procalpurnus, exhibit pyriform, ovate and inflated shells. These shells typically have short extremities, a posterior canal turning leftward and well-developed teeth on the outer lip. (2) Volvinae, including genera like Volva, Simnia, Phenacovolva, Cyphoma and Neosimnia, showcase narrow fusiform shells with prominent extremities, and may lack or have poorly developed teeth.

The mitochondrial genome stands as a paradigm for genome evolution and serves as a powerful tool in elucidating evolutionary relationships11. Recognized as a valuable resource, the complete mitogenome has been pivotal in resolving phylogenetic relationships across diverse mollusk groups12,13,14,15,16. Owing to maternal inheritance, the absence of recombination and a high rate of base substitution, mitogenomes have found extensive application in phylogenetic analyses17,18,19. In contrast to gene fragments, mitogenomes encapsulate a wealth of information, demonstrating utility in recovering internal nodes with robust statistical support. This attribute has been widely applied in reconstructing phylogenetic relationships within various gastropod groups. A common characteristic of metazoan mitogenomes is the presence of a covalently circular molecule that encodes a total of 37 genes. These include 13 protein-coding genes (PCGs), two ribosomal RNA (rRNA) genes and 22 transfer RNA (tRNA) genes. Despite exceptions reported20,21, the mitogenome, when compared to single gene fragments, yields a substantial amount of variant information. This information has played a crucial role in species identification13,22 and phylogenetic reconstruction14,23 within Mollusca. Recent technological advancements, such as rolling circle amplification24,25 and next-generation sequencing technologies26, coupled with decreasing costs of DNA sequencing, have standardized the amplification and sequencing of entire mt genomes.

This investigation seeks to enhance taxonomic representation within the Ovulidae family by sequencing the complete mitogenome of V. habei. Moreover, a comprehensive examination of the mitogenome of V. habei was conducted, exploring intricate genomic details and evolutionary dynamics. This investigation encompassed aspects such as gene order, nucleotide composition, codon utilization, and the secondary structure of tRNA. The precise positioning of V. habei within the phylogenetic framework was discerned through a thoroughgoing phylogenetic analysis of the Littorinimorpha. In order to attain a more profound comprehension of the evolutionary interconnections within the Littorinimorpha, the gene arrangements of mitogenomes across the entire Littorinimorpha family were seamlessly integrated. Moreover, the recently acquired mitogenome was assimilated into the Ovulidae family dataset, aimed at resolving contentious phylogenetic relationships associated with the family. The phylogenetic tree of V. habei carries noteworthy implications for comprehensive genomic and systematic inquiries at elevated taxonomic strata, particularly given the prevailing uncertainties surrounding deep-seated relationships within the Ovulidae family.

Result

Genome structure and composition

The complete mitogenome sequence of V. habei represents a typical closed-loop molecule with a length of 16,519 bp (GenBank accession number OR492307) (Fig. 1). This genome encompasses a total of 37 genes, comprising 13 PCGs, 22 tRNAs, two rRNAs (16 S rRNA and 12 S rRNA) and three concise non-coding regions. All these genes have been identified and are delineated in Fig. 1; Table 1. Among the 37 genes, the majority are located on the heavy strand, with only eight tRNAs (trnM, trnY, trnC, trnW, trnQ, trnG, trnE and trnT) situated on the light strand. Within the 13 PCGs, nad5 is the longest (1695 bp), while atp8 is the shortest (159 bp). The complete mitogenome of V. habei harbors eight overlapping regions, with two regions exceeding or equal to 10 bp in length, specifically cox2 and 16 S rRNA. Regions with fewer than 10 bp include trnW, trnG, 12 S rRNA, trnV, trnL and trnS. In these overlapping regions, two pairs of PCGs overlap (atp8 and atp6; nad4L and nad4). Additionally, the mitogenome of V. habei includes 27 intergenic regions spanning 952 bp, with lengths ranging from 1 to 232 bp (Table 1). Regarding nucleotide composition, the mitogenome of V. habei comprises A (27.97%), T (40.66%), G (17.82%) and C (13.54%), demonstrating a notable AT bias. The cumulative A + T (%) content of the mitogenome is 68.63%. In the selected complete mitogenome, the mitogenome of V. habei shows a negative AT skewness (-0.185) and a positive GC skewness (0.137), suggesting a prevalence of T and C compared to A and G (Table 2).

Fig. 1
figure 1

Maps of the mitochondrial genomes of Volva habei.

Table 1 Mitochondrial genome organization of Volva habei.
Table 2 Nucleotide contents of the coding regions of the mitochondrial genome of Volva Habei, indicating AT-, GC-skew ratios.

Transfer RNAs, ribosomal RNAs

Similar to widespread patterns observed in many invertebrate species27,28, the mitogenome of V. habei comprises 22 tRNA genes, with 14 encoded by the heavy strand and eight by the light strand. Throughout the entire mitogenome, the size of tRNA molecules ranges from 65 to 71 bp, resulting in a total length of 1478 bp for the 22 tRNA genes. Notably, both trnS and trnL exist in duplicate copies. In the mitogenome sequence of V. habei, the lengths of the two rRNA genes, 16 S rRNA and 12 S rRNA, are 1396 and 882 bp, respectively. They are positioned between trnV and two additional tRNA genes, trnL1 and trnE, with mutual separation facilitated by trnV (Fig. 1; Table 1). Furthermore, the tRNAs exhibit a distinct AT bias (67.12%). The values for AT skewness and GC skewness are recorded as 0.014 and 0.193, respectively, indicating a slight preference for adenine and a pronounced preference for guanine (Table 2). After predicting the secondary structure of the 22 tRNA genes of V. habei, it was observed that trnS2 cannot form the typical cloverleaf structure due to the absence of dihydrouracil (DHU) arms. This structural variation in trnS2 is consistent with similar observations in mitogenomes of other invertebrates29,30,31. In contrast, other tRNAs have the capability to fold into the conventional cloverleaf structure. Additionally, all tRNAs exhibit G-C mismatches. On the Acceptor stem, U-U mismatches are observed in trnI due to C-T transitions, and C-U mismatches are observed in trnC due to A-C transitions (Fig. 2).

Fig. 2
figure 2

Secondary structure of the tRNA genes in the mitogenome of Volva habei.

PCGs and codon usage

The results of the start and stop codons for all PCGs in V. habei are presented in Table 2. The mitogenome of V. habei comprises a total of 13 PCGs, including one cytochrome b (cob), two ATPases (atp6 and atp8), three cytochrome oxidases (cox1-3), and seven NADH dehydrogenases (nad16 and nad4l). This configuration aligns with established structural patterns observed in the Cypraeoidea27. The total length of these 13 PCGs is 11,025 bp. Within this set, individual PCGs vary in length from 159 to 1695 bp. Notably, the average A + T content is 67.05%, with a range from 63.84% (cox3) to 72.95% (atp8). The AT skewness and GC skewness values are calculated as -0.260 and 0.129, respectively (Table 2). Most PCGs initiate with the start codon ATG, except for nad2, nad5 and nad6, which use ATT, and nad1, which uses ATA as their start codon. Additionally, the predominant stop codons are TAG or TAA (Table 1). In V. habei, Phe is the most frequently occurring amino acid, while Gln is the least common (Fig. 3). The relative synonymous codon usage (RSCU) values are detailed in Table 4; Fig. 4. Among these, UUA (Leu) is the most prevalent codon, while CUG (Leu) is the least common (Fig. 4; Table 3).

Fig. 3
figure 3

Amino acid composition of Volva habei mitochondrial genomes.

Fig. 4
figure 4

Relative synonymous codon usage (RSCU) in the mitogenomes of Volva habei.

Table 3 Relative synonymous codon usage (RSCU) in the mitogenomes of Volva habei.
Table 4 List of species of Littorinimorpha analysed in this study and their GenBank accession numbers.

Gene re-arrangement

Mitogenome rearrangements in gene order provide an independent dataset for resolving evolutionary relationships. Patterns of mitochondrial gene order rearrangements shared among different taxonomic groups may indicate shared ancestral relationships rather than products of shared evolution32,33. In comparison to the ancestral gastropod gene arrangement, there are notable rearrangements in the mitogenome of V. habei. As illustrated in Fig. 5, at least three gene clusters (or genes) differ markedly from the conventional arrangement, including 15 tRNA genes (M, Y, C, W, Q, G, E, V, L, L, P, S, H, F and T) and eight PCGs (16 S rRNA, 12 S rRNA, nad1, nad6, cob, nad4l, nad4 and nad5). The rearrangements of these three gene clusters (or genes) are detailed below (Fig. 5): (1) The M-Y-C-W-Q-G-E cluster has been relocated downstream of atp6; (2) The T cluster has shifted downstream of F; (3) A rearrangement and translocation have occurred in the F-nad5-H-nad4-nad4l-S-cob-nad6-P-nad1-L-16 S-V-12 S cluster.

Fig. 5
figure 5

Comparison of mitochondrial gene rearrangements of the Volva habei. The blue squares represent PCGs, the green squares represent tRNAs, and the orange squares represent rRNAs.

Phylogenetic relationships

The comparative analysis of individual genes or gene segments provides a more comprehensive understanding of genomic-level features, including genetic resources, molecular evolution, genome evolution, and phylogenetic relationships. In this study, we conducted a phylogenetic analysis using the sequences of 13 PCGs to gain insights into the relationships within the Littorinimorpha clade, with a particular focus on V. habei. Additionally, we included 66 other Littorinimorpha species in the analysis, using A. heterodon and A. varicosa as outgroups. Maximum Likelihood (ML) and Bayesian Inference (BI) methods were employed to construct two phylogenetic trees with identical topologies but different support values. The topology based on the support values from the maximum likelihood tree was selected for presentation and display (Fig. 6).

Fig. 6
figure 6

The phylogenetic tree was inferred from the nucleotide sequences of 13 mitogenome PCGs using BI and ML methods. Numbers on branches indicate posterior probability (BI) and bootstrap support (ML).

Within the phylogenetic tree encompassing the 20 families, each family formed a monophyletic group, supported by high confidence values. Phylogenetic analysis revealed relationships within nine superfamilies in Littorinimorpha as follows: (((Truncatelloidea + Rissooidea) + (Littorinoidea + Naticoidea)) + ((Stromboidea + Xenophoroidea) + (Tonnoidea + Cypraeoidea)) + Vermetoidea). All nine were identified as monophyletic groups, and some previous studies have supported this notion34,35. The phylogenetic tree indicates that Cypraeidae forms a clade, the Ovulidae family, where V. habei belongs, constitutes another clade, and the families Ovulidae and Cypraeidae are sister groups, collectively forming the Cypraeoidea.

Discussion

In comparison to previously reported mitogenomes of other Cypraeoidea members36,37, the sequenced mitogenome in this study exhibits highly similar patterns in terms of genome size, nucleotide composition, and AT content. Investigation into the nucleotide composition of the V. habei mitogenome reveals a distinct AT bias within its genomic structure. This feature is reminiscent of the mitochondrial composition observed in the majority of bivalve mollusks and is a shared characteristic across various animal taxa, including fish and crustaceans38,39. V. habei also exhibits overlapping adjacent genes, a phenomenon commonly observed in molluscan mitogenomes40,41,42. All thirteen protein-coding genes in the V. habei mitogenome are encoded on the heavy strand, aligning with a pattern observed in certain gastropod mollusks, such as Nassariids43, which also encode proteins on the heavy strand. V. habei possesses a total of thirteen intact PCGs, consistent with the quantity and types of PCGs found in the majority of gastropod mollusks. This distinguishes it from some bivalve mollusks, such as true oysters, scallops, and other bivalves, which only possess twelve PCGs44.

In the context of gene rearrangement patterns, three main types are typically identified45: (1) Shuffle, where genes move from their original positions to adjacent positions on the same strand, often without crossing PCGs; (2) Transposition, where genes cross multiple genes, often including protein-coding genes, moving from their original positions to new locations; (3) Inversion, involving genes switching from one strand to the other. Based on the features of mitochondrial sequences, our analysis suggests that inversion and transposition are the primary rearrangement types observed in V. habei. However, mitochondrial gene rearrangements are often associated with an elevated rate of evolution46. Previous studies have found a significant positive correlation between the rate of gene order rearrangement in mitogenomes and an accelerated rate of evolution47. Interestingly, in contrast to the extensive gene rearrangements observed in Lottiidae, the differences in gene order within Littorinimorpha are relatively minor48, excluding Vermetoidea, Truncatelloidea and Stromboidea. We speculate that this could be attributed to the relatively small variations in genome size among Littorinimorpha species, ranging from 15,078 bp to 17,504 bp (Table 1), compared to the mitogenome size in Lottiidae, which spans from 16,319 bp to 26,835 bp. Further research is needed to explore this correlation in a broader range of gastropod taxa. Additionally, we compared the gene order in V. habei with other superfamilies within Littorinimorpha. Except for Vermetoidea, Truncatelloidea, and Stromboidea, the gene order in other superfamilies remains relatively consistent. Notably, in Truncatelloidea, the species Helicostoa sinensis and in Stromboidea, the species Ministrombus variabilis exhibit the absence of trnF in their gene orders. In Vermetoidea, deviations from the conserved order primarily involve tRNAs. Furthermore, the M-Y-C-W-Q-G-E cluster in the mitogenome of Vermetoidea undergoes inversion, a phenomenon observed in other gastropod mitogenomes12, resulting in the disruption and rearrangement of the cluster. Interestingly, a closely related set of genes experienced rearrangement in the common ancestor of Caenogastropoda, although the integrity of the M-Y-C-W-Q-G-E cluster was maintained40,49,50. These findings align with conclusions drawn from phylogenetic analyses based on gene order, emphasizing the value of comparing mitochondrial gene rearrangements as a tool in systematic studies.

The results of the phylogenetic analysis indicate that Naticoidea and Littorinoidea are the closest sister groups, consistent with findings by Santos51. Vermetoidea is positioned at the base of the monophyletic Littorinimorpha, aligning with previous research50. This placement may also be linked to results of gene rearrangement, as the gene sequences of Vermetoidea significantly differ from those of other species, leading to its basal position in the phylogenetic tree.

Stromboidea, encompassing the highest number of families, constitutes a highly diverse group. Currently recognized to include six extant families: Aporrhaidae, Rostellariidae, Seraphsidae, Strombidae, Struthiolariidae and Xenophoridae. At the family level, each family’s species forms a distinct clade. Strombidae, Seraphsidae, and Rostellariidae exhibit close phylogenetic relationships, while Aporrhaidae, Struthiolariidae and Xenophoridae form another closely related group. This is consistent with the findings of Irwin52, who suggested that in mitochondrial gene analysis, combined with nuclear gene analysis, Xenophoridae resolves as the sister group with the highest support to Struthiolariidae + Aporrhaidae. The positions of Rostellariidae and Seraphsidae vary in different analyses. Prokaryotic analyses often recover Rostellariidae as the sister group to Strombidae, while Bayesian inference analysis places it as the sister group to Seraphsidae. Only nuclear gene analysis recovers Seraphsidae as the sister group to Strombidae, with varying support levels52.

Previous studies have established Tonnoidea as a monophyletic group, identifying Ficoidea as its sister group. Our study not only reaffirms the monophyly of Tonnoidea but also uncovers that Tonnoidea is the sister group to Cypraeoidea. Although this study does not include species from the superfamily Ficoidea, we hypothesize, based on existing data, that Ficoidea species may also share a sister group relationship with Cypraeoidea, similar to Tonnoidea53,54,55. In 1973, Cate classified the Ovulidae within the Cypraeoidea taxonomy, a classification that is further substantiated by the phylogenetic analysis results obtained in this study10. Due to the faster evolutionary rate of Ovulidae compared to other gastropod groups, they exhibit high phenotypic plasticity56,57,58,59, making classification based solely on morphology more challenging. Recent studies have utilized cox1 and 16 S rRNA to investigate the phylogeny of Ovulidae, revealing the ability of both markers to distinguish specimens at the species level. This led to the discovery of synonymy among Ovulidae species58,59,60,61 employed barcode technology to analyze sequences of over 2000 individuals from 263 taxonomic groups within Cypraeidae, achieving a 98% accuracy in identifying Evolutionarily Significant Units (ESUs) using neighbor-joining methods. While some have discussed the limitations of mitogenomics for inferring deep-sea mollusk relationships62,63, the presented phylogenetic tree offers a robust framework, enhancing our understanding of Caenogastropoda relationships.

Conclusion

In this investigation, we employed next-generation sequencing technology to meticulously sequence the mitogenome of V. habei, with the goal of obtaining pertinent mitochondrial data for Ovulidae. An extensive analysis of the mitogenome of V. habei unveiled considerable similarity with other members of Littorinimorpha, marked by distinctive features such as AT bias and specific preferences in codon usage. Through comparative analysis with ancestral gastropods, notable rearrangements in the gene order of the mitogenome of V. habei were identified. Meanwhile, Littorinimorpha exhibited five distinct rearrangement patterns. The consistent similarity in these rearrangements underscores their phylogenetic relationships. Our phylogenetic tree, when compared to previous studies, demonstrates both consistency and some variations. Phylogenetic analysis indicates that V. habei forms a distinct Calyptraeoidea clade, while (((Truncatelloidea + Rissooidea) + (Littorinoidea + Naticoidea)) + ((Stromboidea + Xenophoroidea) + (Tonnoidea + Cypraeoidea)) + Vermetoidea) constitutes a separate clade. Despite the limitation in the number of species, our phylogenetic tree garners statistical support, providing a rational framework for future phylogenetic studies in the Calyptraeoidea field. These findings not only contribute insights into the gene arrangement characteristics in the mitogenome of Littorinimorpha but also establish the foundation for a more profound exploration of the phylogeny within Littorinimorpha. The study sheds light on features of mollusk mitogenomes, advancing our understanding of the phylogeny within the gastropod order, crucial for further species identification research and the implementation of effective management and control strategies.

Materials and methods

Sampling and DNA extraction

The specimens were meticulously collected in Zhejiang, adhering to strict sampling protocols. After collection, they were immersed in ethanol and preserved at -20 °C. Expert taxonomists at the Marine Biological Museum of Zhejiang Ocean University conducted a preliminary morphological assessment to authenticate their taxonomic classification. To achieve taxonomic precision, complete genomic DNA was carefully extracted from muscle tissues using the salt-extraction method and stored at -20 °C64, laying the foundation for subsequent in-depth analyses in mitochondrial whole-genome research.

Genome sequencing, assembly and annotation

The mitogenomes of V. habei were meticulously sequenced by Origin Gene Co. Ltd. in Shanghai, China, leveraging the advanced Illumina HiSeq X Ten sequencing platform. Following a protocol akin to that employed for V. habei, HiSeq X Ten libraries were meticulously crafted, featuring insert sizes spanning 300 to 500 bp, sourced from genomic DNA samples, resulting in an impressive 10 gb of raw data. Stringent preprocessing procedures were implemented, encompassing the exclusion of low-quality reads, adapters, sequences containing a notable proportion of ambiguous bases (“N” bases), and those with a length below 25 bp.The assembly process was executed with the aid of the sophisticated NOVOPlasty software65, accessible at https://github.com/ndierckx/NOVOPlasty. Annotation and manual refinement of the assembly were conducted with meticulous attention to established mitogenome datasets. De novo assembled mitogenomes were generated using the powerful MITOS tools66, accessed through the MITOS Web Server at uni-leipzig.de. Sequence accuracy was systematically validated through alignment with mitochondrial genes from other Muricidae species, further confirmed by employing the COI barcode sequence and conducting NCBI BLAST searches67.

High-throughput sequencing reads underwent de novo assembly, and the comprehensive annotation of the mitogenomes was meticulously performed using Sequin version 16.0. The intricate landscape of the V. habei mitogenome was vividly portrayed using the web-based tool Poksee, accessible at https://proksee.ca68, enhancing our understanding of its structural organization.The tertiary structures of tRNA genes were anticipated and visually presented through the sophisticated MITOS Web Server. In-depth insights into coding sequence characteristics were garnered through the computation of relative synonymous codon usage (RSCU) values, and an evaluation of substitution saturation for the 13 PCGs was executed using DAMBE 5. Subsequent analysis and interpretation of these values were meticulously conducted using MEGA 769. Furthermore, a nuanced exploration of base compositional disparities and strand asymmetry among samples was undertaken, involving the calculation of GC-skews and AT-skews. These parameters were intricately determined using the following formulas: AT-skew=[A − T]/[A + T] and GC skew=[G − C]/[G + C]. The extent of substitution saturation for the 13 PCGs was quantified with precision using DAMBE 570. This comprehensive analytical approach provides a robust foundation for understanding the intricacies of the V. habei mitogenome.

Gene order analysis

Beyond the mitogenomes subjected to sequencing in this investigation, an additional set of 66 complete mitogenomes belonging to the Littorinimorpha was procured from GenBank (Table 4) to facilitate comparative analyses. This expansive dataset underwent meticulous scrutiny, as all 66 mitogenomes were juxtaposed against the ancestral Gastropoda gene arrangement. The aim was to pinpoint potential novel gene orders that had not been documented previously. To mitigate the risk of misinterpretation arising from potential misannotations, any mitogenomes within the Littorinimorpha exhibiting deviations from the ancestral pattern underwent a comprehensive reannotation process employing MITOS66. This rigorous approach ensures the accuracy and reliability of observed gene order differences, thereby enhancing the robustness of our comparative analyses.

Phylogenetic analysis

The study aimed to investigate the phylogenetic relationships within Littorinimorpha using 13 PCGs. A dataset of 69 mitogenome sequences was compiled from GenBank, including Alasmidonta heterodon and Alasmidonta varicosa as outgroups. Both ML and BI methods were applied71,72,73. IQ-TREE 1.6.2 with 1000 bootstrap replicates identified the optimal TIM2 + F + R10 model based on Bayesian Information Criterion (BIC). Interdisciplinary associations were established using MrMTgui74, integrating PAUP, ModelTest, and MrModelTest 2.3. Bayesian phylogenetic analyses in MrBayes 3.2 used the GTR + I + G model chosen by Akaike Information Criterion (AIC). Markov Chain Monte Carlo (MCMC) chains ran for 2,000,000 generations, sampling every 1,000 steps, with a 25% burn-in. Convergence was assessed by mean standard deviation of split frequencies (< 0.01). Resulting phylogenetic trees, revealing nuanced Littorinimorpha relationships, were visualized and refined using Figure Tree v.1.4.375. This analytical framework enhances understanding of evolutionary dynamics in the studied taxa.