Unveiling the mitochondrial genome of Salvia splendens insights into the evolutionary traits within the genus Salvia

Yang, Heyu; Ni, Yang; Li, Jingling; Chen, Haimei; Liu, Chang

doi:10.1038/s41598-025-96637-9

Download PDF

Article
Open access
Published: 17 April 2025

Unveiling the mitochondrial genome of Salvia splendens insights into the evolutionary traits within the genus Salvia

Heyu Yang¹,
Yang Ni¹,
Jingling Li¹,
Haimei Chen¹ &
…
Chang Liu ORCID: orcid.org/0000-0003-3879-7302¹

Scientific Reports volume 15, Article number: 13344 (2025) Cite this article

2004 Accesses
Metrics details

Subjects

Abstract

Previously, we resolved the complete sequences of the mitochondrial genomes (mitogenome) of two Salvia species (S. miltiorrhiza and S. officinalis). The major configurations of these two species were two circular chromosomes. In this study, we further studied the mitogenome of a floral species of Salvia (Salvia splendens) to understand the diversity and evolution of the Salvia mitogenomes. We sequenced the total DNAs of S. splendens using the Nanopore and Illumina platforms and assembled the mitogenome using a hybrid assembly strategy. The major configurations of the S. splendens were two circular chromosomes with lengths of 182,239 and 165,055 bp. There were 32 protein-coding genes (PCGs), three rRNA genes, and 18 tRNA genes annotated in the S. splendens mitogenome. We found 56 pairs of repetitive sequences in the S. splendens mitogenome. Three of them (R01, 04, and 07) could mediate recombination, whose products could be identified by the mapping of Nanopore reads, PCR amplifications, and Sanger sequencing of the PCR products. 457 RNA editing sites were identified in the S. splendens mitochondrial RNAs when comparing the RNA-seq data with their corresponding DNA templates. We showed that S. splendens was a sister taxon to S. miltiorrhiza based on the mitogenomes, consistent with the phylogeny determined with the plastome sequences. Crucially, we developed 12 mitochondrial markers sourced from mitochondrial intron regions to facilitate the identification of three Salvia species. Our study offers a comprehensive view of the structure of the Salvia mitogenomes and provides robust mitochondrial markers for Salvia species identification.

Insights into structure, codon usage, repeats, and RNA editing of the complete mitochondrial genome of Perilla frutescens (Lamiaceae)

Article Open access 17 June 2024

Assembly and analysis of the complete mitochondrial genome of Carya illinoinensis to provide insights into the conserved sequences of tRNA genes

Article Open access 19 November 2024

The characterisation of the complete mitochondrial genome of Polygonatum Kingianum reveals recombination mediated by repeats associated with DNA replication

Article Open access 06 June 2025

Introduction

Salvia splendens Ker-Gawl, also called scarlet sage, is a perennial herbaceous plant from the Lamiaceae family. It was native to Brazil and was popular as a bedding plant all over the world. The phenolic metabolites and flavone triglycoside from the methanol extract of S. splendens leaves were shown to have significant hypoglycemic, anti-inflammatory effects, and in vitro antioxidant activity¹. The extracts of S. splendens were found able to phytosynthesize AgNPs². Moreover, the AgNPs derived from S. splendens had cytotoxicity against human lung cancer cell line A549³. Furthermore, the diterpenes found in S. splendens may be useful as neuropharmacological agents⁴.

Apart from these pharmacophylogenetic and synthetic studies on S. splendens, the genome resources of S. splendens have also been reported. Two versions of nuclear genome assemblies were published with genome sizes of 808 and 807 Mb, respectively^5,6. The plastid genome (plastome) of S. splendens was also characterized by a length of 150,604 bp^7,8. During the progress of this study, a mitochondrial genome sequence of S. splendens was released in the public database (GenBank: PNBA02000024.1). However, the genome is not even annotated, and we have not found any corresponding publication. As a result, the mitochondrial genome of S. splendens remains to be analyzed.

Mitochondria produce cellular ATP through oxidative phosphorylation in animal, fungi, and plant cells. The plant mitogenomes were unique compared to their counterparts of animals or fungi⁹. The size of plant mitogenomes ranged from 66 kb for Viscum scurruloideum to 11,000 kb for Silene conica, larger than those of animals or fungi^10,11. Furthermore, many structural rearrangements were found in the plant mitogenomes resulting from intra- or inter-molecular homologous recombination mediated by repetitive sequences^12,13. Specifically, the plant mitogenome was found to have complex gene expression regulation mechanisms demonstrated by cis- and trans-splicing, and RNA editing events^10,14. While plant mitochondrial DNA (mtDNA) exhibits a slower nucleotide substitution rate¹⁵, mitochondrial markers have been developed for both phylogenetic analysis and species identification. These markers are derived from the less conserved regions of the mitochondrial genomes, including intron and intergenic sequences^16,17.

By March 2023, the complete mitogenome sequences of 589 plant species have been deposited in the GenBank (https://www.ncbi.nlm.nih.gov/genome/browse/#!/organelles/). Structural differences and size variations were observed among the mitogenomes of these plant species as previously described^18,19. Most genes were conserved except those lost in the mitogenomes and transferred to the nuclear genomes^20,21.

Previously, we determined the complete mitogenome sequences of two Salvia species S. miltiorrhiza²² and S. officinalis²³. To enhance our understanding of the structural differences, size variations, and divergence of gene contents and orders of Salvia mitogenomes, we here to study the mitogenome of S. splendens. We sequenced the S. splendens total DNAs using the Illumina and Nanopore sequencing platforms and assembled the S. splendens mitogenome using a hybrid strategy. The genome structure, gene contents, extent of RNA editing events, and repeat-mediated homologous recombination were compared among these three Salvia species, which shed light on the mechanisms driving the evolution of Salvia mitogenomes. Furthermore, we have identified 12 mitochondrial markers derived from intron regions of the mitochondrial genome. These polymorphic markers serve as reliable tools for distinguishing the three Salvia species.

Materials and methods

Plant materials and nucleic acid extraction

Young and fresh leaves of S. splendens were collected from a cultivated line (ss01) from Songjiang, Shanghai city (30^◦56′49.5″N, 121^◦15′23.3″E), China. S. splendens is neither an endangered nor protected species. The permission to collect S. splendens has been obtained through the Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College. We deposited the voucher specimen under the accession number ss-vs01 at the Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, China. We stored the leaves of S. splendens at −80 °C until use. The total DNA and RNA of 0.1 g leaves per sample were extracted using a plant genomic DNA kit (Tiangen Biotech, Beijing, Co., Ltd.) and an RNAprep Pure Plan Kit (Tiangen Biotech Beijing CO., LTD), respectively. We assessed the purity of DNA and RNA with electrophoresis using the 1.5% agarose gel and quantified the DNA and RNA concentration using the Nanodrop spectrophotometer 2000 (Thermo Fisher Scientific, America).

DNA sequencing and mitogenome assembly

We generated the DNA sequencing data by Oxford Nanopore and Illumina sequencing technologies. In Nanopore sequencing, the DNA yield of ss01 was evaluated using a Qubit instrument, and DNA quality was evaluated via a Nanodrop instrument. Further DNA fragmentation and selection of DNA fragments with the size of 10–50 kb were conducted using the Blue-pippin system (Sage Science, MA, USA). For sequencing library preparation, we used the 1D ligation kit (SQK-LSK109) and then performed the sequencing on a PromethION Flow Cell (Oxford Nanopore Technologies, UK). Base calling was conducted using Guppy v3.2.2 18 with default parameters. The high-quality reads (Q-score ≥ 7) were obtained for the mitogenome assembly.

For Illumina sequencing, the sequencing libraries were prepared using a Truseq Nano DNA HT Sample Preparation Kit (Illumina) following the manufacturer’s instructions. Briefly, the DNA sample of the same line (ss-01) was fragmented to an average length of 350 bp by sonication, end polished, and ligated with the adapter for PCR amplification. Further PCR amplicons were purified using the AMPure XP system (Beckman Coulter, CA, USA) and the size distribution of PCR amplicons was analyzed using an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Finally, libraries were sequenced on an Illumina NovaSeq 6000 platform and 150 bp paired-end reads were obtained. The above steps of sequencing were conducted at Grandomics Biotechnology Co., Ltd (Wuhan, China).

For RNA sequencing, the rRNAs in the total RNA were depleted via a Ribo-Zero™ Magnetic Kit (Epicenter, Madison, WI, USA). The rRNA-depleted sequencing library was constructed using a VAHTS Universal V8 RNA-seq Library Prep Kit (Vazyme, Nanjing, China) following the manufacturer’s recommendations. The library sequencing was performed on an Illumina HiSeq 2500 sequencer (2 × 150 bp reads) at Grandomics Biotechnology Co., Ltd.

We assembled the mitogenome of S. splendens using a hybrid assembly strategy. We first extended and achieved the mitochondrial reads from Illumina sequencing data using GetOrganelle (v1.6.4)²⁴. The extended reads were assembled into a unitig graph using the SPAdes packaged in the Unicycler (v0.4.9)²⁵. Then the double-bifurcation structures (DBSs) in the unitig graph were resolved by the long Nanopore reads using the Unicycler software.

To validate the assembly results of the unicycler, we obtained the DBS sequences and their flanking sequences of 1000 bp long. These two sequences were named configurations 1 and 2 (c1 and c2 in short). Their flanking sequences were then switched to mimic the homologous recombination products. The resulting two sequences were named configurations 3 and 4 (c3 and c4 in short). Then the nanopore long reads were mapped to these sequences using BWA (v0.7.12-r1039)²⁶ with default parameters. The number of reads mapped to each configuration was counted. Among the four configurations, the two configurations supported with more Nanopore long reads were selected in the final assembly of the S. splendens mitogenome.

Then we mapped the Nanopore and Illumina sequencing reads to the final assembly of the S. splendens mitogenome using BWA with default parameters. The coverage of every site in the mitogenome and the median coverage were calculated using the samtools (v1.3.1)²⁷.

The collinearity analysis of mitogenomes assembled in this study and the one released in GenBank (PNBA02000024.1) were performed using the nucmer module in the Mummer package (v3)^28,29. The sequences were aligned using the many-to-many model with an identity threshold of 85%. Then we visualized the collinearity results using the Rideogram package of R³⁰. The variable sites between the two assemblies were identified using BLASTN (v 2.10.1 +)³¹ and the positions of variable sites were detected using the blastn2snp tool from JVarkit (v.1.0)³². Moreover, the collinearity of S. splendens mitogenome and those of S. miltiorrhiza²² and S. officinalis²³ were performed using the method described above.

Mitogenome annotation

We annotated the final assembly of the S. splendens mitogenome using the MGA pipeline (http://www.1kmpg.cn/mga). The tRNA genes were found via tRNAscan-SE³³. The positions of the start and stop codons and intron/exon boundaries of each gene were manually checked using the Apollo program³⁴. We drew the circular map of the final assembly using PMGView (http://www.1kmpg.cn/pmgview). The sequences of the S. splendens mitogenome were deposited in the GenBank under the accession numbers OQ675154 and OQ675155.

Analysis of the repeat-mediated recombination in the mitogenome

To explore possible recombination events mediated by the repetitive sequences in the S. splendens mitogenome, we identified the repetitive sequences using ROUSfinder v2³⁵. We then extracted the 1000 bp-long flanking sequences of each repetitive sequence and constructed the sequences corresponding to the four configurations c1, c2, c3, and c4. The Nanopore sequencing data were mapped to these four configurations and the numbers of repeat-spanning reads for each configuration were calculated. The Integrative Genomics Viewer (IGV) software (v 2.15.1)³⁶ was used to visualize the mapping results.

We extracted the repetitive sequences and their 200 to 700 bp-long flanking sequences as template sequences. The primers were designed to amplify the s template sequences using the Primer-BLAST. PCR amplification was conducted in a reaction of 50 µl, including 23 µL water, 25 µL 2 × Taq PCR Master Mix, 1 µL of each primer, and 1 µL total DNA. We performed the PCR amplification on a Pro-Flex PCR system (Applied Biosystems, Waltham, MA, USA) under the following conditions: denaturation at 94 °C for 2 min, followed by 35 cycles of 94 °C for 30 s, 57 °C for 30 s, 72 °C for 60 s, and 72 °C for 2 min of the final extension. The long fragments of four configurations associated with R01 were amplified using KOD One™ PCR Master Mix (TOYOBO, Japan). All of the PCR amplicons were analyzed using electrophoresis on a 1.5% agarose gel. Amplicons of R04 and R07 of the expected size were sequenced using the Sanger sequencing methods.

Analysis of repetitive sequences

We used the MISA web service³⁷ to identify the SSRs in the S. splendens mitogenome with the thresholds of 10, 5, and 4 for the number of mono-, di-, trinucleotide repeat units, respectively, and 3 for the numbers of tetra-, penta-, and hexanucleotide repeat units. We detected the tandem repeats in the S. splendens mitogenome using the Tandem Repeats Finder (v 4.09) [70]. The parameters of the Tandem Repeats Finder were set as follows: 2 for matches, 7 for mismatches and indels, and 50 and 500 for the minimum alignment score and maximum period size, respectively.

Identification of mitochondrial plastid sequences (MTPTs)

We assembled the plastome of the S. splendens using GetOrganelle with the parameters: “-R 15 -k 21,45,65,85,105 -F embplant_pt”. The plastome sequence obtained in this study was similar to the plastome sequence (OM617847.1) reported in our previous study⁷. We determined the MTPTs of S. splendens by searching the homologous sequences between the plastome and the mitogenome using BLASTn (v 2.10.1 +) with the parameters of e-value < 1e-6 and word size of 7²². Hits less than 100 bp long and having less than 80% sequence identity to the query sequences were excluded. Then, the sequences consisting of MTPTs and their 2000 bp long flanking regions were extracted. The Nanopore long reads were mapped to these sequences and the mapping results were visualized using the Integrative Genomics Viewer (IGV) software (v 2.15.1)³⁶. All sequences of MTPTs were annotated using CPGAVAS2 (http://www.1kmpg.cn/cpgavas2) to identify the genes located in the MTPTs.

Phylogenetic analysis of the twelve Lamiales species based on common mitochondrial genes

To resolve the phylogenetic relationship of the Lamiales species, we obtained the whole mitogenome sequences of twelve Lamiales species and two outgroup species. The twelve Lamiales included Ajuga reptans (NC_023103.1), Rotheca serrata (NC_049064.1), Scutellaria tsinyunensis (MW553042.1), S. miltiorrhiza (MN_585275/6.1), S. officinalis (OQ001564 and OQ001565), S. splendens (OQ675154 and OQ675155), Erythranthe lutea (NC_018041.1), Castilleja paramensis (NC_031806.1), Utricularia reniformis (NC_034982.1), Dorcoceras hygrometricum (NC_016741.1), Osmanthus fragrans (NC_060346.1), and Hesperelaea palmeri (NC_031323.1). The two outgroup species were Nicotiana tabacum (NC_006581.1) and Solanum lycopersicum (NC_035963.1).

Then we extracted the CDS of 26 PCGs (atp1, atp4, atp6, atp8, atp9, ccmB, ccmC, ccmFc, ccmFn, cob, cox1, cox2, cox3, matR, mttB, nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad7, nad9, rps12, and rps13) shared among all fourteen species via PhyloSuite (v1.2.1)³⁸. We aligned these CDS with MAFFT (v7.450)³⁹ and concatenated the aligned results into a data matrix using PhyloSuite. We established the ML tree using the data matrix and RAxML (v8.2.4)⁴⁰ with the parameters “raxmlHPC-PTHREADS-SSE3 -f a -N 1000 -m PROTGAMMACPREV—× 551314260 -p 551314260 -o Nicotiana_tabacum, Solanum_lycopersicum -T 20.” We obtained the bootstrap support values of each branch in the ML tree after 1000 replicates. We also built the BI tree using MrBayes (v3.2.7)⁴¹ with the model TVM + I + G and the parameters calculated by jMdoleTest (v2.1.0)⁴². We visualized the two trees using the iTOL web server (https://itol.embl.de).

Moreover, we extracted the CDS of 56 PCGs (atpI, atpF, rps15, atpE, rpl22, rpl16, psbK, psbF, petD, rps3, petA, psaC, rpl14, clpP, rbcL, rps2, rps16, psbH, petL, atpA, rpoB, psbJ, petN, rpl20, rps11, ycf4, accD, rpl2, psbA, psbM, rps4, psbD, rpoC2, petG, matK, rpoA, petB, ycf1, rpl23, psbL, rps8, ycf3, psbC, psbN, rps18, ycf2, rps14, rpl33, atpH, psbE, rpl36, psaB, psbT, psaA, rps7, and rpoC1) shared among the chloroplast genomes of these fourteen species. The corresponding accession numbers are Ajuga reptans (NC_023102.1), Rotheca serrata (MN814867), Scutellaria tsinyunensis (NC_050161.1), S. miltiorrhiza (NC_023431.1), S. officinalis (NC_038165.1), S. splendens (NC_050901.1), Erythranthe lutea (NC_030212.1), Castilleja paramensis (NC_031805.1), Utricularia reniformis (NC_029719.1), Dorcoceras hygrometricum (NC_016468.1), Osmanthus fragrans (NC_042377.1), Hesperelaea palmeri (NC_025787.1), Nicotiana tabacum (NC_001879.1) and Solanum lycopersicum (NC_007898.1). The ML and BI trees were constructed and visualized using the same methods described above.

Detection of RNA editing sites

To determine potential RNA editing sites in the S. splendens mitogenome, we extracted the sequences for each CDS and their 100 bp long flanking sequences as the reference sequences. We mapped the strand-specific RNA-seq reads to these reference sequences using HISAT2 (v 2.2.1)⁴³. The parameters of the HISAT2 were “-rna-strandness RF -sensitive -no-mixed -no-discordant” as described previously⁴⁴. The RNA editing sites were identified using the REDItools (v 2.0)⁴⁵ with the parameters of coverage ≥ 5 and frequency ≥ 0.1⁴⁶. We further checked and visualized the mapping results using the IGV software (v 2.15.1)³⁶ with a minor variant frequency ≥ 0.1.

We identified the SNP sites in the CDS of S. splendens mitogenome using the same method described previously²³. Briefly, we mapped the Illumina sequencing data to the template sequences described above using BWA with default parameters. The SNP sites were determined using REDItools (v 2.0) with parameters of coverage ≥ 5 and frequency ≥ 0.1.

Development of genetic markers for interspecific identification based on mitochondrial introns

To develop the mitochondrial intron markers, we extracted the intron sequences of the mitochondrial genomes of S. splendens obtained in this study and S. miltiorrhiza²², S. officinalis²³ using a custom Python script. Alignments of intron sequences were carried out using ClustalW2 and interspecific polymorphic sites of each intron were identified using custom Python script. Subsequently, we extracted the polymorphic sites and their 300 bp long flanking regions, employing them as target regions for primer design. Detailed methods of primer design, PCR amplification, and Sanger sequencing are elucidated above. We conducted the PCR amplification using the genomic DNA samples of three different individuals as templates for each species.

Results

Analysis of the S. splendens mitogenome structure

We sequenced the S. splendens samples using the Nanopore and Illumina sequencing platforms. The statistical results of the sequencing data are shown in (Table S1). The Nanopore and Illumina reads were deposited in GenBank under the accession numbers: SRR23936021 and SRR23936021, respectively. The resulting sequences were assembled using a hybrid assembly strategy. The assembly steps are depicted in (Fig. 1A). Firstly, we assembled the Illumina reads using Spades and obtained a unitig graph. Three double-bifurcation structures (DBSs) were identified. Each DBS structure has four configurations, named c1–4. Here c1 and c2 represented the major configurations. And c3 and c4 represent the configurations resulting from c1 and c2’s recombinations. We then resolved the DBS structures using Unicycler, resulting in two circular chromosomes (Fig. 1B).

To confirm that the two circular chromosomes were assembled correctly, we mapped the long reads back to the core sequences of the DBSs and their 1000 bp long flanking sequences (Fig. S1). The mapping results were consistent with the two circular chromosomes as that resolved by Unicycler (Fig. S1). The location of these DBS structures on MC1 and MC2 is shown in (Table 1). The two circular chromosomes were named MC1 and MC2. MC stands for mitochondrial chromosome. MC1 and MC2 were 182, 239 and 165, 055 bp long with a GC content of 44.84 and 44.43%, respectively.

Table 1 Mapping results of Nanopore long reads to the four possible configurations associated with three DBS (DBS01–03) of the S. splendens mitogenome, which correspond to repetitive sequences (R01, R04, and R07).

Full size table

We then mapped the short reads and long reads to the sequences of MC1 and MC2, simultaneously. We detected the median coverage to be 73 and 80 for MC1 and MC2, respectively (Fig. S2A,B). The short reads were mapped back to the MC1 and MC2 with the median coverage being 191 and 212 for MC1 and MC2, respectively (Fig. S2C,D).

To determine the intra-specific and inter-specific variations of Salvia mitogenomes, we compared the mitogenome sequences of S. splendens in this study with those of S. splendens deposited in Genbank and S. officinalis and S. miltiorrhiza²² using Mummer (v3)²⁸ with the identity threshold of 85% and the many-to-many model. For comparison of the sequences of S. splendens mitogenome assembled in this study and the one released in GenBank (PNBA02000024.1), the alignable regions covered all sequences of the two whole mitogenomes under the identity threshold (Fig. 2A). In total, there were five alignable regions between the two sequences with a length of 31,723 to 104,986 bp and identity of 99.81 to 99.99% (Table S2). We then detected 166 variable sites between these two assemblies using BLASTN (v 2.10.1 +) and Jvarkit (v.1.0) (Table S3).

We compared the sequences of S. splendens and S. miltiorrhiza mitogenomes, the lengths of the alignable regions were 205,371 bp (59.13% of the complete mitogenome sequences) and 204,958 bp (49.49% of the complete mitogenome sequences) of S. splendens and S. miltiorrhiza mitogenome, respectively (Fig. 2B). We then compared the sequences of S. splendens and S. officinalis mitogenomes, the length of the alignable regions was 169,505 bp (48.81% of the complete mitogenome sequences) and 169,816 bp (55.11% of the complete mitogenome sequences) of S. splendens and S. officinalis mitogenomes, respectively (Fig. 2C).

We annotated 51 genes in the S. splendens mitogenome, including 32 PCGs, three rRNA genes, and 18 tRNA genes. Among the 32 PCGs, 24 PCGs were identified in most angiosperm plants and were classified as core genes in the previous report⁴⁷. And 8 PCGs were found in some of the angiosperm plants and were classified as variable genes (Figs. 3, Fig. 4,Table 2). As shown in Fig. 4, the variable genes rps7 and sdh3 were both missing in the six mitogenomes of Lamiaceae. The total length of the coding sequences (CDS) of genes was 49,955 bp, accounting for 14.38% of the total length of the S. splendens mitogenome.

Table 2 Genes predicted in the mitogenome of S. splendens.

Full size table

Repetitive sequences in the S. splendens mitogenome and their involvement in recombination

We found 56 pairs of repetitive sequences in the S. splendens mitogenome using the ROUSFinder2.0.py and checked the recombination frequency of these repetitive sequences using the Nanopore long reads (Table 1, Table S4). Among them, we found the recombination products associated with three repetitive sequences (R01, 04, and 07) based on the mapping results of the long Nanopore reads (Table 1). The length of the longest repetitive sequence (R01) of S. splendens mitogenome was 13,331 bp and it had the highest recombination frequency (40.74%). The other two repetitive sequences (R04 and R07) had lower recombination frequencies of 4.26% and 0.60%, respectively.

We used PCR amplification and Sanger sequencing to validate the presence of the four configurations associated with the repetitive sequences of R04 and R07. The presence of the four configurations associated with R01 was validated using PCR amplification. The primer pairs were designed to amplify sequences corresponding to the four configurations (c1–c4). The primer sequences are shown in (Table S5). The PCR amplicons had the expected sizes (Fig. 5). And the Sanger sequencing results of PCR amplicons associated with R04 and R07 were similar to those of the templates (Fig. S3).

Using the major configuration as the baseline, we can infer possible minor configurations resulting from the recombination mediated by the three repetitive sequences. As shown in Fig. 6, the major configuration (Mac1) is placed in the middle. Seven configurations (Mic1-7) resulting from repeat-mediated recombination from Mac1 are shown around Mac1. Genome configurations Mac1 and Mic5 have two chromosomes. Each of the other six genome configurations (Mic1,2,3,4,6,7) has one chromosome only.

We also detected the tandem repeats and simple sequence repeats (SSRs) that can be used for species authentication, genetic variation, and evolution studies^{48,49,50,51,52} in the S. splendens mitogenome. Using the MISA web service and Linux version of the Tandem Repeats Finder (v4.09), we identified 89 SSRs and ten long tandem repeats (Tables S6–10). We identified 51 and 38 SSRs on the MC1 and MC2, respectively. The most abundant type of SSR was tetranucleotide repeats, accounting for 43.83% of all SSRs in the S. splendens mitogenome (Table S7). Ten long tandem repeats 8 and 2 tandem repetitive sequences were identified on the MC1 and MC2, respectively (Table S8). Their repeat units ranged from 11–26 nt in size (Table S8).

Identification of mitochondrial plastid sequences (MTPTs)

The mitochondrial plastid DNAs (MTPTs) are plastid DNAs (ptDNA) migrated into the mitochondrial DNA (mtDNA)^53,54. We detected MTPTs in the S. splendens by comparing the complete mtDNA and ptDNA sequences of S. splendens using BLASTn with the parameters of e-value < 1e-6 and the word size of 7²². We found 28,175 bp of plastid-derived DNAs distributed among 15 different loci in the S. splendens mtDNAs (Table S9). All these 15 MTPTs occupied 8.11 and 18.71% of the complete ptDNA and mtDNA sequences of S. splendens. The length of these 15 different loci ranged from 103 (MTPT11) to 13,674 bp (MTPT12) (Table S9). The largest MTPTs (MTPT12) inserted the regions of MC2 from positions 77,995 to 91,668 (Table S9).

Most MTPTs represented noncoding DNA or fragments of the CDS, such as rpoB and atpB fragments in the MTPT02 and MTPT15 (Table S9). However, the MTPTs contained functional ptDNA-encoded genes such as five transfer RNA genes (trnI-GAU, trnD-GUC, trnS-GGA, trnV-GAC, and trnW-CCA) and the intact CDSs of 11 genes (psbB, psbH, psbN, psbT, rpl14, rpl2, rpl22, rpoA, rps11, rps19, rps8) (Table S9). We then mapped the long reads to the sequences containing the MTPTs and their 1000 bp of 5′ and 3′ flanking regions to confirm their presence. As a result, all 15 MTPTs including the longest MTPT (MTPT12) were supported by the mapping results of long reads (Fig. S4A–O).

Identification of RNA editing sites

In angiosperms, RNA editing substitutes cytidines (C) for uridines (U) post-transcriptionally^55,56, resulting in a change of the amino acid sequences⁵⁵. These changes were likely to increase the overall diversity of the mitogenomic proteomes⁵⁷, reshape the proteins’ physicochemical characteristics⁵⁸, and modify proteins’ folding patterns⁵⁹. Therefore, RNA editing plays an important role in the biological processes of plant mitochondria.

We identified the RNA editing events in the S. splendens mitochondrial RNAs based on the results of mapping the RNA-seq data to the reference genome. In total, 457 RNA sites were detected in the S. splendens mitochondrial RNAs. The detailed information for RNA editing sites is shown in (Table S10). The mapping results of the RNA-seq reads to the reference genome are shown in (Fig. S5A–Z). Among them, 46 (10.07%) and 411 (75.14%) RNA editing events resulted in synonymous and non-synonymous codon changes, respectively (Table S10). The non-synonymous codon changes mainly contained three amino acid alterations. There were 133 (29.10%) RNA editing events changing Pro to Leu, 89 (19.47%) editing events changing Ser to Leu, and 65 (14.22%) RNA editing events changing Ser to Phe (Table S10). According to the physicochemical properties, 35.89% of non-synonymous codon changes were from hydrophilic to hydrophobic amino acids. Editing events mainly occurred at the first and second codon positions of the CDS, which accounted for 33.04% (151 sites) and 56.46% (258 sites) of all RNA editing sites, respectively (Table S10). We compared the RNA editing sites between S. splendens and S. officinalis. We found that S. splendens and S. officinalis shared 387 RNA editing sites. In contrast, S. splendens and A. thaliana mitogenomes shared only 201 RNA editing sites. The difference is consistent with their phylogenetic relationship.

Two of the most dramatic RNA editing events are the creation of new start and stop codons. We define the stop-gain as a stop codon obtained as a result of RNA editing. Two stop-gain resulting from the RNA editing events cox2-615 and nad9-289 were identified in the S. splendens mitochondrial RNAs (Table S10). The codon changes of the cox2-615 and nad9-289 were CAA to TAA and CGA to TGA (Table S10, Fig. S5K,W). The frequencies of these two events were 0.5 and 0.35, respectively.

To exclude the interference of SNPs on the identification of the RNA editing sites, we also detected the SNPs in the CDS of S. splendens mitogenome using the Illumina short reads. We detected 37 SNP sites in the CDS of the S. splendens mitogenome (Table S11). None of these SNP sites overlapped with the predicted RNA editing sites.

Phylogenetic analysis based on the mitogenomes and plastomes of the Liamiales

To infer the phylogeny of twelve Liamiales species based on mitogenomic sequences, we performed the phylogenetic analysis of twelve Lamiales species based on the CDS of 26 common mitogenomic genes in the 12 Lamiales mitogenomes. Based on the data matrix of the CDS, the maximum likelihood (ML) and the Bayesian inference (BI) trees were constructed by RAxML (v8.2.4) and MrBayes (v3.2.7)⁴¹, respectively. As shown, S. splendens was sister to S. miltiorrhiza and they were clustered in a branch with a bootstrap support value of 99 and a posterior probability of 1.00 for the ML and BI analyses, respectively (Fig. 7A). The branch of the S. splendens and S. miltiorrhiza was then clustered with the S. officinalis to form a branch with a bootstrap support value of 100 and a posterior probability of 1.00 (Fig. 7A). In parallel, we constructed the phylogenetic trees using the CDS of 56 plastid genes with the same methods. The phylogeny of 12 Liamiales species based on the mitogenomic sequences was consistent with that based on the plastome sequences (Fig. 7B).

Genetic markers for interspecific identification based on mitochondrial introns

Initially, through the alignment of mitochondrial intron sequences of S. miltiorrhiza, S. officinalis, and S. splendens, multiple polymorphic loci were identified. Among them, 48 loci exhibiting variation between two or three species in the 12 introns (cox1i12, cox2i12, nad1i23, nad2i12, nad2i34, nad2i45, nad4i12, nad4i34, nad5i45, nad7i12, nad7i23, and nad7i34) were selected (Table S12). The naming convention for the introns follows the obvious study⁶⁰. Out of these, 25 loci were single nucleotide polymorphisms (SNPs), while 23 were insertion-deletion type loci. Remarkably, one of the insertion/deletion polymorphic loci exhibited a substantial base number alteration of 65 bp, hence was not listed in the table.

To further explore these polymorphic loci, 12 primer pairs were designed to amplify these polymorphic loci (Table S13), and the resulting amplicons were sequenced employing Sanger sequencing. Through PCR amplification and Sanger sequencing, the variation loci were discerned within DNA samples obtained from three individuals of each species. Figure 8 shows the electrophoretic gel plot of amplicons generated from all 12 primer pairs, represented by two individuals from each of the three species. Sanger sequencing succeeded in revealing all the detected loci within the 12 introns (Fig. S6). All the variation loci within an intron of these 12 introns enabled three species to be distinguished. As an example, intron cox2i12 revealed nine polymorphic loci, enabling three species to be distinguished through two or more loci among these nine loci (Fig. 9, Fig. S6).

Discussions

Investigations on the genome size, structure, gene content and orders, and intron content of plant mtDNA were crucial for understanding the diversity and evolution of plant mitogenomes. We recently analyzed the mitogenomes of two Salvia species: S. miltiorrhiza and S. officinalis^22,23. To further determine the characteristics of the Salvia mitogenomes, we report here on the structure, gene contents, homologous recombination mediated by the repetitive sequences, and RNA editing events of the S. splendens mitogenome. In addition, a comparative analysis was conducted on the three mitogenomes to determine genetic diversity among the Salvia genus.

All three Salvia species’ mitogenomes contained two circular molecules

We found that the major mitogenome configuration of S. miltiorrhiza, S. officinalis, and S. splendens contained two circular chromosomes. Moreover, we identified 9, 3, and 3 repetitive sequences in the three genomes that can mediate the homologous recombination The length of the longest repetitive sequence of S. splendens mitogenome was 13,331 bp (R01), which was more than two folds of the longest repetitive sequence of S. miltiorrhiza (5835 bp)²² and nearly fifteen folds of the longest repetitive sequence of S. officinalis (892 bp)²³. All these long repetitive sequences were found to be able to mediate the homologous recombination, supported by the mapping results of Pacbio and Nanopore long reads. However, sequence comparison identified little sequence similarity among these repetitive sequences. In the future, they should be examined for possible conservation of high-level structure. We validated the homologous recombination mediated by the long repetitive sequence (13,331 bp) in the S. splendens mitogenome using PCR. Similarly, the recombination mediated by a ~ 16 kp repeat pair was also validated using PCR in the mitogenome of a holoparasitic plant Aeginetia indica. These two results showed that the long repetitive sequences also mediated the homologous recombination in plant mitogenome.

MTPTs were found to be most abundant in the complete S. splendens mitogenome among the three Salvia species

Previous studies showed that MTPTs abundance correlates with both mitochondrial genome size and mtDNA noncoding content⁶¹. Here, we compared the MTPTs in the three Salvia species and found the total length of the MTPTs, ranging from 12,583 bp of S. miltiorrhiza⁶² to 28,175 bp of S. splendens. The total length of the MTPTs was 14,495 bp of S. officinalis²³. However, the mitochondrial genome size of S. miltiorrhiza was 414,114 bp, larger than the 347,294 bp of S. splendens and 308,168 bp of S. officinalis. In these three Salvia species, the lengths of the MTPTs did not correlate with the mitochondrial genome size. For example, the Carica papaya mitogenome was 476,890 bp, larger than the S. splendens. However, the total MTPT length of Carica papaya was 15.1 kb, shorter than that of the S. splendens. Moreover, the length of the longest MTPT of S. splendens mitogenome was 13,674 bp (MTPT12), more than three folds of the longest MTPTs of S. miltiorrhiza (4,261 bp), and S. officinalis (4,987 bp).

Previous studies have suggested that the primary insertions of cpDNAs were large and then diverged and fragmented over evolutionary time⁶³. Thus, it is possible that the MTPT12 was the primary insertion of cpDNA and didn’t fragment in the evolutionary process. Future studies are needed to determine the exact mechanism leading to the long MTPT in S. splendens and size variations in the congeneric mitogenomes.

RNA editing events were conserved among two Salvia species

To gain a better understanding of RNA editing in Salvia mitochondria, we identified the editing sites of mitochondrial RNAs of S. splendens and compared them with those found in other Salvia species (S. officinalis) and Arabidopsis thaliana. 386 C-to-U modifications were shared between the CDS regions of the mitogenomes of S. splendens and S. officinalis. 201 C-to-U modifications were shared between the CDS regions of the mitogenomes of S. splendens and A. thaliana. Therefore, we can conclude that the RNA editing sites were more conserved on the intra-species level than the inter-species level. 201 C-to-U modifications were shared by CDS regions of the mitogenomes of two Salvia species (S. splendens and S. officinalis) and A. thaliana. This suggested that the inter-genus species hold nearly 50% of conserved RNA editing sites in their CDS regions of mitogenomes, in contrast, cogeneric species share 84% of conserved RNA editing sites.

Mitochondrial intron markers for interspecific identification

Compared to nuclear DNA, organelle DNA offers advantages like multiple copies and uniparental inheritance, making sequences like the chloroplast barcode combination (rbcL + matK + trnH-psbA) popular as molecular markers. However, DNA migration events in plant organelles cause some chloroplast DNAs to move to mitochondrial genomes, forming fragments known as MTPTs. The similarity between MTPTs and corresponding chloroplast DNA can be as high as 99.6%⁶⁴. This leads to the co-amplification of chloroplast barcodes with MTPTs, resulting in unexpected marker sequences and potential plant misidentification⁶⁴.

Recent advancements in plant mitochondrial genome research have spurred the development of molecular markers for interspecific identification based on polymorphisms in mitochondrial intron sequences. These markers can eliminate misidentification risks associated with chloroplast genome barcodes. For instance, one mitochondrial marker was developed to identify seven Acer species using a 33 bp insertion-deletion in the nad1 intron⁶⁵. Additionally, two other markers were generated based on variant sites in the introns of the nad2 and nad4 genes to differentiate five Amorphophallus species¹⁷. In this study, we developed 12 intron markers derived from the polymorphisms in mitochondrial intron sequences, tailored for the identification of S. miltiorrhiza, S. officinalis, and S. splendens. These markers are superior to traditional chloroplast markers and may also be used for the identification of more species in the Salvia genus in the future.

Conclusion

Our study showed that the major configuration of the S. splendens mitogenome contains two circular chromosomes. Multiple configurations are likely to coexist, which are converted from the major configuration through repeat-mediated recombination. 457 RNA editing sites including two stop codon gain sites were identified in the S. splendens mitogenome resulting from RNA editing. We developed 12 intron markers derived from the polymorphisms in mitochondrial intron sequences for the identification of S. miltiorrhiza, S. officinalis, and S. splendens.

Data availability

The raw sequencing data from the Illumina and Nanopore platforms generated during the current study are available in GenBank. The associated BioProject, BioSample, and SRA numbers and the associated link are PRJNA947342, SAMN33848326, SRR23935050 for Illumina sequencing reads, and SRR23936021 for Nanopore sequencing reads. The mitogenome sequences along with the annotation information of them have been deposited in GenBank (https://www.ncbi.nlm.nih.gov/) with accession numbers: OQ675154 and OQ675155. The plant sample has been stored at the Herbarium of the Institute of Medicinal Plant Development, Beijing, China (Voucher Numbers: Implad20230116).

References

Moharram, F.A.-E., Marzouk, M. S., El-Shenawy, S. M., Gaara, A. H. & El Kady, W. M. Polyphenolic profile and biological activity of Salvia splendens leaves. J. Pharm. Pharmacol. 64, 1678–1687. https://doi.org/10.1111/j.2042-7158.2012.01544.x (2012).
Article CAS PubMed Google Scholar
Rajendran, R. & Prabha, A. L. AgNPs synthesis, characterization and antibacterial activity from Salvia splendens Sellow ex Roem. & Schult. plant extract. Int. J. Sci. Res. 4, 1086–1090 (2015).
Google Scholar
Rajendran, R., Pullani, S., Thavamurugan, S., Radhika, R. & Lakshmi Prabha, A. Green fabrication of silver nanoparticles from Salvia species extracts: characterization and anticancer activities against A549 human lung cancer cell line. Appl. Nanosci. https://doi.org/10.1007/s13204-021-02130-w (2022).
Article Google Scholar
Fontana, G. et al. Synthetic studies of neoclerodane diterpenoids from Salvia splendens and evaluation of opioid receptor affinity. Tetrahedron 64, 10041–10048. https://doi.org/10.1016/j.tet.2008.08.043 (2008).
Article CAS PubMed PubMed Central Google Scholar
Dong, A. X. et al. High-quality assembly of the reference genome for scarlet sage, Salvia splendens, an economically important ornamental plant. GigaScience https://doi.org/10.1093/gigascience/giy068 (2018).
Article PubMed PubMed Central Google Scholar
Jia, K. H. et al. Chromosome-scale assembly and evolution of the tetraploid Salvia splendens (Lamiaceae) genome. Horticult. Res. https://doi.org/10.1038/s41438-021-00614-y (2021).
Article Google Scholar
Du, Q. et al. Comparative genomics and phylogenetic analysis of the chloroplast genomes in three medicinal Salvia species for bioexploration. Int. J. Mol. Sci https://doi.org/10.3390/ijms232012080 (2022).
Article PubMed PubMed Central Google Scholar
Zhao, F. et al. The chloroplast genome of Salvia: genomic characterization and phylogenetic analysis. Int. J. Plant Sci. 181, 812–830. https://doi.org/10.1086/710083 (2020).
Article Google Scholar
Handa, H. The complete nucleotide sequence and RNA editing content of the mitochondrial genome of rapeseed (Brassica napus L.): comparative analysis of the mitochondrial genomes of rapeseed and Arabidopsis thaliana. Nucleic Acids Res. 31, 5907–5916. https://doi.org/10.1093/nar/gkg795 (2003).
Article CAS PubMed PubMed Central Google Scholar
Mower, J. P. Variation in protein gene and intron content among land plant mitogenomes. Mitochondrion 53, 203–213. https://doi.org/10.1016/j.mito.2020.06.002 (2020).
Article CAS PubMed Google Scholar
Wu, Z. Q., Liao, X. Z., Zhang, X. N., Tembrock, L. R. & Broz, A. Genomic architectural variation of plant mitochondria—A review of multichromosomal structuring. J. Syst. Evol. 60, 160–168. https://doi.org/10.1111/jse.12655 (2020).
Article Google Scholar
Cole, L. W., Guo, W., Mower, J. P. & Palmer, J. D. High and variable rates of repeat-mediated mitochondrial genome rearrangement in a genus of plants. Mol. Biol. Evol. 35, 2773–2785. https://doi.org/10.1093/molbev/msy176 (2018).
Article CAS PubMed Google Scholar
Sanchez-Puerta, M. V., Zubko, M. K. & Palmer, J. D. Homologous recombination and retention of a single form of most genes shape the highly chimeric mitochondrial genome of a cybrid plant. New Phytol. 206, 381–396. https://doi.org/10.1111/nph.13188 (2015).
Article CAS PubMed Google Scholar
Lukes, J., Kaur, B. & Speijer, D. RNA editing in mitochondria and plastids: Weird and widespread. Trends Genet. 37, 99–102. https://doi.org/10.1016/j.tig.2020.10.004 (2021).
Article CAS PubMed Google Scholar
Muse, S. V. Examining rates and patterns of nucleotide substitution in plants. Plant Mol. Biol. 42, 25–43. https://doi.org/10.1023/A:1006319803002 (2000).
Article CAS PubMed Google Scholar
Froelicher, Y. et al. New universal mitochondrial PCR markers reveal new information on maternal citrus phylogeny. Tree Genet. Genom. 7, 49–61. https://doi.org/10.1007/s11295-010-0314-x (2011).
Article Google Scholar
Shan, Y., Li, J., Zhang, X. & Yu, J. The complete mitochondrial genome of Amorphophallus albus and development of molecular markers for five Amorphophallus species based on mitochondrial DNA. Front. Plant Sci. 14, 1180417. https://doi.org/10.3389/fpls.2023.1180417 (2023).
Article PubMed PubMed Central Google Scholar
Alverson, A. J., Rice, D. W., Dickinson, S., Barry, K. & Palmer, J. D. Origins and recombination of the bacterial-sized multichromosomal mitochondrial genome of cucumber. Plant Cell 23, 2499–2513. https://doi.org/10.1105/tpc.111.087189 (2011).
Article CAS PubMed PubMed Central Google Scholar
Alverson, A. J. et al. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Mol. Biol. Evol. 27, 1436–1448. https://doi.org/10.1093/molbev/msq029 (2010).
Article CAS PubMed PubMed Central Google Scholar
Adams, K. Evolution of mitochondrial gene content: gene loss and transfer to the nucleus. Mol. Phylogenet. Evol. 29, 380–395. https://doi.org/10.1016/s1055-7903(03)00194-5 (2003).
Article CAS PubMed Google Scholar
Figueroa, P., Gomez, I., Holuigue, L., Araya, A. & Jordana, X. Transfer of rps14 from the mitochondrion to the nucleus in maize implied integration within a gene encoding the iron-sulphur subunit of succinate dehydrogenase and expression by alternative splicing. Plant J. 18, 601–609. https://doi.org/10.1046/j.1365-313x.1999.00485.x (1999).
Article CAS PubMed Google Scholar
Yang, H. et al. De novo hybrid assembly of the Salvia miltiorrhiza mitochondrial genome provides the first evidence of the multi-chromosomal mitochondrial DNA structure of Salvia species. Int. J. Mol. Sci. https://doi.org/10.3390/ijms232214267 (2022).
Article PubMed PubMed Central Google Scholar
Yang, H. et al. Mitochondrial genome sequence of Salvia officinalis (lamiales: lamiaceae) suggests diverse genome structures in cogeneric species and finds the stop gain of genes through RNA editing events. Int. J. Mol. Sci. 24, 5372. https://doi.org/10.3390/ijms24065372 (2023).
Article CAS PubMed PubMed Central Google Scholar
Jin, J. J. et al. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21, 1–31. https://doi.org/10.1186/s13059-020-02154-5 (2020).
Article Google Scholar
Wick, R. R., Judd, L. M., Gorrie, C. L. & Holt, K. E. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput. Biol. 13, e1005595. https://doi.org/10.1371/journal.pcbi.1005595 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. https://doi.org/10.1093/bioinformatics/btp324 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. https://doi.org/10.1093/bioinformatics/btp352 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kurtz, S. et al. Versatile and open software for comparing large genomes. Genome Biol. 5, R12. https://doi.org/10.1186/gb-2004-5-2-r12 (2004).
Article PubMed PubMed Central Google Scholar
Zhong, Y., Yu, R., Chen, J., Liu, Y. & Zhou, R. Highly active repeat-mediated recombination in the mitogenome of the holoparasitic plant Aeginetia indica. Front. Plant Sci. 13, 988368. https://doi.org/10.3389/fpls.2022.988368 (2022).
Article PubMed PubMed Central Google Scholar
Hao, Z. et al. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput. Sci. https://doi.org/10.7717/peerj-cs.251 (2020).
Article PubMed PubMed Central Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformat. 10, 1–9 (2009).
Article Google Scholar
Lindenbaum, P., Jvarkit (v1.0): java utilities for bioinformatics. Github. https://github.com/lindenb/jvarkit (2015).
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964. https://doi.org/10.1093/nar/25.5.955 (1997).
Article CAS PubMed PubMed Central Google Scholar
Lee, E., Harris, N., Gibson, M., Chetty, R. & Lewis, S. Apollo: a community resource for genome annotation editing. Bioinformatics 25, 1836–1837. https://doi.org/10.1093/bioinformatics/btp314 (2009).
Article CAS PubMed Google Scholar
Wynn, E. L. & Christensen, A. C. Repeats of unusual size in plant mitochondrial genomes: Identification, incidence and evolution. BG3 (Bethesda) 9, 549–559. https://doi.org/10.1534/g3.118.200948 (2019).
Article CAS Google Scholar
Milne, I. et al. Tablet–next generation sequence assembly visualization. Bioinformatics 26, 401–402. https://doi.org/10.1093/bioinformatics/btp666 (2010).
Article CAS PubMed Google Scholar
Beier, S., Thiel, T., Munch, T., Scholz, U. & Mascher, M. MISA-web: a web server for microsatellite prediction. Bioinformatics 33, 2583–2585. https://doi.org/10.1093/bioinformatics/btx198 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zhang, D. et al. PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Resour. 20, 348–355. https://doi.org/10.1111/1755-0998.13096 (2020).
Article PubMed Google Scholar
Katoh, K., Rozewicki, J. & Yamada, K. D. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Br.Bioinform. 20, 1160–1166. https://doi.org/10.1093/bib/bbx108 (2019).
Article CAS Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313. https://doi.org/10.1093/bioinformatics/btu033 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ronquist, F. et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542. https://doi.org/10.1093/sysbio/sys029 (2012).
Article PubMed PubMed Central Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9, 772–772. https://doi.org/10.1038/nmeth.2109 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360. https://doi.org/10.1038/nmeth.3317 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yu, R. et al. The minicircular and extremely heteroplasmic mitogenome of the holoparasitic plant Rhopalocnemis phalloides. Curr. Biol. 32, 470–479. https://doi.org/10.1016/j.cub.2021.11.053 (2021).
Article CAS PubMed Google Scholar
Picardi, E. & Pesole, G. REDItools: high-throughput RNA editing detection made easy. Bioinformatics 29, 1813–1814. https://doi.org/10.1093/bioinformatics/btt287 (2013).
Article CAS PubMed Google Scholar
Wu, B. et al. Identification of symmetrical RNA editing events in the mitochondria of Salvia miltiorrhiza by strand-specific RNA sequencing. Sci. Rep. 7, 1–11. https://doi.org/10.1038/srep42250 (2017).
Article ADS CAS Google Scholar
Skippington, E., Barkman, T. J., Rice, D. W. & Palmer, J. D. Miniaturized mitogenome of the parasitic plant Viscum scurruloideum is extremely divergent and dynamic and has lost all nad genes. Proc. Natl. Acad. Sci. USA 112, E3515–E3524. https://doi.org/10.1073/pnas.1504491112 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Buchanan, F. C., Adams, L. J., Littlejohn, R. P., Maddox, J. F. & Crawford, A. M. Determination of evolutionary relationships among sheep breeds using microsatellites. Genomics 22, 397–403. https://doi.org/10.1006/geno.1994.1401 (1994).
Article CAS PubMed Google Scholar
Gupta, P. K. et al. Transferable EST-SSR markers for the study of polymorphism and genetic diversity in bread wheat. Mol. Genet. Genom. 270, 315–323. https://doi.org/10.1007/s00438-003-0921-4 (2003).
Article CAS Google Scholar
Martin, P., Makepeace, K., Hill, S. A., Hood, D. W. & Moxon, E. R. Microsatellite instability regulates transcription factor binding and gene expression. Proc. Natl. Acad. Sci. USA 102, 3800–3804. https://doi.org/10.1073/pnas.0406805102 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Ramsay, L. et al. A simple sequence repeat-based linkage map of barley. Genetics 156, 1997–2005. https://doi.org/10.1093/genetics/156.4.1997 (2000).
Article CAS PubMed PubMed Central Google Scholar
Vinces, M. D., Legendre, M., Caldara, M., Hagihara, M. & Verstrepen, K. J. Unstable tandem repeats in promoters confer transcriptional evolvability. Science 324, 1213–1216. https://doi.org/10.1126/science.1170097 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Ellis, J. Promiscuous DNA—Chloroplast genes inside plant mitochondria. Nature 299, 678–679. https://doi.org/10.1038/299678a0 (1982).
Article ADS CAS PubMed Google Scholar
Knoop, V. The mitochondrial DNA of land plants: peculiarities in phylogenetic perspective. Curr. Genet. 46, 123–139. https://doi.org/10.1007/s00294-004-0522-8 (2004).
Article CAS PubMed Google Scholar
Covello, P. S. & Gray, M. W. RNA editing in plant mitochondria. Nature 341, 662–666. https://doi.org/10.1038/341662a0 (1989).
Article ADS CAS PubMed Google Scholar
Edera, A. A., Gandini, C. L. & Sanchez-Puerta, M. V. Towards a comprehensive picture of C-to-U RNA editing sites in angiosperm mitochondria. Plant Mol. Biol. 97, 215–231. https://doi.org/10.1007/s11103-018-0734-9 (2018).
Article CAS PubMed Google Scholar
Mower, J. P. The PREP suite: predictive RNA editors for plant mitochondrial genes, chloroplast genes and user-defined alignments. Nucleic Acids Res. 37, W253-259. https://doi.org/10.1093/nar/gkp337 (2009).
Article CAS PubMed PubMed Central Google Scholar
Jobson, R. W. & Qiu, Y.-L. Did RNA editing in plant organellar genomes originate under natural selection or through genetic drift?. Biol. Direct. 3, 43. https://doi.org/10.1186/1745-6150-3-43 (2008).
Article CAS PubMed PubMed Central Google Scholar
Yura, K. & Go, M. Correlation between amino acid residues converted by RNA editing and functional residues in protein three-dimensional structures in plant organelles. BMC Plant Biol. 8, 79. https://doi.org/10.1186/1471-2229-8-79 (2008).
Article CAS PubMed PubMed Central Google Scholar
Grosser, M. R. et al. Plant mitochondrial introns as genetic markers—Conservation and variation. Front. Plant Sci. 14, 1116851. https://doi.org/10.3389/fpls.2023.1116851 (2023).
Article PubMed PubMed Central Google Scholar
Smith, D. R. Extending the limited transfer window hypothesis to inter-organelle DNA migration. Genome Biol. Evol. 3, 743–748. https://doi.org/10.1093/gbe/evr068 (2011).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, V. B. et al. Comprehensive comparative analysis of chloroplast genomes from seven Panax species and development of an authentication system based on species-unique single nucleotide polymorphism markers. J. Ginseng Res. 44, 135–144. https://doi.org/10.1016/j.jgr.2018.06.003 (2020).
Article PubMed Google Scholar
Wang, D. et al. Transfer of chloroplast genomic DNA to mitochondrial genome occurred at least 300 MYA. Mol. Biol. Evol. 24, 2040–2048. https://doi.org/10.1093/molbev/msm133 (2007).
Article CAS PubMed Google Scholar
Park, H. S. et al. Mitochondrial plastid DNA can cause DNA barcoding paradox in plants. Sci. Rep. 10, 6112. https://doi.org/10.1038/s41598-020-63233-y (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, Q. et al. Assembly and comparative analysis of the first complete mitochondrial genome of Acer truncatum Bunge: a woody oil-tree species producing nervonic acid. BMC Plant Biol. 22, 29. https://doi.org/10.1186/s12870-021-03416-5 (2022).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank GrandOmics Biotechnology Co., Ltd for genomic sequencing service.

Funding

This work was supported by the Chinese Academy of Medical Sciences Innovation Funds for Medical Sciences (CIFMS) [2021-I2M-1-071, 2021-I2M-1-022], the National Natural Science Foundation of China [81872966], and the National Science &Technology Fundamental Resources Investigation Program of China [2018FY100705]. The funders were not involved in the study design, data collection, analysis, decision to publish, or manuscript preparation.

Author information

Authors and Affiliations

Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100193, People’s Republic of China
Heyu Yang, Yang Ni, Jingling Li, Haimei Chen & Chang Liu

Authors

Heyu Yang
View author publications
Search author on:PubMed Google Scholar
Yang Ni
View author publications
Search author on:PubMed Google Scholar
Jingling Li
View author publications
Search author on:PubMed Google Scholar
Haimei Chen
View author publications
Search author on:PubMed Google Scholar
Chang Liu
View author publications
Search author on:PubMed Google Scholar

Contributions

CL conceived the work; HYY, YN, JLL performed acquisition and analysis of data: CL, HYY, and HMC wrote the draft paper: HMC critically reviewed the manuscript. All authors have approved the submitted version (and any substantially modified version that involves the author’s contribution to the study). All authors have agreed both to be personally accountable for the author’s own contributions and to ensure that questions related to the accuracy or integrity of any part of the work, even ones in which the author was not personally involved, are appropriately investigated, resolved, and the resolution documented in the literature.

Corresponding authors

Correspondence to Haimei Chen or Chang Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical statement and permission

This study has obtained permission to collect S. splendens through the Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College. We conducted the study in accordance with relevant institutional, national and international guidelines and legislation.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, H., Ni, Y., Li, J. et al. Unveiling the mitochondrial genome of Salvia splendens insights into the evolutionary traits within the genus Salvia. Sci Rep 15, 13344 (2025). https://doi.org/10.1038/s41598-025-96637-9

Download citation

Received: 20 February 2024
Accepted: 31 March 2025
Published: 17 April 2025
Version of record: 17 April 2025
DOI: https://doi.org/10.1038/s41598-025-96637-9

Subjects

Abstract

Similar content being viewed by others

Insights into structure, codon usage, repeats, and RNA editing of the complete mitochondrial genome of Perilla frutescens (Lamiaceae)

Assembly and analysis of the complete mitochondrial genome of Carya illinoinensis to provide insights into the conserved sequences of tRNA genes

The characterisation of the complete mitochondrial genome of Polygonatum Kingianum reveals recombination mediated by repeats associated with DNA replication

Introduction

Materials and methods

Plant materials and nucleic acid extraction

DNA sequencing and mitogenome assembly

Mitogenome annotation

Analysis of the repeat-mediated recombination in the mitogenome

Analysis of repetitive sequences

Identification of mitochondrial plastid sequences (MTPTs)

Phylogenetic analysis of the twelve Lamiales species based on common mitochondrial genes

Detection of RNA editing sites

Development of genetic markers for interspecific identification based on mitochondrial introns

Results

Analysis of the S. splendens mitogenome structure

Repetitive sequences in the S. splendens mitogenome and their involvement in recombination

Identification of mitochondrial plastid sequences (MTPTs)

Identification of RNA editing sites

Phylogenetic analysis based on the mitogenomes and plastomes of the Liamiales

Genetic markers for interspecific identification based on mitochondrial introns

Discussions

All three Salvia species’ mitogenomes contained two circular molecules

MTPTs were found to be most abundant in the complete S. splendens mitogenome among the three Salvia species

RNA editing events were conserved among two Salvia species

Mitochondrial intron markers for interspecific identification

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Ethical statement and permission

Additional information

Publisher’s note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links