Introduction

With their rich nutrients, fantastic flavor and taste, excellent medicinal value and health care functions, aquatic vegetables have been continuously well received by consumers as important economic crops. There are various kinds of aquatic vegetables, of which N. nucifera, C. esculenta and S. trifolia are important and widely planted in China, Southeast Asia, Australia, and Oceania; these aquatic vegetables have high economic value and a long history of cultivation1,2.

SSR fingerprints are inherent in genomes. SSR markers are not affected by external and internal environments, including growth and development time. The advantages of SSR markers are that they are low-cost, fast (within hours), accurate, and reliable. On the one hand, the authenticity of a variety can be effectively identified according to the expected specificity of the DNA fragment. On the other hand, the quality and purity of seeds can be determined by the proportion in the population. Currently, many crop varieties have DNA fingerprints constructed by SSR, such as rice3, corn4, wheat5,6, cotton7, and potato8. In addition, a set of crop varieties have DNA fingerprints constructed based on EST-SSR markers, such as Chinese cabbage9, Cherry10, Hemarthria sibirica11, Blueberry12, Eriobotrya japonica13, and Asparagus14.

The authenticity of the variety and quality and purity of seeds are extremely significant for crop and vegetable production15. However, currently, seed purity identification in China is still dominated by morphological identification16. The method of morphological identification is not only time-consuming and expensive but also subject to environmental factors. Therefore, morphological identification cannot quickly and accurately determine purity or variety. However, identification by SSR markers is accurate, can be accomplished within a short time, is low-cost, and is immune to environmental and self factors. Additionally, EST-SSR markers facilitate easy access to base information and metastasis among related species, with a low cost. Thus, EST-SSR markers have vast potential for future development17.

Due to the limited research on molecular markers, most aquatic vegetables do not have established molecular marker fingerprints or DNA fingerprints constructed from EST-SSR markers. Furthermore, there is no way to identify the authenticity of varieties or the quality and purity of seeds by molecular markers. Therefore, the purpose of this study is to construct DNA fingerprints for aquatic vegetables (sacred lotus, taro and arrowhead) by using a large number of EST-SSR molecular markers developed in our laboratory and to identify the purity of these aquatic vegetables. We aim to provide theoretical knowledge for the construction of DNA fingerprints and purity identification of other aquatic vegetables.

Results and Analysis

Screening of SSR core and candidate primers for three important aquatic vegetables

A total of 11,178 SSR sites were detected by MISA software, and 6,568 pairs of primers were designed by Primer3.0 software. We used a Perl script to synthesize 38 pairs of primers according to electronic polymorphism prediction, and seventy-two pairs of primers were randomly synthesized. Three hundred twenty-five clear bands were obtained by amplifying DNA from 22 scared lotus cultivars with 80 pairs of primers, with an average of 4 bands per pair of primers. From these 80 pairs of primers, 6 pairs of core SSR-labeled primers with polymorphisms that were able to distinguish all 22 materials were selected (Table 1). Three pairs of EST-SSR primers with high PIC (polymorphism information content) were selected as candidate primers for variety purity identification (Table 2). The PIC of each EST-SSR primer pair was calculated using the following formula: \({\rm{PIC}}=1-(\,\mathop{\sum }\limits_{{\rm{i}}=1}^{{\rm{n}}}{{\rm{q}}}_{{\rm{i}}}^{2})-(\mathop{\sum }\limits_{{\rm{i}}=1}^{{\rm{n}}-1}\mathop{\sum }\limits_{{\rm{j}}={\rm{i}}+1}^{{\rm{n}}}2{{\rm{q}}}_{{\rm{i}}}^{2}{{\rm{q}}}_{{\rm{j}}}^{2}\,)\), where n is the number of alleles and q is the ith and jth allele frequency18.

Table 1 Basic information of the six lotus core primers.
Table 2 Basic information of the three lotus candidate primers.

In total, 5,278 SSR loci were detected by MISA software, and 2,858 pairs of primers were designed by Primer3.0 software. We randomly synthesized 100 pairs of primers. Two hundred thirteen clear bands were obtained by amplifying DNA from 46 taro cultivars with 72 pairs of primers, with an average of 3 bands per pair of primers. From these 72 pairs of primers, 20 pairs of core EST-SSR primers with a polymorphism frequency that could distinguish all 46 taro materials were selected (Table 3).

Table 3 Basic information of the twenty taro core primers.

A total of 3,861 SSR loci were detected by MISA software, and 2,476 pairs of primers were designed by Primer3.0 software. We randomly synthesized 100 pairs of primers. One hundred ninety-three clear bands were obtained by amplifying DNA from 10 arrowhead cultivars with 78 pairs of primers, with an average of 2.5 bands per pair of primers. From these 78 pairs of primers, 3 pairs of core EST-SSR primers with a polymorphism frequency that could distinguish all 10 arrowhead materials were selected (Table 4). Nine pairs of EST-SSR primers with high PIC were selected as candidate primers for variety purity identification (Table 5). For every aquatic vegetable, at least one figure of polymorphism showed in supplementary file 2.

Table 4 Basic information of the three arrowhead core primers.
Table 5 Basic information of the nine arrowhead candidate primers.

DNA Fingerprints of three important aquatic vegetables

According to the sequences of the primer alleles presented in Table 4, the scored results, marked as “0” and “1” for the SSR molecular marker bands of sacred lotus core primers, were counted into binary data as 17-bit binary molecular identity cards of 22 sacred lotus cultivars (Table 6).

Table 6 Binary molecular identity of twenty-two lotus varieties.

According to the sequences of the primer alleles presented in Table 5, the scored results, marked as “0” and “1” for the SSR molecular marker bands of taro core primers, were counted into binary data as 87-bit binary molecular identity cards of 46 taro cultivars (Table 7).

Table 7 Binary molecular identity of forty-six taro varieties.

According to the sequences of the primer alleles presented in Table 6, the scored results, marked as “0” and “1” for the SSR molecular marker bands of arrowhead core primers, were counted into binary data as 14-bit binary molecular identity cards of 10 arrowhead cultivars (Table 8).

Table 8 Binary molecular identity of ten arrowhead varieties.

Variety purity identification of three important aquatic vegetables

Among 90 samples of the No.36 space lotus, the bands of 84 samples were regarded as the same characteristic band of the cultivar. The other 6 samples judged to be different cultivars were compared with the characteristic band. Three samples of differential alleles had eight bands, and the other 3 samples had one band. The purity of No.36 space lotus was 93.3% (84/90).

Among 90 samples of Shangdong 8502 taro, the bands of 89 samples were regarded as the same characteristic band of the cultivar. The remaining sample judged to be a different cultivar with 4 differential alleles was compared with the characteristic band. The purity of Shangdong 8502 taro was 98.9% (89/90).

The bands of all 90 Wuhan arrowhead samples were regarded as the same characteristic band of the cultivar, so the 90 samples were identified as the same variety. The purity of Wuhan arrowhead was 100% (90/90).

Genetic diversity analysis of three important aquatic vegetables

A phylogenetic tree of three important aquatic vegetables was constructed (Fig. 1). The Jaccard similarity coefficient ranged from 0.50 to 0.99. Twenty-two individual plants were separated into three major groups, I, II, and III, at a Jaccard similarity coefficient level of 0.65 in sacred lotus (Fig. 1A). The results showed that there was genetic differentiation between lotus with flowers and lotus with roots; lotus with roots was concentrated in group I, while lotus with flowers was mainly concentrated in group III. Forty-six taros were separated into four major groups at a Jaccard similarity coefficient level of 0.74 (Fig. 1B). Two multi-headed taro Jiangan Baba taro and Dongxiang Goutou taro were clustered together, which indicated that there was obvious genetic differentiation between multi-headed and other taro cultivars. Some head taros clustered together in group III, and some clustered together with other types of materials; more than half of the multi-cormels taros were clustered in group II. Arrowhead was separated into two groups (Fig. 1C). A dendrogram with a scale from 0.70 to 0.98 based on Jaccard’s similarity coefficient was constructed and clearly separated the 8 accessions from Jiangsu and Zhejiang provinces into 4 main clusters, while the remaining Guangxi sand arrowhead and Guangzhou arrowhead were classified into group II.

Figure 1
figure 1

Dendrogram showing the genetic relationships of three important aquatic vegetables. (A) Genetic relationships among 22 individual sacred lotus plants based on EST-SSR markers. (B) Genetic relationships among 46 individual taros based on EST-SSR markers. (C) Genetic relationships among 10 arrowheads based on EST-SSR markers.

Discussion

DNA fingerprints are generally constructed by three methods: the characteristic locus method, the single primer method and the core primer combination method, the most efficient of which is the core primer combination method4,19,20,21,22. Core primers can play important roles in preliminary research, generally leading to a good polymorphism frequency, stability, reproducibility and discriminability. Identifying the the core primers of sacred lotus, taro and arrowhead is a key step for SSR fingerprint construction and purity identification, and it is also a prerequisite for the commercial application of SSR fingerprints. In this study, polymorphic SSR primers were first screened as alternative primers, and a set of SSR primers were found to be the core primers for these germplasm resources, with the smallest number that could completely distinguish all germplasm resources. According to the expansion and complexity of samples and genomes, the number of core primers should be adjusted accordingly. For variety identification, it is better to identify more samples with fewer primers; therefore, the ideal method is to distinguish the most varieties with a single primer. However, this was difficult to achieve, even in arrowhead, the cultivar with the smallest number. As a result, the core primer combination method was adopted in this study to construct fingerprints. In this study, 6 core primers and 3 candidate primers were selected for sacred lotus, 20 core primers for taro, and 3 core primers and 9 candidate primers for arrowhead. Then, 17-, 87- and 14-bit binary molecular identification cards were successfully constructed for 22 sacred lotus cultivars, 46 taro cultivars and 10 arrowhead cultivars, respectively. The DNA fingerprint of sacred lotus was more useful than those of the others23,24. The construction of SSR fingerprints for these three important aquatic vegetables is still in its infancy, and in particular, SSR fingerprints for taro and arrowhead have not been reported. SSR fingerprints were constructed by the core primers combination method, which provides useful methodological guidance for the construction of a standard DNA fingerprint database and performing aquatic vegetable mapping analysis in the future.

As a kind of molecular marker with the advantages of high allelic variation, codominance, simple and rapid detection, and good stability, SSRs have been maturely applied in many regions, such as genetic diversity analysis, fingerprint construction, trait marker and genetic linkage map construction. Many agronomists and geneticists have carried out extensive research and application of SSRs25. For example, 30 pairs of core primers were used to construct a fingerprint and test the authenticity and purity of Zhongmian Institute 63 cotton. Compared with field trial identification, the results showed that 30 pairs of core primers achieved good identification26. From 36 pairs of primers, 3 core primers, MCPI-5, MCPI-16 and MCPI-17, were selected. Identified by MCPI-5 and MCPI-16, the purity of 267 watermelon T-1 hybrids was 97.75%, consistent with the field morphological identification results. Hence, the core primers selected could be used for purity identification of watermelon T-1 hybrids27. Other crops, such as rice28, rape29, potato30, maize31, eggplant32,33, cabbage34,35, and soybean36, have also been able to be identified.

EST-SSR markers are new molecular markers based on the expression of sequence tag data combined with the characteristics of SSR. These markers combine the dual advantages of SSR and EST and expand the application prospects of SSR. In 2009, EST-SSR markers were applied to the identification of safflower purity, which became the first successful example of the use of EST-SSR markers for purity identification of a crop hybrid population37. Subsequently, studies of the purity identification of rice, cabbage, melon (cucumber, melon, watermelon) and other crops have been carried out. The results showed that EST-SSR markers are superior and more sensitive than plant morphology identification in the field, and the purity of detection is generally 2–3% higher than the results from the field34,38,39,40. However, the most commonly used method for the authenticity and purity detection of aquatic vegetables is still morphological identification, which is accomplished by comparison with standard morphological characteristics or determination of morphological characteristic consistency in tested varieties. Due to the limited molecular genetics research, there are a few available SSR markers and no reports of the use of EST-SSR markers to detect the authenticity and purity of aquatic vegetables. Therefore, based on lotus, taro and arrowhead cultivars from all over China, the purity of No.36 space lotus, Shandong 8502 taro and Wuhan arrowhead was determined by EST-SSR and core primer PCR amplification. The purities of these three cultivars were 93.3%, 98.9% and 100%, respectively. It can be seen that these varieties all developed with rigorous breeding and screening. With less genetic variation and higher consistency, the three important aquatic vegetables were close to sacred lotus, taro and arrowhead pure varieties. Because of the genetic differentiation in this cultivar, No.36 space lotus with lower purity is a hybrid cultivar.

In this study, the genetic relationship of a lotus germplasm resources is not related to its geographical origin, as reported in previous reports41. At present, lotus can be divided into three types according to its morphological characteristics and its main parts, namely, lotus with seeds, lotus with flowers and lotus with roots41,42,43. The differentiation of lotus with roots and lotus with flowers in this study was consistent with previous research44. There is a certain correlation between the genetic relationship and phenotypic traits of lotus. This type of diversity evaluation will provide important reference materials for breeding research of these cultivars in the future. Forty-six taros were separated into four major groups. The main branches were a mixture of materials from different regions and types, and this result indicated that there is no obvious correlation between morphological characteristics or geographic distribution and kinship, possibly due to the cultivar classification of taro based on the variation of its morphological characteristics, which was probably caused by mutations of a few major genes. There is a certain correlation between the regional differentiation of arrowhead cultivars and their genetic differentiation. In Jiangsu and Zhejiang provinces, which belong to the Yangtze River basin, there is a certain differentiation of arrowhead cultivars, and the results of this study suggest that arrowhead is limited by region, especially water limitation. Lower gene exchange led to abundant genetic diversity, which was beneficial to arrowhead breeding. This study can server as a significant reference for the authenticity and purity detection of aquatic vegetable cultivars. Moreover, EST-SSR markers have higher species transferability45 and can also be used to study other related species.

Materials and Methods

Plant materials

Twenty-two sacred lotus cultivars were used to construct SSR fingerprints, including 14 sacred lotus with seeds cultivars, 7 sacred lotus with flowers cultivars and 1 sacred lotus with root cultivar (Table 9). Forty-six taro cultivated varieties (Table 10) and 10 arrowhead cultivars (Table 11) were used to construct SSR fingerprints.

Table 9 Twenty-two lotus varieties used in this study.
Table 10 Forty-six taro varieties used in this study.
Table 11 Ten arrowhead varieties used in this study.

Extraction of genomic DNA from three important aquatic vegetables

From the above young leaves of these materials, genomic DNA46 was extracted by the modified CTAB method. Among the materials listed above, genomic DNA from the varieties used for purity identification was collected and extracted from the young leaves of 90 plant samples. DNA was extracted by 1.0% agarose gel electrophoresis and stored at −20 °C for later use.

Primer amplification

SSR marker detection was performed using a Perl program known as MicroSatellite (MISA, http://pgrc.ipk-gatersleben.de/misa) from the S. sagittifolia transcriptome, C. esculenta transcriptome and N. nucifera transcriptome. Using effective EST-SSR primers developed in our laboratory, genomic DNA was amplified by PCR. The primer sequences for sacred lotus and arrowhead are shown in supplementary file 1. Based on the obtained bands, EST-SSR markers were selected as the core primers with a high polymorphism frequency that could distinguish all cultivars. These core primers were employed to amplify each of the 90 samples and 5 control samples of genomic DNA of No.36 space sacred lotus, Shandong 8502 taro, and Wuhan arrowhead. Because of the lack of core primers for sacred lotus and arrowhead, 3 and 9 pairs of candidate primers were added to the samples. The reaction mixture (15 μL) contained 1.5 μL of buffer (10 × dilution), 0.8 μL of MgCl2 (20 mM), 0.6 μL of dNTP (10 mM), 0.5 μL of TaqDNA polymerase, 1 μL of forward primer (10 μM), 1 μL of reverse primer (10 μM), 2 μL of template DNA and 7.6 μL of ddH2O. The amplification conditions included an initial denaturation step at 94 °C for 5 min, followed by 35 cycles of denaturation at 94 °C for 30 s, and annealing extension at 54–65 °C, elongation at 72 °C for 30 s and a final step at 72 °C for 5 min. Ten microliters of PCR bromophenol blue buffer was added to the PCR products and placed in the gene amplification instrument (BIOER co.,LTD., Hangzhou, China) at 94 °C to degenerate for 5 min, then quickly cooled on ice and tested with a polyacrylamide gel. The silver staining method was used to display the color, and the electrophoresis results were observed on a slide lamp.

Data statistics and analysis

According to the polyacrylamide electrophoresis results, a band of the same fragment size was recorded as a marker allele and scored as 1, 0 and 2 for a band, no band and a deletion, respectively. The EST-SSR locus allelic variation was counted for sacred lotus, taro and arrowhead, and the statistical data were input into a computer. Data analyses were performed by using the NTSYSpc package version 2.147. The SSR markers were identified as core primers with indicated polymorphisms that could distinguish all cultivars. A set of binary data were obtained from the “0” and “1” data produced by these markers as the binary molecular identity cards of these cultivars.

The characteristic band of a variety is the common characteristic band of most sample individuals. The band of each sample was compared with the characteristic band, and different allele numbers between cultivars (>1) were determined to indicate different varieties, while (=0) was determined to indicate the same variety. P = X/Y × 100%, where X is the number of individuals of the same variety, Y is the total number of individuals identified and P is the value of purity.