Introduction

The superfruit market has had significant growth in the marketplace putting demand on suppliers. Consumer demand for superfoods such as superfruit is increasing and attracting the attention of multiple industry sectors including food products, dietary supplements and natural health products1. Industry has responded to this demand by marketing their products with claims that superfoods mitigate ailments and promote considerable health benefits2,3. The superfruit market alone was valued at 140.3 billion USD in 2025 and is expected to grow over 200 billion USD in the next decade4. This is partly driven by the demand from contemporary consumers that value a healthier and environmentally friendly source of nutrition. Superfoods are highly nutritional with bioavailable nutrients and documented bioactivity within the body due to the concentration of nutrients5. Few fruits meet the criteria of high nutrient value with notable nutrients such as vitamin C and can be sourced at a reasonable cost by large manufactures that focus on global markets.

Acerola berries are designated as a superfruit that can be sustainably grown by producers and sourced in large quantities in the marketplace6. Acerola berries are also known as Barbados cherry and West Indian cherry and are grown commercially as small trees or shrubs. Taxonomically there are two species (Malpighia glabra L.; Malpighia emarginata DC.) from the family Malpighiaceae that contain 45 species7,8. There is mention in the literature of another commercial species Malpighia punicifolia L., which is a synonym for Malpighia glabra. The fruit of these two species is indistinguishable as a red fleshy drupe that produces a juicy acidic pulp. The common names are used inconsistently for the respective species causing some confusion when sourcing acerola ingredients from the global supply chain9. Malpighia glabra is native to the Americas, including Mexico, Central America, South America, and the Caribbean with more recent introduction and cultivation in USA (Florida, California, Texas), Cuba, Jamaica, and Puerto Rico. Malpighia emarginata is originally from Mexico in the Yucatán peninsula, but is now cultivated in Central America, the Caribbean, and South America as far south as Peru10,11 Brazil is the world’s largest producer of acerola berries with most of the production in the northeast states of Pernambuco, Paraíba, Bahia, and Ceará6,12. Both species have been introduced for cultivation in Southeast Asia and southern India. In India, cultivation is reported in backyard gardens across the states of Tamil Nadu, Karnataka, Kerala and Maharashtra. More recently production has been recorded in the Andaman and Nicobar Islands13. Production is relatively small in the southern USA. The largest markets for acerola products include United States, Germany, France, and Japan.

Acerola fruit has considerable nutritional and health benefits. There is a plethora of research on acerola to support pharmacological, medicinal and nutritional claims of which have been well documented in several scientific reviews1,6,14. Acerola health benefits include high antioxidant capacity7,15,16antitumor17antimutagenic18,19,20,21,22,23antidiabetic24hepatoprotective25 and functional properties like skin anti-aging26skin whitening, and multidrug resistant reversal activity12,27. This body of research is supported by many biochemistry studies that have recorded significant phytonutrients such as flavonoids, anthocyanins, carotenoids, phenolics, benzoic acid derivatives and phenylpropanoids within comprehensive scientific reviews28,29. The presence of these bioactive compounds provides multiple hypothetical mechanisms to support the numerous health claims27,28. This includes various biological activities using in vitro and in vivo models in which probable mechanisms have been determined14. One of the most studied areas of research underpins the claim that acerola is one of the richest natural sources of ascorbic acid up to 100 times more than that of oranges and lemons14,30,31,32. It has been reported that the vitamin C of acerola is more easily absorbed than other forms including synthetic ascorbic acid27,33. Acerola berries are consumed and prepared as a nutritional drink and traditional remedy in Brazil. In Europe (e.g., France, Germany, and Hungary) the fruit is utilized as juice, whilst in multiple forms within the United States where it is consumed as a rich source of ascorbic acid (vitamin C) within supplements and natural health products27.

There is a lack of published research on quality assurance tools in the acerola industry. These QA tools are needed to ensure species ingredient authenticity and prevent adulteration with other cheaper fruit species. Morphological techniques are inadequate because they cannot differentiate among the species of Malpighia that are traded as berries. DNA-based markers have become a popular method for identifying and verifying botanical ingredients. This is because the genetic makeup of each species is unique, regardless of factors such as sample age, physiological condition, environmental growth conditions, harvesting methods, storage, and processing34. The DNA extracted from the fruit, leaves, stems or roots of plants all carry the same genetic sequences of which can be stored for long periods of time as they are stable35. Multiple DNA regions have been utilized to develop specific markers for identifying and authenticating botanical ingredients36,37. Several recent reviews provide the benefits and challengers of using DNA-based methods for quality assurance of botanical ingredients38,39,40,41. The use of chemical authentication has been used extensively for quality assurance of botanical ingredients42. Traditional analytical chemistry methods are well established and the advantages and limitations are discussed in numerous publications43,44,45. More advanced methods have been utilized providing rigorous approach to quality assurance of highly processed and mixtures of botanical ingredients that overcomes the challenges of traditional analytical methods46,47,48. The development of orthogonal molecular methods for ingredients with degraded DNA in processed samples allows an alternative for quality assurance testing using metabolomic methods such as nuclear magnetic resonance (NMR)49,50,51,52,53,54. The use of phytochemical analysis and metabolomics methods such as chemical profiling by NMR are fit-for-purpose in identifying the active ingredients labelled on herbal products55,56. Analytical chemistry and DNA methods are not fit-for-purpose when evaluating complex samples, such as herbal dietary supplements that can include a mixture of several plant species in one product36,43,54,57. Next generation sequencing has been suggested as DNA-based tool for admixtures, but there are considerable challenges preventing this method application including: requirement for high quality DNA, sensitivity issues (contamination), quantification inaccuracy, need for complex bioinformatics, reference database limitations, inter-species DNA competition, PCR bias and primer limitations58,59,60. Traditional analytical chemistry utilizes targeted methods for each molecule of interest. Most QA requirements need to assess several molecules of interest requiring a multi-test approach costing considerable time and funding. Analytical chemistry methods are limited by ingredients that are mixtures of species due to several critical issues including: similarity in chemical signatures, incomplete profiling, chemical marker dependence, and processing induced variation causing inconsistent results43,48. NMR methods overcome many of these barriers and have a wide range of applications in food and botanical industry, as ingredients authentication and quantification, including active ingredient quantification and differentiating the geographic origins ingredients61,62,63,64,65,66,67,68. NMR fingerprints use many metabolites to identify species overcoming the barrier of traditional analytical chemistry analysis that focuses on a single molecule that is not species specific44. The extraction methods used to acquire metabolite profiles are robust providing consistent NMR fingerprints with sufficient variation to discriminate species and sometimes geographic variants63,64,65,66,67,68. Although NMR methods cannot be used to identify multiple species in a mixture, it can be used to identify the existence of an unknown adulterant species. This is because NMR fingerprints are specific to each species ingredient and the fingerprint would be considerably different if another species was added. These molecular methods need to be assessed in the context of specific species ingredients to determine fit-for-purpose in the quality assessment of sourced botanical ingredients.

The objective of this study is to develop of Nuclear Magnetic Resonance fingerprints and DNA sequence methods to distinguish Acerola species (Malpighia glabra, M. emarginata) for quality assurance of Food, dietary supplements and natural health products. More specifically we used two methods including, (1) DNA sequencing the ITS nuclear region for 16 samples and, (2) 1 H-NMR metabolite fingerprinting to analyze 16 samples consisting of 10 samples of two Acerola species (Malpighia glabra, M. emarginata) and 6 samples of common adulterant cherry (Prunus spp.) species. The validation of genomics and metabolomics approaches in this study may be valuable tool for quality assurance assessment of both product identity and purity.

Materials and methods

Botanical samples

This study developed two separate orthogonal methods using 32 Samples including, (1) DNA sequencing of 16 samples representing two acerola species (Malpighia glabra, M. emarginata) of which 13 were leaf samples and 3 were berry samples (Table S1), and (2) NMR metabolite fingerprinting of 16 samples representing 10 berry samples of two acerola species (Malpighia glabra, M. emarginata) and 6 berry samples of cherry representing two species (Prunus cerasus L.; Prunus avium (L.) L.) (Table S2). Only berry samples were used for NMR metabolite fingerprinting representing the form of acerola product in the supply chain because NMR fingerprints are known to be different among leaves and fruit69,70. It is known that DNA sequences for specific regions such as ITS do not change for different tissues such as leaves and fruit. Therefore, we used mostly leaves in this study for DNA sequencing. However, we did sequence five vouchers (e.g., leaves and berries) from the NMR study including samples for each Malpighia species to verify that the berries used for NMR metabolite fingerprinting were the correct species (Table S1). We did not attempt to sequence the cherry samples due to the known challenges of barcoding cherry (Prunus spp.) in the published literature including ITS and other regions71,72,73. The identification of the herbarium voucher samples was confirmed by qualified taxonomist on staff at the University of Guelph; the voucher metadata are listed in supplementary Table S3. All samples are housed at the Natural Health Products Research Alliance, University of Guelph.

Genomic DNA extraction

Genomic DNA from the samples were extracted using the Nucleospin Plant II kit (Macherey- Nagel GmbH & Co. KG, Düren, Germany) to get high-quality DNA. DNA extractions were conducted using 100 mg of each sample, following the manufacturer’s instructions. DNA quantification was carried out with the QubitTM 3.0 Fluorometer (Invitrogen, Carlsbad, CA).

DNA amplification of ITS

The ITS region was used because of its ability to discriminate con generic species3,74,75,76. Alternative DNA regions were considered but due to well documented limitations of other DNA regions, the ITS region was used in this study3,77,78,79. PCR was performed using species-specific ITS primer pairs as described by Fazekas et al. (2012)80. The reaction mixtures (20 µL) contained 1 U AmpliTaq Gold Polymerase, GeneAmp buffer II (100 mM Tris-HCl pH 8.3 and 500 mM KCl), 2.5 mM MgCl2, 0.2 mM dNTPs, 0.1 mM of each primer, and 20 ng of template DNA.

DNA sequencing of ITS region

The ITS amplified products were sequenced in both directions following the protocols established at the University of Guelph Genomics Facility (www.uoguelph.ca/~genomics). The products from each specimen were purified using Sephadex columns and analyzed on a ABI 3730 sequencer (Applied Biosystems). Bidirectional sequence reads were acquired for all the PCR products. The sequences were assembled using CodonCode Aligner version 11.0.1 (CodonCode Corp) and manually aligned with BioEdit version 7.0.9. Subsequently, the aligned sequences were processed using Clustal W81. The genetic distances were calculated using the Kimura2Parameter (K2P) model in Mega11 89. 18 DNA sequences from GenBank were used to assess the specificity of the ITS region and compare interspecific variation among the two species of Malpighia including, M. glabra, M. emarginata. These 18 sequences were used in a BLAST analysis within GenBank to assess the specificity of these sequences to other Malpighia species and that of any potential adulterants such as Prunus species.

NMR sample Preparation

Samples for NMR analysis were prepared by weighing 300 mg of finely ground plant tissue (Table S2). To ensure complete homogenization, the plant material was ground in liquid nitrogen using a mortar and pestle, then dissolved in a solvent mixture of 90% regular methanol and 10% deuterated methanol (CD₃OD) for its broad solubility range, which aids in NMR acquisition83,84. Each sample was prepared in triplicate for consistency, sonicated in a water bath at room temperature for 10 min to enhance extraction, and centrifuged at 6000 rpm for 5 min. Finally, 650 µL of the clear supernatant was transferred into a 5 mm Wilmad® NMR tube for spectral acquisition.

NMR spectral acquisition

The proton (¹H) NMR spectra were recorded on a Bruker Avance III 400 MHz spectrometer, which features a 5 mm Broadband Inverse (BBI) probe designed for room temperature use. We used a proton NOESY (Nuclear Overhauser Effect Spectroscopy) pulse sequence with pre-saturation to improve the results. This method included dual solvent suppression for both water and methanol, helping to reduce any interference from the solvent signals. The key acquisition parameters included: Spectrometer frequency: 400.3 MHz; Number of scans: 64; Acquisition time: 2.27 s; Relaxation delay: 12.73 s; Spectral width: 8223.68 Hz. The use of dual solvent suppression was utilized to improve the spectral quality by reducing peak overlap from residual solvent signals, enhancing metabolite detection in complex biological matrices85. Temperature stabilization was maintained at 300 K throughout data acquisition.

NMR data processing and analysis

In this study, NMR spectral data were processed with TopSpin 3.6.3 for consistency and accuracy, using automated phase and baseline corrections and referencing the tetramethylsilane (TMS) peak at 0.00 ppm for calibration. A rectangular binning approach with a bin width of 0.01 ppm was utilized over a chemical shift range from − 1 to 12 ppm, while residual solvent peaks corresponding to water (4.75–5.06 ppm) and methanol (3.16–3.45 ppm) were excluded to minimize spectral distortion. Each spectrum was normalized via a scaling method wherein values below the mean intensity were set to zero, and intensities above the mean were scaled to fit within a range of 1 to 100. For statistical analysis, we performed multivariate analyses, including Hierarchical Clustering Analysis (HCA) and Hierarchical Clustering on Principal Components (HCPC), based on an Euclidean dissimilarity matrix while employing Ward’s clustering method in R86. We converted spectral intensities into chemical fingerprints, which allowed us to use hierarchical clustering to distinguish between different botanical species. This NMR-based metabolomics approach provides reliable differentiation and ensures high reproducibility for future analyses.

Results

DNA extraction and sequencing

The DNA extraction protocols yielded good quality, high molecular weight genomic DNA from the dried berry and leaf samples of M. glabra and M. emarginata. The procedure yielded 40–60ng of DNA per ng/ul of leaf tissue and for a whole dried berries and powders yielded 2 to 4.5 ng/ul. The amplification of the complete ITS region (comprising ITS1, the 5.8 S rRNA gene, and ITS2) was successfully achieved using the universal primers ITS5 (forward) and ITS4 (reverse). This process generated an amplicon of approximately 700 bp for all three species investigated.

Direct sequencing of amplicon yielded a ~ 700 bp ITS sequence for M. glabra and M. emarginata. GenBank accession numbers are recorded (Table S1). Searches and BLAST analysis in GenBank indicated that the sequences of M. emarginata matched that of those in GenBank and that those sequences for M. glabra were novel and added to the GenBank library. Alignment of all the sequences revealed that there was enough interspecific variation to distinguish all the samples of M. glabra from M. emarginata. Interspecific variation among the two species was 1% including seven species specific single nucleotide polymorphisms (SNPs) within the 700 bp nucleotide sequence. Sequence variation indicated that the ITS1, 5.8 S rRNA gene and ITS2 regions of each respective species were unique. BLAST analysis revealed that all the acerola species sequences did not match any other potential adulterant species including cherry (Prunus spp.).

NMR fingerprinting

Metabolite diversity among the NMR fingerprints was considerable. Hierarchical clustering analysis (HCA) of the ¹H NMR spectral data was used to assess metabolite diversity within a heatmap of metabolite diversity (Figure: 1). The resulting dendrogram clearly distinguishes the samples of M. glabra and M. emarginata highlighting their distinct metabolite profiles. However, M. glabra exhibits closer metabolic resemblance to Prunus spp. compared to M. emarginata, suggesting a greater risk of potential adulteration. The hierarchical clustering of cherry samples groups respective Prunus species from all acerola species including distinct differences from M. glabra based on their metabolic composition. Clusters labeled A and D correspond to M. glabra and M. emarginata, respectively, while clusters B and C include Prunus spp. The distinct clustering of M. emarginata underscores its unique metabolite profile, which is markedly different from M. glabra and Prunus species. This separation suggests that while M. glabra may share some chemical characteristics with Prunus, M. emarginata remains phytochemically distinct.

Fig. 1
figure 1

Hierarchical clustering of NMR fingerprints. Color key represents spectral intensity represented as variance in metabolites among the taxa.

Comparative ¹H NMR spectral analysis

Prunus spp. vs. Malpighia spp. ¹H NMR spectra of M. emarginata, Prunus cerasus, and Prunus avium demonstrate clear differences in metabolite distribution (Figure: S1). Although some spectral overlaps exist, there is key variation observed in the regions associated with carbohydrates (3.0–5.5 ppm), organic acids (2.0–3.0 ppm), and aromatic compounds (6.0–8.0 ppm). The most notable differences include:

  • The prominence of organic acids such as malic acid (2.4–3.0 ppm) in M. emarginata, which is significantly reduced or absent in Prunus spp.

  • Higher concentrations of ascorbic acid (4.0–4.3 ppm) in Acerola, confirming its known vitamin C content.

  • The presence of ferulic acid (6.5–7.5 ppm) in M. emarginata, a unique authentication marker absent in Prunus spp.

These spectral differences demonstrate the use of ¹H NMR fingerprinting as a reliable method for species authentication, ensuring that Malpighia species can be accurately distinguished from potential adulterants such as cherry (Prunus spp.).

Metabolite assignments and authentication markers

Elucidation of specific metabolites identifies differences among the key metabolite assignments for M. glabra and M. emarginata, respectively (Figs. 2 and 3). The assigned peaks correspond to major bioactive compounds that play a role in species differentiation (Figs. 2 and 3). The presence of ferulic acid and amino acids in M. emarginata sets it apart from M. glabra and Prunus spp. (Figs. 2 and 3).

Fig. 2
figure 2

Bioactive molecule elucidation of M. glabra. Labelled peaks correspond with metabolite molecular structure.

Fig. 3
figure 3

Bioactive molecule elucidation of M. emarginata. Labelled peaks correspond with metabolite molecular structure.

The metabolites identified in M. glabra include α-D-glucose (5.2–5.4 ppm), β-D-glucose (4.6–4.8 ppm), fructose (4.0–4.5 ppm), ascorbic acid (4.0–4.3 ppm), and malic acid (2.4–3.0 ppm). In comparison, M. emarginata metabolites encompass ferulic acid (6.5–7.5 ppm), notably absent in Prunus spp., along with α-glucose (5.2–5.4 ppm), β-glucose (4.6–4.8 ppm), fructose (4.0–4.5 ppm), and malic acid (2.4–3.0 ppm), the latter serving as a distinguishing marker. Additionally, amino acids such as valine, leucine, and isoleucine (0.8–1.5 ppm) are specifically identified in M. emarginata31,87. These differences are considerable in the specific identification of M. glabra from M. emarginata in commercial samples.

Discussion

One of the impediments in the acceptance of herbal formulations in the medical and pharmaceutical community is the lack of standardization and quality control. This is due to unacceptable risks to human health and unjustified health claims. It is challenging to develop robust quality control parameters using analytical chemical methods because of the complex nature and inherent variability of the chemical ingredients and molecular constituents used in supplements and natural health products, (World Health Organization 2000). These challenges including that of other conventional pharmacognosy methods such as morphology, microscopy, DNA-based and phytochemical analysis have led to the demand for more robust, fit-for-purpose methods of authentication of botanical species ingredients3,88,89,90. The need for more rigorous QA methods is also driven by consumer demand that seek pure, unadulterated natural remedies and nutrition that support health lifestyles without the use of drugs and synthetic health supplements91.

There are considerable challenges in authentication of acerola species ingredients from the commercial supply chain. Our experience and collaboration with industry members working directly with procuring M. glabra and M. emarginata materials indicates that these species cannot be distinguished based on their morphology or histology. Conventional analytical chemistry methods are often providing inconclusive reports. For example, high performance thin layer chromatography (HPTLC) analysis of acerola berries of M. glabra and M. emarginata often have identical or closely overlapping profiles with similar or highly variable acerolin content depending on product processing used by different suppliers (NHPRA members pers. Comm.). More recently the industry has moved to trading fruit as dried extracts of which there is a lack of standard methods for quality assurance protocols such as reference standards and product species verification. This problem is intensified when considering that acerola is considered a superfruit in high demand due to its high vitamin C content of which is highly perishable in transit and storage92,93. Dehydrating acerola juice using a spray dryer has become a common solution in the supply industry to extend the product’s shelf life and subsequent profit margins94,95. The NHP community has repeatedly stated there is a gap in acerola quality assurance methods and need for research on the development of new methods.

This study provides two orthogonal molecular methods for the authentication of acerola species ingredients procured in the industry supply chain. The first method utilized common DNA sequencing. Several studies have used ITS sequences as genetic markers for many botanical species3,74,75,76,96. The sequence variation of ribosomal RNA gene repeats is believed to rapidly turn over thus providing considerable interspecific variation among congeneric species97. Universal PCR primers designed from highly conserved regions flanking the ITS region are available and are relatively small in size (600–700 bp) with high copy number provide relative ease of amplification of the ITS region. In our study we had successful sequencing and found sufficient variation to distinguish the two commercial acerola species. However, the materials we used contained considerable amounts of DNA unlike that of highly processed extracts, which have considerably less DNA presenting challenges for PCR and sequencing34,98. Another consideration for extract materials is the presence of PCR amplification inhibiters such as concentrated phenolic compounds, acidic polysaccharides and pigments, which can be overcome by choosing appropriate DNA extraction methods that can reduce or eliminate PCR inhibitors35. Further research is needed to develop species specific probes based on unique SNPs for M. glabra and M. emarginata that could be used within specific QPCR methods. The use of DNA probes has been successfully used for species specific verification of highly processed supplements in the form of leaf or berry extracts50,53. ITS sequences were unique to commercial acerola species and not potential adulterants such as cherry species. However, standard operating protocols aimed at authenticating target species ingredients and preventing adulteration would require full ITS sequencing, which requires considerable resources (e.g., time, cost). If the materials are highly processed extracts, then targeted QPCR would be required including specific QPCR assays for all the potential adulterants, which is resource intensive (e.g., time, cost).

In this study, ¹H Nuclear Magnetic Resonance (NMR) metabolomic fingerprinting combined with chemometric clustering distinguished all the samples for two Acerola species, M. glabra and M. emarginata, from potential adulterants belonging to the Prunus genus (Prunus cerasus and Prunus avium). The results provide a metabolic fingerprint that can serve as a scientific authentication tool based on primary and secondary metabolite composition. ¹H NMR fingerprinting, coupled with multivariate chemometric analysis, provides a authentication tool for differentiating botanical species. Unlike chromatographic techniques, which can be manipulated by introducing synthetic analogs, NMR-based authentication relies on the holistic metabolite spectrum, making it more resistant to adulteration as it also can differentiate synthetic ingredients85. The authentication of commercial acerola is particularly significant due to its high economic value and susceptibility to adulteration with cherry derivatives. The hierarchical clustering of metabolites indicates that while M. glabra exhibits some spectral resemblance to Prunus spp., the spectra are considerably different as M. emarginata remains chemically distinct. By integrating metabolite-specific peak ratios, particularly malic acid and ascorbic acid content, a more rigorous authentication framework can be established. This approach of provides both a species ingredient identification tools while providing verification of specific bioactive metabolites that are of value to health minded consumers. The elucidation of specific bioactive metabolites is particularly valuable for product claims and specific ingredients on product labels.

Conclusion

This study provides two novel methods for the quality assurance of acerola species procured in the industry supply chain. Previously, there were no published methods for the authentication of acerola species used on commercial products. The DNA methods developed in this study have several limitations for berry extracts due to the lack of high quality DNA. These DNA methods may be cost prohibitive if the goal is to detect species adulteration from other taxa because multiple species test would be required for each potential adulterant. NMR fingerprinting developed in this study can be used for berry extracts and has the benefit of authenticating acerola species ingredients whilst detecting any adulterants in a single analysis. The resulting NMR spectra also provide quantitative estimates of bioactive phytochemicals negating the need for further analytical chemistry methods. However, there is limited adoption of NMR methods in the quality control of botanical ingredients, which poses a financial barrier to broad use of these methods. Further research is needed to develop more NMR fingerprinting methods for other botanical species ingredients. This study highlights the potential application of NMR fingerprinting as a critical quality control measure in the natural health product and food industry.