Introduction

Medicinal plants are valuable sources of phytochemicals that are utilized for the treatment of both infectious and non-infectious diseases1,2,3. Research studies have demonstrated the potential of plant extracts, particularly aqueous, ethanolic extract, and even hydroethanolic extract, as promising sources of antiviral agents, as they contain hydroxybenzoic acids, hydroxycinnamic acids, phenylpropanoids and terpenoids, which possess multiple-target actions disrupting various stages of viral life cycles, including attachment, entry, replication, spread, and even particle maturation4,5,6,7. An aqueous extract of Toona sinensis, for instance, primarily containing gallic acid and catechin, was found to disrupt multiple stages of the influenza virus (e.g., entry, spread, and replication) by concurrently targeting viral machinery proteins such as neuraminidase (NA), hemagglutinin (HA), and RNA polymerase7,8,9. Phenolic substances isolated from Aronia melanocarpa, including isoquercetin, kaempferol, ferulic acid, caffeic acid, ellagic acid, and myricetin, have been shown to inhibit the growth of influenza viruses, while also improving the survival rate of mice infected with the rPR8-GFP virus due to their antioxidant and anti-inflammatory properties10,11. These results prompt us to seek indigenous plants that can provide a diverse range of secondary metabolites as new, safe, affordable, and effective antiviral agents.

Caesalpinia mimosoides Lamk is a member of the Caesalpiniaceae family, where its young aerial parts, such as shoots and leaves, are commonly consumed as fresh or cooked vegetables in daily diets12. As per traditional medicine, it has also been used as a carminative to alleviate dizziness and fainting, indicating its potential therapeutic value and safety. Despite exhibiting various promising biological activities such as anticancer, antibacterial, antioxidant, anti-inflammatory, and neuroprotective effects13,14, research on the antiviral properties of medicinal plants in this family has been relatively limited. Although its virucidal effects need further exploration, our recent study demonstrates that the aqueous extract of C. mimosoides, primarily containing gallic acid and other shikimate-derived products (e.g., benzoic acid, quinic acid, shikimic acid, protocatechuic acid), can inhibit the influenza A/PR/8/34 virus with an impressive IC50 of 5.14 µg/mL (CC50 > 100 µg/mL) without inducing hemolytic effects on human red blood cells (hRBCs) at concentrations up to 800 µg/mL15. Molecular docking studies also suggested that these compounds preferentially interacted with the oseltamivir recognition site and the 430-hydrophobic cavity of NA structure, thereby confirming the obtained results. Likewise, alcoholic extracts of C. minax and C. latisiliqua, both belonging to the same genus, have been found to exhibit significant antiviral activities due to the presence of flavonoids and stilbenes, whose IC50 values ranging from 5.09 to 12.20 µM16. Owing to the potential of sharing an identical chemotaxonomic classification among the three plant species, there is growing interest in further developing C. mimosoides extracts with a myriad range of phytochemicals, ultimately acquiring more effective antiviral agents for pharmaceutical use.

High-resolution LC-MS/MS plays a crucial role in metabolomics analysis due to its exceptional capacity and sensitivity in detecting and tentatively identifying a wide range of metabolites in complex mixtures through untargeted analysis17,18. Several computational tools are developed for the structural annotation of natural products to enhance confidence levels in metabolite identification, primarily based on submitting raw mass spectra19,20. Of much software developed, SIRIUS, can derive the correct molecular formula of queried subjects mainly through the high-resolution isotopic pattern analysis21. Additionally, it is integrated with two essential tools, CSI: FingerID web service and CANOPUS, to provide more details about probable structures, substructures, and classifications of the query metabolites22. Using multiple software tools enables high-level structural annotation of bioactive constituents such as gallic acid, quinic acid, shikimic acid, pyrogallol, protocatechuic acid, bergenin, and various chain-length dicarboxylic acids found in the C. mimosoides aqueous extract15. Wang et al.23 utilized computational tools to identify numerous phytochemicals in Salvia miltiorrhiza Bunge extract, identifying 34 tanshinones, 31 phenolics, and two new salvianolic acids for the first time via their untargeted metabolomics data24. This approach also enabled detailed structural annotation of 7 unknown and 17 known bioactive metabolites in the extract of Urtica dioica leaves25. For these reasons, it is thus crucial to adopt computational mass spectrometric approaches to structurally annotate bioactive constituents in the hydroethanolic extract of C. mimosoides, subsequently benefit in postulating precise antiviral mechanisms and identifying lead compounds for upcoming drug discovery programs. The objective of this study was to formulate a hydroethanolic extract of C. mimosoides utilizing leaves, stalks, and trunks (30% v/v ethanol solution). The study aimed to assess whether this solvent system would enhance the phytochemical profiling compared to the conventional aqueous extract. Additionally, the study evaluated the extract’s antioxidant properties, identified gallic acid as a representative marker using HPLC analysis, and followed by untargeted metabolomics analysis (UPLC-ESI(±)-QTOF-MS/MS) to identify flavonoids and other active constituents such as terpenoids and alkaloids. The structural annotation of the deduced metabolites was performed using a comprehensive assessment of in-silico tools, including MetFrag, SIRIUS (v.5.6.8), CSI: FingerID, and CANOPUS, to achieve a high level of confidence in metabolite annotation (level 2), in accordance with metabolomics initiative guidelines26,27. As proposed earlier, we assessed the anti-influenza A/PR/8/34 virus and hemolytic activities of the hydroethanolic plant extract to ensure its efficacy and safety. Molecular docking against three viral machinery proteins, HA, NA, and PB2, was employed to elucidate the potential mechanisms responsible for antiviral activity.

Materials and methods

Preparation of C. mimosoides aqueous-ethanolic extract

Fresh aerial parts of C. mimosoides, including leaves and stalks, were harvested from an open field in Nam Phong subdistrict, Khon Kaen province, Thailand, at coordinates 16°37’59.9” north latitude and 102°46’14.3 east longitude. The harvested plant materials underwent thorough washing with tap water and double soaking in deionized water for 60 min each to ensure complete removal of any unwanted residues. Subsequently, they were dried at 50°C for 2–3 days using an oven incubator, ground, and milled into fine powder using a blender. A hydro-ethanolic extract was prepared by taking 20 g of plant material, placing it in a 500 mL Erlenmeyer flask, and extracting it with 200 mL of 30% v/v hydroethanolic solution. The extraction process involved gentle shaking at 100 rpm and 25°C for 2 days to minimize the generation of reactive oxygen species (ROSs). The resulting mixture was then centrifuged at 8,000 rpm for 20 minutes at 25°C, producing a clear supernatant that was carefully transferred into a round-bottom flask. Approximately 100 mL of ethanol was evaporated using a BUCHI Rotavapor R-3 evaporator, followed by removal of the remaining water using a freeze dryer at -110°C (CoolSafe, Labogene, Lynge, Denmark) for 48 hours. The freeze-dried extract was subsequently stored at -80°C until analysis.

Total phenolic content (TPC)

Total phenolic content was determined using Folin and Ciocalteu (1927) method with some modifications28 and the reaction took place in a 96-well plate. An aliquot (20 µL) of the extract, purified GA (1) and quercetin (> 95% purity Sigma-Aldrich, St. Louis MO, USA), or blank (deionized water) was carefully loaded into each well. After that, 100 µL of 0.2 M Folin–Ciocalteu reagent and 80 µL of 7% (w/v) sodium carbonate were added. Each set of samples was run in triplicate. The mixtures were incubated at 25 °C in constant darkness for 30 min before being analyzed using a microplate reader (EnSight Multimode Plate Reader, PerkinElmer, USA) at 760 nm. The presence of phenolics was calculated as gallic acid and quercetin equivalents (GAE) per gram of dry weight.

DPPH assay

DPPH (2,2-diphenyl-1-picrylhydrazyl) assay was used to analyze the antioxidant property of the C. mimosoides hydroethanolic extract, as this property is closely associated with its antiviral activities. Gallic acid (Sigma-Aldrich, USA) and quercetin (Sigma-Aldrich, USA) were the reference standards DI water and absolute ethanol used as blank samples, respectively. Samples were mixed with 100 µl of 0.2 mM DPPH reagent, reactions were conducted in a dark environment for 30 min, and absorbance was measured at 517 nm using a microplate reader (EnSight Multimode Plate Reader). Results were expressed as IC50 values and/or percentages of DPPH radical scavenging activity. The inhibition ratio (%) was calculated as follows: (Acontrol - Asample) x 100/Acontrol, where Acontrol is the absorbance of the blank samples (DI water and ethanol), and Asample is the absorbance of the reaction with the sample solution.

HPLC analysis of gallic acid

Gallic acid was selected as a representative chemical marker due to its reported anti-influenza activity29. High-performance liquid chromatography with a diode-array detector (RP-HPLC-DAD) was employed to qualitatively and quantitatively analyze the presence of gallic acid in the C. mimosoides hydroethanolic extract. The analysis utilized an HPLC system from Agilent Technologies 1260 Infinity coupled with a PDA detector. Separation was achieved using a C-18 column (100 mm × 4.6 mm, 5 μm particle size, Phenomenex, USA). The mobile phases consisted of 0.5% phosphoric acid (solvent A) and methanol (solvent B). A linear gradient of solvent B was applied for metabolite separation, starting with 5% B for the first 2 min, increasing to 95% B over 15 min, holding at 95% B for 3 min, and returning to 5% B for the final 10 min, resulting in a total run time of 30 min at a flow rate of 0.8 ml/min. The sample and column temperatures were maintained at 25 °C, and detection was performed at 270 nm using a UV detector. Quantitative analysis was based on the five-point standard curve of gallic acid (62.5–1000 µg/mL) with the equation y = 68.215x + 1534.2 (R² = 0.9976).

UPLC-ESI(±)-QTOF-MS/MS

Untargeted metabolomics analyses both negative and positive ion modes were implemented to gain full details of phenolics and non-phenolic compounds in the extract. An ultrahigh-performance liquid chromatograph (UHPLC) (UltiMate 3000 RSLCnano UHPLC System, Thermo Scientific, US), which is equipped with Acclaim® RSLC120 C18(100 × 2.1 mm, 2.2 μm 120Å, Thermo Scientific, US) column were used in the separation processes. The 0.1% formic acid (solvent A) and acetonitrile (solvent B) were the mobile phases with a flow rate of 0.4 mL/min. A linear gradient of the solvent B as following conditions: 2% for 0–2 min, 0–90% for 1–12 min and hold for 2 min, and back to 2% for 4 min for a total run time of 18 min was implemented. The 1 µL of extract (final concentration of 10 ppm) was separated through the RP-column, where the temperature was maintained at 35 °C. The product identification of a mass spectrometer (TripleTOF6600+, AB SCIEX, Framingham, MA, USA) connected with the HPLC instrument. ESI source conditions were as follows: the ion source gas 1 50, ion source gas 2 60, curtain gas 30, a temperature of 150 °C, with the ion spray voltage floating of -4500 V in negative mode and 5500 V in positive mode. A TOF MS scan range of 100–800 amu. The product ion scan range of 50–800 Da. The scan accumulation time was 0.2 s, while the parent ion scan accumulation time was 0.25. The mass spectra were generated using a delustering potential (DP) of 80 V with the collision energy of 40 ± 10 eV. Metabolite matching was based on spectral similarity with the high-quality MS/MS spectrum deposited in the Natural Products HR-MS/MS library (version 2.0) and NIST 2017, which contains approximately 13,800 substances.

Structural annotation using MetFrag webservice

The raw MS/MS spectra of selected metabolites from C. mimosoides aqueous-ethanolic extract were initially analyzed using MetFrag software (https://msbi.ipb-halle.de/MetFrag/) to obtain detailed information on the neutral formula, mass, and potential structure of the query metabolites. This process involves two steps: retrieving candidates and performing fragmentation setting and processing. First, a specific formula and corresponding m/z value must be provided, where various types of suspected databases to be selected to enhance annotation accuracy. In the case of metabolites originating from plant species, biological databases such as KEGG and NORMAN are strongly recommended30,31, as using the largest structural database (e.g., PubChem) lacks the capability to differentiate compounds possessing the same mass and chemical formula. After retrieving potential candidates, the experimental MS/MS data were matched against the in-silico generated spectra of these candidates. Proposed compounds with an F score close to or equal to 1.0 were primarily considered potential candidates for the query subject.

Metabolite annotation using SIRIUS (v.5.6.8)

The same raw mass dataset of selected metabolites was uploaded into the software. The MS2 level was set to a collision-induced dissociation (CID) energy of 30–50 eV, specifying the precursor ion (e.g., m/z 303.0522) and adduct type. Occasionally, the expected formula (e.g., C15H12O7) was used to improve the precision of substance annotation. After clicking ‘compute’, SIRIUS, CSI: FingerID, and CANOPUS were selected to acquire more detailed information about natural products. Including biological databases like KEGG, NORMAN, PlantCyc, and Natural Products enhances the accuracy of metabolite annotation, especially when the query subjects are derived from natural resources.

Anti-influenza virus screening assay

The influenza virus strain A/Puerto Rico/8/34 (H1N1) was used as a model to test the potential of a hydro-ethanolic plant extract to disrupt the viral cycle in a 96-well plate assay. MDCK cells were seeded at 1.5 × 105 cells/well in serum-free OptiMEM (Invitrogen) with 2 µg/mL TPCK-treated trypsin and incubated at 37 °C with 5% CO2 for 24 h. Assay involved mixing various concentrations of the extract (50–1.56 µg/mL) with viral particles (final concentration 100 pfu/well) at a 1:1 ratio at room temperature for 1 h. The virus-extract mixture was then added to MDCK cells, washed with PBS, and incubated for another 24 h. After fixing with ice-cold acetone, cells were blocked with PBST (PBS containing 0.5% Tween 20) and 2% BSA, followed by incubating with primary mouse anti-influenza A NP-UNLB antibody (BioLegend, 1:2000 dilution) and secondary HRP-conjugated goat anti-mouse antibody (BioLegend, 1:10,000 dilution). A colorimetric assay with TMB (3,3’,5,5’-Tetramethylbenzidine) substrate was conducted at room temperature for 8 min, and absorbance at 450 nm was measured using a microplate reader. Prior to antiviral testing, cytotoxicity was assessed using the MTT assay across a range of extract concentrations (100 to 0.05 µg/mL) to ensure safety and distinguish between extract effects on cell viability and its antiviral activity.

Hemolytic activity assay

Hemolytic activity of the C. mimosoides hydroethanolic extract was evaluated using human red blood cells (hRBCs) as described in our recent publication15. The study protocol was approved by the Ethics Committee of the Faculty of Biological Sciences, Islamic University, Kushtia, Bangladesh (Approval No. ERC/FBS/I.U./2022/09). Written informed consent was obtained from the participant before blood collection, and all experiments followed the guidelines and regulations of the Declaration of Helsinki. Briefly, hRBCs were resuspended in PBS (pH 7.4) at a final concentration of 4% (v/v). Due to the fragile nature of hRBCs, the suspension was carefully transferred to a new tube, mixed with 10 µl of the plant extract, and incubated at 37 °C for 60 min. Clear supernatants were obtained by centrifugation at 10,000x g for 5 min and immediately transferred into a 96-well plate before measuring the signal intensity at 415 nm. Hemolytic activity was calculated using the equation: % hemolysis = (S/P) x 100, where S represents the absorbance of the samples and P represents the absorbance of the positive control (0.1% Triton X-100). Hemolytic activity of less than 15% indicates a promising safety profile for human use32.

Molecular docking

The docking study aimed to elucidate the mechanisms underlying the potent antiviral activity of various phytochemicals from the C. mimosoides hydroethanolic extract. Hemagglutinin (HA) and neuraminidase (NA) play crucial roles in the entry and spread of influenza viruses33. Crystal structures of influenza A (H1N1) HA (1RU7; resolution 2.30 Å) and NA (6HP0; resolution: 1.88 Å) were obtained from the Protein Data Bank (PDB). The NA structure is co-crystallized with GJT ((3R,4R,5 S)-4-acetamido-5-[4-(hydroxymethyl)triazol-1-yl]-3-pentan-3-yloxycyclohexene-1-carboxylic acid; PubChem CID: 139030257), enabling establishment of an accurate RMSD (Root Mean Square Deviation) value using GOLD (Genetic Optimization of Ligand Docking) docking software (v.5.2.2). HA, however, lacks co-crystallized ligands. CB-DOCK2 (https://cadd.labshare.cn/cb-dock2/php/index.php), a cavity-based docking tool, was used to identify an appropriate binding pose, using sialic acid as the natural substrate. Subsequently, Self-docking was performed for both the HA/sialic acid and NA/GJT complexes to confirm that their RMSD values were below 2.0 Å, ensuring a highly reliable docking pose. HA and NA achieved RMSD values of 1.2912 and 1.3602 Å, respectively. For the last viral protein model which is influenza polymerase basic protein 2 (PB2, PDB: 4NCE) with an RMSD of 1.7851, it was also included in the study. The Swiss PDB Viewer (v.4.1.0) was employed to replace missing residues in NA and PB2 prior to docking experiments.

Results and discussion

Total phenolic content (TPC) and radical scavenging activity assays

Evidence indicates a significant link between phenolic substances found in specific plant extracts and their antiviral capabilities, as they found to interact with active sites of viral proteins (e.g., NA, HA, and PB2), forming strong hydrogen bonds and hydrophobic interactions15,29,34,35. Assessment of TPC and radical scavenging activity is a primary concern. TPC value of hydroethanolic plant extract was 337 (GAE)/g DE. As measured by the DPPH assay, its IC50 value of 15.01 µg/mL, highly resembling to the authentic quercetin (IC50 = 12.72 µg/mL) and approximately 4.91 times higher than that of standard gallic acid (IC50 = 3.06 µg/mL) (Table 1). This indicated its potential as a source of phytochemicals, where the chemical details must be unraveled to fully understand the mechanisms underlying its antiviral properties, shedding light on its future uses.

Table 1 Radical scavenging property of various samples.

HPLC analysis of gallic acid in C. mimosoides hydroethanolic extract

Drawing from our previous conclusions, gallic acid is one of the major constituents typically detected in C. mimosoides extracts (e.g., water, methanol, ethanol, ethyl acetate, and hexane)15. It is proposed to be involved in the anti-influenza activities of various plant extracts36. Hence, this compound served as a representative marker for the quality control of the C. mimosoides aqueous ethanolic extract detailed here. As expected, a peak (Rt = 10.048 min) matching the authentic gallic acid (Rt = 10.072 min) was found in the 30% hydroethanolic extract (Fig. 1). Based on comparison with the five-point calibration curve (R² = 0.9976), the yield of gallic acid was found to be 7.12 mg/g DW (dried weight) of plant material, which was lower than that detected in the C. mimosoides aqueous extract (8.77 mg/g DW) (Table 2). Since the gallic acid peak comprises 37.47% of the relative peak area, the remaining substances (approximately 62.53%) need to be explored to gain insight into other polyphenols and compounds present in the extract that may synergistically govern antioxidant and anti-influenza properties.

Fig. 1
figure 1

HPLC analysis of gallic acid as representative marker. (A) 30% hydroethanolic extract. (B) Aqueous extract. (C) standard gallic acid (1 mg/mL). (D) Standard curve of gallic acid.

Table 2 Quantitative analysis of gallic acid in C. mimosoides extracts using HPLC technique.

Metabolic profiling and structural annotation of metabolites using cheminformatics tools

The untargeted metabolomics analysis tentatively identified forty-eight bioactive metabolites in the hydroethanolic extract of C. mimosoides (Table 3). They exhibited reliable matching library scores ranging from 71.4 to 100% (with a mass error of < 20 ppm), though some deviated from these criteria. They were additionally annotated using a series of in-silico tools to detail regarding the correct molecular formula, candidate structure, substructure, and even compound class, achieving a high level of confidence in metabolite annotation (level-2). Upon stepwise examination using the MetFrag and SIRIUS, 35 out of 48 metabolites could be annotated as top-ranked candidates (1st and 2nd level structural reliability) among hundreds of potential structures retrieved from biological databases, gratifying the metabolomics initiative guidelines20,22. According to compound classification by CANOPUS (Class Assignment and Ontology Prediction Using Mass Spectrometry), they were simple phenolics, flavonoids with glycoside derivatives, carboxylic acids, terpenoids, and alkaloids, highlighting the remarkable structural diversity of the newly prepared extract and strengthening our supposition that C. mimosoides could be a promising source of natural antiviral agents, particularly shikimate- and phenylpropanoid-derived natural products, similar to C. minax and C. latisiliqua16. Notably, the correctness of structural annotation of all detected flavonoids (e.g., quercetin, quercitrin, and taxifolin) is likely confirmed by the presence of p-coumaric acid (m/z 163.0411), which is known to be subsequently metabolized by 4-coumarate CoA-ligase (4CL), chalcone synthase (CHS), and chalcone isomerase (CHI) to produce naringenin, a key branching point in the biosynthesis of flavonoids widely distributed in the plant kingdom28. Meanwhile, the detection of negative ion of gallic acid (m/z 169.0178) and its related metabolites, including ethyl gallate (m/z 197.0522), pyrogallol (m/z 125.0270), shikimic acid (m/z 173.0485), quinic acid (m/z 191.0651), ellagic acid (m/z 300.9999), and chlorogenic acid (m/z 353.0910) as top candidates (1st ), supports a previous study showing that they originated from the shikimic route37, thus validating their reliability for structural annotation. The putative pyrogallol (Rt = 2.16 min), with a lower relative peak area (9.16%), was identified as co-eluting with gallic acid (29.64%: Rt = 2.16 min), suggesting they share similar physicochemical features and biosynthetic pathways. Although untargeted metabolomics on phenolic-rich extracts are often more effective in negative ion mode38, the use of positive ion mode confirmed various flavonoids, including myricetin, guaijaverin, (-)-gallocatechin, quinic acid, quercetin, luteolin 7-glucuronide, and afzelin, which formerly detected in the negative ionization. Notably, astragalin and luteolin 7-glucuronide, tentatively characterized by m/z 449.1102 and m/z 463.0890 respectively, could also be detected exclusively. An increase in the ethanol concentration to 30% v/v in the extract solvent enabled the detection of putative germacrone (m/z 463.0890), because of its hydrophobic properties. Unfortunately, the remaining 13 metabolites had to be classified as ‘improper structures; N.D.’ because neither MetFrag nor SIRIUS could annotate them as previously explained by the MS/MS-libraries, resulting in low-confidence identification. Potential factors contributing to these inconsistencies likely include inadequate collision-induced energies, significant mass errors (> 20 ppm), and misidentification due to deficiencies within the spectral libraries26. The examples of metabolites successfully annotated by SIRIUS, in tandem with the connected tools, classified by compound classification, are as follows:

Ellagic acid

Of the ten proposed formulas, high-resolution isotope pattern analysis confirmed that the query metabolite at m/z 300.9999 has the formula C14H6O8 (Fig. 2), conforming with the entry in the database (https://pubchem.ncbi.nlm.nih.gov/compound/Ellagic-Acid) By comparing against biological databases, CSI: FingerID subsequently identified this natural substance as ellagic acid (1st ), presumably due to the pre-existing in the training dataset. Numerous substructures illustrating the successive dimerization of gallic acids followed by cyclization were also detected in the most potential candidate, such as “[#6]c1c([#8])cccc1 (Cc1c(O)ccc1) (gallic acid)”, “[#8]=,:[#6]-,[#6]-,:[#6]-,[#6]-,:#6-,:[#6] (O = C-C-C-C-C(O)-C) (another gallic acid)”, “[#8]=,:[#6]-,:[#6]-,:[#6]-,:[#6]-,:[#6]=,:[#8] (O = C-C-C-C-C = O) (cyclized region)”, and “c(:c: c:o): c:1 (ring closure unit)”. Of equal importance, substructures that indicated the successive formation of ellagic acid, described by “c(:c: c:c(:c1)c(:c(:c: c:c2)~[!#1]): c2~[!#1]): c1” and “c(:c(:c(:c: c1):o: c:c:2):c2):c:1 (coumarin-like structure)” were also detected. This query subject exhibited a mass fragmentation pattern highly similar to that of authentic ellagic acid established by Yan et al.39, characterized by the product ions with m/z values of 185.026, 229.0157, 257.0106, and 283.998. They were suggested to the same substance. CANOPUS also affirmed that gallotannins, phenol acids (C6-C1), and shikimates & phenylpropanoids were the class, superclass, and biosynthetic origin of this natural product, respectively.

Fig. 2
figure 2

Structural annotation of putative ellagic acid.

Chlorogenic acid

Among the one hundred potential metabolites retrieved from Biodatabases, the negative precursor ion at m/z 353.091 was best annotated as chlorogenic acid (1st), owing to its pre-inclusion in the CSI: FingerID structure training set (https://www.csi-fingerid.uni-jena.de/v2.6/api/fingerid/trainingstructures? predictor=2). Numerous fingerprints illustrating the successive ester bond formation between quinic acid and caffeic acid to yield chlorogenic acid were found in the query subject. For instance, substructures encoded by “C(CC(CC-1~[#8])~[#8])(C-1~[#8]~[#8]; (100%)” and “[#8]c1c([#8])cccc1 (Oc1c(O)cccc1); (98%)”, signifying the presence of quinic acid and caffeic acid, respectively. Meanwhile, the ester bonds (R–COO–R′) formed by the two phenolic substances, ‘ECFP6: 1639092370 (100%)’ and ‘ECFP6:-169346356 (100%)’, were detected in the same structural candidate (Fig. 3). The fragment ions at m/z 191.0585 (base peak) and m/z 179.0358 emerged in the MS/MS spectrum indicate the consecutive loss of quinic acid from the caffeic moiety, conforming to its authentic structure in PubChem (https://pubchem.ncbi.nlm.nih.gov/compound/Chlorogenic-Acid#section = Computed-Properties). Compound classifications show that cinnamic acids/derivatives, phenylpropanoids (C6-C3), and shikimates/phenylpropanoids are potential biosynthetic routes for this query metabolite, supporting the obtained results (Fig. 3).

Fig. 3
figure 3

Structural annotation of putative chlorogenic acid.

Taxifolin

Structural annotation correctness also improved for the negative parent ion at m/z 303.0502, which ranked taxifolin as the best match (1st) among hundreds of structures from Biodatabases (Fig. 4). Molecular fingerprint detection clearly showed an existing of basic flavonol ring system, characterized by C6(ring A)-C3(ring C)–C6(ring B), in this query subject. Substructures encode by “C(C(Coc:1:c: c(:c: c:2)~[#8])~[#8])c:1:2” (ring A), “CCC(= O)c1ccccc1” (ring B), “ECFP6: 1580559118” (relationship between rings C and B), and “[#6] ~ C(c(:c: c:c(:c1)~[!#1]): c1)~[!#1] (ring B)” illustrate the atomic correlation between rings A, C, and B within the taxifolin structure. The MS/MS spectrum closely matches those in PubChem https://pubchem.ncbi.nlm.nih.gov/image/ms.cgi? peaks=273.041:999,125.024:583,255.03:284,301.036:238,227.036:212), especially the base peak at m/z 125.0252, indicating a common fragmentation pathway. Dihydroflavonols, flavonoids, and shikimate/phenylpropanoid pathways were predicted as its natural product classes, supporting the biosynthetic origins of this natural product.

Fig. 4
figure 4

Structural annotation of putative Taxifolin.

Luteolin 7-glucuronide

A similar predicted pattern was observed with the positive precursor ion (m/z 463.0890), which was identified as luteolin 7-glucuronide (1st) by both SIRIUS and the MetFrag. In this instance, fingerprints indicating the attachment of the glucuronide moiety to the aglycone core structure such as “CC(O)C(O)C(O)C(O)CO (glucuronide moiety)”, “OCC(= O)O (glucuronide)”, “[#6] ~ O[CH1]([CH1]([CH1]([CH1]([CH391]1[CH0].

(~[OH1D1])~[OH1D1]) ~ O) ~ O)[OH0]1 (glucuronide)”, and “c(:c(:c: c(:c:1)~[#8])~[#8]): c:1~[#] (ring A attached to glucuronide unit)” were clearly detected (Fig. 5). The emergence of the fragment ion with m/z 287.0578 (base peak) signifies the complete loss of the glucuronide from the luteolin moiety. This is highly similar to the MS/MS spectrum of authentic luteolin 7-glucuronide from PubChem (https://pubchem.ncbi.nlm.nih.gov/compound/Luteolin-7-glucuronide#section=Spectral-Information), suggesting they were the same substance.

Fig. 5
figure 5

Structural annotation of putative luteolin 7-glucuronide.

Germacrone

Germacrone (m/z 219.1393) was also supposed to occur in the C. mimosoides aqueous ethanolic extract. Despite a greater mass error deviation (−162. 43 ppm) and not achieving a top rank in MetFrag (7th ; KEGG), narrowing the search to appropriate databases increases the reliability of predicting this natural product. According to the CSI: FingerID, the cyclic sesquiterpene (C15H22O; SIRIUS score 100%) was the top-ranked candidate in the ZINC, MeSH, and PubMed databases, possibly due to its existence in the structural training set (https://www.csi-fingerid.uni-jena.de/.

v3.0/api/fingerid/trainingstructures? predictor = 1). Molecular fingerprints illustrating the successive cyclization of three C5 isoprene units to form a C15 sesquiterpene were deduced with high posterior probability scores, such as ‘CCC = C(C)C (83%)’, ‘CC(= O)C (79%)’, and ‘[#6]=#6~[#6] (90%)’. Sesquiterpenoids and terpenoids are predicted to be the principal natural product classes, thereby supporting the biosynthetic origins of this query subject (Fig. 6).

Fig. 6
figure 6

Structural annotation of putative mass peak germacrone that is resembled to those simulated in database (https://foodb.ca/spectra/ms_ms/74446#spectrum).

Table 3 Molecular formula and rank annotation of bioactive metabolites present in the hydroethanolic extract of C. mimosoides using UPLC-ESI(±)-QTOF-MS/MS coupled with Sirus (v.5.8.6) and MetFrag webservice.

Anti-viral and hemolytic activities of the plant extracts

Cytotoxicity assays against MDCK cells were conducted, revealing a cytotoxicity concentration (CC50) of greater than 50 µg/mL and identifying a range of extract concentrations found to be non-toxic. After diluting within the non-toxic range, the extracts were immediately pre-mixed with the virus before being added to MDCK cells for infection. The mixture was incubated with the cells for 24 h, allowing for the antiviral assessment throughout the viral life cycle. The C. mimosoides hydroethanolic extract demonstrated an IC50 of 2.33 µg/mL, showing approximately 3.35-fold and 1.96-fold higher potency compared to the controls—the aqueous extract and gallic acid, respectively (Fig. 7; Table 4). This result might indicate that the increase in gallic acid content in the plant extracts did not correlate with enhanced antiviral activity, and that other phytochemicals may provide a synergistic effect in this phenomenon. Since the plant extract was present from the attachment through the replication and budding stages, its inhibitory effect may arise at various stages of the viral life cycle. The significant difference between IC50 and CC50 values also demonstrates the extract’s effectiveness without detrimental effects on MDCK cells. It remarkably did not show hemolytic activity towards hRBCs, even at the highest concentration tested (2000 µg/mL). Therefore, the hydroethanolic plant extract could be a safe and effective antiviral solution (Table 4).

Fig. 7
figure 7

Antiviral activity of the C. mimosoides extract. Lower IC50 values indicate greater antiviral potency. CC50 is defined as the concentration needed to reduce MDCK cell viability by 50%. The extract’s IC50 is much lower than its CC50, demonstrating effective antiviral action without harming host cells. Triton-X (0.1% v/v) and hydroethanolic extract of Oroxylum indicum were positive controls in this study.

Table 4 Comparison of anti-influenza and hemolytic activities of C. mimosoides.

Target specificity of chosen metabolites against the H1N1 proteins

Molecular docking was then carried out to explain mechanisms of actions underlying anti influenza activity of the C. mimosoides aqueous ethanolic extract. Since the plant extract was exposed to multiple stages of the viral life cycle (e.g., attachment, entry, replication, and release), HA, NA, and PB2 have been chosen as promising targets in this study. Meanwhile, many flavonoids (e.g., quercetin, isorhamnetin, rutin, and quercetin 7-O-glucoside) have been demonstrated to show multiple inhibitory effects with the aforementioned proteins35,40,41. While specific inhibition of other subunits of influenza RNA polymerase (e.g., PA, NP) remains unclear, Gansukh and colleagues demonstrated that quercetin derivatives strongly interact with PB2, resulting in decreased expression of genes encoding M2 and NP, suggesting that PB2 is a probable target in addition to NA42. This finding appears to correlate with in silico studies showing that quercetin interacts with NA at a higher binding energy (−6.8 kcal/mol) compared to its interaction with the M2 protein (−5.48 kcal/mol)29. Hence, it may be possible to consider M2 as an unlikely target based on our findings. Molecular docking studies revealed that all docked metabolites preferentially interacted with NA and PB2, as indicated by the minimal differences in fitness scores (GOLD score). A high fitness score predicts the ligand binding position with strong binding affinity between two molecules, calculated based on chemical binding energies, including hydrogen bonding, van der Waals forces, metal interactions, and ligand torsional strain43. Comparing the three chosen target-specific proteins, the fitness scores (Gold scores) calculated via the GOLD program for quercetin interacting with NA and PB2 (57.29 and 59.86, respectively) are higher than that for HA (47.66) (Table 5). This result may suggest that the HA-ligand complex has a lower binding affinity compared to the interactions of NA and PB2 with the same metabolite. Therefore, HA might serve as an off-target inhibitor in our case. Notably, since gallic acid, catechin, quercetin, chlorogenic acid, their glycoside derivatives (e.g., quercetin-7-O-glucoside), and polyphenol-rich extracts (PRE) can reduce HA mRNA transcription and NP protein synthesis in a dose-dependent manner in IAV-infected cells8,40,41,44, this inhibitory effect may potentially arise from interference with the influenza RNA polymerase complex—composed of PA, PB1, and PB2—rather than from direct inhibition of HA and NP proteins, due to its regulatory role in controlling the transcription of the machinery proteins29. While specific inhibition of shikimate- and phenylpropanoid-derived natural products against various subunits of the RNA-dependent RNA polymerase (RdRp) of influenza A remains limited, recent in silico studies indicate that quercetin derivatives exhibit high binding activity at the cap-binding site of the PB2 subunit, comparable to favipiravir and mGTP42.All the above considerations lead us to suggest that NA and PB2 are more likely targets of the identified metabolites found in the C. mimosoides hydroethanolic extract than HA and even M2 coding proteins. Although shikimic acid, pyrogallol, and quinic acid—among the top five most abundant phenolics in the hydroethanolic extract—showed limited interaction with the two target proteins (compared with gallic acid), they may still play a protective role by neutralizing oxygen radicals produced during viral infection due to their antioxidant activities45.

Table 5 Target specificity of query metabolites towards HA, NA, and PB2-subunit of influenza A/PR/8/34 virus.

Molecular docking against neuraminidase (NA)

As outlined in recent literature, gallic acid and its two alkylated derivatives, methyl gallate and ethyl gallate, which were isolated from the aqueous extract of Radix paeoniae Alba, show potential as neuraminidase inhibitors although they are slightly less potent than oseltamivir acid36. The binding affinity of gallic acid and its methylated analogs will determine the inhibitory effects of other phytochemical constituents. Any compounds with a predicted fitness score higher or slightly different from 46.75 will be considered to have biological activity. They should also specifically interact with the catalytic residues by forming conventional hydrogen and/or electrostatic bonds, with bond distances not exceeding 3.00 and 5.00 Å, respectively46. Meeting these criteria, 26 out of 35 docked metabolites were suggested to have anti-influenza activities, and their details are provided below.

Simple phenolic and flavonoid derivatives

Prior research has demonstrated the potent anti-influenza activity of shikimate and phenylpropanoid-derived natural products, which can bind strongly to the active site of neuraminidase (NA) and inhibit viral growth cycles more effectively than conventional neuraminidase inhibitors, such as oseltamivir and zanamivir32,47,48. In our study, the phenolics found in the hydroethanolic extract of C. mimosoides, specifically flavonoids and glycosylated derivatives, displayed promising binding affinities towards NA structure, comparable to the referent drug, oseltamivir-1-carboxylic acid, and they share the same binding region. Guaijaverin, the most potent phenolic, interacted with ten critical residues including Arg118, Glu119, Ile149, Lys150, Arg152, Arg278, Arg293, and Arg368 in the inner shell of NA structure through H-bond (1.69–2.93 Å), π-lone pair, and π-cation interactions with Thr438 (2.69 Å), Ile149 (2.57 Å), and Arg156 (3.15 Å), respectively (Fig. 8). Additionally, quercetin 7-O-glucoside exhibited comparable binding characteristics to guaijaverin by forming four H-bonds with Asp151, Glu277, Arg293, and Arg368, and engaging in π-cation, π-alkyl, and π-donor H-bonds with Arg118, Lys150, and Tyr402, playing a crucial role in binding with sialic acid and oseltamivir49. Two glycosylated kaempferol derivatives, astragalin and afzelin, displayed even more potent binding affinities than the two positive referent ligands, oseltamivir carboxylic acid. Astragalin interacted with six critical residues (Arg118, Ile149, Asp151, and Arg152) using its glucose moiety and C ring system (2.14–2.91 Å) and formed various hydrophobic interactions with Lys150 (3.10 Å), Glu278 (4.56–4.78 Å), Arg293 (4.02–4.36 Å), and Tyr402 (5.96 Å), respectively. Afzelin interacted with the highly conserved residues (Glu119, Trp179, Ser180, Lys150, Arg293, and Arg368) through ten H-bonds and formed π-cation, π-π-T-shaped, and π-alkyl interactions through its flavonol basic ring system. Since gallic acid, quercetin, rhamnetin, and isorhamnetin have also been found to possess anti-influenza properties by exhibiting significant binding capabilities to NA from H1N1 (A/PR/8/34), similar to the drug zanamivir36,42, the same inhibitory effects of other flavonoids, including myricetin, myricitrin, dihydromyricetin, quercitrin, scutellarin, and luteolin 7-glucuronide, are highly promising. Upon closer examination, they exhibited strong interactions against the key amino acid residues, resulting in remarkable binding values ranging from 61.85 to 72.59. For instance, luteolin 7-glucuronide forms five hydrogen bonds with active amino acids, including Asp151, Arg152, Trp179, Arg293, and Arg368, utilizing its free carboxyl and hydroxyl groups. Its aromatic core structure also engaged in numerous hydrophobic interactions with Arg118, Ile149, Asp151, and Tyr402. In light of our observations, glycosylated flavonoids generally exhibit stronger binding properties than their parent compounds in biosynthetic pathways. For instance, myricitrin demonstrated superior binding compared to myricitrin and dihydromyricetin presumably due to the existence of a rhamnopyranosyl moiety that enables the creation of three exclusive H-bonds with Arg368 and Arg293, not existing in its parent structures. Mothlhatlego et al.50 demonstrated that myricetin 3-O-rhamnoside, isolated from Newtonia buchananii, inhibits various stages of the influenza A life cycle, including attachment and entry, suggesting NA as a promising antiviral target here. It could also be reinforced by a recent study indicating the synergistic effects of multiple flavonoids—myricetin, quercetin, quercitrin, amentoflavone, and afzelin—found in Thuja orientalis folium extract, leading to substantial inhibition of influenza A virus (A/PR/8/34) attachment and entry, mediated by the inhibition of NA activity46. Similar reasons may be relevant for (-)-catechin and gallocatechin, where the latter created nine H-bonds with the putative catalytic inner residues, a higher number compared to (-)-catechin, which formed five H-bonds in the expected region. We also observed their proximity to oseltamivir acid binding site, suggesting their potential to inhibit the enzyme activity (Fig. 9). The article reviewed by Ide et al.51,52 also shows that various catechins (e.g., EGCG, epicatechin gallate, and catechin-5-gallate) exhibit anti-influenza virus effects by interacting differently with the neuraminidase structure, as proposed in molecular docking studies. Our findings imply that the binding efficiency of various flavonoids found in the C. mimosoides hydroethanolic extract depends on the number and strength of interactions with the target region, thus highlighting polyphenol-rich extracts as natural anti-neuraminidase agents53.

Fig. 8
figure 8

Ligand – receptor interaction of chosen polyphenols with the catalytic residues of NA structure.

Fig. 9
figure 9

Structural orientation of flavonoids and glycoside derivatives within the oseltamivir recognition site in NA structure (PDB: 6HP0).

Concerning simple phenolics (Fig. 10), chlorogenic acid can reveal six H-bonds with the important residues, Thr148, Glu119, Arg152, Trp179, Glu278, and Tyr402, where hydrophobic interactions and C-H bonds were formed by the residues Arg156, Thr148, and Gly147. Ding et al. (2017) provided clear evidence supporting our assumption, showing that chlorogenic acid acts as a neuraminidase blocker, effectively inhibiting influenza A virus in both in vitro and in vivo models54. A recent study also indicated that the higher levels of chlorogenic acid in decaffeinated coffee beans, combined with lower caffeine levels, significantly impact their ability to inhibit neuraminidase55. Despite gallic acid, ethyl gallate, and ellagic acid displaying lower fitness score compared to oseltamivir carboxylic acid (the referent ligand), they should still be recognized as bioactive due to their ability to interact with NA catalytic residues, supported by previous in vitro testing models56. In doing this, gallic acid formed six promising H-bonds with Arg118, Glu119, Arg293, and Arg368 (2.02–2.65 Å) using its three hydroxyl groups. Moreover, its π-system localizing in benzene ring facilitated hydrophobic bonds with Arg293 (3.86 Å), Lys150 (3.72 Å), Glu278 (3.66 Å), and Tyr402 (2.46 Å) located around the central ligand. A comparable trend was observed with ethyl gallate, except it likewise formed C-H bonds with Glu227 due to the ethoxy group, gaining a higher binding energy than its chemical ancestor. Ellagic acid generated five H-bonds with active residues, including Arg118, Glu227, Glu228, Arg293, and Arg368. Its dimeric ring system efficiently created numerous π-interactions with Lys150, Arg225, Glu278, and Tyr402 (Fig. 10B). Also, it was found that gallic acid, ethyl gallate, and ellagic acid are oriented near oseltamivir acid, implying they have the potential to inhibit the neuraminidase enzyme (Fig. 10A). Our results resemble to findings that ellagic acid, one of the four major polyphenols present in Punica granatum extract (PPE), individually exhibits anti-influenza activity and shows a synergistic effect when combined with oseltamivir57.

Fig. 10
figure 10

(A) Structural orientation of simple phenolics located in the active site of NA structure. (B) 2D and 3D structures that illustrating their binding potential with the drug recognition site.

Non-polyphenols against NA

Studies have found that phenolic and non-phenolic substances inhibit the viral cycles synergistically, although they might have limited antiviral properties independently. It is, therefore, essential to investigate compound interactions for potential therapeutic implications. Herein, various non-polyphenols, including dicarboxylic acids (e.g., hexa-, tetra-, and undecane-dioic acids), terpenoids (germacrone), and alkaloids (trigonelline) were simultaneously detected in the hydroethanolic C. mimosoides extract. They were all found to have comparable or remarkably higher fitness scores equated with the reference ligands, e.g., gallic acid, quercetin, and rhamnetin) (Table 5; Fig. 11). Considering dicarboxylic acids, with fitness scores range from 51.36 to 64.22, occupy with the highly conserved residues of NA structure through both H-bonding and alkyl interactions, maintaining distances within the proper ranges. Hexadecanedioic acid, for example, formed three H-bonds with Lys150, Arg156, and Arg156 together with three alkyl-alkyl interactions with arginine triad (118, 152, and 225) (Fig. 11B). Similarly, tetradecanedioic acidwhose fitness score is 60.22, reveals three H-bonds (Lys150, Arg156, and Glu278) together with forming alkyl bonds with those lining catalytic inner shells, including Arg118, Leu134, and Arg152. Azelaic acid and undecanedioic acid, the shorter chain fatty acids, forming four H-bonds with Arg152, Arg156, Arg293, Arg368, and Lys150, where they also used the alkyl systems generating alkyl interactions with Lys150 and Arg118, hence receiving the higher binding scores over specific referent ligand (e.g., gallic acid). These results also indicated their binding affinities diminished in the following order: Hexadecanedioic acid > tetradecanedioic acid > undecanedioic acid > azelaic acid, significantly correlated with the number of carbon atoms and interactions between the ligand and target protein (NA). Trigonelline, an alkaloid, could form two C-H bonds with Glu228 while establishing multiple attractive charges with NA inner shell. Germacrone, with binding energy almost identical to gallic acid, formed C-H, π-alkyl, and alkyl interactions with Lys150, Arg118, and Arg152, maintaining appropriate distances (≤ 3.00 Å for H-bonds, ≤ 5.00 Å for hydrophobic interactions) and shared the binding pocket with selected neuraminidase inhibitors, as revealed by binding orientation analysis (Fig. 11A). This evidence is supported by Liao et al.58, showing that germacrone, derived from Rhizoma curcuma essential oil, inhibits the influenza life cycle’s early stages and replication processes.

Fig. 11
figure 11

(A) Non-phenolic substances are predicted to orient within NA catalytic site. (B) Their specific interactions with the drug/sialic acid recognition pocket.

Molecular docking of against cap-binding domain (PB2)

A significant number of phenylpropanoid-derived natural products have been identified as effective inhibitors of influenza virus replication, in addition to their role as neuraminidase blockers59. By mimicking the binding properties of mGTP and Favipiravir-RTP (reference ligands), they achieve this by forming various hydrophobic interactions (e.g., alkyl, π-π T-shaped, π-π stacked) with highly conserved amino acid residues (such as Phe323, His357, Phe363, and Phe404) within the cap-binding domain, thereby disrupting the viral cycle60. Besides exhibiting the probability to penetrate the viral envelope61, characterized by logP values ranging from 0.21 to 1.8 (Table 3), the target specificity assay revealed that a variety of simple phenolics and flavonoids exhibit strong interaction with PB2 subunit, with comparable scores to those observed with neuraminidase (Table 5). You et al.8 reported that gallic acid from Toona sinensis leaf extract also showed polypharmacological effects, inhibiting influenza virus replication with IC50 and CC50 values of 18.14 and > 100 µg/mL, respectively. Therefore, compounds scoring close to or above 42.71 are likely active and expected to interact with the cap-binding domain. This involves forming a range of H-bonds and hydrophobic interactions, with bond distances not exceeding 3.00 Å and 5.00 Å, respectively46. Based on these criteria, 27 metabolites were selected to explain the potential mechanisms underlying anti-influenza activity, as detailed below.

Simple phenolic compounds against cap-binding protein (PB2)

Twenty polyphenols were identified as capable of interacting with the crucial hydrophobic residues within the cap-binding domain of PB2 subunit, displaying a binding orientation highly resembling those of mGTP, favipiravir-RTP, and quercetin (Fig. 12). They importantly engaged in π-stacking interactions with Phe363 and Phe404, crucial for cap-binding by influenza RNA polymerases53, and could interact with nearby aromatic amino acids, thus resulting in enhanced complex stabilization. To begin with, consider the simple phenolics (gallic acid, ethyl gallate, p-coumaric acid, ellagic acid, and chlorogenic acid) with binding scores ranging from 42.26 to 63.87 (Table 5). Gallic acid, the second most abundant phenolic detected in hydroethanolic extract, created three π-π stacked interactions with Phe323 (5.32 Å), His357 (4.02 Å), and Phe404 (3.70 Å) while forming strong H-bonds with Glu361, Lys376, and Phe404 (1.95–2.16 Å) (Fig. 12B). Notably, a significant number of hydrophobic interactions are observed in its more lipophilic derivatives—ethyl gallate and ellagic acid, with logP values of 1.21 and 0.79, respectively. The former exhibited multiple types of π-interactions with His357 (3.49–5.04 Å), Phe323 (2.66–4.34 Å), and Phe363 (5.47 Å) while establishing multiple H-bonds with Ser337 (2.86 Å), Glu361 (1.83 Å), Lys376 (2.73 Å), and Phe404 (2.37 Å), effectively anchoring in the sialic acid/drug binding pose. These results align with previous studies, showing that extracts from Radix paeoniae Alba and Punica granatum, rich in gallic acid and its alkyl derivatives, inhibit influenza A virus replication in MDCK cells36. The latter, a dimeric form of gallic acid, generated thirteen crucial hydrophobic interactions with Phe323, Phe325, Phe357, Phe363, and Phe404, due to the increased number of π-systems, along with three hydrogen bonds with Ser337 (3.05 Å) and Glu361 (2.05 and 2.08 Å).This elaborated the existing finding that Aronia melanocarpa extract, rich in ellagic acid and myricetin, shows antiviral activity against influenza viruses and protects mice from lethal influenza challenges10. 4-Coumaric acid capable of interacting with His357 and Phe404 via two π-π-stacked interactions (4.0 and 3.73 Å), coexisting with two rigid H-bonds with Phe404 and Arg332 (1.56–2.64 A). As found in the literature, 4-coumaric acid, a key component of Brazilian propolis (AF-08), demonstrated anti-influenza activity (A/PR/8/34(H1N1), with IC50 and CC50 values of 31.5 ± 1.3 µg/mL and > 100 µg/mL, respectively, highlighting its potential antiviral properties56. Although chlorogenic acid is known as a neuraminidase blocker, its multiple effects on the cap-binding domain should be explored due to its fitness score (59.60) and moderate hydrophobicity. This compound established three promising π-π stacked interactions with His357, Phe404, and Phe323 (3.78–5.74 Å), along with multiple H-bonds with Arg332, Ser337, Lys339, His367, and Phe404 (1.59–2.93 Å), resembling the molecular recognition and interaction of mGTP and favipiravir-RTP (Fig. 12A). While direct evidence to support our supposition remains unclear, a computational study conducted by Luo et al.62 showed that chlorogenic acid, one of the major phenolic substances in Lonicera japonica, exerts a synergistic effect against both NA and PB2, which are crucial for the influenza life cycle, shedding light on the multiple effects of natural products reported herein.

Fig. 12
figure 12

(A) Structural orientation of phenolic substances and the two positive ligands —mGTP and favipiravir-RTP—in the cap-binding domain of PB2 subunit of influenza RNA polymerase. (B) Their specific interactions with the hydrophobic pocket.

Flavonoids against cap-binding protein (PB2)

Quercetin and its glycoside derivatives, such as quercetin 3’-glucuronide and quercetin-7-O-glucoside, effectively block the RNA polymerase (PB2) of influenza viruses by precisely occupying the m7GTP binding site, resulting in the disruption of the viral life cycle41,42. Consistently, all putative flavonoids detected in the C. mimosoides extract exhibited a similar orientation pattern to those positive ligands (e.g., quercetin, mGTP and favipiravir-RTP) with binding scores ranging from 59.86 to 73.87 (Table 5; Fig. 13). Among them, two flavone glycosides—scutellarin and luteolin 7-glucuronide—demonstrated the highest binding affinities, scoring 73.69 and 73.87 in fitness scores, respectively. By utilizing the ring systems A and C, the former can generate H-bonds (2.38–2.5 Å) and hydrophobic interactions with Ser321, Phe323, His357, Phe363, Lys376, and Phe404. Meanwhile, its glucuronide moiety formed three H-bonds with Arg355 and Asn429 (1.70–2.99 Å) (Fig. 14). Likewise, luteolin 7-glucuronide, with logP value of 1.55, formed multiple robust hydrophobic interactions with the target residues, where its glucosiduronic acid moiety revealed three H-bonds with Arg355 (2.43 Å) and Asn429 (1.64–2.91 Å). In light of our results, glycosylated flavonoids always exhibited greater binding energies over their aglycone scaffolds. For instance, the fitness scores of quercetin and its three derivatives decreased in order of substitution degree: Quercitrin (quercetin 3-rhamnoside: Q3R) > Guaijaverin (quercetin-3-O-arabinoside) > Taxifolin (dihydroquercetin) > Quercetin. Notably, quercitrin exhibited energy levels comparable to the reference ligand, quercetin-7-O-glucoside, which is known to directly inhibit viral RNA polymerase by occupying the m7GTP binding site [35]. Q3R formed five strong H-bonds (1.69–2.92 Å) and engaged in six promising π-interactions with Phe404, Phe323, Phe363, and His357, suggesting it is well positioned in the target region. Following this pattern, myricitrin (myricetin 3-O-rhamnoside) exhibited superior binding property compared to its analogs—myricetin and dihydromyricetin—through seven hydrophobic contacts, alongside two H-bonds (1.74 to 2.28 Å). The rhamnoside moiety also enhanced stability by forming three H-bonds (1.98 to 2.97 Å) and three alkyl-bonds with Lys339, Arg355, Phe325, and Ser324 (3.98 to 4.88 Å) in the expected region. While the selective inhibition of myricitrin remains ambiguous, Sang et al.60 demonstrated that its molecular scaffold—myricetin—significantly disrupts influenza A virus replication by occupying at PB2 subunit. Consequently, flavonoids that share the same biosynthetic origin, characterized by (C6-C3-C6) skeleton, including myricetin, dihydromyricetin, and even two glycosidic kaempferol derivatives (astragalin and afzelin) apparently active due to their more significant number of H-bonds and hydrophobic interactions with the mGTP recognition residues. Based on the basic flavonoid structure, catechin and gallocatechin are likewise predicted to occupy PB2 subunit of influenza RNA polymerase, forming various hydrophobic interactions and hydrogen bonds with the key residues Phe323, His357, and Phe404 (Fig. 14). Despite further elucidation is needed, previous studies have found that plant extracts containing various catechins (e.g., (+)-catechin, ECGC, EGC, ECG, EC, and GCG) exert antiviral activity against flu viruses grown in MDCK cells by disrupting RNA synthesis in addition to the inhibitory effect on NA and HA proteins63.

Fig. 13
figure 13

Structural orientation of flavonoids and other glycoside analogues with the two positive ligands —mGTP and favipiravir-RTP—in the cap-binding domain of PB2 subunit of influenza RNA polymerase.

Fig. 14
figure 14

Ligand-receptor interactions of polyphenols with the highly conserved hydrophobic residues of PB2 subunit. Their interactions with the hydrophobic pocket.

Non-phenolic against PB2 (4NCE)

Germacrone, a major component of the essential oil in Rhizoma curcuma, not only inhibits the early stages (e.g., entry and spreading) of the influenza life cycle but also disrupts the replication processes58. Here, we found that the putative germacrone (m/z 219.1393) interacted with the cap-binding site with energy levels similar to gallic acid (Fig. 15B). This cyclic sesquiterpene formed several strong intermolecular interactions with highly conserved residues including Phe323, Phe325, Arg355, His357, Phe363, Phe404, and Met431 (3.56 to 5.32 Å), suggesting its potential polypharmacological effects in addition to neuraminidase (Fig. 15B). Therefore, the inhibitory effects of dicarboxylic acids were postulated to be active due to the strong binding affinity (Table 5) and integral lipophilicity, with LogP values ranging from 1.44 to 2.98 (Table 3). Tetradecanedioic acid, for instance, established four promising H-bonds with Glu341 (1.94 Å), Lys353 (3.04 Å), Ile354 (2.78 Å), and Glu361 (2.02 Å), along with several hydrophobic interactions with the key residues (Phe404, His357, Pro430, Arg355, and Met351), highly similar to mGTP and favipiravir-RTP (Fig. 15A). Given the lack of evidence regarding the anti-influenza effects of longer-chain dicarboxylic acids, using alternative strategies, such as MD simulation, is crucial to prospectively assess stability in various complex structures.

Fig. 15
figure 15

(A) Non-phenolics substances predicted to structurally orientate in the cap-binding region of influenza A RNA polymerase (PB2). (B) Their interactions with the hydrophobic pocket.

Conclusion

For the first time, we demonstrated that increasing the ethanol concentration to 30% (v/v) in the extraction solvent led to a substantial increase in the phytochemical profiling, antioxidant, and anti-influenza activities of the C. mimosoides hydroethanolic extract. Gallic acid could be a representative marker based on RP-HPLC analysis. After being characterized by UPLC-ESI(±)-QTOF-MS/MS and verified using multiple mass spectra annotation tools, simple phenolic acids and flavonoids were found to be predominant in the newly established plant extract. It exhibited potent anti-influenza activity, approximately 3.35 times higher than that of the aqueous extract, where flavonoids were barely detected. The absence of hemolytic activity indicates its potential use as a safe antiviral agent. Docking studies affirmed that gallic acid, ethyl gallate, ellagic acid, and chlorogenic acid, along with all flavonoids, interacted well with both NA and PB2, supporting the boosted antiviral activity. Molecular dynamics simulations and further in vitro assays are needed to underscore its potential uses. Owing to the higher lipophilicity of flavonoids, increasing the ratio of ethanol in the solvent used might lead to enhanced antiviral activity of the C. mimosoides aqueous-ethanolic extract in the long run.