Introduction

Carrots are an excellent source of carotenoids, vitamins, dietary fiber, minerals, and antioxidants, making them a highly nutritious and versatile food1,2. Globally, they are among the ten most widely produced and consumed vegetables3. The global carrot market, in terms of exports, is currently valued at $1.6 billion4, and carrots are a significant source of employment for farmers; in fact, vegetable production provides approximately 300,000 direct and indirect jobs in Colombia5.

With the population increase, global food demand is putting strong pressure on current agricultural systems. Parallelly, food loss and waste also increase in the food supply chain. Researchers estimate that the global percentages of food loss in production, postharvest, and consumption stages are 24, 24, and 35%, respectively6. Carrot production has increased in recent decades, reaching 41.7 million tons worldwide in 20214 . However, approximately 30% of total carrot production is discarded during the postharvest stage, as it does not meet the quality standards required for commercialization, including reduced size, irregular length, inconsistencies in shape, color, and the presence of mechanical damage7. Consequently, there are approximately 12.51 million tons of discarded biomass potentially usable by carrot value chain. In rural areas where carrots are commonly grown, the residual biomass is often used as animal feed or compost material. On the other hand, it is incinerated or disposed of in open fields, negatively impacting the environment by contributing to air pollution and groundwater contamination8,9. Additionally, this leads to economic losses for producers and additional costs associated with disposing of discarded material, potentially discouraging agricultural efforts. Reducing the amount of organic waste generated, in terms of losses and production surpluses, can improve food security without increasing the environmental burden of agricultural practices3.

Colombia’s agricultural vocation makes it an interesting model for exploring waste valorization strategies, especially in the case of carrots, where losses reach around 30% in the country—similar to figures reported globally. In Colombia, carrots are classified into three categories for commercialization purposes. The first category, labeled “extra”, includes carrots free from morphological defects, with regular shape, and without bruises or fissures. The second category, labeled “first,” includes carrots with a good appearance but allows minor defects, such as handling or washing-related injuries. Finally, the “second” category includes carrots with defects such as healed wounds and burn marks10,11. Carrots are discarded and considered residual biomass or surplus production when quality variations arise, such as cracked, deformed, or pathologically damaged. These surpluses have been considered renewable and abundant sources of chemical compounds of interest and have even been used as raw materials in biorefinery processes focused on producing fermentable sugars, carotenoids, and biofuels7,12,13.

The utilization of food waste as a source for recovery of value-added compounds through sustainable technologies and bioprospecting is key to improving food security and having a sustainable environment, as food losses contribute to the release of greenhouse gases, poor air quality, land, and water pollution14. Food waste can serve as a platform for chemicals, nutraceuticals, sugars, biofuels, biogas, and biochar via thermo-chemical conversion, anaerobic digestion, and fermentation processes. Now, food waste may start at the source of origin where the harvest is done, and farmers select which products are suitable for commercial purposes; for this reason, the use of agricultural wastes and rejects allows not only nutritional recovery but also economic gain for the producer3,7. At the same time, plant metabolism is influenced by local changes in temperature, water availability, and atmospheric CO2 concentration15. Then, understanding how the local environment influences the biochemical profile of food waste is crucial to determining its potential in bioprospecting and waste valorization.

The chemical composition of an agro-food matrix, such as carrots, may vary due to factors associated with the growing location and their microclimates. The number, proportions, composition, and quality of chemical compounds in plants are influenced by various environmental factors, including geographic location, solar radiation, precipitation, and average temperature, which in intertropical regions like Colombia are primarily determined by altitude. These factors can directly or indirectly affect the concentration of secondary metabolites, and the biological activity associated with these compounds16,17,18. Several studies have explored food matrices such as mint, mango, or dragon fruit, determining a direct influence of altitude and other climate characteristics on levels of phenolic compounds, flavonoids, alkaloids, amino acids, among others, and their inherent bioactivities16,17,18,19,20. However, few studies have focused on evaluating the chemical variability of organic waste, losses, and production surpluses concerning their locality, especially in residual carrots; therefore, it is necessary to consider the high crop heterogeneity and their chemical variability influenced by locality, along with the issues related to underuse of carrot crop surpluses, such as the decrease in production yields in harvests and the inherent losses for producers7,12, as a possible opportunity to find cultivation areas with differential qualities18,19 and achieve the whole production use.

Based on the above, it is essential to conduct bioprospecting studies to identify molecules of interest in residual carrot biomass, facilitating the determination of new uses and discovering specific markets and production niches. For instance, production surpluses with higher carotenoid content could be used specifically in natural colorant production, bioactive ingredients in pharmaceuticals, and cosmetic compounds12,21. Alternatively, production surpluses from a particular locality with higher phenolic and/or antioxidant content could be directed towards generating functional ingredients for various applications8,22 or for fermentation or biorefinery processes if a higher content of fermentable sugars or polysaccharides is identified7,22; thus adding value to the production chain and enabling the replication of such applications or transformations in different localities under similar climate conditions.

Bioprospecting studies can be conducted through metabolomic techniques, which enable high-resolution phenotypic identification23. Metabolomics is the comprehensive study of low-molecular-weight metabolites in a biological system24. Metabolomics, therefore, provides a broad profile of the metabolic richness of a food matrix25. The recent advances in analytical techniques, such as liquid chromatography coupled with mass spectrometry (LC-MS), allow for the identification of hundreds of metabolites and the evaluation of differences and similarities between samples26. Indeed, lipidomic techniques, well known as a subfield of metabolomics25,27, may be seen as a key tool to decipher the metabolite composition of carrots, having into account there are an essential amount of lipid-based compounds associated with color and antioxidant activity, such as carotenoids21,28; however, there are few lipidomic studies on carrots that focus on evaluating agroecological effects on the metabolome29,30, and none so far related to carrot residues, surpluses, or residual biomass. This study aims to determine the effect of microclimate in the metabolic profiles of four types of carrot biomass—cracked (R), deformed (D), and pathologically damaged (P) and control without damages (C)—collected from three distinct localities in the eastern municipalities of the Department of Antioquia, Colombia: Rionegro, Marinilla, and El Santuario. Each locality presents unique climatic conditions; thus, understanding how these climatic gradients influence the metabolic profiles of residual biomass can provide valuable insights in bioprospecting. Furthermore, this knowledge may help in reducing losses and optimize processes in the food supply chain and facilitates the development of targeted strategies to enhance the re-valorization of agro-industrial waste. Such advancements contribute to shifting from a linear production model to a circular economy approach, promoting sustainability and efficiency9. Finally, the findings of this study could support the replication of bioprospecting initiatives across other regions under similar climatic conditions, enabling the generation of bioactive compounds for diverse applications in pharmaceuticals, nutraceuticals, and agro-industrial processes.

Materials and methods

Plant material sampling

Carrot (Daucus carota L.) samples of the Bangor variety were harvested four months after planting from three different localities in Antioquia (Colombia): El Santuario (6°9’12’’N, 75°16’38,6’’E), Marinilla (6°11’8,5’’N, 75°15’26’’E), and Rionegro (6°7’53,876’’N, 75°24’52,166’’E), collected in November 2023 (Fig. 1A.). One general sampling of production surpluses was performed at each locality, considering four carrot types to know: control (C), carrots with conical shape without secondary roots or branches, free from spots, fissures, or wounds (Fig. 1B.); cracked (R), carrots with superficial damage in the form of fissures (Fig. 1C.); deformed (D), showing evident malformations or bifurcated appearance (Fig. 1D.); and pathologically damaged (P), carrots with superficial deterioration associated with color changes or the appearance of spots caused by pathogen attacks (Fig. 1E.).

Fig. 1
Fig. 1
Full size image

Georeferenced map of the harvesting locations (A) and carrot residual biomass typologies analyzed: Control (B), Cracked (C), Deformed (D), Pathologically damaged (E). Map acquired under the Creative Commons Attribution-Share Alike 3.0 Unported license, source from: https://commons.wikimedia.org/wiki/File:Colombia_-_Antioquia.svg#.

Climatic data collection

The climatic characteristics of the three localities are presented in Table 1. For each georeferenced sampling point, we obtained high-resolution climatic variables from the CHELSA v2.1 platform (https://chelsa-climate.org/), which provides spatially interpolated monthly climate data at ~ 1 km² resolution. Specifically, we extracted data for the period 2009–2018, focusing on the months that matched the experimental sampling. Historical climatic datasets offer a robust baseline for ecological comparisons, particularly relevant here since the agricultural materials have long adapted to local climatic conditions31. An analysis of mean differences for each climatic variable across localities was conducted using ANOVA, followed by Tukey’s post-hoc test to identify significant differences between microclimates. In addition, an altitudinal gradient analysis was performed using the rast and ggplot2 functions (R environment, https://cran.r-project.org/bin/windows/base/) aligned with the climatic data set described above.

Table 1 Climatic characteristics of the evaluated harvested localities.

Sample Preparation

Five kilograms of carrots were harvested and processed for each residual biomass typology at each of the three localities. The carrots were thoroughly washed, disinfected by immersion in a sodium hypochlorite solution (200 ppm for 5 min), and then rinsed to remove the excess of disinfectant. Carrots were peeled, and crowns and tips were removed. The carrots were then chopped and homogenized using an industrial blender (Oster; Milwaukee, WI, USA); until the particle size was less than 1 cm³. From this material, 300 g were lyophilized at -50 °C for 40 h (Biobase; Jinan, China), yielding approximately 30 g. This process was performed in quadruplicate. The lyophilized material was stored under vacuum in multilayer laminated bags (aluminum/polyethylene/low-density polyethylene) at -20 °C until extraction.

A total of 10 mg of each lyophilized sample was mixed with 150 µL of cold acetone and homogenized by vortexing for 30 s (DLab Scientific; Beijing, China); acetone was selected as the extraction solvent aligned with a lipid-oriented untargeted metabolomic profile (lipidomic), due to its ability to solubilize semi-polar and moderately non-polar metabolites, including carotenoids, fatty-acid derivatives, alkaloids, phenolic derivatives, amides, and other lipid-related secondary metabolites known to occur in carrot tissues. The sample was centrifuged at 27,000 xg (16000 rpm) for 10 min at 4 °C (Thermo Fisher Scientific; Waltham, MA, USA). The supernatant was recovered, and another 150 µL of cold acetone was added to the precipitate, repeating twice the process. The three supernatants were combined and transferred to an Eppendorf tube. A 300 µL aliquot of supernatant was mixed with 300 µL of MTBE and vortexed for 5 s; then, 100 µL of water was added and vortexed for 5 s. The mixture was centrifuged at 27,000 xg (16000 rpm) for 10 min at 4 °C. The supernatant was recovered, and another 150 µL of MTBE was added to the precipitate, repeating twice the extraction process. The resulting extract, containing an acetone: MTBE solvent mix, was filtered using a 0.2 μm acrodisc filter.

LC/MS and RP-LCMS/MS-QTOF conditions

Metabolomic analysis was performed using an Agilent Technologies 1260 liquid chromatography system coupled with a Q-TOF 6545 quadrupole time-of-flight mass analyzer with electrospray ionization (Agilent Technologies, Palo Alto, CA, USA). One µL of each metabolite extract was injected into a C18 column (InfinityLab Poroshell 120 EC-C18 (100 × 3.0 mm, 2.7 μm)) at 50 °C, with an elution gradient of ammonium acetate solution (10 mM) dissolved in an H2O mix (90:10) (phase A) and ammonium acetate solution (10 mM) dissolved in an ACN: MeOH mix (20:30:50) (phase B), with a constant flow rate of 0.6 mL/min. Mass spectrometry detection was performed in positive ESI mode in a full scan from 100 to 1700 m/z. During analysis, a mass reference mixture was calibrated: m/z 121.0509 (C5H4N4) and m/z 922.0098 (C18H18O6N3P3F24). The chromatographic elution gradient began with 70% of phase B for 1 min, increased to 86% of phase B over 3.5 min, remaining in these conditions for 6.5 min; then increased to 100% of phase B in 1 min and held for 6 min. Afterward, it returned to the initial conditions in 0.1 min and was equilibrated for 3 min.

ESI-MS/MS analyses were performed in positive ionization mode with a full scan from 100 to 1700 Da. The ESI-QTOF-MS/MS was operated under the following conditions: Dual AJS ESI source, Vcap 3500 V, drying gas 10 L/min, gas temperature 200 °C, nebulizer 50 psi, sheath gas temperature 300 °C, and sheath gas flow rate 12 L/min. For MS-TOF, the conditions were: fragmentation 175 V, skimmer 65 V, and OCT RF Vpp 750 V. For MS/MS conditions, collision energy was set at 40 eV, scan range 100–1700 m/z, acquisition speed of 1 spectrum/s, and acquisition time of 1000 ms/spectrum. The runtime was 20 min. For quality control, a set of samples was measured at the beginning of the run. A QC group was prepared by mixing 150 µL of each sample, injecting two QCs at the beginning of the sequence and, subsequently, every four samples to ensure no analytical variability from the instrument. The samples were randomized using Excel’s random function and injected in that order.

A re-fragmentation process was performed on a series of previously selected masses based on their abundance levels or degree of significance. For this purpose, the samples were analyzed using identical liquid chromatography and mass analyzer system configurations. One µL of extracts was injected into a C18 column (100 × 3.0 mm, 2.7 μm) at 60 °C, with a gradient elution composed of 10 mM ammonium formate in 60:40 CAN plus 0.1% formic acid (Phase A) and 10 mM ammonium formate in IPA 90:10 plus 0.1% formic acid (Phase B), at a constant flow rate of 0.6 mL/min. Mass spectrometry detection was performed in positive ESI mode. For mass correction, two reference masses were used during the entire analysis: m/z 121.0509 (C5H4N4) and m/z 922.0098 (C18H18O6N3P3F24). The directed MS/MS analysis was set to a collision energy of 20 V, and the chromatographic elution gradient started with 15% of Phase B, increasing to 30% over 2 min, then reaching 48% over another 2.5 min, subsequently increasing to 82% over 11 min and then to 99% over 23.5 min. Finally, the gradient decreased to 15% in an initial stage of 12.1 min and a final stage of 18 min. Method adapted from previous proposed methodologies32.

Metabolomic data processing and analysis

The compounds present were analyzed and manually inspected using Agilent MassHunter Profinder software, version 10.0, employing the Recursive Molecular Extraction algorithm with an extraction time of 17 min, 5000 counts, using positive ion species (+ H, +Na, -H2O, NH4, K) and without applying mass or signal filters. This algorithm deconvolutes the chromatogram and integrates the molecular features in the samples and solvent blank based on mass and retention time. The results obtained were subjected to a reproducibility-based filtering process; during this procedure, the area’s coefficient of variation (CV) in the quality control (QC) samples was calculated, and molecular properties with a CV higher than 20% were excluded.

Initial MS data was converted to mzXML format using ProteoWizard (https://proteowizard.sourceforge.io/) and then to abf files using ABF converter (https://www.reifycs.com/AbfConverter/). Data analysis was conducted using MS-DIAL software (http://prime.psc.riken.jp/compms/msdial/main.html) with the following parameters: mass range (0-2000 Da), MS1 alignment tolerance (0.01 Da), MS2 alignment tolerance (0.025 Da), retention time (0–100 min), minimum peak height (1000), sigma (0.5), and MS accurate mass tolerance (0.015 Da). Method adapted from previous proposed methodologies32.

The confirmation of statistically significant molecular features was achieved during the re-fragmentation process by determining the exact mass, the molecular formula, and the mass spectra analysis. The annotation involved a combination of platforms, including MS-DIAL 5.0 using libraries from MoNA - MassBank of North America: MoNA-export-LC-MS/MS_Positive_Mode, MoNA-export-GNPS, and MoNA-export-Vaniya-Fiehn_Natural_Products_Library, as well as SIRIUS 5.8.6 software, and in silico tools like CFM ID (cfmid3.wishartlab.com/). Additionally, the refragmented masses were reviewed one by one using their m/z values and intensities or retention times against complementary databases such as MassBank (https://massbank.eu/MassBank/), LipidMaps (https://lipidmaps.org/), Plant Metabolome Repository (https://metabolites.in/plants/), and CEU Mass Mediator (https://ceumass.eps.uspceu.es/index.xhtml), employing spectral, similarity, peak, and other searches with a threshold of at least 0.80 or tolerances no greater than 10 ppm in peak masses. Each metabolite was reported with an ID confidence level according to the Metabolomics Standards Initiative (MSI) guidelines, where Level 0 indicates an unequivocal 3D structure, Level 1 a reliable 2D structure compared with a reference standard, Level 2 a probable structure corroborated by literature or database evidence, Level 3 a possible structure or class identified by molecular formula, and Level 4 an unidentified feature that indicates presence without specific identification.

Metabolite enrichment analysis

The retrieved annotated metabolite list was subjected into MetaboAnalyst 6.0 (https://www.metaboanalyst.ca/) for an Over Representation Analysis (ORA) and metabolic pathway enrichment analysis. The overall annotation of metabolite features was performed by “Annotated Features” module using “Pathway Enrichment” and “Network Analysis” sub-modules, by the RaMP-DB (metabolite and lipid pathways, integrating KEGG via HMDB, Reactome and WikiPathways) pathway-based metabolite library. Target analysis was done with Arabidopsis thaliana from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database as the target organism33,34,35.

Metabolomic statistical analysis

Abundance peaks from the final identification and metabolites were exported from MS-DIAL and subjected to multidimensional statistical analysis using MetaboAnalyst 6.0 (https://www.metaboanalyst.ca/), by the “Statistical Analysis” module and the “one factor” sub-module. The data were preprocessed before performing multidimensional statistical analysis, no missing values were detected from the original dataset, and there was no data filtering. The different normalization procedures were performed, and all the data scaling, transformation, and normalization were tested in order to find the best model for representing the data and its variability. Therefore, the data were preprocessed through median normalization, log transformation (base 10), and Pareto scaling, followed by univariate and multivariate statistics. Univariate analysis included fold change (FC) analysis (FC threshold value = 2) to identify up and down-regulated metabolite features, and volcano plot (FC threshold value = 2 and p-value threshold = 0.05). Multivariate analysis included principal component analysis (PCA) and orthogonal partial least squares discriminant analysis (OPLS-DA) with a p-value threshold = 0.05; and a hierarchical clustering analysis (HCA), using normalized data and autoscale samples, additionally, having on count a euclidian distance and a ward clustering method. The VIP score obtained in the OPLS-DA model and fold-change facilitated the assessment of the influence of intensity and explanatory ability of each of the metabolites in classifying and discriminating clusters of samples on the basis of biologically significant metabolite features. The model parameters (R²Y and Q²) were calculated automatically as part of the OPLS-DA model evaluation, specifically, R²Y represents the fraction of variation in the response variable explained by the model, while Q² indicates the model’s predictive ability estimated by cross-validation.

Results

Exploratory analysis of microclimatic patterns

The climatic diversity across the three localities revealed distinct microclimatic clusters that may influence carrot metabolic profiles. The three localities—El Santuario, Marinilla, and Rionegro—in Antioquia, Colombia, exhibited unique climatic conditions, as detailed in Table 1. Marinilla, situated at the highest altitude, presented distinctive climatic characteristics, including the highest values for precipitation, wind speed, cloud area fraction, and climate moisture index. Rionegro and El Santuario, located at intermediate to lower altitudes, displayed similar climatic conditions driven by relative humidity, potential evapotranspiration, temperature-related variables, and solar radiation, suggesting moderate and less extreme variation for Rionegro and El Santuario.

Although each locality was localized at different altitudes, some climatic characteristics were shared between localities. Marinilla stood out with its higher wind speeds (2.008 m/s) and precipitation (2929.0 mm annually), emphasizing its wetter and windier climate compared to Rionegro and El Santuario. On the other hand, El Santuario clustered closer to Rionegro, indicating similar profiles with moderate to high temperatures, higher solar radiation, and a more significant vapor pressure deficit (0.647 and 0.643 kPa, respectively), indicative of less cloudy conditions and a warmer and drier microclimate compared to Marinilla. These patterns align with the altitudinal gradient shown in Fig. 2A, where Marinilla is located at a greater altitude (2284.0 m.a.s.l.) compared to El Santuario (2160.0 m.a.s.l.) and Rionegro (2119.0 m.a.s.l.), which highlights the relationship between elevation and climatic variation across El Santuario and Rionegro localities against Marinilla. 15,33,34

Statistical analysis using ANOVA revealed significant differences in key climatic variables across the localities. Tukey’s post-hoc test identified specific pairwise differences (Fig. 2B-G), thus reinforcing the distinctions associated with the altitude. Based on the above, we decided to group the localities into one cluster (C1) composed of Rionegro and El Santuario and a second cluster (C2) composed of Marinilla. The analyses further demonstrate that El Santuario and Rionegro can be grouped together based on their climatic similarities, whereas Marinilla exhibits distinct climatic profiles shaped by a unique combination of environmental factors. Climatic differences likely influence the metabolic profiles of residual carrot biomass samples. Factors such as altitude, temperature, precipitation, and other climatic characteristics, along with region-specific biotic factors (e.g., the influence of the growth locality), may explain variations in chemical composition and bioactive compounds16,19. Consequently, microclimatic diversity underscores the importance of considering local environmental conditions in agricultural and metabolic research.

Fig. 2
Fig. 2
Full size image

Climate analysis associated with the three evaluated localities. Altitudinal gradient plot (A), and significant differences in key climatic variables, ANOVA analysis followed by Tukey’s post-hoc test (B-G); altitude (B), precipitation (C), near-surface wind speed (D), near-surface air temperature (E), surface downwelling shortwave radiation (F), and cloud area fraction (G); different letters above each boxplot indicate statistically significant differences. Map generated from R software (version 4.5.1) by ggplot2 and raster R packages (https://cran.r-project.org/bin/windows/base/).

The results demonstrate a marked differentiation in climatic conditions among the studied locations (Marinilla, Rionegro, and El Santuario). Marinilla, for instance, displays distinct climate characteristics, such as higher wind speed, precipitation, and cloud cover. Some of these climatic variables have been shown to significantly influence the biosynthesis of metabolites in organic waste, aligning with previous findings on the effects of climate on crop quality under abiotic stress36. In this context, clarifying which variables contribute most to climatic differentiation at both univariate and multivariate levels is essential to understanding their effect on the metabolomic profile of residual carrot biomass.

Metabolomic profile of residual carrots from different localities

The metabolomic profiles of different carrot types—cracked (R), deformed (D), pathologically damaged (P), and healthy as a control (C)—from various localities (Marinilla, El Santuario, and Rionegro) were evaluated (Supplementary Dataset S1). PCA analysis (Supplementary Fig. S1) did not reveal a clear separation among the carrot typologies. However, distinct clustering emerged when analyzing the climatic patterns of the three localities, which were analyzed independently through an exploratory multivariate analysis (PCA), revealing that Marinilla consistently separated from the other two localities, Rionegro and El Santuario (Supplementary Fig. S2).

A total of 90 metabolites were identified with level 2 and 3 identification confidence level (Supplementary Dataset S2); no metabolite was confirmed using verified reference standards, therefore the results should be interpreted cautiously. In addition, the chemical classes identified (carotenoids, fatty-acid derivatives, alkaloids, flavonoids, terpenoids, and other semi-polar compounds) are consistent with the solvent system extraction profile, indicating that it is appropriate for the targeted metabolomic analysis of residual carrot biomass. Data of abundance were analyzed through hierarchical cluster analysis (HCA) and orthogonal partial least squares discriminant analysis (OPLS-DA). Data of abundance from all experimental and quality control samples were subjected to OPLS-DA; the model was established by comparing Rionegro and El Santuario, designated as cluster 1 (C1), against Marinilla, designated as cluster 2 (C2). The model evaluation parameters were R²Y with a value of 0.556 and Q² with 0.453, and a permutation test (n = 100) was conducted to assess the robustness and statistical significance of the OPLS-DA model (Suplementary Fig. S3).

Fig. 3
Fig. 3
Full size image

OPLS-DA score plot (A), VIP score plot (B), and heatmap plot (C) between the Marinilla locality (C2) concerning Rionegro and El Santuario (C1).

The OPLS-DA score plot (Fig. 3A) shows a clear separation with a 95% confidence level between Marinilla carrot samples and those from Rionegro and El Santuario, indicating metabolic differences between the clusters. Marinilla samples are more tightly clustered, suggesting greater homogeneity regarding chemical richness or metabolomic characteristics. In contrast, samples from Rionegro and El Santuario show a greater dispersion, indicating a higher degree of metabolomic variability.

Additionally, Variable Importance in the Projection (VIP) values were generated to evaluate the overall metabolite impact on cluster differentiation. Metabolites with VIP values > 1 are the most influential for the model and were used to measure the strength and explanatory capacity of each metabolite’s expression pattern in classifying and distinguishing between Rionegro and El Santuario against Marinilla. Figure 3B shows the top twenty VIP scores (VIP score > 1.0), with a p-value < 0.05. The metabolites with the highest VIP scores were Nuciferine, Feruloyltyramine, and Microcystin LW with VIP scores > 2.5, while Cryptotanshinone, N-Hexadecanoylpyrrolidine, and 4’-Methoxyflavonol with VIP score between 2.0 and 2.5, contributed most significantly to the differentiation between the clusters.

To more comprehensively, generally, and intuitively evaluate relationships between samples and differences in metabolite expression patterns in different clusters, data on relative metabolite abundances were analyzed through hierarchical clustering using a heatmap. This process allowed for more accurate identification of marker metabolites and, thus, a better understanding of the related metabolic processes. Grouped metabolites simultaneously showed similar expression patterns and could participate in closely related reaction steps in metabolic processes. Figure 3C illustrates the hierarchical clustering results of the top 28 VIPs. The color scale reveals a clear separation between C1 and C2, with some differential patterns. Specifically, Nuciferine and Cryptotanshinone were more abundant in C1 (Rionegro and Santuario), whereas 4’-Methoxyflavonol, N-Hexadecanoylpyrrolidine, Microcystin LW, and Feruloyltyramine showed higher abundance in carrots from C2 (Marinilla). Additionally, more metabolites were enriched in C2 of Marinilla than in C1 (Rionegro and Santuario)15,16,19,30. Furthermore, we evaluated the foldchange of the most important VIPs in the volcano plot. Here, colored dots represent metabolites with fold change (FC) > 1 or FC < -1 and a p-value threshold on FDR < 0.05, indicating differentially expressed metabolites; FC > 1 suggests an upregulation of the metabolite, and FC < -1 indicates downregulation of the metabolite. The univariate statistical analysis results are shown in Fig. 4.

Fig. 4
Fig. 4
Full size image

Volcano plot highlighting differential metabolites between cluster 2 (Marinilla) and cluster 1 (Rionegro and El Santuario).

The blue dots on the right side of the x-axis represent metabolites that are more abundant in C1 (Rionegro and El Santuario), such as Nuciferine with the highest foldchange value (> 2.5) and with a clear significant difference from superior abundance in C1 compared to C2. Cryptotanshinone with a fold change value between 1.0 and 1.5 stands out as the most significant metabolites in C1, suggesting a high accumulation of these compounds in those localities. This variability may be influenced by specific agroclimatic factors in Rionegro and Santuario that promote the accumulation of these metabolites, possibly related to environmental stress responses or unique cultivation conditions. On the left side of the x-axis, the more negative values indicate metabolites more abundant in C2 (Marinilla), such as Feruloyltyramine, Microcystin LW, N-Hexadecanoylpyrrolidine, and 4’-Methoxyflavonol, with foldchange values between − 0.5 and − 1, reporting the higher abundance values of in C2, and a significant difference concerning C1; especially, Feruloyltyramine and Microcystin LW which presented the highest abundance values in C2.

Additionally, a metabolic enrichment analysis was conducted to understand better the metabolic pathways associated with the identified metabolites, based on significance level (p-value < 0.05) and enrichment score. Figure 5 shows the most significant metabolic pathways, with the α-linolenic acid metabolism pathway at the top of the list, followed by the vitamin A and carotenoid metabolism pathway; these two primary pathways have the lowest p-values, suggesting that the metabolites present in these pathways significantly impact the chemical richness of carrots from the evaluated localities.

Fig. 5
Fig. 5
Full size image

Metabolomic enrichment analysis of carrot residual biomass samples from the evaluated locations.

Valorization potential of residual carrot biomass based on metabolomic profiles.

The metabolomic analysis revealed a high degree of chemical homogeneity among different morphological types of residual carrot biomass—namely, cracked, deformed, pathologically damaged, and control samples—within each evaluated locality. This consistency indicates that external morphological defects do not significantly alter the metabolic composition of the biomass, supporting the notion that such residues maintain their biochemical integrity and retain substantial potential for Agroindustrial valorization, regardless of their visual quality.

In contrast, metabolomic differentiation was clearly associated with the production locality, reflecting the influence of distinct microclimatic conditions. Samples from Cluster C1 (Rionegro and El Santuario) showed elevated levels of nuciferine and cryptotanshinone, two alkaloids with well-documented pharmacological properties including anti-inflammatory, antioxidant, anticancer, and cardiovascular effects37,38,39. These compounds are considered promising candidates for pharmaceutical and nutraceutical applications (Table 2), and their occurrence under specific climatic patterns suggests potential for targeted agronomic optimization in regions with similar environmental profiles.

Conversely, the Marinilla cluster (C2) exhibited a broader metabolic diversity, with higher relative abundance of 4’-methoxyflavonol, N-hexadecanoylpyrrolidine, feruloyltyramine, and microcystin LW. These metabolites are linked to plant defense mechanisms, stress tolerance, and antioxidant activity46,47,48,49,50,51,52,53,54,55,56. Notably, N-hexadecanoylpyrrolidine presents a broad spectrum of bioactivities—including antimicrobial, anticancer, and neuropharmacological effects—making it an attractive molecule for nutraceutical and agrochemical exploration40,41,42,43. Despite the toxicological concerns associated with microcystin LW44,45, its detection opens avenues for studying plant–cyanobacteria interactions and assessing potential risks or remediation strategies in agroecosystems. Feruloyltyramine, involved in cell wall reinforcement and abiotic stress response, also underscores the adaptive metabolic responses driven by environmental conditions in high-altitude and high-precipitation zones. These results indicate that microclimate exerts a stronger influence than morphological typology on the metabolomic architecture of residual carrots, and thereby, on their potential applications. This highlights the strategic importance of territoriality in guiding the differentiated valorization of agro-industrial residues. While the identification of bioactive metabolites is promising, the study is constrained by the lack of quantitative metabolite data and absence of validation through analytical standards, which limits immediate industrial scalability; in addition, the observed results should be interpreted with caution, as they did not include negative ionization. Future research should address these limitations through metabolite quantification, standard validation, and bioactivity assays, thus enabling the development of microclimate-specific valorization pathways within the framework of sustainable and circular bioeconomy models.

Table 2 Metabolite description and bioprospection potential of Carrot residual biomass.

Discussion

There was no discernible separation between the carrot typologies; however, examining the climate patterns revealed a clear clustering of the three localities. This suggests that pedoclimatic factors and cultural practices exert a greater influence on metabolomic profiles between localities than within each locality20,52. Related studies have reported consistent chemical compound contents, such as α-carotene and β-carotene, across different carrot residual types from the same harvest locality7. Moreover, geographic origin has been shown to drive metabolomic segregation within the same crop variety53. Altitude seems to be an essential factor, as altitude differences can influence local climate conditions, thus affecting the metabolomic variability of agricultural products planted at a specific altitude15,36,54. These findings indicate that environmental differences, particularly those influenced by altitude, have a more significant impact on metabolomic variability than the physical or pathological conditions of carrot typologies16,18,55.

The distinct separation of clusters C1 and C2 based on specific metabolite abundances indicates significant metabolic divergence. This divergence is likely driven by unique environmental factors associated with each cluster, such as altitude, temperature, precipitation, and other climatic characteristics, along with region-specific biotic factors (e.g., the influence of the growth locality), 16,19 which may activate different metabolic pathways, which may then explain variations in chemical composition and bioactive compounds15,16,19,30. Additionally, more metabolites were enriched in C2 of Marinilla than in C1 (Rionegro and Santuario). A possible reason for this differential enrichment may be related to the altitude. To adapt to high altitudes, carrot plants may invest more resources in producing secondary metabolites than primary ones56; indeed, the secondary metabolite accumulation and the increased activity of antioxidant enzymes are related to a survival strategy for high altitude adaptation57,58. Furthermore, distinct climate characteristics such as higher wind speed, precipitation, and cloud cover have been shown to significantly influence the biosynthesis of metabolites in organic waste, aligning with previous findings on the effects of climate on crop quality under abiotic stress36.

Parallelly, we decided to analyze the biological functions of selected VIPs further. In particular, Nuciferine and Cryptotanshinone had contrasting abundances for both clusters and were higher in C1 than C2. In general, alkaloids, including Nuciferine, exhibit diverse biological activities depending on cellular conditions and receptor interactions, ranging from protective roles to metabolic regulation (e.g., chemical defense or attack) and metabolic regulation59. For instance, alkaloids like Nuciferine may be induced in plants under stress conditions and play a protective role against heat and UV light stress59,60. Stress-inducing climatic factors, including higher temperatures and greater solar radiation are characteristic of C1 localities, potentially contributing to the enhanced accumulation of Nuciferine in these regions.

Nuciferine is an alkaloid of the aporphine class, previously reported from the lotus plant, that exhibits a broad spectrum of biological activities with significant potential for therapeutic applications. Nuciferine has been associated with activities including anti-hyperlipidemic, anti-obesity, anti-fatty liver, antidiabetic, anti-hyperuricemic, anticancer, anti-inflammatory, antiviral, and antioxidant properties61. Additionally, Nuciferine has been linked to promoting longevity, as evidenced by studies in Caenorhabditis elegans, where it extended lifespan by approximately 14.86% at a concentration of 100 µM. This effect is attributed to the downregulation of the insulin/IGF-1 signaling pathway, enhancing stress tolerance and overall health62. Its anti-inflammatory effects are particularly noteworthy, as Nuciferine modulates oxidative stress and metabolic signaling pathways, playing a crucial role in managing conditions such as obesity, diabetes, and cardiovascular diseases63. Cardiovascular protection is another prominent benefit of Nuciferine. By alleviating vascular cell adhesion molecule 1 (VCAM1) activation through autophagy activation and inhibition of the Akt/mTOR signaling pathway, Nuciferine reduces risks associated with cardiovascular events64. Moreover, its hepatoprotective effects have been demonstrated in bovine hepatocytes, where it mitigates free fatty acid-induced oxidative damage by activating the TFEB/PGC-1α signaling pathway. This mechanism enhances cellular resilience and decreases oxidative stress markers65. Previous findings position Nuciferine as a valuable compound for addressing oxidative damage, metabolic disorders, and inflammatory diseases.

On the other hand, the accumulation of Cryptotanshinone (CPT) in plants appears to be influenced by environmental factors such as radiation, temperature, and precipitation66,67. Extreme climatic conditions, including elevated temperatures, intense solar radiation, and high precipitation levels (> 1877 mm), have been associated with increased CPT synthesis68. These observations align with the climatic characteristics of higher temperatures and greater solar radiation found in C1 localities. Such conditions likely contribute to the enhanced production of CPT, positioning residual carrots from C1 as a valuable source for bioprospecting and therapeutic exploration. The Cryptotanshinone (CPT), a quinone-diterpene from the tanshinone class, exhibits various biological activities, including antioxidant, anti-inflammatory, cardiovascular protective effects, and potential roles in cancer therapy69,70,71. Its mechanism of action is multifaceted, impacting various cellular processes and signaling pathways critical to its efficacy in diverse applications. In the context of antimicrobial activity, CPT demonstrates significant antibacterial properties by interacting with cell membranes, leading to membrane permeabilization and effectively combating various infectious bacteria72 In terms of anticancer effects, CPT inhibits the proliferation of melanoma cells by activating the AMPK pathway, which disrupts glycolysis and promotes apoptosis73. Additionally, in ovarian cancer, CPT induces metabolic disruption by reducing ATP production and enhancing protein catabolism, resulting in cell cycle arrest74. CPT also plays a vital role in metabolic regulation. It has been shown to improve metabolic functions in conditions such as polycystic ovary syndrome (PCOS) by modulating estrogen signaling pathways, which may enhance reproductive health75. Moreover, CPT regulates gut microbiota and bile acid metabolism, demonstrating promise in alleviating conditions such as radiation-induced lung fibrosis76.

In contrast, Marinilla had a larger number of metabolites with VIP scores above 2, such as 4’-Methoxyflavonol, N-Hexadecanoylpyrrolidine, Microcystin LW, and Feruloyltyramine. 4’-Methoxyflavonol belongs to the flavonoid family, which in plants have different functions, mainly as a defense system77., a higher flavonoid content has been associated with higher altitudes78, which may be related to the higher altitude in Marinilla compared to the other locations and this molecules has been related to antioxidant, anti-acetylcholinesterase, antibacterial, and antifungal activities in halophyte plants39. Additionally, 4’-Methoxyflavonol has been used in CO-releasing activities, which benefits the cardiovascular system and inflammation37,38. IN-Hexadecanoylpyrrolidine is an N-acylpyrrolidine, as part of pyrrolidine alkaloids (PAs). The PAs are heterocyclic organic compounds synthesized by plants with a wide variety of biological and pharmacological activities, including antioxidant, anti-inflammatory, antibacterial, antifungal, antiparasitic and anthelmintic, anticancer, anti-hyperglycemic, organ protective, and neuropharmacological activities40,41, and are associated with defense compounds against herbivores and insects42,43. The richness of PAs in plant roots has been shown to increase under conditions of higher soil moisture. This suggests that elevated precipitation intensifies pathogenic pressure, thereby enhancing PA synthesis as a defense mechanism against below-ground threats79. This phenomenon may explain the higher abundance of N-Hexadecanoylpyrrolidine observed in C2, correlating with the elevated precipitation levels in this locality. While PAs offer exciting opportunities for drug development and agricultural applications, their toxicological profiles necessitate careful evaluation. Balancing their therapeutic benefits against potential risks remains a critical area for future research, aiming to harness their potential safely and effectively.

Additionally, natural pyrrolidine derivatives have been identified in wild carrots80. PAs are characterized by their dual nature, offering both therapeutic applications and toxicological concerns. Therapeutically, PAs demonstrate antioxidant properties, effectively mitigating oxidative stress and inflammation; they also exhibit antimicrobial activity, making them promising candidates for antibacterial and antifungal treatments40. Furthermore, some PAs show anticancer potential, with mechanisms involving cell cycle regulation and apoptosis induction40,81.

However, PAs are not without risks. Toxicological concerns include hepatotoxicity, where specific compounds are linked to liver damage and hepatocarcinogenesis81,82. Neurotoxicity has also been observed, as PAs can antagonize muscarinic acetylcholine receptors, leading to adverse effects on the nervous system83. Additionally, PAs have been associated with genotoxic effects, such as DNA damage and cell cycle disruptions, raising concerns about their mutagenic potential81. Microcystin LW belongs to the microcystin family, a family of cyclic heptapeptides produced and released to the surface water of lakes and reservoirs during cyanobacterial blooms44,45. Some climatic conditions related to severe storms and rainfall events, rising CO2 concentrations, global climate change, or eutrophic water conditions contribute to this occurrence45,84. Root vegetable crops like carrots may act as an exposure medium when they are irrigated with water containing cyanobacteria. The cyanobacteria cell may stick to plant tissues, and therefore microcystins can be absorbed84,85. Great precipitations and agricultural practices possibly related to irrigation water systems and reservoir water used in C2 could influence the presence of Microcystin LW in those residual biomass carrots. Microcystin LW is a potent biological toxin with diverse effects on living organisms, extending beyond its well-known hepatotoxicity to include neurotoxic and ecotoxicological impacts. Hepatotoxicity remains its most recognized effect, as Microcystin LW disrupts liver function, inducing hepatocyte cell death. Long-term exposure to contaminated water can result in severe liver damage in both humans and animals86,87. Neurotoxic effects have also been observed, with studies indicating selective cytotoxicity in rat astrocytes, leading to reduced cell viability and increased apoptosis44. Furthermore, the toxin disrupts cytoskeletal integrity in astrocytes, compromising homeostasis critical for proper brain function44.

Ecologically, Microcystin LW poses significant risks as it accumulates in aquatic organisms and can enter the food chain, impacting higher trophic levels, including humans86. Its presence in agricultural systems, such as through contaminated irrigation water, can also negatively affect crops, resulting in broader ecological consequences86. These findings underscore the importance of monitoring and managing water sources used for irrigation, particularly in regions like C2, where high precipitation levels may exacerbate cyanobacterial contamination and toxin absorption in crops.

Feruloyltyramine is a phenolamine reported as a polyphenol compound with antioxidant activity88,89. Phenolamines are secondary metabolites which are derived from the conjugation of a phenolic fraction with polyamines. Their function in plants is related to cell wall reinforcement and defense through toxicity for pathogens and predators90,91. Polyamines, in general, have a role in developing stress tolerance, including resistance to salt, high and low temperatures, hypoxia, hyperosmosis, and atmospheric pollutants91. Local climatic characteristics may increase abiotic stress conditions in C2, considering the greater altitude, precipitation, and higher wind speed, which may mediate Feruloyltyramine production. Beyond its role in plants, N-trans-feruloyltyramine (NTF), a derivative of Feruloyltyramine, exhibits a range of biological activities, including neuroprotection, antioxidant effects, and anti-inflammatory properties. Research indicates that NTF can mitigate cholinergic dysfunction and memory impairment, particularly in neurodegeneration models such as scopolamine-induced cognitive decline in rats92. This neuroprotective effect involves the reduction of reactive oxygen species (ROS) and apoptosis markers in brain tissues92. NTF also demonstrates significant antioxidant capabilities by lowering intracellular ROS levels in neuroblastoma cells. It reverses H2O2-induced apoptosis through modulation of pro-apoptotic and anti-apoptotic proteins, highlighting its cytoprotective potential93. Additionally, NTF exhibits anti-inflammatory mechanisms by suppressing inflammatory mediators such as iNOS and COX-2 in macrophages through the inhibition of AP-1 and MAPK signaling pathways94. While the therapeutic potential of NTF is promising, further studies are needed to explore its full pharmacological profiles and safety in clinical applications. The interplay between local abiotic stress factors in C2 and the enrichment of Feruloyltyramine underscores its potential as a source of bioactive compounds. These findings suggest that climatic factors could drive the production of metabolites like Feruloyltyramine with significant applications in neuroprotection, antioxidant therapies, and inflammation management.

Some studies conducted on the carrot metabolome have revealed a similar number of annotated metabolites, with very few works focusing on the evaluation of the direct effects of locality or microclimate on the metabolome, reporting annotations of up to 86 metabolites, which include organic acids, phenolic compounds, carotenoids, among others29,30, but none of the aforementioned. Other studies have particularly focused on analyzing the richness in terpenes as an impact on the flavor of carrots95, or the effect of different production systems or simple transformation processes96,97; However, so far, no study has been reported that focuses its efforts on the effect of the microclimate associated with the harvesting locality on the residual biomass of carrots.

The α-linolenic acid metabolism involved in unsaturated fatty acid biosynthesis has been reported to exhibit particular enrichment under cold or low-temperature stress and reduced water availability98,99. This pathway, which includes oleic acid and linoleic acid, serves as a gateway to the biosynthesis of various unsaturated fatty acids99, and is closely linked to antioxidant production and jasmonic acid biosynthesis98. On the other hand, high levels of atmospheric humidity and greater precipitations have varying effects on carotenoid accumulation, with a negative impact on β-carotene content and a positive effect on α-branch carotenoid and total carotenoids, respectively. Additionally, photoperiod and light intensity duration have been reported as a differential factor on α-branch or β-branch carotenoid biosynthesis, with high radiation duration and photoperiod having a more significant positive effect on α-branch carotenoid accumulation, while exposure to low light intensity increases β-carotene content52.

Other pathways, such as ferroptosis, iron-dependent programmed cell death, and omega-3/omega-6 fatty acid synthesis, also showed considerable enrichment. Ferroptosis, such as an iron-dependent, oxidative cell death process, has a physiological role in cell death regulation in response to heat stress in plants100,101; moreover, cellular susceptibility to ferroptosis is significantly influenced by the polyunsaturated fatty acids absorption and metabolism, and synthesis of polyunsaturated fatty acids-phospholipids102, being linolenic acid, the primary polyunsaturated fatty acids obtained from plant food sources100. Other pathways with lower significance levels, such as those related to vitamin and micronutrient metabolism, are also statistically relevant. For instance, the alpha-tocopherol role in ferroptosis inhibition is the typical endogenous lipophilic radical-trapping antioxidant102. This result suggests that there may also be differential regulation of these pathways, albeit with less impact on the observed differences.

The physiological adaptability of plants to their local microclimates aligns with the capacity of certain plants to modify biosynthetic pathways of secondary compounds such as polyphenols and terpenoids, adapting to environmental variations through a process known as phenotypic plasticity. This plasticity enables plants to maintain stability and synthesize bioactive compounds with antioxidant and nutraceutical properties103,104. More remarkably, plasticity has been primarily noted for terpene molecules and phenolic compounds (Myristicin, 6-methoxymellein, and D-germacrene) in different carrot varieties and locations, suggesting a co-regulation mechanism in their accumulation, which indicates that these compounds appear to have a similar pattern with common localities and varieties implicated in accumulation29.

Moreover, the enriched metabolic pathways offer significant opportunities for bioprospecting, mainly through α-linolenic acid metabolism, carotenoid biosynthesis, ferroptosis, and omega-3/omega-6 fatty acid synthesis. The α-linolenic acid pathway plays a central role in the biosynthesis of unsaturated fatty acids, which are precursors for antioxidant compounds98,99. This pathway could be explored to develop bio-functional or nutraceutical products based on carrot residual biomass. Carotenoid biosynthesis, influenced by environmental factors such as light intensity and humidity, has potential in biofortification programs that optimize α- and β-carotenoid levels in carrot crops52. This potential is particularly relevant for improving dietary vitamin A intake and antioxidant activity under varying agroclimatic conditions. The ferroptosis pathway, mediated by iron-dependent oxidative stress, allows studying stress-induced cell death mechanisms in carrot plants. Its interplay with polyunsaturated fatty acid metabolism and the role of alpha-tocopherol as an endogenous ferroptosis inhibitor highlight its potential in developing biotechnological tools to enhance stress tolerance and crop resilience100,102. Finally, the omega-3/omega-6 fatty acid synthesis pathway underscores its importance in using carrot residual biomass to produce essential fatty acids for human health. Together with micronutrient metabolism, these pathways could serve as key targets for agricultural, nutraceutical, and pharmaceutical applications, supporting sustainable crop production and human nutrition.

Conclusion

This study underscores the significant impact of agroclimatic conditions on the metabolomic profiles of residual carrot biomass from three Colombian localities. The metabolomic differences between clusters C1 (Rionegro and El Santuario) and C2 (Marinilla) highlight the influence of environmental factors such as altitude, precipitation, temperature, and solar radiation in shaping metabolic profiles. Notably, metabolites like Nuciferine and Cryptotanshinone were more abundant in C1, consistent with the warmer temperatures and higher solar radiation in these localities. Conversely, metabolites such as 4′-methoxyflavonol, N-Hexadecanoylpyrrolidine, Microcystin LW, and Feruloyltyramine were enriched in C2, correlating with Marinilla’s higher altitude and precipitation levels.

The bioprospecting potential of these metabolites is substantial. Compounds such as Nuciferine and Cryptotanshinone show promise for pharmaceutical and nutraceutical applications due to their antioxidant, anti-inflammatory, and anticancer properties. Similarly, metabolites enriched in C2, including 4′-methoxyflavonol and N-Hexadecanoylpyrrolidine, hold potential for use in functional foods and agrochemical innovations. Pathways like α-linolenic acid metabolism and carotenoid biosynthesis further highlight the potential of residual carrots as a sustainable resource for producing bioactive compounds. However, a future research phase is necessary to further validate with reference standards and quantify the metabolites identified, thereby providing a more comprehensive understanding of the metabolomic profiles and their potential applications or precautions. Additionally, metabolomic studies (not lipidomic) are recommended to achieve a deeper understanding of the differences in carrot biomass typologies.

These findings advance the valorization of agricultural waste, offering a foundation for future biotechnological and agricultural innovations; moreover, this study is pioneering in evaluating the effects of microclimate on carrot residual biomass associated with the harvest location. By optimizing the use of residual biomass through targeted bioprospecting, this research bridges the gap between food waste management and the development of high-value bio-based products. This approach supports circular economy models, promoting environmental sustainability while driving economic growth.

The supplementary data can be found in Mendeley Data online (“Carrot residual biomass supplementary metabolomic data”, Mendeley Data, V1, https://data.mendeley.com/datasets/7vbbkv6n9t/2).

Acknowledgements.