Introduction

The wood apple (Feronia limonia) is a native fruit plant of India that is a member of the Rutaceae family. India, Pakistan, Bangladesh, China, Africa, and Southeast Asia are among the tropical and subtropical locations where it is extensively found1. It goes by several regional names in India, including Kavath, Kaith, Kotha, and Kabeet2. The wood apple is a hardy plant that grows naturally in the western Himalayas up to an elevation of roughly 500–600 m. The species is primarily distributed across the semi-arid zones and rain-fed regions of India, with major occurrences in Rajasthan, Gujarat, Maharashtra, Uttar Pradesh, Madhya Pradesh, Tamil Nadu and Karnataka. All parts of the plant are of economic importance, with fruits used in food and beverage manufacturing, and leaves, bark, stem, and roots employed for various medicinal purposes. Fruit ripens between early October to March3. The plant is known for its adaptability to harsh climatic conditions, thriving in areas with a monsoon climate and distinct dry seasons4.

Among the many benefits of wood apple are its uses in beverages, processed foods (such as murabba, preserves, and chutneys). It has been prized for its digestive, carminative, and anti-diarrheal qualities in Ayurvedic and traditional medicine5. Furthermore, the plant includes bioactive substances that are thought to have medicinal properties, including triterpenoids, alkaloids, phenols, and flavonoids6,7,8,9. Despite its high utility, wood apple remains underutilized commercially in hot semi-arid regions of India and worldwide (Bangladesh, Pakistan, Myanmar, Cambodia, Thailand ad north eastern Africa). Its resilience and soil reclamation potential position it as a promising crop for future horticultural development in challenging environments10 and can guide varietal improvement, enhance productivity, and promote the adoption of wood apple as a viable commercial crop in resource-limited environments for small and marginal farmers. Fruit pulp contents ranging from 40.4 to 60.6%, sugar contents ranging from 3.26 to 14.3%, and ascorbic acid levels ranging from 2.7 to 20.45 mg, wood apple fruits offer a pleasing sweet-acidic flavor11,12. Wood apples have a lot of potential to improve fruit production and the creation of value-added goods because of their versatility, nutritional advantages, and therapeutic qualities.

For upcoming crop improvement initiatives, it is crucial to characterize the genetic diversity and phenotypic variability in minor fruit crops. Traditional breeding techniques have produced superior variants of many fruit species, providing a solid basis for boosting horticultural production around the world3,11. Its hardy perennial nature and ease of propagation through grafting and budding make wood apple a promising commercial crop for hot semi-arid regions, offering income-generation opportunities for small and marginal farmers.

The nutritional and quality traits evaluated in this study demonstrated the substantial importance for both consumer acceptance and market value of wood apple. Attributes like total soluble solids, acidity, pulp percentage, sugar content and ascorbic acid concentration directly influence flavor, processing suitability, and nutritional benefits. High pulp yield and desirable flavor profiles enhance the potential for value-added products like chutney, fruit juice and pickles while elevated ascorbic acid and other micronutrients contribute to the crop’s functional food potential and enrich antioxidant profile of fruit. By identifying genotypes with superior quality and nutritional profiles, the study provides a foundation for developing cultivars that meet both market preferences and dietary needs, thereby supporting commercialization and promoting wider adoption of wood apple in semi-arid regions.

Although the crop has substantial nutritional and therapeutic potential, it has not yet been thoroughly investigated or exploited for large-scale production. The majority of wood apple plants are seedlings, which provide an excellent opportunity to choose elite genotypes with outstanding fruit quality and yield potential. The purpose of this study is to assess superior genotypes for adaptation, value-added goods, and quality yield production in Western India’s rainfed environment. In addition to assessing the qualitative characteristics of fruits and derived products, it is essential to disseminate this knowledge to farmers and promote the commercial cultivation of superior genotypes under problematic soil conditions. Such efforts would facilitate the wider adoption of wood apple in semi-arid regions, thereby enhancing local economies and improving agricultural productivity.

Materials and methods

Study sites and experimental materials

The research was conducted during 2022–2024 at the experimental wood apple Germplasm Block of the Central Horticultural Experiment Station in Godhra, Gujarat, India (Fig. 1), situated at a latitude of 22.692948°N, longitude of 73.536038° E, and an elevation of 113.12 m above sea level. The various Wood Apple genotypes were created via in situ grafting and were subsequently planted at a 10 × 5 m spacing.

Fig. 1
figure 1

Germplasm block of wood apple at ICAR-central horticultural experimental station Godhra, Gujarat.

Five plants were used to represent each genotype and these plant developed by used scion of marked mother plant maintained in mother block at the station, with each plant functioning as a replicate. The planting layout conformed to a randomized block pattern. The ages of the trees within the orchard ranged from 13 to 14 years. The minimal descriptors for the Wood Apple germplasm, used for development, consist of 46 genotypes. Genetic profiles were gathered from regions of high biodiversity through surveys and from research institutions in different agroecological zones across the country. The collection sites were visited on two occasions during the flowering stage and again during the ripening phase to assess the morphological and qualitative attributes of the fruits. Shoots from scion trees exhibiting the desired characteristics were harvested in March and April, then grafted in situ at the experimental field of the Station.

The key characteristics used for identifying genotypes involved attributes are presented in supplementary Table 1. The important traits analyzed in the study included fruit length, fruit width, fruit weight, rind thickness, seeds per fruit, fresh seed weight per fruit, dry seed weight per fruit, pulp percentage, reducing sugars, total sugars, total soluble solids (TSS) in the pulp, TSS in the peel, acidity in the pulp, acidity in the peel, vitamin C content in the pulp, vitamin C content in the peel, mineral nutrient composition, protein content in the pulp, seed protein, leaf protein, and yield.

The various genotypes were evaluated under the agricultural climate conditions of a semi-arid region with average annual rainfall of around 750 mm, and maximum temperature fluctuations spanning from 33 °C to 47 °C in the summer to 8–23 °C in the winter. The experimental site had soil consisting of shallow black cotton-to-clay loam, with 143.60 kg/ha of available nitrogen, 7.20 kg/ha of phosphorus, and 144.21 kg/ha of potassium present in the 0–22 cm soil depth. The soil had a pH of 7.20, EC value of 0.39 dS/m, bulk density of 1.32 g/cc, hydraulic conductivity of 0.38 cm/h, and organic carbon content of 0.41%. Throughout the experimentation, each tree was subjected to a consistent application of 40 kg of farmyard manure, 1.00 kg of nitrogen, 0.75 kg of phosphorus, and 1.00 kg of potash in 2nd week of august month in both year.

We took photographs of plants or plant parts using a Sony Cyber-Shot digital camera (Japan) from the field repository for creating variability. For various observations, three trees free from pests and diseases were selected to ensure better results. The variability and morphological variation in wood apple germplasm in tree, fruit, leaves, flower, and bark are presented in Figs. 2, 3, 4, 5, 6, 7, 8 and 9 and supplementary Table 1. The harvested samples were cleaned with a moist cloth to remove dust and control the field heat. A standard procedure was used for the estimation of tartaric acid (%), reducing sugars (%), non-reducing sugars (%), and total sugars (%), as described in Association of Official Agricultural Chemists (AOAC)13.

Fig. 2
figure 2

Variation in growth habit in wood apple genotypes (A) Spreading, (B) Drooping, (C) Semi-spreading, (D) Erect.

Fig. 3
figure 3

Variability in bark colour in wood apple genotypes (A) Whitish gray, (B) Grey, (C) Blackish grey.

Fig. 4
figure 4

Variability in leafsize, leaf petiole and leaflets (A) and thorn (B) of wood apple genotypes.

Fig. 5
figure 5

Variation in Flower morphology: (A) Staminate flower; (B) Pistillate flower.

Fig. 6
figure 6

Different fruit development stages.

Fig. 7
figure 7

variability in fruit shape, surface texture, and colour of wood apple genotypes.

Fig. 8
figure 8

variability in fruit pulp colour and texture among wood apple genotypes.

Fig. 9
figure 9

Seed colour and shape variability in wood apple genotypes.

The dinitrophenylhydrazine (DNPH) procedure was used to measure the ascorbic acid concentration. The value14 was given as mg/100 g fw. Total phenolics were estimated using Folin–Ciocalteu reagent15. To 100 µL of the sample extract (80% ethanol) 2.9 mL of deionized water, 0.5 mL of Folin–Ciocalteu reagent and 2.0 mL of 20% Na2CO3 solution were added. The mixture was allowed to stand for 90 min and absorption was measured at 760 nm against a reagent blank in spectrophotometer. Results were expressed as Gallic acid equivalent (mg GAE/100 g fw).

From the digested solution, the plant nutrients including Ca, Mg, and Fewere determined according to the method proposed by Tandon16. Potassium (K) content was estimated by a flame photometer8. The results were expressed in mg/100 g dw.

Statistical analysis

The data and plot analysis were carried out using various packages of Rstudio version 2023.3.0 + 386. The latest available versions of FactoMineR, factoextra, and ggplot2 were used for Principal component analysis17,18. Cluster analysis was carried out with the help of the cluster, factoextra, dendextend, and ggplot2 packages19. The variability analyzed by variability and agricolae and correlation by corrplot20.

These studies sought to better understand genetic diversity, phenotypic relationships, and clustering patterns among the accessions, thus providing data for sensible breeding techniques. First, calculated heritability (2), genotypic coefficient of variation (GCV), phenotypic coefficient of variation (PCV), genetic advance (GA), and genetic advance as a percentage of mean (GAM). Using the Pandas and NumPy tools, Python constructed these measures to handle and probe the data. The probability of genetic improvement depends on an evaluation of variability; greater values of heritability and GAM indicate features that are fit for selection.

The correlations among these factors were assessed using correlation analysis. Genotypic and phenotypic associations were computed and visualized using heatmaps generated with Python’s seaborn library to reveal trait interdependence, aiding indirect selection in breeding programs. Dimensionality reduction was performed via principal component analysis (PCA) using the scikit-learn suite21,22. Qualitative traits were coded as ordinal categories to integrate with quantitative data, ensuring meaningful contribution to the analysis. Kaiser normalization was applied for scale standardization, and Varimax orthogonal rotation was used to maximize variance in factor loadings, enhancing interpretability. Components with eigenvalues > 1 (Kaiser criterion) were retained23,24. PCA outputs included biplots, 2D score plots, and scree plots, enabling identification of traits contributing most to variance among accessions. Hierarchical clustering, implemented via the SciPy library with seaborn-generated dendrograms and heatmaps, grouped accessions based on trait profiles. Cluster mean analysis and inter- and intra-cluster distances, computed from the D2D^2D2 statistic, quantified genetic diversity and distinguished characteristic features of each group25,26.

Results

The ANOVA results presented in supplementary Table 2 show significant variability in the attributes of forty-six wood apple (Feronia limonia) accessions, with several traits demonstrating highly significant differences.Fruit weight (FW), seed number per fruit (SNF), total soluble solids (TSSP), potassium (K), magnesium (Mg), and yield per plant (YP) are among the highly significant features (p < 0.01). Significant variation between the accessions is indicated by these attributes’ substantial F-values for both the entries and replications. Fruit length (FL), fruit width (FB), and pulp weight (Pu) are traits that exhibit significant diversity across the accessions and have significant differences at the 5% level. Iron (Fe), calcium (Ca), total solids (TS), and disease resistance (RS) are non-significant features for which no discernible variations were seen among the accessions. Notably, though, substantial variations were discovered at the entry level for iron (Fe) (p < 0.01).The coefficient of variation (C.V.) for the traits varied from 1.94% (RT) to 132.07% (Fe), indicating that some traits have more stable values across the accessions while others show greater variation.

A comprehensive overview of the mean, maximum, minimum, and variance components (genotypic, phenotypic, and environmental variance) for twenty-four attributes of forty-six wood apple (Feronia limonia) accessions is shown in Table 1. The mean values of the qualities vary significantly. For instance, the mean potassium (K) value is 1.59, yet the mean fruit weight (FW) is 244.41 g. Fruit weight (FW) and seed number per fruit (SNF) are two examples of qualities that have very high maximum values (382.92 g and 432.76, respectively), but the minimum values are significantly lower (128.62 g for FW and 192.45 for SNF).

Table 1 Estimates of mean, maximum, maximum and different variables for twenty-four attributes in forty-six wood Apple (Feronia limonia) accessions.

Estimates of environmental (Env. Var.), phenotypic (Pheno. Var.), and genotypic (Geno. Var.) variance are also shown in the Table 1. For example, fruit weight (FW) has a genotypic variance of 2590.57, far larger than its environmental variance of 24.58. This suggests that the variation in fruit weight is mostly due to genetic factors. However, in addition to their genotypic variance, some variables, such as fruit length (FL) and fruit width (FB), also exhibit significant environmental variance, suggesting that environmental factors significantly influence their variation. Low Environmental Variance: Characteristics such as pulp weight (Pu), total soluble solids (TSSPe), and seed number per fruit (SNF) exhibit low environmental variance, suggesting that they are comparatively stable across settings.

Genetic traits

Genetic Coefficient of Variation (GCV), Phenotypic Coefficient of Variation (PCV), Environmental Coefficient of Variation (ECV), Heritability (h2), Genetic Advance (GA), and GA as a percentage of the mean are among the twenty-four genetic parameter estimates for 46 wood apple (Feronialimonia) accessions shown in Table 2. This shows how much genetic variation there is in each attribute. values such as fruit weight (FW, 20.82%) and seed number per fruit (SNF, 17.79%) exhibit significant GCV, indicating substantial genetic variation, but values like iron content (Fe, 86.21%) show an extraordinarily high GCV but may be prone to other complexity. Significant overall variance is also seen in traits with high PCV values, such as fruit weight (FW, 20.92%) and seed number per fruit (SNF, 18.20%).

Table 2 Estimates of genetic parameters for twenty-four attributes in forty-six wood Apple (Feronia limonia) accessions.

An estimate of environmental variation is the Environmental Coefficient of Variation (ECV). Low ECV variables, including fruit weight (FW, 2.03%) and seed number (SNF, 3.84%), indicate that environmental influences on these qualities are negligible and that genetic factors play a major role in determining these characteristics. A trait that has a heritability value of 1 (100%) is primarily determined by genetics. For instance, characteristics with high heritability, such as fruit weight (FW, 0.9906) and seed number (SNF, 0.9555), are suitable candidates for breeding selection. On the other hand, calcium (Ca, 0.2699) exhibits low heredity, indicating that environmental variables have a significant impact on its expression.

The anticipated improvement in the trait brought about by selection is known as the genetic progress. Fruit weight (FW, 104.36) and seed number (SNF, 108.39) are two traits that exhibit considerable genetic advancement and suggest substantial room for development through selective breeding. GA expressed as a proportion of the mean: This metric aids in comprehending the relative improvement in genetics. While low values like iron (Fe, 97.07%) demonstrate the difficulty of enhancing such qualities, high values like fruit weight (FW, 42.70%) and yield per plant (YP, 57.75%) indicate that these attributes can undergo significant improvement with selection.

Correlation

Numerous noteworthy positive and negative associations for different Wood Apple features were found in the study. FB (0.8383 **), DSW (0.7522 **), FW (0.6842 **), Pu (0.5072 **), K (0.4151 **), PS (0.4096 **), SNF (0.3349 *), Mg (0.3222 *), and RT (0.317 *) were all strongly positively correlated with the trait FL, for example. Furthermore, FL exhibited positive but non-significant correlations with characteristics such as Ca (0.2564 NS), FSW (0.2546 NS), and TSSPe (0.0782 NS), as well as a significant negative association with Fe (−0.2921 *). Likewise, FB exhibited a substantial negative association with Fe (−0.408 **) and a significant positive correlation with DSW (0.7703 **), FW (0.7652 **), Pu (0.4618 **), RT (0.4024 **), and Mg (0.3605 *).Mg (FW: 0.3576 *, DSW: 0.447 **), Pu (FW: 0.4786 **, DSW: 0.7199 **), DSW (FW: 0.7119 **, DSW: 0.7119 **), and FB (FW: 0.7652 **, DSW: 0.7703 **) all showed favourable associations withother qualities like FW and DSW. On the other hand, negative associations with Fe were noted in DSW (−0.1125 NS) and FW (−0.0444 NS). The study also showed that while SNF had a positive significant correlation with FSW (0.9074 **), it had a negative significant correlation with PP (−0.3167 *) and ViCPe (−0.2963 *). In contrast, traits like RT had positive but non-significant correlations with elements like TSSPe (0.2527 NS), Ca (0.2079 NS), and Mg (0.1651 NS).

Other important characteristics, such as Ca, also demonstrated strong positive associations with Pu (0.7214 **), Mg (0.6235 **), ViCPe (0.62 **), and K (0.6034 **). On the other hand, RS showed a negative non-significant association (−0.2639 NS). While TSSPe demonstrated substantial positive correlations with K (0.3847 **) and FW (0.3331 *), traits such as TSSP also shown a positive significant association with TSSPe (0.6962 **) and AcP (0.299 *). These results demonstrate the complex interrelationships among Wood Apple characteristics, where both strong positive and negative correlations are crucial to the fruit’s development.

Principal component analysis

The Principal Component Analysis (PCA) conducted on 24 traits revealed 7 principal components (PCs) with eigenvalues greater than 1 (Fig. 10), accounting for varying degrees of total variance,22.74% for PC1, 12.84% for PC2, 10.18% for PC3, 8.96% for PC4, 7.39% for PC5, 6.03% for PC6, and 4.91% for PC7. Traits like DSW, FL, FB, FW, and Pu showed a positive correlation with the first principal component (PC1), but TSSP, YP, and Fe showed a negative correlation (Fig. 11). PC2 exhibited negative relationships with K, Pu, AcPe, and a number of other characteristics, but strong positive connections with SNF, FSW, FB, and FL. AcPe, AcP, TSSP, ViCP, and TSSPe all showed positive associations in PC3, but PL, TS, PS, YP, and RS showed negative correlations. PC4 had a negative correlation with YP, DSW, Pu, and FB, but a positive correlation with PL, Fe, TSSPe, TSSP, and PP.

Fig. 10
figure 10

Principal component analysis of Feronia limonia (A) Scree plot eigenvalues; (B) Scree plot for twenty-four traits.

Fig. 11
figure 11

Correlation of principal components and variables among twenty-four traits of forty-six Feronia limonia.

FL, PP, ViCPe, and AcPe were found to have negative relationships with PC5, whereas TSSPe, RS, RT, TSSP, and Mg displayed positive connections. ViCPe, ViCP, Pu, SNF, and K showed negative correlations with PC6, while YP, Fe, Mg, FW, and RT showed substantial positive relationships. Finally, PC7 showed negative associations with YP, ViCPe, TSSP, RT, Mg, Fe, and Ca, but positive correlations with Pu, AcPe, AcP, FW, PL, and PP. These results demonstrate the main characteristics affecting the quality and development of Wood Apple fruit.

PCA biplot of traits

This principal component analysis (PCA) biplot illustrates the relationships between different phenotypic and biochemical traits and their contribution to variability (Fig. 12). According to the plot, Principal Component 1 (PC1) accounts for 22.736% of the overall variation, whereas Principal Component 2 (PC2) explains 12.837%. With their lengths signifying the degree of linkage, arrows show the contribution and correlation of attributes. On the positive axis of PC1, traits including FW (fruit weight), FL (fruit length), DSW (dry seed weight), and K (potassium) cluster together and exhibit high relationships, suggesting their relevance and mutual influence. Different dimensions are influenced by traits on the opposing axis, such as FSW (fresh seed weight) and SNF (seed number per fruit). These traits appear to capture orthogonal components of phenotypic variance, based on the divergence of arrows.

Fig. 12
figure 12

Principal component analysis of Feronialimonia (A) biplot variables; (B) biplot genotypes for twenty-four attributes in forty-six accessions.

PCA scatter plot for accessions

Based on their morphological and biochemical traits, 46 accessions (CHESW-1 to CHESW-46) are distributed along the PC1 and PC2 axes in the scatter plot. The fact that the accessions are dispersed over the four quadrants suggests a high degree of genetic and phenotypic variety. While CHESW-26, CHESW-11, and CHESW-8 are located on the extreme negative side of PC1, accessions like CHESW-1, CHESW-39, and CHESW-32 are situated on the extreme positive axis of PC2. This variance draws attention to the diversity among the accessions and may also show how adaptable or appropriate they are for various environmental settings or breeding goals.

Genotypic path analysis

Variable linkages were found in the genotypic path analysis of the qualities influencing yield potential (Supplementary Table 3). The residual showed that the independent variables accounted for 71.96% of the variation in YP, while the model was unable to explain 28.04% of it. Numerous variables, including FL, FB, FW, and RT, had non-significant direct impacts, and some of them had negative correlations with YP (e.g., FL with r = −0.0931 and RT with r = −0.0834). Nonetheless, a number of characteristics showed favourable indirect impacts via other variables. For instance, despite its negative indirect effects via other qualities like SNF and K, FB had a favourable direct influence on YP (0.2208) and positive indirect effects through variables like PS, DSW, and FSW. Similarly, TSSP had a negative direct effect on YP (−0.20565) but showed positive indirect effects through TSSPe, FSW, and AcP.

Cluster analysis

Clustering of the genetic material using Tocher’s method and Mahalanobis D2 distance revealed distinct groupings, with cluster 2 housing the highest number of genotypes21, followed by cluster 3 with 8 genotypes (Fig. 13). On the other hand, clusters 12, 13, 10, 14, 15, 16, and 11 each contained a single genotype, indicating a high degree of uniqueness or divergence for those genotypes. Traits contributing the most to genetic diversity were FW, with a contribution of 1125.73%, followed by FB at 732.83%, emphasizing the importance of these traits in driving genetic variability within the study material.

Fig. 13
figure 13

Cluster analysis for twenty-four traits of forty-six Feronialimonia accessions.

Significant diversity was found for several features in terms of trait means across the clusters. The highest FL (99.04), FW (375.58), and YP (119.13) values were found in Cluster 2, suggesting a high potential for productivity. The lowest values for variables such as FB (63.81), FW (143.59), and DSW (4.94) were found in cluster 15, indicating a less favourable genotype for qualities associated to yield. Additionally, Cluster 3 displayed high values for features such as ViCP (33.96), suggesting that it has a significant potential for agronomic attributes. The need of selecting for numerous qualities at the same time in breeding programs to optimize genetic potential and yield outcomes is highlighted by the variety in traits between clusters.

Discussion

The findings from the ANOVA analysis provide insights into the genetic and phenotypic diversity present in the wood apple accessions for various traits. There is a lot of genetic variety among the accessions, which can be used in breeding programs targeted at enhancing important agronomic qualities like fruit weight (FW) and yield per plant (YP), as indicated by the high variance in these traits. Significant differences between accessions are also implied by the substantial F-values for parameters like fruit length (FL) and seed number per fruit (SNF), which could be explained by varying genetic backgrounds or environmental factors. Because they may suggest homogeneity or a lack of diversity in certain qualities among the accessions, the non-significant traits like RS, TS, and Fe are interesting. Either a stable characteristic within the species or external conditions reducing the trait’s variability could be the cause of this. Certain elements, including calcium (Ca) and iron (Fe), have significant coefficients of variation, which implies that they may not be genetically varied within this group of accessions or may be more influenced by external environmental influences27,28. Conversely, low-variation traits like RT and K might be a sign of environmental or genetic stability. All things considered, these findings offer valuable information for locating accessions with desired characteristics for breeding initiatives and additional research into the genetic and environmental elements affecting these characteristics in wood apples.

The data presented in Table 1 suggests substantial genetic and environmental variation in the traits of wood apple accessions, which is important for understanding the potential for selection in breeding programs. The genotypic variance is significantly greater than the environmental variance for numerous characteristics, including fruit weight (FW), seed number (SNF), and total soluble solids (TSSP). This suggests that selection in breeding programs could successfully improve these qualities, which are mostly determined by genetic factors. Alongside genotypic diversity, some variables, such as fruit length (FL) and fruit breadth (FB), exhibit significant environmental variance. This implies that environmental variables have a major influence on these features in addition to genetic influences. Therefore, when choosing for these features, breeders should take the environment into account. Traits such as seed number per fruit (SNF), pulp weight (Pu), and total soluble solids (TSSPe) show relatively low environmental variance, implying that they are more stable and less influenced by environmental fluctuations. This stability can be an advantage for selecting consistent traits across different growing conditions8,27,29,30,31,32,33,34.

The significant variation in traits like fruit weight (FW) and yield per plant (YP) suggests that these attributes offer opportunities for selection and improvement in breeding initiatives aimed at boosting productivity. However, unless exact nutrient content targets are needed, low variance traits like calcium (Ca) and iron (Fe) would not be as useful for breeding. All things considered; this table provides breeders with a starting point for rating highly genetically variable traits while accounting for environmental factors. They are able to create better wood apple varieties by making informed decisions as a result.

Genetic traits

The genetic parameters in Table 2 provide valuable insights into the potential for improving various traits of wood apple through breeding. Fruit weight (FW) and seed number per fruit (SNF) are examples of features that show high GCV and heritability, suggesting that they are mostly genetically influenced. Programs to breed for these features would probably be effective since genetic selection would produce notable advancements. Furthermore, these features’ high genetic advance (GA) indicates that selection might provide significant genetic gains. Conversely, low heritability ratings for variables like calcium (Ca) and iron (Fe) suggest that environmental factors, rather than genetics, have a greater influence on their manifestation. This implies that it might be more difficult to improve certain traits through breeding, and breeding programs aimed at them would have to take environmental factors into consideration or consider other breeding techniques like marker-assisted selection9,14,27,33,34,35,36,37,38,39,40,41

Certain traits, like pulp weight (Pu) and total soluble solids (TSSPe), exhibit increased environmental variance and relatively low heritability, suggesting that environmental factors may have a bigger impact on these traits. It could be necessary to carefully balance genetic and environmental factors when breeding for these features. Fruit weight (FW) and yield per plant (YP) are two traits that show strong genetic progress as a percentage of the mean, indicating that selection can greatly enhance these features. Breeding efforts seeking to boost productivity should give priority to such features because the anticipated benefits would be significant. The information shows that several wood apple accessions’ features are highly heritable and genetically varied, which makes them perfect candidates for selection-based improvement. To maximize their expression, characteristics with low heritability—like calcium and iron content—may need to be bred using different methods or concentrate on environmental management techniques.

Correlation

The study on Feronia limonia highlights the complex interrelationships among various fruit traits, providing valuable insights for improving cultivation and fruit quality (Fig. 14). Fruit Length (FL) is positively correlated with Fruit Breadth (FB), Fresh Weight (FW), Dry Seed Weight (DSW), and Potassium (K), indicating that larger fruits typically have more developed physiological characteristics. This confirms earlier research that found a correlation between fruit size and increased weight and nutrient accumulation42,43. Remarkably, the negative association (−0.2921) between FL and iron (Fe) suggests that larger fruits may contain less Fe, necessitating additional research on nutrient dynamics44.The notion that seeds development and total fruit size are strongly related is supported by the positive correlation between DSW and FW (0.7119)45. Furthermore, findings from prior studies on the synergistic effects of these nutrients are supported by the substantial positive correlations of potassium (K) and magnesium (Mg) with features like FB, DSW, and FW, which emphasize their involvement in stimulating growth and fruit development27,46,47,48. Calcium (Ca) is essential for fruit quality, perhaps improving texture and nutrient absorption, according to correlations between Ca and pulp content (Pu)49. Moreover, the inverse relationship between calcium and root disease (RS) would suggest that calcium may have a protective effect against problems with root health.

Fig. 14
figure 14

Pearson correlation coefficients among twenty-four traits of Feronialimonia in forty-six accessions.

The relevance of sugars and vitamins in fruit quality is highlighted by the significant connections found between Total Soluble Solids (TSSP) and Vitamin C concentration (ViCPe). According to the correlation between TSSP and Acidity (AcP) (0.299), sugar buildup and organic acid metabolism are intimately related, and this has an impact on the fruit’s flavour and nutritional makeup9,36,37,50,51,52,53,54,55,56,57,58.

PCA biplot

The PCA biplot’s arrow grouping and direction reveal important information about the relationships between the attributes. Fruit weight, fruit length, and potassium levels are examples of traits that cluster together, indicating that selection for one may have a beneficial effect on others. This is an important finding for breeding programs aimed at increasing output. On the other hand, the separation of other characteristics, such as the quantity of seeds per fruit, emphasizes the existence of trade-offs that might need to be controlled throughout the selection process. The clustering pattern highlights the significance of phenotypic flexibility in complex contexts, which is consistent with findings by27,48,59 in investigations on other underused crops. While some variables are important predictors of variation, PC1 and PC2’s contributions show that additional PCs could be necessary to capture subtle phenotypic changes.

The effectiveness of the dimensionality reduction made possible by PCA is seen by the scree plot. Principal components with eigenvalues larger than one are frequently kept for significant analysis in accordance with Kaiser’s criterion. In this instance, most of the crucial variation in the data is captured by the first few components, especially PC1 and PC2. From PC3 onward, the explained variance gradually decreased, indicating that these components primarily provide noise or redundant information rather than important information for the differentiation of phenotypic or biochemical traits22,60. Selecting only a few principal components for efficient trait selection in breeding programs and genetic diversity assessments is further supported by the “elbow point” at PC7, where the curve starts to level out. These findings align with previous research27, where PCA was instrumental in reducing complexity for genotype characterization.

Path analysis

The genotypic path analysis results in this study revealed complex relationships between various traits and yield potential (YP), indicating that indirect effects through other traits often played a more significant role than direct effects in influencing YP. In line with recent research that found indirect effects to be more relevant in explaining variance in crop yield potential, it was observed that various traits, including FL, FB, and FW, had non-significant effects on YP61,62. The significance of comprehending not just direct trait interactions but also how they interact with other agronomic factors is highlighted by the fact that, although FB had a favourable direct effect on YP, it had a negative indirect effect through traits like SNF and K. According to earlier research, microelements like Fe have been demonstrated to directly increase yield potential by improving photosynthetic efficiency and nitrogen uptake. Similarly, characteristics like Fe and Mg showed beneficial direct benefits on YP63.

The fact that PS had the biggest direct beneficial impact on YP (1.3176), indicating a crucial role in boosting yield potential, was one of the study’s most important discoveries. This is consistent with earlier studies that highlight how important photosynthesis (PS) and associated characteristics are in determining crop productivity64,65,66. Furthermore, through indirect pathways, features like DSW, TS, and PL also favorably impacted YP, supporting results from previous research that showed how traits like plant structure and seed weight indirectly affect yield67,68,69,70.

The study also revealed that although some variables, such as TSSPe, had favorable direct impacts, their detrimental indirect effects—especially through PP, FL, and FSW—emphasized the intricacy of the interactions between traits that can either increase or decrease yield. These findings pointout the critical role of precise trait selection in perennial crops, where long breeding cycles demand careful identification and prioritization of yield-associated traits. Emphasizing selection over broader breeding approaches ensures the retention and enhancement of traits that contribute most directly to long-term productivity and crop improvement.71,72.

Cluster

Significant genotype diversity was found when the genetic material was clustered using Tocher’s technique and Mahalanobis D2 distance. Cluster 2 had the most genotypes21, followed by cluster 3 with 8. A higher degree of divergence or uniqueness among these groups was indicated by the single genotypes found in the remaining clusters, which included 12, 13, 10, 14, 15, 16, and 11. These results are consistent with earlier research that effectively used Tocher’s method to classify genotypes according to their genetic distance, enabling a deeper comprehension of the genetic structure71,73. Furthermore74, contribution study showed that variables like FW and FB, were important in determining the genetic diversity. Such results reinforce the significance of these traits in genetic improvement programs and their role in enhancing productivity and performance of crops7,27.

Significant variance was found in several significant agronomic traits when the means of the attributes across the clusters were analysed. Similar tendencies have been noted in other crops, and cluster 2 showed the highest values for FL (99.04), FW (375.58), and YP (119.13), suggesting that it may have higher production qualities75,76,77,78. The genotype with the lowest yield potential, however, was cluster 15, which showed the lowest values for FB (63.81), FW (143.59), and DSW (4.94). These outcomes are in line with research demonstrating the importance of characteristics such as FW and FB as potent productivity predictors59. Additionally, Cluster 3 showed high values for certain traits, such ViCP (33.96), which would suggest that it could improve quality traits. According to79, the variability seen in these clusters emphasizes the need to choose genotypes with a wide variety of desirable traits to maximize breeding techniques meant to increase both yield and quality. New genotypes with improved genetic potential for a variety of agronomic traits may result from this strategy.

The genotypes CHESW-6, CHESW-10, CHESW-22, CHESW-27, CHESW-33, CHESW-38, and CHESW-42 exhibited distinctive morphological and nutritional attributes of significance for breeding and value addition. Morphologically, CHESW-6 possessed large leaves and a semi-spreading habit, indicating high photosynthetic potential. CHESW-10 and CHESW-33, both with flattened fruits, represented rare shape variation, with CHESW-33 additionally showing extremely compact foliage suitable for dwarf or container cultivation. CHESW-22, with small leaves and long leaflets on a spreading canopy, offers advantages for high-density planting. Green-fruited types CHESW-27, CHESW-38, and CHESW-42 present unique market appeal; CHESW-27 and CHESW-42 exhibit compact, erect growth with small leaves, while CHESW-38 combines a spreading habit with moderate foliage. Nutritionally, CHESW-6 combined high fruit weight, elevated pulp TSS, and appreciable vitamin C, making it suitable for beverages. CHESW-10 showed a balanced TSS–acidity ratio and attractive fruit surface, enhancing consumer appeal. CHESW-22 was notable for high vitamin C and above-average TSS, ideal for functional foods. CHESW-27 demonstrated moderate acidity, good TSS, and early bearing (third year after planting). CHESW-33 had high pulp recovery and total sugars, suiting dessert and processing purposes. CHESW-38 maintained consistent TSS and vitamin C across seasons. CHESW-42 recorded the highest vitamin C content, erect dwarf growth for high-density planting, bunch-bearing fruits, late maturity, and high yield despite smaller fruit size. The greenish fruit surface of CHESW-27 and CHESW-42 at maturity enhances visual freshness and market appeal. Collectively, these genotypes offer a valuable genetic pool for developing wood apple cultivars suited to specific orchard systems, processing needs, nutritional goals and consumer preferences as well as crop improvement programme in future.

Conclusion

The study on the genetic diversity, morphological, and quality traits of Feronialimonia (wood apple) in semi-arid regions reveals significant variation among the 46 accessions in various traits. High levels of genetic diversity were shown by traits such fruit weight (FW), seed number per fruit (SNF), total soluble solids (TSSP), potassium (K), magnesium (Mg), and yield per plant (YP), suggesting that they might be used in selective breeding programs. Certain features, such fruit length (FL) and pulp weight (Pu), have significant environmental influences, according to the analysis, indicating that environmental factors can have a big impact on how those qualities manifest. The significance of variables like FW and SNF, which shown high heritability and genetic advancement, making them appropriate candidates for selection in breeding programs aimed at enhancing yield and quality, was further highlighted by the genetic analysis. The principal component analysis (PCA) revealed the intricate correlations between many features and highlighted the major elements affecting the development of wood apple fruit. Additionally, the genetic diversity among the accessions was shown by cluster analysis, which identified unique clusters that might direct the creation of better genotypes for breeding. These findings emphasize the value of preserving a varied Feronialimonia genetic pool for upcoming breeding initiatives and the possibility of enhancing fruit quality and productivity by focused selection. It would be beneficial to conduct additional research on characteristics, such disease resistance and environmental stress tolerance, to create more resilient cultivars that are appropriate for semi-arid environments.