Introduction

Guava has a special position among tropical fruits in Brazil, mainly due to the great variety of products and by-products, uses, and forms in which it is consumed. In the state of Rio de Janeiro, guava is a suitable fruit in view of the edaphoclimatic conditions, favourable soils, and proximity to the port facilities, thereby benefiting the commercial production of guava trees. As such, the Universidade Estadual do Norte Fluminense Darcy Ribeiro (UENF) has been conducting an breeding program for guava trees for approximately ten years, with positive results so far in the selection and management of new genotypes1,2,3,4,5.

Under this scenario, guava tree breeding with a focus on the selection of elite genotypes via segregating populations has demonstrated to be an efficient selection strategy. It was, however, necessary to evaluate partial inbreeding on the various traits of importance for the guava tree breeding, mainly in the maintenance and fixation of the desired agronomic characters in the genotypes evaluate6.

According to Ambrósio et al.6 using S1 inbred families, in which plants with high yield and precocity could be observed, in addition to high homogeneity within the families, was shown to be effective. On the basis of what was stated above, it became necessary to conduct studies to obtain lines, to fix favorable alleles in genotypes of interest, and also to obtain hybrids, making available to producers a homogeneous and of high-quality material for orchards in various growing regions of Brazil.

In this assumption, one of the aspects of utmost importance for the plant breeding to select superior cultivars is the genetic divergence study, because it allows knowing the magnitude of the genetic variability of populations, enabling the monitoring of crosses, self-fertilization, and acquisition of viable information for the preservation and use of plant genetic resources7. In perennial species, such as P. guajava, it can be carried out on the basis of morphological and agronomic descriptors, characteristics directly related to production and market. Among the methodologies to establish the relevance of descriptors in the population characterization, the use of principal component analysis, method of Singh, and correlation estimates are emphasized. These methodologies have been used in the study of genetic diversity in fruit trees8,9.

In this context, the use of multivariate techniques has made possible studies on genetic divergence among genotypes. Most of these techniques are based on algorithms, or distance measurements, which simultaneously consider numerous characteristics and allow unifying multiple information of a set of characters10. The Gower’s technique (1971) was proposed for simultaneous analysis of quantitative and qualitative data. As such, this combined data analysis leads to a better understanding of the aspects considered and, especially, more accurate and effective conclusions, from the statistical point of view, about the genetic relationship among the accesses studied11.

Another high impact technique to quantify genetic diversity among genotypes is the Ward-Modified Location Model—Ward-MLM methodology proposed by Franco et al.12. This method permits the simultaneous analysis of both quantitative and qualitative variables, accessing a significant part of the available germplasm information. By using this technique, the optimal number of clusters can be defined, and the real probability of each access be assigned to a given cluster can be reliably identified13. This procedure has been applied to some studies, such as with maize14; fodder turnip15; tomato13; beans16; pepper/sweet pepper17; banana18; and guava2.

In view of the considerations above, this study aimed at evaluating the genetic divergence among genotypes of S1 inbred guava families by means of the Gower’s technique and the Ward-MLM methodology, in addition to verifying the correlation and relative contribution of the traits, as well as identifying the descriptors with minimum efficiency for this species.

Material and methods

Self-pollination and obtaining the evaluated population

The 18 inbred families evaluated here came from populations developed by Pessanha et al.1 in a pre-breeding study. After molecular analysis, intraspecific crosses of Psidium guajava accesses were performed. These crosses were performed among the seven superior and contrasting parents, found via RAPD molecular analysis, resulting in 17 segregating families. This segregating population of wide genetic variability was subsequently evaluated and selected by Quintal et al.5 via REML/BLUP, in which the most productive progenies were selected and self-fertilized to originate the 18 inbred families that comprise this experiment (Table 1).

Table 1 Origin and development of S1 families of guava. UENF, Campos dos Goytacazes, RJ, 2022.

The study by Quintal et al.5 aimed to produce new varieties of guava with superior characteristics by the REML/BLUP procedure at individual level. The families selected by the authors can be highlighted by their first positions in the ranking for most of the agronomic traits and fruit quality evaluated.

Notably, guava is considered a mixed reproduction species (autogamous-allogamous)19,20 and has a self-pollination rate of 60%. Therefore, for being a self-compatible species21, inbred populations can be developed, either by self-pollination or crossing between related individuals, without inbreeding depression22.

Guava flowers are actinomorphic, hermaphroditic, being a potential factor for autogamy22. Faced with this possibility, plants were self-pollinated in the field and obtained by protecting flowers, covering buds before anthesis. The buds were identified and fruits were later protected with a paper bag. After harvesting the fruits, seeds were removed by rubbing them against a steel mesh sieve under running water. The removed seeds were left to dry at room temperature for 48 h, and constantly turned over to provide uniform drying.

Seeds from the fruits of self-fertilization were sown in tubettes (three seeds each tubette) and maintained in a greenhouse under polypropylene screens (Sombrite® 50%) at the Research Support Unit (UAP), which is located in the State University of North Fluminense Darcy Ribeiro, campus of Campos dos Goytacazes-RJ (Brazil). Ambient humidity (greenhouse) was controlled by an automatic misting system activated when the ambient temperature reached 27 °C and remained in this environment for 60 days. Afterwards, the plants were transferred to the field for experiment implementation.

Location and experimental design

The experiment was conducted at the experimental station of Ilha Barra do Pomba (Barra do Pomba Island), in the municipality of Itaocara (northwest region) (21°40′ south latitude, 42°04′ west longitude, and 76-m altitude), state of Rio de Janeiro, Brazil. The soil in the area is classified as Red-Yellow Argisol23. An experimental randomized block design with 18 S1 families, three replicates, and ten plants per plot was used.

After the development of the seedlings in the greenhouse, they were planted in the field in July 2014 at a 4-m spacing between rows and 1.5-m between plants. Liming, planting and covering fertilizers were carried out according to soil analysis, in accordance with the recommendations of Costa and Costa24; it was also used drip irrigation.

Nineteen months after planting the 540 plants of the experiment, the earliest and most productive plants were evaluated. As a result, 61 (individual plants) of guava trees were evaluated by means of the descriptors defined for the species P. guajava, in line with the International Union for the Protection of New Varieties of Plants—UPOV25 and described in Campos et al.2 (Table 3).

Phenotyping

The following morphoagronomic traits were evaluated: (1) Stem—Stem diameter at 10 cm from the soil; branch height; and color of the stem in young buds in the 61 genotypes, as well as five young and fully developed leaves and five fruits per plant; (2) Leaves—Qualitative descriptors in the young leaves were the presence of anthocyanin and its intensity in the staining. In the fully developed leaves, the shape; curvature in the cross section and central vein; green color; base and tip shape were evaluated. Length; width; length/width ratio; and secondary leaf vein spacing were also examined; (3) Fruits—Final stalk shape; skin color; surface texture; calyx cavity diameter in relation to the one of the fruits; flesh color; and external flesh thickness in relation to the core diameter were all evaluated. Quantitative descriptors for fruit were number; fruit weight; fruit placental mass; fruit flesh mass; length; diameter; shape index; and flesh and placental thickness of the fruit.

Analysis of genetic diversity by the Gower’s technique and clustering by the UPGMA method

In the analysis of genetic diversity, multivariate methods were adopted, applying quantitative and qualitative variables. Firstly, the Gower index26 was used to evaluate the quantitative and qualitative traits. Matrices and later dendrograms were constructed. At first, the main matrix, which contains the 29 descriptors, was constructed and served as a reference for data analysis. Subsequently, the matrices comprising the descriptors of stem; leaves; and fruits were constructed separately. An estimate of the dissimilarity index, ranging from 0 to 1, was generated.

The dissimilarity was obtained by

$$ Sij = \frac{{\sum\limits_{k = 1}^{p} {W_{ijk} } \cdot S_{ijk} }}{{\sum\limits_{K = 1}^{p} {W_{ijk} } }}, $$
(1)

in which:i and j = individuals to be compared regarding characteristic k;p = total number of characteristics; andSij = contribution of variable k to the total distance.

If the variable was qualitative, Sijk considered value 1 when there was positive or negative agreement for characteristic k between individuals i and j; otherwise, when the variable is quantitative, there is

$$ S_{ij} = \frac{{\left| {Y_{ik} - Y_{jk} } \right|}}{{R_{k} }}, $$
(2)

in which:Rk = the amplitude of variation of variable k, assuming values 0 and 1 or integers between them.

The wijk value was a weight applied to define individual Sijk’s contributions. Under this aspect, when the value of variable k was absent in one or both individuals, wijk = 0; or, if not, it was 1.

On the basis of the distance matrices generated, individuals were clustered by the Unweighted Pair Group Method with Arithmetic Mean—UPGMA, and the distance matrices with the 29 variables were compared with the distance matrices with fewer variables (leaves and fruits) using the Dendextend package in the R program (http://www.r-project.org).

Relative importance of traits

As a suggestion for discarding the less informative quantitative descriptors, it was applied a method based on the relative importance of the traits27, in which this relative importance was estimated by means of the participation of the components, concerning each of the traits, in the total dissimilarity observed. The analysis was conducted by using the Genes computer program7.

Pearson correlation

Pearson correlation was employed to verify the correlation among the quantitative descriptors, considering that the traits that can be discarded must be correlated to others selected. The correlation coefficient significance was examined by the t test. Statistical analyses were carried out with the help of the R Program28.

Analysis of genetic diversity by the ward-MLM method

After the analysis carried out by the Gower Index, the Ward-MLM (Ward-Modified Location Model) method proposed by Franco et al.12 was used, as reported by Viana and Resende29. Next, it was established the ideal number of clusters following the criteria of pseudo–F and pseudo T2 using the Ward clustering method30. Considering the optimal number of clusters, the hierarchical classification was obtained by the Ward method, which gives the parameters required to implement the final step of the MLM model31.

Differences among the clusters, the correlation between the variables and canonical variable (CV) were graphically analyzed. These analyses were made using the SAS statistical software32. Diagrams were obtained using the Sigma Plot software, version 11.0.

Ethical approval and informed consent

The research does not need ethical standards, because it has no Human and/or Animal participants. The article meets ethical standards. The research does not involve human participants and animal welfare.

Results and discussions

Comparative analysis and clustering using the UPGMA method

From the dissimilarity matrix generated by the quantitative and qualitative variables, it could be established the discrimination among the genotypes of the S1 inbred families of Psidium guajava. In the clustering analysis of the 61 genotypes, the UPGMA method enabled the formation of three distinct clusters in which all the 29 morphoagronomic descriptors were considered, indicating genetic diversity among the genotypes under evaluation. Cluster I comprised 42 genotypes; cluster II, 7 genotypes; and cluster III, 12 (Fig. 1).

Figure 1
Figure 1
Full size image

Entanglement among 61 genotypes of S1 inbred families of Psidium guajava, obtained by the Dendextend package, based on Gower Distance considering the morphoagronomic variables of fruit; leaf and stem; and only of leaf/stem. UENF, municipality of Campos dos Goytacazes, 2022.

The purpose of this clustering analysis is to gather genotypes into clusters so that there is homogeneity within the cluster and heterogeneity among them. Among the multivariate analysis methods to quantify genetic diversity, the UPGMA is highlighted for being a technique that uses unweighted means of dissimilarity measurements, which avoids characterizing dissimilarity by extreme values (minimum and maximum) among the genotypes considered7.

In the comparative analysis between dendrograms with all descriptors, and dendrograms with only leaf/stem and fruit descriptors, the need for different descriptors to characterize the genetic divergence of Psidium guajava is evident, since both the number of clusters and the arrangement among genotypes were not kept equal in the analyses. The entanglement value, which measures the matching of genotypes between distinct dendrograms, ranges from 0 to 1, in which 0 means fully matching dendrograms and 1, dendrograms without any matching. In this way, in the comparison of the dendrograms, it was seen the greatest entanglement was 0.28 in the dendrogram showing the leaf/stem descriptors, demonstrating divergence in the distribution of the genotypes in the dendrograms (Fig. 1).

Clusters I and II were formed by 51 and 10 genotypes, respectively. When compared with the dendrogram containing only fruit descriptors, it presented entanglement of 0.24 (Fig. 2). The formation of three clusters was also identified, with cluster I consisting of 43 individuals; cluster II, of 13; and cluster III, of 5 genotypes.

Figure 2
Figure 2
Full size image

Entanglement among 61 genotypes of S1 inbred families of Psidium guajava, obtained by the Dendextend package, according to Gower Distance considering the morphoagronomic variables of fruit; leaf and stem; and only of fruit. UENF, municipality of Campos dos Goytacazes, 2022.

In view of the results achieved by the comparative analysis between the dendrogram that includes all 29 descriptors with the dendrogram that only contains leaf/stem or fruit descriptors, it is necessary to use different descriptors to characterize the genetic divergence among the S1 inbred families of Psidium guajava.

Therefore, there is no possibility of indicating minimum descriptors for the culture given the results found. Similarly, by using the Dendextend package when comparing the dendrograms, with all the descriptors and the dendrogram containing only flower, leaf, or fruit descriptors, Santos et al.33 made it clear that different descriptors are required to characterize genetic diversity in Passiflora, because both the number of clusters and the arrangement between genotypes did not remain the same.

Relative importance of traits

Table 2 depicts the relative contribution of the traits evaluated for diversity (Sj) and their percentage values, which constitute the measure of the relative importance of variable j for the study of genetic diversity. Following the Singh method27, adopted to evaluate the relative contribution of the 14 quantitative traits, it was determined that three of these traits contributed to the genetic divergence with 98.80%, while the others contributed with only 1.20%. The traits with the highest relative contribution values and which contributed most to the genotype differentiation were fruit weight (91.95%); placental weight (3.68%); and fruit flesh mass (3.15%). As such, under this criterion, it can be confirmed that these descriptors are important in the characterization of the guava genotypes evaluated, since they contribute significantly (more than 1.0% of the total variation) to the divergence discrimination.

Table 2 Relative contribution of 14 quantitative morphoagronomic descriptors of Psidium guajava for genetic divergence by the Singh method. UENF, municipality of Campos dos Goytacazes, 2022.

As the fruit shape; leaf length/width ratio; and secondary leaf vein spacing traits did not provide relative contribution, they can be recommended for disposal. The other variables were poorly informative to evaluate the genetic dissimilarity of guava trees, considering that they had estimates of relative contribution of small magnitudes. In the case of some descriptors, however, low variability can be explained by the first generation of self-fertilization made, which provides an increase in homozygosity and a decrease in heterozygosity in the progeny, allowing, in this manner, the achievement of more homogeneous genotypes and a consequent allele fixation34.

Pearson correlation

Estimates of phenotypic correlation values, resulting from the evaluation of 61 guava genotypes, are depicted in Fig. 3. To explain the relationships between the traits of economic importance, correlation estimates should be considered satisfactory, that is, r >  ± 0.5035. The fruit weight was highly correlated with fruit flesh mass (0.54); placental weight (0.88); and fruit diameter (0.64). The fruit flesh mass was highly correlated with fruit length (0.87); fruit diameter (0.92); flesh thickness (0.67); and placental thickness (0.62).

Figure 3
Figure 3
Full size image

Linear correlation matrix (Pearson) among Psidium guajava traits obtained from 61 genotypes. UENF, municipality of Campos dos Goytacazes, 2022. Fruit number (FN); Fruit weight (FW); Fruit flesh mass (FFM); Fruit placental mass (FPM); Fruit length (FL); Fruit diameter (FD); Fruit shape index (FSI); Fruit flesh thickness (FFT); Fruit placental thickness (FPT); Stem diameter at 10 cm from the soil (SD); Leaf length (LF); Leaf width (LW); Leaf length/width ratio (LF/WR); Secondary leaf vein spacing (SLVS).

These correlations with fruit weight were expected, considering that genotypes with heavier fruits typically produce wider fruits with, consequently, larger flesh mass. Hence, these correlation estimates allow forecasting the response of a trait when performing the selection in another correlated one, in other words, it enables, for example, the selection in a trait of easy measurement to obtain gains in another of difficult measurement36.

Clustering analysis by means of the Ward-MLM strategy

In accordance with the Ward-MLM strategy, two clusters were formed following the criteria of pseudo-F and pseudo t2 (Fig. 4). The ideal number of clusters where there was the greatest increase in logarithmic function was verified, and the highest absolute value of 41.98 was confirmed. The indication of the number of clusters is an innovative aspect of the Ward-MLM procedure, in comparison with other hierarchical methods, which results in a more precise and less subjective cluster formation13,37.

Figure 4
Figure 4
Full size image

Log-Likelihood function, indicating the number of clusters formed by the Ward-MLM strategy. UENF, municipality of Campos dos Goytacazes, 2022.

The study conducted by Campos et al.2 with 138 guava genotypes obtained from biparental controlled crossings determined 8 the ideal number of clusters, with an increment value of 67.51. The authors noted a considerable degree of heterozygosity and wide genetic variability in these genotypes for being a segregating population. As such, it is worth mentioning that the results are different from the ones in this study, in which only the formation of two clusters was verified. Such fact seen among the genotypes of the S1 families is due to the reduction of genetic variability within the families and, consequently, to the increase in the frequencies of homozygosity in the first generation of self-fertilization.

Clusters I and II were formed by 33 and 28 genotypes, respectively. Genotypes were classified into the 49 phenotypic classes of the 15 qualitative descriptors evaluated, demonstrating a low genetic variability among the genotypes of the S1 inbred families of guava trees studied (Table 3). As for the qualitative traits, the descriptor about the branch altitude was (6.5%) erect and (65.5%) pending. The color trait of the stem was (39.2%) green; (42.6%) yellow green; and (18.1%) dark red.

Table 3 Phenotypic classes of plant descriptors; leaf and fruit; and number of genotypes of S1 inbred guava families per cluster formed by the Ward-MLM strategy.

There is, in the descriptors, a strong differentiation in the magnitude of the anthocyanin presence in leaves, with percentages of 16.4% and 83.6%, respectively. With respect to the intensity of anthocyanin staining in leaves, it can be seen that, in 52.5%, there was a weak shade, while in 47.5%, it was medium. In fully developed leaves, most genotypes had an obovate shape (57.4%), with the base shape (63.9%) and the tip (83.6%) rounded. In addition, 59% of genotypes had no curvature of the central vein in the leaves. Among those who did, 65.6% were classified as strong central vein curvature.

For the predominant leaf color, green gray was found in 47.5% of the genotypes, followed by green (26.2%); yellow green (23%); and dark green (3.3%).

In relation to fruit descriptors, the genotypes presented final truncated stalk shape (54.1%); rounded (26.2%); broadly rounded (18.1%); and neck-shaped (1.6%). In terms of skin texture, 93.4% of the genotypes showed a soft texture and 6.6% a rough one. On the other hand, 59% of the genotypes had pale yellow skin color; 23%, pale green-yellow; 14.7%, dark yellow; and 3.3%, orange. Pereira and Nachtigal38 pointed out that fruits of yellowish green or yellow color, when ripe, are more commercially appreciated.

For the flesh color descriptor, genotypes were classified into five categories, with a range from cream to orange pink, but with a predominance of dark pink, which was found in 37.7% of the genotypes. Pereira and Nachtigal38 say that pink or reddish flesh fruits are preferred not only for the industry but also for the consumption in natura in the Brazilian market.

In the calyx cavity diameter regarding the fruit, 63.9% of the genotypes were classified as medium (68.8%); thick (21.3%); thin (6.6%); and very thin (3.3%). These descriptors prove to be relevant when choosing genotypes for consumption in natura, given that they are related to factors that depreciate the product if they are not attractive to consumers.

With respect to the quantitative descriptors, cluster II had the highest number of fruits per plant (125), followed by cluster I (111) (Table 4). In both, however, there was great variation, as seen by the minimum and maximum values of each group.

Table 4 Minimum, maximum, and mean values of the descriptors for each of the two groups formed by the Ward-MLM strategy in 61 genotypes of inbred S1 families of guava trees.

There is still less potential for production when comparing with the Paluma cultivar, which produces an average of 188 fruits per plant. Nevertheless, as the evaluation was carried out in the first year of cultivation, this was already expected. As stated by Natale et al.39, guava trees bear fruits from the first year onwards and, over the course of the harvest, production gradually increases until it stabilizes. Even though the number of fruits produced by some genotypes does not come close to being found in some cultivars, the families in this study are in their first harvests with the possibility of an increase in production over the cycles.

Fruit weight trait also varied within the clusters, with cluster II having the highest value (329 g). There was high magnitudes of fruit weight, reflecting the first production, when the plants still have not expressed their full productive potential and thus produce larger fruits. In general, the trend is towards the fruit mass decreasing with the stabilization of commercial production, when the plant tends to yield a greater number of fruits. Gonzaga Neto et al.40 consider the increase in the average weight of the fruits to be related to the number of fruits produced by each plant; as such, the greater quantity of fruits in the plant may influence it to produce smaller fruits in weight and size because the available reserves would be used to fill a greater number of fruits, limiting the size of each one of them.

Cluster I, however, had an average fruit weight of 177.2 g. Hence, these values are in accordance with the standard recommended for consumption because, according to Lima et al.41 fruits ranging from 100 to 200 g can be utilized for dual purposes, in other words, they can be used both for industrial processing and for consumption in natura.

Cluster II also showed a higher value for the fruit flesh mass (242 g) and fruit placental mass (47.9 g), resulting in a greater fruit length (101 mm) and fruit diameter (82.48 mm). Flesh mass trait is of utmost relevance, once it contributes to the definition of the flesh yield. According to Gonzaga Neto et al.40, the average fruit mass is an important trait, given that, in general, fruits with the greatest flesh mass are also the largest and, in turn, they are more attractive to the consumer.

For fruit shape index, values were close among the clusters, in which the mean was of 1.14 e 1.09 for cluster 1 and 2 respectively. The analysis of these variables, separately, has little importance for the fruit characterization of guava tree genotypes41. On the other hand, the relationship among these variables is highly representative, since it indicates the fruit shape. Pyriform or oval fruits (LD/CD ratio greater than 1) are suitable for consumption in natura, and those with rounded forms (LD/CD ratio close to 1) are better indicated for industrialization42.

The clusters presented close means between them for placental thickness and flesh thickness of the fruit, with mean values of 39.35 and 12.96 for cluster 1, and 14.77 and 41.77 for cluster 2 respectively. For the stem diameter at 10 cm from the soil, cluster I reported a mean of 49.56 mm followed by cluster II, with 53.12 mm. For descriptors related to the leaf, clusters I and II were similar for length, width, length/width ratio, and vein spacing, which resulted in a lower mean length/width ratio of (1.89 and 1.94), presenting the characteristic of a rounder leaf. For being correlated with the leaf area of the plant, the leaf length and leaf width are significant. The leaf area is directly related to the taking advantage of solar energy, which is converted into chemical energy during the process of photosynthesis43.

The first two canonical variables (CV) obtained by the Ward-MLM strategy explained 100% of the total variation, being 83% of CV1 and 17% of CV2 (Fig. 5). As such, a two-dimensional representation is the most appropriate to represent the data set. The Ward-MLM procedure was applied to quantify genetic variability in studies on maize14; pepper/sweet pepper15; tomato13; common beans16; pepper/sweet pepper17; and banana18. These authors noticed the first two canonical variables explained the variability between the clusters above 80%, and the two-dimensional graph was adequate to visualize the relationship between the clusters. In this way, this high value shows the graphic representation of the first two canonical variables is adequate to verify the relationship among clusters and individuals within a cluster, enhancing the reliability of the results.

Figure 5
Figure 5
Full size image

Dispersion of the first two canonical variables (CV1 and CV2) representing the formation of two clusters by the Ward-MLM strategy, based on 14 quantitative and 15 qualitative descriptors in S1 inbred families of guava trees. UENF, municipality of Campos dos Goytacazes, 2022.

The quantitative descriptors that most contributed to quantify the genetic diversity among the genotypes, in other words, the ones that displayed the greatest correlations with the first canonical variable were fruit flesh mass (0.549); fruit weight (0.533); fruit diameter (0.515); fruit flesh thickness (0.462); fruit placental mass (0.455); and fruit length (0.295) (Table 5). In the study developed by Campos et al.2, with guava accesses, the authors found that the greatest correlations of the variables with the first canonical variable were flesh mass; fruit mass; fruit diameter; mean fruit weight; mesocarp thickness; and fresh placental mass.

Table 5 Canonical variables of 29 plant descriptors; fruit; leaf; and stem in 61 genotypes of S1 inbred guava families.

In view of the above, it is worth emphasizing that knowing the morphoagronomic traits evaluated herein is fundamental in plant breeding programs, especially to support future crosses and/or self-fertilization of the genetic breeding program of guava trees. By characterizing the genetic diversity of the S1 inbred families of guava tree, the identification of genotypes agronomically superior was made possible, providing better gains as a result of their selection. This study provided a more accurate basis for choosing the best genotypes that can be selected and self-fertilized, so as to achieve the most promising genotypes within the breeding program.

Final considerations

Comparative analysis between dendrograms revealed the use of all 29 descriptors led to greater efficiency in discriminating clusters, enabling genetic dissimilarity to be achieved more efficiently among the S1 inbred families of guava trees.

The Gower’s technique is an effective tool to detect genetic divergence and to cluster accesses using qualitative and quantitative variables simultaneously.

The traits with the greatest relative contribution to genetic variability in the families studied were fruit weight; placental weight; and fruit flesh mass.

The Ward-MLM statistical procedure is a useful tool in detecting genetic divergence and in clustering genotypes, and, by means of it, the descriptors that most contributed to the divergence among the guava genotypes were fruit flesh mass; fruit weight; fruit diameter; fruit flesh thickness; fruit placental mass; and fruit length.

The most promising genotypes for selection in the breeding program belong to group II, due to their good performance for all traits, especially fruit number, fruit weight, fruit pulp mass, and fruit diameter and length.

Therefore, the most divergent genotypes with high production potential can be recommended for further crosses or self-fertilization to acquire new lines in the guava breeding program at UENF.