Abstract
Within large-fruited germplasm, fruit size is influenced by flat and globe shapes. Whereas flat fruits are smaller and retain better marketability, globe fruits are larger and more prone to cuticle disorders. Commercial hybrids are often developed from crosses between flat and globe shaped parents because flat shape is thought to be dominant and fruit size intermediate. The objectives of this study were to determine the genetic basis of flat/globe fruit shape in large-fruited fresh-market tomato germplasm and to characterize its effects on several fruit traits. Twenty-three advanced single plant selections from the Fla. 8000 × Fla. 8111B cross were selectively genotyped using a genome-wide SNP array, and inclusive composite interval mapping identified a single locus on the upper arm of chromosome 12 associated with shape, which we termed globe. A 238-plant F2 population and 69 recombinant inbred lines for this region from the same parents delimited globe to approximately 392-kilobases. A germplasm survey representing materials from multiple breeding programs demonstrated that the locus explains the flat/globe shape broadly. A single base insertion in an exon of Solyc12g006860, a gene annotated as a brassinosteroid hydroxylase, segregated completely with shape in all populations tested. CRISPR/Cas9 knock out plants confirmed this gene as underlying the globe locus. In silico analysis of the mutant allele of GLOBE among 595 wild and domesticated accessions suggested that the allele arose very late in the domestication process. Fruit measurements in three genetic backgrounds evidenced that globe impacts fruit size and several fruit shape attributes, pedicel length/width, and susceptibility of fruit to weather check. The mutant allele of GLOBE appears mostly recessive for all traits except fruit size where it acts additively.
Similar content being viewed by others
Introduction
Tomato is a major vegetable crop worldwide, valued at more than $95 billion annually1. Fresh-market tomato has a production value of more than 1.2 billion dollars in the US, with the state of Florida accounting for more than 36% of that value2. Fruit shape and size are important traits in fresh market tomato, influencing consumer preference, packing demands, and market value3,4.
Tomato originated in the Andean region of South America5 and was domesticated in Central America, possibly in Mexico6. The size of cultivated tomato fruit has dramatically increased relative to wild species. Fruit of wild tomatoes are two-locular and weigh a few grams. In contrast, a modern tomato fruit may be multilocular and weigh up to 1 kg6,7. There is also tremendous variation in fruit shape: whereas fruit of wild tomatoes are simply round, fruit of cultivated tomatoes can be flat, round, fasciated, elongated, pear-shaped, plum, bell pepper-shaped, etc.6,7,8. It is thought that such variation emerged early in the domestication process by selecting alleles with larger fruits and variable shapes, and that these alleles accumulated over time, giving rise to the present-day tomatoes6,7.
As many as 30 QTLs likely explain the overall morphological variation in the fruit of tomato7. However, most of the variation is explained by mutations in four genes: SUN, OVATE, LC, and FAS9. While SUN and OVATE are involved in fruit elongation, LC and FAS regulate the number of locules in the fruit, which in turn impacts both fruit shape and size. On the other hand, mutations at two loci, FW2.2 and FW3.2, are implicated in the increase in fruit size to a great extent.
The short arm of chromosome 7 harbors sun, where a 24.7 kb insertion is linked to elongated fruit shape10,11. It was determined that this segment was copied from chromosome 10 by Rider, a Copia-like retrotransposon, present at both loci11. This rearrangement put SUN under the control of a defensin (DEFL1) promoter, leading to its increased expression in the fruit11,12. This mutation results in increased number of cells in the longitudinal area and reduced in the transversal area of the fruit12, thus not impacting fruit weight.
OVATE encodes a member of the Ovate Family Proteins (OFP), which are believed to be a class of regulatory genes important for plant development13. A SNP in the second exon of the OVATE gene leads to a premature stop codon13, resulting in fruit that can be pear-shaped, round, elongated, or ellipsoid—depending on the genetic background9,13,14. OVATE interacts with the sov1 and sov2 loci on chromosomes 10 and 11, respectively, the first influencing the degree of pear shape and the second fruit elongation14,15.
FAS and LC, located on chromosomes 2 and 11, respectively, regulate the number of locules in the fruit, which in turn impacts both fruit shape and size6,16,17,18,19. The fas mutation results from a 294-kb inversion that impairs the expression of SlCLV320. It is thought that FAS impacts meristem organization and boundary formation within the floral meristem, resulting in changes in the number of locules in the fruit16,21. On the other hand, SlWUS is the gene underlying the lc mutant, which controls the inflorescence and floral meristem formation phase ultimately affecting the final size of the fruit15,19. Two SNPs 1080 bp downstream of this gene are believed to cause the phenotype by disrupting the regulatory region of this gene, resulting in increased expression15,19. Potentially involved in the SlWUS-SlCLV3 network is ENO, a recently identified transcription factor of the AP2/ERF superfamily which may act by regulating SlWUS expression22. A natural 85-bp deletion in the promoter of ENO leads to decreased expression of this gene and increased locule number, and evidence suggests that this mutation occurred prior to tomato domestication and is now fixed in cultivated tomato22.
fw2.2 is responsible for as much as 30% of the difference in fruit size between modern and wild tomatoes23. fw2.2 increases fruit weight by causing the expansion of the placenta and columella of the fruit15. FW2.2 is a cell number regulator (CNR) located on the bottom of chromosome 2, and it is believed that a mutation in the promoter region increases fruit size by influencing the rate of cell division23,24.
FW3.2 is located on chromosome 3 and is as a member of the Cytochrome P450 gene family (formerly identified as SlKLUH) whose members are known to control organ and plant size25,26,27. The increase in fruit weight is the result of pericarp and septum expansion. SlKLUH likely effects these changes by influencing the duration of cell division, thereby increasing the number of cells15,27. Although, it was initially believed that a SNP in the promoter region of the SlKLUH gene was responsible for fw3.2, it was recently discovered that a ~50-Kbp tandem duplication at this locus resulted in two copies of SlKLUH, thus affecting fruit size through increased gene expression27,28. The expression of SlKLUH is exceptionally high in the growing seeds27, suggesting that a seed-produced compound may be involved in the increase in fruit mass27. Other important fruit size and shape QTLs are fs8.1, controlling fruit elongation, and fw11.3 which explained from 8 to 13% of fruit weight variation in populations between domesticated and wild tomato accessions29,30.
Whereas the above-mentioned loci account for major fruit shape and size variation between wild and cultivated germplasm and among some market classes of cultivated tomato, the genetics of shape and size variation within specific market classes is less well understood. A study evaluated the variation for six major fruit shape and size loci (sun, lc, ovate, fas, fw2.2, and fw3.2) and found that there was no variation among sampled fresh market large-fruited tomato inbred lines from the UF/IFAS and NCSU breeding programs31. Specifically, the set of breeding lines genotyped from both programs are fixed for the domesticated allele at FW2.2, FW3.2, FW11.3, and LC, and fixed for the wild allele at FAS, OVATE, and SUN31. Hence variation in shape/size within this market class is explained by another, as yet unknown, locus(loci).
In large-fruited, fresh market tomato, flat and globe fruit shapes are often used by breeders in the development of hybrid cultivars. Flat fruit are wide and ‘blocky’ and are usually smaller than globe fruit, which are deep and round (Fig. S1). Globe fruit shape is described in some cultivar releases as being associated with susceptibility to cuticle disorders, including radial cracking and weather check (cuticle cracking that occurs primarily on fruit shoulders under high moisture conditions, Fig. S1)32,33,34. Hybrids between flat-fruited and globe-fruited parents, however, are thought to be intermediate in fruit size, while retaining the fruit shape and marketability characteristics of the flat parent (Fig. S1). The genetics of flat and globe fruit shape are currently unknown, but identification of the underlying loci may be of particular use for breeders.
The source of globe fruit shape in the tomato breeding program of the University of Florida Institute of Food and Agricultural Sciences (UF/IFAS) is 74VF18, an inbred line from the University of California at Davis (J.W. Scott, personal communication). 74VF18 is likewise in the pedigree of Fla. 7060, the globe parent in the hybrid ‘Solar Set’ and the source of globe fruit shape in the North Carolina State University (NCSU) tomato breeding program (R.A. Gardner, personal communication). Previous observations suggest that flat/globe fruit shape is simply inherited (R.A. Gardner, personal communication). Moreover, globe-fruited germplasm from the UF/IFAS and the NCSU tomato breeding programs has been distributed to many public and private breeding programs over the years, and it is possible that the genetic control of this fruit shape characteristic is common among various tomato breeding programs.
The objectives of this research were: (1) to characterize the genetic basis of flat and globe fruit shapes in large-fruited fresh-market tomato, and to identify and fine-map the underlying locus(loci); (2) as genetic control is determined, to explore the extent to which this locus(loci) may explain fruit shape variation among a diverse panel of germplasm from the UF/IFAS and NCSU public breeding programs, and from several private breeding programs; (3) to identify the gene(s) underlying globe using CRISPR/Cas9 knock outs and to inquire its origin in the domestication process; and (4) to quantitatively characterize the locus(loci) for its effect on fruit size, fruit shape, fruit disorders, and other plant traits.
Results
Identification of globe, a major locus controlling flat and globe fruit shapes
Phenotyping of 83 F6 RILs identified 39 with flat and 44 with globe fruit shape, and a chi-square test for a 1:1 suggested that fruit shape was controlled by a single locus (X2 = 0.15, P-value = 0.7). From this group, 11 flat and 12 globe-shaped RILs were selected and individually genotyped with a 7720 SNP array, identifying 751 polymorphic loci. Inclusive composite interval mapping (ICIM) analysis with this information detected a single locus in the upper arm of chromosome 12 that was significantly associated with fruit shape (Fig. 1A).
A Inclusive composite interval mapping (ICIM) analysis based on 751 markers segregating among 12 globe and 11 flat F6 recombinant inbred lines from the Fla. 8000 × Fla. 8111B population. B ICIM analysis based on eight markers located within the target interval on chromosome 12 and tested on 238 F2 plants from the Fla. 8000 × Fla. 8111B population. Marker names are presented to the right of the chromosome figure, and centi-Morgan positions are provided to the left. The thick red band on the chromosome identifies the QTL position
A 238-plant F2 population, comprised of 176 flat and 62 globe individuals provided an acceptable fit to the phenotypic ratio 3:1, supporting a single locus (X2 = 0.07, P-value = 0.79) and indicating dominant gene action by the flat shape. ICIM analysis on this population, using eight new sequence-based markers spanning the chromosome 12 interval from 390,069 to 2,877,860 bp confirmed the presence of a major QTL explaining 99.58% of the phenotypic variation for fruit shape in this region (Fig. 1B; Table S1). This locus will hereafter be termed the globe locus.
Fine mapping of the globe locus
F3 and F4 segregating populations were screened with seven markers saturating the region between 936,308 and 2,049,214 bp, and 69 recombinant plants were identified. From these, RILs for fine mapping RILs (FM-RILs) were developed, phenotyped for fruit shape, and genotyped with the seven markers. Genotypes and phenotypes of the 69 FM-RILs are presented in Fig. 2. The results clearly delimited the globe locus to an approximately 392 Kbp region between 1,031,490 (marker M2.1) and 1,423,513 bp (marker 16EP133). Marker M3 demonstrated complete co-segregation with the phenotype throughout the entire FM-RIL population.
Results delimited the locus to an approximately 392 Kbp region (delimited by the vertical red dashed lines). The physical position of the markers is based on the Tomato Genome Build SL4.0. Each distinctive recombinant genotype is represented by a horizontal bar, and the number of RILs with this genotype is indicated to the left. Shading of the bars represents the two haplotypes of each recombinant group: dark gray, homozygous for the Fla. 8000 (flat) allele; white, homozygous for the Fla. 8111B (globe) allele; diagonal stripes, undetermined
Validation of the globe locus
Results were validated using a separate F2 population from the cross between Fla. 7946 (flat) and Fla. 8442 (globe). One hundred and three plants were phenotyped for fruit shape and genotyped with the M3 marker, among which, 81 were flat, and 22 were globe, fitting a 3:1 ratio and supporting a single dominant locus (X2 = 0.38, P-value = 0.54). All plants with globe fruit shape were homozygous for the globe allele at marker M3; and among the plants with flat fruit shape, 20 were homozygous for the flat allele, and 61 were heterozygous. These genotypes also fit a 1:2:1 ratio (X2 = 1.82, P-value = 0.40), indicating no distortion in Mendelian segregation.
To investigate the efficacy of this QTL across backgrounds, fruit shape phenotype and genotype at the globe locus were determined for three globe-shaped inbred lines (Fla. 8735, Fla. 7776, and Fla. 8022), for three flat near isogenic lines (NILs) that corresponded to each globe inbred, and for F1s resulting from the cross between each globe recurrent parent and its flat NIL. Each NIL was developed by a modified backcrossing approach, without the use of molecular markers, to introgress the flat fruit shape phenotype into the recurrent parent. Notably, whereas each of the globe parents were homozygous for the globe allele, all three of the flat NILs were homozygous for the flat allele, and each F1 was heterozygous and had flat fruit shape. These results further support the ability of the globe locus in explaining variation of flat and globe fruit shape in multiple backgrounds.
Distribution of globe among tomato breeding programs
To explore the extent to which the globe locus explains fruit shape variation across a broader set of germplasm, a panel of 204 inbred lines from multiple breeding programs were genotyped and surveyed for fruit shape. Of 176 large-fruited samples surveyed, 129 had flat fruit shape, and 47 had globe fruit shape (Table S4). Each of the inbred lines were genotyped with six markers located on chromosome 12 between 0.971 and 1.895 Mbp (Table S1). Results identified nine different marker haplotypes among 129 flat-fruited lines (Table S4). Haplotypes of the flat inbred lines differed for genotype at all markers except M3, where all inbreds had the flat allele. On the other hand, five different haplotypes were identified among globe inbred lines. As with the flat inbred lines, haplotypes of the globe lines differed for genotype at all markers except M3, where each showed the globe genotype. Thus, genotype at marker M3 consistently corresponded to fruit shape phenotype for all 176 of the large-fruited inbred lines that were surveyed. Five of the marker haplotypes observed among large-fruited inbred lines were also found in cherry, grape, and plum types. Each of the five grape/cherry inbred lines were homozygous for the flat allele at marker M3. However, genotype at this marker was not consistent among the 22 plum lines surveyed, where the flat and globe alleles were equally frequent.
To ascertain the extent to which large-fruited commercial hybrids involve crossings between flat and globe parents, 35 hybrids were surveyed with marker M3 (Table S5). From those hybrids, 17 were homozygous for the flat allele, and 18 were heterozygous. None of the hybrids surveyed were homozygous for the globe allele. These results suggest that globe-fruited inbred lines are commonly used as parents of commercial hybrids, but hybrids are being developed from crosses between two flat parents with equal frequency.
Candidate gene identification, confirmation of the GLOBE gene, and insights into its origins during the domestication process
Solyc12g006860 identified as the only candidate gene
Whole-genome sequencing alignments for 6 flat and 4 globe inbred lines were investigated for sequence polymorphisms occurring in the 392 Kbp fine-mapped interval at the globe locus. Comparative analysis identified nine polymorphisms between markers M2.1 and 16EP133: 4 SNPs, 3 simple sequence repeats (SSR), and 2 INDELs (Fig. 3). Eight of those polymorphisms were not consistently polymorphic between flat and globe lines and showed genotypic variation within either group. However, the indel corresponding to marker M3, which demonstrated complete co-segregation with fruit shape in all FM-RILs and consistently corresponded with fruit shape in the germplasm survey, was also consistently polymorphic between the flat and the globe lines that were sequenced.
Select polymorphisms that were used for marker development, and all polymorphisms within the 392 Kbp fine mapped interval (indicated by the box) for the globe locus are displayed. Physical positions (bp) are displayed along the left of the figure and correspond to version SL4.0 of the Heinz 1706 genome assembly. Loci corresponding to markers used for mapping and fine mapping are indicated by brackets. Location of polymorphisms relative to annotated genes are displayed to the right and are indicated by: P = within the promoter region, corresponding to approximately 1.0 Kbp upstream of the annotated gene; Int = within an intron of the annotated gene; Ex = within an exon of the annotated gene; Inter = located intergenically
Fifty annotated genes are located within the 392 Kbp fine mapped interval (ITAG4.1, Table S6). Among the polymorphisms identified in this region, four are intergenically located, two are located in an intron of genes Solyc12g006520 and Solyc12g006540, and two are located within the promoter region of genes Solyc12g006530 and Solyc12g006720. However, none of these consistently corresponded to fruit shape among the sequenced lines. Alternatively, the indel at position 1,306,260, which corresponds to marker M3, is located within the final exon of Solyc12g006860, a gene annotated as a brassinosteroid hydroxylase (Fig. 3). The coding sequences of the wild-type and mutant alleles of Solyc12g006860 were translated into amino acid sequences and aligned for comparison (Fig. S2). Whereas the predicted protein of the flat allele is 555 amino acids in length, the single base pair insertion in the globe allele causes a frameshift mutation, resulting in 16 amino acid substitutions followed by a premature stop codon which truncates the protein by eight amino acids.
Because no other mutations were found in the coding regions of any predicted genes in this area, and the indel at this gene is the only polymorphism demonstrating complete segregation with shape, these results collectively identified Solyc12g006860 as the most likely candidate gene underlying the flat/globe locus.
To confirm Solyc12g006860 as the GLOBE gene, CRISPR/Cas9 targeted mutagenesis was employed to knock out the gene in Fla. 8059, a flat-shaped line. A single mutation event was identified, containing two deletions within the gene: a 59 bp deletion at the beginning of the coding sequence, and a 3 bp deletion towards the end of the gene (Fig. 4A). Both mutations were present in the mutant plants measured. A T0 plant was selfed to generate homozygous mutants that remained Cas9 positive, and fruit shape and pedicel length were measured in the T1 generation. As expected, plants mutant for Solyc12g006860 had a higher fruit shape index and increased pedicel length compared to wild-type plants (Fig. 4B, C), resembling the phenotype of the globe-shaped lines (further discussed in the phenotypic characterization section). These results confirmed Solyc12g006860 as GLOBE and support that the single base pair insertion in the last exon of the natural mutant results in a gene knock out, thereby producing the globe phenotype.
A cDNA sequence of Solyc12g006860 showing the two guide RNAs (1, yellow shade; and 2, gray shade) and the two deletions caused in the gene (the first of 59 bp, the second of 3 bp). The inverted red triangle shows the location of the natural globe mutation (insertion of a thymine). B Phenotypic measurements of fruit shape index and pedicel length in Fla. 8059 (WT) and CRISPR/Cas9 knocked out plants (MUT). MUT and WT are significantly different with P < 0.001, error bars represent the standard error of the mean (n = 5 plants; each observation was derived from the average of at least 3 fruits). C Sliced fruit of each of the wild-type (upper) and mutant plants (lower). Scale bar is 2.5 cm
Insights into the origin of globe during domestication
The presence of the mutant allele of GLOBE in a broad set of germplasm was assessed in silico using publicly available data. The sequencing data provided genotypic information on a total of 595 unique accessions, including tomato wild relative species, semidomesticated cultivars (S. lycopersicum L. var. cerasiforme) and fully domesticated tomato varieties (S. lycopersicum L. var. lycopersicum). Among these accessions, only three cultivated accessions harbored the mutant allele: Fla. 7060, TS-103 (EA00389), and TS-306 (Supplementary file 2). Fla. 7060 is a globe-shaped breeding line from the UF/IFAS breeding program, which was homozygous for mutant allele as expected. TS-103 (EA00389) was heterozygous and TS-306 appeared homozygous (based on a single read at this position). No ancestral accessions contained the mutant allele, suggesting that the mutation arose very recently during the domestication process.
Phenotypic characterization
Fruit size
The impacts of flat and globe fruit shape were evaluated in three near isogenic backgrounds. The analysis indicated that there were significant differences in fruit weight between genotypes, backgrounds, and their interaction. The average fruit weight of globe genotypes was consistently greater than that of the flat genotypes in all backgrounds, and the heterozygote fruit weight was intermediate between both homozygotes for two of the backgrounds, indicating a possible additive effect for this trait (Fig. 5). The overall mean fruit weight was 143 (±2.60) for flat, 159 (±2.76) for heterozygous, and 183 (±2.60) g for globe genotypes, indicating an increase in weight of approximately 11% in heterozygous and 27% in globe genotypes relative to the flat genotype. Likewise, yields of extra-large fruit were generally highest for globe and heterozygous genotypes, and yields of large fruit were generally highest for flat and heterozygous genotypes (Table S7).
The bars represent the standard error of the mean. Within each background, different letters represent significantly different means (Tukey HSD test, alpha = 0.05, based on a 4-replicate RCBD, trialed three seasons [N = 12]). The response value of each experimental unit comes from: the average fruit weight from four or five harvests each season [yield/total fruit per plot] (weight), or average values of 24 fruits per experimental unit from second to fourth harvest (fruit shape attributes)
Marketability of the fruit
Marketable yield results showed high variation between the spring 2018 and both fall seasons. In fall seasons, whereas the heterozygous genotypes had higher yield compared to the homozygous flat genotypes (although not always significantly different), the homozygous globe genotype had the highest yield in two of the three backgrounds. On the other hand, in spring 2018, homozygous globe genotypes consistently had the lowest marketable yield (Table S7).
With respect to culls, fall 2017 results indicated no significant differences among genotypes except for the Fla. 7776 background, where the homozygous flat genotypes had the highest percentage of culls. In contrast, the homozygous globe genotype had the highest percentage of culls in all backgrounds in spring 2018 and fall 2018. These differences were significant in all backgrounds except for Fla. 8022 in spring 2018 (Table S7).
The two fruit disorders that contributed the most to culls were radial cracking and weather check. With respect to radial cracking, there were no significant differences among genotypes in any background in fall 2017. In spring and fall 2018, the homozygous globe genotype typically had the highest percentage of radial cracking, and the heterozygous genotype tended to have the smallest percentage or was not significantly different from the homozygous flat genotype (Table S7). With respect to weather check, the homozygous globe genotype in the Fla. 7776 background demonstrated the highest percentage of weather check across all seasons, having more than six times the culls as the other two genotypes. For the other two backgrounds, results were not consistent in the fall seasons, but in spring 2018, the homozygous globe genotype had the highest percentage of culls among backgrounds (albeit the difference was not significant in the Fla. 8022 background, Table S7).
The relationship between weather (rain and temperature) and percentage of fruit with weather check damage was assessed by collecting weather data for each season and plotting it against the percentage of culled fruit with weather check damage by genotype (Fig. S3). Fruit with weather check damage appeared a few days after a rain events were recorded; furthermore, the percentage of damaged fruit appeared to be correlated to the amount of rainfall. Notably, in spring 2018, rainfall was frequent, and the proportion of damaged fruit was the highest of all three seasons. Moreover, the homozygous globe genotypes were by far the most affected. On the other hand, in both fall seasons, where the rain was less frequent, the percentage of damaged fruit was lower, yet still highest for the globe genotypes. These results support the significance of rainfall in the susceptibility of fruit to weather check damage, especially in globe genotypes.
Fruit shape
All basic shape measurements (fruit height, width, area, and perimeter; data not shown for perimeter) were significantly affected by genotype at the globe locus, with the homozygous globe genotype consistently having the highest values, and the homozygous flat genotype having the lowest (Figs. 5 and S4). The heterozygous genotype was intermediate (though not always significantly different) to the homozygous genotypes. Additionally, the differences in fruit width among genotypes were relatively small compared to those observed for fruit height, indicating that changes in fruit weight are primarily due to differences along the longitudinal axis of the fruit.
The analysis of the fruit shape index (FSI) resulted in significant differences among genotypes and interaction between the background and the genotype (Fig. 5). Still, the homozygous globe genotype consistently had the highest FSI across all backgrounds. On the other hand, the FSI of the heterozygous genotype was not significantly different from the homozygous flat genotype in two of the three backgrounds (in the Fla. 7776 background it appeared intermediate).
The globe locus significantly affected fruit shape at the proximal (stem) end of fruit. Although the analysis suggested a significant interaction between genotype and background for all proximal end traits (shoulders, proximal end angles, and proximal indentation area, Figs. 5 and S4), the homozygous globe genotype always resulted in significantly smaller values, and the Fla. 8022 and Fla. 8735 backgrounds resulted in similar values for both homozygous flat and heterozygous genotypes. This is in agreement with the less prominent shoulders observed in globe fruits. On the other hand, in the Fla. 7776 background, the heterozygous genotype appeared significantly different and intermediate between both homozygotes for all traits except for shoulder height. Contrary to the proximal end, the locus did not appear to have a large impact in the distal end of the fruit, where only the Fla. 7776 background had significant and consistent differences among genotypes for distal angle (Fig. S4).
According to the blockiness descriptors (proximal, distal, and fruit shape triangle Figs. 5 and S4), the proximal width of the fruit appeared to be more affected compared to the distal width. Notably, the analysis of the proximal blockiness and fruit shape triangle resulted in a non-significant genotype by background interaction, with the homozygous globe genotypes consistently having the lowest ratios for both descriptors, and the heterozygous genotype values similar to those of the homozygous flat genotype. On the other hand, the distal blockiness was significantly affected by the genotype only in two of the backgrounds, and the differences between genotypes appeared smaller (Fig. S4). This further supports the lower impact of the locus on the distal end, compared to the proximal.
The greater impact of the locus on the proximal blockiness was also evidenced by the asymmetry descriptors. While no consistent differences were found for the vertical asymmetry, the horizontal asymmetry and the ovate descriptors resulted in significant differences among genotypes with no significant interaction with background (Fig. S4, data not shown for vertical asymmetry). The homozygous flat and heterozygous genotypes were consistently similar and significantly different from the homozygous globe genotype across all backgrounds, with globes being more symmetric and less top-heavy.
The difference in shape between the globe genotype and the flat and heterozygous genotypes was also demonstrated by the elliptic and circular fit functions which yielded very similar results. For these descriptors, the globe genotype had the highest R2 across backgrounds. The heterozygous genotype appeared comparable to the homozygous flat in all backgrounds except for the Fla. 7776, where it appeared intermediate between the homozygous genotypes (Fig. S4, data shown only for elliptic fit).
Effects on plant biomass and fruit pedicel
The plant biomass of the homozygous globe genotype was significantly lower in two of the three backgrounds (Fla. 7776 and Fla. 8735), compared to the heterozygous and homozygous flat genotypes. In those two backgrounds, the plant biomass was reduced by 12–16% in the globe genotypes compared to the flat. Although the mean of the globe genotype was the lowest in the Fla. 8022 background, the analysis showed no significant differences among genotypes in this background (Table S8).
Fruit pedicel size was significantly different among genotypes. For pedicel length, although the analysis suggested that the interaction between genotypes and backgrounds was significant, the pedicel was consistently longest in the homozygous globe genotype (~19 mm), and intermediate in the heterozygous (~10–12 mm) but closer in length to the homozygous flat genotype (~9–11 mm). Similarly, the homozygous globe genotype consistently had the thinnest pedicel among backgrounds (~3.3 mm). The pedicels of the homozygous flat and heterozygous genotypes had similar values in all backgrounds (~4–4.5 mm) except for the Fla. 7776 background, where the heterozygous was significantly different from both homozygous genotypes (Table S8).
Discussion
Previous studies have identified many major and minor QTLs controlling fruit size and/or shape. Such studies were mainly carried out by crossing domesticated tomato with wild accessions. Thus, the QTLs represent critical fruit shape/size loci involved in the transition from wild to modern tomato. In contrast, the present study focused on fruit shape and size variation existing within large-fruited cultivated tomato. Evidence suggests that there is no allelic variation at known major fruit shape and size loci (sun, lc, ovate, fas, fw2.2, and fw3.2) in fresh market large-fruited germplasm from the UF/IFAS breeding program31. In fact, the study by Blanca et al.31 included the parents of the mapping population (Fla. 8000 and Fla. 8111B) and found that all materials were fixed for the domesticated allele at FW2.2, FW3.2, FW11.3 and fixed for the wild-type allele at SUN, FAS, and OVATE, suggesting that other locus(loci) explain variation in fruit shape and size in these materials.
This study identified GLOBE as a major fruit shape gene on the upper arm of chromosome 12. This chromosome is not known to harbor any major effect shape or size genes. However, former studies detected fruit shape/size QTLs in this area of chromosome 12. For instance, one review summarized QTLs controlling fruit shape and size that were mapped during the 15 years prior to the study7. Seven QTLs affecting fruit shape or size in chromosome 12 were noted, two of which were mapped to the upper arm: fs12.a and fw12.1. In another study, two QTLs controlling FSI and distal end angle in the upper arm of chromosome 12 were observed in F2 and BC1 populations from a cross between the elongated fruit shape Howard German cultivar and S. pimpinellifolium35. Fruit shape QTLs in this region were also detected in an F2 population between the elongated-fruit tomato cultivar Sausage and S. pimpinellifulium36. While different fruit shape traits were found in each study, a common QTL may segregate in all studies. However, as demonstrated by the in-silico germplasm survey, the globe mutation likely arose late in domestication, and was not present in any of the ancestral accessions surveyed. Thus, although fs12.a and fw12.1 could relate to the globe locus, the QTLs detected in studies involving S. pimpinellifolium seem to be distinct.
In our study, the single-base thymine insertion towards the end of the coding sequence of Solyc12g006860 completely co-segregated with fruit shape in all populations tested and among all large-fruited inbred lines surveyed, placing this gene as the only likely candidate. CRISPR/Cas9 mutants confirmed that this gene harbors the globe mutation. The allele for flat fruit shape, which is identical to that of the Heinz 1706 reference genome and seemingly corresponds to the wild-type allele, translates into a 555-amino acid peptide. Both wild-type and mutant proteins have the same peptide sequence up to the 531th position. Beyond this point, the mutant allele differs in sequence for 16 consecutive amino acids before truncating prematurely at position 547 (Fig. S2). The CRISPR/Cas9 mutant created in this study had two deletions in Solyc12g006860: one of 59 bp at the beginning of the coding sequence and another of 3 bp towards the end. The mutations knocked out the gene and created the globe phenotype in a flat-shaped breeding line. This evidence confirmed that the globe mutation is a loss of function mutation.
Solyc12g006860 is annotated as brassinosteroid hydroxylase (ITAG4.1 gene annotation) and belongs to the Cytochrome P450 family of genes. Brassinosteroids are steroidal hormones distributed ubiquitously in plants and are involved in the regulation of important processes like cell development growth and multiplication37. Previously, the expression of Solyc12g006860 was characterized and was determined that it functions in brassinosteroid catabolism38. This suggests that altered regulation of brassinosteroids by the different alleles at GLOBE may influence cell elongation and division, leading to the change in fruit morphology.
Interestingly, FW3.2 (SlKLUH) is also a member of the Cytochrome P450 gene family. It is presently understood that increased expression of SlKLUH originated by the tandem duplication at the FW3.2 locus is what drives increased fruit mass28. In contrast to SlKLUH, Solyc12g006860 contains a mutation within the coding region resulting in a loss of function rather than in a change of gene expression.
Although the effect of SIKLUH on fruit size is observed in the pericarp of fruit, its expression is particularly high in developing seeds, and it is therefore speculated that SlKLUH regulates fruit weight through a seed-produced compound, similar to KLUH in Arabidopsis27,39. Interestingly, a transcriptome mapping study of tomato fruit development and ripening determined that expression of Solyc12g006860 is likewise highest in developing seeds40. Besides providing circumstantial evidence supporting Solyc12g006860 as the GLOBE gene, these findings suggest that SlKLUH and Solyc12g006860 may function similarly in the regulation of fruit size and shape. Further research is needed to determine the physiological basis of globe on fruit morphology.
The globe mutation provides a useful molecular resource for breeders and geneticists who may desire to distinguish flat and globe genotypes. The survey of commercial hybrids indicates that although globe parents are being used in some crosses, other hybrids are being developed by crossing two flat parents. Hybrid cultivar releases from UF/IFAS and NCSU alike remark on the advantage of intercrossing flat and globe parents to boost fruit size in the hybrid.
In fresh-market tomato, breeders have for many years inter-crossed flat and globe-shaped inbred lines to create commercial hybrids that, in theory, capitalize on the larger size of the globe parent and the fruit quality characteristics of the flat parent. Yet although this trait has been utilized for so many years, the genetics of flat and globe fruit shape was formerly unknown, and its actual impacts on fruit size and quality were not formerly characterized. Our results demonstrated that although globe has a relatively additive effect on fruit size, it acts in a recessive manner on many fruit shape attributes. However, this should be interpreted with care, as illustrated by the more additive gene action in the Fla. 7776 background for several fruit shape descriptors. Other genetic factors in this background may modify the effects of the locus on fruit shape. Results also indicate that although the globe fruit shape is more prone to imperfections that reduce marketability (like weather check, which is also dependent on the environment), heterozygosity for the QTL results in similar marketability rates as flat fruit shape. Thus, compared to the flat genotype, the heterozygote has increased size and similar shape and quality, with no penalty in yield. Furthermore, in light of the findings that a large number of current commercial cultivars do not involve crosses between flat and globe parents (Table S5), these data provide further evidence that fruit size could be increased in such hybrids through the introduction of the globe allele into one of the hybrid’s parents.
globe had a greater effect on fruit height than on fruit width, and this is likely the primary factor driving changes in fruit size. Furthermore, fruit shape attributes related to the proximal end of the fruit were those primarily affected by this locus. In comparison, fw3.2 increases fruit weight mainly by increasing cell number in pericarp and septum tissues27, and it has a minor effect on shape—primarily affecting the proximal end of the fruit41. On the other hand, there are three major QTLs associated with fruit elongation: sun, ovate, and fs8.1 (ref. 29,36). sun does not affect fruit size because the increase in height is compensated by a reduction in width12; fs8.1 causes an increase in fruit height with no reduction in width29; ovate results in different elongated shapes depending on the genetic background with a slight reduction in fruit weight13. Thus, globe has an apparently unique effect, as it impacts both fruit shape and size, and it seems to have a greater proximal end impact than fw3.2.
Besides the effect on fruit size, it was found that transgenic lines for FW3.2 were affected for plant height, leaf size, and seed number, implying a pleiotropic effect on development27. globe also has an effect on plant biomass and pedicel length, where homozygosity for the mutant allele results in a reduced plant biomass and longer and thinner fruit pedicels. Furthermore, differences in seedling height were observed between the globe breeding lines Fla. 7776, Fla. 8022 and Fla. 8735, and their respective flat NIL and F1, where seedlings homozygous for the globe allele were taller (7–34%) and appeared thinner and leggier at 27 days after sowing than either homozygous flat or heterozygous seedlings (24 < n < 77, data not shown). Thus, similar to FW3.2, GLOBE may also have pleiotropic effects on plant growth. More work is necessary to elucidate the effect of this gene on seedling and young plant growth, and cell size and cell number in the fruit.
Materials and methods
Plant materials for mapping and fine mapping
Several populations developed from the cross between the breeding lines Fla. 8000 and Fla. 8111B were used for mapping and fine mapping the locus controlling flat and globe fruit shape. Fla. 8000 and Fla. 8111B have flat and globe fruit shape, respectively. Eighty-three F6 RILs were grown in the field and phenotyped for fruit shape. From those, 23 RILs (12 with globe fruit shape, 11 with flat fruit shape) were sampled for initial genotyping using the Illumina Infinium SolCAP tomato SNP array (described below). A 238-plant F2 population from the same breeding lines was likewise grown in the field and phenotyped for fruit shape for confirmation of the mapping results from the RILs. Fine mapping was conducted in F4 and F5 populations from the same cross following a map-based approach.
Validation of the identified locus was conducted using additional populations. These included a 103-plant F2 population from the cross between Fla. 7946 (flat) and Fla. 8442 (globe), as well as three near isogenic backgrounds. Recurrent parents, NILs, and F1 progeny resulting from their cross were then genotyped with markers specific to the globe locus.
To determine the extent to which the globe locus explains fruit shape across a broader collection of cultivated germplasm, a survey was conducted including large-fruited inbred lines from the UF/IFAS and NCSU breeding programs, and inbred lines and hybrids from multiple private sector breeding programs.
Further details about the development of the populations used for mapping and germplasm surveyed are provided in supplementary methods.
DNA extraction
DNA was extracted using either a modified Cetyl Trimethylammonium Bromide (CTAB) method42 or a NaOH rapid DNA extraction method43. Samples were stored at 4 °C until PCR was performed. For either extraction method, tissue was collected from young leaves using a 3 mm diameter hole punch (2 punches for CTAB extraction, 1 punch for NaOH extraction).
Whole-genome re-sequencing data, SNP detection, and gene annotations
Whole-genome resequencing data previously described44 was used to identify polymorphic loci for confirmation and fine mapping of the globe locus. In addition to SNPs, insertions/deletions (indels) were predicted from aligned read data using FreeBayes (version 0.9.16-1-gf46d24f) 45. Data were visualized using JBrowse 1.11.5 (ref. 46). Loci that were polymorphic between Fla. 8000 and Fla. 8111B were considered for marker development (described below). Haplotypes at additional polymorphic loci within the fine mapped interval were compared among other breeding lines for further sequence characterization. Sequences were compared for six flat-fruited breeding lines, Fla. 8000, Fla. 8624, Fla. 8820, Fla. 8916, Fla. 8924, and Fla. 8932; and for four globe-fruited breeding lines, Fla. 8111B, Fla. 8297, Fla. 8872B, and Fla. 8925. Gene annotations were based on version 4.1 of the International Tomato Annotation Group (ITAG4.1). Genome physical locations were based on the version SL4.0 of the Tomato Genome Assembly and the sequences were accessed through the Sol Genomics Network database (https://solgenomics.net/)47. The wild-type and mutant sequences of the candidate gene were translated into peptide sequences using the ExPASy Translate Tool (https://web.expasy.org/translate/).
Marker development and testing
Selective genotyping for initial mapping was performed using the Illumina Infinium SolCAP tomato SNP array (Illumina Inc., San Diego, CA), which encompasses 7720 Single Nucleotide Polymorphism (SNP) sites distributed across the genome48. DNA samples were extracted using the CTAB method described above, and SNP chip processing was performed by the Michigan State University Potato Breeding and Genetics SNP genotyping facility. An Illumina iScan system was used for SNP chip processing, and SNP analysis was done using GenomeStudio 2.0 software (Illumina Inc., San Diego, CA).
Two hundred and thirty-eight F2 plants from the Fla. 8000 × Fla. 8111B cross were used for confirmation of the mapped flat/globe locus. Kompetitive allele-specific PCR (KASP; LGC Genomics, Beverly, MA), Cleaved Amplified Polymorphic Sequences (CAPS), derived Cleaved Amplified Polymorphic Sequences (dCAPS) markers, and an indel-based marker, were designed from polymorphisms spanning the mapped globe locus and used to genotype this population. Subsequent mapping and fine mapping relied upon High resolution Melting Analysis (HRM) and additional dCAPS markers. Marker protocols are described further in supplementary methods.
Genetic data analysis
Monogenic inheritance of fruit shape was tested in the Fla. 8000 × Fla. 8111B F6 RIL population using a chi-square test for goodness of fit to 1:1 ratio of flat:globe phenotypes. Genotype data generated by the SolCAP SNP chip was used for initial mapping of the flat/globe fruit shape locus. Data were filtered, and SNPs identified as polymorphic between Fla. 8000 and Fla. 8111B and segregating among the 23 RILs were utilized for genetic map construction with JoinMap® 4.1 using the Kosambi mapping function49,50. RIL phenotypes, along with SNP genotypes and calculated genetic positions were then used to perform inclusive composite interval mapping analysis with QTL IciMapping 4.1 (refs. 51,52). The ICIM-ADD mapping method was used with a scanning step of 1 cM and a probability in stepwise regression of 0.001 as implemented in the software.
To confirm monogenic inheritance of fruit shape in the Fla. 8000 × Fla. 8111B F2 population, the frequency of flat:globe phenotypes was tested for goodness of fit to a 3:1 ratio using a chi-square. Phenotypes of these F2 individuals along with their genotypes at eight markers were used to construct a genetic map and perform inclusive composite interval mapping as described above.
Generation of CRISPR/Cas9 knock outs and measurements
Plant materials and growth conditions
Fla. 8059 seeds were sown directly in SunGro Germination mix soil in 72 count 6 pack trays and grown in a growth chamber under 16-h light/8-h dark photoperiod for 4 weeks under lithonia ibz 632 wd with ProLume eco-shield F32T8-850-ECO bulbs (32 watt/ 3,050 lumens full spectrum daylight/5000 K). Four weeks-old seedlings were transplanted into 6.06 L pots and were cultivated in an air-conditioned greenhouse under 16-h light/8-h dark photoperiod with daytime temperatures 75–80F (8am–6pm) and nighttime temperatures 60/65F (9pm–6am) in Athens, GA, USA.
Guide RNA design, CRISPR/Cas9 construct, and plant transformation
The gRNAs targeting Solyc12g006860 were designed using the CRISPR-P tool (http://cbi.hzau.edu.cn/cgi-bin/CRISPR)53. The gRNAs were selected to have no fewer than four mismatches within the coding region of potential off target genes. The CRISPR/Cas9 constructs were assembled using the NEBuilder Assembly of CRISPR vectors using ssDNA oligos as previously described54. Electroporation was used to introduce the final binary vectors into Agrobacterium tumefaciens strain LBA4404, which was kindly provided by Dr. Joyce Van Eck, Cornell University. The LBA4404 harboring the binary vector was used for the transformations of Fla. 8059. The genetic transformations were performed as described55. The gRNAs were cloned and transgenic plants were genotyped using primers listed in Table S2. A single T0 plant (with two gRNA targeting the globe gene, both heterozygous) was selfed. Five T1 homozygous mutant plants were selected and used for measurements along with five WT Fla. 8059.
Fruit shape and pedicel length measurements
Ripe fruits were halved and scanned at 200dpi using an HP Scanjet G4050. These images were analyzed using Tomato Analyzer to determine shape index (Length/width)56. Pedicel length was measured using calipers. At least three (and up to eleven) measurements were performed for each of five WT plants and five mutant plants. The average of the shape index and pedicel length per plant was determined and used to perform a t-test to look into statistically significant differences.
In silico survey of germplasm
The distribution of the globe allele across a broad collection of wild and domesticated tomato germplasm was determined in silico using BLAST search57 against publicly available tomato sequencing projects58,59,60. Data was retrieved from the Sequencing Read Archive (https://www.ncbi.nlm.nih.gov/sra). A 61-nucleotide sequence containing the single base pair insertion in the globe allele (consisting of 30 bp upstream and downstream of the insertion) was used as a query. Blastn program was chosen, leaving all other parameters to default values.
Plant materials for phenotypic characterization
The three flat NILs described previously along with the globe recurrent parents and corresponding backcross F1s were used for the characterization trials. The trials were conducted during three seasons (fall 2017, spring 2018, and fall 2018) at the UF/IFAS Gulf Coast Research and Education Center in Balm, FL. The plants were established in the field in 12-plant plots arranged in a randomized complete block design with four replicates. Eight plants were harvested from each plot. Due to pollination problems, the backcross F1 for the Fla. 8022 background was not evaluated in spring 2018.
Fruit size, shape and marketability measurements
Fruit weight was assessed from four to five weekly harvests of vine-ripened fruit in each of the three trials. The fruit from each experimental unit were graded for marketability and size according to the US Standards for Grades of Fresh Tomatoes (USDA-Agricultural Marketing Service). The fruit weight was obtained by averaging the total weight of marketable fruit over the total number of fruits per plot. Culled fruit included, but were not limited to, fruit with radial cracking, weather check, zippering, zippering with windows, and rough scars. Radial cracking and weather check are illustrated in Fig. S1. Culled fruits were grouped according to the defect, and fruits in each group were counted and weighed. Marketable fruits were sized into small (S), medium (M), large (L), and extra-large (XL) categories, and fruits of each category were counted and weighed. The average weight per plant for each marketable size category was used for statistical analysis. For the cull categories, the percentage of total harvest was used for analysis.
For fruit shape, 24 mature fruit per plot, from the second to fourth harvest, were randomly selected and cut in half longitudinally. One half of each fruit was scanned into a 200-dpi (dots per square inch) image. Several fruit shape attributes were measured from images with the Tomato Analyzer 2.2 software56. The default settings of the software were used, and boundaries and end points were corrected as needed. The attributes assessed in this study are listed in Table S3. The average value of each attribute per plot was used for statistical analysis.
Further details on data collection for plant biomass and fruit pedicel are described in the supplementary methods.
Statistical analysis
A linear mixed effect analysis was performed to assess the association of the genotype at the globe locus with each of the traits measured. The model was defined as:
where \(L_p\), \(b_{pq}(L)\), and εpqij represent the variance due to season, block (nested within season), and the residual, respectively, and conform the design structure effects, hence were set as random. On the other hand, μij is the mean structure for the fixed effects given by:
where μ is the overall mean, αi is the main effect for the ith genotype at the fruit shape locus (globe line G/G, flat NIL F/F, F1 F/G), βj is the main effect for the jth genetic background (Fla. 7776, Fla. 8022, Fla. 8735), and αβij is the interaction between genotype and background. The design structure effects have nothing to do with the mean structure and interactions between season and treatments were not included for this reason61.
Since it is known that, in tomato, fruit yield and marketability usually show a high variation across seasons, and fruit disorders like weather check are known to be highly dependent on weather, the analysis of the marketability traits was made individually for each season, hence the Lp, and bpq(L) terms in the design structure of the model were dropped and bq, representing the block effect, was added instead.
The R programing language (R Core Team 2017) was used to perform the analysis62. The lmer function of the lme4 package63 was used to fit the model. The model assumptions were checked by visually inspecting the QQ plot, histogram, and distribution of the standardized residuals. Weather check data in both fall seasons was log-transformed in order to meet the model assumptions.
The model results were later entered into the lsmeans function to calculate the fixed-effects estimates and p-values of the pairwise differences adjusting with the Tukey HSD method (P ≤ 0.05). The significance letters were generated with the cld function. Both functions belong to the lsmeans package64. The pairwise differences were calculated within and among backgrounds.
Data availability
Supporting data for this research may be requested from the corresponding author.
References
FAO. FAOSTAT. Value of Agricultural Production. Last update June 13, 2018 (2018). Accessed Oct 8. http://www.fao.org/faostat/en/#data/QV.
USDA-NASS. Vegetables 2016 summary (2017).
Causse, M. et al. Consumer preferences for fresh tomato at the European scale: a common segmentation on taste and firmness. J. Food Sci. 75, S531–S541 (2010).
Simone, A. H., Bridget, B. K. & Marshal, M. M. Consumers prefer low-priced and high-lycopene-content fresh-market tomatoes. Hort. Technology 16, 674–681 (2006).
Paran, I. & van der Knaap, E. Genetic and molecular regulation of fruit and plant domestication traits in tomato and pepper. J. Exp. Bot. 58, 3841–3852 (2007).
Tanksley, S. D. The genetic, developmental, and molecular basis of fruit size and shape variation in tomato. Plant Cell 16, S181–S189 (2004).
Grandillo, S., Ku, H. M. & Tanksley, S. D. Identifying the loci responsible for natural variation in fruit size and shape in tomato. Theor. Appl. Genet. 99, 978–987 (1999).
van der Knaap, E., Lippman, Z. B. & Tanskley, S. D. Extremely elongated tomato fruit controlled by four quantitative trait loci with epistatic interactions. Theor. Appl. Genet. 104, 241–247 (2002).
Rodriguez, G. R. et al. Distribution of SUN, OVATE, LC, and FAS in the tomato germplasm and the relationship to fruit shape diversity. Plant Physiol. 156, 275–285 (2011).
van der Knaap, E., Sanyal, A., Jackson, S. A. & Tanksley, S. D. High-resolution fine mapping and fluorescence in situ hybridization analysis of sun, a locus controlling tomato fruit shape, reveals a region of the tomato genome prone to DNA rearrangements. Genetics 168, 2127–2140 (2004).
Xiao, H., Jiang, N., Schaffner, E., Stockinger, E. J. & van der Knaap, E. A retrotransposon-mediated gene duplication underlies morphological variation of tomato fruit. Science 319, 1527–1530 (2008).
Wu, S., Xiao, H., Cabrera, A., Meulia, T. & van der Knaap, E. SUN regulates vegetative and reproductive organ shape y changing cell division patterns. Plant Physiol. 127, 1175–1186 (2011).
Liu, J., Eck, J. V., Cong, B. & Tanksley, S. D. A new class of regulatory genes underlying the cause of pear-shaped tomato fruit. Proc. Natl Acad. Sci. USA 99, 13302–13306 (2002).
Rodriguez, G. R., Kim, H. J. & van der Knaap, E. Mapping of two suppressors of OVATE (sov) loci in tomato. Heredity 111, 256–264 (2013).
van der Knaap, E. et al. What lies beyond the eye: the molecular mechanisms regulating tomato fruit weight and shape. Front. Plant Sci. 5, 227 (2014).
Lippman, Z. & Tanksley, S. D. Dissecting the genetic pathway to extreme fruit size in tomato using a cross between the small fruited wild species Lycopersicon pimpinellifolium and L. esculentum var. Giant Heirloom. Genetics 158, 413–422 (2001).
Barrero, L. S. & Tanksley, S. D. Evaluating the genetic basis of multiple-locule fruit in a broad cross section of tomato cultivars. Theor. Appl. Genet. 109, 669–679 (2004).
Barrero, L. S., Cong, B., Wu, F. & Tanksley, S. D. Developmental characterization of the fasciated locus and mapping of Arabidopsis candidate genes involved in the control of floral meristem size and carpel number in tomato. Genome 49, 991–1006 (2006).
Muños, S. et al. Increase in tomato locule number is controlled by two single-nucleotide polymorphisms located near WUSCHEL. Plant Physiol. 156, 2244–2254 (2011).
Xu, C. et al. A cascade of arabinosyltransferases controls shoot meristem size in tomato. Nat. Genet. 47, 784–792 (2015).
Cong, B., Barrero, L. S. & Tanksley, S. D. Regulatory change in YABBY-like transcription factor led to evolution of extreme size during tomato domestication. Nat. Genet. 40, 800–804 (2008).
Yuste-Lisbona, F. J. et al. ENO regulates tomato fruit size through the floral meristem development network. PNAS 117, 8187–8195 (2020).
Frary, Anne et al. fw2.2: a quantitative trait locus key to the evolution of tomato fruit size. Science 289, 58–88 (2000).
Cong, B., Lio, J. & Tanskley, S. D. Natural alleles at a tomato fruit size quantitative trait locus differ by heterochromatic regulatory mutations. PNAS 99, 13606–13611 (2002).
Ito, T. & Meyerowitz, E. Overexpression of a gene encoding a cytochrome P450, CYP78A9, induces large and seedless fruit in Arabidopsis. Plant Cell 12, 1541–1550 (2000).
Fang, W., Wang, Z., Cui, R., Li, J. & Li, Y. Maternal control of seed size by EOD3/CYP78A6 in Arabidopsis thaliana. Plant J. 70, 929–939 (2012).
Chakrabarti, M. et al. A cytochrome P450 regulates a domestication trait in cultivated tomato. PNAS 110, 17125–17130 (2013).
Alonge, M. et al. Major impacts of widespread structural variation on gene expression and crop improvement in tomato. Cell 182, 145–161.e23 (2020).
Sun, L. et al. Candidate gene selection and detailed morphological evaluations of fs8.1, a quantitative trait locus controlling tomato fruit shape. J. Exp. Bot. 66, 6471–6482 (2015).
Mu, Q. et al. Fruit weight is controlled by Cell Size Regulator encoding a novel protein that is expressed in maturing tomato fruits. PLOS Genet. 13, e1006930 (2017).
Blanca, J. et al. Genomic variation in tomato, from wild ancestors to contemporary breeding accessions. BMC Genom. 16, 257 (2015).
Gardner, R. G. ‘Carolina Gold’, a hybrid tomato, and its parents, NC 1Y and NC2Y. Hort. Science 35, 966–967 (2000).
Scott, J. W. et al. ‘Solar Fire’ hybrid tomato: Fla. 7776 tomato breeding line. Hort. Science 41, 1504–1505 (2006).
Panthee, D. R. & Gardner, R. G. ‘Mountain Majesty’: a tomato spotted wilt virus-resistant fresh-marker hybrid tomato and its parents NC 714 and NC 1CS. Hort.Science 46, 1321–1323 (2011).
Brewer, M. T., Moyseenko, J. B., Monforte, A. J. & van der Knaap, E. Morphological variation in tomato: a comprehensive study of quantitative trait loci controlling fruit shape and development. J. Exp. Bot. 58, 1339–1349 (2007).
Gonzalo, M. K. & van der Knaap, E. A comparative analysis into the genetic bases of morphology in tomato varieties exhibiting elongated fruit shape. Theor. Appl. Genet. 116, 647–656 (2008).
Bishop, G. & Yokota, T. Plant steroid hormones, brassinosteroids: current highlights of molecular aspects on their synthesis/metabolism, transport, perception and response. Plant Cell Physiol. 42, 114–120 (2001).
Ohnishi, T. et al. Tomato cytochrome P450 CYP734A7 functions in brassinosteroid catabolism. Phytochemestry 67, 189–1906 (2006).
Anastasiou, E. et al. Control of plant organ size by KLUH/CYP78A5-dependent intercellular signaling. Dev. Cell 13, 846–856 (2007).
Shinozaki, Y. et al. High-resolution spatiotemporal transcriptome mapping of tomato fruit development and ripening. Nat. Commun. 9, 364 (2018).
Zhang, N., Brewer, M. & van der Knaap, E. Fine mapping of fw3.2 controlling fruit weight in tomato. Theor. Appl. Genet. 125, 273–284 (2012).
Fulton, M., Chunwongse, J. & Tanksley, S. Microprep protocol for extraction of DNA from tomato and other herbaceous plants. Plant Mol. Biol. Report. 13, 207–209 (1995).
Lee et al. A high-throughput genotyping system combining rapid DNA extraction and high-resolution melting analysis in allo-octoploid strawberry. Acta Horticulturae 1156, 89–94 (2016).
Lee, T. G., Shekasteband, R., Menda, N., Mueller, L. A. & Hutton, S. F. Molecular markers to select for the j-2-mediated jointless pedicel in tomato. Hort. Science 53, 153–158 (2018).
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv:1207.3907 (2012).
Buels, R. et al. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol. 17, 66 (2016).
Fernandez-Pozo, N. et al. The Sol Genomics Network (SGN)—from genotype to phenotype to breeding. Nucleic Acids Res. 43, D1036–D1041 (2015).
Sim, S. C. et al. Development of a large SNP genotyping array and generation of high-density genetic maps in tomato. PLoS ONE 7, e40563 (2012).
Kosambi, D. D. The estimation of map distances from recombination values. Ann. Eugen. 12, 172–175 (1944).
van Ooijen, J. W. JoinMap ® 4, software for the calculation of genetic linkage maps in experimental populations. Kyazma BV, Wageningen, Netherlands (2006).
Li, H., Ye, G. & Wang, J. A modified algorithm for the improvement of composite interval mapping. Genetics 175, 361–374 (2007).
Wang, J., Li, H., Zhang, L. & Meng, L. Users’ Manual of QTL IciMapping. The Quantitative Genetics Group, Institute of Crop Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China, and Genetic Resources Program, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico, D.F., Mexico (2016).
Lei, Y. et al. CRISPR-P: a web tool for synthetic single-guide RNA design of CRISPR-system in plants. Mol. Plant 7, 1494–1496 (2014).
Jacobs, T. B., LaFayette, P. R., Schmitz, R. J. & Parrot, W. A. Targeted genome modifications in soybean with CRISPR/Cas9. BMC Biotechnol. 15, 16 (2015).
Gupta, S. & Van Eck, J. Modification of plant regeneration medium decreases the time for recovery of Solanum lycopersicum cultivar M82 stable transgenic lines. Plant Cell Tissue Organ Cult. 127, 417–423 (2016).
Brewer, M. T. et al. Development of a controlled vocabulary and software application to analyze fruit shape variation in tomato and other plant species. Plant Physiol. 141, 15–25 (2006).
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Aflitos, S. et al. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole‐genome sequencing. Plant J. 80, 136–148 (2014).
Lin, T. et al. Genomic analyses provide insights into the history of tomato breeding. Nat. Genet 46, 1220–1226 (2014).
Razifad, H. et al. Genomic evidence for complex domestication history of the cultivated tomato in Latin America. Mol. Biol. Evol. 37, 1118–1132 (2020).
Blouin, D. C., Webster, E. P. & Bond, J. A. On the analysis of combined experiments. Weed Technol. 25, 165–169 (2011).
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/ (2017).
Bates, D., Maechler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015).
Lenth, R. V. Least-squares means: the R package lsmeans. J. Stat. Softw. 69, 1–33 (2016).
Acknowledgements
This work was supported by funding from USDA NIFA AFRI 2016-05951 and by a scholarship from the Fulbright García-Robles program. The authors thank Dr. Randy Gardner with North Carolina State University; Mark Barineau with Lipman Family Farms; David Wolff with Sakata Seed Company; Eka Sari and Jeremy Sisson with Enza Zaden; and Susan Inman, Jocepascual Martinez and Marcio Pais de Arruda with HM.CLAUSE for their assistance with the germplasm survey.
Author information
Authors and Affiliations
Contributions
S.F.H. and E.S.O. conceived the project and designed the experiments. E.S.O., R.S., E.I.B., A.S. and S.F.H. collected data. E.V.D.K. and A.S. designed the CRISPR/Cas9 guides and performed the CRISPR/Cas9 knockouts. E.S.O. analyzed data and wrote the manuscript with input and contributions on data interpretation from S.F.H., E.V.D.K. and T.G.L. All authors read, edited, and approved the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Sierra-Orozco, E., Shekasteband, R., Illa-Berenguer, E. et al. Identification and characterization of GLOBE, a major gene controlling fruit shape and impacting fruit size and marketability in tomato. Hortic Res 8, 138 (2021). https://doi.org/10.1038/s41438-021-00574-3
Received:
Revised:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41438-021-00574-3
This article is cited by
-
Comprehensive analysis of transcriptome and metabolome identified the key gene networks regulating fruit length in melon
BMC Plant Biology (2025)
-
Genome-wide meta-QTL approach identifies consensus loci associated with agriculturally important tomato fruit traits
Theoretical and Applied Genetics (2025)
-
Modulating the fruit morphology of traditional melon varieties: insights into IL × genetic background interactions
Euphytica (2025)
-
Identification of CaPCR1, an OFP gene likely involved in pointed versus concave fruit tip regulation in pepper (Capsicum annuum L.) using recombinant inbred lines
Theoretical and Applied Genetics (2024)
-
Genetic architecture of fresh-market tomato yield
BMC Plant Biology (2023)







