Abstract
Wheat, a major staple crop, contributes significantly to global protein and calorie intake. However, the increasing challenges posed by climate change and a growing population threaten its stable production. Pre-harvest sprouting (PHS), triggered by prolonged rainfall and humidity before harvest, significantly reduces wheat grain yield and quality. This study is the first to assess PHS tolerance (PHST) in a global collection of 116 T. sphaerococcum accessions, which were characterized at three different locations. A 35 K Axiom single nucleotide polymorphism (SNP) array was used to genotype these accessions and finally 15, 308 high quality SNPs were used to perform Genome-wide association studies (GWAS) employing two single-locus GWAS (SL-GWAS) and four multi-locus GWAS (ML-GWAS) models. Consequently, twelve marker-trait associations (MTAs) controlling PHST were identified using SL-GWAS and ML- GWAS models (p < 0.001). Among these, five MTAs (AX-94415302, AX-94919611, AX-94403953, AX-95220897, and AX-94756068), were consistently found across all the tested environments. In silico analysis revealed that these SNPs were located within the candidate genes (CGs) containing domains such as LRR, NAC, serine/threonine kinase, F-box, WRKY, SANT/Myb, cytochrome P450, homeobox-like, and WD40, which are involved in regulating seed germination, dormancy, and abiotic stress tolerance. Furthermore, haplotype analysis led to the identification of a variable number of haplotypes across 10 MTAs. Notably, three haplotypes, namely H005, H006, and H007 were present in PHS tolerant accessions, TS28, TS64 and TS81 respectively, representing favourable allelic combinations for PHST. These findings provide valuable genetic resources and potential targets for breeding strategies to enhance PHST in wheat.
Similar content being viewed by others
Introduction
Wheat, a primary cereal crop cultivated globally across diverse agro-climatic conditions, contributes significantly by providing 20% of human protein and calorie needs. The foremost challenges to the world’s food security stem from the ever-changing climate and the rapidly growing human population. To meet the projected demand of 140 million tonnes of wheat by 2050, breeders must develop highly productive wheat varieties resilient to both abiotic and biotic stresses. PHS is one of the major abiotic stresses that adversely affects the yield and quality of wheat grains, thus lowering farmer’s income. The estimated yearly losses worldwide, as a result of PHS are US$ one billion1. In China alone, over 2.5 million tons of wheat are affected by PHS each year, causing a significant reduction in grain quality and market value and in India particularly certain regions experiencing sudden uncertain conditions i.e., high humidity and rainfall during the harvest period2. PHS in wheat is characterized by seed germination on the spike of mother plant before harvesting due to prolonged rainfall and high humidity3,4,5, posing a significant abiotic constraint, thus, impeding wheat productivity at physiological maturity6,7. The reduction in yield of germinated seeds is attributed to the hydrolysis of materials stored inside the endosperm, like starch granules and protein. This leads to a decrease in both the thousand grain weight and bulk weight. Additionally, the increase in the activity of α-amylase in germinated seeds negatively impacts the quality of wheat grains due to the reduction in starch and protein content. This deterioration in quality not only affects the overall yield but also hampers seedling quality by diminishing seed vigour8,9,10,11. Beyond quality losses, yield losses from PHS can reach up to 20–40% in susceptible varieties under humid and wet pre-harvest conditions12. This is due to both grain shattering and embryo degradation, reducing the physiological viability of seeds13,14. Consequently, PHS is increasingly recognized as a severe climate-induced threat that compromises the stability and safety of global wheat production. Major wheat-producing countries have undertaken extensive research to address this issue15,16.
PHS, a complex trait, is influenced by several key factors, including environmental conditions, seed dormancy, colour, α-amylase activity, seed coat permeability, levels of endogenous hormones, functional proteins, genes, quantitative trait loci, and other relevant elements17. PHS has been observed in a variety of cereal crops, such as maize, wheat, rice, barley, and sorghum, across numerous global regions, like Japan, India, China, the United States, Australia, Canada, North Africa, and various parts of Europe14,18. This issue is recognized as a widespread concern, occurring approximately once every 10 years in major wheat-producing areas worldwide19. The primary genetic factor influencing resistance to PHS in wheat is seed dormancy, which dictates the level of resistance. Therefore, when investigating the mechanisms of PHS resistance, a key focus is often placed on understanding the genetic regulation of seed dormancy. As per biparental genetic linkage analyses, it has been reported that Quantitative trait loci (QTL) for PHS resistance are present on all 21 chromosomes of wheat5,20,21,22. However, the regions consistently identified are predominantly situated on the chromosome 3 A23,24,25,26,27 and chromosome 4 A5,28,29,30 As a result, the predominant strategies employed to mitigate the risk of PHS entail the development and utilization of wheat varieties that exhibit resistance to PHS through selective breeding. Hence, the main goal of molecular breeding research is to improve PHST that lies in the exploration and identification of key genes and loci.
Species related to wheat are considered as potential reservoirs of untapped grain yield and quality traits31. Thus, there is an urgent demand for characterizing other species of wheat, such as T. sphaerococcum (AAEEDD, 2n = 6x = 42), an ancient wheat of Indian origin, to identify promising lines that exhibit improved quality traits. This Indian origin wheat possesses several important characteristics, including short and robust culms, hemispherical grains, higher protein content in comparison to bread wheat, and resilience to biotic and abiotic stresses31. Despite these merits, sphaerococcum wheat has been inadequately studied32,33. The introduction of high-yielding wheat varieties through the Green Revolution together with the rust susceptibility, drastically reduced its distribution and cultivation after 196034. The genetic diversity of T. sphaerococcum continues to be important since it can improve wheat crops while enhancing their nutritional properties and resilience despite its fading cultivation35. Being a hexaploid species, it holds significant potential for contributing to the improvement of bread wheat, aligning with achieving food and nutritional security outlined by the United Nations sustainable development agenda36.
GWAS have emerged as powerful tools for dissecting the genetic architecture of complex traits like PHST in diverse wheat germplasm, including Indian origin sphaerococcum wheat. The wheat cultivars AUS1408 and CN1905537; Lok138; SPR819839 were used as the donors for the introgression of tolerance trait in the Indian breeding program. Recent GWAS studies have identified several quantitative trait loci (QTLs) and candidate genes associated with PHST in T. aestivum, such as TaMKK3-A on chromosome 4 A and TaVp1 on group 3 chromosomes4,40. Over the last few decades, researchers have extensively investigated the genetic basis of PHST in wheat using bi-parental mapping and association mapping. These comprehensive studies have revealed that PHST is a complex trait influenced by numerous QTLs and genes spread across all 21 wheat chromosomes41,42,43,44,45,46,47,48,49,50. A recent review of interval mapping and association mapping for PHST identified 575 known QTLs and MTAs in wheat12. However, sphaerococcum wheat has not been used extensively for the identification of QTLs/genes associated with PHST. Furthermore, the limited genetic diversity within elite aestivum germplasm poses a bottleneck for breeding durable PHS tolerance. Ancient wheat species like sphaerococcum offer a reservoir of untapped alleles, which could be introgressed to broaden the genetic base and improve resilience under variable climatic conditions31,34. Given that QTLs identified in T. aestivum often show variable expression across environments, the discovery of robust and environment-stable QTLs in Sphaerococcum could complement existing breeding strategies5,20.
Haplotype analysis is a powerful method for enhancing MTAs and identifying superior allelic combinations in wheat. In contrast to single-SNP analysis, haplotypes take into account the combined effects of tightly linked variants within the linkage disequilibrium (LD) blocks, thereby offering superior resolution in mapping studies. This approach is particularly effective in crops like wheat, which possess a large and complex genome with extensive LD and polyploidy51. Haplotype-based approaches have been widely used wheat to analyse quantitative traits such as PHS52, grain yield53, disease resistance54, glume colour55, nitrogen-use efficiency56, and glume pubescence57.
In view of the above, the current study was conceptualized to identify potential markers/genes associated with PHST in Indian dwarf wheat (Triticum sphaerococcum) using GWAS in a panel of 116 T. sphaerococcum wheat accessions, a diverse collection procured from three different gene banks across the world using the Affymetrix 35 K Axiom Wheat Breeders’ Array. The findings of our current study will improve our understanding and offer valuable gene resources for improving the PHST of Indian Dwarf Wheat.
Materials and methods
Plant material and data recording
A set of global collections consisting of 116 accessions of T. sphaerococcum were evaluated for their PHST at three different environments, ICAR- National Bureau of Plant Genetic Resources (NBPGR), New Delhi (E1), ICAR- Indian Agricultural Research Institute (IARI), Wellington (E2) and Mahatma Phule Krishi Vidyapeeth Rahuri (MPKV), Rahuri, Maharashtra (E3), with three biological replicates of each accession in 2023-24. The passport data of the accessions is given in the supplementary table (Table S1). Five spikes from each accession were harvested on physiological maturity, indicated by the loss of green colour from the spike58. A scale of 1 to 9 was used to evaluate PHS data; genotypes with no visible sprouting were given a score of 1, while genotypes with full sprouting were given a score of 9. This scoring system was adapted from59. Within one hour of harvest, spikes were soaked in water for 4–6 h. Subsequently, the spikes were incubated in a closed chamber in laboratory at ~ 20℃ to 25℃ and near saturated (90–100%) relative humidity on a layer of moist sand measuring 7.5 cm in thickness and covered with two layers of wet jute bags using the moist-chamber laboratory assay described by Baier (1987)60. To prevent drying, the spikes were regularly watered every 3–4 h. Ten days after the spikes were harvested and first submerged in water, observations on sprouting were made.
Statistical analysis
SAS v9.3 software was used to conduct one-way analysis of variance (ANOVA) in order to further evaluate associated variance components such as genotype, environment, and their interaction for PHST. Best Linear Unbiased Predictions (BLUPs) were estimated for combined phenotypic data of three environments (CE) using a linear mixed model, where genotypes were treated as random effects and environments as fixed effects using the lme4 package in R software61. This approach accounts for environmental variation and provides unbiased predictions of genotypic performance across environments, following the methodology described by62. The descriptive statistics for the PHST trait was evaluated in the four environments (E1, E2, E3, and CE) (Table S2).
SNP genotyping of wheat accessions
The genomic DNA from 116 lines was extracted separately from 15-day-old seedlings by following the CTAB procedure63. The association panel consisting of 116 wheat accessions were genotyped using a 35 K Axiom Wheat Breeders Array, following Affymetrix’s protocol (Axiom 2.0 Assay for 384 samples P/N 703154 Rev. 2) for wheat. This process resulted in the identification of 35,143 SNPs. To refine the dataset for downstream analysis, SNPs with a minimum allele frequency (MAF) below 0.05 were excluded. Ultimately, 15,308 polymorphic SNPs were retained for the subsequent GWAS analysis.
Population structure and linkage disequilibrium (LD)
Population structure and PCA were analysed in our previous study using STRUCTURE v 2.3.464. Intra-chromosomal LD between all potential pairwise comparisons of SNPs was computed using TASSEL v5.0 as squared allele frequency correlation (r2)65. The background LD was measured to determine a significant distance for LD decay. The average pattern of genome-wide LD decay across physical distance was evaluated using a scatter plot of r2 values against the matching physical distance between the markers. The degree of LD decay was evaluated using the LOESS (Locally Weighted Scatter-plot Smoother) model66. The 95th percentile of the square root transformed r2 data of unlinked markers was used to get the r2 value67.
GWAS and pyramiding effect of desirable alleles
MTAs for PHST trait were identified using each of the following models: (i) Compressed Mixed Linear Model (CMLM), (ii) Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK) and (iii) Fixed and random model Circulating Probability Unification (FarmCPU), (iv) Multiple loci mixed model (MLMM), (v) Mixed Linear Model (MLM) and General Linear Model (GLM). All these models were implemented in R using GAPIT software package68. The CMLM and GLM allowed SL-GWAS while the FarmCPU, BLINK, MLMM and MLM allowed ML-GWAS analysis. Further, GAPIT was used to compute a marker-based kinship matrix (K). While fitting GWAS models, information about the kinship matrix (K) and population structure (Q) was also employed as covariates. The P value was used as a criterion for the identification of significant marker-trait association, while the coefficient of determination (R2) value was used to assess the magnitude of the marker effects. Further, false discovery rate (FDR) was used as a corrective measure for the problem arising due to multiple hypothesis testing. A threshold p-value < 0.001 was used to declare significant QTLs in the current study. MTAs associated with at least two models, or two environments were designated as consistent QTLs. The pyramiding effect of SNPs associated with PHST was evaluated using linear regression analysis. The number of desirable SNP alleles was used as the independent variable, while the corresponding trait values of genotypes carrying varying numbers of these alleles served as the dependent variable69.
Mining candidate regions for key genes and haplotype blocks
The physical position of each SNP was used as input in the EnsemblPlants database (http://www.ensembl.org/info/docs/tools/vep/index.html). For each SNP, the corresponding chromosomal region was extended by 1 Mb upstream and downstream, generating a 2 Mb interval for mining potential candidate genes (CGs) associated with seed germination and dormancy. The Biomart tool, available at the EnsemblPlants database, was used to extract information on proteins encoded by the genes. To determine the potential involvement of the identified CGs in regulating the PHS trait, their annotations were confirmed through published papers. The GO annotations (including molecular function and biological process) for each CG were extracted from the IWGSC website (http://www.wheatgenome.org/).
To investigate haplotype variation within these key regions, haplotype analysis was performed for MTAs with detectable haplotype blocks in their surrounding LD region using the geneHapR package in R70,71. Two MTAs (AX-94415302, AX-95097524) did not show any haplotype blocks in their surrounding LD regions and were therefore excluded from this analysis.
Identification of superior haplotypes by Haplo-Pheno analysis
To identify superior haplotypes associated with the PHS trait, haplo-pheno analysis was performed using the geneHapR package R70,71. This analysis aimed to evaluate the phenotypic effects of all the 12 significant MTAs from the GWAS, including the two excluded from haplotype analysis, to group genotypes based on extreme haplotypes for trait evaluation. Extreme phenotypic classes for PHST (highly resistant and highly susceptible accessions) were selected to ensure clear differentiation. Haplo-pheno analysis allowed comparison of PHST values across haplotype groups, thereby distinguishing favourable allelic combinations. Genotypes carrying haplotypes with significantly higher PHST scores were considered superior, representing promising candidates for breeding.
Results
Statistical analyses
ANOVA was performed to assess the effects of genotype and environment on the observed trait. The results revealed that genotypic differences were highly significant (F = 15.97, p < 2e–16), indicating substantial variation among genotypes for the trait under study. In contrast, the effect of the environments (E1, E2 and E3) were statistically non-significant (p = 0.534), suggesting that variation due to environmental replications was minimal (Table S3). The residual variance accounted for the remaining unexplained variability. Overall, these results highlight the strong genetic influence on the trait, confirming the potential for selection and genetic improvement (Table S3). Notched box plots were employed to visualize the distribution of PHS values across three different environments (E1, E2, E3) (Fig. 1a). The PHS trait shows consistent median values and distribution across the three environments (E1, E2, E3) suggesting good repeatability and minimal environmental variability. Phenotypic data from three environments (E1, E2 and E3) were combined using BLUPs to account for genotype-by-environment interactions and improve the reliability of trait estimates. These BLUP values were then used for GWAS to identify markers consistently associated with PHST trait across varying environmental conditions37,72,73. The hierarchical clustering treatment divided accessions into four distinct phenotypic groups. Lower BLUP values indicate higher tolerance to PHST. Genotypes in Cluster 1 had the lowest values, suggesting they are highly tolerant. Cluster 3 genotypes showed slightly higher values, indicating moderate tolerance. In contrast, Cluster 4 genotypes were moderately susceptible, and the highest values were observed in Cluster 2, which suggests that these genotypes are the least tolerant to PHS (Fig. 1b). The visual scoring method used for PHS on a scale of 1–9 is shown in Fig. 1c.
(a) Notched box plot showing the distribution of PHS score in different environments, (b) genotypic variation in PHST trait value across the clusters, (c) scale used for measuring PHS trait.
Population structure, marker coverage and LD analysis
Population structure analysis led to the identification of four sub-populations amongst the 116 T. sphaerococcum accessions as reported in our previous paper64. After filtering, 15,308 polymorphic SNPs out of 35,144 SNPs were utilized for association mapping. Out of the 15,308 filtered SNPs, 4802 were mapped on the A sub-genome, 5925 on the B sub-genome, and 4581 on the D sub-genome (Fig. 2a). The number of SNPs mapped on individual chromosomes ranged from 255 (Chr4D) to 1,054 (Chr2B). The distribution of SNPs on three sub-genomes showed that A sub-genome has the maximum SNPs on Chr7A (823), followed by Chr2A and Chr5A (787); the B sub-genome has the maximum SNPs on Chr2B (1054), followed by Chr1B (992), whereas the D sub-genome has the maximum SNPs on Chr2D (1033), followed by Chr1D (742) (Fig. 2b).
(a) Distribution of filtered SNPs across chromosomes and sub-genome used for the GWAS analysis, (b) marker density Plot showing distribution of chromosomes across the 21 wheat chromosomes.
Chromosome-wise LD plot was also drawn for 15,308 SNP markers to investigate pair-wise linkage among markers. Individually, the average R2 of genome-wide LD was 0.197 for sub-genome A, 0.172 for B and 0.177 for D sub-genome. SNP markers, with their assigned physical position on the map, were further used to estimate intra-chromosomal LD. The coefficient of regression (r2) for LD across 21 wheat chromosomes was minimum for chromosome 4D (0.175) and maximum LD was for chromosome 2D (0.315). The fastest LD decay was observed for the D sub-genome, followed by B and A sub-genome (Fig. 3). In the D sub-genome, r2 value for the marker pair was reduced to 6.81 Mb as compared to 11.24 Mb in B and 12.06 Mb in the A sub-genome. A detailed summary of markers, including chromosome distribution, average LD score, and other associated statistics, is presented in Table 1.
Estimation of Linkage Disequilibrium (LD) decay rate for the (a) A sub-genome, (b) B sub-genome, (c) D sub-genome, and (d) whole genome.
Genome-wide association analysis
A total of twelve MTAs for PHST were identified using single-locus and multi-locus models (p < 0.001) (Table 2). The genome-wide significant p-value threshold was adjusted based on Bonferroni correction. The MTAs were distributed on eight chromosomes (1 A, 1B (4), 1D, 2 A (2), 3 A, 3D, 5 A, 6B). The B genome and A genome harboured the maximum number of MTAs (five) followed by D genome (two MTAs). Five stable MTAs AX-94,415,302, AX-94,919,611, AX-94,403,953 on chromosome 1B, AX-95,220,897 on chromosome 6B and AX-94,756,068 on chromosome 2 A were found as they are common in all the three environments and CE and were also found common across the three models (FarmCPU, BLINK and GLM). AX-94,414,200 on chromosome 1B was found in two environments (E1 and E2) and CE. AX-94,823,205 (Chr1A), AX-94,939,596 (Chr1D), AX-94,523,390 (Chr2A), AX-95,003,297 (Chr3A), AX-94,580,041(Chr3D), AX-95,097,524 (Chr5A) were found consistent in only two environments (E1 and E3). Notably, the AX-94,919,611 (Chr1B) marker was identified and detected consistently by all six methods across all the three locations and thus it was a stable and major locus. Manhattan plots and the QQ plots of the GWAS results for CMLM, FarmCPU, BLINK, and MLMM are shown in Fig. 4.
Circular Manhattan Plots obtained from (a) BLINK, (b) FarmCPU, (c) CMLM, (d) MLMM. In each circular plot, inner, middle and outer plots represent E1, E2, and E3 environments, respectively. The LOD threshold value - log10(p) ≥ 3 is indicated as red-colored dotted circle. For each case, multi-track Q-Q plots are shown alongside the circular Manhattan plots.
Pyramiding effect of desirable alleles
The pyramiding effect of desirable alleles from multiple associated SNPs was evaluated using linear regression analysis. In the case of PHST, twelve MTAs exhibited significant associations across various environments. A progressive accumulation of up to eight favourable alleles corresponded with a marked reduction in PHS levels, as illustrated in Fig. 5. The estimated regression coefficients for these associations ranged from 0.173 to 0.201.
Linear regression analysis depicting the relationship between the number of desirable SNP alleles (independent variable) and PHS score (dependent variable). R2 = regression coefficient; **represents 0.0001 level of significance.
Identification of candidate genes and haplotype diversity
A 1 Mb region flanking on either side of the significant MTAs linked to PHST trait were employed to pinpoint CGs through the annotated wheat reference sequence (Wheat Chinese Spring IWGSC Ref Seq v2.1 genome assembly, 2021). A total of 176 PHS-related CGs were identified, of which 47 genes, associated with 9 markers, were directly involved in regulating seed germination and dormancy. The remaining gene (129) are likely to influence pre-harvest sprouting (PHS) indirectly through pathways related to stress response, hormone signalling, and transcriptional regulation. These 47 potential CGs encoded proteins that contained 15 different types of domains. Some of the important domains include the following: (i) leucine-rich-repeat (LRR) superfamily, (ii) NAC domain superfamily, (iii) serine/threonine protein kinase, (iv) F-box domain, (v) WRKY domain, (vi) SANT/Myb domain, (vii) cytochrome P450, (viii) homeobox like domain, (ix) WD40 repeat. A set of 14 CGs underlying 4 MTAs located on 4 different wheat chromosomes encoded proteins that contained F-box domain (Table 3). Similarly, 10 CGs associated with 2 MTAs on nine different chromosomes encoded proteins that contained SANT/Myb domain. Detailed information of 176 CGs and their functional annotations are presented in Table S4.
Haplotype analysis across the LD block regions of 10 significant MTAs revealed variable haplotype blocks, reflecting genetic diversity around these candidate regions. The highest diversity was observed in the candidate region of MTA AX-94,523,390, with 32 haplotype blocks, followed by 27 haplotype blocks in the LD regions of AX-94,414,200 and AX-94,409,353. In contrast, the LD region of MTA AX-94,823,205 exhibited the lowest haplotype diversity, with only seven haplotype blocks (Fig. 6).
Bar graph representing the number of haplotypes surrounding each MTA.
Superior haplotypes regulating PHST
Haplo-pheno analysis of the significant MTAs revealed eight distinct haplotypes and allele patterns underlying variation in pre-harvest sprouting tolerance (PHST) (Fig. 7). Among these, five haplotypes (H001–H004 and H008) were predominantly associated with PHS susceptible accessions, including TS67, TS3, TS26, TS10, and TS14. In contrast, three haplotypes - H005 (TS28), H006 (TS64), and H007 (TS81) - were consistently present in PHS tolerant accessions, representing favorable allelic combinations for PHS tolerance. These superior haplotypes exhibited significantly higher PHST values compared to the susceptible groups, clearly distinguishing tolerant and susceptible genotypes. The tolerant haplotypes identified here provide valuable targets for marker-assisted selection and can serve as novel genetic resources for introgression of PHS tolerance into elite wheat breeding lines (Fig. 8).
Haplotype analysis leads to the identification of 8 haplotypes, among which 5 (H001, H002, H003, H004, H008) were involved in conferring tolerance to PHST while the remaining 3 (H005, H006, H007) were involved in conferring sensitivity to PHS.
Developing PHS tolerant T. sphaerococcum varieties with superior haplotypes.
Discussion
Triticum sphaerococcum, often known as Indian dwarf wheat, had been cultivated for thousands of years prior to the Green Revolution. This wheat, which is native to India and Pakistan, was preferred by local farmers due to its unique characteristics, which include round grains, erect leaves, sturdy short stems, and a high tolerance to abiotic stressors. These characteristics have significant potential for modern wheat improvement74. However, modern wheat cultivars have replaced Indian dwarf wheat nationwide due to the advent of domestication and the Green Revolution. Despite this decline, Indian dwarf wheat still harbors numerous unexplored genetic variations31,34,74 that would help in overcoming the limitations of bread wheat for numerous traits. Among these, PHS represents a major challenge, as it reduces grain quality and causes substantial economic losses, particularly in regions with humid harvest conditions75. Interestingly, sphaerococcum wheat, characterized by its distinctive round grains, has recently gained attention for its potential tolerance to PHS76. Therefore, it would be wise to investigate the possibility of introgressing PHS tolerance alleles from T. sphaerococcum to bread wheat as a promising strategy for wheat improvement. In the current study, T. sphaerococcum accessions were genotyped using the 35 K Axiom Wheat Breeders’ Array, chosen for its high density and broad genome coverage. This genotyping not only provides valuable insights into the genetic makeup of the species but also offers wheat improvement programs access to diverse novel alleles that can be harnessed for enhancing PHS tolerance. To our knowledge, this study provides the first insights into PHS tolerance in T. sphaerococcum, as no previous reports are available for this species. Therefore, the findings are interpreted in comparison with those reported previously for T. aestivum.
The conventional test that simulates field conditions was used to assess PHS in this study, as well as several previous investigations14,25,77,78. The test required immersing the spikes in water and making sure they stayed moist for a duration where sprouting could take place in the susceptible accessions. However, other factors such as Falling number (FN) and alpha-amylase activity could also be used to assess susceptibility to PHS, but each has its limitations79. It has been demonstrated that these parameters have a strong correlation with PHS and may thus be used to assess PHS. Although these parameters are associated with PHS, they may not be the reason for PHS because FN and alpha-amylase measure the quality of the endosperm after sprouting rather than the PHS itself25. In our experiment, the germination test with intact spikes was applied to check how much the seeds can resist sprouting, rather than examining the condition of their endosperm. Using the results in the present study, one can say that a germination test is suitable for determining if a species is likely to sprout in the fields due to uncertain climatic situations while maturity.
The overall genetic richness and diversity of wheat genome is assessed by the distribution and density of polymorphic markers. Out of 15,308 SNPs used in our analysis, the lowest frequency was observed in the D sub-genome, and the B sub-genome harboured the maximum frequency, which is consistent with the previous studies80. The B and A sub-genomes, which are considered to be older, contain a higher number of SNPs, probably due to their domestication events earlier by gene flow and gene duplications, which have caused more mutations to accumulate over time81,82.
Since phenotypes are affected by genes and the environment, using SNPs to group accessions is better than using phenotypes alone. By using population structure information as a way to lower the chance of finding false associations, population structure analysis greatly reveals the understanding of genetic diversity and increases the accuracy of association mapping83,84. We found four subpopulations in our analysis that varied in their allele frequencies, and these could be linked to genetic bottlenecks, recombination events over ages, selection pressures (artificial and natural)85,86.
The precision in the association mapping depends on the selection of an appropriate number of markers, the extent of the LD, and its decay rate in the mapping panel used87. Thus, it is important to determine the range of LD in the species under study. In our study, we observed the range of LD in each of the three sub-genomes separately and across the whole genome to gain a deeper understanding of the genetic architecture. As per the earlier reports, LD decay occurs faster in sub-genome B when compared with sub-genomes A and D88,89. However, in our study, the LD decay rate was faster in sub-genome D, followed by sub-genomes B and A, which aligns with results from previous studies90,91. The D sub-genome usually exhibits faster LD decay because of its higher and more consistent recombination distribution along the chromosomes92. The most likely cause for the differences in LD decay patterns among the sub-genomes could be the usage of different study materials with distinct population stratifications and selection pressures and, levels of gene flow34.
Twelve MTAs for PHS in wheat were found on the chromosomes 1 A, 1B, 1D, 2 A, 3 A, 3D, 5 A, and 6B. This information sheds light on the genetic makeup of this intricate trait. Interestingly, most of these markers were found in the B and A genomes, which is consistent with earlier research that found important PHS-related quantitative trait loci QTLs on these chromosomes. For example, Chao et al.93 highlighted the important role of chromosomes 2B, 3 A, and 4 A in PHS resistance by identifying key PHS-related quantitative QTLs on these genomes. Munkvold et al.94 also identified QTLs on chromosome 1B, highlighting the B genome’s significance in PHST. Furthermore, up to 78.03% of the phenotypic variance was found to be explained by a significant QTL for PHS tolerance that was identified on chromosome 3 A by Kulwal et al.95. Together, these investigations highlight the critical roles that the A and B genomes play in wheat’s tolerance to PHS. Similarly, markers on chromosomes 1 A, 1D, 2D, 3 A, 3D, and 5 A, which are consistently detected in two environments, and AX-94,414,200 on chromosome 1B, which was found in three environments, demonstrate their importance in breeding for PHST. The consistent detection of markers AX-94,415,302, AX-94,919,611, AX-94,403,953 on chromosome 1B, AX-95,220,897 on chromosome 6B, and AX-94,756,068 on chromosome 2 A across all four environments and three models reveals their strength as stable and reliable indicators for PHST in wheat.
QTLs for PHST/dormancy were found to exist on up to 20 distinct chromosomes in previous wheat studies, with chromosome 1D being the only exception23,76,96,97,98,99,100,101,102,103. Interestingly, no significant associations were detected on chromosome 4 A in our study, despite it being widely recognized as a major locus for PHS tolerance97,104,105,106,107,108,109,110,111,112,113,114. Despite the fact that T. sphaerococcum and T. aestivum are both hexaploid wheats, the absence of the well-known 4 A gene for PHS resistance in bread wheat may be due to species-specific differences in genomic architecture and allelic composition. It is plausible that the allelic variation on chromosome 4 A that confers PHST in T. aestivum is either absent, fixed, or replaced by other genomic areas in T. sphaerococcum, as our work focused on T. sphaerococcum, a species different from bread wheat (T. aestivum). Such species-specific divergence suggests that PHST in T. sphaerococcum may be governed by novel loci not previously reported in bread wheat, thereby underscoring the unique contribution of this species to broadening the genetic base for PHST.
The pyramiding effect of multiple associated SNPs was found to be significant, and the genotypes carrying a greater number of favourable alleles consistently exhibited superior phenotypic performance when compared to those with fewer favourable alleles (Fig. 5). Although the regression coefficients (R² values) for the pyramiding effect were statistically significant, they were relatively low (0.173–0.201), indicating that the pyramided alleles accounted for only about 20% of the total phenotypic variance. This highlights the complex and polygenic nature of PHS tolerance, suggesting that additional genetic factors and interactions beyond the pyramided alleles contribute substantially to the trait.
The 47 potential CGs identified in this study are essential for controlling several molecular processes linked to seed dormancy and PHS tolerance in wheat. Gene ontology analysis revealed that these CGs are involved in several key functions like abscisic acid (ABA) signalling, gibberellin (GA) biosynthesis, and seed dormancy and germination. A significant portion of the CGs identified in this study are involved in the ABA biosynthesis and signalling network, therefore, it is important to understand the molecular mechanisms driving PHST in wheat (Table 3). These ABA-related genes are involved in glucose signalling, metabolism, root growth, faulty embryo formation, catalytic activity, and ABA signalling breakdown or deactivation96,97,98,99,100,115,116,117,118,119. Leucine-rich repeat (LRR) genes are well known for playing crucial roles in plant development, immunology, and stress responses because of their involvement in signal transduction pathways. Although there is currently little direct functional evidence, LRR-containing proteins in wheat are progressively being linked seed dormancy and PHST. The wheat genome contains several QTLs linked to PHST, particularly on chromosomes 3 A and 4 A, which are areas frequently enriched with genes producing LRR proteins95,120. For example, the QPhs.ccsu-3 A.1 QTL was introgressed into the PHS-susceptible cultivar HD2329, and the region contains CGs, some with LRR domains, that may influence hormonal pathways such as ABA signalling, which is critical for maintaining seed dormancy102,121. In many plant species, two main endogenous hormones - ABA and GA are generally believed to regulate seed dormancy and germination (i.e., PHST) antagonistically103,122. While GA promotes germination, ABA contributes to the promotion of dormancy123,124. The exact regulatory mechanisms controlling the balance between ABA and GA in seed dormancy and germination are still not fully understood, despite years of intensive research on the two chemicals. A majority of the CGs found in our analysis contain the F-Box domain (Table 3). In wheat, the F-box protein gene TaFBA1 plays an important role in abiotic stress tolerance. Given the key role of ABA in maintaining seed dormancy and preventing premature germination, F-box proteins like TaFBA1 might influence PHS tolerance by modulating ABA signalling pathways. Moreover, the interaction of F-box proteins with key components of the ABA signalling cascade, such as RCAR1 and ABI5, emphasizes their potential regulatory role in seed dormancy mechanisms106,125. Thus, F-box domain-containing genes in wheat are promising candidates for future research aimed at improving PHST through genetic and biotechnological approaches. These findings imply that the genetic regulation of PHS and seed dormancy is extremely complex, and a more thorough investigation is required to completely comprehend how these CGs contribute to PHS tolerance. Additionally, Myb 10-D proteins are also known to confer PHST by enhancing ABA biosynthesis, thereby delaying germination in wheat11.
Additionally, our haplotype analysis suggests the contrasting distribution of alleles between PHS tolerant and susceptible genotypes which underscores the need for the potential utility of H005, H006, and H007 as reliable markers for MAS and haplotype-based breeding in wheat for PHST. Furthermore, the SNPs underlying the haplotypes identified in our study are located in genomic regions harbouring genes annotated for seed dormancy regulation, hormone signalling pathways like ABA and GA, and cell wall metabolism, all of which have been previously implicated in PHS resistance105,108. Superior haplotypes (H005, H006 and H007) may therefore, represent allelic variants that enhance dormancy or strengthen protective barriers, conferring greater tolerance to PHS under humid conditions. In contrast, the inferior haplotypes (H001, H002, H003, H004 and H008) may either lack these favourable alleles or carry alternate variants that reduce dormancy and increase susceptibility to PHS. These results imply that the superior haplotypes may represent favourable allele combinations rather than just statistical relationships, though functional validation will be necessary. Such insights strengthen their value as targets for MAS and for Introgression into elite wheat lines to broaden the genetic base of PHST.
Conclusion
This study provides the first evidence of novel genetic variation for PHST in Indian dwarf wheat, establishing a foundation for future breeding strategies in wheat improvement. Genetic diversity analysis of T. sphaerococcum using 35 K Axiom array revealed significant genetic variability across 116 accessions. By evaluating 116 accessions across three environments, we identified twelve significant MTAs, of which several were consistent across environments and models, predominantly in the A and B sub-genomes, which further supports the potential role of these genomic regions in regulating PHST. Candidate gene analysis revealed key functional categories, including ABA/GA signalling components, F-box proteins, LRR domain-containing genes, and Myb10 transcription factors, all of which are implicated in the regulation of seed dormancy and sprouting responses. The abundance of genes associated with ABA and GA signalling, particularly those with LRR and F-box domains, highlights the complex hormonal interplay that regulates dormancy and sprouting responses. Additionally, the presence of favourable haplotypes (H005, H006, and H007), together with the pyramiding effect of favourable alleles, highlights the potential for haplotype-assisted selection to enhance PHST in wheat. Our study highlights the value of this neglected wheat germplasm for identifying new alleles and genetic regulatory mechanisms, offering significant potential for developing wheat cultivars that are more tolerant to PHS in the face of climate change. Future functional validation and genomic studies are essential to further unravel the complexity of PHST and effectively translate these discoveries into applied breeding strategies.
Data availability
The datasets generated during and/or analysed during the current study are available in the Indian Array Data Archive of Indian Biological Data Centre (**Project ID: IADA-PRjisbyxke**).
Abbreviations
- ANOVA:
-
Analysis of variance
- BLINK:
-
Bayesian-information and linkage-disequilibrium iteratively nested keyway
- BLUP:
-
Best linear unbiased prediction
- CE:
-
Combined environment
- CG:
-
Candidate gene
- CMLM:
-
Compressed mixed linear model
- CTAB:
-
Cetyl trimethyl ammonium bromide
- FarmCPU:
-
Fixed and random model circulating probability unification
- FDR:
-
False discovery rate
- GAPIT:
-
Genome association and prediction integrated tool
- GLM:
-
General linear model
- GO:
-
Gene ontology
- GWAS:
-
Genome wide association studies
- IWGSC:
-
International wheat genome sequencing consortium
- K:
-
Kinship matrix
- LD:
-
Linkage disequilibrium
- LOESS:
-
Locally weighted scatterplot smoother
- LRR:
-
Leucine-rich repeat
- LSD:
-
Least significant difference
- MAF:
-
Minor allele frequency
- MAS:
-
Marker assisted selection
- Mb:
-
Megabase
- MLM:
-
Mixed linear model
- MLMM:
-
Multiple loci mixed model
- MTA:
-
Marker trait association
- PCA:
-
Principal component analysis
- PHS:
-
Pre harvest sprouting
- PHST:
-
Pre harvest sprouting tolerance
- Q:
-
Population structure matrix
- QQ Plot:
-
Quantile-quantile plot
- QTL:
-
Quantitative trait loci
- r²:
-
Squared allele frequency correlation
- SNP:
-
Single nucleotide polymorphism
- UN:
-
United nations
References
Ali, A. et al. Unraveling molecular and genetic studies of wheat (Triticum aestivum L.) resistance against factors causing pre-harvest sprouting. Agronomy 9, 117. https://doi.org/10.3390/agronomy9030117 (2019).
Kruger, J. In Preharvest Field Sprouting in Cereals. 1–14 (CRC Press, 2018).
Andreoli, C., Bassoi, M. C. & Brunetta, D. Genetic control of seed dormancy and pre-harvest sprouting in wheat. Scientia Agricola. 63, 564–566. https://doi.org/10.1590/S0103-90162006000600009 (2006).
Zhang, Y., Xia, X. & He, Z. The seed dormancy allele TaSdr-A1a associated with pre-harvest sprouting tolerance is mainly present in Chinese wheat landraces. Theor. Appl. Genet. 130, 81–89. https://doi.org/10.1007/s00122-016-2793-0 (2017).
Cabral, A. L. et al. Identification of candidate genes, regions and markers for pre-harvest sprouting resistance in wheat (Triticum aestivum L). BMC Plant Biol. 14, 1–12. https://doi.org/10.1186/s12870-014-0340-1 (2014).
Wang, X. et al. Phenotypic and genotypic characterization of near-isogenic lines targeting a major 4BL QTL responsible for pre-harvest sprouting in wheat. BMC Plant Biol. 19, 1–10. https://doi.org/10.1186/s12870-019-1961-1 (2019).
Patwa, N. & Penning, B. W. Environmental impact on cereal crop grain damage from pre-harvest sprouting and late maturity alpha-amylase. Sustain. Agric. era Clim. change https://doi.org/10.1007/978-3-030-45669-6_2 (2020).
Miao, X. et al. Mapping quantitative trait loci for pre-harvest sprouting resistance in white-grained winter wheat line CA 0431. Crop Pasture Sci. 64, 573–579. https://doi.org/10.1071/CP13102 (2013).
Liu, C. et al. Reprogramming of seed metabolism facilitates pre-harvest sprouting resistance of wheat. Sci. Rep. 6, 20593. https://doi.org/10.1038/srep20593 (2016).
Brown, L. K., Wiersma, A. T. & Olson, E. L. Preharvest sprouting and α-amylase activity in soft winter wheat. J. Cereal Sci. 79, 311–318. https://doi.org/10.1016/j.jcs.2017.11.016 (2018).
Lang, J. et al. Myb10-D confers PHS‐3D resistance to pre‐harvest sprouting by regulating NCED in ABA biosynthesis pathway of wheat. New Phytol. 230, 1940–1952. https://doi.org/10.1111/nph.17312 (2021).
Singh, C. et al. Pre-harvest sprouting in wheat: current status and future prospects. J. Cereal Res. https://doi.org/10.25174/2582-2675/2021/114484 (2021).
Mares, D. Pre-harvest sprouting in wheat. I. Influence of cultivar, rainfall and temperature during grain ripening. Aust. J. Agric. Res. 44, 1259–1272. https://doi.org/10.1071/AR9931259 (1993).
Biddulph, T., Plummer, J., Setter, T. & Mares, D. Seasonal conditions influence dormancy and preharvest sprouting tolerance of wheat (Triticum aestivum L.) in the field. Field Crops Res. 107, 116–128. https://doi.org/10.1016/j.fcr.2008.01.003 (2008).
Lei, L. et al. TaMFT-A1 is associated with seed germination sensitive to temperature in winter wheat. PloS One. 8, e73330. https://doi.org/10.1371/journal.pone.0073330 (2013).
Xiao, S. H., Zhang, X. Y., Yan, C. S. & Lin, H. Germplasm improvement for preharvest sprouting resistance in Chinese white-grained wheat: an overview of the current strategy. Euphytica 126, 35–38. https://doi.org/10.1023/A:1019679924173 (2002).
Gao, X. et al. Factors affecting pre-harvest sprouting resistance in wheat (Triticum aestivum L.): A review. (2013).
Nakamura, S. Grain dormancy genes responsible for preventing pre-harvest sprouting in barley and wheat. Breed. Sci. 68, 295–304 (2018).
Olaerts, H. & Courtin, C. M. Impact of preharvest sprouting on endogenous hydrolases and technological quality of wheat and bread: a review. Compr. Rev. Food Sci. Food Saf. 17, 698–713. https://doi.org/10.1111/1541-4337.12347 (2018).
Mohan, A. et al. Genome-wide QTL analysis for pre-harvest sprouting tolerance in bread wheat. Euphytica 168, 319–329. https://doi.org/10.1007/s10681-009-9935-2 (2009).
Cao, L. et al. Detection of QTLs for traits associated with pre-harvest sprouting resistance in bread wheat (Triticum aestivum L). Breed. Sci. 66, 260–270 (2016).
Fakthongphan, J., Graybosch, R. & Baenziger, P. Combining ability for tolerance to pre-harvest sprouting in common wheat (Triticum aestivum L). Crop Sci. 56, 1025–1035. https://doi.org/10.2135/cropsci2015.08.0490 (2016).
Osa, M. et al. Mapping QTLs for seed dormancy and the Vp1 homologue on chromosome 3A in wheat. Theor. Appl. Genet. 106, 1491–1496. https://doi.org/10.1007/s00122-003-1208-1 (2003).
Kato, K., Nakamura, W., Tabiki, T., Miura, H. & Sawada, S. Detection of loci controlling seed dormancy on group 4 chromosomes of wheat and comparative mapping with rice and barley genomes. Theor. Appl. Genet. 102, 980–985. https://doi.org/10.1007/s001220000494 (2001).
Kulwal, P., Singh, R., Balyan, H. & Gupta, P. Genetic basis of pre-harvest sprouting tolerance using single-locus and two-locus QTL analyses in bread wheat. Funct. Integr. Genom. 4, 94–101. https://doi.org/10.1007/s10142-004-0105-2 (2004).
Mori, M., Uchino, N., Chono, M., Kato, K. & Miura, H. Mapping QTLs for grain dormancy on wheat chromosome 3A and the group 4 chromosomes, and their combined effect. Theor. Appl. Genet. 110, 1315–1323. https://doi.org/10.1007/s00122-005-1972-1 (2005).
Liu, S., Bai, G., Cai, S. & Chen, C. Dissection of genetic components of preharvest sprouting resistance in white wheat. Mol. Breeding. 27, 511–523. https://doi.org/10.1007/s11032-010-9448-7 (2011).
Mares, D. et al. A QTL located on chromosome 4A associated with dormancy in white-and red-grained wheats of diverse origin. Theor. Appl. Genet. 111, 1357–1364. https://doi.org/10.1007/s00122-005-0065-5 (2005).
Chen, C. X., Cai, S. B. & Bai, G. H. A major QTL controlling seed dormancy and pre-harvest sprouting resistance on chromosome 4A in a Chinese wheat landrace. Mol. Breeding. 21, 351–358. https://doi.org/10.1007/s11032-007-9135-5 (2008).
Singh, R., Matus-Cádiz, M., Båga, M., Hucl, P. & Chibbar, R. N. Identification of genomic regions associated with seed dormancy in white-grained wheat. Euphytica 174, 391–408. https://doi.org/10.1007/s10681-010-0137-8 (2010).
Adhikari, S. et al. Unlocking the potential of ancient hexaploid Indian Dwarf wheat, tritium sphaerococcum for grain quality improvement. PeerJ 11, e15334. https://doi.org/10.7717/peerj.15334 (2023).
Josekutty, P. C. Defining the genetic and physiological basis of Triticum sphaerococcum Perc. (2008).
Matsuoka, Y. Evolution of polyploid triticum wheats under cultivation: the role of domestication, natural hybridization and allopolyploid speciation in their diversification. Plant Cell Physiol. 52, 750–764. https://doi.org/10.1093/pcp/pcr018 (2011).
Gaikwad, K. B. et al. Trait phenotyping in an ancient Indian landrace of wheat triticum sphaerococcum under optimum, terminal heat stress and deficit irrigation conditions. Genet. Resour. Crop Evol. 71, 2779–2795. https://doi.org/10.1007/s10722-023-01817-z (2024).
Gaikwad, K. B. et al. Evaluating heat and drought resilience in ancient Indian Dwarf wheat triticum sphaerococcum Percival using stress tolerance indices. Sci. Rep. 15, 1–22. https://doi.org/10.1038/s41598-025-02502-0 (2025).
Mazumder, A. K. et al. Discovering novel genomic regions explaining adaptation of bread wheat to conservation agriculture through GWAS. Sci. Rep. 14, 16351. https://doi.org/10.1038/s41598-024-66903-3 (2024).
Gautam, T. et al. Development of white-grained PHS-tolerant wheats with high grain protein and leaf rust resistance. Mol. Breeding. 41, 42. https://doi.org/10.1007/s11032-021-01234-z (2021).
Jan, I. et al. Development of MAS-derived wheat genotypes with high GPC, PHST and rust resistance. (2023).
Kumar, J. et al. Marker-assisted selection for pre‐harvest sprouting tolerance and leaf rust resistance in bread wheat. Plant. Breed. 129, 617–621. https://doi.org/10.1111/j.1439-0523.2009.01758.x (2010).
Shorinola, O. et al. The wheat Phs-A1 pre-harvest sprouting resistance locus delays the rate of seed dormancy loss and maps 0.3 cM distal to the PM19 genes in UK germplasm. J. Exp. Bot. 67, 4169–4178. https://doi.org/10.1093/jxb/erw194 (2016).
Jaiswal, V., Mir, R., Mohan, A., Balyan, H. & Gupta, P. Association mapping for pre-harvest sprouting tolerance in common wheat (Triticum aestivum L). Euphytica 188, 89–102. https://doi.org/10.1007/s10681-012-0713-1 (2012).
Kulwal, P. et al. Association mapping for pre-harvest sprouting resistance in white winter wheat. Theor. Appl. Genet. 125, 793–805. https://doi.org/10.1007/s00122-012-1872-0 (2012).
Rehman Arif, M. et al. An association mapping analysis of dormancy and pre-harvest sprouting in wheat. Euphytica 188, 409–417. https://doi.org/10.1007/s10681-012-0705-1 (2012).
Kumar, S. et al. Maximizing the identification of QTL for pre-harvest sprouting resistance using seed dormancy measures in a white-grained hexaploid wheat population. Euphytica 205, 287–309. https://doi.org/10.1007/s10681-015-1460-x (2015).
He, J. et al. Identification of QTLs and a candidate gene for reducing pre-harvest sprouting in Aegilops tauschii–Triticum aestivum chromosome segment substitution lines. Int. J. Mol. Sci. 22, 3729. https://doi.org/10.3390/ijms22073729 (2021).
Liton, M. U. A. et al. Identification of loci for pre-harvest sprouting resistance in the highly dormant spring wheat RL4137. Theor. Appl. Genet. 134, 113–124. https://doi.org/10.1007/s00122-020-03685-y (2021).
Li, Y. et al. Genome-wide identification and expression analysis of ADP-ribosylation factors associated with biotic and abiotic stress in wheat (Triticum aestivum L). PeerJ 9, e10963. https://doi.org/10.7717/peerj.10963 (2021).
Dhariwal, R. et al. Mapping pre-harvest sprouting resistance loci in AAC Innova× AAC tenacious spring wheat population. BMC Genom. 22, 1–20. https://doi.org/10.1186/s12864-021-08209-6 (2021).
Khumalo, T. P., Hlongoane, T., Barnard, A. & Tsilo, T. J. Genomic regions influencing preharvest sprouting tolerance in two doubled-haploid wheat populations (Triticum aestivum L). Agronomy 12, 832. https://doi.org/10.3390/agronomy12040832 (2022).
Tai, L. et al. Pre-harvest sprouting in cereals: genetic and biochemical mechanisms. J. Exp. Bot. 72, 2857–2876. https://doi.org/10.1093/jxb/erab024 (2021).
Sehgal, D. et al. Haplotype-based, genome-wide association study reveals stable genomic regions for grain yield in CIMMYT spring bread wheat. Front. Genet. 11, 589490. https://doi.org/10.3389/fgene.2020.589490 (2020).
Lin, M. et al. Genome-wide association analysis on pre-harvest sprouting resistance and grain color in US winter wheat. BMC Genom. 17, 1–16. https://doi.org/10.1186/s12864-016-3148-6 (2016).
Juliana, P. et al. Genomic selection for grain yield in the CIMMYT wheat breeding program—status and perspectives. Front. Plant Sci. 11, 564183. https://doi.org/10.3389/fpls.2020.564183 (2020).
Yao, F. et al. Genome-wide association analysis of stable Stripe rust resistance loci in a Chinese wheat landrace panel using the 660K SNP array. Front. Plant Sci. 12, 783830. https://doi.org/10.3389/fpls.2021.783830 (2021).
Abrouk, M. et al. Population genomics and haplotype analysis in spelt and bread wheat identifies a gene regulating glume color. Commun. Biology. 4, 375. https://doi.org/10.1038/s42003-021-01908-6 (2021).
Koua, A. P. et al. Genome-wide dissection and haplotype analysis identified candidate loci for nitrogen use efficiency under drought conditions in winter wheat. Plant. Genome. 17, e20394. https://doi.org/10.1002/tpg2.20394 (2024).
Hu, X. et al. Genomic insights into glume pubescence in durum wheat: GWAS and haplotype analysis implicates TdELD1-1A as a candidate gene. Gene 909, 148309. https://doi.org/10.1016/j.gene.2024.148309 (2024).
Trethowan, R. Evaluation and selection of bread wheat (Triticum aestivum L.) for preharvest sprouting tolerance. Aust. J. Agric. Res. 46, 463–474. https://doi.org/10.1071/AR9950463 (1995).
McMaster, G. & Derera, N. Methodology and sample preparation when screening for sprouting damage in cereals. Cereal Res. Communications 4, 251–254 (1976).
Baier, A. Pre-harvest sprouting. Annu. Wheat Newsl. 33, 274 (1987).
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48. https://doi.org/10.18637/jss.v067.i01 (2015).
Henderson, C. R. Best linear unbiased estimation and prediction under a selection model. Biometrics 31, 423–447 (1975).
Doyle, J. J. Isolation of plant DNA from fresh tissue. Focus 12, 13–15 (1990).
Mazumder, A. K. et al. Exploring the genetic diversity and population structure of an ancient hexaploid wheat species triticum sphaerococcum using SNP markers. BMC Plant Biol. 24, 1188. https://doi.org/10.1186/s12870-024-05968-8 (2024).
Bradbury, P. J. et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635 (2007).
Cleveland, W. S. Robust locally weighted regression and smoothing scatterplots. J. Am. Stat. Assoc. 74, 829–836 (1979).
Breseghello, F. & Sorrells, M. E. Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172, 1165–1177 (2006).
Lipka, A. E. et al. GAPIT: genome association and prediction integrated tool. Bioinformatics 28, 2397–2399. https://doi.org/10.1093/bioinformatics/bts444 (2012).
Jaiswal, V. et al. Genome-wide association study (GWAS) delineates genomic loci for ten nutritional elements in Foxtail millet (Setaria Italica L). J. Cereal Sci. 85, 48–55. https://doi.org/10.1016/j.jcs.2018.11.006 (2019).
Zhang, R., Jia, G. & Diao, X. GeneHapR: an R package for gene haplotypic statistics and visualization. BMC Bioinform. 24, 199. https://doi.org/10.1186/s12859-023-05318-9 (2023).
Kumar, K. et al. Identification of superior haplotypes for flowering time in Pigeonpea through candidate gene-based association study of a diverse minicore collection. Plant Cell Rep. 43, 156. https://doi.org/10.1007/s00299-024-03230-x (2024).
Gaur, A. et al. GWAS elucidated grain yield genetics in Indian spring wheat under diverse water conditions. Theor. Appl. Genet. 137, 177. https://doi.org/10.1007/s00122-024-04680-3 (2024).
Yang, Y. et al. Multi-locus GWAS of quality traits in bread wheat: mining more candidate genes and possible regulatory network. Front. Plant Sci. 11, 1091. https://doi.org/10.3389/fpls.2020.01091 (2020).
Gupta, A. et al. Multiple origins of Indian Dwarf wheat by mutations targeting the TREE domain of a GSK3-like kinase for drought tolerance, phosphate uptake, and grain quality. Theor. Appl. Genet. 134, 633–645. https://doi.org/10.1007/s00122-020-03719-5 (2021).
Mares, D. J. & Mrva, K. Wheat grain preharvest sprouting and late maturity alpha-amylase. Planta 240, 1167–1178. https://doi.org/10.1007/s00425-014-2172-5 (2014).
Kumar, M. et al. GWAS and genomic prediction for pre-harvest sprouting tolerance involving sprouting score and two other related traits in spring wheat. Mol. Breeding. 43, 14. https://doi.org/10.1007/s11032-023-01357-5 (2023).
Wu, J. & Carver, B. F. Sprout damage and preharvest sprout resistance in hard white winter wheat. Crop Sci. 39, 441–447. https://doi.org/10.2135/cropsci1999.0011183X0039000200024x (1999).
Okuyama, L., Riede, C. & Kohli, M. Association between falling number and grain characteristics to evaluate preharvest sprouting in wheat. J. Exp. Biol. Agric. Sci. https://doi.org/10.18006/2020.8(6).758.764 (2020).
Zanetti, S., Winzeler, M., Keller, M., Keller, B. & Messmer, M. Genetic analysis of pre-harvest sprouting resistance in a wheat x spelt cross. Crop Sci. 40, 1406–1406 (2000).
Xu, H. et al. Genome-wide association study and genomic selection of spike-related traits in bread wheat. Theor. Appl. Genet. 137, 131. https://doi.org/10.1007/s00122-024-04640-x (2024).
Glover, N. M. et al. Small-scale gene duplications played a major role in the recent evolution of wheat chromosome 3B. Genome Biol. 16, 1–13. https://doi.org/10.1186/s13059-015-0754-6 (2015).
Rimbert, H. et al. High throughput SNP discovery and genotyping in hexaploid wheat. PloS One 13, e0186329. https://doi.org/10.1371/journal.pone.0186329 (2018).
Zimmerman, S. J., Aldridge, C. L. & Oyler-McCance, S. J. An empirical comparison of population genetic analyses using microsatellite and SNP data for a species of conservation concern. BMC Genom. 21, 1–16. https://doi.org/10.1186/s12864-020-06783-9 (2020).
Sesia, M., Bates, S., Candès, E., Marchini, J. & Sabatti, C. False discovery rate control in genome-wide association studies with population structure. Proc. Natl. Acad. Sci. 118, e2105841118. https://doi.org/10.1073/pnas.2105841118 (2021).
Joukhadar, R., Daetwyler, H. D., Gendall, A. R. & Hayden, M. J. Artificial selection causes significant linkage disequilibrium among multiple unlinked genes in Australian wheat. Evol. Appl. 12, 1610–1625. https://doi.org/10.1111/eva.12807 (2019).
Danguy des Déserts, A., Bouchet, S., Sourdille, P. & Servin, B. Evolution of recombination landscapes in diverging populations of bread wheat. Genome Biol. Evol. 13, evab152. https://doi.org/10.1093/gbe/evab152 (2021).
Flint-Garcia, S. A., Thornsberry, J. M. & Buckler, I. V. Structure of linkage disequilibrium in plants. Annu. Rev. Plant Biol. 54, 357–374. https://doi.org/10.1146/annurev.arplant.54.031902.134907 (2003).
Sheoran, S. et al. Uncovering genomic regions associated with 36 agro-morphological traits in Indian spring wheat using GWAS. Front. Plant Sci. 10, 527. https://doi.org/10.3389/fpls.2019.00527 (2019).
Krishnappa, G. et al. Genome-wide association study for grain protein, thousand kernel weight, and normalized difference vegetation index in bread wheat (Triticum aestivum L). Genes 14, 637. https://doi.org/10.3390/genes14030637 (2023).
Pang, Y. et al. High-resolution genome-wide association study identifies genomic regions and candidate genes for important agronomic traits in wheat. Mol. Plant. 13, 1311–1327. https://doi.org/10.1016/j.molp.2020.07.008 (2020).
Chou, C. H., Lin, H. S., Wen, C. H. & Tung, C. W. Patterns of genetic variation and QTLs controlling grain traits in a collection of global wheat germplasm revealed by high-quality SNP markers. BMC Plant Biol. 22, 455. https://doi.org/10.1186/s12870-022-03844-x (2022).
Hao, C. et al. Resequencing of 145 landmark cultivars reveals asymmetric sub-genome selection and strong founder genotype effects on wheat breeding in China. Mol. Plant. 13, 1733–1751. https://doi.org/10.1016/j.molp.2020.09.001 (2020).
Chao, S., Xu, S. S., Elias, E. M., Faris, J. D. & Sorrells, M. E. Identification of chromosome locations of genes affecting preharvest sprouting and seed dormancy using chromosome substitution lines in tetraploid wheat (Triticum turgidum L). Crop Sci. 50, 1180–1187. https://doi.org/10.2135/cropsci2009.10.0589 (2010).
Munkvold, J. D., Tanaka, J., Benscher, D. & Sorrells, M. E. Mapping quantitative trait loci for preharvest sprouting resistance in white wheat. Theor. Appl. Genet. 119, 1223–1235. https://doi.org/10.1007/s00122-009-1123-1 (2009).
Kulwal, P. et al. Mapping of a major QTL for pre-harvest sprouting tolerance on chromosome 3A in bread wheat. Theor. Appl. Genet. 111, 1052–1059. https://doi.org/10.1007/s00122-005-0021-4 (2005).
Anderson, J. A., Sorrells, M. E. & Tanksley, S. D. RFLP analysis of genomic regions associated with resistance to preharvest sprouting in wheat. Crop Sci. 33, 453–459. https://doi.org/10.2135/cropsci1993.0011183X003300030008x (1993).
Flintham, J., Adlam, R., Bassoi, M., Holdsworth, M. & Gale, M. Mapping genes for resistance to sprouting damage in wheat. Euphytica 126, 39–45. https://doi.org/10.1023/A:1019632008244 (2002).
Groos, C. et al. Study of the relationship between pre-harvest sprouting and grain color by quantitative trait loci analysis in a white× red grain bread-wheat cross. Theor. Appl. Genet. 104, 39–47. https://doi.org/10.1007/s001220200004 (2002).
Kato, K., Nakamura, W., Tabiki, T., Miura, H. & Sawada, S. Detection of loci controlling seed dormancy on group 4 chromosomes of wheat and comparative mapping with rice and barley genomes. Theor. Appl. Genet. 102, 980–985. https://doi.org/10.1007/s001220000494 (2001).
Mares, D. & Mrva, K. Mapping quantitative trait loci associated with variation in grain dormancy in Australian wheat. Aust. J. Agric. Res. 52, 1257–1265. https://doi.org/10.1071/AR01049 (2001).
Miura, H., Sato, N., Kato, K., Amano, Y. & Mcintosh, R. Detection of chromosomes carrying genes for seed dormancy of wheat using the backcross reciprocal monosomic method. Plant. Breed. 121, 394–399. https://doi.org/10.1046/j.1439-0523.2002.741382.x (2002).
Roy, J. et al. Identification of a microsatellite on chromosomes 6B and a STS on 7D of bread wheat showing an association with preharvest sprouting tolerance. Theor. Appl. Genet. 99, 336–340. https://doi.org/10.1007/s001220051241 (1999).
Zanetti, S., Winzeler, M., Keller, M., Keller, B. & Messmer, M. Genetic analysis of pre-harvest sprouting resistance in a wheat x spelt cross. CROP SCIENCE-MADISON-. 40, 1406–1417. https://doi.org/10.2135/cropsci2000.4051406x (2000).
Appels, R., Francki, M. & Chibbar, R. Advances in cereal functional genomics. Funct. Integr. Genom. 3, 1. https://doi.org/10.1007/s10142-002-0073-3 (2003).
Barrero, J. M. et al. Transcriptomic analysis of wheat near-isogenic lines identifies PM19-A1 and A2 as candidates for a major dormancy QTL. Genome Biol. 16, 93. https://doi.org/10.1186/s13059-015-0665-6 (2015).
Bailey, P. et al. Genetic map locations for orthologous Vp1 genes in wheat and rice. Theor. Appl. Genet. 98, 281–284 (1999).
Kulwal, P., Singh, R., Balyan, H. & Gupta, P. Genetic basis of pre-harvest sprouting tolerance using single-locus and two-locus QTL analyses in bread wheat. Funct. Integr. Genom. 4, 94–101. https://doi.org/10.1007/s10142-004-0105-2 (2004).
Torada, A. et al. A causal gene for seed dormancy on wheat chromosome 4A encodes a MAP kinase kinase. Curr. Biol. 26, 782–787. https://doi.org/10.1016/j.cub.2016.01.063 (2016).
Chen, C. X., Cai, S. B. & Bai, G. H. A major QTL controlling seed dormancy and pre-harvest sprouting resistance on chromosome 4A in a Chinese wheat landrace. Mol. Breeding. 21, 351–358. https://doi.org/10.1007/s11032-007-9135-5 (2008).
Ogbonnaya, F. C. et al. Genetic and QTL analyses of seed dormancy and preharvest sprouting resistance in the wheat germplasm CN10955. Theor. Appl. Genet. 116, 891–902. https://doi.org/10.1007/s00122-008-0712-8 (2008).
Dallinger, H. G. et al. Genome-wide association mapping for pre‐harvest sprouting in European winter wheat detects novel resistance QTL, pleiotropic effects, and structural variation in multiple genomes. Plant. Genome. 17, e20301. https://doi.org/10.1002/tpg2.20301 (2024).
Lin, M. et al. Genome-wide association analysis on pre-harvest sprouting resistance and grain color in US winter wheat. BMC Genom. 17, 794. https://doi.org/10.1186/s12864-016-3148-6 (2016).
Li, L. et al. Genome-wide linkage mapping for preharvest sprouting resistance in wheat using 15K single-nucleotide polymorphism arrays. Front. Plant Sci. 12, 749206. https://doi.org/10.3389/fpls.2021.749206 (2021).
Cabral, A. L. et al. Identification of candidate genes, regions and markers for pre-harvest sprouting resistance in wheat (Triticum aestivum L). BMC Plant Biol. 14, 340. https://doi.org/10.1186/s12870-014-0340-1 (2014).
Zhou, K., Yang, J., Wang, Z. X. & Wang, J. R. Sequence analysis and expression profiles of TaABI5, a pre-harvest sprouting resistance gene in wheat. Genes Genomics. 39, 161–171. https://doi.org/10.1007/s13258-016-0483-6 (2017).
Zhou, Y. et al. Genome-wide association study for pre-harvest sprouting resistance in a large germplasm collection of Chinese wheat landraces. Front. Plant Sci. 8, 401. https://doi.org/10.3389/fpls.2017.00401 (2017).
Park, J. et al. Epigenetic switch from repressive to permissive chromatin in response to cold stress. Proc. Natl. Acad. Sci. 115, E5400–E5409. https://doi.org/10.1073/pnas.1721241115 (2018).
Wang, Z. et al. Counteraction of ABA-mediated Inhibition of seed germination and seedling establishment by ABA signaling terminator in Arabidopsis. Mol. Plant. 13, 1284–1297. https://doi.org/10.1016/j.molp.2020.06.011 (2020).
Rikiishi, K., Sugimoto, M. & Maekawa, M. Transcriptomic analysis of developing seeds in a wheat (Triticum aestivum L.) mutant RSD32 with reduced seed dormancy. Breed. Sci. 71, 155–166 (2021).
Liu, S. et al. Cloning and characterization of a critical regulator for preharvest sprouting in wheat. Genetics 195, 263–273. https://doi.org/10.1534/genetics.113.152330 (2013).
Singh, A. et al. Genetics of pre-harvest sprouting resistance in a cross of Canadian adapted durum wheat genotypes. Mol. Breeding. 33, 919–929. https://doi.org/10.1007/s11032-013-0006-y (2014).
Chen, H. et al. AtPER1 enhances primary seed dormancy and reduces seed germination by suppressing the ABA catabolism and GA biosynthesis in Arabidopsis seeds. Plant J. 101, 310–323. https://doi.org/10.1111/tpj.14542 (2020).
Hilhorst, H. & Karssen, C. Seed dormancy and germination: the role of abscisic acid and gibberellins and the importance of hormone mutants. Plant. Growth Regul. 11, 225–238. https://doi.org/10.1007/BF0002456 (1992).
Sohn, S. I. et al. Seed dormancy and pre-harvest sprouting in rice—an updated overview. Int. J. Mol. Sci. 22, 11804. https://doi.org/10.3390/ijms222111804 (2021).
An, J. et al. Wheat F-box protein TaFBA1 positively regulates plant drought tolerance but negatively regulates stomatal closure. Front. Plant Sci. 10, 1242. https://doi.org/10.3389/fpls.2019.01242 (2019).
Acknowledgements
Authors are highly thankful to the Director, ICAR-NBPGR, for his constant support to DBT Wheat Network Project. Authors are also thankful to the Department of Biotechnology for providing an opportunity to work on this project.
Funding
The authors are grateful to the Indian Council of Agricultural Research for supporting this research under the ICAR-NBPGR-DBT Wheat Network Project (No. BT/Ag/Network/Wheat/2019-20).
Author information
Authors and Affiliations
Contributions
DS: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, writing-original draft, writing—review and editing, AM: Data curation, Software, Investigation, Methodology, TS: Software, Formal analysis, writing—original draft, AK: Software, Formal analysis, writing—original draft, LB: Data curation, Software, AY: Data curation, Methodology, RRM: Supervision, writing—review and editing, VG: Supervision, writing—original draft, writing—review and editing, PK: Investigation, Supervision, writing—review and editing., NB: Conceptualization, Software, writing—review and editing, VKV: Investigation, Supervision, writing—review and editing, KBG: Writing—review and editing, AKS: Investigation, Supervision, writing—review and editing, GPS: Investigation, Supervision, writing—review and editing, SK: Conceptualization, Investigation, writing—review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Sharma, D., Mohapatra, A., Sivakumar, T. et al. Dissecting the genetic architecture of pre-harvest sprouting tolerance in Indian dwarf wheat (Triticum sphaerococcum) by multi-locus association analysis. Sci Rep 15, 43929 (2025). https://doi.org/10.1038/s41598-025-27797-x
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-27797-x










