Introduction

Rice (Oryza sativa L.) stands as a crucial food crop globally, with the imperative to enhance its yield to cater to the increasing demands of the world’s population1. As pan-genomic research progresses, there is a burgeoning abundance of variant information in terms of quantity and diversity. Delving into these genetic variants to unearth new beneficial genes forms a pivotal basis for rice genetic enhancement2,3,4. Among various population genetics techniques, GWAS remains the predominant and representative approach for unraveling complex traits. It can be used to explore and assess trait-associated functional genes efficiently across extensive germplasm resources5,6,7. Nonetheless, GWAS has several limitations: it is sensitive to population structure and sample size, and its capability to analyze low-frequency variants is constrained, hampering its efficacy in pinpointing new QTLs8.

Meta-analysis can provide a remedy by integrating outcomes from multiple independent GWAS experiments, thereby enlarging the sample size and notably enhancing the precision and scope of gene mining research. Thus far, meta-GWAS has found extensive application across diverse animal and plant species, including humans9, tomatoes10, and cattle11, underscoring the benefits of meta-GWAS for in-depth gene exploration. In recent decades, extensive GWAS research data have accumulated for rice via various germplasm populations, resulting in the creation of numerous outstanding databases12,13,14. Nevertheless, to the best of our knowledge, meta-GWAS studies that leverage large-scale population data for comprehensive gene mining in rice are still uncommon.

In this study, we carried out a large-scale meta-GWAS analysis to identify new functional genes associated with important rice agronomic traits, utilizing a total of 7765 cultivated rice (Oryza sativa) accessions. Independent GWAS studies were conducted for six distinct agronomic traits across these six panels. Following this, a meta-analysis was performed to consolidate the findings from all panels for each trait, resulting in the identification of 156 QTLs, 116 of which were entirely novel. We subsequently undertook a detailed analysis of two novel QTLs associated with grain size to pinpoint and validate two new genes responsible for rice grain width and grain length. Our research clearly highlights the feasibility and potential benefits of meta-GWAS in the study of rice population genetics, offering a robust tool and practical model for harnessing large-scale and multi-population data to mine favorable genes related to crucial agronomic traits in rice.

Results

Meta-analysis

In this study, we report the largest meta-GWAS analysis conducted in rice research to date (Fig. 1). The raw genomic sequencing data and phenotypic data were sourced from six independent rice population panels, denoted as Panel A–F. These panels encompassed a total of 7765 cultivated rice accessions derived from 126 countries or regions worldwide, including 295415, 245016, 2024, 114017, 49418 and 52519 rice accessions in Panel A–F, respectively (Supplementary Data. 1). Quality control was conducted on the raw sequencing data for all panels using consistent standards, followed by reads mapping and variant calling processes. The raw SNPs from each panel were filtered (MAF 0.05, missing rate 0.1), resulting in 6,604,898 high-quality SNPs, with each panel containing between 2,890,946 and 4,786,218 SNPs. To obtain a high-quality structural variant dataset, the sequencing reads were mapped to a pan-genome, containing 356 rice accessions as described in our previous study4,13, yielding 42,871 high-quality presence/absence variants (PAVs), varying from 9041–34,175 across panels (Supplementary Fig. 1). Six agronomic traits, including grain width, grain length, thousand-grain weight, plant height, heading date, and panicle number, were collected from the six panels (Supplementary Fig. 2). Independent GWAS was carried out between the combined SNP + PAV dataset and phenotypic dataset of each trait for all panels (Supplementary Figs. 3-8). Bonferroni’s correction was applied to GWAS outcomes to set the thresholds (0.05/n, where n is the total variants count) for false positive control and trustworthy conclusions. Across all panels, 9004 significant variants were identified, with individual panel variants ranging from 0–2600, resulting in a total of 40 QTLs for the six agronomic traits (Supplementary Data 2).

Fig. 1: Flowchart of study design.
Fig. 1: Flowchart of study design.The alternative text for this image may have been generated using AI.
Full size image

PAV presence/absence variations, GW grain width, GL grain length, TGW thousand-grain weight, PH plant height, DTH days to heading, PN panicle number, GWAS genome-wide association study, QTL quantitative trait locus.

For each trait, a meta-analysis was performed to consolidate all the available GWAS findings from different panels using the fixed-effect model. By merging multiple independent GWAS experiments, meta-analysis significantly enhanced the significance of the association P-values for the trait-associated variants across nearly all the QTLs (Fig. 2a, Supplementary Fig. 9) leading to an average of 6.79 orders of magnitude increase (ranging from -1.14–98.45). We further compared the identified QTLs between the GWAS and meta-analysis (Supplementary Figs. 1011). Compared with independent GWAS experiments, the meta-GWAS approach detected an equal or greater number of QTLs (ranging from 0%–537.5%) and explained more or a similar degree of heritability (ranging from 0% to 107.3%), utilizing a broader range of rice panels (Fig. 2c, d). This enhancement demonstrated a notably positive correlation with both the quantity of QTLs (R = 0.92, P = 0.0084) and their explained heritability (R = 0.94, P = 0.0051). Notably, almost all the QTLs (40/40) observed in the individual GWAS results were recovered via meta-analysis, but different GWAS may contribute differently to the meta-analysis; e.g., for panels A–F, a total of 4, 6, 1, 1, 2 and 3 GWAS-based QTLs related to grain width were recovered via the meta-analysis, respectively (Supplementary Figs. 38, Supplementary Data 2).

Fig. 2: Statistics of trait-associated variants detected by meta-analysis of GWAS results in 6 rice panels.
Fig. 2: Statistics of trait-associated variants detected by meta-analysis of GWAS results in 6 rice panels.The alternative text for this image may have been generated using AI.
Full size image

a Lead variants of QTLs from GWAS and meta-analysis for different phenotypes. b Illustrated of the 156 QTLs along the rice chromosomes. The colors of the QTL labels represent different phenotypes, as listed in the legend. The density of SNP + PAV is measured in a 100 Kb window for each chromosome. QTLs labeled with underscores were identified by both GWAS and meta-analysis. Inverted triangles in red indicate that overlapping QTLs of different phenotypes. c Percentage increase in QTL number by meta-analysis relative to individual GWAS results. d Percentage increase in heritability explained by QTLs of meta-analysis relative to individual GWAS results. In (c, d), the error bands indicate 95% confidence intervals, p-values were calculated by two-sided spearman correlation test.

The meta-analysis identified a total of 156 QTLs associated with the six traits, with 116 being unique and not detected in the independent GWAS experiments (Supplementary Data. 3). The QTLs are distributed across all 12 chromosomes (Fig. 2b) and their rate of variation explained for each of the six phenotypes was estimated separately (Supplementary Data. 4). We further screened the functional genes located in the QTLs and identified a total of 141 known genes related to the corresponding traits in rice. Specifically, 29 genes were related to grain width, such as GW5 (qGW5-3, P = 8.81e−180), GW6 (qGW6-1, P = 4.39e−11), GW7 (qGW7-3, P = 5.73e−14), etc.; 37 genes were related to grain length, such as GS3 (qGL3-8, P = 1.23e−85), GL6 (qGL6-3, P = 1.15e−11), GL7 (qGL7-4, P = 3.36e−13), and GL10 (qGL10-4, P = 1.06e−11), etc.; 46 genes were related to thousand-grain weight, such as GIF1 (qTGW4-1, P = 1.49e−11), GSE5 (qTGW5-3, P = 1.77e−22), OsFD2 (qTGW6-3, P = 1.40e−13), and FZP (qTGW7-5, P = 9.52e−51), etc.; and 26 genes were related to plant height, such as SD1 (qPH1-4, P = 1.59e−23), OsCCT22 (qPH6-1, P = 2.92e−12), OsMADS15 (qPH7-1, P = 9.22e−10), etc.; and three genes were related to days to heading, such as DTH3 (qDTH3-1, P = 2.11e−20), OsCCT22 (qDTH6-1, P = 1.01e−12), Ehd1 (qDTH10-1, P = 2.45e−10). It is worth mentioning that there were 63 QTLs in which no genes known to be associated with phenotypes were found, and these QTLs are valuable for further analysis. Among the six phenotypes, the grain shape can affect both the yield and rice quality traits of rice, thus QTLs for grain width and length were prioritized for further in-depth analysis. The meticulous exploration of these genes is essential for understanding intricate traits. These findings underscore the precision efficacy, and potential benefits of meta-GWAS analysis in mining genes within rice.

Identification and validation of the candidate gene associated with grain width

Among the 32 grain-width-associated QTLs, 23 were found exclusively in the meta-analysis and could not identified in individual GWAS (Fig. 3a). Among the meta-exclusive QTLs, 15 reported QTLs related to grain width (Supplementary Data. 5). For the remaining 9 meta-exclusive novel QTLs, qGW10-2 presented the lowest associated P-value (lead variant: Chr10_20159929, A > C, P = 1.11e-11) and was selected for further examination (Fig. 3b). Four steps were performed to screen candidate genes for qGW10-2 within a 200 kb vicinity of this variant; (1), Variants in all the genes in this region were assessed, focusing on missense, frameshift, and premature-stop-codon mutations for subsequent haplotype analysis; (2), Genes with distinct haplotypes exhibiting significantly different traits across the six panels were selected for further evaluation using haplotype analysis; (3), Genes lacking proper annotation and with a median expression below 0 in seeds were excluded by integrating gene function data form ricedata.cn/gene/, and transcriptome information from plantrnadb.com/ricerna/20. Two genes were identified (LOC_Os10g37280 and LOC_Os10g37640). (4), Co-expression networks of these two genes were further evaluated, revealing that LOC_Os10g37280 was co-expressed with two known genes, LOC_Os02g4797021 related to leaf shape and LOC_Os11g3290022 related to pollen development, whereas LOC_Os10g37640 exhibited co-expression with a known gene, LOC_Os07g4636023 related to grain development. Additionally, we noted that LOC_Os10g37640 was involved in auxin signaling pathway regulation (GO:0010928) and cell elongation regulation (GO:0051510), which was also noted for two known genes related to grain size, GS924 and GLW76. Consequently, we identified LOC_Os10g37640 as a promising candidate gene for qGW10-2 and designated it GW10.2.

Fig. 3: Meta-GWAS for grain width and its functional analysis.
Fig. 3: Meta-GWAS for grain width and its functional analysis.The alternative text for this image may have been generated using AI.
Full size image

a Manhattan plot for the meta-analysis of genome-wide association study of grain width. The genes labeled in the figure are the known grain width genes associated with meta-GWAS, and the red one is the identified candidate gene. The black line represents the genome-wide significance threshold at 7.52e−9. b P-values of Chr10:20159929 in GWAS and meta-analysis. c Gene structure, showing 3 missense mutation SNPs in the exon region. d Boxplot showing the grain width at different haplotypes in each panel. Statistical differences according to two-tailed Student’s t-test. The pie chart displays the subpopulation distribution of the three haplotypes in each panel (Osi, Oryza sativa indica; Osj, Oryza sativa japonica). e The difference of grain width of Chromosome Segment Substitution Lines in different accessions. g Difference in grain width between wild type and mutant. Scale bars: 1 cm. Values are presented as means ± SD in f and h, Student’s t-test. CSSL-1hap3 (n = 246) and CSSL-2hap2 (n = 191) in f. Wild type (n = 5) and mutants (n = 5) in h. In d, the horizontal line represents the median value, the lower and upper quartiles represent the 25th and 75th percentile, and the whiskers show the maximum and minimum values. Source data are provided as a Source Data file.

Three SNPs resulting in missense mutations were detected for GW10.2 in its exon region (Fig. 3c), leading to the formation of three haplotypes (Supplementary Table. 1). Among them, hap2 was the predominant haplotype, displaying a significantly larger grain width than the other two haplotypes across all six panels. Additionally, we investigated the frequency distribution of these three haplotypes in the indica and japonica subgroups. The result revealed that the hap2 had a notably higher frequency averaging 95.14% in japonica rice, only 1.75% in indica rice and 3.12% in other rice (Fig. 3d). This suggests that hap2 plays a role in the divergence of grain size between the two subspecies in rice.

The function of GW10.2 was further verified by using a chromosome segment substitution line (CSSL) population (PA64s × Nipponbare) and mutant plants. From a CSSL population reported in our previous study25, we selected and evaluated a CSSL (CSSL-2hap2) containing the hap2 of GW10.2 in a Nipponbare-derived fragment and a CSSL (CSSL-1hap3) containing the hap3 in a PA64s-derived fragment (Supplementary Fig. 12). The CSSL-2hap2 showed a significantly larger (8.56%) grain width than that of CSSL-1hap3 (Fig. 3e, f). Furthermore, we utilized the clustered regularly interspaced palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9) system to knock out GW10.2 in Zhonghua11 (ZH11), resulting in homozygous individuals with a GW10.2 mutation (Supplementary Fig. 13a). The mutant plants exhibited a significant reduction in grain width, with decreases of 4.3% for those cultivated in Shenzhen (2023) and 6.4% for those grown in Hangzhou (2024), compared to the wild type (Fig. 3g, h; Supplementary Fig. 14). These findings confirm that GW10.2 could play a positive role in regulating grain width in rice.

Identification and validation of the candidate gene for grain length

Among the 51 grain-length-associated QTLs examined, 43 were novel and were exclusively detected through meta-analysis (Fig. 4a). A novel QTL qGL11-1 for further analysis using the same strategy as above for grain width (Supplementary Data. 5), with its lead variant being Chr11:2627276 (A > G, p = 2.66e−11), showing relatively low significance across all panels (Fig. 4b). Employing the same strategy as previously described, we obtained five genes within the 200 kb region surrounding the variants (LOC_Os11g05480, LOC_Os11g05552, LOC_Os11g05570, LOC_Os11g05640, LOC_Os11g05930). Analysis of co-expression networks of these five genes indicated that only LOC_Os11g05930 was co-expressed with three known genes (OsPRR1, OsNST1, and OsFKF1)26,27,28 involved in regulating grain development. Consequently, LOC_Os11g0593029 was considered the candidate gene for qGL11-1 and was designated GL11.

Fig. 4: Meta-GWAS for grain length and its functional analysis.
Fig. 4: Meta-GWAS for grain length and its functional analysis.The alternative text for this image may have been generated using AI.
Full size image

a Manhattan plot for meta-analysis of genome-wide association study of grain length. The genes labeled in the figure are the known grain length genes associated with meta-GWAS, and the red one is the identified candidate gene. The black line represents the genome-wide significance threshold at 7.52e−9. b P-values of Chr11:2627276 in GWAS and meta-analysis. c Gene structure, showing 2 missense mutation SNPs in the exon region. d Boxplot showing the grain length at different haplotypes in each panel. Statistical differences according to two-tailed Student’s t-test. The pie chart displays the subpopulation distribution of the three haplotypes in each panel (Osi, Oryza sativa indica; Osj, Oryza sativa japonica). e Difference in grain length between wild type (n = 19) and mutants (n = 19). Scale bars: 1 cm. Values are presented as means ± SD in f, Student’s t-test. In d, the horizontal line represents the median value, the lower and upper quartiles represent the 25th and 75th percentile, and the whiskers show the maximum and minimum values. Source data are provided as a Source Data file.

Two SNPs resulting in missense mutations were detected for GL11 in its exon region (Fig. 4c), forming three haplotypes in all accessions, with hap1 and hap2 being the predominant ones (Supplementary Table. 1). Across all panels, hap2 emerged as the dominant haplotype, exhibiting a significantly larger grain length (average 4.42%) compared to the other two haplotypes across all six panels. Moreover, hap2 had a markedly higher frequency of 91.83% in indica rice, only 0.79% in japonica rice and 7.39% in other rice accessions (Fig. 4d).

To further validate the biological function of GL11, we created knock-out mutants for GL11 by using CRISPR/Cas9 technology (Supplementary Fig. 13b). These mutant plants showed a notable increase (9.83%) in grain length compared with the wild type (Fig. 4e, f), indicating that GL11 functions as a negative regulator of rice grain length.

Discussion

Rice yield is typically influenced by factors such as the effective tiller number, grains per panicle, and grain weight30. Grain size is closely associated with grain weight and overall rice quality31. Over the past decade, numerous genes that regulate grain size have been identified. However, as grain size and rice quality are typical quantitative trait regulated by multiple genes, the molecular regulatory mechanisms involving most of these genes remain poorly understood32. Genome-wide association studies (GWAS), which have been widely employed for decades33,34,35, have limitations in mining genes from populations with limited size and complex population structures, particularly for complex phenotypes such as yield-related traits. Owing to insufficient statistical power and the influence of complex environmental factors, GWAS can explain only a small portion of the missing heritability36. As previously reported, meta-analysis facilitated genetic analysis across different populations, enhancing the identification of complex traits37. Furthermore, it enhances the detection rate of rare mutations and subsequently improve their contribution to phenotypic heritability38. Thus, the meta-analysis serves as an effective tool for these endeavors.

In this study, we systematically applied and comprehensively evaluated the meta-GWAS strategy for mining genes associated with important agronomic traits in rice. We identified 156 QTLs, 116 of which are being novel and present valuable genetic resources for mining new favorable genes in rice. Two of these novel QTLs were chosen for further analysis, leading to the identification of two candidate genes, GW10.2 and GL11 related to grain size. Prior research has indicated that auxin can regulate cell size, thereby influencing grain size39. GW10.2 is involved in biological processes such as the regulation of the auxin signaling pathway and cell elongation, suggesting that GW10.2 might modulate grain width through the auxin pathway. Numerous studies have highlighted that several genes, including OsGATA6 and Ghd740,41, can coordinate the regulation of heading time and grain size. GL11 may be a novel factor in coordinating the regulation of grain length in rice. Additionally, previous studies have shown that GW10.2 can regulate leaf pillow development, leaf inclination and ear development42 and GL11 inhibits heading under long day conditions and promotes heading under short day conditions29. Consequently, we hypothesize that both GW10.2 and GL11 act as pleiotropic genes for different important agronomic traits and can provide useful gene resources for the molecular design of rice breeding.

On the basis of our findings, several advantages of meta-analysis in our study are evident: (1) integration of rich phenotypic data and a large sample size from diverse populations; (2) attainment of a more robust and precise statistical effect through the amalgamation of multiple independent experiments, particularly for variants with low frequency; (3) facilitation of a more thorough and systematic analysis of complex agronomic traits; and (4) significant cost savings in sequencing and phenotypic management by leveraging the abundant genotypic and phenotypic datasets amassed in rice research. Furthermore, we also observed that meta-GWAS results from multiple panels proved highly effective in detecting QTLs, underscoring the importance of collecting as many GWAS dataset as possible to fully utilize the advantages of meta-analysis. To date, the field of rice GWAS has been abundant43,44,45, which have established a good foundation for subsequent meta-analyses. Our study can thus serve as a research model for applying meta-GWAS in rice population genetics and gene mining studies. Nevertheless, given the filtering of low-frequency variant sites (e.g., MAF < 0.05) in the GWAS workflow, GWAS and its meta-analysis based on natural populations may have difficulty effectively identifying rare QTLs within populations. Moreover, parameters such as the selection of the threshold for P-values of associated loci and the size and composition of the population, which affect the efficiency of GWAS analysis, are crucial factors influencing meta-analysis.

In summary, we conducted a meta-GWAS study encompassing six rice agronomic trait by integrating genotypic and phenotypic datasets from six rice panels. This effort led to the identification of 156 potential QTLs, in which two QTLs related to grain size were selected for in-depth analysis. Through validation experiments, we screened and validated two functional genes associated with grain width and grain length, denoted as GW10.2 and GL11, respectively. Our study demonstrates the successful application of meta-GWAS analysis strategies in gene mining from multiple population panels to glean valuable gene insights for modern rice breeding. Furthermore, this research provides a novel approach to fully utilize the wealth of rice genotype and phenotypic datasets amassed over recent decades, significantly propelling efforts to elucidate pivotal agronomic traits in rice.

Methods

Plant materials and growth conditions

The 202 rice accessions of panel-C were grown in Shenzhen, China (N22°32′, E114°03′). Seeds were sown on 5 July 2022. At 25 days after sowing, the seedings were transplanted into a five-row plot with 5 individuals in each row at a spacing of 25 cm × 25 cm. The field management practices, including irrigation, fertilization, and pest control, adhere to standard agricultural protocols. The grain traits, including grain width, grain length and thousand-grain weight were measured in the laboratory after harvest. The phenotypes of other five panels were as described in their original refs. 15,16,17,18,19, and we integrated the location and year of planting in Supplementary Table 2. All phenotypic data can be found in the Source Data.

Variant calling

We collected raw short sequencing data and phenotypic data from six independent experiments. Trimmomatic46 was used for quality control of the short sequencing data. (Version: 0.39, parameter: MINLEN: 75 LEADING: 20 TRAILING: 20 SLIDINGWINDOW: 5:20; MINLEN = 40, while the read length is <75 bp). BWA47 was used to map the quality control data to the Nipponbare genome (MSU v7.0)48. (Version: v.0.7.17-r1188, default parameter). SNP calling was subsequently conducted via Sentieon pipeline49 (Version: genomics-202112.02, default parameter). Based on the pan-genome we constructed in our previous studies4,13, PAV calling was conducted with quality control data and the pan-genome by using VG Giraffe50 and SURVIVOR (Version: v.1.0.7, parameters: 1000 1 1 -1 -1 50). PAVs of low-quality or unexpected length (>1 Mb or <50 bp) were removed. Quality control of removed PAVs was performed using VCFtools51 for subsequent analysis. (Version: 0.1.16, parameter --maf 0.05).

SNP quality control and genotype imputation

We used VCFtools51 to filter the SNPs with same standard for each panel. (Version: 0.1.16, parameter: --maf 0.05 --max-missing 0.9). Then, for three panels (Panel-D, Panel-E, Panel-F) with low sequencing depth (average 7× or below)52, we used an online method for genotype imputation (https://gong_lab.hzau.edu.cn/Plant_imputeDB/)53 after quality control to ensure the accuracy of the imputation. In the end, we obtained high quality SNPs for each panel, panel-A, 3,038,707; panel-B, 2,890,946; panel-C, 4,786,218; panel-D, 3,353,170; panel-E, 4,325,815 and panel-F, 4,321,409. A total of 6,604,898 high-quality SNPs were eventually retained for meta-analysis.

Genome-wide association analysis

A dataset of SNPs and PAVs was used for GWAS analysis. Principal component analysis (PCA) was conducted by PLINK54 (Version: v1.90b6.21, parameters: --pca 5). Kinship was conducted by GEMMA55 (version: gemma-0.98.5-linux-static-AMD64, parameters: -gk 2). The first five principal components and standardized matrix of kinship were used as covariates. GWAS was performed using a mixed linear model in genome-wide efficient mixed model association software (GEMMA, parameters: lmm 1 -k -c). The threshold for GWAS was calculated using Bonferroni56 test (0.05/n, where n is the total number of variants). CMplot R-package57 was used to plot Manhattan plots and QQ plots to show the results of all GWSA results.

Meta-analysis

We performed an independent meta-analysis of all phenotypes by collecting the GWAS summary data available from each panel. Genome inflation is a frequent occurrence in association analysis. There are many reasons for genome inflation, such as population structure, cryptic relatedness and others58, and the GenABEL R-package is used to calculate the genome inflation factor59. Meta-analysis was performed using the inverse variance-weighted fixed-effect model in METAL60 with genomic control10. Genome inflation control was performed via the following equation:

$${SEcorrected}={SE}\times \sqrt{\lambda }$$
(1)

where SE denotes the standard deviation obtained via GWAS, and where \(\lambda\) denotes the genome inflation factor for each GWAS finding.

Significant sites and QTLs statistics

The genotype frequencies of the significant variants obtained from GWAS and meta-analysis of all phenotypes were calculated with VCFtools51 (parameter: --freq). The QTL was defined as a chromosomal region where adjacent pairs of significant variants were <200 kb from each other, according to the average LD decay of six panels (195.5 kb, Supplementary Fig. 15) estimated by plink (Version: v1.90b6.21, parameters: --r2 -ld-window-kb 1000 –ld-window 99999 ---ld-window-r2 0). Within each QTL, the most significant variant was taken as the lead variant.

Heritability estimation

Based on the lead variant in each QTL, all the variants and phenotypes in the upstream and downstream 200 kb of the lead variant were used to estimate heritability. Plink54 was used for principal component analysis of the obtained variants, and the first five principal components were used as covariables for heritability estimation. The GRM matrix is then constructed using GCTA software61. (Version: gcta-1.94.1, parameter: --make-grm-alg 1 --make-grm) GTCA was used to calculate the heritability of QTLs obtained by GWAS and meta-analysis for each panel, respectively. (parameter: --reml --qcovar --grm). Heritability of QTLs was performed via the following equation:

$${{\rm{h}}}^{2}={{\rm{V}}}_{{\rm{G}}}/{{\rm{V}}}_{{\rm{P}}}$$
(2)

where VG is the genotypic variance and VP is the phenotypic variance.

Construction of CRISPR/Cas9 vectors and transformation

The guide RNA (sgRNA) sequences were designed according to the predictions of the CRISPR-P program62. The construction of the CRISPR/Cas9 plant expression vector VK005-01 followed the procedures provided by the company’s reagent kit (Sbigen Biotech, Beijing, China). The Cas9/sgRNA constructs were transformed into Agrobacterium strain EHA105. Two conventional japonica rice varieties Xiushui134 (XS134) and ZhongHua11 (ZH11) were used in this study. Rice callus tissue of XS134 and ZH11 were transformed with the agrobacterium strains containing the Cas9/sgRNA constructs as previously described63 to generate transgenic rice plants. Transgene-free plants were grown in a standard greenhouse and in a paddy field at the Shenzhen experimental station (N22°32′, E114°03′), Guangdong Province, China and Hangzhou (N30°18′, E120°12′), Zhejiang Province, China. Transgenic rice plants were grown in a standard greenhouse.

Primers

All primers used in this study are provided in Supplementary Table 3.

Genotype and phenotype analysis of wild and mutant type of rice plants

Genotyping was performed for mutations in the target genes of the transgenic plants. The PCR products were sequenced via the Sanger method. Decoding of the sequencing chromatograms was carried out as described previously64. DNA sequences were aligned using Clustal65. Agronomic traits were studied in the wild-type parent and T1 generation plants. Rice plants were grown at the Shenzhen experimental station, Guangdong Province, under natural conditions. Plant management followed normal agricultural practices.

Reporting summary

Further information on the research design is available in the Nature Portfolio Reporting Summary linked to this article.