A pangenome of maize provides genetic insights into drought resistance

Yang, Shiping; Wang, Yijie; Huang, Qin; Wang, Mengyuan; Wang, Shuhui; Fu, Xiaomeng; Zhu, Chaohui; Cheng, Jinkui; Liu, Shengxue; Yang, Zhirui; Yang, Ning; Yan, Jianbing; Yang, Xiaohong; Qin, Feng

doi:10.1038/s41588-025-02378-w

Article
Published: 27 October 2025

A pangenome of maize provides genetic insights into drought resistance

Nature Genetics volume 57, pages 2831–2841 (2025)Cite this article

7170 Accesses
1 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Drought poses a severe threat to the stability of crop yields. It is crucial to identify genetic resources and decipher the molecular mechanisms underlying drought resistance in crops. Here we generated high-quality genome assemblies of 25 maize germplasms exhibiting substantial variation in drought resistance. Combined with 31 additional maize genome sequences, a comprehensive pangenome analysis was performed. Rare allelic variations and extensive regulatory diversity were revealed in abscisic acid-related or drought-related genes, which may contribute to the diversity in drought resistance among germplasms. Furthermore, we identified three genes, ZmUGE2, ZmSIL2 and ZmASI3, that enhance maize drought resistance by strengthening mechanical support of the cell wall, regulating stress-responsive gene expression and coordinating male and female inflorescence development, respectively. Thus, this study provides valuable insight into the genetic control of drought resistance in maize at different growing stages. The expanded maize pangenome information serves as a valuable resource for maize genomic research.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to the full article PDF.

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Phylogenetic analysis of 25 selected maize germplasms and characterization of their drought-related phenotypes.**

**Fig. 2: Maize pangenome analysis and variations in ABA-related and drought-related genes.**

**Fig. 3: *ZmUGE2* positively regulates drought resistance.**

**Fig. 4: *ZmSIL2* negatively regulates drought resistance at the seedling stage.**

**Fig. 5: Natural variation in *ZmASI3* contributes to drought resistance at the flowering stage.**

Genome assembly and genetic dissection of a prominent drought-resistant maize germplasm

Article 20 February 2023

Transcriptomic profiling of the high-vigour maize (Zea mays L.) hybrid variety response to cold and drought stresses during seed germination

Article Open access 29 September 2021

The role of transposon inverted repeats in balancing drought tolerance and yield-related traits in maize

Article 13 October 2022

Data availability

All raw sequencing data referenced in this study are available at the National Center for Biotechnology Information (NCBI) Sequence Read Archive with BioProject accession number PRJNA1103102, including the ONT, DNA and/or RNA Illumina sequencing, as well as RNA-seq and ChIP–seq data for transgenic plants. Genome sequences and annotations for 25 newly assembled maize germplasms and SV information are available via Zenodo at https://doi.org/10.5281/zenodo.16576184 (ref. ⁹⁹). GWAS results related to ZmUGE2, ZmSIL2 and ZmASI3 are available via Zenodo at https://doi.org/10.5281/zenodo.17138577 (ref. ¹⁰⁰). Long-read sequencing data of the previously published genomes were obtained from the NCBI database, which includes NAM founders¹⁶ (accession number PRJEB31061), Mo17²⁷ (accession number PRJNA358298), SK³¹ (accession number PRJNA531547), CIMBL55²⁵ (accession number PRJNA765111), K0326Y³² (accession number PRJNA539996) and A188³³ (accession number PRJNA635654). Other public maize, teosinte and T. dactyloides genome sequences used in this work are available at https://download.maizegdb.org/. Source data are provided with this paper.

Code availability

All customized scripts used in this study are available via GitHub at https://github.com/YangBioinformatics/Maize-Pan-genome and via Zenodo at https://doi.org/10.5281/zenodo.16631382 (ref. ¹⁰¹).

References

The Impact of Disasters and Crises on Agriculture and Food Security (FAO, 2021).
Erenstein, O., Chamberlin, J. & Sonder, K. Estimating the global number and distribution of maize and wheat farms. Glob. Food Secur. 30, 100558 (2021).
Article Google Scholar
Harrison, M. T., Tardieu, F., Dong, Z., Messina, C. D. & Hammer, G. L. Characterizing drought stress and trait influence on maize yield under current and future conditions. Glob. Chang. Biol. 20, 867–878 (2014).
Article PubMed Google Scholar
Lobell, D. B., Deines, J. M. & Tommaso, S. D. Changes in the drought sensitivity of US maize yields. Nat. Food 1, 729–735 (2020).
Article PubMed Google Scholar
Lobell, D. B. et al. Greater sensitivity to drought accompanies maize yield increase in the U.S. Midwest. Science 344, 516–519 (2014).
Article CAS PubMed Google Scholar
Chen, L. et al. Genome sequencing reveals evidence of adaptive variation in the genus Zea. Nat. Genet. 54, 1736–1745 (2022).
Article CAS PubMed Google Scholar
Wang, X. et al. Genetic variation in ZmVPP1 contributes to drought tolerance in maize seedlings. Nat. Genet. 48, 1233–1241 (2016).
Article CAS PubMed Google Scholar
Bolaños, J. & Edmeades, G. O. The importance of the anthesis-silking interval in breeding for drought tolerance in tropical maize. Field Crops Res. 48, 65–80 (1996).
Article Google Scholar
Liu, B. et al. Manipulating ZmEXPA4 expression ameliorates the drought-induced prolonged anthesis and silking interval in maize. Plant Cell 33, 2058–2071 (2021).
Article PubMed PubMed Central Google Scholar
Danilevskaya, O. N. et al. Developmental and transcriptional responses of maize to drought stress under field conditions. Plant Direct 3, e00129 (2019).
Article PubMed PubMed Central Google Scholar
Fuad-Hassan, A., Tardieu, F. & Turc, O. Drought-induced changes in anthesis-silking interval are related to silk expansion: a spatio-temporal growth analysis in maize plants subjected to soil water deficit. Plant Cell Environ. 31, 1349–1360 (2008).
Article PubMed Google Scholar
Liu, S. et al. Mapping regulatory variants controlling gene expression in drought response and tolerance in maize. Genome Biol. 21, 163 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mao, H. et al. A transposable element in a NAC gene is associated with drought tolerance in maize seedlings. Nat. Commun. 6, 8326 (2015).
Article CAS PubMed Google Scholar
Ricci, W. A. et al. Widespread long-range cis-regulatory elements in the maize genome. Nat. Plants 5, 1237–1249 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jiao, Y. et al. Improved maize reference genome with single-molecule technologies. Nature 546, 524–527 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hufford, M. B. et al. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes. Science 373, 655–662 (2021).
Article CAS PubMed PubMed Central Google Scholar
Alonge, M. et al. Major impacts of widespread structural variation on gene expression and crop improvement in tomato. Cell 182, 145–161 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. C. et al. Pan-genome of wild and cultivated soybeans. Cell 182, 162–176 (2020).
Article CAS PubMed Google Scholar
Qin, P. et al. Pan-genome analysis of 33 genetically diverse rice accessions reveals hidden genomic variations. Cell 184, 3542–3558 (2021).
Article CAS PubMed Google Scholar
Wang, B. et al. De novo genome assembly and analyses of 12 founder inbred lines provide insights into maize heterosis. Nat. Genet. 55, 312–323 (2023).
Article CAS PubMed Google Scholar
Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654–662 (2024).
Article CAS PubMed PubMed Central Google Scholar
Jiao, C. et al. Pan-genome bridges wheat structural variations with habitat and breeding. Nature 637, 384–393 (2024).
Article PubMed Google Scholar
Gui, S. et al. A pan-Zea genome map for enhancing maize improvement. Genome Biol. 23, 178 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y. et al. Graph pangenome captures missing heritability and empowers tomato breeding. Nature 606, 527–534 (2022).
Article CAS PubMed PubMed Central Google Scholar
Tian, T. et al. Genome assembly and genetic dissection of a prominent drought-resistant maize germplasm. Nat. Genet. 55, 496–506 (2023).
Article CAS PubMed Google Scholar
Yang, X. et al. Characterization of a global germplasm collection and its potential utilization for analysis of complex quantitative traits in maize. Mol. Breed. 28, 511–526 (2010).
Article Google Scholar
Sun, S. et al. Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes. Nat. Genet. 50, 1289–1295 (2018).
Article CAS PubMed Google Scholar
Chen, J. et al. A complete telomere-to-telomere assembly of the maize genome. Nat. Genet. 55, 1221–1231 (2023).
Article CAS PubMed PubMed Central Google Scholar
Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article CAS PubMed Google Scholar
Ou, S., Chen, J. & Jiang, N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res. 46, e126 (2018).
PubMed PubMed Central Google Scholar
Yang, N. et al. Genome assembly of a tropical maize inbred line provides insights into structural variation and crop improvement. Nat. Genet. 51, 1052–1059 (2019).
Article CAS PubMed Google Scholar
Li, C. et al. Long-read sequencing reveals genomic structural variations that underlie creation of quality protein maize. Nat. Commun. 11, 17 (2020).
Article PubMed PubMed Central Google Scholar
Lin, G. et al. Chromosome-level genome assembly of a regenerable maize inbred line A188. Genome Biol. 22, 175 (2021).
Article CAS PubMed PubMed Central Google Scholar
Huang, Y. et al. OsNCED5, a 9-cis-epoxycarotenoid dioxygenase gene, regulates salt and water stress tolerance and leaf senescence in rice. Plant Sci. 287, 110188 (2019).
Article CAS PubMed Google Scholar
Uga, Y. et al. Control of root system architecture by DEEPER ROOTING 1 increases rice yield under drought conditions. Nat. Genet. 45, 1097–1102 (2013).
Article CAS PubMed Google Scholar
Zhang, F. et al. Genomic basis underlying the metabolome-mediated drought adaptation of maize. Genome Biol. 22, 260 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wu, X. et al. Using high-throughput multiple optical phenotyping to decipher the genetic architecture of maize drought tolerance. Genome Biol. 22, 185 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. BRITTLE PLANT1 is required for normal cell wall composition and mechanical strength in rice. J. Integr. Plant Biol. 63, 865–877 (2021).
Article CAS PubMed Google Scholar
Liu, S. et al. The rice BZ1 locus is required for glycosylation of arabinogalactan proteins and galactolipid and plays a role in both mechanical strength and leaf color. Rice 13, 41 (2020).
Article PubMed PubMed Central Google Scholar
Rosti, J. et al. UDP-glucose 4-epimerase isoforms UGE2 and UGE4 cooperate in providing UDP-galactose for cell wall biosynthesis and growth of Arabidopsis thaliana. Plant Cell 19, 1565–1579 (2007).
Article CAS PubMed PubMed Central Google Scholar
Kaplan-Levy, R. N., Brewer, P. B., Quon, T. & Smyth, D. R. The trihelix family of transcription factors—light, stress and development. Trends Plant Sci. 17, 163–171 (2012).
Article CAS PubMed Google Scholar
Yang, N. et al. Two teosintes made modern maize. Science 382, eadg8940 (2023).
Article CAS PubMed Google Scholar
Sakamoto, H. et al. Arabidopsis Cys2/His2-type zinc-finger proteins function as transcription repressors under drought, cold, and high-salinity stress conditions. Plant Physiol. 136, 2734–2746 (2004).
Article CAS PubMed PubMed Central Google Scholar
Liu, S. et al. Genome-wide analysis of ZmDREB genes and their association with natural variation in drought tolerance at seedling stage of Zea mays L. PLoS Genet. 9, e1003790 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wu, Q. et al. Transcription factor ZmEREB97 regulates nitrate uptake in maize (Zea mays) roots. Plant Physiol. 196, 535–550 (2024).
Article CAS PubMed PubMed Central Google Scholar
Huh, S. U. New function of hypoxia-responsive unknown protein in enhanced resistance to biotic stress. Plant Signal Behav. 16, 1868131 (2021).
Article PubMed Google Scholar
Knizewski, L., Ginalski, K. & Jerzmanowski, A. Snf2 proteins in plants: gene silencing and beyond. Trends Plant Sci. 13, 557–565 (2008).
Article CAS PubMed Google Scholar
Deng, Y. et al. Epigenetic regulation of antagonistic receptors confers rice blast resistance with yield balance. Science 355, 962–965 (2017).
Article CAS PubMed Google Scholar
Gao, M. J. et al. Repression of seed maturation genes by a trihelix transcriptional repressor in Arabidopsis seedlings. Plant Cell 21, 54–71 (2009).
Article CAS PubMed PubMed Central Google Scholar
Liu, H. et al. Distant eQTLs and non-coding sequences play critical roles in regulating gene expression and quantitative trait variation in maize. Mol. Plant 10, 414–426 (2017).
Article CAS PubMed Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Felsenstein, J. PHYLIP—Phylogeny Inference Package (version 3.2). Cladistics 5, 164–166 (1989).
Google Scholar
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hu, J. et al. NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads. Genome Biol. 25, 107 (2024).
Article PubMed PubMed Central Google Scholar
Hu, J., Fan, J., Sun, Z. & Liu, S. NextPolish: a fast and efficient genome polishing tool for long-read assembly. Bioinformatics 36, 2253–2255 (2020).
Article CAS PubMed Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Alonge, M. et al. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol. 23, 258 (2022).
Article CAS PubMed PubMed Central Google Scholar
Marcais, G. et al. MUMmer4: a fast and versatile genome alignment system. PLoS Comput. Biol. 14, e1005944 (2018).
Article PubMed PubMed Central Google Scholar
Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).
Article CAS PubMed Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Liu, R. & Dickerson, J. Strawberry: fast and accurate genome-guided transcript reconstruction and quantification from RNA-Seq. PLoS Comput. Biol. 13, e1005851 (2017).
Article PubMed PubMed Central Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
Article CAS PubMed PubMed Central Google Scholar
Song, L., Sabunciyan, S. & Florea, L. CLASS2: accurate and efficient splice variant annotation from RNA-seq reads. Nucleic Acids Res. 44, e98 (2016).
Article PubMed PubMed Central Google Scholar
Venturini, L., Caim, S., Kaithakottil, G. G., Mapleson, D. L. & Swarbreck, D. Leveraging multiple transcriptome assembly methods for improved gene structure annotation. Gigascience 7, giy093 (2018).
Article PubMed PubMed Central Google Scholar
Mapleson, D., Venturini, L., Kaithakottil, G. & Swarbreck, D. Efficient and accurate detection of splice junctions from RNA-seq with Portcullis. Gigascience 7, giy131 (2018).
Article PubMed PubMed Central Google Scholar
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Article CAS PubMed Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinform. 10, 421 (2009).
Article Google Scholar
Bruna, T., Hoff, K. J., Lomsadze, A., Stanke, M. & Borodovsky, M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genom. Bioinform. 3, lqaa108 (2021).
Article PubMed PubMed Central Google Scholar
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 25, 4–10 (2009).
Article Google Scholar
Dainat, J. & Pucholt, D. H. AGAT: another Gff analysis toolkit to handle annotations in any GTF. v.0.6.0. Zenodo https://doi.org/10.5281/zenodo.4637977 (2021).
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).
Article CAS PubMed PubMed Central Google Scholar
Soderlund, C. et al. Sequencing, mapping, and analysis of 27,455 maize full-length cDNAs. PLoS Genet. 5, e1000740 (2009).
Article PubMed PubMed Central Google Scholar
Wang, B. et al. A comparative transcriptional landscape of maize and sorghum obtained by single-molecule sequencing. Genome Res. 28, 921–932 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhang, R. G. et al. TEsorter: an accurate and fast method to classify LTR-retrotransposons in plant genomes. Hortic. Res. 9, uhac017 (2022).
Article PubMed PubMed Central Google Scholar
Campbell, M. S. et al. MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations. Plant Physiol. 164, 513–524 (2014).
Article CAS PubMed Google Scholar
Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 (2010).
Article CAS PubMed Google Scholar
Chen, M. M., Lin, H., Chiang, L. M., Childers, C. P. & Poelchau, M. F. The GFF3toolkit: QC and merge pipeline for genome annotation. Methods Mol. Biol. 1858, 75–87 (2019).
Article CAS PubMed Google Scholar
Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 49, D344–D354 (2021).
Article CAS PubMed Google Scholar
Olson, A. J. & Ware, D. Ranked choice voting for representative transcripts with TRaCE. Bioinformatics 38, 261–264 (2021).
Article PubMed PubMed Central Google Scholar
Ou, S. et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol. 20, 275 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wenke, T. et al. Targeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes. Plant Cell 23, 3117–3128 (2011).
Article CAS PubMed PubMed Central Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).
Article PubMed PubMed Central Google Scholar
Goel, M., Sun, H., Jiao, W. B. & Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).
Article PubMed PubMed Central Google Scholar
Kronenberg, Z. N. et al. High-resolution comparative analysis of great ape genomes. Science 360, eaar6343 (2018).
Article PubMed PubMed Central Google Scholar
Sedlazeck, F. J. et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat. Methods 15, 461–468 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jiang, T. et al. Long-read-based human genomic structural variation detection with cuteSV. Genome Biol. 21, 189 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hickey, G. et al. Genotyping structural variants in pangenome graphs using the vg toolkit. Genome Biol. 21, 35 (2020).
Article PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chen, S. et al. Paragraph: a graph-based structural variant genotyper for short-read sequence data. Genome Biol. 20, 291 (2019).
Article PubMed PubMed Central Google Scholar
Speed, D., Holmes, J. & Balding, D. J. Evaluating and improving heritability models using summary statistics. Nat. Genet. 52, 458–462 (2020).
Article CAS PubMed Google Scholar
Zhou, X. & Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 44, 821–824 (2012).
Article CAS PubMed PubMed Central Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Article CAS PubMed PubMed Central Google Scholar
Chen, S., Songkumarn, P., Liu, J. & Wang, G. L. A versatile zero background T-vector system for gene cloning and functional genomics. Plant Physiol. 150, 1111–1121 (2009).
Article CAS PubMed PubMed Central Google Scholar
Xing, H. L. et al. A CRISPR/Cas9 toolkit for multiplex genome editing in plants. BMC Plant Biol. 14, 327 (2014).
Article PubMed PubMed Central Google Scholar
Byeon, B. et al. The ATP-dependent chromatin remodeling enzyme Fun30 represses transcription by sliding promoter-proximal nucleosomes. J. Biol. Chem. 288, 23182–23193 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yang, S. Supplementary data of the maize pan-genome work (1.0). Zenodo https://doi.org/10.5281/zenodo.16576184 (2025).
Yang, S. GWAS results related to the maize pan-genome work (1.0). Zenodo https://doi.org/10.5281/zenodo.17138577 (2025).
Yang, S. Maize pan-genome related scripts and pipelines (1.0). Zenodo https://doi.org/10.5281/zenodo.16631382 (2025).

Download references

Acknowledgements

We thank T. Tian (University of Science and Technology Beijing), Z. Zhang (China Agricultural University) and H. He (Fujian Agriculture and Forestry University) for valuable discussions, and S. Wang and Y. Liu (Instrumental platform of state key laboratory of plant environmental resilience, China Agricultural University, Beijing) for the help on scanning electron microscopy and confocal imaging. This research was supported by the National Key Research and Development Program of China (grant number 2023YFF1001300), the National Natural Science Foundation of China (grant numbers 32430010 and 32272024), the Chinese Universities Scientific Fund (grant numbers 2025TC135 and 2025TC148), Beijing Outstanding Young Scientist Program (grant number BJJWZYJH01201910019026) and the China Postdoctoral Science Foundation (grant number 2019M660867).

Author information

These authors contributed equally: Shiping Yang, Yijie Wang, Qin Huang, Mengyuan Wang.

Authors and Affiliations

Frontiers Science Center for Molecular Design Breeding (MOE), State Key Laboratory of Plant Environmental Resilience, Center for Crop Functional Genomics and Molecular Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
Shiping Yang, Yijie Wang, Qin Huang, Mengyuan Wang, Xiaomeng Fu, Chaohui Zhu, Jinkui Cheng, Shengxue Liu, Zhirui Yang, Xiaohong Yang & Feng Qin
College of Life Sciences, Hebei University, Baoding, China
Shuhui Wang
National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
Ning Yang & Jianbing Yan
Hubei Hongshan Laboratory, Wuhan, China
Ning Yang & Jianbing Yan

Authors

Shiping Yang
View author publications
Search author on:PubMed Google Scholar
Yijie Wang
View author publications
Search author on:PubMed Google Scholar
Qin Huang
View author publications
Search author on:PubMed Google Scholar
Mengyuan Wang
View author publications
Search author on:PubMed Google Scholar
Shuhui Wang
View author publications
Search author on:PubMed Google Scholar
Xiaomeng Fu
View author publications
Search author on:PubMed Google Scholar
Chaohui Zhu
View author publications
Search author on:PubMed Google Scholar
Jinkui Cheng
View author publications
Search author on:PubMed Google Scholar
Shengxue Liu
View author publications
Search author on:PubMed Google Scholar
Zhirui Yang
View author publications
Search author on:PubMed Google Scholar
Ning Yang
View author publications
Search author on:PubMed Google Scholar
Jianbing Yan
View author publications
Search author on:PubMed Google Scholar
Xiaohong Yang
View author publications
Search author on:PubMed Google Scholar
Feng Qin
View author publications
Search author on:PubMed Google Scholar

Contributions

F.Q. and S.Y. designed and supervised the study and revised the paper. S.Y. and Y.W. performed the pangenome study and gene association analysis. Q.H., M.W., S.W., X.F., C.Z., S.L. and Z.Y. performed experiments for gene cloning and collected phenotypic data in the fields. J.C. contributed to transgenic maize generations. N.Y., J.Y. and X.Y. provided maize materials and valuable discussions and edited the paper. All the authors read and approved the final paper.

Corresponding author

Correspondence to Feng Qin.

Ethics declarations

Competing interests

Two patent applications related to this work have been submitted by F.Q., S.Y., Q.H. and M.W. The other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Klaus Mayer and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Drought resistance phenotypes analysis in pan-genome and population level.

a, Representative plant photographs of the 23 NAM founders¹⁶, excluding B73 (presented in Tian et al.²⁵), CML247 and CML277 (lacking sufficient seeds), grown under WW and WS2 conditions in fields. The ears are harvested from the plants grown under WW, WS1 and WS2 conditions. They are ordered according to the average yield under WS1 and WS2. The size of bubble corresponds to the average yield and days of ASI under WS1 and WS2 conditions, and seedling survival rate (SR) after drought stress. b, Integrative illustration of multiple drought resistance phenotypes of 50 germplasms. The 50 germplasms include 25 newly assembled germplasms in this study, 26 NAM founders¹⁶, Mo17²⁷ and CIMBL55²⁵, excluding SY1032, CML247 and CML277 (lacking sufficient seeds). Plant drought resistance at the seedling stage is scored according to the survival rate after drought. All the germplasms are classified into three categories: ‘Resistant’, ‘Intermediate’ and ‘Sensitive’ as Fig. 1c. ‘WW yield’, grain yield under WW; ‘WS yield’, (yield_WS1+yield_WS2)/2; ‘dASI’, (ASI_WS1+ASI_WS2)/2 − (ASI_WW). The stacked bar value (left y-axis) represents WW yield (light blue) and WS yield (bright blue) for each germplasm. Scatter points (right y-axis) show the dASI for each germplasm. Red inverted triangles indicate six drought-resistant germplasms referring to three criteria: 1) resistance or intermediate resistance at the seedling stage; 2) yield > 25g (above the cyan dotted line) under WS; 3) dASI < 2 days (below the red dotted line). c, Statistic of ASI under WW and WS conditions. Statistical significance was determined by paired t-test. d, e, Correlation between ASI and yield per plant (d), seedling SR and yield per plant (e) under WW and WS conditions. Pearson correlation coefficient (r) is used to evaluate a linear correlation between two traits. Statistical significance was determined by a two-sided t-test. In c–e, the phenotypes are obtained from 228 temperate germplasms⁹.

Source data

Extended Data Fig. 2 Plant survival rate assay for drought resistance at seedling stage for NAM founders.

The seedling SR of each germplasm is compared with B73, respectively. Fifteen plants of each germplasm are compared in each pot, with at least two replicates. Representative photographs are taken before and after drought treatment.

Extended Data Fig. 3 Pipeline of de novo assembly for 25 genomes.

Approximately 98× genomic DNA long-read sequencing data generated by Oxford Nanopore Technology (ONT) were initially corrected by NextDenovo⁵⁴. Then, the ONT reads and about 60× genomic DNA short-read sequencing data generated by Illumina NovaSeq 6000 were employed to polish draft contigs. RagTag⁵⁸ was utilized to address inter-chromosomal assembly errors and construct chromosome-scale pseudomolecules (See Methods).

Extended Data Fig. 4 Pipeline of gene and repeat sequence annotation.

a, Gene annotation pipeline. Both evidence-based and ab initio gene prediction methods were used for each genome. For the evidence-based method, transcript assembly programs were used for transcript prediction based on RNA-seq data of five tissues (See Methods) and Mikado⁶⁷ was further employed to select the best transcripts. For ab initio gene prediction, the mapped RNA-seq reads and protein sequences generated by evidence-based method were used as inputs of BRAKER2⁷¹ for gene predictions. The non-redundant gene models from both methods were further refined by PASA2⁷⁴ based on maize Iso-seq⁷⁶ and Expressed Sequence Tags (ESTs) from GenBank (https://www.ncbi.nlm.nih.gov/genbank/). The final gene models were generated after post-processing. b, Repeat sequence annotation pipeline. For long terminal repeat (LTR), terminal inverted repeat (TIR), and Helitron annotation, novel transposable elements (TEs) for each genome were identified using RepeatMasker⁷² and EDTA⁸³ by comparing with a known TE library (METC, https://github.com/oushujun/MTEC). After merging and removing redundant novel TEs from 25 novel assembly genomes, a non-redundant novel library was created. This novel TE library was further aggregated with the MTEC library to form the final Pan-TE library. LTRs, TIRs, and helitrons were finally annotated based on the Pan-TE library. Besides, short interspersed nuclear elements (SINEs) were identified by SineFinder⁸⁴, while long interspersed nuclear elements (LINEs) and non-TE repeats were directly annotated by RepeatMasker and EDTA.

Extended Data Fig. 5 Pan-gene and Pan-SV analyses of 56 maize genomes.

a, b Modeling the size of maize pan-genes and core-genes (a), pan-gene families and core-gene families (b) when additional genomes are incorporated into the maize pan-genome. Genomes were sampled as 56 random combinations of each given number of genomes. Mean values are displayed with error bars representing ± SD. c, Observed (arrow) and expected (density) distribution of genes belonging to core and softcore gene families. The expected distribution is the proportion of genes belonging to core and softcore gene families by randomly sampling 1,344 protein-coding genes for 10,000 times. The statistical significance is determined by permutation test. d, Accumulation of different types of SVs, including insertion (INS), deletion (DEL), duplication (DUP), translocation (TRA), and inversion (INV), with the increase of genome numbers. The height of the stacked chart represents the total number of SVs, while the colored sections show the number of different types of SV. e, Upset plot showing the number of SVs identified by different methods. High-quality SVs refer to those identified by at least two methods. The horizontal bars represent the total number of SVs identified by each method. Vertical bars display the number of SVs identified by one or multiple methods, as indicated by the black dots below the x-axis. Black dots indicate the methods for the SVs identification. f, Distribution of pan-SV position, referring to the gene position in B73 (v5). ‘UTR’, untranslated region. ‘CDS’, coding region sequence. g, Proportion of pan-SVs of different lengths. h, Root gravitropism assay for B73 and CML333. Red arrow indicates the direction of gravity. The inclination angle of the root with the horizontal line was measured, 12 hours after rotation the petri dishes 90° to the right. Numbers above the x-axis indicate the sample size, and the statistical significance is determined by a two-sided t-test.

Source data

Extended Data Fig. 6 Pipeline for pangenome structural variant (pan-SV) identification.

Initial SV identification was based on a maize pan-genome comprising 56 high-quality assembled genomes. The Nanopore/PacBio long-read sequences of each germplasm were mapped to the B73 (v5) genome sequence and SVs were identified by Sniffles⁸⁸ and CuteSV⁸⁹. Meanwhile, SyRI⁸⁶ and Smartie-sv⁸⁷ were employed to identify SVs through sequence alignment of each assembled genome sequence to that of B73 (v5). SVs identified in NAM founders¹⁶ were merged into the final non-redundant SV dataset using Jasmine¹⁷.

Extended Data Fig. 7 ZmUGE2 contributes to seedling drought resistance.

a–c, Manhattan plot of the GWAS identifying the genetic loci associated with the ZmUGE2 expression levels under WW (a), WS1 (b) and WS2 (c) conditions. Variants located within ZmUGE2 and the 5-kb flanking sequence are indicated by red dots. d, Comparison of gene expression levels of ZmUGE2-Hap1 and ZmUGE2-Hap2 under WW and WS2 conditions. e, Comparison of gene expression levels of ZmUGE2 (left panel) and the survival rates (right panel) of the germplasms carrying different genotypes of ZmUGE2 based on the genotype of Indel no. 7434. ‘+’ indicates the presence of the 9-bp insertion; ‘−’ indicates its absence. f, Left panel: RT-qPCR analysis of relative ZmUGE2 transcript levels in WT and ZmUGE2-OE lines, normalized to the internal control gene ZmUBI. Mean values are displayed with error bars representing ± SD from three independent biological replicates. Right panel: Western blot analysis of the ZmUGE2-GFP protein in the ZmUGE2-OE plants, with actin used as a loading control. Molecular weight markers are shown on the right. g, Schematic diagram of the CRISPR-targeted knockout genotype of zmuge2-KO lines. The gRNA target sequences and edits are indicated below the gene diagram. h, Water loss rate (%) of detached leaves at the indicated time points. For each genotype, four detached leaves are placed on a clean bench to dehydrate, and their weights are recorded periodically over an 8-hour period. Data represent the mean ± SD, based on three replicated experiments. Asterisks (*p < 0.05) indicate a significant difference between the WT and transgenic plants. i, Comparison of plant height between WT and ZmUGE2-OE plants. Numbers above the x-axis indicate the number of plants for each genotype. Numbers above the x-axis represent the number of germplasms for each genotype in d and e. In d–f, h and i, statistical significance is determined by a two-sided t-test.

Source data

Extended Data Fig. 8 ZmSIL2 plays a negative role in maize seedling drought resistance.

a-c, Manhattan plot of the GWAS identifying the genetic loci associated with ZmSIL2 expression levels under WW (a), WS1 (b) and WS2 (c) conditions. Variants located within ZmSIL2 and the 5-kb flanking sequence are indicated by red dots. d, Comparison of gene expression levels of ZmSIL2-Hap1 and ZmSIL2-Hap2 under WW and WS2 conditions. e, Comparison of gene expression levels of ZmSIL2 (left panel) and the survival rates (right panel) of the germplasms carrying different genotypes of ZmSIL2 based on SV no. 2606. ‘+’ indicates the presence of the 715-bp insertion; ‘−’ indicates its absence. f, Expression level of ZmSIL2 in WT and ZmSIL2-OE lines. Mean ± SD from three independent biological replicates. g, Schematic diagram of the CRISPR-targeted knockout genotype of ZmSIL2. The trihelical DNA-binding domain is highlighted, and the predicted α-helix regions are shown in cyan. h, Subcellular localization of ZmSIL2-GFP. Confocal microscopy images the expressed ZmSIL2-GFP protein in maize leaf protoplasts. Scale bar, 5 μm. i, Western blot analysis of the samples used for the ZmSIL2-GFP ChIP-seq analysis. ‘WT’, the samples of WT are prepared parallelly as negative controls. ‘Anti-H3’ indicates the nuclear fraction. j, Gene expression analysis of four ZmSIL2-regulated genes. For each gene, the gene structure is shown in the upper panel, and the RNA-seq reads graphs of WT and zmsil2-KO1 samples under WW and WS conditions are shown in the four tracks in the middle panel. The bottom panel shows RT-qPCR confirmation of the altered gene expression in ZmSIL2-OE and zmsil2-KO plants. Data are presented as mean values ± SD. In d and e, numbers above the x-axis represent the number of germplasms for each genotype. In h and i, the micrographs and immunoblot are the representative results from at least two independent biological replicates. In d–f and j, statistical significance is determined by a two-sided t-test.

Source data

Extended Data Fig. 9 ZmASI3 contributes to maize drought resistance at the flowering stage.

a, The putative SNF2 helicase domains (ZmASI3-1267 aa) encoded by T01 and the truncated one (ZmASI3-1151) encoded by T02 are fused with Glutathione S-transferase (GST) for protein expression and purification in E. coli, respectively. b, Protein gel electrophoresis for the purified GST, GST-ZmASI3-1267, and GST-ZmASI3-1151 proteins. Molecular weight markers are indicated on the right side. c, In vitro ATPase activity assay. GST, GST-ZmASI3-1267, and GST-ZmASI3-1151 proteins were used in the assay. ‘+’ indicates 0.2 μg of protein, while ‘++’ indicates 0.4 μg. Mean values are displayed with error bars representing ± SD from three independent biological replicates. d, Schematic illustration for the in vitro nucleosome remodeling assay. The previously inaccessible Dpn II site (yellow) became exposed due to localized nucleosome sliding. e, Nucleosome remodeling assay. The GST-ZmASI3-1267 and GST-ZmASI3-1151 proteins are incubated with the pre-assembled nucleosome over the indicated time periods. Bands near 225 bp represent intact nucleosomal DNA, while bands near 200 bp indicate fragment mobility cleaved by Dpn II after nucleosome remodeling, indicated by the red asterisk. GST protein was used as a negative control. The gel electrophoresis is the representative result from two independent biological replicates. f, Presence and absence of SV1-5 within intron 5 of ZmASI3 among different haplotypes. Cyan indicates the presence of an SV, while gray indicates its absence. g, Schematic diagram of the CRISPR-targeted knockout genotype of ZmASI3. Violin plots of days to anthesis (DTA) (h) and days to silking (DTS) (i) in WT and zmasi3-KO plants under WW and WS field conditions. In h and i, Numbers above the x-axis represent the number of plants for each genotype, and statistical significance is determined by a two-sided t-test.

Source data

Supplementary information

Supplementary Information

Supplementary Note, Tables 1–16 and refs. 1–13.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–16.

Source data

Figs. 1–5 and Extended Data Figs. 1, 5 and 7–9

Statistical source data for Figs. 1–5 and Extended Data Figs. 1, 5 and 7–9.

Fig. 3g and Extended Data Figs. 7–9

Unprocessed western blots and gels for Fig. 3g and Extended Data Figs. 7f, 8i and 9b,e.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Yang, S., Wang, Y., Huang, Q. et al. A pangenome of maize provides genetic insights into drought resistance. Nat Genet 57, 2831–2841 (2025). https://doi.org/10.1038/s41588-025-02378-w

Download citation

Received: 18 January 2025
Accepted: 19 September 2025
Published: 27 October 2025
Version of record: 27 October 2025
Issue date: November 2025
DOI: https://doi.org/10.1038/s41588-025-02378-w

Subjects

Abstract

Access options

Similar content being viewed by others

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links