Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

A pangenome of maize provides genetic insights into drought resistance

Abstract

Drought poses a severe threat to the stability of crop yields. It is crucial to identify genetic resources and decipher the molecular mechanisms underlying drought resistance in crops. Here we generated high-quality genome assemblies of 25 maize germplasms exhibiting substantial variation in drought resistance. Combined with 31 additional maize genome sequences, a comprehensive pangenome analysis was performed. Rare allelic variations and extensive regulatory diversity were revealed in abscisic acid-related or drought-related genes, which may contribute to the diversity in drought resistance among germplasms. Furthermore, we identified three genes, ZmUGE2, ZmSIL2 and ZmASI3, that enhance maize drought resistance by strengthening mechanical support of the cell wall, regulating stress-responsive gene expression and coordinating male and female inflorescence development, respectively. Thus, this study provides valuable insight into the genetic control of drought resistance in maize at different growing stages. The expanded maize pangenome information serves as a valuable resource for maize genomic research.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Phylogenetic analysis of 25 selected maize germplasms and characterization of their drought-related phenotypes.
Fig. 2: Maize pangenome analysis and variations in ABA-related and drought-related genes.
Fig. 3: ZmUGE2 positively regulates drought resistance.
Fig. 4: ZmSIL2 negatively regulates drought resistance at the seedling stage.
Fig. 5: Natural variation in ZmASI3 contributes to drought resistance at the flowering stage.

Similar content being viewed by others

Data availability

All raw sequencing data referenced in this study are available at the National Center for Biotechnology Information (NCBI) Sequence Read Archive with BioProject accession number PRJNA1103102, including the ONT, DNA and/or RNA Illumina sequencing, as well as RNA-seq and ChIP–seq data for transgenic plants. Genome sequences and annotations for 25 newly assembled maize germplasms and SV information are available via Zenodo at https://doi.org/10.5281/zenodo.16576184 (ref. 99). GWAS results related to ZmUGE2, ZmSIL2 and ZmASI3 are available via Zenodo at https://doi.org/10.5281/zenodo.17138577 (ref. 100). Long-read sequencing data of the previously published genomes were obtained from the NCBI database, which includes NAM founders16 (accession number PRJEB31061), Mo1727 (accession number PRJNA358298), SK31 (accession number PRJNA531547), CIMBL5525 (accession number PRJNA765111), K0326Y32 (accession number PRJNA539996) and A18833 (accession number PRJNA635654). Other public maize, teosinte and T. dactyloides genome sequences used in this work are available at https://download.maizegdb.org/. Source data are provided with this paper.

Code availability

All customized scripts used in this study are available via GitHub at https://github.com/YangBioinformatics/Maize-Pan-genome and via Zenodo at https://doi.org/10.5281/zenodo.16631382 (ref. 101).

References

  1. The Impact of Disasters and Crises on Agriculture and Food Security (FAO, 2021).

  2. Erenstein, O., Chamberlin, J. & Sonder, K. Estimating the global number and distribution of maize and wheat farms. Glob. Food Secur. 30, 100558 (2021).

    Article  Google Scholar 

  3. Harrison, M. T., Tardieu, F., Dong, Z., Messina, C. D. & Hammer, G. L. Characterizing drought stress and trait influence on maize yield under current and future conditions. Glob. Chang. Biol. 20, 867–878 (2014).

    Article  PubMed  Google Scholar 

  4. Lobell, D. B., Deines, J. M. & Tommaso, S. D. Changes in the drought sensitivity of US maize yields. Nat. Food 1, 729–735 (2020).

    Article  PubMed  Google Scholar 

  5. Lobell, D. B. et al. Greater sensitivity to drought accompanies maize yield increase in the U.S. Midwest. Science 344, 516–519 (2014).

    Article  CAS  PubMed  Google Scholar 

  6. Chen, L. et al. Genome sequencing reveals evidence of adaptive variation in the genus Zea. Nat. Genet. 54, 1736–1745 (2022).

    Article  CAS  PubMed  Google Scholar 

  7. Wang, X. et al. Genetic variation in ZmVPP1 contributes to drought tolerance in maize seedlings. Nat. Genet. 48, 1233–1241 (2016).

    Article  CAS  PubMed  Google Scholar 

  8. Bolaños, J. & Edmeades, G. O. The importance of the anthesis-silking interval in breeding for drought tolerance in tropical maize. Field Crops Res. 48, 65–80 (1996).

    Article  Google Scholar 

  9. Liu, B. et al. Manipulating ZmEXPA4 expression ameliorates the drought-induced prolonged anthesis and silking interval in maize. Plant Cell 33, 2058–2071 (2021).

    Article  PubMed  PubMed Central  Google Scholar 

  10. Danilevskaya, O. N. et al. Developmental and transcriptional responses of maize to drought stress under field conditions. Plant Direct 3, e00129 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  11. Fuad-Hassan, A., Tardieu, F. & Turc, O. Drought-induced changes in anthesis-silking interval are related to silk expansion: a spatio-temporal growth analysis in maize plants subjected to soil water deficit. Plant Cell Environ. 31, 1349–1360 (2008).

    Article  PubMed  Google Scholar 

  12. Liu, S. et al. Mapping regulatory variants controlling gene expression in drought response and tolerance in maize. Genome Biol. 21, 163 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Mao, H. et al. A transposable element in a NAC gene is associated with drought tolerance in maize seedlings. Nat. Commun. 6, 8326 (2015).

    Article  CAS  PubMed  Google Scholar 

  14. Ricci, W. A. et al. Widespread long-range cis-regulatory elements in the maize genome. Nat. Plants 5, 1237–1249 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Jiao, Y. et al. Improved maize reference genome with single-molecule technologies. Nature 546, 524–527 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Hufford, M. B. et al. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes. Science 373, 655–662 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Alonge, M. et al. Major impacts of widespread structural variation on gene expression and crop improvement in tomato. Cell 182, 145–161 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Liu, Y. C. et al. Pan-genome of wild and cultivated soybeans. Cell 182, 162–176 (2020).

    Article  CAS  PubMed  Google Scholar 

  19. Qin, P. et al. Pan-genome analysis of 33 genetically diverse rice accessions reveals hidden genomic variations. Cell 184, 3542–3558 (2021).

    Article  CAS  PubMed  Google Scholar 

  20. Wang, B. et al. De novo genome assembly and analyses of 12 founder inbred lines provide insights into maize heterosis. Nat. Genet. 55, 312–323 (2023).

    Article  CAS  PubMed  Google Scholar 

  21. Jayakodi, M. et al. Structural variation in the pangenome of wild and domesticated barley. Nature 636, 654–662 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Jiao, C. et al. Pan-genome bridges wheat structural variations with habitat and breeding. Nature 637, 384–393 (2024).

    Article  PubMed  Google Scholar 

  23. Gui, S. et al. A pan-Zea genome map for enhancing maize improvement. Genome Biol. 23, 178 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Zhou, Y. et al. Graph pangenome captures missing heritability and empowers tomato breeding. Nature 606, 527–534 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Tian, T. et al. Genome assembly and genetic dissection of a prominent drought-resistant maize germplasm. Nat. Genet. 55, 496–506 (2023).

    Article  CAS  PubMed  Google Scholar 

  26. Yang, X. et al. Characterization of a global germplasm collection and its potential utilization for analysis of complex quantitative traits in maize. Mol. Breed. 28, 511–526 (2010).

    Article  Google Scholar 

  27. Sun, S. et al. Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes. Nat. Genet. 50, 1289–1295 (2018).

    Article  CAS  PubMed  Google Scholar 

  28. Chen, J. et al. A complete telomere-to-telomere assembly of the maize genome. Nat. Genet. 55, 1221–1231 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).

    Article  CAS  PubMed  Google Scholar 

  30. Ou, S., Chen, J. & Jiang, N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res. 46, e126 (2018).

    PubMed  PubMed Central  Google Scholar 

  31. Yang, N. et al. Genome assembly of a tropical maize inbred line provides insights into structural variation and crop improvement. Nat. Genet. 51, 1052–1059 (2019).

    Article  CAS  PubMed  Google Scholar 

  32. Li, C. et al. Long-read sequencing reveals genomic structural variations that underlie creation of quality protein maize. Nat. Commun. 11, 17 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

  33. Lin, G. et al. Chromosome-level genome assembly of a regenerable maize inbred line A188. Genome Biol. 22, 175 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Huang, Y. et al. OsNCED5, a 9-cis-epoxycarotenoid dioxygenase gene, regulates salt and water stress tolerance and leaf senescence in rice. Plant Sci. 287, 110188 (2019).

    Article  CAS  PubMed  Google Scholar 

  35. Uga, Y. et al. Control of root system architecture by DEEPER ROOTING 1 increases rice yield under drought conditions. Nat. Genet. 45, 1097–1102 (2013).

    Article  CAS  PubMed  Google Scholar 

  36. Zhang, F. et al. Genomic basis underlying the metabolome-mediated drought adaptation of maize. Genome Biol. 22, 260 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Wu, X. et al. Using high-throughput multiple optical phenotyping to decipher the genetic architecture of maize drought tolerance. Genome Biol. 22, 185 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Zhang, Y. et al. BRITTLE PLANT1 is required for normal cell wall composition and mechanical strength in rice. J. Integr. Plant Biol. 63, 865–877 (2021).

    Article  CAS  PubMed  Google Scholar 

  39. Liu, S. et al. The rice BZ1 locus is required for glycosylation of arabinogalactan proteins and galactolipid and plays a role in both mechanical strength and leaf color. Rice 13, 41 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

  40. Rosti, J. et al. UDP-glucose 4-epimerase isoforms UGE2 and UGE4 cooperate in providing UDP-galactose for cell wall biosynthesis and growth of Arabidopsis thaliana. Plant Cell 19, 1565–1579 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Kaplan-Levy, R. N., Brewer, P. B., Quon, T. & Smyth, D. R. The trihelix family of transcription factors—light, stress and development. Trends Plant Sci. 17, 163–171 (2012).

    Article  CAS  PubMed  Google Scholar 

  42. Yang, N. et al. Two teosintes made modern maize. Science 382, eadg8940 (2023).

    Article  CAS  PubMed  Google Scholar 

  43. Sakamoto, H. et al. Arabidopsis Cys2/His2-type zinc-finger proteins function as transcription repressors under drought, cold, and high-salinity stress conditions. Plant Physiol. 136, 2734–2746 (2004).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Liu, S. et al. Genome-wide analysis of ZmDREB genes and their association with natural variation in drought tolerance at seedling stage of Zea mays L. PLoS Genet. 9, e1003790 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Wu, Q. et al. Transcription factor ZmEREB97 regulates nitrate uptake in maize (Zea mays) roots. Plant Physiol. 196, 535–550 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Huh, S. U. New function of hypoxia-responsive unknown protein in enhanced resistance to biotic stress. Plant Signal Behav. 16, 1868131 (2021).

    Article  PubMed  Google Scholar 

  47. Knizewski, L., Ginalski, K. & Jerzmanowski, A. Snf2 proteins in plants: gene silencing and beyond. Trends Plant Sci. 13, 557–565 (2008).

    Article  CAS  PubMed  Google Scholar 

  48. Deng, Y. et al. Epigenetic regulation of antagonistic receptors confers rice blast resistance with yield balance. Science 355, 962–965 (2017).

    Article  CAS  PubMed  Google Scholar 

  49. Gao, M. J. et al. Repression of seed maturation genes by a trihelix transcriptional repressor in Arabidopsis seedlings. Plant Cell 21, 54–71 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Liu, H. et al. Distant eQTLs and non-coding sequences play critical roles in regulating gene expression and quantitative trait variation in maize. Mol. Plant 10, 414–426 (2017).

    Article  CAS  PubMed  Google Scholar 

  51. Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Felsenstein, J. PHYLIP—Phylogeny Inference Package (version 3.2). Cladistics 5, 164–166 (1989).

    Google Scholar 

  53. Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Hu, J. et al. NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads. Genome Biol. 25, 107 (2024).

    Article  PubMed  PubMed Central  Google Scholar 

  55. Hu, J., Fan, J., Sun, Z. & Liu, S. NextPolish: a fast and efficient genome polishing tool for long-read assembly. Bioinformatics 36, 2253–2255 (2020).

    Article  CAS  PubMed  Google Scholar 

  56. Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Alonge, M. et al. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol. 23, 258 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Marcais, G. et al. MUMmer4: a fast and versatile genome alignment system. PLoS Comput. Biol. 14, e1005944 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  60. Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).

    Article  CAS  PubMed  Google Scholar 

  61. Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).

    Article  CAS  PubMed  Google Scholar 

  62. Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Liu, R. & Dickerson, J. Strawberry: fast and accurate genome-guided transcript reconstruction and quantification from RNA-Seq. PLoS Comput. Biol. 13, e1005851 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  65. Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Song, L., Sabunciyan, S. & Florea, L. CLASS2: accurate and efficient splice variant annotation from RNA-seq reads. Nucleic Acids Res. 44, e98 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  67. Venturini, L., Caim, S., Kaithakottil, G. G., Mapleson, D. L. & Swarbreck, D. Leveraging multiple transcriptome assembly methods for improved gene structure annotation. Gigascience 7, giy093 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  68. Mapleson, D., Venturini, L., Kaithakottil, G. & Swarbreck, D. Efficient and accurate detection of splice junctions from RNA-seq with Portcullis. Gigascience 7, giy131 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  69. Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).

    Article  CAS  PubMed  Google Scholar 

  70. Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinform. 10, 421 (2009).

    Article  Google Scholar 

  71. Bruna, T., Hoff, K. J., Lomsadze, A., Stanke, M. & Borodovsky, M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genom. Bioinform. 3, lqaa108 (2021).

    Article  PubMed  PubMed Central  Google Scholar 

  72. Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 25, 4–10 (2009).

    Article  Google Scholar 

  73. Dainat, J. & Pucholt, D. H. AGAT: another Gff analysis toolkit to handle annotations in any GTF. v.0.6.0. Zenodo https://doi.org/10.5281/zenodo.4637977 (2021).

  74. Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Soderlund, C. et al. Sequencing, mapping, and analysis of 27,455 maize full-length cDNAs. PLoS Genet. 5, e1000740 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  76. Wang, B. et al. A comparative transcriptional landscape of maize and sorghum obtained by single-molecule sequencing. Genome Res. 28, 921–932 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  77. Zhang, R. G. et al. TEsorter: an accurate and fast method to classify LTR-retrotransposons in plant genomes. Hortic. Res. 9, uhac017 (2022).

    Article  PubMed  PubMed Central  Google Scholar 

  78. Campbell, M. S. et al. MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations. Plant Physiol. 164, 513–524 (2014).

    Article  CAS  PubMed  Google Scholar 

  79. Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 (2010).

    Article  CAS  PubMed  Google Scholar 

  80. Chen, M. M., Lin, H., Chiang, L. M., Childers, C. P. & Poelchau, M. F. The GFF3toolkit: QC and merge pipeline for genome annotation. Methods Mol. Biol. 1858, 75–87 (2019).

    Article  CAS  PubMed  Google Scholar 

  81. Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 49, D344–D354 (2021).

    Article  CAS  PubMed  Google Scholar 

  82. Olson, A. J. & Ware, D. Ranked choice voting for representative transcripts with TRaCE. Bioinformatics 38, 261–264 (2021).

    Article  PubMed  PubMed Central  Google Scholar 

  83. Ou, S. et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol. 20, 275 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  84. Wenke, T. et al. Targeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes. Plant Cell 23, 3117–3128 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  85. Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  86. Goel, M., Sun, H., Jiao, W. B. & Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  87. Kronenberg, Z. N. et al. High-resolution comparative analysis of great ape genomes. Science 360, eaar6343 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  88. Sedlazeck, F. J. et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat. Methods 15, 461–468 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  89. Jiang, T. et al. Long-read-based human genomic structural variation detection with cuteSV. Genome Biol. 21, 189 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  90. Hickey, G. et al. Genotyping structural variants in pangenome graphs using the vg toolkit. Genome Biol. 21, 35 (2020).

    Article  PubMed  PubMed Central  Google Scholar 

  91. Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  92. Chen, S. et al. Paragraph: a graph-based structural variant genotyper for short-read sequence data. Genome Biol. 20, 291 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  93. Speed, D., Holmes, J. & Balding, D. J. Evaluating and improving heritability models using summary statistics. Nat. Genet. 52, 458–462 (2020).

    Article  CAS  PubMed  Google Scholar 

  94. Zhou, X. & Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 44, 821–824 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  95. Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  96. Chen, S., Songkumarn, P., Liu, J. & Wang, G. L. A versatile zero background T-vector system for gene cloning and functional genomics. Plant Physiol. 150, 1111–1121 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  97. Xing, H. L. et al. A CRISPR/Cas9 toolkit for multiplex genome editing in plants. BMC Plant Biol. 14, 327 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  98. Byeon, B. et al. The ATP-dependent chromatin remodeling enzyme Fun30 represses transcription by sliding promoter-proximal nucleosomes. J. Biol. Chem. 288, 23182–23193 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  99. Yang, S. Supplementary data of the maize pan-genome work (1.0). Zenodo https://doi.org/10.5281/zenodo.16576184 (2025).

  100. Yang, S. GWAS results related to the maize pan-genome work (1.0). Zenodo https://doi.org/10.5281/zenodo.17138577 (2025).

  101. Yang, S. Maize pan-genome related scripts and pipelines (1.0). Zenodo https://doi.org/10.5281/zenodo.16631382 (2025).

Download references

Acknowledgements

We thank T. Tian (University of Science and Technology Beijing), Z. Zhang (China Agricultural University) and H. He (Fujian Agriculture and Forestry University) for valuable discussions, and S. Wang and Y. Liu (Instrumental platform of state key laboratory of plant environmental resilience, China Agricultural University, Beijing) for the help on scanning electron microscopy and confocal imaging. This research was supported by the National Key Research and Development Program of China (grant number 2023YFF1001300), the National Natural Science Foundation of China (grant numbers 32430010 and 32272024), the Chinese Universities Scientific Fund (grant numbers 2025TC135 and 2025TC148), Beijing Outstanding Young Scientist Program (grant number BJJWZYJH01201910019026) and the China Postdoctoral Science Foundation (grant number 2019M660867).

Author information

Authors and Affiliations

Authors

Contributions

F.Q. and S.Y. designed and supervised the study and revised the paper. S.Y. and Y.W. performed the pangenome study and gene association analysis. Q.H., M.W., S.W., X.F., C.Z., S.L. and Z.Y. performed experiments for gene cloning and collected phenotypic data in the fields. J.C. contributed to transgenic maize generations. N.Y., J.Y. and X.Y. provided maize materials and valuable discussions and edited the paper. All the authors read and approved the final paper.

Corresponding author

Correspondence to Feng Qin.

Ethics declarations

Competing interests

Two patent applications related to this work have been submitted by F.Q., S.Y., Q.H. and M.W. The other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Klaus Mayer and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Drought resistance phenotypes analysis in pan-genome and population level.

a, Representative plant photographs of the 23 NAM founders16, excluding B73 (presented in Tian et al.25), CML247 and CML277 (lacking sufficient seeds), grown under WW and WS2 conditions in fields. The ears are harvested from the plants grown under WW, WS1 and WS2 conditions. They are ordered according to the average yield under WS1 and WS2. The size of bubble corresponds to the average yield and days of ASI under WS1 and WS2 conditions, and seedling survival rate (SR) after drought stress. b, Integrative illustration of multiple drought resistance phenotypes of 50 germplasms. The 50 germplasms include 25 newly assembled germplasms in this study, 26 NAM founders16, Mo1727 and CIMBL5525, excluding SY1032, CML247 and CML277 (lacking sufficient seeds). Plant drought resistance at the seedling stage is scored according to the survival rate after drought. All the germplasms are classified into three categories: ‘Resistant’, ‘Intermediate’ and ‘Sensitive’ as Fig. 1c. ‘WW yield’, grain yield under WW; ‘WS yield’, (yield_WS1+yield_WS2)/2; ‘dASI’, (ASI_WS1+ASI_WS2)/2 − (ASI_WW). The stacked bar value (left y-axis) represents WW yield (light blue) and WS yield (bright blue) for each germplasm. Scatter points (right y-axis) show the dASI for each germplasm. Red inverted triangles indicate six drought-resistant germplasms referring to three criteria: 1) resistance or intermediate resistance at the seedling stage; 2) yield > 25g (above the cyan dotted line) under WS; 3) dASI < 2 days (below the red dotted line). c, Statistic of ASI under WW and WS conditions. Statistical significance was determined by paired t-test. d, e, Correlation between ASI and yield per plant (d), seedling SR and yield per plant (e) under WW and WS conditions. Pearson correlation coefficient (r) is used to evaluate a linear correlation between two traits. Statistical significance was determined by a two-sided t-test. In c–e, the phenotypes are obtained from 228 temperate germplasms9.

Source data

Extended Data Fig. 2 Plant survival rate assay for drought resistance at seedling stage for NAM founders.

The seedling SR of each germplasm is compared with B73, respectively. Fifteen plants of each germplasm are compared in each pot, with at least two replicates. Representative photographs are taken before and after drought treatment.

Extended Data Fig. 3 Pipeline of de novo assembly for 25 genomes.

Approximately 98× genomic DNA long-read sequencing data generated by Oxford Nanopore Technology (ONT) were initially corrected by NextDenovo54. Then, the ONT reads and about 60× genomic DNA short-read sequencing data generated by Illumina NovaSeq 6000 were employed to polish draft contigs. RagTag58 was utilized to address inter-chromosomal assembly errors and construct chromosome-scale pseudomolecules (See Methods).

Extended Data Fig. 4 Pipeline of gene and repeat sequence annotation.

a, Gene annotation pipeline. Both evidence-based and ab initio gene prediction methods were used for each genome. For the evidence-based method, transcript assembly programs were used for transcript prediction based on RNA-seq data of five tissues (See Methods) and Mikado67 was further employed to select the best transcripts. For ab initio gene prediction, the mapped RNA-seq reads and protein sequences generated by evidence-based method were used as inputs of BRAKER271 for gene predictions. The non-redundant gene models from both methods were further refined by PASA274 based on maize Iso-seq76 and Expressed Sequence Tags (ESTs) from GenBank (https://www.ncbi.nlm.nih.gov/genbank/). The final gene models were generated after post-processing. b, Repeat sequence annotation pipeline. For long terminal repeat (LTR), terminal inverted repeat (TIR), and Helitron annotation, novel transposable elements (TEs) for each genome were identified using RepeatMasker72 and EDTA83 by comparing with a known TE library (METC, https://github.com/oushujun/MTEC). After merging and removing redundant novel TEs from 25 novel assembly genomes, a non-redundant novel library was created. This novel TE library was further aggregated with the MTEC library to form the final Pan-TE library. LTRs, TIRs, and helitrons were finally annotated based on the Pan-TE library. Besides, short interspersed nuclear elements (SINEs) were identified by SineFinder84, while long interspersed nuclear elements (LINEs) and non-TE repeats were directly annotated by RepeatMasker and EDTA.

Extended Data Fig. 5 Pan-gene and Pan-SV analyses of 56 maize genomes.

a, b Modeling the size of maize pan-genes and core-genes (a), pan-gene families and core-gene families (b) when additional genomes are incorporated into the maize pan-genome. Genomes were sampled as 56 random combinations of each given number of genomes. Mean values are displayed with error bars representing ± SD. c, Observed (arrow) and expected (density) distribution of genes belonging to core and softcore gene families. The expected distribution is the proportion of genes belonging to core and softcore gene families by randomly sampling 1,344 protein-coding genes for 10,000 times. The statistical significance is determined by permutation test. d, Accumulation of different types of SVs, including insertion (INS), deletion (DEL), duplication (DUP), translocation (TRA), and inversion (INV), with the increase of genome numbers. The height of the stacked chart represents the total number of SVs, while the colored sections show the number of different types of SV. e, Upset plot showing the number of SVs identified by different methods. High-quality SVs refer to those identified by at least two methods. The horizontal bars represent the total number of SVs identified by each method. Vertical bars display the number of SVs identified by one or multiple methods, as indicated by the black dots below the x-axis. Black dots indicate the methods for the SVs identification. f, Distribution of pan-SV position, referring to the gene position in B73 (v5). ‘UTR’, untranslated region. ‘CDS’, coding region sequence. g, Proportion of pan-SVs of different lengths. h, Root gravitropism assay for B73 and CML333. Red arrow indicates the direction of gravity. The inclination angle of the root with the horizontal line was measured, 12 hours after rotation the petri dishes 90° to the right. Numbers above the x-axis indicate the sample size, and the statistical significance is determined by a two-sided t-test.

Source data

Extended Data Fig. 6 Pipeline for pangenome structural variant (pan-SV) identification.

Initial SV identification was based on a maize pan-genome comprising 56 high-quality assembled genomes. The Nanopore/PacBio long-read sequences of each germplasm were mapped to the B73 (v5) genome sequence and SVs were identified by Sniffles88 and CuteSV89. Meanwhile, SyRI86 and Smartie-sv87 were employed to identify SVs through sequence alignment of each assembled genome sequence to that of B73 (v5). SVs identified in NAM founders16 were merged into the final non-redundant SV dataset using Jasmine17.

Extended Data Fig. 7 ZmUGE2 contributes to seedling drought resistance.

a–c, Manhattan plot of the GWAS identifying the genetic loci associated with the ZmUGE2 expression levels under WW (a), WS1 (b) and WS2 (c) conditions. Variants located within ZmUGE2 and the 5-kb flanking sequence are indicated by red dots. d, Comparison of gene expression levels of ZmUGE2-Hap1 and ZmUGE2-Hap2 under WW and WS2 conditions. e, Comparison of gene expression levels of ZmUGE2 (left panel) and the survival rates (right panel) of the germplasms carrying different genotypes of ZmUGE2 based on the genotype of Indel no. 7434. ‘+’ indicates the presence of the 9-bp insertion; ‘−’ indicates its absence. f, Left panel: RT-qPCR analysis of relative ZmUGE2 transcript levels in WT and ZmUGE2-OE lines, normalized to the internal control gene ZmUBI. Mean values are displayed with error bars representing ± SD from three independent biological replicates. Right panel: Western blot analysis of the ZmUGE2-GFP protein in the ZmUGE2-OE plants, with actin used as a loading control. Molecular weight markers are shown on the right. g, Schematic diagram of the CRISPR-targeted knockout genotype of zmuge2-KO lines. The gRNA target sequences and edits are indicated below the gene diagram. h, Water loss rate (%) of detached leaves at the indicated time points. For each genotype, four detached leaves are placed on a clean bench to dehydrate, and their weights are recorded periodically over an 8-hour period. Data represent the mean ± SD, based on three replicated experiments. Asterisks (*p < 0.05) indicate a significant difference between the WT and transgenic plants. i, Comparison of plant height between WT and ZmUGE2-OE plants. Numbers above the x-axis indicate the number of plants for each genotype. Numbers above the x-axis represent the number of germplasms for each genotype in d and e. In d–f, h and i, statistical significance is determined by a two-sided t-test.

Source data

Extended Data Fig. 8 ZmSIL2 plays a negative role in maize seedling drought resistance.

a-c, Manhattan plot of the GWAS identifying the genetic loci associated with ZmSIL2 expression levels under WW (a), WS1 (b) and WS2 (c) conditions. Variants located within ZmSIL2 and the 5-kb flanking sequence are indicated by red dots. d, Comparison of gene expression levels of ZmSIL2-Hap1 and ZmSIL2-Hap2 under WW and WS2 conditions. e, Comparison of gene expression levels of ZmSIL2 (left panel) and the survival rates (right panel) of the germplasms carrying different genotypes of ZmSIL2 based on SV no. 2606. ‘+’ indicates the presence of the 715-bp insertion; ‘−’ indicates its absence. f, Expression level of ZmSIL2 in WT and ZmSIL2-OE lines. Mean ± SD from three independent biological replicates. g, Schematic diagram of the CRISPR-targeted knockout genotype of ZmSIL2. The trihelical DNA-binding domain is highlighted, and the predicted α-helix regions are shown in cyan. h, Subcellular localization of ZmSIL2-GFP. Confocal microscopy images the expressed ZmSIL2-GFP protein in maize leaf protoplasts. Scale bar, 5 μm. i, Western blot analysis of the samples used for the ZmSIL2-GFP ChIP-seq analysis. ‘WT’, the samples of WT are prepared parallelly as negative controls. ‘Anti-H3’ indicates the nuclear fraction. j, Gene expression analysis of four ZmSIL2-regulated genes. For each gene, the gene structure is shown in the upper panel, and the RNA-seq reads graphs of WT and zmsil2-KO1 samples under WW and WS conditions are shown in the four tracks in the middle panel. The bottom panel shows RT-qPCR confirmation of the altered gene expression in ZmSIL2-OE and zmsil2-KO plants. Data are presented as mean values ± SD. In d and e, numbers above the x-axis represent the number of germplasms for each genotype. In h and i, the micrographs and immunoblot are the representative results from at least two independent biological replicates. In d–f and j, statistical significance is determined by a two-sided t-test.

Source data

Extended Data Fig. 9 ZmASI3 contributes to maize drought resistance at the flowering stage.

a, The putative SNF2 helicase domains (ZmASI3-1267 aa) encoded by T01 and the truncated one (ZmASI3-1151) encoded by T02 are fused with Glutathione S-transferase (GST) for protein expression and purification in E. coli, respectively. b, Protein gel electrophoresis for the purified GST, GST-ZmASI3-1267, and GST-ZmASI3-1151 proteins. Molecular weight markers are indicated on the right side. c, In vitro ATPase activity assay. GST, GST-ZmASI3-1267, and GST-ZmASI3-1151 proteins were used in the assay. ‘+’ indicates 0.2 μg of protein, while ‘++’ indicates 0.4 μg. Mean values are displayed with error bars representing ± SD from three independent biological replicates. d, Schematic illustration for the in vitro nucleosome remodeling assay. The previously inaccessible Dpn II site (yellow) became exposed due to localized nucleosome sliding. e, Nucleosome remodeling assay. The GST-ZmASI3-1267 and GST-ZmASI3-1151 proteins are incubated with the pre-assembled nucleosome over the indicated time periods. Bands near 225 bp represent intact nucleosomal DNA, while bands near 200 bp indicate fragment mobility cleaved by Dpn II after nucleosome remodeling, indicated by the red asterisk. GST protein was used as a negative control. The gel electrophoresis is the representative result from two independent biological replicates. f, Presence and absence of SV1-5 within intron 5 of ZmASI3 among different haplotypes. Cyan indicates the presence of an SV, while gray indicates its absence. g, Schematic diagram of the CRISPR-targeted knockout genotype of ZmASI3. Violin plots of days to anthesis (DTA) (h) and days to silking (DTS) (i) in WT and zmasi3-KO plants under WW and WS field conditions. In h and i, Numbers above the x-axis represent the number of plants for each genotype, and statistical significance is determined by a two-sided t-test.

Source data

Supplementary information

Supplementary Information

Supplementary Note, Tables 1–16 and refs. 1–13.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–16.

Source data

Figs. 1–5 and Extended Data Figs. 1, 5 and 7–9

Statistical source data for Figs. 1–5 and Extended Data Figs. 1, 5 and 7–9.

Fig. 3g and Extended Data Figs. 7–9

Unprocessed western blots and gels for Fig. 3g and Extended Data Figs. 7f, 8i and 9b,e.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, S., Wang, Y., Huang, Q. et al. A pangenome of maize provides genetic insights into drought resistance. Nat Genet 57, 2831–2841 (2025). https://doi.org/10.1038/s41588-025-02378-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41588-025-02378-w

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research