Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Resource
  • Published:

Transcription factor binding divergence drives transcriptional and phenotypic variation in maize

Abstract

Regulatory elements are essential components of plant genomes that have shaped the domestication and improvement of modern crops. However, their identity, function and diversity remain poorly characterized, limiting our ability to harness their full power for agricultural advances using induced or natural variation. Here we mapped transcription factor (TF) binding for 200 TFs from 30 families in two distinct maize inbred lines historically used in maize breeding. TF binding comparison revealed widespread differences between inbreds, driven largely by structural variation, that correlated with gene expression changes and explained complex quantitative trait loci such as Vgt1, an important determinant of flowering time, and DICE, an herbivore resistance enhancer. CRISPR–Cas9 editing of TF binding regions validated the function and structure of regulatory regions at various loci controlling plant architecture and biotic resistance. Our maize TF binding catalogue identifies functional regulatory regions and enables collective and comparative analysis, highlighting its value for agricultural improvement.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Large-scale DAP-seq profiling of maize TFs provides high-quality genome-wide binding site data for B73v5 and Mo17.
Fig. 2: Maize TF diversity panel reveals functional cis-regulatory modules.
Fig. 3: Genotype-specific peaks are prevalent in B73 and Mo17 and explain genetically defined QTLs.
Fig. 4: DoubleDAP-seq analysis of Gln-Ala-Arg-type bHLHs.
Fig. 5: CRISPR-induced cis-regulatory variation drives expression and phenotypic differences.

Similar content being viewed by others

Data availability

The raw and processed data for the DAP-seq datasets generated in this study are available from NCBI GEO under accession number GSE275897. The processed data are available via Zenodo at https://doi.org/10.5281/zenodo.14991915 (ref. 97). Comparative peak data tracks from the different inbred lines can be viewed on JBrowse2 at http://hlab.bio.nyu.edu/projects/zm_crm/.

Code availability

The analysis pipelines are available via GitHub at https://github.com/hlab1/Maize_B73_Mo17_DAP.

References

  1. Kim, S. & Wysocka, J. Deciphering the multi-scale, quantitative cis-regulatory code. Mol. Cell 83, 373–392 (2023).

    Article  CAS  PubMed  Google Scholar 

  2. Rodriguez-Leal, D., Lemmon, Z. H., Man, J., Bartlett, M. E. & Lippman, Z. B. Engineering quantitative trait variation for crop improvement by genome editing. Cell 171, 470–480.e8 (2017).

    Article  CAS  PubMed  Google Scholar 

  3. Liang, Y., Liu, H. J., Yan, J. & Tian, F. Natural variation in crops: realized understanding, continuing promise. Annu. Rev. Plant Biol. 72, 357–385 (2021).

    Article  CAS  PubMed  Google Scholar 

  4. Wang, X. et al. Dissecting cis-regulatory control of quantitative trait variation in a plant stem cell circuit. Nat. Plants 7, 419–427 (2021).

    Article  CAS  PubMed  Google Scholar 

  5. Meyer, R. S. & Purugganan, M. D. Evolution of crop species: genetics of domestication and diversification. Nat. Rev. Genet. 14, 840–852 (2013).

    Article  CAS  PubMed  Google Scholar 

  6. Liu, L. et al. Enhancing grain-yield-related traits by CRISPR–Cas9 promoter editing of maize CLE genes. Nat. Plants 7, 287–294 (2021).

    Article  CAS  PubMed  Google Scholar 

  7. Sun, Y. et al. Divergence in the ABA gene regulatory network underlies differential growth control. Nat. Plants 8, 549–560 (2022).

    Article  CAS  PubMed  Google Scholar 

  8. Wallace, J. G., Rodgers-Melnick, E. & Buckler, E. S. On the road to breeding 4.0: unraveling the good, the bad, and the boring of crop quantitative genomics. Annu. Rev. Genet. 52, 421–444 (2018).

    Article  CAS  PubMed  Google Scholar 

  9. Kremling, K. A. G. et al. Dysregulation of expression correlates with rare-allele burden and fitness loss in maize. Nature 555, 520–523 (2018).

    Article  CAS  PubMed  Google Scholar 

  10. Hake, S. & Ross-Ibarra, J. Genetic, evolutionary and plant breeding insights from the domestication of maize. eLife 4, e05861 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  11. Rodgers-Melnick, E., Vera, D. L., Bass, H. W. & Buckler, E. S. Open chromatin reveals the functional maize genome. Proc. Natl Acad. Sci. USA 113, E3177–E3184 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. O’Malley, R. C. et al. Cistrome and epicistrome features shape the regulatory DNA landscape. Cell 165, 1280–1292 (2016); erratum 166, 1598 (2016).

  13. Zhang, Y. et al. Evolutionary rewiring of the wheat transcriptional regulatory network by lineage-specific transposable elements. Genome Res. 31, 2276–2289 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Baumgart, L. A. et al. Recruitment, rewiring, and deep conservation in flowering plant gene regulation. Preprint at bioRxiv https://doi.org/10.1101/2024.10.08.617089 (2025).

  15. Zhan, J. et al. Opaque-2 regulates a complex gene network associated with cell differentiation and storage functions of maize endosperm. Plant Cell 30, 2425–2446 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Wallace, J. G. et al. Association mapping across numerous traits reveals patterns of functional variation in maize. PLoS Genet. 10, e1004845 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  17. Mural, R. V. et al. Association mapping across a multitude of traits collected in diverse environments in maize. GigaScience 11, giac080 (2022).

    Article  PubMed  PubMed Central  Google Scholar 

  18. Marand, A. P., Eveland, A. L., Kaufmann, K. & Springer, N. M. cis-Regulatory elements in plant development, adaptation, and evolution. Annu. Rev. Plant Biol. 74, 111–137 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Tu, X. et al. Reconstructing the maize leaf regulatory network using ChIP-seq data of 104 transcription factors. Nat. Commun. 11, 5089 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Jores, T. et al. Plant enhancers exhibit both cooperative and additive interactions among their functional elements. Plant Cell 36, 2570–2586 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Hufford, M. B. et al. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes. Science 373, 655–662 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Ricci, W. A. et al. Widespread long-range cis-regulatory elements in the maize genome. Nat. Plants 5, 1237–1249 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Crisp, P. A. et al. Stable unmethylated DNA demarcates expressed genes and their cis-regulatory space in plant genomes. Proc. Natl Acad. Sci. USA 117, 23991–24000 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Song, B. et al. Conserved noncoding sequences provide insights into regulatory sequence and loss of gene expression in maize. Genome Res. 31, 1245–1257 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Wills, D. M. et al. From many, one: genetic control of prolificacy during maize domestication. PLoS Genet. 9, e1003604 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Salvi, S. et al. Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize. Proc. Natl Acad. Sci. USA 104, 11376–11381 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Iotchkova, V. et al. GARFIELD classifies disease-relevant genomic features through integration of functional annotations with association signals. Nat. Genet. 51, 343–353 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Eichten, S. R. et al. B73-Mo17 near-isogenic lines demonstrate dispersed structural variation in maize. Plant Physiol. 156, 1679–1690 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Buckler, E. S. et al. The genetic architecture of maize flowering time. Science 325, 714–718 (2009).

    Article  CAS  PubMed  Google Scholar 

  30. Mickelson, S. M., Stuber, C. S., Senior, L. & Kaeppler, S. M. Quantitative trait loci controlling leaf and tassel traits in a B73 × Mo17 population of maize. Crop Sci. 42, 1902–1909 (2002).

    Article  CAS  Google Scholar 

  31. Ordas, B. et al. Mapping of QTL for resistance to the Mediterranean corn borer attack using the intermated B73 × Mo17 (IBM) population of maize. Theor. Appl. Genet. 119, 1451–1459 (2009).

    Article  CAS  PubMed  Google Scholar 

  32. Goering, R., Larsen, S., Tan, J., Whelan, J. & Makarevitch, I. QTL mapping of seedling tolerance to exposure to low temperature in the maize IBM RIL population. PLoS ONE 16, e0254437 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Sun, S. et al. Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes. Nat. Genet. 50, 1289–1295 (2018).

    Article  CAS  PubMed  Google Scholar 

  34. Song, B. et al. AnchorWave: sensitive alignment of genomes with high sequence diversity, extensive structural polymorphism, and whole-genome duplication. Proc. Natl Acad. Sci. USA 119, e2113075119 (2022).

    Article  CAS  PubMed  Google Scholar 

  35. Zhao, H. et al. CrossMap: a versatile tool for coordinate conversion between genome assemblies. Bioinformatics 30, 1006–1007 (2014).

    Article  PubMed  Google Scholar 

  36. Liang, Y. et al. ZmMADS69 functions as a flowering activator through the ZmRap2.7-ZCN8 regulatory module and contributes to maize flowering time adaptation. New Phytol. 221, 2335–2347 (2019).

    Article  CAS  PubMed  Google Scholar 

  37. Tian, D. et al. GWAS Atlas: a curated resource of genome-wide variant–trait associations in plants and animals. Nucleic Acids Res. 48, D927–D932 (2020).

    Article  CAS  PubMed  Google Scholar 

  38. Galli, M. et al. The DNA binding landscape of the maize AUXIN RESPONSE FACTOR family. Nat. Commun. 9, 4526 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  39. Maekawa, S. et al. The Arabidopsis ubiquitin ligases ATL31 and ATL6 control the defense response as well as the carbon/nitrogen response. Plant Mol. Biol. 79, 217–227 (2012).

    Article  CAS  PubMed  Google Scholar 

  40. Zhou, P., Hirsch, C. N., Briggs, S. P. & Springer, N. M. Dynamic patterns of gene expression additivity and regulatory variation throughout maize development. Mol. Plant 12, 410–425 (2019).

    Article  CAS  PubMed  Google Scholar 

  41. Guillotin, B. et al. A pan-grass transcriptome reveals patterns of cellular divergence in crops. Nature 617, 785–791 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Li, M. et al. Double DAP-seq uncovered synergistic DNA binding of interacting bZIP transcription factors. Nat. Commun. 14, 2600 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Heim, M. A. et al. The basic helix–loop–helix transcription factor family in plants: a genome-wide study of protein structure and functional diversity. Mol. Biol. Evol. 20, 735–747 (2003).

    Article  CAS  PubMed  Google Scholar 

  44. de Martin, X., Sodaei, R. & Santpere, G. Mechanisms of binding specificity among bHLH transcription factors. Int. J. Mol. Sci. 22, 9150 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Gaillochet, C. et al. Control of plant cell fate transitions by transcriptional and hormonal signals. eLife 6, e30135 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  46. Crawford, B. C. & Yanofsky, M. F. HALF FILLED promotes reproductive tract development and fertilization efficiency in Arabidopsis thaliana. Development 138, 2999–3009 (2011).

    Article  CAS  PubMed  Google Scholar 

  47. Gallavotti, A. et al. The role of barren stalk1 in the architecture of maize. Nature 432, 630–635 (2004).

    Article  CAS  PubMed  Google Scholar 

  48. Jumper, J. et al. Applying and improving AlphaFold at CASP14. Proteins 89, 1711–1721 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Gremski, K., Ditta, G. & Yanofsky, M. F. The HECATE genes regulate female reproductive tract development in Arabidopsis thaliana. Development 134, 3593–3601 (2007).

    Article  CAS  PubMed  Google Scholar 

  50. Aguirre, L., Hendelman, A., Hutton, S. F., McCandlish, D. M. & Lippman, Z. B. Idiosyncratic and dose-dependent epistasis drives variation in tomato fruit size. Science 382, 315–320 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Whipple, C. J. et al. A conserved mechanism of bract suppression in the grass family. Plant Cell 22, 565–578 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. McSteen, P. et al. barren inflorescence2 encodes a co-ortholog of the PINOID serine/threonine kinase and is required for organogenesis during inflorescence and vegetative development in maize. Plant Physiol. 144, 1000–1011 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Zheng, L. et al. Prolonged expression of the BX1 signature enzyme is associated with a recombination hotspot in the benzoxazinoid gene cluster in Zea mays. J. Exp. Bot. 66, 3917–3930 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Frey, M. et al. Analysis of a chemical plant defense mechanism in grasses. Science 277, 696–699 (1997).

    Article  CAS  PubMed  Google Scholar 

  55. Engelhorn, J. et al. Genetic variation at transcription factor binding sites largely explains phenotypic heritability in maize. Preprint at bioRxiv https://doi.org/10.1101/2023.08.08.551183 (2024).

  56. Parvathaneni, R. K. et al. The regulatory landscape of early maize inflorescence development. Genome Biol. 21, 165 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Cahn, J. et al. MaizeCODE reveals bi-directionally expressed enhancers that harbor molecular signatures of maize domestication. Nat. Commun. 15, 10854 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Sun, Y. et al. 3D genome architecture coordinates trans and cis regulation of differentially expressed ear and tassel genes in maize. Genome Biol. 21, 143 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Savadel, S. D. et al. The native cistrome and sequence motif families of the maize ear. PLoS Genet. 17, e1009689 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Oka, R. et al. Genome-wide mapping of transcriptional enhancer candidates using DNA and chromatin features in maize. Genome Biol. 18, 137 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  61. Marand, A. P. et al. The genetic architecture of cell type–specific cis regulation in maize. Science 388, eads6601 (2025).

    Article  CAS  PubMed  Google Scholar 

  62. Hendelman, A. et al. Conserved pleiotropy of an ancient plant homeobox gene uncovered by cis-regulatory dissection. Cell 184, 1724–1739.e16 (2021).

  63. Bartlett, A. et al. Mapping genome-wide transcription-factor binding sites using DAP-seq. Nat. Protoc. 12, 1659–1672 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Burdo, B. et al. The Maize TFome—development of a transcription factor open reading frame collection for functional genomics. Plant J. 80, 356–366 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  65. Yilmaz, A. et al. GRASSIUS: a platform for comparative regulatory genomics across the grasses. Plant Physiol. 149, 171–180 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  67. Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Guo, Y., Mahony, S. & Gifford, D. K. High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints. PLoS Comput. Biol. 8, e1002638 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Yu, G., Wang, L. G. & He, Q. Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015).

    Article  CAS  PubMed  Google Scholar 

  71. Machanick, P. & Bailey, T. L. MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics 27, 1696–1697 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  72. Shen, Z., Hoeksema, M. A., Ouyang, Z., Benner, C. & Glass, C. K. MAGGIE: leveraging genetic variation to identify DNA sequence motifs mediating transcription factor binding and function. Bioinformatics 36, i84–i92 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  73. Grzybowski, M. W. et al. A common resequencing-based genetic marker data set for global maize diversity. Plant J. 113, 1109–1121 (2023).

    Article  CAS  PubMed  Google Scholar 

  74. Liu, X., Huang, M., Fan, B., Buckler, E. S. & Zhang, Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 12, e1005767 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  75. Yin, L. et al. rMVP: a memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study. Genomics Proteomics Bioinformatics 19, 619–628 (2021).

    Article  PubMed  PubMed Central  Google Scholar 

  76. Dong, Z. et al. Necrotic upper tips1 mimics heat and drought stress and encodes a protoxylem-specific transcription factor in maize. Proc. Natl Acad. Sci. USA 117, 20908–20919 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  77. Dai, D. et al. Paternal imprinting of dosage-effect defective1 contributes to seed weight xenia in maize. Nat. Commun. 13, 5366 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  78. Wu, H. et al. NAKED ENDOSPERM1, NAKED ENDOSPERM2, and OPAQUE2 interact to regulate gene networks in maize endosperm development. Plant Cell 36, 19–39 (2023).

    Article  PubMed  PubMed Central  Google Scholar 

  79. Bang, S. et al. WUSCHEL-dependent chromatin regulation in maize inflorescence development at single-cell resolution. Preprint at bioRxiv https://doi.org/10.1101/2024.05.13.593957 (2024).

  80. Kielbasa, S. M., Wan, R., Sato, K., Horton, P. & Frith, M. C. Adaptive seeds tame genomic sequence comparison. Genome Res. 21, 487–493 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  81. Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  82. Diesh, C. et al. JBrowse 2: a modular genome browser with views of synteny and structural variation. Genome Biol. 24, 74 (2023).

    Article  PubMed  PubMed Central  Google Scholar 

  83. Ramirez, F., Dundar, F., Diehl, S., Gruning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  84. Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  85. Shumate, A. & Salzberg, S. L. Liftoff: accurate mapping of gene annotations. Bioinformatics 37, 1639–1643 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  86. Goel, M., Sun, H., Jiao, W. B. & Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  87. Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  88. Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).

    Article  CAS  PubMed  Google Scholar 

  89. R Core Team R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2023); https://www.R-project.org/

  90. Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res. 4, 1521 (2015).

    Article  PubMed  Google Scholar 

  91. Gu, Z., Eils, R. & Schlesner, M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–2849 (2016).

    Article  CAS  PubMed  Google Scholar 

  92. Gu, Z. Complex heatmap visualization. iMeta 1, e43 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  93. Noshay, J. M. et al. Stability of DNA methylation and chromatin accessibility in structurally diverse maize genomes. G3 11, jkab190 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  94. Ge, S. X., Jung, D. & Yao, R. ShinyGO: a graphical gene-set enrichment tool for animals and plants. Bioinformatics 36, 2628–2629 (2020).

    Article  CAS  PubMed  Google Scholar 

  95. Xing, H. L. et al. A CRISPR/Cas9 toolkit for multiplex genome editing in plants. BMC Plant Biol. 14, 327 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  96. Chen, Z., Debernardi, J. M., Dubcovsky, J. & Gallavotti, A. The combination of morphogenic regulators BABY BOOM and GRF-GIF improves maize transformation efficiency. Preprint at bioRxiv https://doi.org/10.1101/2022.09.02.506370 (2022).

  97. Galli, M., Huang, S.-s. C. & Gallavotti, A. Data from: Transcription factor binding divergence drives transcriptional and phenotypic variation in maize. Zenodo https://doi.org/10.5281/zenodo.14991915 (2025).

Download references

Acknowledgements

We thank R. Macrae for helpful comments and suggestions on the manuscript and maizeGDB for data curation of informatics resources. We acknowledge the Office of Advanced Research Computing at Rutgers, the State University of New Jersey, for providing access to the Amarel cluster. This work was supported in part through the NYU IT High Performance Computing resources, services and staff expertise. This work was supported by NSF Plant Genome Research Project grant no. IOS-1916804 to A.G. and S.-s.C.H. and NIH award no. R35GM138143 to S.-s.C.H. A.C. is a Fulbright Scholar and recipient of a Meta Prevorsek Turner Fellowship. This material was also in part based on work supported by the Center for Bioenergy Innovation, US Department of Energy, Office of Science, Biological and Environmental Research Program under award no. ERKP886 and the Oak Ridge National Laboratory Director’s R&D Program (M.L.). Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the Office of Science of the US Department of Energy under contract no. DE-AC05-00OR22725.

Author information

Authors and Affiliations

Authors

Contributions

M.G., S.-s.C.H. and A.G. conceptualized the project, analysed the data and prepared the paper. M.G., Z.C., T.G., M.L. and S.-s.C.H. conducted the TF binding analysis and visualization. S.-s.C.H. conducted the GWAS analysis. M.G., Z.C., A.C., J.G., F.F. and A.G. conducted the CRISPR experiments and analysis. M.G., N.S., X.Z., Y.D., G.S., J.W.W., G.C., C.W. and H.F.K. provided reagents.

Corresponding authors

Correspondence to Shao-shan Carol Huang or Andrea Gallavotti.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Plants thanks James Schnable, Feng Tian and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 DAP-seq data samples broadly across maize TF families.

a, Stacked bar graph showing 36 TF families in which at least one family member was tested in DAP-seq. The number of members tested for each is shown with a colored bar; the number of members for which high quality datasets were obtained is shown in parentheses next to the family name. b, Scatterplot showing the number of B73 and Mo17 peaks for each TF. c, Boxplots showing the distribution of the number of target genes obtained for each TF, n = 200 d, Heatmap showing Pearson correlation of TF binding profiles genome wide (10-bp bins). Side annotation colors correspond to TF structural families. TFs in the diversity panel are indicated with an asterisk. e, Average number of reads in peaks and number of peaks with GEM motif are indicated. Each datapoint corresponds to a DAP-seq dataset from either B73 or Mo17. Whiskers in boxplots in panels c and e indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles with n = 200 TFs.

Extended Data Fig. 2 Enrichment analysis of selected TFs and GWAS traits.

a, GO enrichment analysis of putative target genes assigned to peaks of selected TFs. b, Bubble plot showing enrichment for NAM panel GWAS SNPs associated with branch zone, cob length, and leaf angle that overlapped SBP30/UB3 DAP-seq peaks (right panel; red boxed areas) and bubble plot showing MADS73 DAP-seq peaks were significantly enriched for SNPs associated with several flowering time-related traits (left panel; black boxed areas). c, Bubble plot showing enrichment for Wisconsin Diversity panel GWAS SNPs associated with branching phenotypes that overlapped SBP30/UB3 DAP-seq peaks (right panel; red boxed areas) and bubble plot showing MADS73 DAP-seq peaks were significantly enriched for SNPs associated with flowering time-related traits (left panel; black boxed areas).

Extended Data Fig. 3 B73 TF diversity panel and DAP-CRM analysis.

a, Heatmap showing Pearson correlation of genome-wide binding profiles for 66 TFs in the B73 TF diversity panel. b, Phylogeny, tissue-specific RNA-expression (TPM; transcripts per million), and binding motifs of selected SBP TFs tested in maize DAP-seq. Expression patterns are often different for family members with similar motifs. c, Genome browser screenshot of SBP TFs shown in b at the TSH1 locus. d, Histogram showing the size distribution of DAP-CRMs. The dotted line indicates the average size of all DAP-CRMs. e, Histogram of the number of TFs per DAP-CRM. The black dotted line indicates the average number of TFs per DAP-CRM. The inset shows a close-up on the number of TFs from 20-63 TFs per DAP-CRM. f, Frequency of DAP-CRMs versus randomly shuffled CRMs of the same size within a 3 kb window surrounding the TSS.

Extended Data Fig. 4 Genome browser views of composite DAP-seq peaks.

a, B73v5 IGV genome browser view of the Vgt1 and RAP2.7 locus showing binding by DAP-seq TFs. Lower panel shows close-up of DAP-CRMs with binding by TFs not displayed as main tracks due to space constraints. b, Genome browser view of the UNBRANCHED2 (UB2) locus showing binding by DAP-seq TFs. c, Genome browser view of the INDETERMINATE GAMETOPHYTE1 (IG1) locus showing binding site location of DAP-seq TFs. In each panel the number below the DAP-CRM track indicates the number of TF peaks present in the DAP-CRM. Please visit website genome browser for a display of all TF tracks http://hlab.bio.nyu.edu/projects/zm_crm/.

Extended Data Fig. 5 Quality assessment of B73 and Mo17 DAP-seq datasets.

a, Pearson correlation of TF binding profiles for B73 and Mo17 genomes. Most B73 peak datasets (rows annotated in yellow) clustered with Mo17 peak datasets for the same TFs lifted to B73 (rows annotated in teal), as indicated by the magenta boxes. b, Percentage of non-crossmappable peaks (coordinates could not be lifted) that overlapped with “Non-Alignable” region (NotAligned), duplicated regions (DUP), or structural variants (SV, insertions or deletions greater than 50 bp). Each datapoint corresponds to the percentage of peaks from an individual TF overlapping the indicated category, n = 200. Whiskers in boxplots in panels b-d indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles with n = 200 TFs. c, Upper: scatterplots showing no difference in average reads per peak or percentage of peaks with motif for shared and specific datasets. Each dot corresponds to a single TF. Lower: distribution of average reads per peak and percentage of peaks with motif for shared and specific datasets. n = 66 d, Percentage of peaks within each category that overlapped SVs and transposable elements (TEs), n = 200.

Extended Data Fig. 6 Shared and genotype-specific peaks.

a, Percentage of TF diversity panel peaks also called as peaks in Mo17 (shared, grey) and those called only in B73v5 but not Mo17 (B73-specific, blue). b, Percentage of TF diversity panel peaks also called as peaks in B73 (shared, grey) and those called only in Mo17 but not B73 (Mo17-specific, green). Mo17 datasets show a similar percentage of Mo17-specific and Mo17-B73 shared peaks relative to those seen for B73-specific and B73-Mo17 shared peaks (TF order is same as in a). c, JBrowse2 genome browser view showing similar binding intensity of MADS69 in the 5’UTR of CYCLIN13 (Zm00001eb193830) in both genotypes. d, scatterplot showing good correlation of MADS69 reads in B73v5 compared to Mo17 MADS69 reads converted to B73v5 coordinates. e, JBrowse2 view of the MATE18 locus showing B73-specific peaks residing near a GWAS hit. Note that the B73-specific sequence insertion is not annotated as a TE. f, B73-specific peaks more often contain more than four SNPs per peak (top plot) or more SNPs per peak center (20 bp region surrounding the peak summit) (bottom plot). Shared peaks more often contain less than four SNPs per peak (top plot) or per the 20 bp region surrounding the peak summit (bottom plot). Data points represent individual TFs, n = 65. For f and g, whiskers indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles. g, Overlap of B73-specific peaks and B73-Mo17 shared peaks with orthogonal datasets or DAP-CRMs. Both B73-specific peaks and shared peaks show statistically significant enrichment (p value < 0.0001, two-tailed t-test) relative to randomly shuffled peaks, providing support that both B73-specific peaks and shared peaks could be functional. Data points represent individual TFs, n = 200.

Extended Data Fig. 7 Peak positional variation among genotypes.

a, Comparative JBrowse2 genome browser view showing shared peaks of B73 and Mo17 at the MADS67 locus. b, JBrowse2 genome browser view showing TF binding site positional variants at the ZmATL34 locus. Lower heatmap shows ZmATL34 RNA expression levels (transcripts per million) from several tissues40. A 13-fold increase in Mo17 RNA expression was observed in radicle tissue compared to B73. c, Top: percentage of shared TF diversity panel promoter peaks (top 20%) that correspond to positional variants (PosV) greater than 500 bp or less than 500 bp. Bottom: percentage of top (shared and specific) promoter peaks that corresponded to shared PosVs or B73-specific peaks. Each datapoint corresponds to an individual TF, n = 65. Whiskers indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles. d, Plot showing percentage of B73-Mo17 shared promoter peaks (<-10kb) that were associated with differentially expressed genes in various tissues from40. Genes associated with PosV<500 bp peaks are shown in dark pink, while those associated with PosV>500 peaks are shown in light pink. Fisher’s exact tests were performed to assess the enrichment of DEGs in PosV>500 bp genes relative to PosV<500 bp genes for each TF and tissue combination, and the resulting p-values were adjusted for multiple testing. e, Heatmap of -log10 FDR-adjusted p-values from one-sided Fisher’s exact tests, evaluating whether root cell type marker genes determined by single cell RNA-seq data41 are enriched for putative target genes associated with DAP-seq peaks shared between B73 and Mo17, compared to putative targets of B73-specific peaks.

Extended Data Fig. 8 DoubleDAP-seq analysis.

a, Schematic showing the doubleDAP-seq assay in which putative heterodimeric TF-DNA complexes can be pulled down and compared to results from a single protein DAP-seq assay to assess binding site specificity differences. b, Neighbor-joining phylogeny based on amino acid similarity of group VII (blue) and group VIII (green) bHLHs from maize and Arabidopsis. Members that were tested in DAP-seq are shown in bold. Three members that were tested in DAP-seq that did not yield any peaks are shown in italics. c, AlphaFold prediction of BHLH85 homodimer. d, Normalized RNA-seq expression (TPM; transcripts per million) of group VIII and VII bHLHs showing heterodimer pairs are co-expressed in many tissues. Normalized expression data from Walley et al.98. e, Venn diagram showing low degree of overlap between Q-A-R (subclade VIII) homodimers and Q-A-R:ZmSPT (subclade VII) heterodimers as exemplified by HALO-BHLH113 (Q-A-R bHLH) in single DAP and HALO-BHLH113:SBPTag-BHLH125/ZmSPT1 in doubleDAP. Only 2% of HALO-BHLH113 peaks were captured in the doubleDAP dataset indicating that the heterodimer configuration is preferred relative to the homodimer.

Extended Data Fig. 9 CRISPR editing of maize cis-regulatory regions influences phenotype.

a, Genome browser screenshot of TSH1 locus showing binding in upstream promoter region by many TFs. Grey shaded areas upstream of and within CRM169681 indicate regions that were deleted in alleles shown in b. b, Schematic showing two independent CRISPR alleles with deletions and inversions that eliminate at least five TF binding sites (colored bars; colors match TFs shown in genome browser screenshot in a). Lightly shaded grey areas indicate regions deleted in both alleles. c, Images of mature tassels for WT, two tsh1 promoter CRISPR alleles, and tsh1-ref mutant (coding region mutation51). Both CRISPR promoter alleles show outgrowth of the tassel sheath leaf (white arrows) that is not present in the WT (black arrow). Tassel branching is reduced in the tsh1-ref, and the CRISPR promoter alleles (white brackets) relative to WT (black bracket). d, SEMs of immature tassels of the CRISPR promoter alleles showing bract outgrowth (white arrowheads). e, qRT-PCR analysis of CRISPR promoter alleles showing reduced expression in immature tassels relative to WT immature tassels. Data are presented as mean values. Error bars represent standard deviation of three biological replicates for each edited allele (two biological replicates for wildtype). Data points correspond to individual biological replicates. Statistical significance determined by two-sided t-test. f, CRISPR editing of BIF2 3’UTR ARF binding sites. A cis-regulatory module (CRM18765) is situated downstream of the BIF2 gene. Within this CRM region, there is a strong ARF peak containing five ARF binding motifs (three TGTCs and two GACAs)38. Three single guide RNAs (gRNAs) were designed for CRISPR-Cas9 editing that specifically targeted the ARF motifs. Three deletion alleles were obtained. Homozygous plants of these alleles exhibited a weak bif2 phenotype with various degrees of severity (BIF2-crm1cr1 and BIF2-crm1cr3 are more severe than BIF2-crm1cr2) during the early ear development stage, characterized by partial barren patches (white arrows) on the ear primordia as seen by SEM.

Extended Data Fig. 10 Sequence alignment of Mo17 tandem-duplicated CRMs.

Nucleotide alignment of B73v3_DICE (single copy), Mo17_CRM119798 (tandem copy 1), and Mo17_CRM119799 (tandem copy 2) showing conservation of individual TF binding motifs.

Supplementary information

Supplementary Information

Supplementary Figs. 1–7 and Tables 1, 2 and 4–8.

Reporting Summary

Supplementary Table 3

Combinations of three TFs that bound DAP-CRM regions.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Galli, M., Chen, Z., Ghandour, T. et al. Transcription factor binding divergence drives transcriptional and phenotypic variation in maize. Nat. Plants 11, 1205–1219 (2025). https://doi.org/10.1038/s41477-025-02007-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41477-025-02007-8

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing