Abstract
Regulatory elements are essential components of plant genomes that have shaped the domestication and improvement of modern crops. However, their identity, function and diversity remain poorly characterized, limiting our ability to harness their full power for agricultural advances using induced or natural variation. Here we mapped transcription factor (TF) binding for 200 TFs from 30 families in two distinct maize inbred lines historically used in maize breeding. TF binding comparison revealed widespread differences between inbreds, driven largely by structural variation, that correlated with gene expression changes and explained complex quantitative trait loci such as Vgt1, an important determinant of flowering time, and DICE, an herbivore resistance enhancer. CRISPR–Cas9 editing of TF binding regions validated the function and structure of regulatory regions at various loci controlling plant architecture and biotic resistance. Our maize TF binding catalogue identifies functional regulatory regions and enables collective and comparative analysis, highlighting its value for agricultural improvement.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout





Similar content being viewed by others
Data availability
The raw and processed data for the DAP-seq datasets generated in this study are available from NCBI GEO under accession number GSE275897. The processed data are available via Zenodo at https://doi.org/10.5281/zenodo.14991915 (ref. 97). Comparative peak data tracks from the different inbred lines can be viewed on JBrowse2 at http://hlab.bio.nyu.edu/projects/zm_crm/.
Code availability
The analysis pipelines are available via GitHub at https://github.com/hlab1/Maize_B73_Mo17_DAP.
References
Kim, S. & Wysocka, J. Deciphering the multi-scale, quantitative cis-regulatory code. Mol. Cell 83, 373–392 (2023).
Rodriguez-Leal, D., Lemmon, Z. H., Man, J., Bartlett, M. E. & Lippman, Z. B. Engineering quantitative trait variation for crop improvement by genome editing. Cell 171, 470–480.e8 (2017).
Liang, Y., Liu, H. J., Yan, J. & Tian, F. Natural variation in crops: realized understanding, continuing promise. Annu. Rev. Plant Biol. 72, 357–385 (2021).
Wang, X. et al. Dissecting cis-regulatory control of quantitative trait variation in a plant stem cell circuit. Nat. Plants 7, 419–427 (2021).
Meyer, R. S. & Purugganan, M. D. Evolution of crop species: genetics of domestication and diversification. Nat. Rev. Genet. 14, 840–852 (2013).
Liu, L. et al. Enhancing grain-yield-related traits by CRISPR–Cas9 promoter editing of maize CLE genes. Nat. Plants 7, 287–294 (2021).
Sun, Y. et al. Divergence in the ABA gene regulatory network underlies differential growth control. Nat. Plants 8, 549–560 (2022).
Wallace, J. G., Rodgers-Melnick, E. & Buckler, E. S. On the road to breeding 4.0: unraveling the good, the bad, and the boring of crop quantitative genomics. Annu. Rev. Genet. 52, 421–444 (2018).
Kremling, K. A. G. et al. Dysregulation of expression correlates with rare-allele burden and fitness loss in maize. Nature 555, 520–523 (2018).
Hake, S. & Ross-Ibarra, J. Genetic, evolutionary and plant breeding insights from the domestication of maize. eLife 4, e05861 (2015).
Rodgers-Melnick, E., Vera, D. L., Bass, H. W. & Buckler, E. S. Open chromatin reveals the functional maize genome. Proc. Natl Acad. Sci. USA 113, E3177–E3184 (2016).
O’Malley, R. C. et al. Cistrome and epicistrome features shape the regulatory DNA landscape. Cell 165, 1280–1292 (2016); erratum 166, 1598 (2016).
Zhang, Y. et al. Evolutionary rewiring of the wheat transcriptional regulatory network by lineage-specific transposable elements. Genome Res. 31, 2276–2289 (2021).
Baumgart, L. A. et al. Recruitment, rewiring, and deep conservation in flowering plant gene regulation. Preprint at bioRxiv https://doi.org/10.1101/2024.10.08.617089 (2025).
Zhan, J. et al. Opaque-2 regulates a complex gene network associated with cell differentiation and storage functions of maize endosperm. Plant Cell 30, 2425–2446 (2018).
Wallace, J. G. et al. Association mapping across numerous traits reveals patterns of functional variation in maize. PLoS Genet. 10, e1004845 (2014).
Mural, R. V. et al. Association mapping across a multitude of traits collected in diverse environments in maize. GigaScience 11, giac080 (2022).
Marand, A. P., Eveland, A. L., Kaufmann, K. & Springer, N. M. cis-Regulatory elements in plant development, adaptation, and evolution. Annu. Rev. Plant Biol. 74, 111–137 (2023).
Tu, X. et al. Reconstructing the maize leaf regulatory network using ChIP-seq data of 104 transcription factors. Nat. Commun. 11, 5089 (2020).
Jores, T. et al. Plant enhancers exhibit both cooperative and additive interactions among their functional elements. Plant Cell 36, 2570–2586 (2024).
Hufford, M. B. et al. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes. Science 373, 655–662 (2021).
Ricci, W. A. et al. Widespread long-range cis-regulatory elements in the maize genome. Nat. Plants 5, 1237–1249 (2019).
Crisp, P. A. et al. Stable unmethylated DNA demarcates expressed genes and their cis-regulatory space in plant genomes. Proc. Natl Acad. Sci. USA 117, 23991–24000 (2020).
Song, B. et al. Conserved noncoding sequences provide insights into regulatory sequence and loss of gene expression in maize. Genome Res. 31, 1245–1257 (2021).
Wills, D. M. et al. From many, one: genetic control of prolificacy during maize domestication. PLoS Genet. 9, e1003604 (2013).
Salvi, S. et al. Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize. Proc. Natl Acad. Sci. USA 104, 11376–11381 (2007).
Iotchkova, V. et al. GARFIELD classifies disease-relevant genomic features through integration of functional annotations with association signals. Nat. Genet. 51, 343–353 (2019).
Eichten, S. R. et al. B73-Mo17 near-isogenic lines demonstrate dispersed structural variation in maize. Plant Physiol. 156, 1679–1690 (2011).
Buckler, E. S. et al. The genetic architecture of maize flowering time. Science 325, 714–718 (2009).
Mickelson, S. M., Stuber, C. S., Senior, L. & Kaeppler, S. M. Quantitative trait loci controlling leaf and tassel traits in a B73 × Mo17 population of maize. Crop Sci. 42, 1902–1909 (2002).
Ordas, B. et al. Mapping of QTL for resistance to the Mediterranean corn borer attack using the intermated B73 × Mo17 (IBM) population of maize. Theor. Appl. Genet. 119, 1451–1459 (2009).
Goering, R., Larsen, S., Tan, J., Whelan, J. & Makarevitch, I. QTL mapping of seedling tolerance to exposure to low temperature in the maize IBM RIL population. PLoS ONE 16, e0254437 (2021).
Sun, S. et al. Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes. Nat. Genet. 50, 1289–1295 (2018).
Song, B. et al. AnchorWave: sensitive alignment of genomes with high sequence diversity, extensive structural polymorphism, and whole-genome duplication. Proc. Natl Acad. Sci. USA 119, e2113075119 (2022).
Zhao, H. et al. CrossMap: a versatile tool for coordinate conversion between genome assemblies. Bioinformatics 30, 1006–1007 (2014).
Liang, Y. et al. ZmMADS69 functions as a flowering activator through the ZmRap2.7-ZCN8 regulatory module and contributes to maize flowering time adaptation. New Phytol. 221, 2335–2347 (2019).
Tian, D. et al. GWAS Atlas: a curated resource of genome-wide variant–trait associations in plants and animals. Nucleic Acids Res. 48, D927–D932 (2020).
Galli, M. et al. The DNA binding landscape of the maize AUXIN RESPONSE FACTOR family. Nat. Commun. 9, 4526 (2018).
Maekawa, S. et al. The Arabidopsis ubiquitin ligases ATL31 and ATL6 control the defense response as well as the carbon/nitrogen response. Plant Mol. Biol. 79, 217–227 (2012).
Zhou, P., Hirsch, C. N., Briggs, S. P. & Springer, N. M. Dynamic patterns of gene expression additivity and regulatory variation throughout maize development. Mol. Plant 12, 410–425 (2019).
Guillotin, B. et al. A pan-grass transcriptome reveals patterns of cellular divergence in crops. Nature 617, 785–791 (2023).
Li, M. et al. Double DAP-seq uncovered synergistic DNA binding of interacting bZIP transcription factors. Nat. Commun. 14, 2600 (2023).
Heim, M. A. et al. The basic helix–loop–helix transcription factor family in plants: a genome-wide study of protein structure and functional diversity. Mol. Biol. Evol. 20, 735–747 (2003).
de Martin, X., Sodaei, R. & Santpere, G. Mechanisms of binding specificity among bHLH transcription factors. Int. J. Mol. Sci. 22, 9150 (2021).
Gaillochet, C. et al. Control of plant cell fate transitions by transcriptional and hormonal signals. eLife 6, e30135 (2017).
Crawford, B. C. & Yanofsky, M. F. HALF FILLED promotes reproductive tract development and fertilization efficiency in Arabidopsis thaliana. Development 138, 2999–3009 (2011).
Gallavotti, A. et al. The role of barren stalk1 in the architecture of maize. Nature 432, 630–635 (2004).
Jumper, J. et al. Applying and improving AlphaFold at CASP14. Proteins 89, 1711–1721 (2021).
Gremski, K., Ditta, G. & Yanofsky, M. F. The HECATE genes regulate female reproductive tract development in Arabidopsis thaliana. Development 134, 3593–3601 (2007).
Aguirre, L., Hendelman, A., Hutton, S. F., McCandlish, D. M. & Lippman, Z. B. Idiosyncratic and dose-dependent epistasis drives variation in tomato fruit size. Science 382, 315–320 (2023).
Whipple, C. J. et al. A conserved mechanism of bract suppression in the grass family. Plant Cell 22, 565–578 (2010).
McSteen, P. et al. barren inflorescence2 encodes a co-ortholog of the PINOID serine/threonine kinase and is required for organogenesis during inflorescence and vegetative development in maize. Plant Physiol. 144, 1000–1011 (2007).
Zheng, L. et al. Prolonged expression of the BX1 signature enzyme is associated with a recombination hotspot in the benzoxazinoid gene cluster in Zea mays. J. Exp. Bot. 66, 3917–3930 (2015).
Frey, M. et al. Analysis of a chemical plant defense mechanism in grasses. Science 277, 696–699 (1997).
Engelhorn, J. et al. Genetic variation at transcription factor binding sites largely explains phenotypic heritability in maize. Preprint at bioRxiv https://doi.org/10.1101/2023.08.08.551183 (2024).
Parvathaneni, R. K. et al. The regulatory landscape of early maize inflorescence development. Genome Biol. 21, 165 (2020).
Cahn, J. et al. MaizeCODE reveals bi-directionally expressed enhancers that harbor molecular signatures of maize domestication. Nat. Commun. 15, 10854 (2024).
Sun, Y. et al. 3D genome architecture coordinates trans and cis regulation of differentially expressed ear and tassel genes in maize. Genome Biol. 21, 143 (2020).
Savadel, S. D. et al. The native cistrome and sequence motif families of the maize ear. PLoS Genet. 17, e1009689 (2021).
Oka, R. et al. Genome-wide mapping of transcriptional enhancer candidates using DNA and chromatin features in maize. Genome Biol. 18, 137 (2017).
Marand, A. P. et al. The genetic architecture of cell type–specific cis regulation in maize. Science 388, eads6601 (2025).
Hendelman, A. et al. Conserved pleiotropy of an ancient plant homeobox gene uncovered by cis-regulatory dissection. Cell 184, 1724–1739.e16 (2021).
Bartlett, A. et al. Mapping genome-wide transcription-factor binding sites using DAP-seq. Nat. Protoc. 12, 1659–1672 (2017).
Burdo, B. et al. The Maize TFome—development of a transcription factor open reading frame collection for functional genomics. Plant J. 80, 356–366 (2014).
Yilmaz, A. et al. GRASSIUS: a platform for comparative regulatory genomics across the grasses. Plant Physiol. 149, 171–180 (2009).
Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682 (2022).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Guo, Y., Mahony, S. & Gifford, D. K. High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints. PLoS Comput. Biol. 8, e1002638 (2012).
Yu, G., Wang, L. G. & He, Q. Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015).
Machanick, P. & Bailey, T. L. MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics 27, 1696–1697 (2011).
Shen, Z., Hoeksema, M. A., Ouyang, Z., Benner, C. & Glass, C. K. MAGGIE: leveraging genetic variation to identify DNA sequence motifs mediating transcription factor binding and function. Bioinformatics 36, i84–i92 (2020).
Grzybowski, M. W. et al. A common resequencing-based genetic marker data set for global maize diversity. Plant J. 113, 1109–1121 (2023).
Liu, X., Huang, M., Fan, B., Buckler, E. S. & Zhang, Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 12, e1005767 (2016).
Yin, L. et al. rMVP: a memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study. Genomics Proteomics Bioinformatics 19, 619–628 (2021).
Dong, Z. et al. Necrotic upper tips1 mimics heat and drought stress and encodes a protoxylem-specific transcription factor in maize. Proc. Natl Acad. Sci. USA 117, 20908–20919 (2020).
Dai, D. et al. Paternal imprinting of dosage-effect defective1 contributes to seed weight xenia in maize. Nat. Commun. 13, 5366 (2022).
Wu, H. et al. NAKED ENDOSPERM1, NAKED ENDOSPERM2, and OPAQUE2 interact to regulate gene networks in maize endosperm development. Plant Cell 36, 19–39 (2023).
Bang, S. et al. WUSCHEL-dependent chromatin regulation in maize inflorescence development at single-cell resolution. Preprint at bioRxiv https://doi.org/10.1101/2024.05.13.593957 (2024).
Kielbasa, S. M., Wan, R., Sato, K., Horton, P. & Frith, M. C. Adaptive seeds tame genomic sequence comparison. Genome Res. 21, 487–493 (2011).
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Diesh, C. et al. JBrowse 2: a modular genome browser with views of synteny and structural variation. Genome Biol. 24, 74 (2023).
Ramirez, F., Dundar, F., Diehl, S., Gruning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Shumate, A. & Salzberg, S. L. Liftoff: accurate mapping of gene annotations. Bioinformatics 37, 1639–1643 (2021).
Goel, M., Sun, H., Jiao, W. B. & Schneeberger, K. SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol. 20, 277 (2019).
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
R Core Team R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2023); https://www.R-project.org/
Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res. 4, 1521 (2015).
Gu, Z., Eils, R. & Schlesner, M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–2849 (2016).
Gu, Z. Complex heatmap visualization. iMeta 1, e43 (2022).
Noshay, J. M. et al. Stability of DNA methylation and chromatin accessibility in structurally diverse maize genomes. G3 11, jkab190 (2021).
Ge, S. X., Jung, D. & Yao, R. ShinyGO: a graphical gene-set enrichment tool for animals and plants. Bioinformatics 36, 2628–2629 (2020).
Xing, H. L. et al. A CRISPR/Cas9 toolkit for multiplex genome editing in plants. BMC Plant Biol. 14, 327 (2014).
Chen, Z., Debernardi, J. M., Dubcovsky, J. & Gallavotti, A. The combination of morphogenic regulators BABY BOOM and GRF-GIF improves maize transformation efficiency. Preprint at bioRxiv https://doi.org/10.1101/2022.09.02.506370 (2022).
Galli, M., Huang, S.-s. C. & Gallavotti, A. Data from: Transcription factor binding divergence drives transcriptional and phenotypic variation in maize. Zenodo https://doi.org/10.5281/zenodo.14991915 (2025).
Acknowledgements
We thank R. Macrae for helpful comments and suggestions on the manuscript and maizeGDB for data curation of informatics resources. We acknowledge the Office of Advanced Research Computing at Rutgers, the State University of New Jersey, for providing access to the Amarel cluster. This work was supported in part through the NYU IT High Performance Computing resources, services and staff expertise. This work was supported by NSF Plant Genome Research Project grant no. IOS-1916804 to A.G. and S.-s.C.H. and NIH award no. R35GM138143 to S.-s.C.H. A.C. is a Fulbright Scholar and recipient of a Meta Prevorsek Turner Fellowship. This material was also in part based on work supported by the Center for Bioenergy Innovation, US Department of Energy, Office of Science, Biological and Environmental Research Program under award no. ERKP886 and the Oak Ridge National Laboratory Director’s R&D Program (M.L.). Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the Office of Science of the US Department of Energy under contract no. DE-AC05-00OR22725.
Author information
Authors and Affiliations
Contributions
M.G., S.-s.C.H. and A.G. conceptualized the project, analysed the data and prepared the paper. M.G., Z.C., T.G., M.L. and S.-s.C.H. conducted the TF binding analysis and visualization. S.-s.C.H. conducted the GWAS analysis. M.G., Z.C., A.C., J.G., F.F. and A.G. conducted the CRISPR experiments and analysis. M.G., N.S., X.Z., Y.D., G.S., J.W.W., G.C., C.W. and H.F.K. provided reagents.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Plants thanks James Schnable, Feng Tian and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 DAP-seq data samples broadly across maize TF families.
a, Stacked bar graph showing 36 TF families in which at least one family member was tested in DAP-seq. The number of members tested for each is shown with a colored bar; the number of members for which high quality datasets were obtained is shown in parentheses next to the family name. b, Scatterplot showing the number of B73 and Mo17 peaks for each TF. c, Boxplots showing the distribution of the number of target genes obtained for each TF, n = 200 d, Heatmap showing Pearson correlation of TF binding profiles genome wide (10-bp bins). Side annotation colors correspond to TF structural families. TFs in the diversity panel are indicated with an asterisk. e, Average number of reads in peaks and number of peaks with GEM motif are indicated. Each datapoint corresponds to a DAP-seq dataset from either B73 or Mo17. Whiskers in boxplots in panels c and e indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles with n = 200 TFs.
Extended Data Fig. 2 Enrichment analysis of selected TFs and GWAS traits.
a, GO enrichment analysis of putative target genes assigned to peaks of selected TFs. b, Bubble plot showing enrichment for NAM panel GWAS SNPs associated with branch zone, cob length, and leaf angle that overlapped SBP30/UB3 DAP-seq peaks (right panel; red boxed areas) and bubble plot showing MADS73 DAP-seq peaks were significantly enriched for SNPs associated with several flowering time-related traits (left panel; black boxed areas). c, Bubble plot showing enrichment for Wisconsin Diversity panel GWAS SNPs associated with branching phenotypes that overlapped SBP30/UB3 DAP-seq peaks (right panel; red boxed areas) and bubble plot showing MADS73 DAP-seq peaks were significantly enriched for SNPs associated with flowering time-related traits (left panel; black boxed areas).
Extended Data Fig. 3 B73 TF diversity panel and DAP-CRM analysis.
a, Heatmap showing Pearson correlation of genome-wide binding profiles for 66 TFs in the B73 TF diversity panel. b, Phylogeny, tissue-specific RNA-expression (TPM; transcripts per million), and binding motifs of selected SBP TFs tested in maize DAP-seq. Expression patterns are often different for family members with similar motifs. c, Genome browser screenshot of SBP TFs shown in b at the TSH1 locus. d, Histogram showing the size distribution of DAP-CRMs. The dotted line indicates the average size of all DAP-CRMs. e, Histogram of the number of TFs per DAP-CRM. The black dotted line indicates the average number of TFs per DAP-CRM. The inset shows a close-up on the number of TFs from 20-63 TFs per DAP-CRM. f, Frequency of DAP-CRMs versus randomly shuffled CRMs of the same size within a 3 kb window surrounding the TSS.
Extended Data Fig. 4 Genome browser views of composite DAP-seq peaks.
a, B73v5 IGV genome browser view of the Vgt1 and RAP2.7 locus showing binding by DAP-seq TFs. Lower panel shows close-up of DAP-CRMs with binding by TFs not displayed as main tracks due to space constraints. b, Genome browser view of the UNBRANCHED2 (UB2) locus showing binding by DAP-seq TFs. c, Genome browser view of the INDETERMINATE GAMETOPHYTE1 (IG1) locus showing binding site location of DAP-seq TFs. In each panel the number below the DAP-CRM track indicates the number of TF peaks present in the DAP-CRM. Please visit website genome browser for a display of all TF tracks http://hlab.bio.nyu.edu/projects/zm_crm/.
Extended Data Fig. 5 Quality assessment of B73 and Mo17 DAP-seq datasets.
a, Pearson correlation of TF binding profiles for B73 and Mo17 genomes. Most B73 peak datasets (rows annotated in yellow) clustered with Mo17 peak datasets for the same TFs lifted to B73 (rows annotated in teal), as indicated by the magenta boxes. b, Percentage of non-crossmappable peaks (coordinates could not be lifted) that overlapped with “Non-Alignable” region (NotAligned), duplicated regions (DUP), or structural variants (SV, insertions or deletions greater than 50 bp). Each datapoint corresponds to the percentage of peaks from an individual TF overlapping the indicated category, n = 200. Whiskers in boxplots in panels b-d indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles with n = 200 TFs. c, Upper: scatterplots showing no difference in average reads per peak or percentage of peaks with motif for shared and specific datasets. Each dot corresponds to a single TF. Lower: distribution of average reads per peak and percentage of peaks with motif for shared and specific datasets. n = 66 d, Percentage of peaks within each category that overlapped SVs and transposable elements (TEs), n = 200.
Extended Data Fig. 6 Shared and genotype-specific peaks.
a, Percentage of TF diversity panel peaks also called as peaks in Mo17 (shared, grey) and those called only in B73v5 but not Mo17 (B73-specific, blue). b, Percentage of TF diversity panel peaks also called as peaks in B73 (shared, grey) and those called only in Mo17 but not B73 (Mo17-specific, green). Mo17 datasets show a similar percentage of Mo17-specific and Mo17-B73 shared peaks relative to those seen for B73-specific and B73-Mo17 shared peaks (TF order is same as in a). c, JBrowse2 genome browser view showing similar binding intensity of MADS69 in the 5’UTR of CYCLIN13 (Zm00001eb193830) in both genotypes. d, scatterplot showing good correlation of MADS69 reads in B73v5 compared to Mo17 MADS69 reads converted to B73v5 coordinates. e, JBrowse2 view of the MATE18 locus showing B73-specific peaks residing near a GWAS hit. Note that the B73-specific sequence insertion is not annotated as a TE. f, B73-specific peaks more often contain more than four SNPs per peak (top plot) or more SNPs per peak center (20 bp region surrounding the peak summit) (bottom plot). Shared peaks more often contain less than four SNPs per peak (top plot) or per the 20 bp region surrounding the peak summit (bottom plot). Data points represent individual TFs, n = 65. For f and g, whiskers indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles. g, Overlap of B73-specific peaks and B73-Mo17 shared peaks with orthogonal datasets or DAP-CRMs. Both B73-specific peaks and shared peaks show statistically significant enrichment (p value < 0.0001, two-tailed t-test) relative to randomly shuffled peaks, providing support that both B73-specific peaks and shared peaks could be functional. Data points represent individual TFs, n = 200.
Extended Data Fig. 7 Peak positional variation among genotypes.
a, Comparative JBrowse2 genome browser view showing shared peaks of B73 and Mo17 at the MADS67 locus. b, JBrowse2 genome browser view showing TF binding site positional variants at the ZmATL34 locus. Lower heatmap shows ZmATL34 RNA expression levels (transcripts per million) from several tissues40. A 13-fold increase in Mo17 RNA expression was observed in radicle tissue compared to B73. c, Top: percentage of shared TF diversity panel promoter peaks (top 20%) that correspond to positional variants (PosV) greater than 500 bp or less than 500 bp. Bottom: percentage of top (shared and specific) promoter peaks that corresponded to shared PosVs or B73-specific peaks. Each datapoint corresponds to an individual TF, n = 65. Whiskers indicate the minimum and maximum values; central lines correspond to medians and box boundaries denote the upper (25th percentile) and lower (75th percentile) quartiles. d, Plot showing percentage of B73-Mo17 shared promoter peaks (<-10kb) that were associated with differentially expressed genes in various tissues from40. Genes associated with PosV<500 bp peaks are shown in dark pink, while those associated with PosV>500 peaks are shown in light pink. Fisher’s exact tests were performed to assess the enrichment of DEGs in PosV>500 bp genes relative to PosV<500 bp genes for each TF and tissue combination, and the resulting p-values were adjusted for multiple testing. e, Heatmap of -log10 FDR-adjusted p-values from one-sided Fisher’s exact tests, evaluating whether root cell type marker genes determined by single cell RNA-seq data41 are enriched for putative target genes associated with DAP-seq peaks shared between B73 and Mo17, compared to putative targets of B73-specific peaks.
Extended Data Fig. 8 DoubleDAP-seq analysis.
a, Schematic showing the doubleDAP-seq assay in which putative heterodimeric TF-DNA complexes can be pulled down and compared to results from a single protein DAP-seq assay to assess binding site specificity differences. b, Neighbor-joining phylogeny based on amino acid similarity of group VII (blue) and group VIII (green) bHLHs from maize and Arabidopsis. Members that were tested in DAP-seq are shown in bold. Three members that were tested in DAP-seq that did not yield any peaks are shown in italics. c, AlphaFold prediction of BHLH85 homodimer. d, Normalized RNA-seq expression (TPM; transcripts per million) of group VIII and VII bHLHs showing heterodimer pairs are co-expressed in many tissues. Normalized expression data from Walley et al.98. e, Venn diagram showing low degree of overlap between Q-A-R (subclade VIII) homodimers and Q-A-R:ZmSPT (subclade VII) heterodimers as exemplified by HALO-BHLH113 (Q-A-R bHLH) in single DAP and HALO-BHLH113:SBPTag-BHLH125/ZmSPT1 in doubleDAP. Only 2% of HALO-BHLH113 peaks were captured in the doubleDAP dataset indicating that the heterodimer configuration is preferred relative to the homodimer.
Extended Data Fig. 9 CRISPR editing of maize cis-regulatory regions influences phenotype.
a, Genome browser screenshot of TSH1 locus showing binding in upstream promoter region by many TFs. Grey shaded areas upstream of and within CRM169681 indicate regions that were deleted in alleles shown in b. b, Schematic showing two independent CRISPR alleles with deletions and inversions that eliminate at least five TF binding sites (colored bars; colors match TFs shown in genome browser screenshot in a). Lightly shaded grey areas indicate regions deleted in both alleles. c, Images of mature tassels for WT, two tsh1 promoter CRISPR alleles, and tsh1-ref mutant (coding region mutation51). Both CRISPR promoter alleles show outgrowth of the tassel sheath leaf (white arrows) that is not present in the WT (black arrow). Tassel branching is reduced in the tsh1-ref, and the CRISPR promoter alleles (white brackets) relative to WT (black bracket). d, SEMs of immature tassels of the CRISPR promoter alleles showing bract outgrowth (white arrowheads). e, qRT-PCR analysis of CRISPR promoter alleles showing reduced expression in immature tassels relative to WT immature tassels. Data are presented as mean values. Error bars represent standard deviation of three biological replicates for each edited allele (two biological replicates for wildtype). Data points correspond to individual biological replicates. Statistical significance determined by two-sided t-test. f, CRISPR editing of BIF2 3’UTR ARF binding sites. A cis-regulatory module (CRM18765) is situated downstream of the BIF2 gene. Within this CRM region, there is a strong ARF peak containing five ARF binding motifs (three TGTCs and two GACAs)38. Three single guide RNAs (gRNAs) were designed for CRISPR-Cas9 editing that specifically targeted the ARF motifs. Three deletion alleles were obtained. Homozygous plants of these alleles exhibited a weak bif2 phenotype with various degrees of severity (BIF2-crm1cr1 and BIF2-crm1cr3 are more severe than BIF2-crm1cr2) during the early ear development stage, characterized by partial barren patches (white arrows) on the ear primordia as seen by SEM.
Extended Data Fig. 10 Sequence alignment of Mo17 tandem-duplicated CRMs.
Nucleotide alignment of B73v3_DICE (single copy), Mo17_CRM119798 (tandem copy 1), and Mo17_CRM119799 (tandem copy 2) showing conservation of individual TF binding motifs.
Supplementary information
Supplementary Information
Supplementary Figs. 1–7 and Tables 1, 2 and 4–8.
Supplementary Table 3
Combinations of three TFs that bound DAP-CRM regions.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Galli, M., Chen, Z., Ghandour, T. et al. Transcription factor binding divergence drives transcriptional and phenotypic variation in maize. Nat. Plants 11, 1205–1219 (2025). https://doi.org/10.1038/s41477-025-02007-8
Received:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/s41477-025-02007-8
This article is cited by
-
Recruitment, rewiring and deep conservation in flowering plant gene regulation
Nature Plants (2025)