Deep learning-based high-resolution time inference for deciphering dynamic gene regulation from fixed embryos

Bao, Huihan; Zhang, Shihe; Yu, Zhiyang; Xu, Heng

doi:10.1038/s41467-025-61907-7

Download PDF

Article
Open access
Published: 16 July 2025

Deep learning-based high-resolution time inference for deciphering dynamic gene regulation from fixed embryos

Huihan Bao^1,2,
Shihe Zhang ORCID: orcid.org/0000-0003-3151-0882^1,2,
Zhiyang Yu^1,2 &
…
Heng Xu ORCID: orcid.org/0000-0002-3717-0412^1,2

Nature Communications volume 16, Article number: 6565 (2025) Cite this article

2998 Accesses
11 Altmetric
Metrics details

Subjects

Abstract

Embryo development is driven by the spatiotemporal dynamics of complex gene regulatory networks. Uncovering these dynamics requires simultaneous tracking of multiple fluctuating molecular species over time, which exceeds the capabilities of traditional live-imaging approaches. Fixed-embryo imaging offers the necessary sensitivity and capacity but lacks temporal resolution. Here, we present a multi-scale ensemble deep learning approach to precisely infer absolute developmental time with 1-minute resolution from nuclear morphology in fixed Drosophila embryo images. Applying this approach to quantitative imaging of fixed wild-type embryos, we resolve the spatiotemporal regulation of the endogenous segmentation gene Krüppel (Kr) by multiple transcription factors (TFs) during early development without genetic modification. Integrating a time-resolved theoretical model of single-molecule mRNA statistics, we further uncover the unsteady-state bursty kinetics of the endogenous segmentation gene, hunchback (hb), driven by dynamic TF binding. Our method provides a versatile framework for deciphering complex gene network dynamics in genetically unmodified organisms.

From complex datasets to predictive models of embryonic development

Article 20 August 2021

Uncovering developmental time and tempo using deep learning

Article Open access 23 November 2023

A sensitive mNeonGreen reporter system to measure transcriptional dynamics in Drosophila development

Article Open access 12 November 2020

Introduction

Embryo development is orchestrated by complex gene regulatory networks in individual cells, whose dynamic spatiotemporal expressions dictate cell fate determination and body plan formation^1,2,3. For example, in early Drosophila embryos (nuclear cycles (nc) 11 to early 14), cell nuclei undergo rapid and synchronous divisions, with no visible body structures yet formed. Yet, during this stage, future body segments along the anterior-posterior (AP) axis are pre-defined by the dynamic expression patterns of the segmentation gene network^4,5,6. To unravel these gene regulation dynamics, a key challenge is to simultaneously track both the spatial and temporal information with high resolution. A common approach involves real-time imaging of live embryos genetically engineered to express fluorescently labeled mRNAs or proteins of interest^7,8,9,10. However, due to genetic and optical challenges, this approach inherently struggles to monitor multiple dynamic molecular species simultaneously (typically <4)¹¹, limiting its effectiveness in analyzing complex regulatory interactions. Additionally, genetic modifications can disrupt endogenous gene expression, potentially distorting the natural biological system¹¹.

Fixed-embryo imaging offers a compelling alternative that does not require genetic modifications and is scalable to high-throughput applications^12,13,14. Without the trade-offs associated with live tracking, it provides higher sensitivity and spatial resolution¹⁵. The challenge, however, is that each fixed-embryo image represents only a snapshot in time, necessitating temporal alignment of multiple fixed embryos to reconstruct the developmental process. Traditionally, this alignment relied on manually timing embryos before fixation^16,17 or comparing their morphology to a pre-established developmental atlas after fixation^18,19. Yet, due to uncertainty in developmental start times and the limited sample density of the atlas, these methods provide only coarse-grained developmental stages, inadequate for resolving detailed gene regulation dynamics at minute or sub-minute timescales.

One way to improve temporal resolution is by leveraging fast-changing morphological features at the cellular level^{5,20,21,22,23}. For example, during the cellularization stage of the Drosophila embryo (early-mid nc14), the progressively elongating furrows of invaginating cell membranes have been used to estimate the developmental time with ~2–4-minute resolution^5,20. However, in most other cases, the relationship between time and cell morphology is complex and stage-dependent^21,22,23. Explicitly identifying these subtle features and reliably mapping them to absolute developmental time remains technically challenging. Deep learning methods, which can objectively extract comprehensive image features and model complex relationships, have recently been applied as a powerful tool to study embryo development^24,25. However, while most existing applications focus on classifying embryo phenotypes and coarse-grained developmental stages^26,27,28, a versatile and high-resolution inference of absolute developmental time is still lacking.

Here, we present a deep learning-based regression approach to infer the absolute developmental time of early Drosophila embryos during nc11–early 14 with 1-minute resolution. Using time-lapse nuclear histone images of transgenic embryos as the initial dataset, we employed ensemble learning with three independent convolutional neural network (CNN) models to capture morphological features across multiple spatial scales^29,30,31. By calibrating fixation-induced and strain-specific variations in embryo size, and implementing a relayed learning strategy, our method precisely inferred developmental time from standard DNA images of fixed embryos, regardless of strain. Using this approach, we quantified the spatiotemporal regulation of the segmentation gene Krüppel (Kr) by two transcription factors (TFs), Bicoid (Bcd) and Hunchback (Hb), from single-molecule imaging of multiple fixed wild-type (WT) embryos during nc11–13. We found that Kr transcription is dynamically governed by a multiplicative combination of cooperative Bcd activation and Hb repression. Moreover, by incorporating single-molecule mRNA statistics, we developed a time-resolved theoretical pipeline to uncover the microscopic transcription kinetics of the segmentation gene hunchback (hb) during nc11–13. We found that hb transcription follows unsteady-state bursty kinetics driven by dynamic Bcd binding within a specific time window of each nuclear cycle. Our method offers a versatile framework for investigating dynamic gene regulation in genetically unmodified organisms and facilitates spatiotemporal multi-omics for deciphering complex gene regulatory networks.

Results

Predicting developmental time from nuclear histone images with 1-minute accuracy

To relate developmental time with dynamic nuclear signal, we obtained time-lapse images of histone H2A-RFP from >30 transgenic Drosophila embryos (his2av-mrfp1) during nc10–early 14 with 1-minute time-resolution using confocal microscopy (Fig. 1a and Supplementary Fig. 1a–c, see Methods). The timing of different embryos was aligned and scaled based on cell division times to compensate for variation in developmental tempo among embryos. Besides a significant difference in the number of nuclei between nuclear cycles, we observed a continuous evolution of nuclear morphology (including size, shape, etc.) across multiple spatial scales within each nuclear cycle (Fig. 1a and Supplementary Fig. 1d).

**Fig. 1: CNN models predict developmental time from nuclear histone images of live embryos.**

To comprehensively extract all time-dependent features from embryo images, we constructed a multi-scale ensemble deep learning framework consisting of three independent VGG-like CNN models, each with a regression output layer for continuous-time prediction^30,31 (Fig. 1b, Supplementary Table 1, see Methods). By dividing every embryo image into multiple small windows of three different sizes covering single, few (~6), and multiple (~20) nuclei, respectively, we trained each CNN model independently to capture time-dependent features at different spatial scales (Fig. 1c and Supplementary Fig. 1e). At any time point during nc11–early 14, predictions from these models were always similar and closely matched the ground truth, mutually validating each other (Supplementary Fig. 1f). Further combining the three models through a median filter outperformed any individual predictions (Fig. 1d and Supplementary Fig. 1f).

Compared with ground truth, our time prediction is both accurate and precise across all nuclear cycles, with little bias and variability (mean residual ± s.d., nc11: 0.01 ± 0.35 min, nc12: 0.02 ± 0.34 min, nc13: 0.05 ± 0.34 min, nc14: −0.2 ± 0.68 min, Fig. 1e). Notably, almost all predictions for nc11–early 14 are within 1-minute accuracy (nc11: 100%, nc12: 98%, nc13: 100%, nc14: 87%), catching the imaging time resolution. In comparison, a baseline predictor relying solely on nuclear size — a key time-dependent feature — performed significantly worse (1-minute accuracy: nc11: 68%, nc12: 50%, nc13: 50%, nc14: 12%, Supplementary Fig. 1g, see Methods). This underscores the value of our CNN-based method in extracting comprehensive image features for precise time inference. Moreover, our predictions for early nc14 significantly outperformed the traditional time inference method based on membrane invagination^5,20 (~2–4 min accuracy), further demonstrating the power of our deep learning method. Given this enhanced accuracy, we focus on inference for nc11–13 in the following study, as no other methods exist for this period.

Size rescaling enables time inference from fixed-embryo imaging

To apply the trained time inference models to fixed embryos, we compared images from fixed and live embryos of the same strain (his2av-mrfp1) during nc11–13. Consistent with previous reports of fixation-induced embryo shrinkage^20,32, we observed a decrease in nuclear size of a similar proportion (~80%–90%, Fig. 2a). Since nuclear size is one of the major time-dependent features captured by our models, such shrinkage severely disrupted time inference accuracy (Supplementary Fig. 2a, b).

**Fig. 2: Time inference from histone images of fixed embryos with image size rescaling.**

To correct this effect, we preceded our time inference framework with an image-rescaling step for fixed embryos (see Methods). To determine the magnitude of rescaling, we used the trend of nuclear size over time as a reference (Fig. 2b). For each nuclear cycle, by directly measuring this trend from live imaging, we tried to reconstruct it from a set of fixed embryos through time inference with various rescaling magnitude (Fig. 2b, Supplementary Fig. 2c, d, see Methods). The best reconstructions matched well with live measurement results (Supplementary Fig. 2d), indicating an optimal rescaling magnitude of ~1.20, consistent with previous reports³².

By applying this time inference to individual nuclei in different regions of fixed embryos at mitotic interphase, we observed temporal asynchrony along the AP axis (Fig. 2c). This is likely a residue of the mitotic wave, where nuclear divisions occur in a wave-like pattern that moves from the poles to the equator of the Drosophila embryo within ~0.5–1 min^33,34. Specifically for embryos in late mitotic interphase of nc11–12, the inferred time for the medial region (~0.5 embryo length (EL)) exhibited a significant delay of ~0.3–0.6 min compared to the anterior and posterior regions (~0.2 EL and ~0.8 EL). In nc13, this time delay increased to ~0.6–1.0 min, consistent with previous reports of wave slow-down during this cycle^33,34 (Fig. 2d, Supplementary Fig. 2e). These results demonstrate the 1-minute resolution of our time inference method for fixed embryos. In subsequent applications, to mitigate the influence of the mitotic wave, we averaged the inference results from these three regions to generate an overall time prediction for each fixed embryo (Fig. 2c).

Relayed learning enables time inference from the nuclear DNA signal of WT embryos

Beyond histone, which requires genetic modification of fly strains or specific antibodies for labeling, nuclear DNA, which can be easily marked by organic dyes^35,36, is a rather common target for fixed-embryo imaging. However, directly training our framework for the DNA signal is challenging due to the difficulty of labeling and imaging nuclear DNA in live embryos. We thus used the histone signal in fixed embryos as a relay to link the nuclear DNA signal with developmental time for training (see Methods). Specifically, by simultaneously imaging histone and DNA signals from 160 fixed his2av-mrfp1 embryos during nc11–13 (Fig. 3a), we inferred embryonic times from histone signals and used them as input to train and test three DNA-based models at different spatial scales (Fig. 3b). With proper calibration of image contrast and saturation (Supplementary Fig. 3a–c), the combined predictions from these DNA-based models closely matched histone-based time inference, achieving 1-minute accuracy and high precision in all cycles (mean residual ± s.d., nc11: 0.20 ± 0.46 min, nc12: 0.14 ± 0.47 min, nc13: 0.19 ± 0.42 min, Fig. 3c and Supplementary Fig. 3d).

**Fig. 3: Time inference from DNA images of fixed embryos using relayed learning.**

To generalize this DNA-based time inference for other fly strains, particularly the WT, we noticed systematic differences in embryo size between different strains (Fig. 3d)^23,32. Thus, DNA images of WT embryos need to be rescaled accordingly before applying the his2av-mrfp1-trained framework (see Methods). To validate this approach, we simultaneously imaged nuclear DNA and Cyclin B protein, a key cell-cycle marker of interphase progression³⁷, in 23 fixed WT embryos across nc11–13 (see Methods). By inferring the developmental time and determining the average Cyclin B level for each embryo, we reconstructed the Cyclin B dynamics from fixed WT embryos (Supplementary Fig. 3e). The resulting profile closely reproduced previously reported live imaging measurements³⁷, with minimal temporal variability (s.d.: nc11: 0.32 min, nc12: 0.19 min, nc13: 0.47 min), confirming the 1-minute accuracy of our DNA-based framework across different fly strains.

To assess the applicability of this approach for studying dynamic gene expression, we simultaneously imaged nuclear DNA and nascent mRNA of the anteriorly expressed hb gene in 124 fixed WT embryos during nc11–13 (Fig. 3e, see Methods). By inferring the developmental time and quantifying the anterior hb mRNA level (0.2–0.4 EL) for each embryo, we reconstructed the endogenous hb transcription dynamics from fixed WT embryos without requiring genetic modification (Fig. 3f, Supplementary Fig. 3f, g). This reconstruction quantitatively matched live imaging results of hb-MS2 in transgenic embryos with minor deviations (mean residual ± s.d., nc11: 0.27 ± 0.19 min, nc12: 0.06 ± 0.28 min, nc13: −0.19 ± 0.55 min, see Methods), demonstrating the high resolution and broad versatility of our DNA-based framework for gene expression studies.

Further analyzing hb transcription along the entire AP axis obtained a complete spatiotemporal profile of endogenous hb transcription in WT embryos (Fig. 3g, Supplementary Fig. 3h, see Methods). This is a critical complement to previous live imaging studies, which had limited field-of-view and often omitted the posterior hb expression band^7,38. Here, we observed that the posterior expression band emerged later than the anterior one, with significantly delayed initiation and peak times (Fig. 3h). This finding aligns with previous reports that anterior hb transcription is activated by early-produced maternal Bcd, while posterior hb transcription is regulated by zygotic transcription factors, e.g., Tll³⁹, which are produced much later^4,40 (Supplementary Fig. 3i). In contrast, both bands vanished simultaneously, suggesting a shared mechanism of transcription termination.

Time inference of fixed embryos resolves the dynamic regulation of Kr by multiple TFs

Transcriptional regulation involves complex and dynamic interactions between multiple factors over time. Our time inference method, combined with multiplex fluorescence imaging of mRNA and regulatory factors in fixed embryos, enables direct measurement of such regulatory dynamics. Here, we used a multi-factor-regulated gene, Kr, as an example to demonstrate this approach. According to genetic studies, the transcriptional regulation of Kr in early development (nc11–13) primarily involves two maternal TFs, i.e., an activator, Bcd, and a repressor, Hb⁴¹. However, the coarse temporal resolution of previous fixed-embryo imaging studies failed to verify this two-factor model, prompting speculation that additional, unidentified regulators might be required^4,42. To resolve this issue, we applied smFISH^35,36,43,44, immunofluorescence^5,20,43, and a DNA dye to simultaneously label Kr mRNA, Bcd and Hb proteins, and nuclear DNA in >130 fixed WT embryos during nc11–13 (Fig. 4a, see Methods).

**Fig. 4: Resolving the dynamic regulation of Kr by multiple TFs from fixed embryos.**

By inferring developmental time from DNA signals, we quantified the spatiotemporal profiles of nuclear Kr transcription and Bcd and Hb concentrations in individual embryos over nc11–13 (Fig. 4b and Supplementary Fig. 4a–c, see Methods). Within each mitotic interphase, Kr formed a medial expression stripe (~0.45–0.60 EL), while Bcd and Hb displayed exponential and reverse sigmoidal gradients along the AP axis, respectively (Fig. 4a, b and Supplementary Fig. 4d), consistent with previous reports⁴. Temporally, all three species varied significantly within each nuclear cycle, with Bcd and Hb levels rising ahead of Kr (Fig. 4c). Notably, during nc13, Kr exhibited two distinct expression peaks that were previously indistinguishable⁴², highlighting the sensitivity of our method. The duration of the earlier peak was significantly shorter the later one, explaining the previously reported increase in Kr expression during late nc13⁴².

To understand the dynamic regulation of Kr by Bcd and Hb, we further analyzed the quantitative relationship between the three profiles. Given the presence of multiple Bcd and Hb binding sites on Kr enhancers (Supplementary Fig. 4e)^41,45,46, we constructed a time-dependent thermodynamic model to describe the nuclear mRNA level of Kr (R) in response to Bcd and Hb concentrations (C_Bcd and C_Hb) (Fig. 4d, see Methods). Applying the model to fit the spatiotemporal relationship between Kr, Bcd, and Hb over nc11–13 yielded (Fig. 4e)

$$\frac{dR}{dt}=k\frac{{C}_{{{{\rm{Bcd}}}}}^{{n}_{{{{\rm{B}}}}}}}{{C}_{{{{\rm{Bcd}}}}}^{{n}_{{{{\rm{B}}}}}}+{C}_{{{{\rm{B0}}}}}^{{n}_{{{{\rm{B}}}}}}}\frac{{C}_{{{{\rm{H0}}}}}^{{n}_{{{{\rm{H}}}}}}}{{C}_{{{{\rm{Hb}}}}}^{{n}_{{{{\rm{H}}}}}}+{C}_{{{{\rm{H0}}}}}^{{n}_{{{{\rm{H}}}}}}}-\gamma R,$$

(1)

where n_B and n_H are Hill coefficients describing the cooperativities of Bcd and Hb bindings, respectively; C_B0 and C_H0 are concentration thresholds for Bcd and Hb regulations, respectively; k and γ represent mRNA production and degradation rates, respectively (see Methods). This result suggests a synergistic regulatory mechanism, where cooperative Bcd and Hb bindings combine their regulatory effects multiplicatively.

From the fitting, we found that Kr was only active (k > 0) in a fraction of each nuclear cycle (Fig. 4f). Notably, in nc13, k exhibited two pulses, suggesting gene activation in G1 and G2 phases, respectively. This phenomenon was not observed in nc11–12, possibly due to the transiency of G1 in early cycles. Throughout all cycles, n_B and n_H remained stable at ~4 and ~6, respectively (Fig. 4g), revealing higher-order cooperativity of Hb than Bcd, consistent with bioinformatic studies (Supplementary Fig. 4e)^41,45,46. C_B0 and C_H0 were estimated to be ~2.4 nM and ~7.2 nM, respectively (Fig. 4h), suggesting that Hb repression and Bcd activation determine the anterior and posterior boundaries of Kr expression, respectively (Fig. 4i). Further measuring Kr regulation in transgenic embryos with reduced Bcd and Hb dosage (1×bcd strain) confirmed these findings (Supplementary Fig. 4f–i), revealing that maternal Bcd and Hb are sufficient for early Kr patterning.

Time-resolved single-molecule mRNA statistics reveal unsteady-state kinetics of hb transcription

Beyond population-level studies, fixed-embryo imaging can achieve single-molecule sensitivity, enabling statistical analysis to uncover the microscopic mechanisms of gene regulation. Following this strategy, previous smFISH studies of nascent mRNA copy-number distribution at individual gene loci have identified the bursty nature of stochastic gene transcription in various biological systems^35,47,48. However, most of these studies assumed gene activity at steady state^35,47,48, which is inadequate for capturing the highly dynamic nature of developmental gene regulation. Here, with accurate time inference, we used the hb gene as an example to establish a time-resolved theoretical pipeline for deciphering unsteady-state transcription kinetics from multiple smFISH-labeled embryos covering nc11–13 (Fig. 5a, see Supplementary Note 1).

**Fig. 5: Uncovering hb transcription kinetics through time-resolved single-molecule mRNA statistics.**

Based on previous steady-state analyses, hb transcription satisfies two-state stochastic kinetics^36,43,49, with random transitions between active and inactive gene states occuring at Poissonian rates k_ON and k_OFF. Nascent mRNA molecules are only initiated in the active state at a rate k_INI, followed by elongation at a speed V_EL and a rapid release. Cooperative Bcd binding activates anterior hb transcription by modulating k_ON. However, the validity and temporal evolution of this picture within each nuclear cycle remain unclear. To address this issue, we treated the kinetic parameters of the two-state model as time-dependent variables (Fig. 5a, see Methods). Particularly, to describe the shutdown and re-expression of hb during each mitosis (Fig. 3f), we assumed that the percentage of activatable gene loci, P_active, varies with nuclear cycle phases⁵⁰. Notably, since each nascent mRNA signal reflects transcription events within several minutes, signal statistics at successive time points are correlated through inherited kinetic parameters. Retaining this correlation requires simultaneous analysis of the entire time sequence.

Following this extension, we solved the unsteady-state distribution of nascent mRNA number per gene loci over time in different parts of the embryo and compared it with experimental data to extract kinetic parameters (Fig. 5b, see Supplementary Note 1). Unlike steady-state analyses that treated V_EL as a predetermined scaling factor^35,36,43,49, our approach allowed a direct estimation of V_EL at ~40 bp/s (Supplementary Fig. 5a), aligning with live-imaging measurements^51,52. In each nuclear cycle, we found a consistent trapezoidal P_active profile across the anterior hb expression domain (Fig. 5c), agreeing with live-imaging results for synthetic genes⁵⁰. The onset of this trapezoidal profile advanced with the nuclear cycle, while its duration increased markedly, indicating a rise in gene activity throughout development (Fig. 5d).

During each nuclear cycle, we found that k_ON varied significantly with AP position and time (Fig. 5e), while k_OFF and k_INI remained stable (Supplementary Fig. 5b, c). These results corroborate previous steady-state analyses, confirming that k_ON is the primary target of Bcd modulation^35,43. Specifically, at each AP position, the temporal profiles of k_ON and Bcd concentration both showed rising, plateau, and falling phases (Fig. 5f). During the plateau phase, the steady-state Bcd dependence of k_ON followed a Hill function with a Hill coefficient of ~5, consistent with the canonical picture of cooperative Bcd binding-induced gene activation⁴³ (Fig. 5g, h). In contrast, the rapid rising phase exhibited unsteady-state behaviors. Notably, we observed a continued increase in k_ON for ~1.5 min after the Bcd concentration peaked (Fig. 5f). This relaxation behavior supports a causal relationship between Bcd binding and hb activation, which was not captured in previous steady-state studies. Further analyzing this relaxation behavior using an unsteady-state Bcd binding model enabled a direct estimation of Bcd binding and unbinding rates (Fig. 5i, see Supplementary Note 1), which were previously unknown. During the falling phase, k_ON decreased ahead of Bcd (Fig. 5f), suggesting a TF-independent mechanism of transcription termination. Collectively, these results demonstrate the power of our method in elucidating the dynamic nature of single-cell gene regulation mechanisms.

Discussion

Direct spatiotemporal tracking of complex developmental gene regulation through live imaging has long been limited to monitoring only a few factors simultaneously¹¹. Although recent advances in fluorescence lifetime imaging offer potential solutions⁵³, the genetic perturbations from multiple fluorescent reporters complicate this approach. Conversely, fixed-embryo imaging could overcome these limitations but lacks precise temporal resolution. In this paper, we present a CNN-based deep learning approach to objectively infer the absolute development time from the nuclear signals in fixed-embryo images with 1-minute resolution. Using early Drosophila embryos as a case study, we demonstrate its power in unraveling transcriptional regulation of key developmental genes at the single-cell and single-molecule level.

Compared to existing CNN-based embryo staging, our method offers several key improvements: (1) Instead of analyzing macroscopic embryo morphology that varies slowly, we focus on fast-changing nuclear signals, enabling finer temporal resolution. (2) Unlike traditional CNN-based staging that produces classification outputs^26,27, our method uses regression outputs, allowing for continuous time prediction. (3) Rather than using a single neural network, whose learning may be limited to a specific spatial scale^26,27, we employ three independent networks to cover multiple spatial scale. Compared to other multi-scale approaches, such as feature fusion^54,55, our combination of three independent predictions serves as an ensemble validation, enhancing robustness and providing greater scalability for future adaptation to other systems. (4) To apply our method to fixed embryos of arbitrary strains, we rescale images to compensate for fixation-induced and strain-specific variations in nuclear size — a key time-dependent feature captured by our models. (5) To extend the applicability of our method to DNA images, we use fixed embryos colabeled with histone and DNA as intermediates to train DNA-based models. Overall, these improvements ensure precise and reliable time inference for fixed embryos of any strain.

Besides morphology-based embryo staging, gene expression and epigenetic features have also been used for staging^6,17. However, these approaches typically require transcriptomic and/or epigenomic data collected at multiple time points, incompatible with regular imaging studies. Moreover, because gene expression is highly influenced by genetic modifications, the versatility and reliability of these approaches across different strains are uncertain. Conversely, our morphology-based method is more robust against genetic manipulation.

Previous fixed-imaging studies of gene regulation are typically limited to resolving static or quasi-static regulatory relationships^6,35,42,43. Our method enables, for the first time, high-resolution reconstruction of dynamic gene regulation from fixed WT embryos, capturing dynamic details comparable to live imaging. Without genetic modifications and maturation delays of fluorescent proteins, it offers more accurate quantification^11,56,57. This allowed us to resolve the dynamic regulatory relationship between Kr, Bcd, and Hb, extending the static thermodynamic model of transcriptional regulation⁵⁸. By showing that maternal Bcd and Hb are sufficient to drive early Kr patterning, our results dispel the prior speculation about additional, unidentified regulators^4,42. This not only advances our understanding of early Kr regulation but also provides a framework for determining key regulators for other gap genes. Moreover, our results quantitatively verify a previously uncertain hypothesis of combinatorial Kr regulation: Bcd and Hb each perform cooperative binding independently, while their regulatory effects combine synergistically through multiplication⁵⁹. This finding enhances the mechanistic understanding of early Kr regulation and underscores the potential of our approach to unravel complex gene regulatory mechanisms in other systems.

Single-molecule statistical analysis of fixed-imaging data can uncover microscopic mechanisms of gene regulation^35,47,48. However, the lack of temporal resolution hinders its application to unsteady-state process^60,61. With precise time inference, we extended this approach to reconstruct the time-dependent transcription kinetics of individual hb gene loci in WT embryos. Our analysis reveals the variation in the percentage of activatable gene loci within the nuclear cycle, a feature previously reported only in live imaging of synthetic genes⁵⁰. Comparing the extracted hb kinetics with the Bcd profile further extended our understanding of TF binding-induced gene activation to realistic unsteady-state scenario. These results were essential for establishing a dynamic view of how Bcd regulates the endogenous hb gene, highlighting the power of our approach to unravel the microscopic kinetics of dynamic gene regulation.

Although our method requires imaging a large number of embryos, it offers an effective and universal solution for accurately quantifying developmental dynamics without genetic modifications. By imaging other features and retraining the models, it can be easily adapted to other developmental stages, other organisms, or even non-developmental processes. Future integration of this method with high-throughput imaging of RNAs, proteins, and gene loci will further enhance our understanding of complex gene regulatory networks^12,13,14.

Methods

Fly strains

Oregon-R (OreR) strain was used as the wild type. Histone H2A-RFP strain (his2av-mrfp1) was obtained from the Bloomington Drosophila Stock Center (stock number 23650, 23651). Two strains for live imaging of hb activity (yw; histone-rfp; mcp-nonls-gfp and yw; hb BAC>ms2) were developed previously⁸ and were obtained as gifts from Dr. Hernan H. Garcia (University of California at Berkeley). 1×bcd strain (+/CyO-bcd + ; E1s) was developed previously⁶² and was obtained as a gift from Dr. Jun Ma (Zhejiang University). The sex of embryos from these strains was not considered in this study, as it is not expected to affect early developmental processes. No ethical approval was required for research involving Drosophila melanogaster, in accordance with institutional (Shanghai Jiao Tong University) and national policies.

smFISH probe design

Sets of DNA oligonucleotides complementary to the target transcripts (48 probes for hb, 33 probes for Kr) were designed and synthesized as previously reported³⁵ (Supplementary Table 2). Kr probes were conjugated with tetramethylrhodamine (TAMRA; Thermo Fisher Scientific, C6123). hb probes were conjugated with either Alexa Fluor™ 647 (Invitrogen, A20106) or TAMRA.

Live imaging sample preparation and data acquisition

his2av-mrfp1 embryos were collected directly. Female virgins from line yw; histone-rfp; mcp-nonls-gfp were crossed with males from line yw; hb BAC>ms2 for embryo collection. Embryos collected at 25 °C were dechorionated using bleach, mounted between a semipermeable membrane (Biofolie; In Vitro Systems & Services) and a coverslip, and embedded in Halocarbon 27 oil (Sigma, 9002-83-9), following previous literature⁷.

Live embryos were imaged using a Zeiss LSM 710 and 880 confocal microscopes equipped with a 63×/1.4 NA oil immersion objective in confocal or fast airyscan mode. For his2av-mrfp1 embryos, 16-bit image sequences were acquired at 1 AU pinhole aperture with a pixel size of 132 nm and a z-step size of 1 μm. Most confocal-mode images consisted of 1024 × 1024 × 15 pixels. An exception of five image sequences was captured at 512 × 512 × 15 pixels. Airyscan-mode images were captured at 1012 × 1012 × 19 pixels. The standard temporal resolution was one frame per minute (fpm). An exception of a short image sequence for the cell division process had a temporal resolution of 12 fpm.

For live-imaging of hb-MS2 signal, the pixel size remained at 132 nm, with z-step sizes adjusted to either 0.5 or 0.6 μm. These image sequences were acquired at a resolution of 512 × 512 pixels, with a temporal resolution set to 2 fpm. The excitation wavelengths used were 488 nm for MCP-GFP and 561 nm for Histone-RFP.

Fixed imaging sample preparation and data acquisition

Data of 82 OreR embryos were from previous studies^35,43. All other embryos were collected, fixed, and labeled according to a previously published protocol^35,43. Briefly, his2av-mrfp1 embryos collected at 25 °C were fixed in 8% (v/v) paraformaldehyde solution for 30 min, hand devitellinized, and stored in 1× PBS with 0.1% (w/v) BSA and 0.1% (v/v) Triton X-100 at 4 °C. OreR and 1×bcd embryos collected at 25 °C were fixed in 4% (v/v) paraformaldehyde solution for 15 min, vortexed in 100% methanol for devitellinization, and stored in 100% methanol at −20 °C.

For smFISH, fixed embryos were rehydrated (4 × 10 min) in PBTx (1× PBS, 0.1% (v/v) Triton X-100), washed (3 × 10 min) in hybridization wash buffer (2× SSC, 20% (w/v) formamide, 0.1% (v/v) Triton X-100) at room temperature, and incubated with probe-containing hybridization buffer (10% Dextran sulfate, 0.1% E.coli tRNA, 20% (w/v) formamide, 2×SSC, 0.1% (v/v) Triton X-100, 2 mM Vanadyl ribonucleoside complex (NEB, S1402S), 0.02% (w/v) BSA) at 30 °C overnight. After hybridization, embryos were washed in hybridization wash buffer at 30 °C (3 × 30 min) and in 2× SSC at room temperature (2 × 10 min). For immunofluorescence, embryos were washed (4 × 10 min) in PBTx and blocked in PBT-B (1× PBS, 20% (v/v) western blocking reagent (Roche, 11921673001), 2 mM ribonucleoside vanadyl complex (NEB, S1402S), 0.1% (v/v) Triton X-100) at room temperature for 1 h. Embryos were then incubated with preabsorbed primary antibodies diluted in PBT-B at 4 °C for 20 h, followed by washes (4 × 10 min) in PBTx, another 1 h block in PBT-B, and 1 h incubation with secondary antibodies diluted in PBT-B at room temperature.

For smFISH and immunofluorescence colabeling of OreR and 1×bcd embryos, hb or Kr mRNAs were labeled using smFISH, followed by immunostaining for Bcd and Hb proteins using rabbit anti-Bcd (Santa Cruz Biotechnology, cat: SC-66818, lot: A0108, 1:50 (v/v)) and rat anti-Hb antibody (obtained from Asian Distribution Center for Segmentation Antibodies, 576, 1:300 (v/v)) primary antibodies. Secondary antibodies were Alexa Fluor™ 647-conjugated goat anti-rabbit IgG (Invitrogen, cat: A21245, lot: 1922848, 1:500 (v/v)) and Alexa Fluor™ 488-conjugated goat anti-rat IgG (Invitrogen, cat: A11006, lot: 1921310, 1:500 (v/v)). For a small set of OreR embryos, Cyclin B protein was individually labeled using immunofluorescence with mouse anti-Cyclin B antibody (Santa Cruz Biotechnology, cat: SC-166210, lot: E1016, 1:50 (v/v)), followed by goat anti-mouse IgG secondary antibody conjugated with Alexa Fluor™ 488 (Invitrogen, cat: A11001, lot: 1890503A, 1:500 (v/v)).

Following smFISH and/or immunofluorescence labeling, embryos were washed (4 × 10 min) in PBTx and incubated with Hoechst 33342 (Thermo Fisher Scientific, 62249, 0.1% (v/v) diluted in PBTx) at room temperature for 10 min to stain nuclear DNA. his2av-mrfp1 embryos were only stained for nuclear DNA. Following staining, embryos were washed (4 × 10 min) in PBTx (1× PBS, 0.1% (v/v) Triton X-100) and mounted in Aqua-Poly/Mount (Polysciences, 18606) for imaging.

Fixed embryos were imaged using a Zeiss LSM 880 confocal microscope equipped with a GaAsP detector and a 63×/1.4 NA oil immersion objective. Imaging of fixed his2av-mrfp1 embryos used the same parameter setting as in live imaging. Multiple adjacent image stacks, with a typical size of ~1700 × 1400 × 10 pixels each, were acquired to cover the cortex layer of each embryo. Imaging of fixed 1×bcd and OreR embryos was performed at 16-bit and 1 AU pinhole aperture with a pixel size of 66 nm and a z-step size of 0.32 μm. Multiple adjacent image stacks, with a typical size of ~3200 × 3000 × 21 pixels each, were acquired to cover the cortex layer of each embryo.

Image preprocessing and nuclear segmentation

Image processing followed a previously developed pipeline^35,43 with updated algorithms. Briefly, raw images were converted to TIFF format and flat-field corrected. For both live and fixed images of Histone-RFP signals, two-dimensional (2D) nuclear segmentation was conducted on maximum intensity projection of the z-stack using a combination of local threshold and watershed. For fixed images of OreR and 1×bcd embryos, three-dimensional (3D) segmentation of nuclei was performed on Hoechst image stacks using the Cellpose algorithm⁶³. Segmentation results were manually refined using a custom MATLAB graphical user interface (GUI). For each fixed embryo, the nuclear cycle was determined based on the number of recognized nuclei, using a criterium established from a study of 167 OreR embryos (Supplementary Fig. 2c). The embryo boundary was identified by thresholding the Hoechst image and was manually refined using custom MATLAB GUI. This boundary was used to determine the AP position of each nucleus.

mRNA quantification in live images

In each frame of an image sequence, nascent mRNA foci candidates were identified as 3D local maxima in the nuclear region. Following a double-check using a custom MATLAB GUI, each focus was assigned to its closest nucleus. The local intensity profile of each focus was fitted to a 2D Gaussian function with a uniform background to extract the peak height (I_peak) and radius (σ₀), from which the fluorescence intensity of the focus was calculated as I = 2πI_peakσ₀². By averaging foci intensities over all nuclei for each frame, the nuclear expression dynamics was extracted and normalized against its maximum value throughout the sequence. Results from multiple image sequences were further averaged and normalized.

mRNA and protein quantification in fixed images

mRNA and protein quantification in fixed images followed established protocols^35,43. Briefly, mRNA spot identification and intensity extraction were identical to those in live images. By comparing the joint distribution of peak height (I_peak) and radius (σ₀) between spots from the high-expression region (hb: anterior; Kr: medial) and those from a low-expression region (posterior), we determined a 2D threshold to distinguish real mRNA spots from background noise. The typical intensity, I₀, of a single mRNA molecule was extracted from the primary peak of the spot intensity distribution based on a multi-Gaussian fitting. A threshold (3I₀ for hb and I₀ for Kr) was applied to identify active transcription sites within the recognized mRNA spots. The intensity of each transcription site was divided by I₀ to determine the equivalent number of nascent transcripts at that site.

For Bcd and Hb protein signals, the average immunofluorescence intensity of each nucleus was estimated from the central z-slice of the nucleus. A background fluorescence level was estimated from the posterior part of the embryo and subtracted from the results. The typical intensity of a single protein molecule was extracted from cytoplasmic protein spots, similar to mRNA spot quantification. This value converted the background-subtracted nuclear fluorescence into absolute protein concentration. Quantification of the Cyclin B signal followed a similar protocol without background subtraction.

Determining the developmental time for live images

Each image sequence of a developing embryo covered multiple nuclear cleavage cycles, whose timing was determined based on nuclear division events. Specifically, each nuclear division event typically occurred in ~1–2 min (nc10–11, nc11–12, and nc12–13: ~1 min, nc13–14: ~2 min). The last frame of each division event was defined as the start time of a cycle (Supplementary Fig. 1a). Since nuclear division event can be easily identified by eye, these mitotic frames were excluded from model training and prediction.

To align data from different embryos and compensate for the fluctuation in developmental tempo, we rescaled the nuclear cycle durations (excluding the mitotic frames) of each embryo to standard values. Specifically, in most embryos, nc11 interphase lasted ~7 min, nc12 interphase lasted ~8–10 min, and nc13 interphase lasted ~12–15 min. We thus determined the standard durations of nc11–13 interphases to be 7, 9, and 13 min, respectively. In contrast, for the extended nc14 (lasting >1 h), we focused on its early interphase (~30 min), during which the existing time-prediction methods based on membrane invagination or nuclear length show limited accuracy^20,23. We noticed that, during this period, nuclear size experienced a rapid increase followed by a gradual decline, stabilizing after ~26 min (Supplementary Fig. 1d). Therefore, we rescaled the duration of early nc14 interphase based on this trend. To minimize the impact of imaging-induced phototoxicity, which could disrupt developmental timing, we excluded embryos with nuclear aberrations or significantly prolonged early nc14 interphase ( ≥30 min) from analysis.

Beyond embryo-to-embryo differences in cycle duration, alignment of different image sequences is further limited by the 1-minute resolution of time-lapse imaging, which can result in sub-minute time offsets between sequences. This offset is particularly critical in nc11, as its short duration means that even small timing differences can significantly impact nuclear morphology. To improve the accuracy of nc11 dataset labeling, we tried to extract additional temporal information from the first frame of nc11 in each image sequence. During this 1-minute interval, dividing nuclei undergo significant elongation, with the axial ratio changing systematically over time. By comparing the median nuclear axial ratio ($\tilde{\varepsilon }$; major vs. minor axes) measured for each embryo, we classified all embryos into two groups with $\tilde{\varepsilon }$ around 1.36 and 1.67, respectively (Supplementary Fig. 1b). Using higher temporal resolution imaging (12 fpm) of nuclear division, we estimated the time offset between these two groups to be ~25 s (~0.4 min, Supplementary Fig. 1c). Consequently, nc11 time labels were adjusted to 0.6–6.6 min and 1–7 min for the two groups of embryos, accordingly.

Generating datasets from live images for histone-based learning

For each nuclear cycle, live-imaging embryos were divided into two groups for training and testing, respectively (nc11: n_training = 13, n_test = 4; nc12: n_training = 12, n_test = 6; nc13: n_training = 18, n_test = 4; nc14: n_training = 5, n_test = 3). Within either group, the focal plane of each nucleus in every embryo was identified by comparing the nuclear histone signal in different z-slices. Histone images around each nucleus were cropped and normalized at the nuclear focal plane to generate three datasets of different spatial scales for training or testing, depending on the embryo group. For each nuclear cycle, the three spatial scales were adjusted to cover ~1, 6, and 20 nuclei, i.e., nc11: 81 × 81, 301 × 301, and 601 × 601 pixels; nc12: 75 × 75, 251 × 251, and 401 × 401 pixels; nc13: 71 × 71, 201 × 201, and 401 × 401 pixels; nc14: 55 × 55, 151 × 151, and 251 × 251 pixels. All datasets were converted to 8-bit.

To test the impact of size reduction on time prediction, the image crop size of each dataset was increased (nc11: 90 × 90, 334 × 334, and 668 × 668 pixels; nc12: 83 × 83, 279 × 279, and 446 × 446 pixels; nc13: 79 × 79, 223 × 223, and 446 × 446 pixels) to mimic a 10% reduction of nuclear size.

Generating datasets from fixed his2av-mrfp1 images for relayed learning

160 fixed-imaging embryos covering every minute in nc11−13 (~2 embryos/min) were used to perform relayed learning for DNA-based models. Firstly, histone images from each embryo were cropped and normalized around individual nuclei to generate nine datasets (corresponding to three spatial scales across three regions of the embryo) for histone-based time inference. Specifically, cropped images from the anterior (0.2 EL), medial (0.5 EL), and posterior (0.8 EL) regions of the fixed embryo (region sizes: 1201 × 1201 pixels for nc11, 1001 × 1001 pixels for nc12, and 801 × 801 pixels for nc13) were split into different datasets, with each region generating their own three-scale datasets. Due to fixation-induced embryo shrinkage, the image sizes of these datasets were rescaled to be: nc11: 66 × 66, 244 × 244, and 487 × 487 pixels; nc12: 62 × 62, 208 × 208, and 333 × 333 pixels; nc13: 59 × 59, 167 × 167, and 333 × 333 pixels (see Image rescaling for time inference from fixed embryos).

Secondly, the same set of embryos were divided into two groups for DNA-based training and testing, respectively (nc11: n_training = 16, n_test = 34; nc12: n_training = 20, n_test = 40; nc13: n_training = 22, n_test = 28). Within the training group, DNA images around each nucleus in every embryo were cropped and normalized to generate three datasets of different spatial scales, respectively, similar to the processing of live images. In contrast, DNA images from each embryo within the testing group were cropped and normalized around individual nuclei to generate nine datasets (corresponding to three spatial scales across three regions of the embryo), similar to the processing of fixed histone images. Due to fixation-induced embryo shrinkage, the image sizes of DNA datasets were adjusted to be: nc11: 70 × 70, 259 × 259, and 517 × 517 pixels; nc12: 66 × 66, 221 × 221, and 353 × 353 pixels; nc13: 63 × 63, 181 × 181, and 361 × 361 pixels. All datasets were converted to 8-bit.

Generating datasets from fixed OreR and 1×bcd images for time prediction

Images and segmentation results of fixed OreR and 1×bcd embryos were resized to adjust the pixel dimensions to 132 × 132 nm². The focal plane of each nucleus was identified based on the DNA signal. DNA images from each embryo were cropped and normalized around individual nuclei to generate nine datasets (corresponding to three spatial scales across three regions of the embryo) for DNA-based time prediction, similar to the processing of fixed histone images. Due to the difference in embryo size between strains, the image sizes of these datasets were rescaled. Specifically, the image sizes of OreR datasets were adjusted to be: nc11: 80 × 80, 296 × 296, and 590 × 590 pixels; nc12: 74 × 74, 248 × 248, and 397 × 397 pixels; nc13: 71 × 71, 203 × 203, and 406 × 406 pixels. Similarly, those of 1×bcd datasets were adjusted to be: nc11: 72 × 72, 267 × 267, and 533 × 533 pixels; nc12: 67 × 67, 226 × 226, and 360 × 360 pixels; nc13: 64 × 64, 185 × 185; and 368 × 368 pixels. All datasets were converted to 8-bit.

Measuring the trend of nuclear diameter change over time

For every live or fixed images of his2av-mrfp1 embryos, the diameter of each nucleus was measured, from which the median value was extracted. Results from different images were plotted against the measured or inferred developmental time and averaged with a bin size of 1 min (Supplementary Fig. 1d).

Image rescaling for time inference from fixed embryos

To predict time from fixed his2av-mrfp1 embryos using live-imaging-based models, the crop sizes for generating time inference datasets were rescaled to correct fixation-induced embryo shrinkage. The rescaling magnitude, r, for each nuclear cycle was determined through a recursion. Briefly, an initial guess of r = 1.18 was set based on the size comparison of fixed and live embryos. The crop sizes for three spatial scales were adjusted accordingly to generate tentative datasets for each embryo. Applying the trained histone model to these datasets predicted the developmental time of each embryo, which enabled reconstructing a trend of nuclear diameter change over time. Least-squares fitting of this reconstructed trend to live imaging result provided a new estimation of r for the next round of recursion. The convergence values of r (nc11: 1.23; nc12: 1.20; nc13: 1.20) were used to finalize the crop sizes for fixed his2av-mrfp1 embryos.

Applying DNA-based models established from fixed his2av-mrfp1 embryos on fixed OreR and 1×bcd embryos also requires rescaling the image crop sizes to account for embryo size difference between strains. Here, the rescaling magnitude, r, was estimated from the total area of the embryo (A). Specifically, the mean area of fixed his2av-mrfp1 embryos was used as a standard (nc11: 5.7 × 10⁴ μm², nc12−13: 5.8 × 10⁴ μm²). The mean area of fixed OreR and 1×bcd embryos (OreR: 7.4 × 10⁴ μm², 1×bcd: 6.1 × 10⁴ μm²) were measured as comparisons. For embryos whose areas fall within ±30% of the mean area, r was determined to be $\sqrt{{\bar{A}}_{{{{\rm{OreR}}}}}/{\bar{A}}_{{{{\rm{His}}}}}}$ or $\sqrt{{\bar{A}}_{1\times {{{\rm{Bcd}}}}}/{\bar{A}}_{{{{\rm{His}}}}}}$, with ${\bar{A}}_{{{{\rm{His}}}}}$, ${\bar{A}}_{{{{\rm{OreR}}}}}$, and ${\bar{A}}_{1\times {{{\rm{Bcd}}}}}$ representing the mean areas of his2av-mrfp1, OreR, and 1×bcd embryos, respectively. For embryos whose areas fall between ±30% and ±50% of the mean area, r was determined based on the nearest ±30% boundary, i.e., $\sqrt{A/(1\pm 0.3){\bar{A}}_{{{{\rm{His}}}}}}$. Embryos outside the ±50% mean area range were excluded as outliers.

Calibrating image contrast and saturation for DNA images

DNA signals from fixed embryos were affected by embryo-to-embryo fluctuation in labeling and imaging. Impacting the background fluorescence level, this extra noise may affect the accuracy of our DNA-based models. To identify and solve this issue in each cropped DNA image, we first fitted the intensity histogram of the image to a multi-Gaussian distribution to extract the peak intensity (I_bk) and width (σ_bk) of background fluorescence. Measuring the median values of these quantities for each embryo revealed their upper limits, i.e., $\sup ({\tilde{I}}_{{{{\rm{bk}}}}})=60$ and $\sup ({\tilde{\sigma }}_{{{{\rm{bk}}}}})=20$ (Supplementary Fig. 3a, b). We thus increased the noise levels of each cropped image, i.e., lowered the image contrast, by adding a Gaussian white noise with $\mu={(60-{I}_{{{{\rm{bk}}}}})}^{+}$ and $\sigma=\sqrt{{({20}^{2}-{\sigma }_{{{{\rm{bk}}}}}^{2})}^{+}}$.

Besides being affected by the background level, our DNA-based models were also sensitive to image saturation, which varied depending on the researcher’s imaging setting. Properly compensating for this effect is essential for integrating data from multiple experiments. To create a general framework for this task, we first quantified the saturation level (r_sat) of each background-calibrated DNA image crop as the percentage of pixels at maximum intensity (255 for 8-bit images). By comparing the average r_sat of different training embryos, we classified all embryos into low-saturation and high-saturation groups with a threshold of ${\bar{r}}_{{{{\rm{sat}}}}}=2\%$ (Supplementary Fig. 3c). For each low-saturation image crop, we increased its saturation level to 2% via linear stretching of pixel intensities.

CNN model construction, training, and testing

For each nuclear cycle, three independent CNN models for different image scales were constructed based on a modified VGGNet architecture³⁰, with linear regression output (instead of softmax) for continuous-time inference. Besides the output form, the model was customized in the following ways to fit the small sample size of our study (thousands of images per dataset): (1) The number of convolutional layers was lowered from 8−16 to 3−6 to reduce overfitting. (2) Each convolutional layer was followed by a max-pooling layer to help reduce the parameter number. (3) The size of the convolutional kernel was increased from 3 × 3 to 5 × 5 to enhance feature mining with fewer layers. (4) The number of fully connected layers was increased from three to six to improve feature cross and classification, while the channel number for each layer was reduced to limit the parameter number. (5) In addition to dropout in the first two fully connected layers (dropout ratio: 50%), we added a dropout layer between the second and third fully connected layers (dropout ratio: ~50%) in some CNN models to reduce overfitting. (6) A batch normalization layer was inserted before the flatten layer to speed up model training and reduce overfitting. (7) Based on the characteristics of fly embryo images, we chose to perform random rotation (0−30°), horizontal flip, and a slight shear transformation (0−0.1) in data augmentation. Specifically, the shear transformation was applied to account for slight nuclear deformation that may happen during the experiment. The detailed information of all models is listed in Supplementary Table 1.

For model training, thousands of cropped images from the corresponding training dataset were resized and converted to normalized float format for input. Mean Square Error (MSE) and Adaptive Moment Estimation (ADAM) were selected as the loss function and the optimization function, respectively, with a learning rate of 0.001. With k-fold cross-validation (k = 5 or 10) and Mean Absolute Error (MAE)-based evaluation metric, each model was trained for ~400 epochs to find the best result. Following the above protocol, we adjusted the image batch size, the depth and width of convolutional layers, and the number and rate of dropout layers for each model to further optimize model parameters.

To test the performance of a model on an embryo (or its anterior, medial, or posterior region), cropped images from the corresponding test dataset were resized and converted to normalized float format for input. Typically, the raw image stack of an embryo can generate hundreds of small- and median-scale images and >40 large-scale images, each of which can provide a time prediction. To balance prediction speed and accuracy, we randomly selected 40 images of the corresponding scale and recorded the median value of individual images’ prediction results as the model’s time inference for the embryo (or its anterior, medial, or posterior region).

Model training and testing were performed on a workstation equipped with an NVIDIA GeForce RTX 3090 GPU. For each nuclear cycle, training the small-, medium-, and large-scale models took ~6, 12, and 24 h, respectively. Training a histone-based multi-scale time predictor required ~42 h. Training a DNA-based multi-scale time predictor added another ~42 h. The total training time required to obtain a fully functional DNA-based estimator across all nuclear cycles was ~2 weeks.

Integration of time prediction results

For each embryo in the test group, time prediction results from three independent CNN models for different image scales were compared and integrated. Specifically, for a live-imaging embryo, whose images only covered a small region of the embryo, the median value of three independent predictions from this imaged region was output as the final inference result. Conversely, for a fully imaged fixed embryo, the anterior, medial, and posterior regions each provided three independent predictions. Each region’s median prediction was extracted and compared with each other. An average over the three regions was output as the inference result for the embryo. In addition, for DNA-based inference, the standard deviation of all nine predictions (σ_infer) served as a confidence criterion (Supplementary Fig. 3d). Embryos with σ_infer > 1.5 min were discarded for further analysis. Such confidence assessment is a key advantage of ensemble-based inference, directly improving the trustworthiness of our results.

Construction and application of nuclear-size-based time predictor

To establish a baseline predictor relying solely on nuclear size, we binned the diameters (d) of all nuclei in live-embryo images at each time point (t) in the training dataset to estimate the conditional probability distribution $P(d|t)$. Within each nuclear cycle, this probability is proportional to the likelihood of t given d under a flat prior assumption, i.e., $P(t|d)\propto P(d|t)$. We then performed spline interpolation of $P(t|d)$ using the MATLAB function “spline” with smoothing parameter λ = 0.7 to prevent overfitting. For each nucleus with a given diameter, the time prediction was determined by identifying the global maximum of $P(t|d)$. The overall time estimation for an embryo was calculated as the median of all individual nuclei predictions.

Maximum likelihood estimation (MLE) of time inference residuals for fixed WT embryos

We developed an MLE approach to estimate the residuals of our DNA-based time prediction framework on fixed WT embryos based on the per-embryo fluctuation of hb transcription dynamics. Specifically, by assuming that the hb transcription level (y) across embryos follows a normal distribution at any given time t, i.e., $y \sim N({y}_{0}(t),{\sigma }_{0}^{2}(t))$, we estimated ${y}_{0}(t)$ and ${\sigma }_{0}^{2}(t)$ from live imaging data and smoothly interpolated them using the MATLAB function “spline” (smoothing parameter λ = 0.99). Further assuming that the residuals of our time prediction also follow a normal distribution with fixed mean and variance, i.e., ${t}_{{{{\rm{fixed}}}}} \sim {t}_{{{{\rm{actual}}}}}+N({\mu }_{{{{\rm{res}}}}},{\sigma }_{{{{\rm{res}}}}}^{2})$, the probability of a fixed WT embryo at time ${t}_{{{{\rm{actual}}}}}$ with measured hb transcription ${y}_{{{{\rm{fixed}}}}}$ and predicted time ${t}_{{{{\rm{fixed}}}}}$ can be written as:

$$ P({y}_{{{{\rm{fixed}}}}},{t}_{{{{\rm{fixed}}}}}|{t}_{{{{\rm{actual}}}}})=P({y}_{{{{\rm{fixed}}}}}|{t}_{{{{\rm{actual}}}}})P({t}_{{{{\rm{fixed}}}}}|{t}_{{{{\rm{actual}}}}}) \\ =\frac{1}{\sqrt{2\pi {\sigma }_{0}^{2}({t}_{{{{\rm{actual}}}}})}}{e}^{-\frac{{({y}_{{{{\rm{fixed}}}}}-{y}_{0}({t}_{{{{\rm{actual}}}}}))}^{2}}{2{\sigma }_{0}^{2}({t}_{{{{\rm{actual}}}}})}}\frac{1}{\sqrt{2\pi {\sigma }_{{{{\rm{res}}}}}^{2}}}{e}^{-\frac{{({t}_{{{{\rm{fixed}}}}}-{t}_{{{{\rm{actual}}}}}-{\mu }_{{{{\rm{res}}}}})}^{2}}{2{\sigma }_{{{{\rm{res}}}}}^{2}}}.$$

(2)

Under a flat prior assumption, the likelihood of observing a fixed WT embryo with ${y}_{i}$ and ${t}_{i}$ is proportional to the marginalization of this probability, i.e.:

$$ L(y_i,t_i \mid \mu_{{\mathrm{res}}},\sigma_{{\mathrm{res}}}^2) \propto \int P(y_i,t_i \mid t_{{\mathrm{actual}}}) P(t_{{\mathrm{actual}}}) \, dt_{{\mathrm{actual}}} \\ =\displaystyle \int \frac{1}{\sqrt{2\pi \sigma_0^2(t_{{\mathrm{actual}}})}}e^{ -\frac{(y_i - y_0(t_{{\mathrm{actual}}}))^2}{2\sigma_0^2(t_{{\mathrm{actual}}})} }\frac{1}{\sqrt{2\pi \sigma_{{\mathrm{res}}}^2}}e^{ -\frac{(t_i - t_{{\mathrm{actual}}} - \mu_{{\mathrm{res}}})^2}{2\sigma_{{\mathrm{res}}}^2} }\, dt_{{\mathrm{actual}}}.$$

(3)

Therefore, the total log-likelihood across a dataset of fixed WT embryos is:

$$\log L(\{{y}_{i},{t}_{i}\}|{\mu }_{{{{\rm{res}}}}},{\sigma }_{{{{\rm{res}}}}}^{2})={\sum}_{i}\log L({y}_{i},{t}_{i}|{\mu }_{{{{\rm{res}}}}},{\sigma }_{{{{\rm{res}}}}}^{2}).$$

(4)

For each nuclear cycle, we numerically computed the integral in Eq. (3) using the MATLAB function “integral” and maximized Eq. (4) (or minimized its negative) using the MATLAB function “fmincon” to obtain ${\mu }_{{{{\rm{res}}}}}$ and ${\sigma }_{{{{\rm{res}}}}}$.

Measuring the spatial expression patterns of Bcd, Hb, and Kr

For each OreR and 1×bcd embryo labeled with Kr mRNA and Bcd/Hb proteins, we plotted the nascent mRNA signal (r, in units of the number of molecules) and absolute protein concentrations against nuclear position (x) for all nuclei, respectively. For each data species, we binned individual data points by x. Within position ranges 0.10−0.75 EL (for Bcd), 0.15−0.85 EL (for Hb), 0.20−0.85 EL (for Kr in OreR), and 0.10−0.80 EL (for Kr in 1×bcd), we used a least-squares algorithm (the “fit” function in MATLAB) to fit the binned data from each species to the following functions:

$$[{{{\rm{Bcd}}}}]={c}_{1,{{{\rm{Bcd}}}}}{e}^{-x/\lambda }+{c}_{0,{{{\rm{Bcd}}}}},$$

(5)

$$[{{{\rm{Hb}}}}]={c}_{1,{{{\rm{Hb}}}}}\frac{{e}^{-(x-{x}_{0})/d}}{{e}^{-(x-{x}_{0})/d}+1}+{c}_{0,{{{\rm{Hb}}}}},$$

(6)

$$R={r}_{1}\frac{{e}^{(x-{x}_{{{{\rm{A}}}}})/{d}_{{{{\rm{A}}}}}}}{{e}^{(x-{x}_{{{{\rm{A}}}}})/{d}_{{{{\rm{A}}}}}}+1}\cdot \frac{{e}^{-(x-{x}_{{{{\rm{P}}}}})/{d}_{{{{\rm{P}}}}}}}{{e}^{-(x-{x}_{{{{\rm{P}}}}})/{d}_{{{{\rm{P}}}}}}+1}+{r}_{0},$$

(7)

where c_1,·, c_0,·, r₁, and r₀ denote the peak and background levels of the corresponding species, respectively; λ, d, d_A, and d_P indicate the decay lengths of Bcd, Hb, and Kr patterns, respectively; x₀ represents the boundary position of Hb expression pattern; x_A and x_P denote the anterior and posterior boundary positions of Kr expression domain, respectively. For Kr-boundary analysis (Supplementary Fig. 4d), low-expression embryos with <3 nascent Kr mRNAs per nucleus within the expression band (OreR: 0.4−0.6 EL, 1×bcd: 0.35−0.55 EL) were excluded.

Thermodynamic modeling of unsteady-state gene regulation

To analyze the dynamic regulation of Kr by Bcd and Hb, we related the average nuclear expression of Kr to Bcd and Hb concentrations at each position within 0.20–0.70 EL for each embryo. By predicting the developmental time of each embryo, we binned the Kr-Bcd-Hb relationship at each spatial position along the time axis with a bin width of 1 min. To understand this spatiotemporal relationship, we constructed a time-dependent thermodynamic model, with Kr transcription determined by four possible states of cooperative Bcd and Hb bindings, i.e.,

$$\frac{dR}{dt}= k_{11} \frac{C_{{\mathrm{Bcd}}}^{n_{{\mathrm{B}}}}}{C_{{\mathrm{Bcd}}}^{n_{{\mathrm{B}}}}+C_{{\mathrm{B0}}}^{n_{{\mathrm{B}}}}} \cdot \frac{C_{{\mathrm{Hb}}}^{n_{{\mathrm{H}}}}}{C_{{\mathrm{Hb}}}^{n_{{\mathrm{H}}}}+C_{{\mathrm{H0}}}^{n_{{\mathrm{H}}}}}+k_{10} \frac{C_{{\mathrm{Bcd}}}^{n_{{\mathrm{B}}}}}{C_{{\mathrm{Bcd}}}^{n_{{\mathrm{B}}}}+C_{{\mathrm{B0}}}^{n_{{\mathrm{B}}}}} \cdot \frac{C_{{\mathrm{H0}}}^{n_{{\mathrm{H}}}}}{C_{{\mathrm{Hb}}}^{n_{{\mathrm{H}}}}+C_{{\mathrm{H0}}}^{n_{{\mathrm{H}}}}} \\ +k_{01} \frac{C_{{\mathrm{B0}}}^{n_{{\mathrm{B}}}}}{C_{{\mathrm{Bcd}}}^{n_{{\mathrm{B}}}}+C_{{\mathrm{B0}}}^{n_{{\mathrm{B}}}}} \cdot \frac{C_{{\mathrm{Hb}}}^{n_{{\mathrm{H}}}}}{C_{{\mathrm{Hb}}}^{n_{{\mathrm{H}}}}+C_{{\mathrm{H0}}}^{n_{{\mathrm{H}}}}}+k_{00} \frac{C_{{\mathrm{B0}}}^{n_{{\mathrm{B}}}}}{C_{{\mathrm{Bcd}}}^{n_{{\mathrm{B}}}}+C_{{\mathrm{B0}}}^{n_{{\mathrm{B}}}}} \cdot \frac{C_{{\mathrm{H0}}}^{n_{{\mathrm{H}}}}}{C_{{\mathrm{Hb}}}^{n_{{\mathrm{H}}}}+C_{{\mathrm{H0}}}^{n_{{\mathrm{H}}}}} - \gamma R.$$

(8)

Here, the cooperativity of either Bcd or Hb is described by a Hill function of protein concentration, with n. and C.₀ representing the Hill coefficient and concentration threshold of the corresponding species, respectively. The mRNA production rate for each binding state is denoted by k_ij, with i, j = 0 or 1 corresponding to the unbound or bound states of Bcd and Hb, respectively. Following production, mRNA is thought to degrade at a constant rate γ. By comparing the quantitative relationship between Kr, Bcd, and Hb to Eq. (8), we determined that only k₁₀ is non-zero, leading to Eq. (1).

Based on the observed Kr expression dynamics, we assumed that the gene is only activatable within a specific period in each nuclear cycle. For nc11−12, we used two parameters, t_on and t_off, to represent the start and end moment of this period, respectively. For nc13, we incorporated two additional parameters, t_low,1 and t_low,2, to delineate the start and end moment of the observed low-expression phase within the activatable period, respectively. The transcription rates for the low- and high-expression phases were treated as separate variables.

Following the above settings, we can numerically solve Eq. (1) at each position and time point of a nuclear cycle based on the observed Bcd and Hb concentration profiles. Experimental data from different positions and time points were fitted altogether to theoretical results using a least-squares algorithm (the “lsqcurvefit” function in MATLAB) to extract kinetic parameters. The fitted time parameters (t_on, t_off, t_low,1, t_low,2) were used to extract five expression periods (k > 0) in nc11–13 (4.6–5.5 min in nc11; 4.8–6.6 min in nc12; 3.8–5 min, 5–8.5 min, and 8.5–11.5 min for nc13).

PWM motif analysis

Sequences of the two Kr enhancers were provided in the reference⁴⁶. Bcd and Hb binding sites on Kr enhancers were predicted using a PWM motif analysis following previous literature⁵⁸. PWMs for Bcd and Hb were retrieved from the OnTheFly database⁶⁴ (http://bhapp.c2b2.columbia.edu/OnTheFly/index.php). By computing the binding probabilities of Bcd and Hb at each location on the sequences, four strong Bcd sites and seven strong Hb sites were identified on Kr enhancers with the criterion: sites with a log-odds of above 5.0 (natural logarithm) (Supplementary Fig. 4e).

Mathematical modeling of unsteady-state transcriptional kinetics

Stochastic modeling and inference of unsteady-state transcriptional kinetics are described in detail in Supplementary Note.

Statistics and reproducibility

No statistical method was used to predetermine sample size. For model training and testing, sample sizes were determined based on the requirements of deep learning models, which typically demand datasets comprising several thousand images for effective performance. To ensure sufficient temporal resolution, statistical robustness, and reproducibility, images for each nuclear cycle in either the training or test dataset were obtained from ≥3 biologically independent embryos collected across ≥3 independent experiments. For the fixed-embryo data used in the DNA-based model, since each embryo corresponds to a single time point, more embryos were imaged to ensure coverage of every minute within the nuclear cycle (~2 embryos per minute). Moreover, fixed embryos in the training dataset were approximately evenly distributed across developmental time to prevent temporal bias.

Most gene regulation data (live imaging of hb-MS2 and fixed imaging of Kr and hb) for each nuclear cycle were obtained from ≥3 biologically independent embryos collected across ≥3 independent experiments to ensure sufficient accuracy, statistical robustness, and reproducibility. As above, fixed-imaging data were obtained from more embryos to ensure coverage of every minute within the nuclear cycle (~2 embryos per minute). Two exceptions are the control experiments on 1×bcd embryos for Kr analysis and WT embryos for Tll analysis, each of which performed in two independent experiments, but still including a sufficient number of biologically independent embryos (n ≥ 7).

Cyclin B data for each nuclear cycle were obtained from ≥7 biologically independent embryos (~1 embryo per minute) collected across three independent experiments to ensure sufficient accuracy, statistical robustness, and reproducibility when fitting to previously reported live imaging results.

Embryos undergoing nuclear division were excluded from time inference and gene regulation analyses, as mitotic events are easily identifiable by eye and do not require computational time assignment. During CNN training, data were randomly shuffled with a fixed seed to ensure reproducibility. Otherwise, the experiments were not randomized. No blinding was performed, as all measurements and analyses were fully automated to prevent bias.

Each embryo is biologically independent. Each live embryo was imaged in an independent experiment. For fixed imaging experiments, the numbers of biologically independent replicates are provided as follows: Fig. 2, Fig. 3a–d, Supplementary Fig. 2d, e, and Supplementary Fig. 3b–d: n = 160 fixed his2av-mrfp1 embryos collected from 21 independent experiments. Figure 3d: n = 100 fixed WT embryos collected from three independent experiments. Figure 3e–h, Fig. 5, Supplementary Fig. 3h, and Supplementary Fig. 5: n = 124 fixed WT embryos for hb analysis collected from nine independent experiments. Figure 4 and Supplementary Fig. 4: n = 137 fixed WT embryos for Kr analysis collected from six independent experiments. Supplementary Fig. 2c: n = 167 fixed WT embryos for cycle determine collected from nine independent experiments. Supplementary Fig. 3e: n = 23 fixed WT embryos for Cyclin B analysis collected from three independent experiments. Supplementary Fig. 3i: n = 112 fixed WT embryos for Bcd analysis collected from six independent experiments; n = 7 fixed WT embryos for Tll analysis from two independent experiments. Supplementary Fig. 4f–i: n = 11 fixed 1×bcd embryos for Kr analysis collected from two independent experiments.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The raw image data reported in this paper are available in a private server under accession code http://gofile.me/4yuzx/XhyORolMN. A smaller representative image dataset has been deposited in the Zenodo database under accession code https://doi.org/10.5281/zenodo.15680493⁶⁵. Source data are provided with this paper.

Code availability

Custom scripts for this paper were written in multiple programming languages and are available on https://github.com/Xulab-biophysics/FISHIF-Time2024.git and https://doi.org/10.5281/zenodo.15702436⁶⁶. Specifically, scripts for image preprocessing, gene regulation analysis, and theoretical modeling were developed in MATLAB 2023a (MathWorks). CNN models for time inference were implemented using a Python-based deep learning API, Keras (https://keras.io/). A ready-to-use DNA-based time inference framework was encapsulated into a MATLAB App for customer usage. PWM motif analysis was conducted using C++.

References

Levine, M. & Davidson, E. H. Gene regulatory networks for development. Proc. Natl Acad. Sci. USA 102, 4936–4942 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Peter, I. S. & Davidson, E. H. Evolution of gene regulatory networks controlling body plan development. Cell 144, 970–985 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pan, X. & Zhang, X. Studying temporal dynamics of single cells: Expression, lineage and regulatory networks. Biophys. Rev. 16, 57–67 (2024).
Article PubMed Google Scholar
Jaeger, J. The gap gene network. Cell. Mol. Life Sci. 68, 243–274 (2011).
Article CAS PubMed Google Scholar
Haroush, N., Levo, M., Wieschaus, E. F. & Gregor, T. Functional analysis of the Drosophila eve locus in response to non-canonical combinations of gap gene expression levels. Dev. Cell 58, 2789–2801.e5 (2023).
Article CAS PubMed PubMed Central Google Scholar
Jaeger, J. et al. Dynamic control of positional information in the early Drosophila embryo. Nature 430, 368–371 (2004).
Article ADS CAS PubMed Google Scholar
Garcia, H. G., Tikhonov, M., Lin, A. & Gregor, T. Quantitative imaging of transcription in living Drosophila embryos links polymerase activity to patterning. Curr. Biol. 23, 2140–2145 (2013).
Article CAS PubMed Google Scholar
Bothma, J. P. et al. Enhancer additivity and non-additivity are determined by enhancer strength in the Drosophila embryo. ELife 4, e07956 (2015).
Article PubMed PubMed Central Google Scholar
Kawasaki, K. & Fukaya, T. Functional coordination between transcription factor clustering and gene activity. Mol. Cell 83, 1605–1622.e9 (2023).
Article CAS PubMed Google Scholar
Bothma, J. P., Norstad, M. R., Alamos, S. & Garcia, H. G. LlamaTags: A versatile tool to image transcription factor dynamics in live embryos. Cell 173, 1810–1822.e16 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hwang, D.-W., Maekiniemi, A., Singer, R. H. & Sato, H. Real-time single-molecule imaging of transcriptional regulatory networks in living cells. Nat. Rev. Genet. 25, 272–285 (2024).
Article CAS PubMed Google Scholar
Lu, T., Ang, C. E. & Zhuang, X. Spatially resolved epigenomic profiling of single cells in complex tissues. Cell 185, 4448–4464.e17 (2022).
Article CAS PubMed PubMed Central Google Scholar
Takei, Y. et al. Integrated spatial genomics reveals global architecture of single nuclei. Nature 590, 344–350 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Goltsev, Y. et al. Deep profiling of mouse splenic architecture with CODEX multiplexed imaging. Cell 174, 968–981.e15 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pichon, X., Lagha, M., Mueller, F. & Bertrand, E. A growing toolbox to image gene expression in single cells: Sensitive approaches for demanding challenges. Mol. Cell 71, 468–480 (2018).
Article CAS PubMed Google Scholar
Blythe, S. A. & Wieschaus, E. F. Establishment and maintenance of heritable chromatin structure during early Drosophila embryogenesis. Elife 5, e20148 (2016).
Calderon, D. et al. The continuum of Drosophila embryonic development at single-cell resolution. Science 377, eabn5800 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kimmel, C. B., Ballard, W. W., Kimmel, S. R., Ullmann, B. & Schilling, T. F. Stages of embryonic development of the zebrafish. Dev. Dyn. 203, 253–310 (1995).
Article CAS PubMed Google Scholar
Campos-Ortega, J. A. & Hartenstein, V. The embryonic development of Drosophila melanogaster 2nd edn (Springer, 1997).
Dubuis, J. O., Samanta, R. & Gregor, T. Accurate measurements of dynamics and reproducibility in small genetic networks. Mol. Syst. Biol. 9, 639 (2013).
Article PubMed PubMed Central Google Scholar
Boettiger, A. N. & Levine, M. Rapid transcription fosters coordinate snail expression in the Drosophila embryo. Cell Rep. 3, 8–15 (2013).
Article CAS PubMed PubMed Central Google Scholar
Huang, S. K., Whitney, P. H., Dutta, S., Shvartsman, S. Y. & Rushlow, C. A. Spatial organization of transcribing loci during early genome activation in Drosophila. Curr. Biol. 31, 5102–5110.e5 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wu, H., Manu, Jiao, R. & Ma, J. Temporal and spatial dynamics of scaling-specific features of a gene regulatory network in Drosophila. Nat. Commun. 6, 10031 (2015).
Article ADS CAS PubMed Google Scholar
Hallou, A., Yevick, H. G., Dumitrascu, B. & Uhlmann, V. Deep learning for bioimage analysis in developmental biology. Development 148, dev199616 (2021).
Moen, E. et al. Deep learning for cellular image analysis. Nat. Methods 16, 1233–1246 (2019).
Article CAS PubMed PubMed Central Google Scholar
Eulenberg, P. et al. Reconstructing cell cycle and disease progression using deep learning. Nat. Commun. 8, 463 (2017).
Article ADS PubMed PubMed Central Google Scholar
Čapek, D. et al. EmbryoNet: Using deep learning to link embryonic phenotypes to signaling pathways. Nat. Methods 20, 815–823 (2023).
Article PubMed PubMed Central Google Scholar
Toulany, N. et al. Uncovering developmental time and tempo using deep learning. Nat. Methods 20, 2000–2010 (2023).
Article CAS PubMed PubMed Central Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017).
Article Google Scholar
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. Proc. Int. Conf. Learn. Representat., 1–14 (2014).
Cao, Y., Geddes, T. A., Yang, J. Y. H. & Yang, P. Ensemble deep learning in bioinformatics. Nat. Mach. Intell. 2, 500–508 (2020).
Article Google Scholar
Shen, J., Liu, F. & Tang, C. Scaling dictates the decoder structure. Sci. Bull. 67, 1486–1495 (2022).
Article Google Scholar
Di Talia, S. & Vergassola, M. Waves in embryonic development. Annu. Rev. Biophys. 51, 327–353 (2022).
Article PubMed PubMed Central Google Scholar
Vergassola, M., Deneke, V. E. & Di Talia, S. Mitotic waves in the early embryogenesis of Drosophila: Bistability traded for speed. Proc. Natl Acad. Sci. USA 115, E2165–e2174 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wang, J., Zhang, S., Lu, H. & Xu, H. Differential regulation of alternative promoters emerges from unified kinetics of enhancer-promoter interaction. Nat. Commun. 13, 2714 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Zoller, B., Little, S. C. & Gregor, T. Diverse spatial expression patterns emerge from unified kinetics of transcriptional bursting. Cell 175, 835–847.e25 (2018).
Article CAS PubMed PubMed Central Google Scholar
Deneke, V. E. et al. Self-organized nuclear positioning synchronizes the cell cycle in Drosophila embryos. Cell 177, 925–941.e17 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lucas, T. et al. Live imaging of bicoid-dependent transcription in Drosophila embryos. Curr. Biol. 23, 2135–2139 (2013).
Article CAS PubMed Google Scholar
Margolis, J. S. et al. Posterior stripe expression of hunchback is driven from two promoters by a common enhancer element. Development 121, 3067–3077 (1995).
Article CAS PubMed Google Scholar
Surkova, S. et al. Characterization of the Drosophila segment determination morphome. Dev. Biol. 313, 844–862 (2008).
Article CAS PubMed Google Scholar
Hoch, M., Seifert, E. & Jäckle, H. Gene expression mediated by cis-acting sequences of the Krüppel gene in response to the Drosophila morphogens bicoid and hunchback. EMBO J. 10, 2267–2278 (1991).
Article CAS PubMed PubMed Central Google Scholar
Jaeger, J., Sharp, D. H. & Reinitz, J. Known maternal gradients are not sufficient for the establishment of gap domains in Drosophila melanogaster. Mech. Dev. 124, 108–128 (2007).
Article CAS PubMed Google Scholar
Xu, H., Sepúlveda, L. A., Figard, L., Sokac, A. M. & Golding, I. Combining protein and mRNA quantification to decipher transcriptional regulation. Nat. Methods 12, 739–742 (2015).
Article CAS PubMed PubMed Central Google Scholar
Raj, A., van den Bogaard, P., Rifkin, S. A., van Oudenaarden, A. & Tyagi, S. Imaging individual mRNA molecules using multiple singly labeled probes. Nat. Methods 5, 877–879 (2008).
Article CAS PubMed PubMed Central Google Scholar
Wunderlich, Z. et al. Krüppel expression levels are maintained through compensatory evolution of shadow enhancers. Cell Rep. 12, 1740–1747 (2015).
Article CAS PubMed PubMed Central Google Scholar
Scholes, C., Biette, K. M., Harden, T. T. & DePace, A. H. Signal integration by shadow enhancers and enhancer duplications varies across the Drosophila embryo. Cell Rep. 26, 2407–2418.e5 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sanchez, A. & Golding, I. Genetic determinants and cellular constraints in noisy gene expression. Science 342, 1188–1193 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Raj, A., Peskin, C. S., Tranchina, D., Vargas, D. Y. & Tyagi, S. Stochastic mRNA synthesis in mammalian cells. PLoS Biol. 4, e309 (2006).
Article PubMed PubMed Central Google Scholar
Xu, H., Skinner, S. O., Sokac, A. M. & Golding, I. Stochastic kinetics of nascent RNA. Phys. Rev. Lett. 117, 128101 (2016).
Lammers, N. C. et al. Multimodal transcriptional control of pattern formation in embryonic development. Proc. Natl Acad. Sci. USA 117, 836–847 (2020).
Article ADS CAS PubMed Google Scholar
Fukaya, T., Lim, B. & Levine, M. Rapid rates of Pol II elongation in the Drosophila embryo. Curr. Biol. 27, 1387–1391 (2017).
Article CAS PubMed PubMed Central Google Scholar
Keller, S. H., Deng, H. & Lim, B. Regulation of the dynamic RNA Pol II elongation rate in Drosophila embryos. Cell Rep. 42, 113225 (2023).
Article CAS PubMed PubMed Central Google Scholar
Qian, Y., Celiker, O. T., Wang, Z., Guner-Ataman, B. & Boyden, E. S. Temporally multiplexed imaging of dynamic signaling networks in living cells. Cell 186, 5656–5672.e21 (2023).
Article CAS PubMed PubMed Central Google Scholar
Chaib, S., Liu, H., Gu, Y. & Yao, H. Deep feature fusion for VHR remote sensing scene classification. IEEE Trans. Geosci. Remote Sens. 55, 4775–4784 (2017).
Article ADS Google Scholar
Haghighat, M., Abdel-Mottaleb, M. & Alhalabi, W. Discriminant correlation analysis: Real-time feature level fusion for multimodal biometric recognition. IEEE Trans. Inf. Forensics Secur. 11, 1984–1996 (2016).
Article Google Scholar
Liu, F., Morrison, A. H. & Gregor, T. Dynamic interpretation of maternal inputs by the Drosophila segmentation gene network. Proc. Natl Acad. Sci. USA 110, 6724–6729 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Balleza, E., Kim, J. M. & Cluzel, P. Systematic characterization of maturation time of fluorescent proteins in living cells. Nat. Methods 15, 47–51 (2018).
Article CAS PubMed Google Scholar
Segal, E., Raveh-Sadka, T., Schroeder, M., Unnerstall, U. & Gaul, U. Predicting expression patterns from regulatory sequence in Drosophila segmentation. Nature 451, 535–540 (2008).
Article ADS CAS PubMed Google Scholar
Scholes, C., DePace, A. H. & Sánchez, Á Combinatorial gene regulation through kinetic control of the transcription cycle. Cell Syst. 4, 97–108.e9 (2017).
Article CAS PubMed Google Scholar
Wang, M., Zhang, J., Xu, H. & Golding, I. Measuring transcription at a single gene copy reveals hidden drivers of bacterial individuality. Nat. Microbiol. 4, 2118–2127 (2019).
Article PubMed PubMed Central Google Scholar
Kilic, Z., Schweiger, M., Moyer, C., Shepherd, D. & Pressé, S. Gene expression model inference from snapshot RNA data using Bayesian non-parametrics. Nat. Comput. Sci. 3, 174–183 (2023).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. & Ma, J. Dampened regulates the activating potency of Bicoid and the embryonic patterning outcome in Drosophila. Nat. Commun. 4, 2968 (2013).
Article ADS PubMed Google Scholar
Stringer, C., Wang, T., Michaelos, M. & Pachitariu, M. Cellpose: A generalist algorithm for cellular segmentation. Nat. Methods 18, 100–106 (2021).
Article CAS PubMed Google Scholar
Shazman, S., Lee, H., Socol, Y., Mann, R. S. & Honig, B. OnTheFly: A database of Drosophila melanogaster transcription factors and their binding sites. Nucleic Acids Res 42, D167–D171 (2014).
Article CAS PubMed Google Scholar
Bao, H., Zhang, S., Yu, Z. & Xu, H. Deep learning-based high-resolution time inference for deciphering dynamic gene regulation from fixed embryos. Zenodo https://doi.org/10.5281/zenodo.15680493 (2025).
Bao, H., Zhang, S., Yu, Z. & Xu, H. Deep learning-based high-resolution time inference for deciphering dynamic gene regulation from fixed embryos. Github https://doi.org/10.5281/zenodo.15702436 (2025).

Download references

Acknowledgements

We thank Hernan H. Garcia and Jun Ma for the generous gift of fly lines. We thank Asian Distribution Center for Segmentation Antibodies for providing the anti-Hb antibody. We thank Jingyao Wang for providing previously published imaging data. This work was supported by the National Key R&D Program of China (grant no. 2021YFA0910702 to H.X.), the National Natural Science Foundation of China (grant no. 12474194, 11774225, 41921006 to H.X.), and the Natural Science Foundation of Shanghai (grant no. 22ZR1434000 to H.X.). We gratefully acknowledge the imaging and computing resources provided by the Student Innovation Center at Shanghai Jiao Tong University, and we sincerely thank Liuyin Fan for the dedicated management and support of these resources.

Author information

Authors and Affiliations

School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai, China
Huihan Bao, Shihe Zhang, Zhiyang Yu & Heng Xu
Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai, China
Huihan Bao, Shihe Zhang, Zhiyang Yu & Heng Xu

Authors

Huihan Bao
View author publications
Search author on:PubMed Google Scholar
Shihe Zhang
View author publications
Search author on:PubMed Google Scholar
Zhiyang Yu
View author publications
Search author on:PubMed Google Scholar
Heng Xu
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization by H.B. and H.X.; Methodology by H.B., S.Z., and H.X.; Software by H.B., S.Z., and Z.Y.; Formal Analysis by H.B. and S.Z.; Investigation by H.B., S.Z., and H.X.; Writing – Original Draft by H.B., S.Z., and H.X.; Writing – Revised Draft by H.B., S.Z., and H.X.; Funding acquisition by H.X.; Resources by H.X.; and Supervision by H.X.

Corresponding author

Correspondence to Heng Xu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Source data

Source Data file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Bao, H., Zhang, S., Yu, Z. et al. Deep learning-based high-resolution time inference for deciphering dynamic gene regulation from fixed embryos. Nat Commun 16, 6565 (2025). https://doi.org/10.1038/s41467-025-61907-7

Download citation

Received: 15 December 2024
Accepted: 04 July 2025
Published: 16 July 2025
DOI: https://doi.org/10.1038/s41467-025-61907-7