Deep-prior ODEs augment fluorescence imaging with chemical sensors

Pham, Thanh-an; Boquet-Pujadas, Aleix; Mondal, Sandip; Unser, Michael; Barbastathis, George

doi:10.1038/s41467-024-53232-2

Download PDF

Article
Open access
Published: 24 October 2024

Deep-prior ODEs augment fluorescence imaging with chemical sensors

Nature Communications volume 15, Article number: 9172 (2024) Cite this article

4843 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

To study biological signalling, great effort goes into designing sensors whose fluorescence follows the concentration of chemical messengers as closely as possible. However, the binding kinetics of the sensors are often overlooked when interpreting cell signals from the resulting fluorescence measurements. We propose a method to reconstruct the spatiotemporal concentration of the underlying chemical messengers in consideration of the binding process. Our method fits fluorescence data under the constraint of the corresponding chemical reactions and with the help of a deep-neural-network prior. We test it on several GCaMP calcium sensors. The recovered concentrations concur in a common temporal waveform regardless of the sensor kinetics, whereas assuming equilibrium introduces artifacts. We also show that our method can reveal distinct spatiotemporal events in the calcium distribution of single neurons. Our work augments current chemical sensors and highlights the importance of incorporating physical constraints in computational imaging.

Design of a palette of SNAP-tag mimics of fluorescent proteins and their use as cell reporters

Article Open access 13 June 2023

An implantable CMOS deep-brain fluorescence imager with single-neuron resolution

Article 27 October 2025

A high-performance genetically encoded fluorescent indicator for in vivo cAMP imaging

Article Open access 12 September 2022

Introduction

Biological organisms transmit information by altering the spatiotemporal concentration of certain chemical species¹. For instance, calcium ions (Ca²⁺) act as messengers in a wide range of physiological processes such as cell motility and differentiation, cardiac contraction, wound response, or brain signaling^2,3. In particular, the concentration of calcium exhibits multiple temporal profiles within and across brain cells, each with a unique waveform that encodes a potentially different message^4,5,6; and this is independent of whether the cells are electrically excitable^4,7. For neurons, this diversity is known to play a role in the cellular mechanisms of synaptic plasticity^4,5 and brain memory⁶. For astrocytes, it is a sign of specialization to the heterogeneity of synaptic inputs^7,8,9,10. Many chemical species other than Ca²⁺ are involved in biological signaling. For example, hydrogen-peroxide (H₂O₂) waves play a key role in transmitting information within plants^11,12.

To study cell signaling in living organisms, researchers use chemical sensors¹³. More precisely, an important subset of chemical sensors are designed to report on the presence of certain chemical species of interest (CSI) such as Ca²⁺¹⁴.

Their fluorescence changes upon binding to the CSI^15,16, indirectly measuring its concentration. This makes it possible to follow the evolution of dynamic processes through space and time with little invasiveness¹³.

The ability of chemical sensors to report on cell signaling has been key to many studies^17,18,19,20. However, the timescale of the binding kinetics limits the resolution of the observations and, therefore, the range of dynamic processes that can be studied^21,22,23. If the concentration of the CSI varies at a similar timescale—as is often the case in neuroscience—the fluorescence might not reflect the underlying concentration accurately^23,24. This has led researchers into a quest for sensors that are faster and less obscured by the chemical reaction behind the binding process. The example par excellence is the GCaMP family of calcium sensors, which has recently reached its eighth iteration (jGCaMP8²¹) since 2001¹⁶.

Overall, new generations of sensors often translate into new discoveries^3,25,26,27, but experimental design still requires great care in considering the binding kinetics. The non-linearities in the binding process and the speed of its kinetics might offer a deformed picture of the underlying CSI waves and introduce time lags^24,28,29. In spite of this, fluorescent signals are sometimes interpreted in place of the CSI^23,24. In the few cases where the underlying chemical reaction is considered, it is presumed at equilibrium^30,31,32, which is not always justified. The noise inherent to the chemical reaction, as well as that introduced by the acquisition process, are additional challenges to the interpretation of the fluorescence signals.

In this work, we aim to augment chemical sensors by separating the behavior of the CSI from that of the sensors. The final goal is to paint a more accurate picture of the behavior of the underlying CSI by attempting to recover its concentration. This entails accounting for the different kinetics of the sensors, as well as for the non-linearity and time dependency of their interactions.

We first show that overlooking the binding process—e.g., considering the reaction at equilibrium—can deform the shape of the CSI wave considerably, and introduce other artifacts such as time lags. Our main contribution is then the formulation of an inverse problem that recovers the spatiotemporal concentration of the CSI from images of the fluorescence emitted by its corresponding chemical sensor. The resulting variational framework accounts explicitly for the non-linear binding reaction outside of equilibrium and imposes a deep neural network prior to the spatiotemporal distribution of the CSI. The prior is based on a reparameterization of the concentration using our proposed adaptable latent space that does not require training. In part, this is possible because we inform the inverse problem with the ordinary differential equations (ODE) that model the binding phenomenologically.

We first validate the accuracy of our reconstructions using simulations of calcium sensors with realistic parameters and in vitro mixing experiments. We then apply our method to real calcium-imaging data of mouse neurons. The calcium concentration that we recover shows a common waveform regardless of the sensor used when the stimuli are similar. By contrast, the corresponding fluorescence signals are amorphous and vary considerably across the sensors. Moreover, our method recovers a CSI-concentration map that is regular in space and time, enabling the observation of distinct spatiotemporal events within single cells without the need for averaging. This effect translates into a denoised fluorescence signal too. Preliminary experiments on simulated hydrogen-peroxide sensors in plant leaves were also promising and can be found in our recent conference abstract³³. In contrast to calcium sensors, the fluorescence of these H₂O₂ sensors decreases upon binding.

We would like to note that our framework is not meant to substitute careful experimental design, but to augment the information provided by chemical sensors. One should still ask whether the kinetics of the sensors are adequate for the signals under study, whether the binding model is accurate, and whether the mere presence of the sensor affects the natural physiology of the process under study. These caveats are discussed throughout the main text and the Supplementary.

Results

Principles of our framework

Physical model

We start by modeling the binding process. Our aim is to relate the concentration of the CSI with the fluorescence that is emitted from the sample. We consider a CSI such as Ca²⁺ or H₂O₂ with a concentration of c(x, t) inside the sample. The CSI binds to the chemical sensor to form a fluorescent compound. Let s(x, t) and s_b(x, t) denote the concentrations of the sensor and of the fluorescent compound. The binding process can then be modeled with the reversible reaction

$${{\rm{s}}}+{n}_{{{\rm{H}}}}{{\rm{c}}}{\rightleftharpoons}_{{{{\rm{k}}}}_{b}}^{{{{\rm{k}}}}_{f}}{{{\rm{s}}}}_{b},$$

(1)

where n_H > 0 denotes the Hill coefficient, and k_f, k_b are the kinetic rates of the binding and unbinding processes, respectively. We understand (1) as a phenomenological model that can potentially hide multiple reactions³⁴ or dependent binding sites³¹ behind fractional Hill coefficients (see “Methods” and the Supplementary). This is a common interpretation among experimentalists and requires a measurement of n_H, k_f, and k_b. The chemical reaction in (1) can be modeled with an ordinary differential equation (ODE) that interrelates the temporal variations in the concentrations of the three species. While the fluorescence emitted by the sensors depends mainly on the concentration of the fluorescent compound s_b, we can link it to the concentration c of CSI via the ODE. Therefore, we model the predicted fluorescence as ${{\mathcal{H}}}({{\bf{x}}},t;c,{g}_{0},{q}_{e})$, where (1) is an implicit constraint, and g₀(x), q_e(x) are the fluorescent background and a concentration-to-fluorescence factor³¹ (see “Methods”). These two additional variables account for several unknown factors such as the quantum yield or the small emissions of the unbound sensor, which can vary over the field of view due to multiple factors such as the length of the optical path to each pixel (ref. ³¹, Section 10.3.1]. They could also act as low-order corrections for small kinetic changes, for example for those originating from surface-to-volume variations within a cell. While we chose (1) for its generality as an phenomenological model, other ODE systems tailored to specific sensors can be plugged-in seamlessly into the rest of our framework. Relatedly, the recovered concentration will be unitless unless the experiments are calibrated (see Methods).

Inverse problem

Equipped with the physical model, we now present our variational framework to recover the concentration distribution from fluorescence images (Fig. 1). To this end, we formulate the inverse problem

$$\begin{array}{rcl}\left({c}^{\star },{g}_{0}^{\star },{q}_{e}^{\star }\right)\in \arg {\min } _{c,{g}_{0},{q}_{e}} &{{\mathcal{D}}}\left({{\mathcal{H}}}(c,{g}_{0},{q}_{e}),\,{g}_{m}\right)\\ & +{{\mathcal{R}}}(c,{g}_{0},{q}_{e}),\hfill\end{array}$$

(2)

where one searches for the concentration c(x, t), background g₀(x), and scaling q_e(x) that best fit the fluorescence measurements g_m(x, t). In (2), ${{\mathcal{D}}}$ is a data-fidelity term. It enforces that the predicted fluorescence ${{\mathcal{H}}}(c,{g}_{0},{q}_{e})$ (Fig. 1B) is close to the fluorescence measurements g_m (Fig. 1A). The binding model in ${{\mathcal{H}}}$ ensures that the predicted fluorescence is consistent with the chemical equations. Mathematically, however, data fidelity is not enough to single out a solution: many concentration distributions can give rise to similar measurements. The addition of a regularization term ${{\mathcal{R}}}$ takes care of this so-called illposedness by enforcing additional properties that one expects from a realistic distribution of the concentration and of the background. For similar reasons, it is convenient that g₀(x), q_e(x) only vary spatially so that one can extract the static information from the many images in a video.

Deep spatiotemporal prior

In the majority of problems that are computationally similar to (2), the term ${{\mathcal{R}}}$ only enforces spatial regularity^35,36,37. Imposing temporal regularity realistically with such regularization terms is complicated; it might require an accurate model of any underlying motion^38,39.

To mitigate the illposedness of problem (2) in both space and time, we propose to use the framework of deep spatiotemporal priors instead^40,41,42. (See Supplementary Notes 1 and 2 for a thorough comparison of the methods). In our case, we express the distribution of the concentration as the output of a neural network c(x, t) = f_θ(x, z(t)) parameterized by θ, and by a latent vector z(t) that is time-dependent (Fig. 1B). This results in a model ${{\mathcal{H}}}\left({f}_{{{\boldsymbol{\theta }}}}({{\bf{x}}},{{\bf{z}}}(t)),{g}_{0},{q}_{e}\right)$ where the concentration is regularized implicitly (in space–time) by the restriction of the latent variables to a manifold, whereas g₀, q_e are regularized explicitly in space (see “Methods”). Note that the network is never trained.

In summary, the framework that we propose combines the information contributed by the physical model of the chemical reaction with the regularity of the neural-network parameterization. We call this approach deep-prior ODEs.

Parametric latent space

In addition, we propose a modification of deep priors so that they are better adapted to the dynamics of chemical sensors. In previous work⁴¹, the latent space of the deep prior in a Fourier-ptychography method was a straight line with equidistant samples for lack of structured motion. In ref. ⁴⁰, the latent space of an MRI algorithm was a helicoidal curve to reproduce the periodicity of the sample movements. In biological signaling, however, samples may alternate between fast and slow dynamics in an unknown manner. To capture such heterogeneity, we propose to represent our latent space with a flexible parametric curve (see “Methods”).

We provide the mathematical description and implementation of our framework in the “Methods”. Find also an extended description thereof in the Supplementary Notes 1 and 2. There, we also present a more technical comparison to the state of the art. We refer to our framework as DUSK for “Deep-prior odes for Uncoupling Sensor Kinetics” (Fig. 1).

Baseline method: reaction at equilibrium

To study the effects of overlooking the sensor kinetics, we consider an alternative method where we assume that the ODE that models (1) reaches the steady state instantly ($i.e.,\frac{{{\rm{d}}}{s}_{b}}{{{\rm{d}}}t}=0$). This is standard practice in calcium imaging^30,31. It leads to the nonlinear pointwise function

$${{\mathcal{H}}}({{\bf{x}}},t;c,{g}_{0},{q}_{e})={g}_{0}({{\bf{x}}})+{q}_{e}({{\bf{x}}})\frac{c{({{\bf{x}}},t)}^{{n}_{{{\rm{H}}}}}}{\frac{{k}_{b}}{{k}_{f}}+c{({{\bf{x}}},t)}^{{n}_{{{\rm{H}}}}}}$$

(3)

for the fluorescence of the sample.

In other words, the binding process is assumed to be much faster than the temporal variations of the CSI concentration. In our experiments, we will compare this baseline method to DUSK. For maximum fairness, we always equip the baseline method with the same deep spatiotemporal prior. We remark, however, that this already constitutes an improvement over considering (3) alone.

Calcium imaging in the brain

We developed our framework with the intention of augmenting any fluorescent chemical sensor. While other experimental models might stand more to gain from our framework, in this article, our case study is neuronal calcium imaging with GCaMP. More than for its extensive practical importance, we chose this modality because of the availability of rich datasets with accompanying electrophysiological measurements. These are useful as a pseudo groundtruth because of the “reproducibility” of action potential (AP) signals.

In principle, the study of neuronal activity is less vulnerable to misinterpreting fluorescent signals because it focuses exclusively on action potentials. Since APs are spikes, they are usually estimated directly from the fluorescence using spike-deconvolution algorithms^43,44,45,46. For fast sensors, simplified spike-to-fluorescence models are sometimes enough to recover the presence of APs^21,44, but the shape of the measurements is better recovered when sensor non-linearities are accounted for^28,47. This is especially true in rapid successions of APs because the fluorescence does not add linearly in time. On the other hand, measurement noise is usually tackled by averaging the fluorescence signal over an entire cell²¹. Only recently have there been attempts to denoise the fluorescence signal, notably with the help of deep learning^48,49.

While APs are admittedly the final goal of many studies in neural circuits, all these methods are not applicable to cells that are not electrically excitable such as astrocytes, or plants cells in general. Moreover, they do not consider the concentration of the CSI, which can carry information in its waveforms, even in cells that do generate APs^4,6. For these reasons, some works do tackle the problem of recovering quantitative calcium signals by developing new protocols or sensors^{30,50,51,52,53,54,55} while being mindful of potential physiological alterations^56,57. Some of these approaches estimate the concentration of calcium by experimental calibration; they often directly assume that the spatiotemporal distribution of the CSI is similar to the one of the fluorescence signal or—more rarely—presume that the chemical reactions are at equilibrium^30,31, which is still only adequate for certain combinations of sensor and signal speed. We explore this with our framework.

DUSK recovers the CSI in simulations

To assess the accuracy of DUSK in realistic conditions, we developed a simulation pipeline. We used it to simulate the spatiotemporal evolution of CSI concentration in an astrocyte-like sample (Fig. 2B, Branches). The CSI binds with the sensors as it propagates through the branches of the sample by diffusion (Fig. 2A, Ground truth). We model this with a set of reaction-diffusion PDEs. The constants are taken from experimental values for the jGCaMP8s sensor. The fluorescent measurements were computed according to our physical model. We corrupted them with Poisson noise to model the emission and acquisition process (Fig. 2A, Measured). While the simulations evolve via diffusion, we remark that our reconstruction method does not assume so; DUSK is completely agnostic to the underlying transport mechanism. See “Methods” for a more detailed description of the simulations.

**Fig. 2: DUSK recovers the concentration of CSI accurately in simulations.**

In Fig. 2A, we show how DUSK is able to recover the CSI concentration accurately over both time and space from the measured fluorescence. The accuracy of DUSK over time is especially noticeable in the maximum intensity projection (MIP). There, we can observe how the DUSK concentration propagates through the branches similarly to how the ground-truth one does. As seen in Frame 29, DUSK also captures the heterogeneous spatial distribution of the ground-truth CSI. An additional by-product of DUSK is that it denoises the image measurements (Predicted (DUSK), Supplementary Movie 1). We also evaluated the accuracy of the reconstruction quantitatively using the regressed signal-to-noise ratio (RSNR) (see “Methods”). We found that the recovered CSI had an RSNR of 12.52 dB. To illustrate the importance of considering the binding kinetics, we then applied the baseline method to the same measured fluorescence for comparison. As seen in the MIP, the assumption of equilibrium introduces temporal artifacts that hide the astrocyte branches. Another consequence is that the CSI appears spatially homogeneous (Frame 29). The much lower RSNR of 1.31dB achieved by the baseline method is in agreement with our visual assessment.

As the CSI propagates through the sample, it creates a traveling wave. At different parts of the cell, the CSI concentration reaches its maximum value at different time points. The spatial regularity of DUSK allowed us to compute the time lag of this wave at each point in space (Fig. 2B). This kind of information is generally more difficult to obtain because most methods require averaging over the cell body. We used the computed time lags to align the CSI concentration temporally, creating a median waveform that could be representative of a stimulus-response (Fig. 2C). To avoid interferences, we split the signal into the cell and the background. We compared the waveforms resulting from the concentration recovered by DUSK and by the baseline method (with the same deep image prior). By design, both methods recover the fluorescence wave accurately. However, the assumption of equilibrium in the baseline method not only introduces a time lag in the CSI concentration but also deforms the wave significantly. Conversely, DUSK captures the behavior accurately, even for the low signal in the background. Together with Fig. 2A, these experiments suggest that, in some cases, it is important to uncouple the behavior of the CSI from that of the sensor. This is especially relevant because our simulations follow the time scale of chemical sensors that are used in practice. We reached similar conclusions with other simulations (see Supplementary Note 1), as well as with our preliminary analysis of H₂O₂ signaling in simulated plant leaves³³. In that case, the sensors had very different kinetic coefficients and, contrarily to most sensors, their fluorescence decreased upon binding.

To further assess the influence of the kinetics on the accuracy of the reconstruction, in the Supplementary Note 4 we perform simulations with different pairs of binding rates. The results show that DUSK remains accurate across a wide range of values, while the assumption of equilibrium would require very high rates to recover a signal as fast as that in Fig. 2.

Finally, we also studied the robustness of DUSK with respect to the imaging rate. To do so, we recovered the CSI concentration using all the fluorescence measurements available (D = 1), every other image (D = 2), and every 4th image (D = 4). This D stands for the downsampling factor in the forward model (see “Methods”). We then computed the median waveform of the reconstructed CSI for each D (Fig. 2D). For reference, D = 1 is equivalent to an imaging rate with a time step that is half as small as the half-rise of the sensor. Therefore, we assessed whether for D > 1 the imaging rate would be sufficient to capture the behavior. Remarkably, we found that the waveforms recovered by DUSK remained qualitatively close to the ground truth. The RSNRs over the spatiotemporal distribution corroborated our observation quantitatively as they only decreased slightly with D, D = 1: 13.31 dB, D = 2: 12.31 dB, and D = 4: 10.91 dB. From another perspective, this reflects the ability of DUSK to interpolate between measurements.

DUSK recovers neuronal calcium activity from fluorescence measurements

Having validated our framework, we applied DUSK to the calcium imaging of neurons. We used an extensive dataset where several types of GCaMP reporters were imaged under similar conditions using a two-photon microscope²¹. We remark that our work does not aim at deciphering neuronal spikes, but at uncoupling the behavior of the sensors from the one of calcium. Therefore, we leveraged the richness of this neuronal dataset to compare the underlying calcium signals obtained by applying DUSK to different sensors. In particular, we considered three GCaMP sensors (jGCaMP8s, jGCaMP7f, and jGCaMP8m), each with different kinetics and sensitivity (see Extended Data Table 3 from Zhang et al.²¹).

Each fluorescence sequence in the dataset is paired with electrophysiological measurements that monitor the membrane voltage inside certain regions of interest (ROI). AP spikes are the main source of calcium rises in this dataset. Since our interests are not APs, we do not aim at imposing sparsity on the signal for spike deconvolution⁴⁵. Instead, we have used the electrophysiological measurements as indicators of expected increases in calcium.

Before diving into the reconstruction of calcium in vivo, we tested our method under controlled conditions. In particular, DUSK was able to recover the concentration of calcium accurately in mixing and unmixing experiments with the different sensors (see Supplementary Note 3).

We then proceeded with in vivo experiments. In Fig. 3, we present an example reconstruction for each of the sensor types. Similarly to the simulations, we observe that DUSK has a strong denoising effect on the fluorescence images (two first rows in Fig. 3A–C). The fluorescent decay after a burst of action potentials is clearly observable in the MIPs. On the other hand, the concentration profiles consistently exhibit bursts that precede and are shorter than their fluorescent counterparts (Supplementary Movies 2–4).

**Fig. 3: DUSK recovers the calcium concentration from real fluorescence measurements of different chemical sensors.**

We also compared the concentration with the electrophysiological measurements. To this end, we computed the temporal trace (median and interquartile) over a biological ROI (Fig. 3D–F, see AP marks in golden). The interquartile in the measured fluorescence (first row) is higher than in the predicted fluorescence (second row), which corroborates the denoising effect of DUSK. Both the fluorescence and the concentration profiles are well aligned with the APs (golden rods) detected with the electrophysiological measurements. Not only the bursts of calcium concentration are shorter than their fluorescent counterparts, we also observe a sharper rise in most cases. Spike-related concentration peaks are thus more distinguishable. For example, the concentration in the first AP burst of Fig. 3D has a higher sensitivity index (${d}_{C}^{{\prime} }=2.578$, see “Methods”) than that of the corresponding measured fluorescence (${d}_{M}^{{\prime} }=1.141$), or of the predicted fluorescence (${d}_{P}^{{\prime} }=1.407$).

We found that the scaling q_e(x) recovered by DUSK was largely homogeneous, with the typical example having a very small standard deviation across the image, e.g., 1.155 ± 0.004. This suggests that the spatial effects listed in “Physical model”, such as the difference in optical paths, play a minor role.

The spatiotemporal detail of DUSK provides rich insights

We first assessed whether DUSK does address the temporal deformation induced by the binding process. We applied DUSK to multiple samples expressing the jGCaMP8s, jGCaMP8m, or jGCaMP7f sensors. We then identified events where a single AP was recorded by the electrophysiological measurements. For each of the sensors, we computed the median temporal profile of the (normalized) concentration and of the measured fluorescence over the ROIs of the samples (Fig. 4A). We observe that the resulting fluorescence profiles of the three sensors are qualitatively different (right plot). This is in agreement with the diversity of sensor kinetics. The half-decay times reported in ref. ²¹ for jGCaMP8s (${t}_{1/2}^{8{{\rm{s}}}}=188$ ms) and jGCaMP8m (${t}_{1/2}^{8{{\rm{m}}}}=38$ ms) match well with our estimated fluorescence profiles. By contrast, the median concentration profiles that DUSK estimated for the three sensors exhibit a short, transient calcium response that is very similar across the sensors (left plot). This is in spite of the large sample variability and the three different kinetics. It is also relative to the 8.2 ms frame period of the image data. The CSI responses recovered by DUSK for a burst of two and three APs were also similar across the three sensors (see Supplementary Note 9). Overall, this experiment is further validation of the capability of DUSK to uncouple the behavior of the sensors from that of the CSI.

Next, we evaluated the behavior of the transient response of calcium to the underlying bursts of APs. In general, if each AP elicits an increase in calcium, the last AP in a burst should occur before the calcium concentration reaches its maximum. We thus computed the difference between the rise time of the CSI response and the duration of the burst. We did this over multiple spatial ROIs and burst types (between 2 and 7 APs per burst). In Fig. 4B, we display a box plot of this time difference for each sensor. The median time difference is positive for all sensors. This confirms that calcium normally reaches its maximum concentration after the last AP. Note that this evaluation is rather conservative because a burst of APs may saturate the sensor signal, or the calcium response itself⁵⁸, before the last AP occurs.

In our framework, the concentration and predicted fluorescence are denoised without averaging over a spatial area. This property allowed us to analyze the spatial behavior of the CSI in more detail. In Fig. 4C, we display time-lapse images of a burst of APs. As indicated by the arrows, spatiotemporal patterns are more evident in the concentration than in the fluorescence. In this example, an early and a late calcium rise occurs at the top and bottom of the ROI, respectively, showing spatial delays within the cell body. This type of pattern may be related to the calcium waves and sparks that are known to occur in some neurons^53,59,60.

Figure 4A is a good example of the non-linear time-dependent relation between the CSI and the fluorescence: three different and rather flat fluorescence profiles lead to a very similar CSI profile with a peaked waveform. Similarly, the non-linearity and time dependency hide a lag not only over time but also over space in Fig. 4C. In Supplementary Note 5, we present three additional examples of how these effects make the fluorescence significantly different than the CSI. In one of them, two synchronized and very similar fluorescence profiles give rise to two desynchronized CSI waveforms of different magnitudes.

To visualize the interaction between the spatial and temporal heterogeneity of the concentration, we computed spatial maps of the time lag as in Fig. 2B. These maps make spatiotemporal events more evident. For the experiment in (Fig. 4D), the map outlines a distinct area with a longer time lag (red, orange). This is in line with our observations of the delays in Fig. 4C. We then leveraged the continuous representation (10) that underlies the CSI in DUSK. By sampling this function at four times (Δt = 2.05 ms) the experimental acquisition rate of Δt = 8.20 ms, this representation allowed us to explore the reconstructions in more detail. In particular, the right area of the ROI appeared constant in Fig. 4D—left, but the transition between the different high-lag regions became clearer when sampled at a higher rate in Fig. 4D—right.

We believe that the versatility of DUSK is, in part, due to the adaptability of this continuous latent space. The latent vectors seem to display a recurring behavior once optimized. In Fig. 4E, we present the latent curve (10) of some illustrative examples. There, we can see that the AP events (golden dots) tend to cluster in the latent space as much as the parameterization of the curve allows. This indicates that the latent vectors are optimized in a way that helps capture the different dynamics within the signal.

Discussion

We have proposed a variational formulation with a deep spatiotemporal prior to recovering the concentration of a CSI from noisy fluorescent images of chemical sensors.

DUSK was accurate at recovering the spatiotemporal concentration of CSI in reaction–diffusion simulations. Conversely, the assumption of equilibrium deformed the CSI and introduced artifacts such as time lags. In real data, the rise time of the CSI profiles recovered by DUSK was in agreement with the duration of the underlying AP bursts, which were measured independently. DUSK was also accurate in the mixing and unmixing experiments. Moreover, the CSI profiles recovered in response to similar stimuli were consistent across the different sensors in spite of their different fluorescence profiles. This doubles as a validation of our framework. It also highlights the importance of uncoupling the CSI from the sensor. Distinguishing whether different fluorescent responses correspond to the same underlying CSI profile could be key to deciphering the function behind the different calcium waves. Since the chemical reactions are non-linear, this uncoupling becomes even more relevant when the stimuli come in bursts. Indeed, on top of correcting for the time lags, DUSK also corrected the deformations induced by the nonlinearity of the binding process. This is perhaps most clear when comparing the fluorescence to the concentration in Fig. 4A, but is also notable in the rest of the figures (see Supplementary Notes 4, 5, 7, too). Taking into account that APs lead to CSI concentrations of similar magnitude, we expect other experimental settings might benefit even more from DUSK.

The temporal scale of the biological signals that can be studied is limited by the sensor and by the acquisition setup⁶¹. When the time scale of the signal is much faster than the binding kinetics, information is lost because any change in fluorescence becomes negligible compared to the compounding of the detector and shot noise. In principle, this loss is irreversible up to the prior. Nonetheless, it appears that accounting for the binding process might help establish these limits more clearly and—perhaps—push them (see Supplementary Note 4). That DUSK could recover similar information for sensors of different speeds is promising from this perspective. The rise time is usually the main consideration when choosing a sensor. However, a smaller decay time leads to less saturation, and a stronger signal leads to less noise. We found that these two factors were key to the well-posedness of the inversion from fluorescence to concentration, while the final temporal resolution of the CSI profiles was less affected by slower rise times upon uncoupling. Such insight might help guide the design of new sensors and suggests incorporating the inversion of the binding process as an additional consideration. One byproduct of DUSK is a denoising effect on the measurements. This might be of interest by itself and is the aim of recent works that denoise fluorescence images before further processing^48,49,62,63. (See Supplementary Notes 2 and 6 for a comparison and a more thorough discussion.) In DUSK, however, this effect happens as a result of jointly considering the binding process and estimating a CSI with spatiotemporal priors. As a consequence, averaging over the cell body is not necessary to achieve a signal that is interpretable. Indeed, DUSK yields heterogeneous spatiotemporal maps of the CSI in addition to the standard temporal profiles of the fluorescence. We have found that our time-lag maps highlight how different regions in the cells might be delayed. The ability to distinguish these spatial patterns could be helpful in the study of presynaptic regions by outlining the local accumulation of calcium⁵. It may even be more relevant in astrocytes, where the onset of calcium signaling is less synchronized, and thus averaging is more detrimental²⁴. Our framework also has the potential to mitigate “the oversimplification of slow Ca²⁺ waves” in astrocytes^23,24, for example, by accounting for the long decay times.

We found that the latent space of DUSK may act similarly to a classifier by constructing a subspace where similar events, such as APs, become clustered. This could be leveraged for spike detection. It could even be more relevant for classifying the signature of different calcium waves in cells that are not electrically excitable. Exploring these subspaces could yield a succinct, low-dimensional interpretation of the distinguishing features of the waveforms.

The accuracy of DUSK is limited by the suitability of the underlying model (see Supplementary Notes 3, 7, and 8). Fortunately, the characterization of the kinetics is already a key part of the design and testing of sensors⁶⁴. We did not account for the diffusion of the sensors because they are expressed constitutively, which should promote a more constant and homogeneous distribution. As in standard setups, experimental calibration is also required by DUSK if the exact concentration of CSI is of interest^{30,50,51,52,53}. However, the spatiotemporal distribution itself does not require calibration in DUSK. We remark that DUSK should be regarded as a way to augment chemical sensors and is not meant to substitute careful experimental design. For example, one important question is whether the presence of the sensor itself perturbs the dynamics under study (other than through the chemical interaction modeled in this work).

We set out to recover the concentration of a CSI from fluorescent chemical sensors. However, the same principles of DUSK readily translate to sensors with a response signal that can be modeled with ODEs. One example is H₂O₂ sensors for plant imaging^32,33. Another example is genetically encoded voltage indicators (GEVI)⁶⁵. They are designed to report on membrane potential via fluorescent emission. Similarly to GCaMP, the design of GEVIs wrangles over low rise times, long decay, and low sensitivity because they complicate the interpretability of the voltage signal⁶⁶. Accounting for the underlying kinetics could decrease the sensitivity to these parameters, facilitate signal analysis, and help guide sensor design.