Finite integration time can shift optimal sensitivity away from criticality

Azizpour, Sahel; Priesemann, Viola; Zierenberg, Johannes; Levina, Anna

doi:10.1038/s42005-026-02584-w

Download PDF

Article
Open access
Published: 28 March 2026

Finite integration time can shift optimal sensitivity away from criticality

Communications Physics volume 9, Article number: 119 (2026) Cite this article

1459 Accesses
Metrics details

Subjects

Abstract

Sensitivity to small changes in the environment is crucial for many real-world tasks, enabling living and artificial systems to make correct behavioral decisions. It has been shown that such sensitivity is maximized when a system operates near the critical point of a phase transition. However, proximity to criticality introduces large fluctuations and diverging timescales. Hence, to leverage the maximal sensitivity, it could require impractically long integration periods. Here, we analytically and computationally demonstrate how the optimal tuning of a recurrent neural network is determined given a finite integration time. Rather than maximizing the theoretically available sensitivity, we find networks attain different sensitivities depending on the available time. Consequently, the optimal dynamic regime can shift away from criticality when integration times are finite, highlighting the necessity of incorporating finite-time considerations into studies of information processing.

A mean-field approach to criticality in spiking neural networks for reservoir computing

Article Open access 06 October 2025

Path sampling of recurrent neural networks by incorporating known physics

Article Open access 24 November 2022

Timeliness criticality in complex systems

Article 19 June 2024

Introduction

Living systems must efficiently encode relevant environmental information while being sensitive to small changes. Increasing evidence suggests that many natural systems tackle this challenge by operating near a critical phase transition¹. Signatures of near-critical dynamics have been observed across different scales, from collective behaviors in flocks of birds² to cellular diversity in stem cell populations³, and in the brain^4,5,6,7,8. The proposed advantage of operating near a critical point is that phase transitions endow systems with computational benefits, including elevated sensitivity and correlation^9,10, maximized dynamic range¹¹, enhanced information flow^12,13,14, optimal input representation^15,16, and a diverse spectrum of dynamical responses¹⁷.

Operating in the vicinity of a critical phase transition offers significant advantages but comes with inherent challenges. While enhanced sensitivity of critical systems makes them ideal for some tasks, it also increases their vulnerability to noise, further amplified by critical slowing down^18,19. A recent example of this is decision-making by integrated Ising models, where operating at a distance from a phase transition allows control of the trade-off between reaction time and error rate²⁰. More generally, such a trade-off can be formulated as an optimization problem with a control parameter λ (in our case, changing the distance to criticality) that regulates both beneficial gain G(λ) and detrimental loss L(λ) with some weighting factor γ, i.e.,

$${\lambda }^{* }=\mathop{\arg \max }\limits_{\lambda }\,\left\{G(\lambda )-\gamma L(\lambda )\right\}.$$

(1)

Both gains and losses depend on the particularities of the system. Thus, the optimal tuning of λ, and thereby the optimal distance to criticality, will have to depend on the specific system and requirements of each task²¹. For example, fish schools balance reaction time and energy cost in their alarmed state²², while neuromorphic computing and artificial networks adjust their state to match memory requirements for optimal functioning^23,24. Despite these observations, it remains a challenge to quantitatively assess the trade-off between gain and loss that would determine an optimal distance from criticality.

A famous example of how criticality can assist encoding in the brain is the dynamic range. The dynamic range quantifies the range of continuous input features that can be encoded by the nonlinear firing-rate response of a neuron. It is commonly defined as the logarithmic range of inputs h for which the output is between the 10th and 90th percentile of all outputs¹¹, i.e., $\Delta =10\,{\log }_{10}({h}_{0.9}/{h}_{0.1})$, selected to exclude responses that would not be distinguishable from the noise floor at low activity and saturation regime at high activity. Examples include encoding of correlations in the visual field²⁵, odor concentration²⁶, and sound level^27,28.

Unfortunately, the dynamic range of a single cell is usually much smaller than the dynamic range of perception. This dynamic-range problem can be solved with the emergent properties from recurrent interactions, which were shown to drastically increase the dynamic range as the network approaches criticality^11,29,30. Exploiting close-to-critical emergence was also observed in structures with heterogeneous³¹, modular³², or hierarchical³³ organization. However, previous work neglected the emerging close-to-criticality population activity fluctuations that can hinder confidence in discrimination.

In this work, we combine analytical calculations, numerical simulations, and machine-learning approximations to quantify the optimal balance between input discrimination confidence and the sensitivity of a recurrent neural network, controlled by its recurrent interaction strength λ and the timescale T of a leaky readout (Fig. 1). To formalize this optimization problem, we introduce two generalized measures of dynamic range derived from the discriminability of inputs and provide analytical results for the limiting cases of instantaneous readout and infinite integration time. We find that the optimal state, λ^*, of the network depends on the required confidence and integration time, with a safety margin from the precise critical point for all finite integration times.

**Fig. 1: Fluctuations in network activity can lead to unreliable input reconstruction.**

Results

We consider a random network of probabilistic spiking neurons that can be activated externally and recurrently (see “Methods” section for details). To mimic processing and transmission, only a random subset of neurons receives input, while another random subset of neurons serves as output (Fig. 1a). Input neurons receive uncorrelated, independent Poisson spike trains with a rate h, which represents the input strength. The recurrent interactions are defined by a random sparse connectivity matrix W for which we can control the largest eigenvalue λ and thereby the fluctuations of recurrent activity. A leaky readout integrates output neurons’ activity with timescale T, which can be expressed as a sum of exponential kernels

$${o}^{T}(t)=\frac{1}{{N}^{{{\rm{out}}}}}\mathop{\sum }\limits_{i\in {N}^{{{\rm{out}}}}}\mathop{\sum }\limits_{\{k| \,{t}_{i}^{k} < t\}}{e}^{-(t-{t}_{i}^{k})/T}+\eta ,$$

(2)

where ${t}_{i}^{k}$ is the timing of the k-th spike of neuron i. Here, we added a small Gaussian noise $\eta \sim {{\mathcal{N}}}(0,{\sigma }^{2})$ to be able to technically treat δ-distributed outputs from absorbing states or mean-field solutions in our later analysis, with only a minor effect on typical output distributions. Depending on the parameters of the recurrent interactions, the input ordering would be more or less observable from the output activity, for an external observer or for neurons further up in the processing hierarchy.

For the extreme cases of T → ∞ or T → 0, we can solve the model analytically and obtain closed-form solutions for P(o^T∣h). For T → ∞ this can be achieved using a mean-field approach, and for T → 0 we solve a Fokker–Planck equation (see the “Methods” section). For intermediate integration times, 0 < T < ∞, we have to rely on simulations to obtain the output distributions. This intermediate regime is particularly relevant for biological systems, as the intrinsic timescales of the cortical neurons were found to be in the range of about 50–500 ms^34,35,36. Comparable timescales can also arise from the slow accumulation of activity signals that inform or trigger synaptic plasticity, for instance, through calcium decay following neural spiking, typically lasting around 500 ms³⁷.

The sensitivity of the system is controlled by the largest eigenvalue λ of the connectivity matrix³⁸. The recurrent network we consider has a critical non-equilibrium phase transition for h → 0 at λ_c = 1^32,38, where both sensitivity and correlated fluctuations are strongest. Reducing λ reduces recurrent network fluctuations (Fig. 1b). This generates a trade-off: close to criticality, we expect optimized information processing properties for infinite integration time but simultaneously increased fluctuations in the finite-time output o^T(t). In the following, we explore how to quantify this trade-off depending on the available integration time.

As a first step, we illustrate how the recurrent network’s largest eigenvalue and the readout integration time affect the representation of inputs (Fig. 1b). Each panel shows the outputs from two identical copies of the network with given λ and T in response to the two stimuli with rates h₁ < h₂, which are chosen such that the mean-field outputs 〈o^T〉(h_2/1) = 0.5 ± Δo/2 are easily distinguishable with 〈o^T〉(h₂) − 〈o^T〉(h₁) = Δo = 5σ. Assume the observer’s task is to find which network received the stronger input. This can be solved by deciding that a stronger output indicates a stronger input. We shade gray the times where, following this strategy, the observer will make a mistake. We observe only small fluctuations when λ is far from the critical point (λ = 0.9, left column), irrespective of the integration time (vertical axis), and the inputs can be perfectly assigned from the output. When shifting λ closer to the critical point, fluctuations increase such that with insufficient integration time, the errors appear but can be remedied when the integration time is increased.

To formalize this intuition, we consider the input-output distribution P(o^T(t)∣h) to observe an output o^T(t) in response to a specific input h at time t. In the following, we will omit the time argument for brevity. Now, if two inputs h₁ and h₂ were equally likely to be presented, then the overlap between P(o^T∣h₁) and P(o^T∣h₂) quantifies the minimal discrimination error³⁹ of an ideal observer:

$${{\mathcal{E}}}({h}_{1},{h}_{2})=\frac{1}{2}\int \min \left\{P({o}^{T}| {h}_{1}),P({o}^{T}| {h}_{2})\right\}d{o}^{T}.$$

(3)

Computing this error for the stimuli in our example (Fig. 1c), we find that ${{\mathcal{E}}}$ increases with λ and decreases with T, which matches our just-gained intuition. In our example, variability in o^T comes from observing stochastic, correlated dynamics (λ) with a finite integration time T plus noise. However, our logic remains the same for other causes of variability.

As a next step, we define a set of discriminable inputs that can be sufficiently well distinguished from observing only the output. We call two inputs ε-discriminable if the overlap of the response distributions generated by the inputs is smaller than an error threshold ε. Formally speaking, a set of ε-discriminable inputs ${{\mathcal{H}}}=\{{h}_{1},{h}_{2},\ldots ,{h}_{{n}_{d}}\}$, with h₀ = 0 and ${h}_{{n}_{d}+1}=:{h}_{\infty }$, is a set for which ${{\mathcal{E}}}({h}_{i},{h}_{j})\le \varepsilon$ for all i ≠ j, i, j ∈ [0, n_d + 1], where h_∞ is an input that generated saturated output 〈o^T〉 = 1. Finding the maximal (in the sense of cardinality) set of discriminable inputs is a close-packing problem without a unique solution. To circumvent this complication, we propose the following algorithm: start by finding ${h}_{1}^{{{\rm{left}}}}=\min \{h > {h}_{0}=0:{{\mathcal{E}}}({h}_{0},h)\le \varepsilon \}$, and then proceed by induction to ${h}_{i+1}^{{{\rm{left}}}}=\min \{h > {{h}}_{i}^{{{\rm{left}}}}:{{\mathcal{E}}}({h}_{i}^{{{\rm{left}}}},h )\le \varepsilon \}$, (Fig. 1, see the “Methods” section for more details). We stop at the first i such that ${{\mathcal{E}}}({h}_{i+1}^{{{\rm{left}}}},{h}_{\infty }) > \varepsilon$ and get this way ${n}_{d}^{{{\rm{left}}}}=i$. We repeat the same procedure starting from the right with ${h}_{1}^{{{\rm{right}}}}=\max \{h < {h}_{\infty }:{{\mathcal{E}}}(h,{h}_{\infty })\le \varepsilon \}$ and iterate until ${{\mathcal{E}}}({h}_{i+1}^{{{\rm{right}}}},0) > \varepsilon$ to find ${n}_{d}^{{{\rm{right}}}}$. Our final estimate of the discriminable inputs cardinality is the average ${n}_{d}=1/2({n}_{d}^{{{\rm{left}}}}+{n}_{d}^{{{\rm{right}}}})$.

While this algorithm is numerically straightforward, it comes with a technical challenge: it requires iterative evaluation of P(o^T∣h) for continuous values of h. This is not a problem for our analytical solutions, but it becomes intractable for the actual numerical model because each P(o^T∣h) is a result of a long simulation. To tackle this, we measure the distribution P(a^T∣h) of pure network activity a^T(t) for a broad range of h, T, and λ values, notice that they can be well approximated by a Beta distribution Beta(α,β), and train a neural network to learn the parameters (α, β) as a function of (T, h, λ) to interpolate between them (see the “Methods” section).

From the set of ε-discriminable inputs, we can now construct measures for information processing capabilities using finite integration time (Fig. 2). Let us start with the number of ε-discriminable inputs, n_d (Fig. 2a). Because the task requires sufficient coupling between the input and output population, n_d is very small with λ = 0 and first increases with λ. However, n_d exhibits a maximum at a T-dependent subcritical λ < 1, above which it decays for λ → 1. Our numerical results interpolate between the analytical predictions for T → ∞ (solid line) and T → 0, indicating that every finite integration time will have an optimal 0 < $\lambda \ast$ < 1, while for infinite integration time, n_d is bound by the Gaussian noise of the readout.

**Fig. 2: Dynamical regime for optimal information transmission depends on the readout integration timescale.**

Let us now turn to the dynamic range, which can be naturally generalized to account for fluctuations by choosing as bounds the first and last inputs that can be discriminated from the boundaries. We thus define our dynamic range as

$$\Delta =\Delta (\varepsilon )=10\,{\log }_{10}({h}_{1}^{{{\rm{right}}}}/{h}_{1}^{{{\rm{left}}}}),$$

(4)

where the ${h}_{1}^{{{\rm{left}}}}$ and ${h}_{1}^{{{\rm{right}}}}$ are the minimal and maximal inputs that can be discriminated from h₀ = 0 and h_∞, respectively, with error not surpassing ε. Δ depends on the specific choices of the discrimination error ε and the variance of the Gaussian noise σ, which we can be tuned to recover the typical 10–90% bounds of the established dynamic range¹¹ for T → ∞. For finite T, our numerical estimates interpolate well between the analytical bounds (see Supplementary Note 2 for comparison with full readout). Importantly, a finite T results in a substantial reduction of the dynamic range in the vicinity of the critical point, i.e., for λ ≈ 1, but only a slight reduction at small λ. As a result, the dynamic range develops a T-dependent maximum, which is, however, different from the maximum of n_d (insets in Fig. 2).

Discussion

Our results establish a connection between sensitivity (governed by the distance to criticality), confidence (capturing the probability of wrong classification), and integration time in a recurrent network of excitatory stochastic neurons. While we primarily focused on discriminability as a function of the distance to criticality for a given integration time, we can also make statements about how T and ε affect the discriminability. For any fixed λ (vertical slice in Fig. 2), we find that both measures of discriminability increase monotonically with T. Also, by construction, the discriminability has to increase monotonically with the discrimination error ε. Still, both T and ε affect the peaks in our measures of discriminability and thereby define the optimal state. For finite T, the optimal balance is achieved by subcritical networks.

We expect that our insight about optimal sensitivity away from criticality will occur similarly in other stochastic systems, where emergent properties near criticality are beneficial for solving tasks in the presence of increasing stochastic fluctuations. On which side of the transition this optimum lies will, however, depend on both the task and the type of phase transition, e.g., absorbing-to-active, transition to chaos, or a bifurcation—see ref. ¹ for an overview. For example, in the case of transition to chaos, it was shown that deviations toward the supercritical side, deeper into the chaotic regime, allow for slower integration times and are thus beneficial in the presence of noise⁴⁰. Additionally, emergent critical fluctuations do not necessarily have to align with the neural population activity; examples include a large dispersion of correlations⁴¹ or low-dimensional subspaces⁸.

Since near-critical dynamics imply a finite autocorrelation time^9,10, our results align well with the observation of finite timescales in neurophysiological data^42,43,44. While there is clear evidence for sensory integration, many perceptual tasks are solved in short times of less than a second⁴². This limited temporal integration can be due to non-stationary information rates, temporal correlations (such as in our recurrent dynamics), or leaky integrators (such as in our readout). In our model, the recurrent autocorrelation timescale can be estimated assuming a linear autoregressive representation^21,43, yielding $\tau \approx -\Delta t/ln(\lambda )$. The confidence-dependent optima in Fig. 2 thus correspond to τ ≈ 10 − 100 ms, assuming a timestep Δt = 1 ms, or to τ ≈ 50–500 ms for a timestep Δt = 5 ms that could comprise various propagation delays and raise times. This is consistent with empirical evidence of cortical timescales in the range of 50 ms–1 s^34,35,36. Also, it is consistent with the recently observed adaptation to task requirements^23,44, which in our case would correspond to a change in ε and T.

On the side of artificial networks, we believe that our results provide a new perspective on the reservoir-computing paradigm with typically memory-less readout signals⁴⁵. While noise-free continuous formulations, like the echo-state network⁴⁶, allow for reading out information about the past from standard nonlinear dynamic considerations, any system that comes with noise could benefit from integrating the readout over time, as was demonstrated recently for active particles⁴⁷. In light of our results, an instantaneous readout would require a larger distance to criticality for optimal discriminability. However, leaky readout units could allow the reservoir to be tuned closer to criticality, thereby benefiting from the edge-of-chaos sensitivity.

To summarize, given a readout with finite integration time, we find maximal discriminability for close-to-critical dynamics. The intuitive reason for our finding is that emergent temporal fluctuations close to a critical phase transition can smear out the signal if they are aligned with the readout, and thereby hinder discrimination. Since the network sensitivity is maximal at criticality, this implies a trade-off between sensitivity and discriminability. Our results thereby add to the hypothesis that living systems need to adjust their state to optimally balance opposing demands depending on the specific processing tasks at hand^21,44,48,49.

Methods

Neural network model

We consider a network of N = 10⁴ binary spiking neurons, each described by a state variable s_i that can be active (s_i(t) = 1) or inactive (s_i(t) = 0). Time evolves in discrete steps of Δt. Neurons can be activated by recurrent input from other neurons with probability p^rec[s_i(t + Δt) = 1∣s(t)] = f(∑_jw_ijs_j(t)), where w_ij are directed coupling weights (not symmetric) and f(x) is a rectified linear function with f(x) = 0 for x < 0, f(x) = x for 0 < x < 1, and f(x) = 1 for x > 1. The connectivity matrix $W=({w}_{ij})$ is a sparse matrix with mean degree K = 10², where non-zero edges are selected with probability K/N and diagonal entries are removed. Non-zero weights are set to w_ij = λ/K_i, where K_i is the indegree of neuron i corresponding to the number of non-zero weights in row i. Thereby, each neuron has the same maximal input of ∑_jw_ij = λ, and λ is the largest eigenvalue of the connectivity matrix W.

In addition to the recurrent activation, a random subset of Nⁱⁿ = μN neurons receives external input. The external input is modeled as a Poisson process with rate h or equivalently as an activation probability p^ext[s_i(t + Δt) = 1] = 1 − e^−hΔt, which causes neurons to fire independently and irregularly.

The output is defined by an exponential smoothing of the spikes from a random subset of N^out = νN neurons plus Gaussian noise, cf. Eq. (2). Let us denote the pure network output as ${a}^{T}(t)=1/{N}^{{{\rm{out}}}}{\sum }_{i\in {N}^{{{\rm{out}}}}}{\sum }_{\{k| {t}_{i}^{k} < t\}}{e}^{(t-{t}_{i}^{k})/T}$, where the sum goes over all i in the subset of output neurons and all spike times k with ${t}_{i}^{k} < t$. For discrete time steps, these exponential kernels can be implemented as standard exponential smoothing

$${a}^{T}(t)=(1-{c}^{T}){a}^{T}(t-\Delta t)+{c}^{T}\frac{1}{{N}^{{{\rm{out}}}}}\mathop{\sum }\limits_{i\in {N}^{{{\rm{out}}}}}{s}_{i}(t),$$

(5)

with c^T = 1 − e^−Δt/T. Iterative substitution yields a geometric sequence as the discrete realization of the desired exponential function.

Neural-network approximation of output distribution

The stochastic simulations yield $P\left({a}^{T}| h\right)$ by aggregating all measurements a^T(t) after proper equilibration for specific values of h. However, our estimation of discriminable inputs requires a distribution of $P\left({a}^{T}| h\right)$ for any h that can be achieved using interpolation. To solve this, we first notice that $P\left({a}^{T}| h\right)$ can be well approximated by a Beta distribution Beta(α, β). We obtain (α, β) as a maximum-likelihood estimate from simulations with parameters θ = (T, h, λ). We scan the parameter space logarithmically in the ranges $1-\lambda \in \left[1{0}^{-4},1\right]$, $h\in \left[1{0}^{-6},1{0}^{2}\right]$ and $T\in \left[1,1{0}^{4}\right]$ and train a dense 3-layer neural network to approximate the functions α(θ) and β(θ), cf. Fig. 3. This exploits the fact that neural networks can act as general function approximators⁵⁰. Here, we choose three layers with 60 neurons each and a hyperbolic tangent ($\tanh$) activation function, following previous approaches to fitting scaling functions⁵¹. We found good fits when scaling input and output parameters into the domain [−1, 1]. To ensure that the distribution mean increases monotonously with h (relevant for the discriminable inputs), we further added a regularization term that penalizes deviations of the mean 〈a^T〉 = α/(α + β) from the mean-field solution Eq. (9), essentially implementing a physics-informed regularization.

**Fig. 3: Workflow to calculate ε-discriminable inputs.**

Mean-field solution for the limit T → ∞

For simplicity, we perform mean-field computation for the case of the read-out population coinciding with the whole network. For T → ∞, we can neglect fluctuations such that the P(a^T∣h) becomes a delta distribution at the mean-field activity ${a}^{\infty }=a={lim}_{T\to \infty }1/N{\sum }_{i=1}^{N}1/T{\sum }_{t=1}^{T}{a}_{i}(t)$, cf. Fig. 3a. To estimate the mean activity, we need to separate the network into the part that receives input with Nⁱⁿ = μN neurons and mean activity aⁱⁿ, and the rest of N^rest = (1 − μ)N neurons with mean activity a^rest, such that the mean activity is a = μaⁱⁿ + (1 − μ)a^rest.

Since each neuron is randomly connected to any other neuron in the network with the same total weight λ = ∑_ijw_ij/N, we can approximate the probability of recurrent activation

$$\overline{{p}^{{{\rm{rec}}}}}=\overline{\mathop{\sum }\limits_{j}{w}_{ij}{s}_{j}(t)}\approx \lambda \left[\mu {a}^{{{\rm{in}}}}+(1-\mu ){a}^{{{\rm{rest}}}}\left.\right)\right].$$

(6)

After averaging out temporal fluctuations, we find that the mean activity equals the activation probability. For those neurons that can only be excited recurrently, we thus obtain ${a}^{{{\rm{rest}}}}=\overline{{p}^{{{\rm{rec}}}}}$. For those neurons that receive external input, we need to take coalescence into account⁵² and find ${a}^{{{\rm{in}}}}=1-(1-\overline{{p}^{{{\rm{rec}}}}})(1-{p}^{{{\rm{ext}}}})$. This leaves us with a system of self-consistent equations

$${a}^{{{\rm{rest}}}}=\lambda \left[\mu {a}^{{{\rm{in}}}}+(1-\mu ){a}^{{{\rm{rest}}}}\right],$$

(7)

$${a}^{{{\rm{in}}}}=1-\left(1-\lambda \left[\mu {a}^{{{\rm{in}}}}+(1-\mu ){a}^{{{\rm{rest}}}}\right]\right)\left(1-{p}^{{{\rm{ext}}}}\right),$$

(8)

that can be solved to yield

$$a=\frac{\mu {p}^{{{\rm{ext}}}}}{1-\lambda \left(1-\mu \right)-\lambda \mu \left(1-{p}^{{{\rm{ext}}}}\right)}.$$

(9)

Mean-field solution for T → 0

In the limit to continuous time, we can model the probability of neural activation and deactivation as a birth-death process with birth rate Ω₊(A) and death rate Ω₋(A), where A is the number of active neurons. The time evolution of the probability distribution P(A, t) is then described by the master equation

$$\frac{d}{dt}P(A,t)={\Omega }_{+}(A-1)\,P(A-1,t)$$

(10)

$$\qquad\qquad\quad+{\Omega }_{-}(A+1)\,P(A+1,t)$$

(11)

$$\qquad\qquad\quad\quad-\left({\Omega }_{+}(A)+{\Omega }_{-}(A)\right)\,P(A,t).$$

(12)

Using a Kramers–Moyal expansion up to second order⁵³, we obtain the Fokker–Planck equation (see Supplementary Note 1)

$$\frac{d}{dt}P(A,t)=-\frac{d}{dA}\left[f(A)P(A,t)\right]+\frac{1}{2}\frac{{d}^{2}}{d{A}^{2}}\left[g(A)P(A,t)\right],$$

with a “drift” term f(A) = Ω₊(A) − Ω₋(A) and a “diffusion” term g(A) = Ω₊(A) + Ω₋(A). The solution of the stationary Fokker–Planck equation, $\frac{d}{dt}P(A,t)=0$, is

$$P(A)\propto \frac{1}{g(A)}\exp \left\{2{\int }_{\!\!\!\!0}^{A}\frac{f(x)}{g(x)}dx\right\},$$

(13)

which can be solved numerically once birth and death rates are specified.

To specify the birth and death rates, we assume that inactive neurons can create activity by becoming active, while active neurons destroy activity by becoming inactive. If p^a is the probability to activate any neuron in the next time step and there are A out of N neurons currently active, then we find

$$\begin{array}{rcl}{\Omega }_{+}(A) & = & (N-A)\,{p}^{{{\rm{a}}}},\\ {\Omega }_{-}(A) & = & A\,(1-{p}^{{{\rm{a}}}}).\end{array}$$

(14)

Since the activation probability depends on whether the neuron receives external input or not, we need to distinguish between those neurons that receive input, Nⁱⁿ, and those that can only be activated recurrently, N^rest. While for the latter, we can identify p^a = p^rec, we need to account for coalescence in the former case and obtain p^a = 1 − (1 − p^rec)(1 − p^ext).

To obtain an expression for p^rec, we assume a mean-field setting where each connected neuron is described by its mean activity. Then Eq. (6) yields the probability $\overline{{p}^{{{\rm{rec}}}}}({A}^{{{\rm{in}}}},{A}^{{{\rm{rest}}}})=\lambda \left[{A}^{{{\rm{in}}}}+{A}^{{{\rm{rest}}}}\right]/N$, which depends on the activity in both subsets and thereby couples their Fokker–Planck equations. To solve our Fokker–Planck equations for Nⁱⁿ and N^rest, we decouple this probability by replacing one variable via its mean-field equation as a function of the other. Specifically, we start from Eq. (8) to rewrite ${A}^{{{\rm{in}}}}=\mu \frac{N{p}^{{{\rm{ext}}}}+\lambda (1-{p}^{{{\rm{ext}}}}){A}^{{{\rm{rest}}}}}{1-\mu \lambda (1-{p}^{{{\rm{ext}}}})}$ and get

$${p}^{{{\rm{rec}}}}({A}^{{{\rm{rest}}}})=\lambda \frac{{A}^{{{\rm{rest}}}}/N+\mu {p}^{{{\rm{ext}}}}}{1-\mu \lambda (1-{p}^{{{\rm{ext}}}})}.$$

(15)

Similarly, we start from Eq. (7) to rewrite ${A}^{{{\rm{rest}}}}=\frac{(1-\mu )\lambda {A}^{{{\rm{in}}}}}{1-(1-\mu )\lambda }$ and get

$${p}^{{{\rm{rec}}}}({A}^{{{\rm{in}}}})=\lambda \frac{{A}^{{{\rm{in}}}}/N}{1-(1-\mu \lambda )}.$$

(16)

We can then independently solve the Fokker–Planck equations for P(A^rest) with

$${\Omega }_{+}({A}^{{{\rm{rest}}}})=(N-{A}^{{{\rm{rest}}}})\,\lambda \frac{{A}^{{{\rm{rest}}}}/N+\mu {p}^{{{\rm{ext}}}}}{1-\mu \lambda (1-{p}^{{{\rm{ext}}}})},$$

(17)

$${\Omega }_{-}({A}^{{{\rm{rest}}}})={A}^{{{\rm{rest}}}}\,\left(1-\lambda \frac{{A}^{{{\rm{rest}}}}/N+\mu {p}^{{{\rm{ext}}}}}{1-\mu \lambda (1-{p}^{{{\rm{ext}}}})}\right),$$

(18)

as well as P(Aⁱⁿ) with

$${\Omega }_{+}({A}^{{{\rm{in}}}})=(N-{A}^{{{\rm{in}}}})\,\lambda \frac{{A}^{{{\rm{in}}}}/N}{1-(1-\mu \lambda )},$$

(19)

$${\Omega }_{-}({A}^{{{\rm{in}}}})={A}^{{{\rm{in}}}}\,\left(1-\lambda \frac{{A}^{{{\rm{in}}}}/N}{1-(1-\mu \lambda )}\right).$$

(20)

The solution for the total network activity A = Aⁱⁿ + A^rest is obtained by the convolution P(A) = P(Aⁱⁿ)*P(A^rest), cf. Fig. 3c.

Data availability

The processed data supporting the findings of this study are available on GitHub at sahelazizpour/Finite-Observation-Dynamic-Range.

Code availability

The simulation code, analysis pipeline, results, and scripts to produce the figures that support the findings of this study are available from GitHub at sahelazizpour/Finite-Observation-Dynamic-Range.

References

Muñoz, M. A. Colloquium: criticality and dynamical scaling in living systems. Rev. Mod. Phys. 90, 031001 (2018).
Article ADS MathSciNet Google Scholar
Cavagna, A. et al. Scale-free correlations in starling flocks. Proc. Natl. Acad. Sci. USA 107, 11865 (2010).
Article ADS Google Scholar
Ridden, S. J., Chang, H. H., Zygalakis, K. C. & MacArthur, B. D. Entropy, ergodicity, and stem cell multipotency. Phys. Rev. Lett. 115, 208103 (2015).
Article ADS Google Scholar
Beggs, J. M. The criticality hypothesis: how local cortical networks might optimize information processing. Philos. Trans. R. Soc. Lond. Math. Phys. Eng. Sci. 366, 329 (2008).
ADS MathSciNet Google Scholar
Priesemann, V., Munk, M. H. & Wibral, M. Subsampling effects in neuronal avalanche distributions recorded in vivo. BMC Neurosci. 10, 40 (2009).
Article Google Scholar
Palva, J. M. et al. Neuronal long-range temporal correlations and avalanche dynamics are correlated with behavioral scaling laws. Proc. Natl. Acad. Sci. USA 110, 3585 (2013).
Article ADS Google Scholar
Wilting, J. & Priesemann, V. 25 years of criticality in neuroscience—established results, open controversies, novel concepts. Curr. Opin. Neurobiol. 58, 105 (2019).
Article Google Scholar
Fontenele, A. J., Sooter, J. S., Norman, V. K., Gautam, S. H. & Shew, W. L. Low-dimensional criticality embedded in high-dimensional awake brain dynamics. Sci. Adv. 10, eadj9303 (2024).
Article Google Scholar
Henkel, M., Hinrichsen, H. & Lübeck, S. Non-Equilibrium Phase Transitions Volume 1: Absorbing Phase Transitions (Springer Science & Business Media, 2008)
Täuber, U. C. Critical Dynamics: A Field Theory Approach to Equilibrium and Non-Equilibrium Scaling Behavior (Cambridge University Press, 2014)
Kinouchi, O. & Copelli, M. Optimal dynamical range of excitable networks at criticality. Nat. Phys. 2, 348 (2006).
Article Google Scholar
Boedecker, J., Obst, O., Lizier, J. T., Mayer, N. M. & Asada, M. Information processing in echo state networks at the edge of chaos. Theory Biosci. 131, 205 (2012).
Article Google Scholar
Barnett, L., Lizier, J. T., Harré, M., Seth, A. K. & Bossomaier, T. Information flow in a kinetic ising model peaks in the disordered phase. Phys. Rev. Lett. 111, 177203 (2013).
Article ADS Google Scholar
Meijers, M., Ito, S. & ten Wolde, P. R. Behavior of information flow near criticality. Phys. Rev. E 103, L010102 (2021).
Article ADS Google Scholar
Morales, G. B. & Muñoz, M. A. Optimal input representation in neural systems at the edge of chaos. Biology 10, 702 (2021).
Article Google Scholar
Yang, Z., Liang, J. & Zhou, C. Critical avalanches in excitation-inhibition balanced networks reconcile response reliability with sensitivity for optimal neural representation. Phys. Rev. Lett. 134, 028401 (2025).
Article ADS Google Scholar
Nykter, M. et al. Critical networks exhibit maximal information diversity in structure-dynamics relationships. Phys. Rev. Lett. 100, 058702 (2008).
Article ADS Google Scholar
Dakos, V. et al. Slowing down as an early warning signal for abrupt climate change. Proc. Natl. Acad. Sci. USA 105, 14308 (2008).
Article ADS Google Scholar
Maturana, M. I. et al. Critical slowing down as a biomarker for seizure susceptibility. Nat. Commun. 11, 2172 (2020).
Article ADS Google Scholar
Tapinova, O. et al. Integrated Ising model with global inhibition for decision-making. Proc. Natl. Acad. Sci. USA 122, e2423557122 (2025).
Article Google Scholar
Wilting, J. et al. Operating in a reverberating regime enables rapid tuning of network states to task requirements. Front. Syst. Neurosci. 12, 55 (2018).
Article Google Scholar
Poel, W. et al. Subcritical escape waves in schooling fish. Sci. Adv. 8, eabm6385 (2022).
Article Google Scholar
Cramer, B. et al. Control of criticality and computation in spiking neuromorphic networks with plasticity. Nat. Commun. 11, 2853 (2020).
Article ADS Google Scholar
Khajehabdollahi, S. et al. Emergent mechanisms for long timescales depend on training curriculum and affect performance in memory tasks. in The Twelfth International Conference on Learning Representations (ICLR, 2024)
Britten, K., Shadlen, M., Newsome, W. & Movshon, J. The analysis of visual motion: a comparison of neuronal and psychophysical performance. J. Neurosci. 12, 4745 (1992).
Article Google Scholar
Wachowiak, M. & Cohen, L. B. Representation of odorants by receptor neuron input to the mouse olfactory bulb. Neuron 32, 723 (2001).
Article Google Scholar
Evans, E. F. The dynamic range problem: place and time coding at the level of cochlear nerve and nucleus. in Neuronal Mechanisms of Hearing (eds Syka, J. & Aitkin, L.) 69–85 (Springer, 1981).
Dean, I., Harper, N. S. & McAlpine, D. Neural population coding of sound level adapts to stimulus statistics. Nat. Neurosci. 8, 1684 (2005).
Article Google Scholar
Shew, W. L., Yang, H., Petermann, T., Roy, R. & Plenz, D. Neuronal avalanches imply maximum dynamic range in cortical networks at criticality. J. Neurosci. 29, 15595 (2009).
Article Google Scholar
Gautam, S. H., Hoang, T. T., McClanahan, K., Grady, S. K. & Shew, W. L. Maximizing sensory dynamic range by tuning the cortical state to criticality. PLoS Comput. Biol. 11, e1004576 (2015).
Article ADS Google Scholar
Gollo, L. L. Coexistence of critical sensitivity and subcritical specificity can yield optimal population coding. J. R. Soc. Interface 14, 20170207 (2017).
Article Google Scholar
Zierenberg, J., Wilting, J., Priesemann, V. & Levina, A. Tailored ensembles of neural networks optimize sensitivity to stimulus statistics. Phys. Rev. Res. 2, 013115 (2020).
Article Google Scholar
Galera, E. F. & Kinouchi, O. Physics of psychophysics: large dynamic range in critical square lattices of spiking neurons. Phys. Rev. Res. 2, 033057 (2020).
Article Google Scholar
Murray, J. D. et al. A hierarchy of intrinsic timescales across primate cortex. Nat. Neurosci. 17, 1661 (2014).
Article Google Scholar
Rudelt, L. et al. Signatures of hierarchical temporal processing in the mouse visual system. PLOS Comput. Biol. 20, e1012355 (2024).
Article Google Scholar
Shi, Y.-L., Zeraati, R., Laboratory, I. B., Levina, A. & Engel, T. A. Brain-wide organization of intrinsic timescales at single-neuron resolution. Preprint at https://doi.org/10.1101/2025.08.30.673281 (2025).
Vogelstein, J. T. et al. Spike inference from calcium imaging using sequential Monte Carlo methods. Biophys. J. 97, 636 (2009).
Article ADS Google Scholar
Larremore, D. B., Shew, W. L. & Restrepo, J. G. Predicting criticality and dynamic range in complex networks: effects of topology. Phys. Rev. Lett. 106, 058101 (2011).
Article ADS Google Scholar
Berens, P., Gerwinn, S., Ecker, A. & Bethge, M., Neurometric function analysis of population codes. in Advances in Neural Information Processing Systems, Vol. 22 (eds Bengio, Y., Schuurmans, D., Lafferty, J., Williams, C. & Culotta, A.) (Curran Associates, Inc., 2009)
Toyoizumi, T. & Abbott, L. F. Beyond the edge of chaos: amplification and temporal integration by recurrent networks in the chaotic regime. Phys. Rev. E 84, 051908 (2011).
Article ADS Google Scholar
Dahmen, D., Grün, S., Diesmann, M. & Helias, M. Second type of criticality in the brain uncovers rich multiple-neuron dynamics. Proc. Natl. Acad. Sci. USA 116, 13051 (2019).
Article ADS Google Scholar
Uchida, N., Kepecs, A. & Mainen, Z. F. Seeing at a glance, smelling in a whiff: rapid forms of perceptual decision making. Nat. Rev. Neurosci. 7, 485 (2006).
Article Google Scholar
Wilting, J. & Priesemann, V. Inferring collective dynamical states from widely unobserved systems. Nat. Commun. 9, 2325 (2018).
Article ADS Google Scholar
Zeraati, R. et al. Intrinsic timescales in the visual cortex change with selective attention and reflect spatial connectivity. Nat. Commun. 14, 1858 (2023).
Article Google Scholar
Tanaka, G. et al. Recent advances in physical reservoir computing: a review. Neural Netw. 115, 100 (2019).
Article ADS Google Scholar
Jaeger, H. The “Echo State” Approach to Analysing and Training Recurrent Neural Networks—with an Erratum Note. Technical Report 148 (German National Research Institute for Computer Science, 2001)
Wang, X. & Cichos, F. Harnessing synthetic active particles for physical reservoir computing. Nat. Commun. 15, 774 (2024).
Article ADS Google Scholar
Dahmen, D. et al. Strong and localized recurrence controls dimensionality of neural activity across brain areas. Preprint at https://doi.org/10.1101/2020.11.02.365072 (2022).
Khajehabdollahi, S., Prosi, J., Giannakakis, E., Martius, G. & Levina, A. When to be critical? Performance and evolvability in different regimes of neural ising agents. Artif. Life 28, 458 (2022).
Article Google Scholar
Hornik, K., Stinchcombe, M. & White, H. Multilayer feedforward networks are universal approximators. Neural Netw. 2, 359 (1989).
Article Google Scholar
Dornheim, T. et al. The static local field correction of the warm dense electron gas: an ab initio path integral Monte Carlo study and machine learning representation. J. Chem. Phys. 151, 194104 (2019).
Article ADS Google Scholar
Zierenberg, J., Wilting, J., Priesemann, V. & Levina, A. Description of spreading dynamics by microscopic network models and macroscopic branching processes can differ due to coalescence. Phys. Rev. E 101, 022301 (2020).
Article ADS MathSciNet Google Scholar
Risken, H. and Frank, T. The Fokker–Planck Equation: Methods of Solution and Applications (Springer Science & Business Media, 2012)

Download references

Acknowledgements

J.Z. was supported by the Joachim Herz Stiftung. J.Z. and V.P were funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—Project-ID 454648639 - SFB 1528 “Cognition of Interaction”. A.L was supported by the Sofja Kovalevskaja Award from the Alexander von Humboldt Foundation. All authors gratefully acknowledge support from the Max Planck Society.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Sahel Azizpour
Present address: Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, Netherlands
These authors contributed equally: Johannes Zierenberg, Anna Levina.

Authors and Affiliations

Department of Computer Science, University of Tübingen, Tübingen, Germany
Sahel Azizpour & Anna Levina
Max Planck Institute for Biological Cybernetics, Tübingen, Germany
Sahel Azizpour & Anna Levina
Max Planck Institute for Dynamics and Self-Organization, Göttingen, Germany
Viola Priesemann & Johannes Zierenberg
Institute for the Dynamics of Complex Systems, University of Göttingen, Göttingen, Germany
Viola Priesemann & Johannes Zierenberg

Authors

Sahel Azizpour
View author publications
Search author on:PubMed Google Scholar
Viola Priesemann
View author publications
Search author on:PubMed Google Scholar
Johannes Zierenberg
View author publications
Search author on:PubMed Google Scholar
Anna Levina
View author publications
Search author on:PubMed Google Scholar

Contributions

J.Z. and A.L. designed the project. S.A. and J.Z. wrote the code and performed the simulations. A.L. developed the measure, S.A and J.Z. analyzed the data, and S.A., A.L., and J.Z. calculated the mean-field solution. V.P., J.Z, and A.L discussed the results and wrote the manuscript. All authors contributed to reviewing the manuscript.

Corresponding authors

Correspondence to Johannes Zierenberg or Anna Levina.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Physics thanks John M. Beggs and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Azizpour, S., Priesemann, V., Zierenberg, J. et al. Finite integration time can shift optimal sensitivity away from criticality. Commun Phys 9, 119 (2026). https://doi.org/10.1038/s42005-026-02584-w

Download citation

Received: 06 May 2025
Accepted: 05 March 2026
Published: 28 March 2026
Version of record: 02 April 2026
DOI: https://doi.org/10.1038/s42005-026-02584-w