Uncertainty estimation with prediction-error circuits

Hertäg, Loreen; Wilmes, Katharina A.; Clopath, Claudia

doi:10.1038/s41467-025-58311-6

Download PDF

Article
Open access
Published: 28 March 2025

Uncertainty estimation with prediction-error circuits

Nature Communications volume 16, Article number: 3036 (2025) Cite this article

10k Accesses
3 Citations
48 Altmetric
Metrics details

Subjects

Abstract

Neural circuits continuously integrate noisy sensory stimuli with predictions that often do not perfectly match, requiring the brain to combine these conflicting feedforward and feedback inputs according to their uncertainties. However, how the brain tracks both stimulus and prediction uncertainty remains unclear. Here, we show that a hierarchical prediction-error network can estimate both the sensory and prediction uncertainty with positive and negative prediction-error neurons. Consistent with prior hypotheses, we demonstrate that neural circuits rely more on predictions when sensory inputs are noisy and the environment is stable. By perturbing inhibitory interneurons within the prediction-error circuit, we reveal their role in uncertainty estimation and input weighting. Finally, we link our model to biased perception, showing how stimulus and prediction uncertainty contribute to the contraction bias.

Differential modulation of positive and negative prediction errors by stimulus variability in the mouse posterior parietal cortex

Article Open access 30 September 2025

Studying the neural representations of uncertainty

Article 09 October 2023

Population codes of prior knowledge learned through environmental regularities

Article Open access 12 January 2021

Introduction

To survive in an ever-changing environment, animals must flexibly adapt their behavior based on previously encoded and novel information. This adaptation is reflected in the information processing of neural networks underlying context-dependent behavior. For instance, when walking down an unfamiliar staircase in a well-lit basement, your brain may rely almost entirely on feedforward (bottom-up) sensory input (Fig. 1A, left), gradually forming a model of the step sizes. As the step sizes become more predictable, the brain can increasingly rely on this model. However, if the step sizes suddenly change, it will need to revert to the sensory input for guidance. Later, when walking down the same staircase and the lights suddenly turn off, your brain may rely entirely on feedback (top-down) signals derived from the staircase model you previously formed (Fig. 1A, middle), as the sensory information becomes too noisy to trust. But how do neural networks switch between a feedforward-dominated and a feedback-dominated processing mode in an ever-changing environment? For instance, if you hike down an unexplored mountain in very foggy conditions, your brain receives unreliable visual information. In addition, it can only draw on a shaky prediction about what to expect (Fig. 1A, right).

**Fig. 1: Neural network model to track both the uncertainty of sensory inputs and predictions.**

A common hypothesis is that the brain weights different inputs according to their reliabilities. A prominent example of this hypothesis is Bayesian multisensory integration (for example, ref. ¹). According to this theory, neural networks represent information from multiple modalities by a linear combination of the uncertainty-weighted single-modality estimates. Multisensory integration is supported by several observations showing that animals can combine information from different modalities in a fashion that minimizes the variance of the final estimate^{2,3,4,5,6,7,8}. Here, we propose that the same concepts could be employed for the weighting of sensory inputs and predictions thereof^4,9. A central point in the weighting of inputs is the estimation of their variances as a measure of uncertainty. However, how the variance of both the sensory input and the prediction can be computed on the circuit level is not resolved yet.

We hypothesized that prediction-error (PE) neurons provide the basis for the neural computation of variances. PEs are an integral part of the theory of predictive processing which states that the brain constantly compares incoming sensory information with predictions. If those predictions are wrong, the resulting PEs allow the network to revise the model of the world, thereby ensuring that the predictions become more accurate¹⁰. Experimental evidence suggests that these PEs may be represented in the activity of distinct groups of neurons, termed PE neurons^11,12,13,14. Moreover, these neurons may come in two types when excitatory neurons exhibit near-zero, spontaneous firing rates^10,15: negative PE (nPE) neurons only increase their activity when the sensory input is weaker than the prediction, while positive PE (pPE) neurons only increase their activity when the sensory input is stronger than the prediction. Indeed, it has been shown that excitatory neurons in rodent primary sensory areas can encode negative or positive PEs^14,16,17,18.

Here, we show that the unique response patterns of nPE and pPE neurons may provide the backbone for computing both the mean and the variance of sensory stimuli. Furthermore, we suggest a network model with a hierarchy of PE circuits to estimate the variance of the prediction, in addition to the variance of the sensory inputs. We show that in line with the ideas of multisensory integration, predictions are weighted more strongly than the sensory stimuli when the environment is stable (that is, predictable) and the sensory inputs are noisy. Moreover, we find that predictions are taken into account more at the beginning of a new trial than at the end, especially when the new sensory stimulus is reliable. In addition, we unravel the mechanisms underlying a neuromodulator-induced shift in the weighting of sensory inputs and predictions. In our model, these neuromodulators activate groups of inhibitory neurons such as parvalbumin-expressing (PV), somatostatin-expressing (SOM), and vasoactive intestinal peptide-expressing (VIP) interneurons^{19,20,21,22,23,24}. These interneurons have been suggested to establish a multi-pathway balance of excitation and inhibition that is the basis for nPE and pPE neurons^25,26. By perturbing this balance, the PE neurons change their baseline firing rate and gain, leading to a biased variance estimation. Finally, we show that this weighting can be understood as a neural manifestation of the contraction bias, that is, the magnitude of the represented sensory input is biased towards the mean of the past stimuli experienced^{27,28,29,30,31,32}.

Results

A circuit model for uncertainty estimation

We hypothesize that the distinct response patterns of negative and positive prediction-error (nPE/pPE) neurons can act as a backbone for estimating the mean and the variance of sensory stimuli. An nPE neuron only increases its activity relative to a baseline when the sensory input is weaker than predicted, while a pPE neuron only increases its activity relative to a baseline when the sensory input is stronger than predicted. Moreover, both nPE and pPE neurons remain at their baseline activities when the sensory input is fully predicted (Fig. 1B).

To test our hypothesis, we study a rate-based mean-field network: the core network contains two excitatory neurons, two inhibitory PV interneurons, one inhibitory SOM, and one inhibitory VIP interneuron (Fig. 1B, also see Supplementary Fig. 1). While the excitatory neurons are simulated as two coupled point compartments to emulate the soma and dendrites of elongated pyramidal cells, respectively, all inhibitory cell types were modeled as point neurons. In line with experimental findings²³, we assume that the PV neurons target the somatic compartment, while the SOM neuron targets the dendritic compartment of the excitatory cells. Moreover, the SOM neuron inhibits both the PV and VIP neurons, while the VIP neuron inhibits both the PV and the SOM neurons²³ (Fig. 1B, also see Supplementary Fig. 1). In addition, all neurons receive local connections from the excitatory neurons (Supplementary Fig. 1).

We chose the connection strengths in line with our previous work on prediction-error neurons (for example, see refs. ^25,26). In that work, we showed that response patterns of excitatory cells resemble those of PE neurons when a number of excitatory (E) and inhibitory (I) pathways onto the pyramidal cells were balanced. This multi-pathway E/I balance results in an E/I balance of the inputs to excitatory neurons when the stimulus is perfectly predicted. Depending on the network connectivity, for some excitatory cells, this input E/I balance was maintained for over-predicted stimuli (sensory input < prediction), but temporarily shifted toward excitation for under-predicted stimuli (sensory input > prediction). In contrast, other excitatory cells exhibited the opposite pattern, with responses for over- and under-predicted stimuli reversed. The former group corresponds to pPE neurons, while the latter represents nPE neurons.

The multi-pathway E/I balance required for PE neurons to emerge was established through the different interneurons. These interneurons provide compartment-specific inhibition to balance the feedforward sensory inputs and the feedback predictions, respectively (for a more detailed discussion on the role of these interneurons in PE circuits, please see Supplementary Discussion). In the present work, we use a PE circuit in which the soma of the excitatory cells, the SOM neuron, and one of the PV neurons receive the feedforward sensory input, while the other cells/compartments receive the prediction thereof. This is in line with experimental work showing that feedback connections hypothesized to carry information about expectations or predictions^33,34,35 target the apical dendrites of pyramidal cells³⁴ and interneurons located in superficial layers of the cortex (for example, ref. ²³).

We reasoned that if a prediction of a stimulus is the mean of the previously experienced stimuli, it can be modeled through a perfect integrator (here denoted memory neuron) that receives connections from the PE neurons (Fig. 1C). More precisely, following Keller and Mrsic-Flogel¹⁰, we assume that the pPE neuron excites the memory neuron, while the nPE neuron inhibits this neuron (for instance, through lateral inhibition, here not modeled explicitly). If the activity of the memory neuron is below the sensory input, the pPE neuron is active while the nPE neuron is silent (Supplementary Fig. 2). Hence, the memory neuron receives more excitation. If the activity of the memory neuron is above the sensory input, the nPE neuron is active while the pPE neuron is silent (Supplementary Fig. 2). As a consequence, the memory neuron receives more inhibition. When the memory neuron is roughly at the mean of the sensory inputs, occasionally being below or above, the effect of nPE and pPE neuron cancels. Hence, the PE neurons ensure that the memory neuron’s activity does not drift too far from the mean (see Box 1).

The memory neuron in our network projects back to the PE neurons it receives inputs from. We, therefore, call the input from the memory neuron to all other neurons in the PE circuit feedback input. While we consider the activity of the memory neuron as a prediction of the current sensory input, it could also be interpreted as a prior of the sensory mean at the next time step.

If the prediction equals the mean of the sensory stimulus, the activity of the nPE and pPE neurons encode the deviation from the mean. Thus, the squared sum of nPE and pPE neuron activity represents the variance of the feedforward input (provided that the PE neurons are silent without sensory stimulation). We, therefore, simulate a downstream neuron (termed V neuron), modeled as a leaky integrator with a quadratic activation function, that receives excitatory synapses from the PE neurons (see the lower-level subnetwork in Fig. 1C, the higher-level circuit is described later).

Box 1

Following the definition of nPE and pPE neurons, their idealized activity, r_nE and r_pE, can be written as

$$\begin{array}{r}{r}_{{{\rm{nE}}}}={\left[{r}_{{{\rm{M}}}}-S\right]}_{+}\\ {r}_{{{\rm{pE}}}}={\left[S-{r}_{{{\rm{M}}}}\right]}_{+}\end{array}$$

with S denoting the time-dependent feedforward input and r_M the activity of the M neuron that can be summarized as

$${\tau }_{M}\cdot \frac{d{r}_{{{\rm{M}}}}}{dt}={r}_{{{\rm{pE}}}}-{r}_{{{\rm{nE}}}}.$$

Inserting the idealized activity of nPE and pPE neurons and solving the differential equation yields

$${r}_{{{\rm{M}}}}=\frac{1}{{\tau }_{M}}\int\limits_{0}^{t}{e}^{-(t-x)/{\tau }_{M}}\cdot S(x)\,dx$$

for zero activity at time t = 0. In the steady state (that is, t → ∞), this is the exponential moving average of the feedforward input. Similarly, the activity of the V neuron can be described by

$${\tau }_{V}\cdot \frac{d{r}_{{{\rm{V}}}}}{dt}=-{r}_{{{\rm{V}}}}+{({r}_{{{\rm{pE}}}}+{r}_{{{\rm{nE}}}})}^{2}=-{r}_{{{\rm{V}}}}+{({r}_{{{\rm{M}}}}-S)}^{2}.$$

Solving the differential equation after replacing the activity of nPE and pPE neurons with their respective definitions yields

$${r}_{{{\rm{V}}}}=\frac{1}{{\tau }_{V}}\int\limits_{0}^{t}{e}^{-(t-x)/{\tau }_{V}}\cdot {\left[{r}_{{{\rm{M}}}}(x)-{s}_{{{\rm{FF}}}}(x)\right]}^{2}\,dx.$$

In the limit of t → ∞, r_V approaches the variance of the feedforward input.

Estimating the mean and variance of sensory stimuli with prediction-error neurons

To show that this network can indeed represent the mean and the variance in the respective neurons, we stimulate it with a sequence of step-wise constant inputs drawn from a uniform distribution (Fig. 2A). We, hence, assume that the sensory stimulus varies over time. In line with the distinct response patterns for nPE and pPE neurons, these neurons change only slightly with increasing stimulus mean but increase strongly with input variance (Fig. 2B). In contrast, the three interneurons strongly increase with stimulus mean and only moderately increase with stimulus variance (Fig. 2C). The activity of the memory neuron gradually approaches the mean of the sensory inputs (Fig. 2D, middle), while the activity of the V neuron approaches the variance of those inputs (Fig. 2E, middle). We show that this holds for a wide range of input statistics (Fig. 2D, E, right) and input distributions (Supplementary Fig. 3). Small deviations from the true mean occur mainly for large input variances, while the estimated variance is fairly independent of the input statistics tested. Moreover, using a continuously changing signal instead of a piecewise constant stimulus yields similar results, where small deviations can be attributed to the PE neurons not reaching their steady state (Supplementary Fig. 4).

**Fig. 2: Prediction-error neurons as the basis for estimating mean and variance of sensory stimuli.**

While the results do not strongly depend on the stimulus statistics and distribution, they are affected by the baseline activities of the PE neurons that were assumed to be zero in our network, in line with the low baseline firing rates reported for neurons in primary visual cortex of rodents^36,37. When the baseline rate of the nPE neuron is increased, the memory neuron underestimates the mean of the sensory input (Supplementary Fig. 5A). In contrast, when the baseline rate of the pPE neuron is increased, the memory neuron overestimates the mean of the sensory input (Supplementary Fig. 5A). However, increasing the baseline for both PE neurons by the same amount does not affect the estimation of the stimulus mean (Supplementary Fig. 5A). In contrast, a non-zero baseline in any of the PE neurons yields an overestimation of the stimulus variance (Supplementary Fig. 5B). This suggests inhibitory interneurons must cancel the baseline activity to ensure an unbiased uncertainty estimation in networks with high-baseline PE neurons. While the baseline activity of PE neurons can bias the estimation of mean and variance, other neuron properties and network connection strengths play a less pivotal role (see Supplementary Fig. 6, discussed in the Supplementary Discussion).

We verified the main results in a heterogeneous network, where each neuron type of the PE circuit was represented by a distinct population of neurons, and the synaptic connection strengths from each PE neuron onto the M and V neuron are different (see SI Methods, Supplementary Fig. 7A). As before, the network can correctly estimate the mean and the variance of the sensory stimuli (Supplementary Fig. 7B). Furthermore, we show that the errors with which the M and V neurons encode the stimulus statistics are independent of uncorrelated modulations of the connection strengths (Supplementary Fig. 7C) and the sparsity of the network (Supplementary Fig. 7E). When all connection strengths are collectively shifted to higher values, the error increases for the variance neuron, while it remains unaffected for the memory neuron.

While our mean-field network was designed to track the mean and the variance of stimuli that vary in time, we reasoned that the same principles apply to stimuli that vary across space. To show that, we simulated a population network that consists of unconnected replicates of the mean-field network described above (Supplementary Fig. 8A). Each mean-field network receives a short, constant input from a different part of the receptive field. If the connection strengths from the PE neurons to the M and V neurons are adjusted accordingly (see Methods), the network correctly estimates the stimulus average and spatial uncertainty (Supplementary Fig. 8B, C).

In summary, nPE and pPE neurons can serve as a basis to estimate the mean and the variance of sensory stimuli which vary over time and space.

Estimating the uncertainty of both the sensory input and the prediction requires a hierarchy of PE circuits

Following the ideas of Bayesian multisensory integration, the weighting of sensory stimuli and predictions would require knowledge about their uncertainties. As we have shown in the previous section, the variance of the sensory stimulus can be estimated using PE neurons. We hypothesize that the same principles apply to computing the variance of the prediction. To show this, we augment the network with a higher PE circuit that receives feedforward synapses from the memory (M) neuron of the lower PE circuit (Fig. 1D, and a more detailed network diagram in Supplementary Fig. 1). Both subnetworks are identical except for the M neuron in the higher PE circuit which is modeled with slower dynamics than the one in the lower PE circuit.

To evaluate the network’s ability to accurately estimate variances, we conducted tests using a sequence of inputs varying on two different timescales. Specifically, in each trial, the network receives a stimulus consisting of N_in constant values. Each value is drawn from a normal distribution and presented over N_step consecutive time steps. The variance of this normal distribution indicates the level of stimulus noise. Additionally, to simulate changes in the environment, the stimulus mean is re-drawn from a uniform distribution (Fig. 3A) after N_in ⋅ N_step time steps (that is, in each trial). This setup aligns with a change detection task and has been previously studied (for e.g., see ref. ^38,39).

**Fig. 3: Estimating the uncertainty of both the sensory input and the prediction.**

Following the formalism of multisensory integration (for e.g., see ref. ⁴⁰), we assume that the network’s output is a weighted sum of the feedforward sensory input and the feedback prediction. The weights assigned to each input stream are functions of the uncertainties, that is, the activities of the V neurons. The sensory weight captures how much the network relies on the sensory input (Fig. 2B). For the sake of simplicity, we assume that the weighted output is encoded in a separate class of neurons not explicitly modeled here and only compute the sensory weight arithmetically.

To test our network, we first consider two limit cases. In the first limit case, we show a low-variance stimulus that differs in each trial (low stimulus uncertainty, high trial-to-trial uncertainty, see Fig. 3C, left). According to the theory, the network should follow the sensory inputs closely and ignore the predictions. When we arithmetically calculate the weighted output (Fig. 3C, middle) and the sensory weight (Fig. 3C, right), the network shows a clear preference for the sensory input, indicating that the network estimated the uncertainty of the sensory input to be lower than that of the prediction. In the second limit case, we show a high-variance stimulus, the mean of which does not change from trial to trial (high stimulus uncertainty, low trial-to-trial uncertainty, see Fig. 3D, left). According to the theory, the network should downscale the sensory feedforward input and weight the prediction more strongly. As expected, the weighted output of the network shows a clear tendency to the mean of the stimuli (Fig. 3D, middle), also reflected in the low sensory weight (Fig. 3D, right). Hence, the network estimated the uncertainty of the sensory input to be higher than that of the prediction.

To validate the network responses fully, we systematically varied the trial and stimulus variability independently. If both variances are similar, the sensory weight approaches 0.5, reflecting the equal contribution of the sensory input and the prediction to the weighted output. Only if both variances are zero, the network represents the sensory input perfectly. In line with the limit case examples above, if the stimulus variance is larger than the trial variance, the network weights the prediction more strongly than the sensory input (Fig. 3E). Because the network dynamically estimates the sensory and prediction uncertainty, the sensory weight changes when the input statistics shifts (Supplementary Fig. 9).

Inspecting closely the dynamics of our network, we noticed that the prediction is typically weighted higher at the beginning of a new trial than in the steady state. This is particularly pronounced in a sensory-driven input regime (see Fig. 3C), and further confirmed in simulations in which the trial duration was shortened from 5s to 1s (Fig. 3F). This observation highlights that the approach is suboptimal immediately after a change point, as predictions based on previous sensory inputs become incorrect following an environmental change. In such cases, the system should promptly adapt to the new sensory input. Alternative approaches that detect potential change points and allow the system to prioritize sensory input after a change are possible and have been discussed (for e.g., see ref. ³⁹). However, identifying change points can be challenging, especially when the level of sensory noise varies. While it is common to focus on changes in the environment (i.e., μ), changes in sensory noise levels (i.e., σ) can also occur. In this work, we considered a spectrum of scenarios encompassing both environmental changes and fluctuations in noise levels. Although the weighting strategy used here is less accurate immediately after a change point, it performs well in steady-state conditions (Supplementary Fig. 10A). Furthermore, the reliability-weighted input approach effectively handles scenarios where sensory noise undergoes abrupt changes (Supplementary Fig. 10A). In contrast, simpler methods designed to minimize inaccuracies after a change point may struggle with high-noise scenarios. For example, while approaches based solely on sensory input variance (Supplementary Fig. 10B) or the disparity between the lower and the higher memory neuron (Supplementary Fig. 10C) improve the output estimate in low-noise conditions, they struggle in high-noise conditions.

It has been hypothesized, that some symptoms in psychiatric disorders like autism and schizophrenia can be ascribed to a pathological weighting of sensory inputs and predictions⁹. We thus wondered which network properties might bias the estimation of the variances, and, consequently, the weighting of different input streams. We identified the effective timescales at which the memory neurons incorporate new information as a decisive factor in the integration of inputs. To show this, we varied the weights from the PE neurons onto the lower-level memory neurons. If the weights are too small (the memory neuron updates too slowly), the system relies too much on feedback predictions. In contrast, if the weights are too large (the memory neuron updates too fast), the system relies too much on the feedforward sensory information (Supplementary Fig. 11A). However, scaling the respective weights onto both the lower-level and the higher-level memory neuron by the same amount only has a minor effect on the sensory weight (Supplementary Fig. 6B). While the speeds at which the activity of the memory neurons evolve influence the weighting of inputs, the precise activation function of the variance neurons is less pivotal. When we replaced the quadratic activation function with a linear, rectified function, the V neurons did not encode the variance but the average absolute deviation of the sensory stimuli. However, the sensory weight is only slightly shifted to larger values for low trial/high stimulus variability (Supplementary Fig. 11B).

We have shown that the baseline activity of PE neurons can affect the ability of the M and V neurons to encode the mean and the variance of the feedforward input, respectively. In the full network, these biases manifest in a sensory weight that is slightly pushed towards 0.5 (Supplementary Fig. 5C). That is, in an initially sensory-driven regime, the dependence on the sensory inputs is slightly weakened. In contrast, in an initially prediction-driven regime, the dependence on the sensory inputs is slightly strengthened. Similarly, while other properties like the connectivity between the PE neurons and the M/V neurons can affect the estimation of the mean and the variance, the sensory weight is only slightly affected if the changes occur in both the lower- and the higher-level circuit (Supplementary Fig. 6).

In summary, we show that the variances of both the sensory inputs and predictions thereof can be dynamically computed in networks comprising a lower and higher PE circuit. In such a network, predictions are given more weight at the beginning of a new stimulus, and if the sensory inputs are noisy while the environment is stable.

Biasing the weighting of sensory inputs and predictions by neuromodulators

The brain’s flexibility and adaptability are supported by a plethora of neuromodulators that influence the activity of neurons in a variety of ways⁴¹. A prominent target of neuromodulatory inputs is inhibitory neurons^42,43,44. Moreover, distinct interneuron types are differently (in-)activated by those neuromodulators^43,44,45. Given that the interneurons in our network play a crucial role in establishing the PE neurons that, in turn, are the backbone for computing the uncertainties, we wondered if and how the weighting of sensory inputs and predictions may be biased when neuromodulators activate distinct interneuron types.

To this end, we modeled the presence of a neuromodulator by injecting an additional excitatory input into an interneuron type (while a neuromodulator can also suppress neuronal activity, we focus on the more common excitatory effects that have been described). Given that the interneurons are embedded in a network and establish an E/I balance in the PE neurons through multiple pathways, we reasoned that not only the interneuron type but also the connections it receives/makes determine the effect of a neuromodulator. Hence, testing only one instantiation of our network may yield effects that do not generalize to other parameterizations of the network. To avoid that, we tested three different mean-field networks derived in²⁶. These networks differ in the distribution of sensory inputs and predictions onto the interneurons, and, hence, the underlying connectivity. They cover a broad range of possible PE circuits. The only commonality across those networks is that they exhibit an E/I balance of excitatory and inhibitory pathways onto the PE neurons.

Across the different mean-field networks tested, increasing the activity of PV neurons biases the network’s output toward predictions (Fig. 4A, B, light blue line). In contrast, increasing VIP activity forces the networks to weigh both inputs more equally. As a consequence, predictions are overrated in a sensory-driven input regime (Fig. 4A, green line), and, sensory inputs are overrated in a prediction-driven input regime (Fig. 4B, green line). Increasing SOM neuron activity, while qualitatively similar to increasing VIP neuron activity, depends on the mean-field network tested and the strength of activation (Fig. 4A, B, dark blue line).

**Fig. 4: Neuromodulator-based shifts in the weighting of sensory inputs and predictions.**

Neuromodulators are most likely increasing the activity of more than one interneuron type. To account for the co-activation of interneurons, we injected an excitatory input into two interneuron types at the same time and varied the strength with which each interneuron was modulated. If SOM and VIP neurons are equally stimulated, the weighting of sensory inputs and predictions remains largely unaffected (Fig. 4A, B, Supplementary Figs. 15 and 16, dashed beige line), suggesting that the individual effects cancel out. If PV neurons are the major target of a neuromodulator, the network is still biased toward predictions (Supplementary Figs. 15 and 16). While some results depend not only on the interneuron type targeted but also the connections it makes/receives, as well as the strength of the neuromodulation, there are consistent effects across the mean-field networks tested (illustrated in Fig. 4C): neuromodulators increasing the PV neuron activity bias the weighting towards predictions. If, however, VIP neurons are the major target of a neuromodulator, the sensory weight is pushed towards 0.5. This effect is reversed when SOM neurons are equally targeted (see Discussion for more details and a comparison with experiments).

What are the network mechanisms underlying these observations? The sensory weight is a function of the lower and higher variance (V) neuron activity. Hence, any changes to the sensory weight result from changes to the neurons encoding the variances. In our network, the V neurons only receive excitatory synapses from PE neurons. As a consequence, any changes in the sensory weights upon activation of interneurons must be due to changes in the PE neurons. This suggests that to understand the effect of neuromodulators on sensory weight, we need to unravel the effect of interneuron activation on PE neurons. Increasing interneuron activity leads to changes in the baseline and gain of PE neurons (Supplementary Fig. 12, see Methods). In all three networks tested, activating PV neurons decreases both the baseline and gain of the PE neurons, leading to a decrease in the estimated variance (Fig. 4D and Supplementary Fig. 13). Stimulating the SOM or VIP neuron decreases the gain in either nPE or pPE neuron. However, the baseline of those neurons can either decrease or increase depending on the connectivity with other neurons in the network. The summed effect over nPE and pPE neurons (Supplementary Fig. 13) suggests that whether the activity of the V neuron increases or decreases depends on the input statistics: for low-mean stimuli, the elevated baseline activity dominates the changes in the variance, while for high-mean stimuli the changes in the gain dominate. Furthermore, we note that the presence of a neuromodulator can also affect the predictive state of the network. That is, the M neuron activity can also be modulated by changes in the baseline and gain of the PE neurons.

Altogether, we show that neuromodulators increasing the activity of interneurons bias the weighting of sensory inputs and predictions by changing the gain and baseline of PE neurons. Whether the sensory weight increases or decreases depends not only on the interneuron it targets but also on the network it is embedded in and the input regime.

Explaining the contraction bias with the weighting of sensory inputs and predictions

We hypothesized that the weighted integration of sensory inputs and predictions manifests in everyday behavior as contraction bias. This phenomenon describes the tendency to overestimate sensory stimuli from the lower end of a distribution and underestimate those from the upper end, reflecting a bias toward the mean observed across species and modalities^{27,28,29,30,31,32}.

First, we investigated whether the network’s output can be interpreted as a neuronal manifestation of the contraction bias (see Methods for an illustrative analysis). We define contraction bias as the trial-averaged difference between the weighted output and the sensory stimulus. The bias is positive for stimuli below the mean of the input distribution and negative for stimuli above the mean (Fig. 5A), consistent with a bias toward the mean. We quantify the bias using the slope of the linear fit between bias and trial stimulus; a larger absolute slope indicates a larger bias.

**Fig. 5: Mechanisms underlying the contraction bias.**

What network factors contribute to the neuronal contraction bias? When we increase the stimulus uncertainty, the bias increases as well (Fig. 5B). In contrast, when we increase the trial-to-trial uncertainty, the bias decreases (Fig. 5B). To further disentangle the different sources of the bias, we simulated a network without stimulus uncertainty (variance set to zero) under two trial-to-trial variances (environmental volatility). In this case, the emerging contraction bias is independent of the volatility of the environment (Fig. 5C). We show mathematically that the bias results from the network output not reaching its new steady state within the trial duration (see SI Methods). How fast the new steady state is reached depends only on the time constants in the network and not the trial-to-trial variability. Next, we considered high stimulus uncertainty with zero trial-to-trial uncertainty. In this case, the contraction bias is largely independent of the stimulus variance (Fig. 5D). Our mathematical analysis reveals that the bias is well described by the difference between the prediction (that is, the mean stimulus over the history of all stimuli shown) and the current stimulus, weighted by a function of the trial duration.

The analysis of both limit cases suggests that the bias also depends on the trial duration. To confirm this, we extended the trial duration for either limit case. As expected from the analysis, the bias decreases steadily in the simulations (Fig. 5E). We, therefore, predict that the contraction bias can be reduced for sufficiently long trials.

We assumed that the stimulus variance is independent of the stimulus mean. Consequently, the bias at both ends of the input distribution is similar but reversed in sign. However, behavioral data (for example, ref. ⁴⁶) shows that the bias increases for stimuli from the upper end of the distribution, a phenomenon usually attributed to scalar variability. Modeling the stimulus standard deviation as linearly increasing with the stimulus mean, we also observed an increased bias for higher trial means (Supplementary Fig. 14).

In summary, we show that the weighted integration of sensory inputs and predictions can be interpreted as a neural manifestation of the contraction bias. While the stimulus and trial-to-trial variability shape the contraction bias, their contributions differ. Moreover, we reveal that the trial duration contributes to the bias.

Discussion

Our work has been driven by the puzzling question of how the brain integrates top-down feedback predictions with the sensory feedforward inputs it constantly receives during behavior. This task is particularly challenging when predictions and sensory information differ⁴⁷. Conflicting information may arise from noise in the sensory inputs or from unpredictable changes in the environment. A prominent hypothesis suggests that the degree to which we rely on predictions versus new sensory evidence is determined by an intricate balance based on the reliability of each source (for example, see refs. ^4,9).

This idea aligns with Bayesian theories on the optimal integration of multiple sensory cues (multisensory integration). Ernst and Banks² demonstrated that humans estimate the height of a bar by combining visual and haptic information in a manner that minimizes the variance of the final estimate. Similar studies have confirmed that animals can optimally combine multiple sensory information by taking into account their uncertainties^3,4,5,6,7,8. These behavioral findings were accompanied by neural recordings identifying populations of neurons that can form the basis of multisensory integration^7,8,48.

Here, we show that PE neurons can serve as the backbone for estimating the uncertainty of both the feedforward sensory inputs and the feedback predictions (Figs. 2 and 3). Our model proposes a hierarchy of PE circuits connected through the lower-order memory neuron, whose activity encodes the mean of the sensory bottom-up inputs. This local prediction is fed back to the lower-order circuit and simultaneously feed-forwarded to the higher-order subnetwork (Fig. 1). With this architecture, we show that we rely more strongly on our internal signals when the perceived sensory cues are noisier than the predictions. Moreover, our work suggests that predictions modulate neural activity more at the onset of a new sensory input, even if the stimulus is not noisy.

Relying more on predictions at the onset of a new trial, immediately after a change point, can be suboptimal. It was found that subjects tended to discard their predictions immediately following a change point in a sound-localization task where subjects were asked to predict the next stimulus after observing a series of stimuli^38,39. However, as noted in the study, participants were informed about the nature of the task, which could have influenced their responses. In situations where the underlying task is not explicitly known, the strategy may be less clear. This is particularly true when changes occur in sensory noise rather than in the environment (that is, σ than μ). In such cases, a reliability-based weighting of sensory input and predictions might offer an advantage. Nevertheless, our model could be extended to include a change-point detection mechanism (for e.g., see ref. ³⁹), which could help reduce the observed discrepancy immediately after a change point.

We show that the weighting of sensory inputs and predictions can be biased by neuromodulators, as previously suggested (for example, see ref. ⁹). In our model, these modulatory signals act through interneurons⁴² whose activities increase in the presence of neuromodulators. When PV neuron activity increases, the network weighs predictions stronger than without modulation. In contrast, when VIP neuron activity increases, the network underestimates the uncertainty of the prediction in a sensory-driven regime, and it underestimates the uncertainty of the sensory input in a prediction-driven regime. This results in the system weighting sensory inputs and predictions more equally (Fig. 4A). When SOM and VIP neuron activities are modulated to the same degree, the weighting remains unaffected, suggesting that the individual contributions cancel (Fig. 4B). These findings can be explained by changes in the baseline and gain of PE neurons arising through the modulation of interneuron activity (Fig. 4D). The results can be tested experimentally by optogenetically or pharmacologically stimulating specific interneuron types. Finally, we illustrate that the weighted integration of feedforward and feedback inputs can be interpreted as a neural manifestation of the contraction bias.

What could be the biological basis for our network model? Sensory information is commonly believed to be channeled through the thalamus and initially arrives in layer 4 of the neocortex^49,50. Neurons in layer 4 subsequently relay the information to layer 2/3⁴⁹, where it is further integrated with inputs from higher-order cortical areas entering layer 1⁵¹. From layer 2/3, the information is subsequently forwarded to layer 5 neurons^49,52, which integrate it with direct inputs from the thalamus⁵³.

The core hypothesis of our model is the presence of sensory PE neurons that have been predominately found in layer 2/3, in different brain areas of various species^{11,12,14,16,17,18}. While we assume these neurons encode PEs in their activity, it remains an active research area whether PEs are encoded in the (spiking) activity of separate neurons and/or in the local voltage dynamics of dendrites⁵⁴. Recent findings by Gillon et al.⁵⁵ indicate that pattern-violation signals are processed differently at the soma and dendrites over time, suggesting a more complex role for excitatory neuron compartments in predictive processing than our simplified model accounts for.

In our network, memory neurons could correspond to a subset of excitatory L2/3 neurons. Some L2/3 neurons have been shown to develop predictive responses to expected visual stimuli⁵⁶. Additionally, a group of L2/3 neurons has been shown to integrate both negative and positive prediction errors⁵⁷, which aligns with our assumption. The weighted output of our network aligns with the concept of internal representation neurons in predictive processing theories^10,58, hypothesized to be deep-layer 5 (L5) neurons^58,59. These large pyramidal cells in L5 are ideally situated to integrate top-down information (e.g., predictions) arriving at their apical dendrites in layer 1 with bottom-up information (e.g., sensory inputs) arriving at their basal dendrites in deeper layers^34,60.

Neurons encoding variance in primary sensory areas remains a prediction of our model that requires validation. However, it has been shown that stimulus uncertainty can be encoded in the gain variability of individual neurons in V1 and V2 of macaques⁶¹. Evidence also indicates that populations of neurons can encode uncertainty⁶². For instance, neurons in the parietal cortex of monkeys encode confidence in perceptual decisions⁶³, and neurons in the orbitofrontal cortex encode confidence regardless of sensory modality⁶⁴. Neural signatures of uncertainty have also been found in regions of the prefrontal cortex⁶⁵, the rat insular and orbitofrontal cortex⁶⁶, and the dorsal striatum in monkeys⁶⁷. Additionally, the accuracy of memory recalls is encoded in single neurons of the human parietal and temporal lobes^68,69.

In our computation, the relative weights with which the sensory input and the prediction are integrated depend on the activities of the lower-level and higher-level variance neurons. While it is unlikely that these variance neurons can directly modulate the weights, they might trigger the release of neuromodulators that then, in turn, affect the synaptic plasticity of those weights^70,71. For example, deep L5 neurons, which have been hypothesized to act as internal representation neurons^58,59 could be modulated in this manner. Depending on the receptor types present in the apical and basal dendrites, neuromodulators could either decrease or increase the synaptic weights connecting the sensory input and the prediction to these neurons.

Alternatively, the integration of sensory inputs and predictions could occur without changes in synaptic weights, implemented through a network of neurons encoding different aspects of the computation via their activities. For instance, an inhibitory neuron could encode the sum of the variances and exert divisive inhibition on another neuron, which is driven by the sensory input and the higher-order variance neuron in a multiplicative manner. Interneurons such as PV or SOM neurons have been shown to exert divisive inhibition^72,73. Moreover, these computations could also be carried out by different compartments within the same neuron. For example, a deep L5 pyramidal cell may receive the sensory input at its basal dendrites and the activity of the higher-order variance neuron at its apical dendrites.

Neuromodulators correlate with uncertainty and influence the weighting of sensory inputs and their predictions⁹. Theoretical work by Yu and Dayan⁷⁴ suggests that acetylcholine (ACh) correlates with expected uncertainty, while noradrenaline (NA) correlates with unexpected uncertainty. Expected uncertainty is usually interpreted as known cue-outcome unreliabilities, whereas unexpected uncertainty relates to the changes in the environment that produce large PEs outside the expected range of uncertainties⁷⁴. While in our network, the stimulus and trial-to-trial variability can only be loosely interpreted as ’expected’ and ’unexpected’ uncertainty, we discuss the conditions under which our network aligns with findings on ACh and NA.

NA is believed to increase in more volatile environments and enhance bottom-up processes^9,75. In line with this idea, NA blockade impairs cognitive flexibility^76,77. Recent work by Lawson et al.⁷⁸ shows that humans receiving propranolol (blocking NA) rely more on expectations and are slower to update their predictions despite new sensory evidence⁹. A main target for noradrenergic inputs is SOM neurons, whose activity increases in the presence of NA (reviewed in refs. ^43,44,79). In our model, activating SOM neurons does not enhance sensory bottom-up input. In a volatile environment, that is, a sensory-driven regime, the system takes into account predictions slightly more than without SOM modulation (Fig. 4A and Supplementary Fig. 15).

However, we assumed that neuromodulators act globally, that is, on the interneurons in both the lower and the higher PE circuit. While this agrees with the view that neuromodulators can control network states globally, there is also evidence that they can have a more local, finely adjusted impact on neural circuits⁸⁰. In our model, increasing SOM activity only in the lower-order circuit slightly enhances the sensory weight (Supplementary Fig. 16), that is, the bottom-up inputs.

Similarly to NA, ACh has also been shown to enhance bottom-up, feedforward inputs (reviewed in refs. ^74,81). For instance, subjects relied more on prior beliefs when given cholinergic receptor antagonists⁸¹. A major target for cholinergic inputs is VIP neurons, whose activity increases in the presence of ACh (reviewed in refs. ^43,44,45). In our model, globally activating VIP neurons enhances bottom-up input in stable environments for noisy stimuli. However, increasing VIP activity only in the higher-order PE circuit generally enhances sensory bottom-up inputs (Supplementary Fig. 16). This suggests that whether a neuromodulator biases the network toward feedforward bottom-up or feedback top-down inputs depends on its spatial and temporal scale of influence.

Our model suggests one potential neuronal circuit mechanism for the uncertainty estimation of sensory inputs and predictions. Modeling specific neurons that encode the variance of feedforward sensory inputs and predictions aligns with the concept that neurons can explicitly represent parameters of a probability distribution, such as the mean or variance (see also^82,83,84). However, the representation of variances in the brain is still not comprehensively understood, and several alternative models have been proposed. For instance, uncertainty might be decoded from the collective activity of neuron populations^85,86,87,88. Some theories suggest that uncertainty is represented by the amplitude⁸⁷, the width⁸⁹ or the variability of a neuron’s response^87,90. Another prominent theory is the neural sampling hypothesis, which suggests that neural circuits encode probability distributions rather than precise values. In this framework, the variability in neural responses is interpreted as samples drawn from these distributions^91,92,93.

Many normative models have been proposed for state estimation and prediction under uncertainty⁶², ranging from the classical Kalman filter to more recent models like Bayes Factor Surprise⁹⁴. For instance, the Bayes factor surprise formularizes the trade-off between integrating new observations in an existing belief system and resetting this belief system with novel evidence. The surprise factor captures how much an animal’s current belief deviates from the new observation.

In recent years, normative models have also been squared with biological constraints. For instance, Kutschireiter et al.⁹⁵ showed that a Bayesian ring attractor model can encode uncertainty in the amplitude of the network activity and match the performance of a circular Kalman filter when the recurrent connections are tuned appropriately. In other seminal work, it has been proposed that Bayesian inference in time can be linked to the dynamics of leaky integrate-and-fire neurons with spike-dependent adaptation⁹⁶.

Furthermore, in our model, we assume that the lower-level prediction is not only fed back to neurons in the lower-level PE circuit but is also forwarded to the higher-level subnetwork. Unlike classical hierarchical predictive coding frameworks, where PEs are sent up the hierarchy, we propose that PEs are processed locally. However, we are not the first to consider an alternative model. For instance, Spratling⁹⁷ reformulated the predictive coding model by Rao and Ballard¹⁵ within the context of a biased competition model. In Spratling’s model, cortical borders are also redefined so that prediction neurons in the lower-level cortical area connect to error neurons in the higher-level cortical area. However, we note that these views are not mutually exclusive and might differ between brain areas or species. We discuss in more detail alternative network models in the Supplementary Discussion.

The notion that neural implementations for the integration of inputs may vary across species and modalities is supported by work on multisensory integration. Wong et al.⁹⁸ showed that in Drosophila larva the chosen cue-combination strategy varies depending on the type of sensory information available. Also, humans put typically more weight on visual than auditory cues^3,5, but trust vestibular information more than visual information about head direction⁹⁹, a finding also observed for monkeys¹⁰⁰. Moreover, Summerfield et al.¹⁰¹ showed that humans diverge from an optimal Bayesian strategy in very volatile environments and act according to their experience in the last trial. It has been suggested that the brain may use different strategies to combine signals depending on the task demands⁸⁴.

Here, we propose a view in which PE neurons serve as the backbone for estimating both the uncertainty of the feedforward sensory stimuli arising from the external world and the feedback signals carrying predictions about the same feedforward inputs our brains receive. Our work is an important step toward a better understanding of the brain’s ability to integrate these unreliable feedforward and feedback signals that often do not match perfectly.

Methods

Network model

The mean-field network model consists of a lower and higher PE circuit (Fig. 1C, D). Each PE circuit contains an excitatory nPE neuron and pPE neuron (N_nPE = N_pPE = 1), as well as inhibitory neurons. The inhibitory neurons comprise PV, SOM, and VIP neurons (N_SOM = N_VIP = 1, N_PV = 2), further explained in ref. ²⁶. In addition to the core PE circuit, each subnetwork also includes one memory neuron M and one variance neuron V.

The excitatory neurons in the PE circuit are simulated as two coupled point compartments, representing the soma and the dendrites of elongated pyramidal cells. All other neurons are modeled as point neurons. The activities of all neurons are represented by a set of differential equations describing the network dynamics.

The dynamics of the neurons in the lower and higher PE circuits (${\underline{r}}_{{{\rm{PE}}}}^{{{\rm{low}}}}$ and ${\underline{r}}_{{{\rm{PE}}}}^{{{\rm{high}}}}$) are given by

$$\begin{array}{rc}{\underline{r}}_{{{\rm{PE}}}}^{{{\rm{low}}}}=&{\left[{\underline{h}}_{{{\rm{PE}}}}^{{{\rm{low}}}}\right]}_{+}\\ {\underline{r}}_{{{\rm{PE}}}}^{{{\rm{high}}}}=&{\left[{\underline{h}}_{{{\rm{PE}}}}^{{{\rm{high}}}}\right]}_{+}\end{array}$$

(1)

with

$${T}_{c}\cdot {\underline{\dot{h}}}_{{{\rm{PE}}}}^{{{\rm{low}}}}=-{\underline{h}}_{{{\rm{PE}}}}^{{{\rm{low}}}}+{W}_{{{\rm{PE\leftarrow PE}}}}\cdot {\underline{r}}_{{{\rm{PE}}}}^{{{\rm{low}}}}+{\underline{w}}_{{{\rm{PE\leftarrow M}}}}\cdot {r}_{{{\rm{M}}}}^{{{\rm{low}}}}+{\underline{w}}_{{{\rm{PE\leftarrow FF}}}}\cdot s+{\underline{I}}_{{{\rm{PE}}}}\\ {T}_{c}\cdot {\underline{\dot{h}}}_{{{\rm{PE}}}}^{{{\rm{high}}}}=-{\underline{h}}_{{{\rm{PE}}}}^{{{\rm{high}}}}+{W}_{{{\rm{PE\leftarrow PE}}}}\cdot {\underline{r}}_{{{\rm{PE}}}}^{{{\rm{high}}}}+{\underline{w}}_{{{\rm{PE\leftarrow M}}}}\cdot {r}_{{{\rm{M}}}}^{{{\rm{high}}}}+{\underline{w}}_{{{\rm{PE\leftarrow FF}}}}\cdot {r}_{{{\rm{M}}}}^{{{\rm{low}}}}+{\underline{I}}_{{{\rm{PE}}}}.$$

(2)

We follow the notation that column and row vectors are indicated by letters with an underscore $\underline{\bullet }$, matrices are denoted by capital letters, and scalars are given by small letters without an underscore. Furthermore, a time derivative (e.g., $\frac{dx}{dt}$) is denoted by a dot above the letter (e.g., $\dot{x}$). The rate vector ${\underline{r}}_{{{\rm{PE}}}}^{{{\rm{loc}}}}=\left[{r}_{{{\rm{nE}}}}^{{{\rm{loc}}}},\,{r}_{{{\rm{pE}}}}^{{{\rm{loc}}}},\,{r}_{{{\rm{nD}}}}^{{{\rm{loc}}}},\,{r}_{{{\rm{pD}}}}^{{{\rm{loc}}}},\,{r}_{{{{\rm{PV}}}}_{{{\rm{1}}}}}^{{{\rm{loc}}}},{r}_{{{{\rm{PV}}}}_{{{\rm{2}}}}}^{{{\rm{loc}}}},\,{r}_{{{\rm{SOM}}}}^{{{\rm{loc}}}},{r}_{{{\rm{VIP}}}}^{{{\rm{loc}}}}\right]$ with loc ∈ {low, high} contains the activities of all neurons or compartments in the PE circuit (soma of nPE/pPE neurons: nE/pE, dendrites of nPE/pPE neurons: nD/pD). The network receives time-dependent stimuli s and neuron/compartment-specific external background input ${\underline{I}}_{{{\rm{PE}}}}$. The connection strengths between the pre-synaptic population and the neurons of the PE circuit are denoted by ${W}_{{{\rm{PE\leftarrow pre}}}}$ (if r is a vector) or ${\underline{w}}_{{{\rm{PE}}}\leftarrow pre}$ (if r is a scalar). The activities of the neurons evolve with time constants summarized in the diagonal matrix T_c (entries that correspond to an excitatory cell are set to τ_E = 60 ms, while entries that correspond to the inhibitory neurons are set to τ_I = 2 ms, in line with ref. ²⁶).

The activities of the lower and higher memory (M) neuron evolve according to a perfect integrator. The M neurons receive synapses from both nPE and pPE neurons of the same subnetwork,

$${\tau }_{E}\cdot {\dot{r}}_{{{\rm{M}}}}^{{{\rm{low}}}}={\underline{w}}_{{{\rm{M\leftarrow PE}}}}^{{{\rm{low}}}}\cdot {\underline{r}}_{{{\rm{PE}}}}^{{{\rm{low}}}}={w}_{{{\rm{M\leftarrow pPE}}}}^{{{\rm{low}}}}\cdot {r}_{{{\rm{pPE}}}}^{{{\rm{low}}}}-{w}_{{{\rm{M\leftarrow nPE}}}}^{{{\rm{low}}}}\cdot {r}_{{{\rm{nPE}}}}^{{{\rm{low}}}}\\ {\tau }_{E}\cdot {\dot{r}}_{{{\rm{M}}}}^{{{\rm{high}}}}={\underline{w}}_{{{\rm{M\leftarrow PE}}}}^{{{\rm{high}}}}\cdot {\underline{r}}_{{{\rm{PE}}}}^{{{\rm{high}}}}={w}_{{{\rm{M\leftarrow pPE}}}}^{{{\rm{high}}}}\cdot {r}_{{{\rm{pPE}}}}^{{{\rm{high}}}}-{w}_{{{\rm{M\leftarrow nPE}}}}^{{{\rm{high}}}}\cdot {r}_{{{\rm{nPE}}}}^{{{\rm{high}}}}.$$

(3)

Please note that although the time constants for the lower and higher M neurons are identical, their effective time constants differ due to variations in the weights connecting the PE neurons with the M neurons (the effective time constant of the higher subnetwork is between 4 and 64 times larger than that of the lower subnetwork, see ’Connectivity’).

The activities of the lower and higher V neuron evolve according to a leaky integrator with quadratic activation function (τ_V = 5 s). The variance neurons receive synapses from both nPE and pPE neurons of the same subnetwork,

$${\tau }_{{{\rm{V}}}}\cdot {\dot{r}}_{{{\rm{V}}}}^{{{\rm{low}}}}=-{r}_{{{\rm{V}}}}^{{{\rm{low}}}}+{\left({\underline{w}}_{{{\rm{V\leftarrow PE}}}}\cdot {\underline{r}}_{{{\rm{PE}}}}^{{{\rm{low}}}}\right)}^{2}=-{r}_{{{\rm{V}}}}^{{{\rm{low}}}}+{\left({w}_{{{\rm{V\leftarrow pPE}}}}\cdot {r}_{{{\rm{pPE}}}}^{{{\rm{low}}}}+{w}_{{{\rm{V\leftarrow nPE}}}}\cdot {r}_{{{\rm{nPE}}}}^{{{\rm{low}}}}\right)}^{2}\\ {\tau }_{{{\rm{V}}}}\cdot {\dot{r}}_{{{\rm{V}}}}^{{{\rm{high}}}}=-{r}_{{{\rm{V}}}}^{{{\rm{high}}}}+{\left({\underline{w}}_{{{\rm{V\leftarrow PE}}}}\cdot {\underline{r}}_{{{\rm{PE}}}}^{{{\rm{high}}}}\right)}^{2}=-{r}_{{{\rm{V}}}}^{{{\rm{high}}}}+{\left({w}_{{{\rm{V\leftarrow pPE}}}}\cdot {r}_{{{\rm{pPE}}}}^{{{\rm{high}}}}+{w}_{{{\rm{V\leftarrow nPE}}}}\cdot {r}_{{{\rm{nPE}}}}^{{{\rm{high}}}}\right)}^{2}.$$

(4)

Details on the model equations for the mean-field and the multi-cell population network, as well as supporting analyses can be found in the supplementary material.

Connectivity

The connectivity within a PE circuit, W_PE←PE, can be found in ref. ²⁶, in the simulation code provided (see below), and the Supplementary Data 1–3. The vector ${\underline{w}}_{{{\rm{PE\leftarrow M}}}}$ contains the connection strengths between the memory neuron M and the post-synaptic neurons X in the PE circuit, w_X←M. If a connection exists, w_X←M = 1, w_X←M = 0 otherwise. In all mean-field networks tested, the dendrites of nPE and pPE neurons and one of the two PV neurons receive connections from the memory neuron. Whether the SOM or VIP neurons are the target of the feedback projections depend on the specific mean-field network tested (see Supplementary Table 2 and ref. ²⁶).

The vector ${\underline{w}}_{{{\rm{PE\leftarrow FF}}}}$ contains the connection strengths between the feedforward input and the post-synaptic neurons X in the PE circuit, w_X←FF. The feedforward input is either the direct sensory input s for the lower PE circuit, or the activity of the lower-level M neuron, ${r}_{{{\rm{M}}}}^{{{\rm{low}}}}$, for the higher PE circuit. In general, for the three mean-field networks tested, we chose w_X←FF = 1 − w_X←M.

The connection strength between the nPE/pPE neuron and the memory neuron M in the mean-field network is ${w}_{{{\rm{M\leftarrow nE}}}}^{{{\rm{loc}}}}\,=\,\frac{{\lambda }^{{{\rm{loc}}}}}{{g}_{{{\rm{nPE}}}}}$ and ${w}_{{{\rm{M\leftarrow pE}}}}^{{{\rm{loc}}}}\,=\,\frac{{\lambda }^{{{\rm{loc}}}}}{{g}_{{{\rm{pPE}}}}}$, respectively, where λ^loc denotes the non-normalized weight for the lower or higher-order PE circuit, loc ∈ {low, high}. In the lower PE circuit, λ^low = 3 × 10⁻³ for the mean-field model in Fig. 2 and λ^low = 4.5 × 10⁻² for Figs. 3–5. In the higher PE circuit, λ^high = 7 × 10⁻⁴. The gain factors, g_nPE and g_pPE depend on the mean-field network tested and can be found in ref. ²⁶ or the supplementary material. Similarly, the connection strength between the nPE/pPE neuron and a V neuron is given by ${w}_{{{\rm{V\leftarrow nPE/pPE}}}}\,=\,\frac{1}{{g}_{{{\rm{nPE/pPE}}}}}$.

Details on the multi-cell population networks can be found in the supplementary material.

Inputs

All neurons of the core PE circuit receive an external background input (summarized in the vector ${\underline{I}}_{{{\rm{PE}}}}$) that ensures reasonable baseline firing rates in the absence of sensory inputs and predictions thereof. In line with ref. ²⁶, these inputs were set such that the baseline firing rates are r_pE = r_pD = r_nE = r_nD = 0 s⁻¹ and r_P = r_S = r_V = 4 s⁻¹. In addition, the network receives feedforward stimuli s that may vary between trials. To account for noise, each stimulus is composed of N_in constant values drawn from a normal distribution with mean μ_in and standard deviation σ_in. Each value is presented for N_step consecutive time steps (each time step is 1 ms). To account for changes in the environment, μ_in is drawn from a uniform distribution U(a, b) with mean ${\mu }_{{{\rm{trial}}}}=\frac{a+b}{2}$ and standard deviation ${\sigma }_{{{\rm{trial}}}}=\frac{b-a}{\sqrt{12}}$. To test different input statistics, the parameterization of both distributions varies across the experiments (see Table 1). All stimulus/input parameters can also be found in the supplementary material, in addition to the parameters for the Supplementary Figs.

Table 1 Parameters used to stimulate the network

Full size table

Simulations

All simulations were performed in customized Python code written by LH. Differential equations were numerically integrated using a 2^nd-order Runge-Kutta method. Neurons were initialized with r = 0/s.

Weighting of sensory inputs and predictions

We arithmetically calculated the weighted output of sensory inputs and predictions, r_out, based on ideas of Bayesian multisensory integration (for e.g., see ref. ⁴⁰),

$${r}_{{{\rm{out}}}}=\alpha \cdot s+(1-\alpha )\cdot {r}_{{{\rm{M}}}}^{{{\rm{low}}}},$$

(5)

where α denotes the sensory weight (that is, the reliability of the sensory input) and is given by

$$\alpha={\left(1+\frac{{r}_{{{\rm{V}}}}^{{{\rm{low}}}}}{{r}_{{{\rm{V}}}}^{{{\rm{high}}}}}\right)}^{-1}.$$

(6)

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data is generated through simulations, with the corresponding code publicly available (see Code availability statement below).

Code availability

The Python source code (v1.0.0) for reproducing the simulations, analyses, and figures can be accessed at https://github.com/lhertaeg/weighted_sensory_prediction¹⁰².

References

Deneve, S. & Pouget, A. Bayesian multisensory integration and cross-modal spatial links. J. Physiol. Paris 98, 249–258 (2004).
Article PubMed MATH Google Scholar
Ernst, M. O. & Banks, M. S. Humans integrate visual and haptic information in a statistically optimal fashion. Nature 415, 429–433 (2002).
Article ADS CAS PubMed MATH Google Scholar
Battaglia, P. W., Jacobs, R. A. & Aslin, R. N. Bayesian integration of visual and auditory signals for spatial localization. J. Opt. Soc. Am. A Opt. Image Sci. Vis. 20, 1391–1397 (2003).
Article ADS PubMed Google Scholar
Körding, K. P. & Wolpert, D. M. Bayesian integration in sensorimotor learning. Nature 427, 244–247 (2004).
Article ADS PubMed MATH Google Scholar
Alais, D. & Burr, D. The ventriloquist effect results from near-optimal bimodal integration. Curr. Biol. 14, 257–262 (2004).
Article CAS PubMed MATH Google Scholar
Rowland, B., Stanford, T. & Stein, B. A bayesian model unifies multisensory spatial localization with the physiological properties of the superior colliculus. Exp. Brain Res. 180, 153–161 (2007).
Article PubMed MATH Google Scholar
Gu, Y., Angelaki, D. E. & DeAngelis, G. C. Neural correlates of multisensory cue integration in macaque mstd. Nat. Neurosci. 11, 1201–1210 (2008).
Article CAS PubMed PubMed Central MATH Google Scholar
Fetsch, C. R., Pouget, A., DeAngelis, G. C. & Angelaki, D. E. Neural correlates of reliability-based cue weighting during multisensory integration. Nat. Neurosci. 15, 146–154 (2012).
Article CAS MATH Google Scholar
Yon, D. & Frith, C. D. Precision and the bayesian brain. Curr. Biol. 31, R1026–R1032 (2021).
Article CAS PubMed MATH Google Scholar
Keller, G. B. & Mrsic-Flogel, T. D. Predictive processing: a canonical cortical computation. Neuron 100, 424–435 (2018).
Article CAS PubMed PubMed Central Google Scholar
Eliades, S. J. & Wang, X. Neural substrates of vocalization feedback monitoring in primate auditory cortex. Nature 453, 1102 (2008).
Article ADS CAS PubMed MATH Google Scholar
Keller, G. B. & Hahnloser, R. H. Neural processing of auditory feedback during vocal practice in a songbird. Nature 457, 187 (2009).
Article ADS CAS PubMed MATH Google Scholar
Ayaz, A. et al. Layer-specific integration of locomotion and sensory information in mouse barrel cortex. Nat. Commun. 10, 2585 (2019).
Article ADS PubMed PubMed Central MATH Google Scholar
Audette, N. J., Zhou, W. & Schneider, D. M. Temporally precise movement-based predictions in the mouse auditory cortex. bioRxiv https://www.biorxiv.org/content/10.1101/2021.12.13.472457v1.full (2021).
Rao, R. P. & Ballard, D. H. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci. 2, 79 (1999).
Article CAS PubMed MATH Google Scholar
Keller, G. B., Bonhoeffer, T. & Hübener, M. Sensorimotor mismatch signals in primary visual cortex of the behaving mouse. Neuron 74, 809–815 (2012).
Article CAS PubMed Google Scholar
Attinger, A., Wang, B. & Keller, G. B. Visuomotor coupling shapes the functional development of mouse visual cortex. Cell 169, 1291–1302 (2017).
Article CAS PubMed MATH Google Scholar
Jordan, R. & Keller, G. B. Opposing influence of top-down and bottom-up input on excitatory layer 2/3 neurons in mouse primary visual cortex. Neuron 108, 1194–1206 (2020).
Article CAS PubMed PubMed Central Google Scholar
Markram, H. et al. Interneurons of the neocortical inhibitory system. Nat. Rev. Neurosci. 5, 793 (2004).
Article CAS PubMed MATH Google Scholar
Rudy, B., Fishell, G., Lee, S. & Hjerling-Leffler, J. Three groups of interneurons account for nearly 100% of neocortical gabaergic neurons. Dev. Neurobiol. 71, 45–61 (2011).
Article PubMed PubMed Central MATH Google Scholar
Pfeffer, C. K., Xue, M., He, M., Huang, Z. J. & Scanziani, M. Inhibition of inhibition in visual cortex: the logic of connections between molecularly distinct interneurons. Nat. Neurosci. 16, 1068–1076 (2013).
Article CAS PubMed PubMed Central Google Scholar
Jiang, X. et al. Principles of connectivity among morphologically defined cell types in adult neocortex. Science 350, aac9462 (2015).
Article PubMed PubMed Central Google Scholar
Tremblay, R., Lee, S. & Rudy, B. Gabaergic interneurons in the neocortex: from cellular properties to circuits. Neuron 91, 260–292 (2016).
Article CAS PubMed PubMed Central Google Scholar
Campagnola, L. et al. Local connectivity and synaptic dynamics in mouse and human neocortex. Science 375, eabj5861 (2022).
Article CAS PubMed PubMed Central MATH Google Scholar
Hertäg, L. & Sprekeler, H. Learning prediction error neurons in a canonical interneuron circuit. Elife 9, e57541 (2020).
Article PubMed PubMed Central MATH Google Scholar
Hertäg, L. & Clopath, C. Prediction-error neurons in circuits with multiple neuron types: formation, refinement, and functional implications. Proc. Natl Acad. Sci. 119, e2115699119 (2022).
Article PubMed PubMed Central Google Scholar
Hollingworth, H. L. The central tendency of judgment. J. Philos. Psychol. Sci. Methods 7, 461–469 (1910).
MATH Google Scholar
Jazayeri, M. & Shadlen, M. N. Temporal context calibrates interval timing. Nat. Neurosci. 13, 1020–1026 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ashourian, P. & Loewenstein, Y. Bayesian inference underlies the contraction bias in delayed comparison tasks. PloS One 6, e19551 (2011).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Petzschner, F. H. & Glasauer, S. Iterative Bayesian estimation as an explanation for range and regression effects: a study on human path integration. J. Neurosci. 31, 17220–17229 (2011).
Article CAS PubMed PubMed Central MATH Google Scholar
Akrami, A., Kopec, C. D., Diamond, M. E. & Brody, C. D. Posterior parietal cortex represents sensory history and mediates its effects on behaviour. Nature 554, 368–372 (2018).
Article ADS CAS PubMed MATH Google Scholar
Meirhaeghe, N., Sohn, H. & Jazayeri, M. A precise and adaptive neural mechanism for predictive temporal processing in the frontal cortex. 109, 2995–3011.e5 (2021).
Mumford, D. On the computational architecture of the neocortex. Biol. Cybern. 66, 241–251 (1992).
Article CAS PubMed MATH Google Scholar
Larkum, M. A cellular mechanism for cortical associations: an organizing principle for the cerebral cortex. Trends Neurosci. 36, 141–151 (2013).
Article CAS PubMed MATH Google Scholar
Friston, K. Hierarchical models in the brain. PLOS Comput. Biol. 4, e1000211 (2008).
Article ADS MathSciNet PubMed PubMed Central MATH Google Scholar
Polack, P.-O., Friedman, J. & Golshani, P. Cellular mechanisms of brain state–dependent gain modulation in visual cortex. Nat. Neurosci. 16, 1331 (2013).
Article CAS PubMed PubMed Central MATH Google Scholar
Xue, M., Atallah, B. V. & Scanziani, M. Equalizing excitation–inhibition ratios across visual cortical neurons. Nature 511, 596 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Krishnamurthy, K., Nassar, M. R., Sarode, S. & Gold, J. I. Arousal-related adjustments of perceptual biases optimize perception in dynamic environments. Nat. Hum. Behav. 1, 0107 (2017).
Article PubMed PubMed Central Google Scholar
Meijer, D., Barumerli, R. & Baumgartner, R. How relevant is the prior? Bayesian causal inference for dynamic perception in volatile environments. bioRxiv https://www.biorxiv.org/content/10.1101/2024.10.29.620874v1 (2024).
Pouget, A., Beck, J. M., Ma, W. J. & Latham, P. E. Probabilistic brains: knowns and unknowns. Nat. Neurosci. 16, 1170–1178 (2013).
Article CAS PubMed PubMed Central Google Scholar
Avery, M. C. & Krichmar, J. L. Neuromodulatory systems and their interactions: a review of models, theories, and experiments. Front. Neural Circuits 11, 108 (2017).
Article PubMed PubMed Central MATH Google Scholar
Cardin, J. A. Functional flexibility in cortical circuits. Curr. Opin. Neurobiol. 58, 175–180 (2019).
Article CAS PubMed PubMed Central MATH Google Scholar
Hattori, R., Kuchibhotla, K. V., Froemke, R. C. & Komiyama, T. Functions and dysfunctions of neocortical inhibitory neuron subtypes. Nat. Neurosci. 20, 1199–1208 (2017).
Article CAS PubMed PubMed Central MATH Google Scholar
Swanson, O. K. & Maffei, A. From hiring to firing: activation of inhibitory neurons and their recruitment in behavior. Front. Mol. Neurosci. 12, 168 (2019).
Article CAS PubMed PubMed Central MATH Google Scholar
Wester, J. C. & McBain, C. J. Behavioral state-dependent modulation of distinct interneuron subtypes and consequences for circuit function. Curr. Opin. Neurobiol. 29, 118–125 (2014).
Article CAS PubMed MATH Google Scholar
Rakitin, B. C. et al. Scalar expectancy theory and peak-interval timing in humans. J. Exp. Psychol. Anim. Behav. Process. 24, 15 (1998).
Article CAS PubMed MATH Google Scholar
Han, S. & Helmchen, F. Behavior-relevant top-down cross-modal predictions in mouse neocortex. Nat. Neurosci 27, 298–308 (2023).
Wallace, M. T., Meredith, M. A. & Stein, B. E. Multisensory integration in the superior colliculus of the alert cat. J. Neurophysiol. 80, 1006–1010 (1998).
Article CAS PubMed MATH Google Scholar
Douglas, R. J. & Martin, K. A. Neuronal circuits of the neocortex. Annu. Rev. Neurosci. 27, 419–451 (2004).
Article CAS PubMed MATH Google Scholar
Bruno, R. M. & Sakmann, B. Cortex is driven by weak but synchronously active thalamocortical synapses. Science 312, 1622–1627 (2006).
Article ADS CAS PubMed MATH Google Scholar
Larkum, M. E. The yin and yang of cortical layer 1. Nat. Neurosci. 16, 114–115 (2013).
Article CAS PubMed MATH Google Scholar
Thomson, A. & Bannister, A. Postsynaptic pyramidal target selection by descending layer iii pyramidal axons: dual intracellular recordings and biocytin filling in slices of rat neocortex. Neuroscience 84, 669–683 (1998).
Article CAS PubMed Google Scholar
Constantinople, C. M. & Bruno, R. M. Deep cortical layers are activated directly by thalamus. Science 340, 1591–1594 (2013).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Mikulasch, F. A., Rudelt, L., Wibral, M. & Priesemann, V. Where is the error? hierarchical predictive coding through dendritic error computation. Trends Neurosci. 46, 45–59 (2023).
Article CAS PubMed MATH Google Scholar
Gillon, C. J. et al. Responses to pattern-violating visual stimuli evolve differently over days in somata and distal apical dendrites. J. Neurosci. 44 e1009232023 (2024).
Fiser, A. et al. Experience-dependent spatial expectations in mouse visual cortex. Nat. Neurosci. 19, 1658–1664 (2016).
Article CAS PubMed MATH Google Scholar
O’Toole, S. M., Oyibo, H. K. & Keller, G. B. Prediction error neurons in mouse cortex are molecularly targetable cell types. BioRxiv https://www.biorxiv.org/content/10.1101/2022.07.20.500837v1.full (2022).
Bastos, A. M. et al. Canonical microcircuits for predictive coding. Neuron 76, 695–711 (2012).
Article CAS PubMed PubMed Central MATH Google Scholar
Heindorf, M. & Keller, G. B. Reduction of layer 5 mediated long-range cortical communication by antipsychotic drugs. bioRxiv https://www.biorxiv.org/content/10.1101/2022.01.31.478462v1 (2022).
Harris, K. D. & Shepherd, G. M. The neocortical circuit: themes and variations. Nat. Neurosci. 18, 170 (2015).
Article CAS PubMed PubMed Central MATH Google Scholar
Hénaff, O. J., Boundy-Singer, Z. M., Meding, K., Ziemba, C. M. & Goris, R. L. Representation of visual uncertainty through neural gain variability. Nat. Commun. 11, 2513 (2020).
Article ADS PubMed PubMed Central Google Scholar
Soltani, A. & Izquierdo, A. Adaptive learning under expected and unexpected uncertainty. Nat. Rev. Neurosci. 20, 635–644 (2019).
Article CAS PubMed PubMed Central MATH Google Scholar
Kiani, R. & Shadlen, M. N. Representation of confidence associated with a decision by neurons in the parietal cortex. Science 324, 759–764 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Masset, P., Ott, T., Lak, A., Hirokawa, J. & Kepecs, A. Behavior-and modality-general representation of confidence in orbitofrontal cortex. Cell 182, 112–126 (2020).
Article CAS PubMed PubMed Central MATH Google Scholar
Rushworth, M. F. & Behrens, T. E. Choice, uncertainty and value in prefrontal and cingulate cortex. Nat. Neurosci. 11, 389–397 (2008).
Article CAS PubMed MATH Google Scholar
Jo, S. & Jung, M. W. Differential coding of uncertain reward in rat insular and orbitofrontal cortex. Sci. Rep. 6, 24085 (2016).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
White, J. K. & Monosov, I. E. Neurons in the primate dorsal striatum signal the uncertainty of object–reward associations. Nat. Commun. 7, 12735 (2016).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Rutishauser, U. et al. Representation of retrieval confidence by single neurons in the human medial temporal lobe. Nat. Neurosci. 18, 1041–1050 (2015).
Article CAS PubMed PubMed Central MATH Google Scholar
Rutishauser, U., Aflalo, T., Rosario, E. R., Pouratian, N. & Andersen, R. A. Single-neuron representation of memory strength and recognition confidence in left human posterior parietal cortex. Neuron 97, 209–220 (2018).
Article CAS PubMed Google Scholar
Gao, W.-J. & Goldman-Rakic, P. S. Selective modulation of excitatory and inhibitory microcircuits by dopamine. Proc. Natl. Acad. Sci. 100, 2836–2841 (2003).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Picciotto, M. R., Higley, M. J. & Mineur, Y. S. Acetylcholine as a neuromodulator: cholinergic signaling shapes nervous system function and behavior. Neuron 76, 116–129 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lee, S.-H. et al. Activation of specific interneurons improves v1 feature selectivity and visual perception. Nature 488, 379–383 (2012).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Seybold, B. A., Phillips, E. A., Schreiner, C. E. & Hasenstaub, A. R. Inhibitory actions unified by network integration. Neuron 87, 1181–1192 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yu, A. J. & Dayan, P. Uncertainty, neuromodulation, and attention. Neuron 46, 681–692 (2005).
Article CAS PubMed MATH Google Scholar
Hasselmo, M. E., Linster, C., Patil, M., Ma, D. & Cekic, M. Noradrenergic suppression of synaptic transmission may influence cortical signal-to-noise ratio. J. Neurophysiol. 77, 3326–3339 (1997).
Article CAS PubMed Google Scholar
Ridley, R., Haystead, T., Baker, H. & Crow, T. A new approach to the role of noradrenaline in learning: problem-solving in the marmoset after α-noradrenergic receptor blockade. Pharmacol. Biochem. Behav. 14, 849–855 (1981).
Article CAS PubMed Google Scholar
Janitzky, K. et al. Optogenetic silencing of locus coeruleus activity in mice impairs cognitive flexibility in an attentional set-shifting task. Front. Behav. Neurosci. 9, 286 (2015).
Article PubMed PubMed Central Google Scholar
Lawson, R. P., Bisby, J., Nord, C. L., Burgess, N. & Rees, G. The computational, pharmacological, and physiological determinants of sensory learning under uncertainty. Curr. Biol. 31, 163–172 (2021).
Article CAS PubMed PubMed Central Google Scholar
Urban-Ciecko, J. & Barth, A. L. Somatostatin-expressing neurons in cortical networks. Nat. Rev. Neurosci. 17, 401–409 (2016).
Article CAS PubMed PubMed Central MATH Google Scholar
Nadim, F. & Bucher, D. Neuromodulation of neurons and synapses. Curr. Opin. Neurobiol. 29, 48–56 (2014).
Article CAS PubMed MATH Google Scholar
Marshall, L. et al. Pharmacological fingerprints of contextual uncertainty. PLoS Biol. 14, e1002575 (2016).
Article PubMed PubMed Central MATH Google Scholar
Wilmes, K. A., Petrovici, M. A., Sachidhanandam, S. & Senn, W. Uncertainty-modulated prediction errors in cortical microcircuits. bioRxiv https://www.biorxiv.org/content/10.1101/2023.05.11.540393v7 (2023).
O’Neill, M. & Schultz, W. Coding of reward risk by orbitofrontal neurons is mostly distinct from coding of reward value. Neuron 68, 789–800 (2010).
Article PubMed Google Scholar
O’Reilly, J. X., Jbabdi, S. & Behrens, T. E. How can a bayesian approach inform neuroscience? Eur. J. Neurosci. 35, 1169–1179 (2012).
Article PubMed Google Scholar
Pouget, A., Dayan, P. & Zemel, R. Information processing with population codes. Nat. Rev. Neurosci. 1, 125–132 (2000).
Article CAS PubMed MATH Google Scholar
Knill, D. C. & Pouget, A. The bayesian brain: the role of uncertainty in neural coding and computation. TRENDS Neurosci. 27, 712–719 (2004).
Article CAS PubMed MATH Google Scholar
Ma, W. J., Beck, J. M., Latham, P. E. & Pouget, A. Bayesian inference with probabilistic population codes. Nat. Neurosci. 9, 1432–1438 (2006).
Article CAS PubMed MATH Google Scholar
Dehaene, G. P., Coen-Cagli, R. & Pouget, A. Investigating the representation of uncertainty in neuronal circuits. PLOS Comput. Biol. 17, e1008138 (2021).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Fischer, B. J. & Peña, J. L. Owl’s behavior and neural representation predicted by bayesian inference. Nat. Neurosci. 14, 1061–1066 (2011).
Article CAS PubMed PubMed Central MATH Google Scholar
Hoyer, P. & Hyvärinen, A. Interpreting neural response variability as monte carlo sampling of the posterior. Adv. Neural Inf. Process Syst. 15 (2002).
Fiser, J., Berkes, P., Orbán, G. & Lengyel, M. Statistically optimal perception and learning: from behavior to neural representations. Trends Cogn. Sci. 14, 119–130 (2010).
Article PubMed PubMed Central MATH Google Scholar
Buesing, L., Bill, J., Nessler, B. & Maass, W. Neural dynamics as sampling: a model for stochastic computation in recurrent networks of spiking neurons. PLoS Comput. Biol. 7, e1002211 (2011).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Berkes, P., Orbán, G., Lengyel, M. & Fiser, J. Spontaneous cortical activity reveals hallmarks of an optimal internal model of the environment. Science 331, 83–87 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Liakoni, V., Modirshanechi, A., Gerstner, W. & Brea, J. Learning in volatile environments with the bayes factor surprise. Neural Comput. 33, 269–340 (2021).
Article MathSciNet PubMed MATH Google Scholar
Kutschireiter, A., Basnak, M. A., Wilson, R. I. & Drugowitsch, J. Bayesian inference in ring attractor networks. Proc. Natl. Acad. Sci. 120, e2210622120 (2023).
Article MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Deneve, S. Bayesian spiking neurons i: inference. Neural Comput. 20, 91–117 (2008).
Article MathSciNet PubMed MATH Google Scholar
Spratling, M. W. Predictive coding as a model of biased competition in visual attention. Vis. Res. 48, 1391–1408 (2008).
Article CAS PubMed MATH Google Scholar
Wong, P. et al. Computational principles of adaptive multisensory combination in the drosophila larva. bioRxiv https://www.biorxiv.org/content/10.1101/2023.05.04.539474v1 (2023).
Butler, J. S., Smith, S. T., Campos, J. L. & Bülthoff, H. H. Bayesian integration of visual and vestibular signals for heading. J. Vis. 10, 23–23 (2010).
Article PubMed Google Scholar
Fetsch, C. R., Turner, A. H., DeAngelis, G. C. & Angelaki, D. E. Dynamic reweighting of visual and vestibular cues during self-motion perception. J. Neurosci. 29, 15601–15612 (2009).
Article CAS PubMed PubMed Central Google Scholar
Summerfield, C., Behrens, T. E. & Koechlin, E. Perceptual classification in a rapidly changing environment. Neuron 71, 725–736 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hertäg, L., Wilmes, K. A. & Clopath, C. Uncertainty estimating with prediction-error circuits. GitHub repository https://doi.org/10.5281/zenodo.14869367 (2025).

Download references

Acknowledgements

We thank Inês C. Guerreiro for comments on earlier versions of this manuscript. L.H. is supported by Deutsche Forschungsgemeinschaft (DFG) Grant 46008809. C.C. is supported by Biotechnology and Biological Sciences Research Council (BBSRC) Grants BB/N013956/1 and BB/N019008/1, Wellcome Trust Grant 200790/Z/16/Z, Simons Foundation Grant 564408, and Engineering and Physical Sciences Research Council (EPSRC) Grant EP/R035806/1.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Modeling of Cognitive Processes, TU Berlin, Berlin, Germany
Loreen Hertäg
Department of Physiology, University of Bern, Bern, Switzerland
Katharina A. Wilmes
Bioengineering Department, Imperial College London, London, UK
Claudia Clopath

Authors

Loreen Hertäg
View author publications
Search author on:PubMed Google Scholar
Katharina A. Wilmes
View author publications
Search author on:PubMed Google Scholar
Claudia Clopath
View author publications
Search author on:PubMed Google Scholar

Contributions

L.H., K.W., and C.C. conceived the project and designed the experiments. L.H. performed the simulations and analyses. L.H., K.W., and C.C. interpreted the results and wrote the paper.

Corresponding author

Correspondence to Loreen Hertäg.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Sophie Deneve and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Reporting Summary

Lasing Reporting Summary

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hertäg, L., Wilmes, K.A. & Clopath, C. Uncertainty estimation with prediction-error circuits. Nat Commun 16, 3036 (2025). https://doi.org/10.1038/s41467-025-58311-6

Download citation

Received: 16 January 2024
Accepted: 17 March 2025
Published: 28 March 2025
DOI: https://doi.org/10.1038/s41467-025-58311-6

This article is cited by

A metaphysics for predictive processing
- Jakob Hohwy
Synthese (2025)