Flexible value coding in the mesolimbic dopamine system depending on internal water and sodium balance

Ozawa, Takaaki; Nakagawa, Issei; Uchida, Yuuki; Abe, Mayuka; Macpherson, Tom; Yamashita, Yuichi; Hikida, Takatoshi

doi:10.1038/s41538-025-00558-w

Download PDF

Article
Open access
Published: 30 September 2025

Flexible value coding in the mesolimbic dopamine system depending on internal water and sodium balance

Takaaki Ozawa^1,2^na1,
Issei Nakagawa^1,2^na1,
Yuuki Uchida^3,4^na1,
Mayuka Abe^1,2,
Tom Macpherson^1,2,5,
Yuichi Yamashita³ &
…
Takatoshi Hikida^1,2

npj Science of Food volume 9, Article number: 197 (2025) Cite this article

328 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Homeostatic imbalances elicit strong cravings, such as thirst and salt appetite, to restore equilibrium. Although midbrain dopaminergic neurons are known to encode the value of foods, their nutritional state-dependency remains unknown. Here, we show that the activity of the dopaminergic mesolimbic pathway flexibly expresses the positive and negative values of water and salt depending on the internal state in mice. Mice showed behavioral preference and aversion to water and salt depending on their internal water and sodium balance. Fiber photometry recordings revealed that dopamine neurons in the ventral tegmental area and dopamine release in the nucleus accumbens core flexibly showed bidirectional excitatory and inhibitory responses to water and salt intake in a state-dependent manner. Furthermore, these dopaminergic and behavioral responses were recapitulated by a homeostatic reinforcement learning model that formalizes reward as reductions in homeostatic drive and punishment as its escalation. Our results demonstrate the nutritional state-dependency of value coding in mesolimbic dopamine systems, providing new insights into neural circuits underlying homeostatic regulation of appetitive and avoidance behaviors.

Dopamine subsystems that track internal states

Article Open access 13 July 2022

Enhanced cognitive flexibility and phasic striatal dopamine dynamics in a mouse model of low striatal tonic dopamine

Article 02 May 2024

Individual differences in decision-making shape how mesolimbic dopamine regulates choice confidence and change-of-mind

Article 30 July 2025

Introduction

Homeostasis refers to the body’s ability to maintain a constant internal environment¹, and its imbalance can lead to various appetitive behaviors that facilitate its restoration. Typical examples of such cravings include thirst and salt appetite, each of which is triggered by a reduction in the amount of water or sodium concentration in the body, respectively^2,3,4,5. Both water and sodium deficiencies activate the endocrine system, including the renin-angiotensin-aldosterone system (RAAS), which inhibits water and salt excretion and further stimulates appetitive behaviors aimed at restoring balance⁵. On the other hand, it is known that thirst and salt appetite are controlled by distinct neural circuits. Thirst is strongly regulated by the activities of circumventricular organs (CVOs), such as the subfornical organ (SFO) and organum vasculosum of lamina terminalis (OVLT), which are located around the third ventricle and lack a blood-brain barrier^6,7,8,9. The signals from CVOs converge in the hypothalamic median preoptic nucleus (MnPO) to trigger water seeking/taking behavior through a distributed neural network^7,10. In contrast, salt appetite is controlled by the excitatory activities of specific neural populations in hindbrain and midbrain regions, including the nucleus of the solitary tract (NTS)^11,12, the pre-locus coeruleus (pre-LC)¹², and the parabrachial nucleus (PBN)¹³, as well as CVOs⁶.

Dopamine, a key neurotransmitter in the brain’s reward system, is released from dopaminergic neurons centrally located in midbrain regions, including the ventral tegmental area (VTA) of the midbrain, which strongly projects to the striatum. These neurons are well-known to play a crucial role in feeding behavior¹⁴ and are activated by rewarding stimuli, such as food intake under a hunger state¹⁵. Conversely, they are largely suppressed by aversive stimuli, such as pain^16,17. This bidirectional response to reward and punishment forms the basis of a prominent theory that dopaminergic activity encodes the value of the stimulus¹⁸. Moreover, dopamine neuron activity in the midbrain is suggested to represent information analogous to that of reward prediction error, the difference between the predicted and resulting outcome, in reinforcement learning models^18,19,20. Notably, dopamine release in the ventral striatum, including the nucleus accumbens (NAc), in response to reward²¹ and punishment¹⁶, as well as during the encoding of prediction error-like information²², closely mirrors the activity of midbrain dopamine neurons.

Dopamine neuron activity in response to water and salt ingestion is highly dependent on the internal balance of water and sodium in animals. For example, thirst increases the dopaminergic response to water intake^23,24, while low body sodium levels result in an elevated dopaminergic response to salt intake^23,25,26. However, it remains unsolved whether excessive water or salt intake suppresses dopaminergic activity when the body already contains sufficient water and salt, and whether dopamine neurons in the same animal flexibly show bidirectional responses, such as an excitatory response to rewarding water- or salt intake and an inhibitory response to aversive water- or salt intake, in an internal state-dependent manner. Interestingly, previous experiments have shown that some dopamine neurons in the VTA exhibit excitatory responses to aversive or salient external stimuli, and that dopamine release in specific subregions of the NAc also increases in response to such stimuli^{16,17,18,27,28,29}. Moreover, optogenetic activation of midbrain dopamine neurons reduces salt consumption in desalted mice³⁰, challenging the idea of purely state-dependent value coding in dopamine neurons.

The objective of this study was to investigate the influence of water and salt imbalance on the activity of the mesolimbic dopaminergic system, and to identify whether bidirectional regulation of dopaminergic responses occurs during water or salt consumption depending on the internal balance. To tackle this, we developed a novel single-drop brief access test to show that mice subjected to water- or salt-restricted conditions exhibit preferences to and avoidances from water or salt depending on their homeostatic needs. Firstly, we demonstrated that our behavioral task was suitable for measuring hedonic and aversive behavioral responses to sweet and bitter outcomes, respectively, as well as for quantifying excitatory and inhibitory responses of dopamine neurons in the VTA. Next, through recording of dopamine neuron activity in the VTA and dopamine release in the NAc core using fiber photometry, we found that mesolimbic dopaminergic responses were bidirectionally regulated in response to water or salt intake, depending on internal water or salt levels. Furthermore, we found that this state-dependent and bidirectional activity of dopaminergic circuits could be qualitatively simulated by a homeostatic reinforcement learning (HRL) model^31,32. Our results strongly suggest that dopaminergic activity reflects the value of food in relation to the water and mineral balance in the body.

Results

Behavioral and dopaminergic responses to appetitive and aversive outcomes

In this study, we first developed a single-drop brief access test to measure liquid food palatability and dopaminergic reactivity in mice. Specifically, we investigated licking responses to sucrose (sweet) and quinine (bitter) solutions in water-restricted mice. During the test, we recorded the activity of midbrain dopamine neurons in response to liquid consumption in freely-moving animals. This was achieved by virally expressing the fluorescent calcium sensor jGCaMP8s in a cre-dependent manner in VTA dopamine neurons of DAT-cre mice, and observing fluorescence through an implanted optical fiber (Fig. 1a, b). In the single-drop brief access test, mice could earn either one drop (10 μL) of sucrose (300 mM) or quinine (1 mM) by licking a drinking spout (Fig. 1c, d).

**Fig. 1: Licking responses and neural responses of midbrain dopamine neurons to sweet and bitter tastants.**

Analysis of licking behavior revealed that mice showed preference or aversion to sucrose or quinine, respectively, in the single-drop brief access test. In peri-event-time histograms (PETHs) of licking responses during liquid consumption (Fig. 1e), it was revealed that mice showed sustained licking responses to sucrose but immediate cessation of licking when quinine was delivered (Fig. 1e). Area-under-the-curve (AUC) analyses of licking responses (Fig. 1f) demonstrated that mice showed greater licking responses to sucrose than to quinine. Furthermore, the analysis of the licking microstructure^33,34 revealed that mice showed longer bout durations (Fig. 1g), a larger number of licks per bout (Fig. 1h), and a smaller lick rate in each bout (Fig. 1i) during sucrose consumption compared to those of quinine. Collectively, these results suggest that sucrose has a higher palatability than quinine and that the palatability of a liquid can be accurately measured using our task.

Fiber photometry recording revealed a bidirectional change in VTA dopamine neuron activity in response to sucrose and quinine (Fig. 1j, k, Fig. S1, S2). PETH analysis revealed that the activity of VTA dopamine neurons was increased in response to sucrose intake, and transiently increased but immediately reduced to below zero in response to the ingestion of quinine (Fig. 1j). This multiphasic response, an initial excitation followed by suppression, of midbrain dopamine neurons to aversive stimuli is in line with that reported in a previous electrophysiology study³⁵. Due to this multiphasic response pattern of dopamine neurons after liquid consumption, we separately analyzed the positive and negative components of dopamine neuron activity AUCs³⁶ (Fig. 1k). We found that the positive component of dopamine neuron activity change during sucrose intake was significantly higher than that of quinine intake, whereas the negative component was larger during quinine consumption. We also analyzed the covariance between licking responses and dopamine neuron activity change by correlation and linear regression (Fig. 1l), and found that licking was positively correlated with dopamine neuron activity. Taken together, these findings indicate that our behavioral task in combination with fiber photometry in vivo imaging was suitable to assess the palatability of liquid stimuli and the resulting dopaminergic response.

State-dependent, bidirectional responses of dopamine neurons in the VTA to water and salt

Next, we tested state-dependent preference or aversion to water or salt in the same animal by manipulating the internal water or sodium level in mice through dehydration or salt restriction (Fig. 2a–c, Fig. S3). As with the previous experiment, DAT-cre mice were infused with a cre-dependent jGCaMP8s into the VTA, above which an optic fiber was implanted for recording of fluorescence (neural activity) (Fig. 2b). Mice then underwent a period of water-restriction (water-restriction phase 1), followed by a period of salt restriction (salt restriction phase), followed by a second period of water-restriction (water-restriction phase 2) to evaluate the effect of different internal states on motivated behavior and dopamine neuron activity (Fig. 2a). During these water- or salt-restricted states, mice could earn either one drop (10 μl) of fresh water (water) or salt-water (salt; 300 mM or 750 mM) by licking the drinking spout (Fig. 2c).

**Fig. 2: State-dependent preference/aversion and bidirectional responses of midbrain dopamine neurons to water or salt intake.**

Analysis of licking behavior revealed that mice showed preference or aversion to water or salt depending on the water or salt-restricted state (Fig. 2d, f, g). In water-restriction phase 1, mice showed sustained licking responses to water (Fig. 2d, left). Conversely, they started but immediately stopped licking when salt was delivered (Fig. 2d, left). In the salt restriction phase, mice showed more licking in response to salt delivery than to water delivery (Fig. 2d, middle). In water-restriction phase 2, we again observed sustained licking to water delivery and immediate cessation of licking to salt delivery (Fig. 2d, right). Next, we analyzed the AUCs of licking behavior (Fig. 2e) and the licking microstructure (Figs.2f and S4). This analysis revealed that mice showed more licking responses to water compared to salt under water restriction and more licking responses to salt when desalted. Furthermore, we also confirmed that increased licking responses to salt under salt restriction disappeared when the epithelial sodium channel (ENaC) was blocked by amiloride¹² (Fig. S5).

Fiber photometry recording revealed a state-dependent and bidirectional change in VTA dopamine neuron activity in response to water or salt, suggesting flexible value coding based on homeostatic needs (Figs. 2g, h, i and S6). In water-restriction phase 1, the activity of dopamine neurons was increased in response to water intake and was transiently increased but immediately reduced to below zero in response to the ingestion of high salt (Fig. 2g, left). Interestingly, this relationship between liquid intake and dopamine neuron activity was reversed in the salt restriction phase, where activity increased during salt intake but was inhibited during water intake (Fig. 2g, middle). Finally, in water-restriction phase 2, we observed an increase in dopamine neuron activity during water consumption and a transient increase followed by strong inhibition of activity during salt consumption, similar to that observed during water-restriction 1 (Fig. 2g, right). Analysis of the AUCs of dopamine neuron responses revealed that, under both water-restriction phases, the positive component of responses to water intake was significantly higher than that to salt intake. Conversely, under salt-restricted conditions, dopamine neuron responses were higher during salt intake than during water intake (Fig. 2h). Notably, the negative component of dopamine neuron activity during salt ingestion was larger than that during water under both water-restricted conditions, and was larger during water consumption than that during salt consumption when animals were salt-restricted (Fig. 2h). Importantly, we also found a significant positive correlation between licking and dopamine neuron responses (Fig. 2j) in each phase suggesting that VTA dopamine neurons encode state-dependent value signals linked to consummatory behavior.

State-dependent, bidirectional dopamine release in the nucleus accumbens in response to water and salt

Next, we measured changes of dopamine release levels in the NAc core²⁵ by taking advantage of the dopamine sensor, GRAB-DA2m (Figs. 3a, b and S7). Analysis of licking behavior confirmed the replication of state-dependent preference and aversion to water and salt in mice, consistent with findings from the previous experiment. PETH analysis revealed that mice showed sustained licking behavior in response to the consumption of water, while their licking ceased immediately when salt was delivered under water-restriction phases (Fig. 3c, left, right). Conversely, under salt restriction, mice showed higher licking responses to salt compared to water (Fig. 3c, middle). Analysis of AUCs and the licking microstructure revealed that mice showed greater licking responses to water compared to salt under water restriction, and more licking responses to salt under salt restriction (Figs. 3d, e and S8).

**Fig. 3: State-dependent bidirectional change of dopaminergic responses to water or salt ingestion in the NAc.**

Fiber photometry recording revealed nutritional state-dependent, bidirectional changes in NAc dopamine in response to water or salt intake, suggesting state-dependent value coding similar to that observed in VTA dopamine neurons (Figs. 3f, g and S9). Under water restriction, dopamine increased in response to water intake but was transiently increased and then immediately reduced to below baseline during high-concentration salt consumption (Fig. 3f, left, right). However, when desalted, dopamine increased in response to salt intake but was reduced to below baseline during water intake (Fig. 3f, middle). Analysis of AUCs revealed that under both water-restriction conditions, the positive component of dopamine was significantly higher during water intake than salt intake, whereas under salt restriction, it was higher during salt intake than water intake (Fig. 3g). In contrast, under both water-restricted conditions, the negative component of dopamine change was larger during salt intake than water intake, while under salt restriction, it was larger during water intake than salt intake (Fig. 3g). We also assessed the covariance between licking responses and dopamine change using correlation and linear-regression analyses (Fig. 3h). As a result, we found that increased licking was positively associated with higher dopamine levels in each water/salt-deprivation condition, suggesting a significant relationship between state-dependent value coding by NAc dopamine and consummatory behavior.

A homeostatic learning model explains state-dependent activity in the dopaminergic system

To better understand the computational principles underlying the dopaminergic responses observed in our animal experiments, we utilized an HRL model. This framework enabled us to simulate how internal physiological states, such as water or sodium deprivation, influence value-based decision-making and dopaminergic prediction error signaling. By assessing whether the model could qualitatively reproduce both the behavioral and neural outcomes across deprivation conditions, we aimed to determine whether homeostatic control mechanisms could explain the observed patterns of intake and NAc dopamine activity.

We conducted simulations of the animal experiments under the six deprivation/testing conditions described above and examined the outcomes during water or sodium intake tests (Fig. 4a, b). Notably, under each condition, the HRL model’s temporal difference (TD) errors (Fig. 4c) and estimated number of licks (Fig. 4d) qualitatively paralleled dopaminergic activity (Figs. 2h and 3g) and licking responses (Figs. 2e and 3d), respectively, seen in our animal experiments. This correspondence suggests that the HRL framework can capture key trends in dopaminergic signaling associated with changes in internal states and consummatory behavior.

**Fig. 4: Simulations of state-dependent changes in dopamine and licking responses using the homeostatic reinforcement learning model.**

We describe the model’s behavior in detail for each condition as follows. Under the water-deprived condition tested with water (WD-W), the model successfully reproduced the observed behavioral patterns (Fig. 4e). During training, water intake reduced the internal drive (D) as the hydration state moved towards its ideal setpoint, yielding positive reward (R). This increased the state-action value (Q), reinforcing the intake behavior. At the start of the test phase, the model retained this Q-value despite being re-initialized to a deprived state, resulting in positive TD errors upon water consumption. On the contrary, in the WD-300 condition (water-deprived, tested with 300 mM NaCl), while training similarly reinforced water intake (Fig. 4f), during the test phase, the beneficial effect of partial hydration was counteracted by the aversive effect of consuming excessive salt, resulting in smaller rewards and smaller positive TD errors relative to WD-W. Finally, in the WD-750 condition (water-deprived, tested with 750 mM NaCl), the high salt content had a stronger negative impact, leading to negative rewards and negative TD errors (Fig. 4g).

In contrast to the water-deprived condition, a different pattern of behavior emerged in the model under sodium-deprived conditions. When mice were tested with water (SD-W), it further diluted sodium levels, producing negative rewards and negative TD errors (Fig. 4h). In the SD-300 condition (sodium-deprived, tested with 300 mM NaCl), the positive effect of sodium replenishment partially offset the negative effects of excess water intake, resulting in modest positive rewards and TD errors (Fig. 4i). Finally, in the SD-750 condition (sodium-deprived, tested with 750 mM NaCl), the strongly positive effect of sodium replenishment outweighed the negative effects of excess water intake, resulting in high rewards and large positive TD errors (Fig. 4j).

Together, these simulation results demonstrate that the HRL model qualitatively captures the dopaminergic dynamics observed in our animal experiments and offers a computational framework for understanding state-dependent value coding.

Discussion

Although value coding in the mesolimbic dopamine pathway has been reported in previous studies, its nutritional state-dependency has not been fully substantiated by experimental evidence. In the current study, we developed a single-drop brief access test and successfully revealed both excitatory and inhibitory dopaminergic activity in response to water or salt ingestion, depending on the homeostatic needs of the animal. Furthermore, we found that this bidirectional modulation of dopamine signaling could be recapitulated by an HRL model. Together, our experimental and computational results strongly support the idea that value coding in the mesolimbic dopamine pathway is critically shaped by the internal nutritional state.

Previous studies have reported the representation of value information in dopamine neuron activity. Specifically, the majority of midbrain dopamine neurons are activated in response to reward but inhibited in response to punishment^18,20. It has been found that the increase in dopaminergic response to reward is dependent on the internal state of the animal. For example, water intake during dehydration^23,24, food intake during fasting^15,21, protein intake during protein restriction^37,38,39, and salt intake under sodium restriction^25,40,41,42 all induce stronger behavioral consummatory responses as well as excitatory responses in dopamine neurons or accumbal dopamine release, indicating that consumption to satisfy the homeostatic needs acts as a powerful reward. In the present study, mice exhibited strong preferences for water during water restriction and for salt intake during salt restriction. Furthermore, we also found that strong activation of VTA dopamine neurons and dopamine release in the NAc core were elicited during these reward intakes. These results support the state-dependent nature of reward-induced dopamine neuron activation that was reported in previous studies. However, it has also been suggested that distinct populations of dopamine neurons can encode the saliency of stimuli (i.e., the absolute magnitude of value), and that these neurons are activated not only in response to rewarding but also to novel or aversive stimuli^17,18,27,28. Furthermore, it is also plausible that pauses in dopaminergic activities reflect precise stimulus encoding, not the negative value of the stimulus. Therefore, it is necessary to investigate whether dopamine neurons exhibit opposite responses to reward and punishment in order to examine the representation of value in these neurons. In the present study, we demonstrated that salt intake during water restriction and water intake under salt restriction suppressed consumption behavior in mice, and that intake of these aversive stimuli caused inhibition in VTA dopamine neurons and reduced dopamine release in the NAc core. Our results suggest that both reward-induced excitatory responses and punishment-induced inhibitory responses in the mesolimbic dopamine pathway are strongly dependent on the water and salt balance in the body. Overall, our results provide the first direct evidence that dopaminergic encoding of value is dependent on internal physiological state.

The specific neural circuits that drive the excitatory dopaminergic responses to the ingestion of water or salt reward, depending on the homeostatic needs, remain unidentified. According to previous studies, water ingestion during dehydration has been shown to increase dopaminergic activity through both oral sensation⁴³ and gastrointestinal signaling⁴⁴. It was reported that the immediate oral sensation of water triggers a rapid activation of dopamine neurons, while the slower absorption of water in the digestive system causes a gradual increase in midbrain dopamine activity, which can take several minutes to occur⁴⁴. Given that the dopaminergic response observed in our study emerged within a few seconds, it is plausible that this excitatory dopaminergic response was driven by rapid sensory signals triggered by the detection of water in the oral cavity and/or the throat, and then transmitted through the trigeminal nerve to the central nervous system⁴⁵. It has been shown that activity in lamina terminalis (LT) regions, (SFO, MnPO, and OVLT) cause thirst, and that the activation of excitatory neurons in the SFO is especially critical for dopamine release induced by water intake during dehydration^24,^44,46. Importantly, however, inhibition of excitatory neurons in the SFO does not elicit dopaminergic neuron activity or dopamine release in the NAc⁴⁴. Likewise, activation of inhibitory water-satiation neurons expressing GLP1 receptors in the SFO does not directly affect dopamine activity⁴⁶. These previous findings suggest that while activation of the thirst-driving system is a prerequisite for water intake to be rewarding and engage dopaminergic neurons, a mere reduction in the activity in this thirst system may not be sufficient to trigger increased activity in midbrain dopamine neurons. It has been reported that dopaminergic activation induced by water absorption from the gastrointestinal tract during dehydration requires the activity of inhibitory neurons in the lateral hypothalamus⁴⁶. However, it is unclear whether this mechanism could also account for the rapid dopaminergic response that occurs within seconds during spontaneous drinking behavior. Another study suggests that neurons in various brain regions projecting directly to VTA dopaminergic neurons respond to reward in the same way as dopaminergic neurons⁴⁷, suggesting that the rewarding effect of water may be induced through the activity of distributed brain circuits rather than a single brain region.

The acute rewarding effect of salt depends primarily on the peripheral tongue, and particularly requires an influx of sodium ions into specific taste receptor cells (TRCs) through the ENaC^12,25. In accordance with this, we showed that hedonic responses to salt reward in salt-depleted mice disappeared after amiloride treatment, an antagonist of ENaCs. Although the peripheral mechanism has been revealed, the neural circuitry responsible for its rewarding effect largely remains unknown. Salt craving has been demonstrated to be initiated by the activation of specific cells within distinct brain regions. Notably, the SFO, the pre-locus coeruleus, and the PBN have been found to play significant roles in salt craving^6,11,12,48. Furthermore, previous studies have revealed that input from the SFO to the bed nucleus of the stria terminalis (BNST) promotes salt craving, while input from the PBN to the BNST inhibits salt craving by activating inhibitory neurons in the BNST ^6,48. The activity of specific projections from the BSNT to the VTA has been reported to control both reward and aversive behavior⁴⁹, likely through the stimulation of distinct neural populations in the striatum^50,51. Taken together, these studies suggest that the BNST may play a role in translating salt appetite into approach behavior.

In the present study, high salt intake in the dehydrated state and water intake in the desalted state each functioned as a punishment, suppressing consummatory licking and dopamine neuron activity. Previous research indicates that the perception of water is mediated by the activity of specific TRCs, suggesting that these TRCs play important roles in the aversive effect of excessive water ingestion⁴³. In contrast, the aversive effects of high salt intake may depend on bitter and acid receptors⁵² as well as chloride channels⁵³ rather than ENaCs in TRCs. Regarding the central mechanism, there exist several specific neuronal populations expressing specific genes such as calcitonin-related polypeptide alpha (Calca), oxytocin receptor, and serotonin 2c receptor, within the PBN that exert an inhibitory effect on water- and salt-seeking behaviors^13,54,55. Therefore, it is plausible that these cell populations in the PBN are important for the aversion that arises from excessive salt and water intake. Furthermore, based on previous findings that neural input from the PBN to the VTA decreases feeding behavior^56,57, it is possible that the reduction in VTA dopamine neuron activity and dopamine levels in the striatum induced by excessive salt or water intake may be mediated by the PBN-VTA pathway.

Prior research has developed the HRL model to explain state-dependent learning and reward-seeking behaviors³². Importantly, our previous study showed that this HRL model could explain salt-seeking behavior as well³¹. Furthermore, another study revealed that the HRL model can also explain enhanced dopamine release during salt intake under salt-deprived conditions by assuming a one-dimensional state space representing the internal sodium level⁴⁰. However, it remains unclear whether the HRL can fully explain the state-dependent and flexible neural responses that either activate or suppress dopamine neurons. Indeed, a previous simulation study excluded testing dopaminergic responses to stimuli that exacerbate physiological needs, deeming such conditions as unsimulatable⁴⁰. In the present study, we assumed a two-dimensional state space of water and salt in the HRL model to tackle this question. As a result, we found that (1) state-dependent behavioral preference and aversion to water or salt, (2) enhanced excitatory dopaminergic responses during water intake under water deprivation and salt intake under salt-deprivation, and (3) the suppression of dopaminergic activity during salt intake under water deprivation and water intake under salt-deprivation, can all be qualitatively well-simulated by our HRL model. The results of our study suggest that the dopaminergic response to food intake is strongly dependent on homeostatic needs.

Both current and prior studies have demonstrated a significant correlation between dopaminergic response and food palatability, suggesting a potential causal relationship between them. However, it is important to acknowledge that dopaminergic activity may also reflect the intensity of behavioral responses such as licking. Moreover, it should be noted that the neuropsychological mechanism by which the dopaminergic system regulates eating behavior remains under discussion. Berridge and colleagues showed that manipulation of mesolimbic dopamine does not alter hedonic taste reactivity but significantly impacts appetitive food-seeking behavior^58,59,60. Supporting this, a more recent study indicated that optogenetic inhibition of dopamine neurons in the VTA during the consummatory period did not affect consummatory licking itself, although it impaired the maintenance of cue-guided appetitive food seeking^61,62. These findings suggest that dopaminergic responses during food consumption function as motivational or learning signals to sustain seeking behavior without influencing hedonic reactions, which serve as an index of palatability. Conversely, dopamine receptor blockade is known to decrease licking bout length, which is another index of palatability ³³. Furthermore, another recent study demonstrated that optimized optogenetic activation and inhibition of dopamine neurons in the VTA bidirectionally control licking bout length⁶³, suggesting a role for dopamine in the regulation of food palatability. Thus, dopamine may contribute to eating behavior by regulating its specific aspects (i.e., hedonicity, motivation, learning) or multiplexed processes. Importantly, in any case, our findings regarding nutritional state-dependent value coding in the mesolimbic pathway further contribute to elucidation of the dopaminergic mechanisms controlling food consumption.

Imbalances in sodium and water appetite are implicated in various pathophysiological conditions. For example, psychogenic polydipsia, often linked to psychiatric disorders such as schizophrenia, involves excessive water intake that suppresses plasma osmolality and dilutes serum sodium, leading to hyponatremia⁶⁴. This chronic overhydration blunts hypothalamic osmoreceptor responsiveness and alters hormonal regulation. Sodium appetite typically diminishes in the hyponatremic state, but abrupt fluid restriction may cause rapid sodium concentration increases and hypernatremia. Our results facilitate a deeper understanding of the disruption in the neural mechanism that adaptively integrates the monitoring of the internal nutritional state and executes consummatory behavior.

In summary, the current study revealed bidirectional excitatory and inhibitory responses of dopamine neurons in the VTA and dopamine release in the NAc during food intake, depending on the internal salt-water balance in mice. Our results suggest that dopamine neurons control flexible approach and avoidance behaviors, as well as their learning, by encoding the value of food in a nutritionally state-dependent manner. These findings contribute to the elucidation of neural circuits controlling appetitive behaviors and their computational mechanisms, as well as the deeper understanding of the pathophysiology underlying psychiatric disorders associated with dysregulated appetite.

Methods

Subjects

Sixteen DAT-cre mice (Slc6a3, The Jackson Laboratory; 9 males and 7 females) that were 8–12 weeks old and on a C57BL/6JJcl background, and fourteen 8-week-old male C57BL/6JJcl mice (CLEA, Tokyo, Japan) were used. Mice weighed 20–25 g and were provided food and water ad libitum until behavioral experiments were started. Mice were singly housed on a 12-h light/dark cycle (0800/2000), and experiments were carried out during the light cycle. All animal experiments were approved by the Animal Experimental Committee of the Institute for Protein Research, University of Osaka (29-02-1 and R04-01-2).

Stereotaxic cannula implantation and virus injection

All surgeries were conducted under isoflurane gas anesthesia (1.5–3%) and using a stereotaxic frame (RWD Life Science, Shenzhen, China). For recording of dopamine neuron activity, 400 nl of AAV9-hSyn-FLEX-jGCaMP8s (2.7 × 10¹³GC/ml, Addgene, #162377, MA, USA) mixed with 100 nl of AAV9-CAG-FLEX-tdTomato (3.8 × 10¹³GC/ml, Addgene, #28306, MA, USA) was unilaterally injected into the VTA (coordinates in mm: AP –3.3, ML ± 0.4 from bregma, and DV − 4.5 from the brain surface). For recording of dopamine release, 400 nl of AAV9-hSyn-GRAB-DA2m (2.5 × 10¹³GC/ml, Addgene, #140553, MA, USA; 2.5 × 10¹³GC/ml, WZ Bioscience Inc, MD, USA) mixed with 100 nl of AAV5-hSyn-mCherry (2.3 × 10¹³GC/ml, Addgene, #114472, MA, USA) was unilaterally injected into the nucleus accumbens core (AP + 1.4, ML ± 1.1 from bregma, and DV − 3.4 from the brain surface). In both experiments, injections were performed using a Nanoliter 2020 Injector (WPI, FL, USA) at a rate of 60 nl/min, and the injection glass capillary remained in place for 5–10 min to reduce backflow. Immediately after viral infusions, a 200 µm diameter (NA 0.37) optic fiber (RWD Life Science, Shenzhen, China) was implanted into the VTA (coordinates in mm: AP –3.3, ML ± 0.4 from bregma, and DV − 4.3 from the brain surface) or nucleus accumbens core (AP + 1.4, ML ± 1.1 from bregma, and DV − 3.2 from the brain surface), and fixed in place with Superbond (Sunmedical, Shiga, Japan) and the dental cement (UNIFAST; GC, Tokyo, Japan).

Apparatus

Water- or salt-ingestive behavior was assessed in a metal operant chamber with a grid-floor and a drinking spout protruding from one sidewall (Med Associates, Inc., VT, USA). The chamber was illuminated by a house light. Our custom-made drinking spout had three holes through which three different liquid stimuli could be delivered. Liquid delivery through the spout was controlled by pinch valves (#HYN-3, CKD, Aichi, Japan). Licking responses to the feeding spout were counted with a contact lickometer (Med Associates, Inc., VT, USA). MED-PC software (Med Associates, Inc., VT, USA) was used for the quantification of spout-licking behavior and the control of valve opening for liquid delivery.

Test of sucrose and quinine

Water restriction started at least two days before pre-training and continued throughout the pre-training and test phases. Animals were allowed limited daily access to the water for 10 min in the home cage after daily pre-training or test sessions. During the pre-training, mice were trained on a fixed ratio (FR) 10 schedule for 3 days where they had to lick the drinking spout 10 times to earn one drop of deionized water reward (10 μl). The water rewards were delivered randomly through one of three holes on the spout. This 3-day FR10 training was followed by variable interval schedule (VI) training in which the water reward was delivered through the drinking spout as soon as mice exhibited spout-licking behavior after the inter-trial-interval (ITI) had passed. Each ITI was randomized with a uniform distribution between 10 s and 30 s, with a mean of 20 s (i.e., VI20 schedule). FR10 and VI20 trainings were finished when mice consumed 30 or 60 rewards, respectively, or after 30 min had passed. Once animals showed frequent licking for the water reward in the VI20 training, a single-drop brief access test was conducted. In this test, one of two different tastants, sucrose (300 mM) or quinine (1 mM) solution, was randomly delivered through different holes in the feeding spout. This test was conducted under a VI20 schedule and finished when mice consumed each tastant 20 times or after 60-min had passed.

Test of water and salt: water-restriction phase 1

Animals were water-restricted and trained in the operant-licking task to get deionized water for several days in the pre-training. They were then finally tested for taste reactivity to deionized water, 300 mM, or 750 mM NaCl solution delivered through three different holes in the drinking spout. Procedures for the water restriction, pre-training, and testing were the same as those of the aforementioned test of sucrose and quinine.

Test of water and salt: salt restriction phase

After finishing water-restriction phase 1, food and water in the home cage were replaced with a sodium-free diet (Oriental Yeast Co., LTD, Tokyo, Japan) and deionized water (Milli-Q water), respectively. One day before pre-training in the salt restriction phase, mice were injected with furosemide (Nichi-iko, Toyama, Japan; 50 mg/kg, i.p.) to facilitate the excretion of sodium from their body. In pre-training, mice were trained in the operant-licking task under a VI20 schedule for at least 3 days to earn a 300 mM NaCl solution. Once animals showed frequent operant-licking behavior, the test was conducted. Similar to the test under water restriction, licking responses to Milli-Q water, 300 mM, or 750 mM NaCl solution were tested under salt restriction.

Test of water and salt: water-restriction phase 2

After finishing the salt restriction phase, animals were subjected to water restriction again. In this phase, animals were trained in the operant-licking task under a VI20 schedule to get deionized water for several days and then finally tested in the single-drop brief access test. The procedures for the water restriction and testing were the same as those of the water restriction phase 1.

Behavioral measurement

During testing, each individual lick was measured. For the analysis of taste-evoked licking responses, we drew peri-event-time histograms (PETH) of the averaged licking frequency and calculated the area under the curve (AUC). Furthermore, we also analyzed the microstructure of consummatory licks. A lick bout was defined as the first unit consisting of at least three consecutive licks and separated by a pause of 250 ms after the reward delivery. We analyzed bout duration, licks per bout, and lick rate in bout^33,34.

The effect of ENaC blockade

The ENaC was blocked by mixing its antagonist, amiloride hydrochloride (Tokyo Chemical Industry Co., Ltd., Tokyo, JAPAN; 0.1 mM), into each liquid delivered (i.e., water, NaCl 300 mM and 750 mM). In this test, mice were pretrained and tested under water restriction and salt restriction with or without amiloride.

Fiber photometry

Fluorescent signals were measured using a CMOS camera-based fiber photometry system (Doric Lenses Inc., QC, Canada) as described previously⁶⁵. Briefly, fluorescence signals were obtained by exciting cells expressing jGCaMP8s or GRAB-DA2m with a 470 nm LED (30 µW at fiber tip), while dopamine-independent signals were obtained by exciting cells expressing tdTomato or mCherry with a 560 nm LED (30 µW at fiber tip). 470 nm and 560 nm LED lights were alternately pulsed at 12 Hz, and emitted lights were recorded using a CMOS camera, which acquired video frames of fluorescence from the end of the patch cord (NA = 0.37, 200 mm core; RWD Life Science, Shenzhen, China) at the same frequency. The timestamps between the fiber photometry and the behavioral chamber were synchronized using a TTL pulse, which was generated from the behavioral chamber and sent to the fiber photometry system.

Video frames (fluorescent signals) were analyzed using Doric Neuroscience Studio version. 6.2.3.0. (Doric Lenses Inc., QC, Canada). 470 nm-derived and 560 nm-derived signals were used for analysis. To compute the dF/F for signals, the raw signal was first baseline-corrected by subtracting its moving median, calculated using a 60-second sliding window. The resulting difference was then normalized by the same moving median, yielding dF/F = (raw signal – moving median)/moving median. This dF/F was z-scored by using the z score function in MATLAB and used for further analysis. For the analysis of tastant-evoked signal change, we drew PETHs of the averaged z-scored signal among animals, and also calculated positive and negative components of the AUC from 0 to 8 s after the onset of the first consummatory licking. Positive and negative AUCs were defined as the AUC above or below zero, respectively.

Histological verification

Animals were deeply anesthetized with ketamine (100 mg/kg) and xylazine (20 mg/kg), and transcardially perfused with 0.01 M PBS followed by 4% paraformaldehyde (PFA) in 0.1 M PB (pH 7.4). Brains were removed and post-fixed with 4% PFA at 4 °C for 2 days. After cryoprotection, brains were embedded in O.C.T compound (Sakura Finetek Japan CO., Ltd, Tokyo, Japan) and cryosectioned at 40 µm. Sections were mounted with antifade mounting medium with DAPI (#ab104139, ABCAM, Cambridge, England). Images were acquired using a Keyence BZ-X800 microscope (Keyence, Osaka, Japan).

Statistics

All averaged data were expressed as the mean ± SEM. Raster plots of licking, heatmaps of dopamine change, and PETHs of reward-related licking and dopamine change were created using Matlab software (MathWorks, CA, USA). All statistical analyses were performed using Prism (Graphpad, CA, USA) software. Normality of data distribution was checked by the Shapiro-Wilk test. Statistical methods are shown in each figure legend for each experiment. All ANOVAs were followed by post-hoc Tukey’s multiple comparisons test. For the analysis of the correlation between the number of licks and the dopaminergic response, Pearson’s correlation coefficient and linear regression analysis were performed. The F-test was used to assess whether the slope of the regression line was significantly different from zero. Scatter among replicates (i.e., trials) was taken into account. For all statistical tests, significance was assessed using an α value of 0.05. All statistical results are shown in the Supplementary Data.

Computational modeling

To elucidate how variations in dopaminergic neuronal activity, which depends on variations in internal states of hydration and sodium levels, influence reward prediction and choice behavior, we employed a decision-making computational model. Specifically, we utilized the Homeostatic Reinforcement Learning (HRL, HRRL) model, a robust computational framework for representing internal state-dependent decision-making processes such as those influenced by hydration and sodium levels. Previous research has demonstrated that the HRL model could effectively explain how organisms maintain internal states such as adaptive levels of sodium and water through decision-making^31,32. By simulating the dopamine changes and behavior of mice during the operant-licking task using the HRL model, we aimed to interpret the computational processes underlying changes in dopaminergic neuronal activity, which are greatly influenced by differences in the internal state, and to provide a functional explanation for these dynamics.

Computational model: HRL model

The HRL model employed in this study is based on the assumption that homeostasis maintenance can be understood as a reinforcement learning process. It frames the minimization of deviations of internal states from their optimal levels as the calculation of action values that maximize the total reward. In the HRL model, a multidimensional metric space, where each dimension represents an internal state (e.g., hydration level or sodium concentration), is defined as the “homeostatic space” (a schematic image of the structure of the homeostasis space is shown in Fig. 4b). Within this homeostatic space, the drive function D(H_t) is defined as the distance between the internal state of the i-th component at time t (e.g., hydration level or sodium concentration), H^i_t, and the ideal internal state H^*_i:

$$D\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}}\right)=\root{m}\of{\mathop{\sum }\limits_{i=1}^{N}{\left|{H}^{* i}-{H}_{t}^{i}\right|}^{n}}$$

(1)

Here, m and n are free parameters that define the distance. As the internal state approaches the ideal state, the value of the drive function decreases. Based on this drive function, the reward r_t is determined as the change in the value of the drive function from time t to t + 1. For example, if H_t represents the internal state of sodium, and K_t is defined as the amount of sodium ingested at time t, the relationship between the drive function and reward, derived from the internal state H_{t + 1} at time t + 1, is expressed as follows:

$$r\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}},\,{{\boldsymbol{K}}}_{{\boldsymbol{t}}}\right)=D\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}}\right)-D\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}+{\bf{1}}}\right)$$

$$=D\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}}\right)-D\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}}+{{\boldsymbol{K}}}_{{\boldsymbol{t}}}\right)\,$$

(2)

Furthermore, the natural decline in sodium balance is implemented using a time decay constant tau, as expressed in the following equation. The value of tau can be adjusted based on the internal state. When the internal states are set to represent water and sodium, the model is formulated as follows:

$${H}_{t+1}^{{water}},\,{H}_{t+1}^{{sodium}}=\left(1-\frac{1}{{\tau }^{{water}}}\right){H}_{\,t}^{{water}},\,\left(1-\frac{1}{{\tau }^{{sodium}}}\right){H}_{\,t}^{{sodium}}$$

(3)

Combining these elements, the reward is calculated using the following equation:

$$r\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}},\,{{\boldsymbol{K}}}_{{\boldsymbol{t}}}\right)=D\left({{\boldsymbol{H}}}_{{\boldsymbol{t}}}\right)-D({{\boldsymbol{H}}}_{{\boldsymbol{t}}+{\bf{1}}})$$

$$=D\left({H}_{t}^{{water}},\,{H}_{t}^{{sodium}}\right)-D\left(\left(1-\frac{1}{{\tau }^{{water}}}\right){H}_{t}^{{water}}+{K}_{t}^{{water}},\,\left(1-\frac{1}{{\tau }^{{sodium}}}\right){H}_{t}^{{sodium}}+{K}_{t}^{{sodium}}\right)$$

(4)

The update of action values based on reward prediction errors was modeled as a RL process using the Q-learning framework. In this model, the value Q_t(a) of an action a_t (e.g., water intake or taking no action) (Fig. 4a) is updated based on temporal difference (TD) errors through the Q-learning ^31,32,66 (Eq. 5):

$${Q}_{t}\left(a\right)\leftarrow {Q}_{t}\left(a\right)+\alpha {\delta }_{t}$$

$${\delta }_{t}={{r}_{t}+\mathop{{\rm{\gamma \; max}}}\limits_{{a}^{{\prime} }}{Q}_{t+1}\left({a}^{{\prime} }\right)-Q}_{t}(a)$$

(5)

Here, alpha represents the learning rate for Q_t(a). Action selection depends on the relative magnitudes of the values (Q-values) for each action and follows a soft-max function:

$${P}_{t}\left({a}_{t}^{k}\right)=\frac{\exp (\beta \cdot {Q}_{t}\left({a}_{t}^{k}\right))}{{\sum }_{j}\exp (\beta \cdot {Q}_{t}\left({a}_{t}^{j}\right))}$$

(6)

To facilitate a straightforward understanding of the HRL model algorithm, Fig. S10 illustrates a simulation of homeostasis maintenance for a one-dimensional internal state (e.g., water).

Simulation conditions and data recording

Simulations were conducted under six experimental conditions, defined by the initial internal state (water- or salt-deprived) and the solutions used during training and testing. Specifically, mice began in a water-deprived state (WD-W) and were trained to lick for water, followed by a test in which only water was available. Under the next two water-deprived conditions, mice were similarly trained with water but tested either with 300 mM NaCl (WD-300) or with 750 mM NaCl (WD-750). The remaining three conditions involved mice that were initially salt-deprived and trained with 300 mM NaCl. These mice were then tested with either water (SD-W), 300 mM NaCl (SD-300), or 750 mM NaCl (SD-750).

For all six conditions, we recorded the time series data on internal states (H), behaviors (a), rewards (R), and reward prediction errors (TD error).

Training, testing, and measurement procedures

In the three water-deprived conditions (WD-W, 300, 750), simulations began from a water-deprived state (H_0 = 100) and followed the same training protocol, in which subjects were trained to lick for water. Similarly, in the three salt-deprived conditions (SD-W, 300, 750), simulations started from a salt-deprived state (H_0 = 100), employed the same training regimen across all three conditions, this time training subjects to lick a 300 mM NaCl.

Each condition was followed by two tests, with only one solution offered during each test. We evaluated Q-values, the probability of intake behavior, actual behavior (a), intake volume (K), internal state (H), drive (D), reward (R), temporal difference errors (δ), and lick frequency. TD errors were quantified by pooling data from the two tests for each condition, as shown in Fig. 3c. Licking behavior in the model was measured as the total number of “intake” behaviors in a single training or test trial (Fig. 4d) or the number of intake behaviors in a certain period of time (Fig. 4e, f), regardless of whether the liquid was acquired. Detailed simulation parameters are available in Supplementary Table 2.

Data availability

The codes used in this study can be found at: https://github.com/YuukiUchida/WaterSaltDopamine_hRL.

References

Cannon, W. B. Organization for physiological homeostasis. Physiol. Rev. 9, 399–431 (1929).
Article Google Scholar
Geerling, J. C. & Loewy, A. D. Central regulation of sodium appetite. Exp. Physiol. 93, 177–209 (2008).
Article CAS PubMed Google Scholar
Gizowski, C. & Bourque, C. W. The neural basis of homeostatic and anticipatory thirst. Nat. Rev. Nephrol. 14, 11–25 (2018).
Article CAS PubMed Google Scholar
Augustine, V., Lee, S. & Oka, Y. Neural control and modulation of thirst, sodium appetite, and hunger. Cell 180, 25–32 (2020).
Article CAS PubMed PubMed Central Google Scholar
NODA, M. & MATSUDA, T. Central regulation of body fluid homeostasis. Proceedings of the Japan Academy, Series B 98, PJA9807B-01 (2022).
Matsuda, T. et al. Distinct neural mechanisms for the control of thirst and salt appetite in the subfornical organ. Nat. Neurosci. 20, 230–241 (2017).
Article CAS PubMed Google Scholar
Zimmerman, C. A. et al. A gut-to-brain signal of fluid osmolarity controls thirst satiation. Nature 568, 98–102 (2019).
Article CAS PubMed PubMed Central Google Scholar
Matsuda, T., Hiyama, T. Y., Kobayashi, K., Kobayashi, K. & Noda, M. Distinct CCK-positive SFO neurons are involved in persistent or transient suppression of water intake. Nat. Commun. 11, 5692 (2020).
Augustine, V. et al. Hierarchical neural architecture underlying thirst regulation. Nature 555, 204–209 (2018).
Article CAS PubMed PubMed Central Google Scholar
Allen, W. E. et al. Thirst-associated preoptic neurons encode an aversive motivational drive. Science 357, 1149–1155 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jarvie, B. C. & Palmiter, R. D. HSD2 neurons in the hindbrain drive sodium appetite. Nat. Neurosci. 20, 167–169 (2017).
Article CAS PubMed Google Scholar
Lee, S. et al. Chemosensory modulation of neural circuits for sodium appetite. Nature 568, 93–97 (2019).
Article CAS PubMed PubMed Central Google Scholar
Park, S., Williams, K. W., Liu, C. & Sohn, J. W. A neural basis for tonic suppression of sodium appetite. Nat. Neurosci. 23, 423–432 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Q. Y. & Palmiter, R. D. Dopamine-deficient mice are severely hypoactive, adipsic, and aphagic. Cell 83, 1197–1209 (1995).
Article CAS PubMed Google Scholar
Branch, S. Y. et al. Food restriction increases glutamate receptor-mediated burst firing of dopamine neurons. J. Neurosci. 33, 13861–13872 (2013).
Article CAS PubMed PubMed Central Google Scholar
de Jong, J. W. et al. A neural circuit mechanism for encoding aversive stimuli in the mesolimbic dopamine system. Neuron 101, 133–151.e7 (2019).
Article PubMed Google Scholar
Brischoux, F., Chakraborty, S., Brierley, D. I. & Ungless, M. A. Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli. Proc. Natl. Acad. Sci. USA 106, 4894–4899 (2009).
Article CAS PubMed PubMed Central Google Scholar
Matsumoto, M. & Hikosaka, O. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 459, 837–841 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schultz, W., Dayan, P. & Montague, P. R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997).
Article CAS PubMed Google Scholar
Cohen, J. Y., Haesler, S., Vong, L., Lowell, B. B. & Uchida, N. Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature 482, 85–88 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cone, J. J., McCutcheon, J. E. & Roitman, M. F. Ghrelin acts as an interface between physiological state and phasic dopamine signaling. J. Neurosci. 34, 4905–4913 (2014).
Article PubMed PubMed Central Google Scholar
Amo, R. et al. A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning. Nat. Neurosci. 25, 1082–1092 (2022).
Article CAS PubMed PubMed Central Google Scholar
Fortin, S. M. & Roitman, M. F. Challenges to body fluid homeostasis differentially recruit phasic dopamine signaling in a taste-selective manner. J. Neurosci. 38, 6841–6853 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hsu, T. M. et al. Thirst recruits phasic dopamine signaling through subfornical organ neurons. Proc. Natl. Acad. Sci. USA 117, 30744–30754 (2020).
Cone, J. J. et al. Physiological state gates acquisition and expression of mesolimbic reward prediction signals. Proc. Natl. Acad. Sci. 113, 1943–1948 (2016).
Verharen, J. P. H. et al. Limbic control over the homeostatic need for sodium. Sci. Rep. 9, 1–11 (2019).
Article CAS Google Scholar
Menegas, W., Babayan, B. M., Uchida, N. & Watabe-Uchida, M. Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice. Elife 6, 1–26 (2017).
Article Google Scholar
Menegas, W., Akiti, K., Amo, R., Uchida, N. & Watabe-Uchida, M. Dopamine neurons projecting to the posterior striatum reinforce avoidance of threatening stimuli. Nat. Neurosci. 21, 1421–1430 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kutlu, M. G. et al. Dopamine release in the nucleus accumbens core signals perceived saliency. Curr. Biol. 31, 4748–4761.e8 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sandhu, E. C. et al. Phasic stimulation of midbrain dopamine neuron activity reduces salt consumption. eNeuro 5, ENEURO.0064-18.2018 (2018).
Uchida, Y., Hikida, T. & Yamashita, Y. Computational mechanisms of osmoregulation: a reinforcement learning model for sodium appetite. Front. Neurosci. 16, 857009 (2022).
Keramati, M. & Gutkin, B. Homeostatic reinforcement learning for integrating reward collection and physiological stability. Elife 3, 1–26 (2014).
Article Google Scholar
D’Aquila, P. S. Dopamine on D2-like receptors ‘reboosts’ dopamine D1-like receptor-mediated behavioural activation in rats licking for sucrose. Neuropharmacology 58, 1085–1096 (2010).
Article PubMed Google Scholar
Davis, J. D. & Perez, M. C. Food deprivation- and palatability-induced microstructural changes in ingestive behavior. Am. J. Physiol. 264, R97–R103 (1993).
CAS PubMed Google Scholar
Fiorillo, C. D., Song, M. R. & Yun, S. R. Multiphasic temporal dynamics in responses of midbrain dopamine neurons to appetitive and aversive stimuli. J. Neurosci. 33, 4710–4725 (2013).
Article CAS PubMed PubMed Central Google Scholar
Loriaux, A. L., Roitman, J. D. & Roitman, M. F. Nucleus accumbens shell, but not core, tracks motivational value of salt. J. Neurophysiol. 106, 1537–1544 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chiacchierini, G. et al. Protein appetite drives macronutrient-related differences in ventral tegmental area neural activity. J. Neurosci. 41, 5080–5092 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chaumontet, C. et al. The protein status of rats affects the rewarding value of meals due to their protein content. J. Nutr. 148, 989–998 (2018).
Article PubMed Google Scholar
Murphy, M. et al. Restriction of dietary protein leads to conditioned protein preference and elevated palatability of protein-containing food in rats. Physiol. Behav. 184, 235–241 (2018).
Article CAS PubMed PubMed Central Google Scholar
Duriez, A., Bergerot, C., Cone, J. J., Roitman, M. F. & Gutkin, B. Homeostatic reinforcement theory accounts for sodium appetitive state- and taste-dependent dopamine responding. Nutrients 15, 1015 (2023).
Wagman, W. Sodium chloride deprivation: development of sodium chloride as a reinforcement. Science 140, 1403–1404 (1963).
Article CAS PubMed Google Scholar
Berridge, K. C., Flynn, F. W., Schulkin, J. & Grill, H. J. Sodium depletion enhances salt palatability in rats. Behav. Neurosci. 98, 652–660 (1984).
Article CAS PubMed Google Scholar
Zocchi, D., Wennemuth, G. & Oka, Y. The cellular mechanism for water detection in the mammalian taste system. Nat. Neurosci. 20, 927–933 (2017).
Article CAS PubMed Google Scholar
Grove, J. C. R. et al. Dopamine subsystems that track internal states. Nature 608, 374–380 (2022).
Article CAS PubMed PubMed Central Google Scholar
Krashes, M. J. Forecast for water balance. Nature 537, 626–627 (2016).
Article CAS PubMed Google Scholar
Augustine, V. et al. Temporally and spatially distinct thirst satiation signals. Neuron 103, 242–249.e4 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tian, J. et al. Distributed and mixed information in monosynaptic inputs to dopamine neurons. Neuron 91, 1374–1389 (2016).
Article CAS PubMed PubMed Central Google Scholar
Matsuda, T., Kobayashi, K., Kobayashi, K. & Noda, M. Two parabrachial Cck neurons involved in the feedback control of thirst or salt appetite. Cell Rep. 43, 113619 (2024).
Jennings, J. H. et al. Distinct extended amygdala circuits for divergent motivational states. Nature 496, 224–228 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hikida, T., Kimura, K., Wada, N., Funabiki, K. & Nakanishi Shigetada, S. Distinct roles of synaptic transmission in direct and indirect striatal pathways to reward and aversive behavior. Neuron 66, 896–907 (2010).
Article CAS PubMed Google Scholar
Macpherson, T. & Hikida, T. Role of basal ganglia neurocircuitry in the pathology of psychiatric disorders. Psychiatry Clin. Neurosci. 73, 289–301 (2019).
Oka, Y., Butnaru, M., Von Buchholtz, L., Ryba, N. J. P. & Zuker, C. S. High salt recruits aversive taste pathways. Nature 494, 472–475 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kasahara, Y. et al. TMC4 is a novel chloride channel involved in high-concentration salt taste sensation. J. Physiol. Sci. 71, 23 (2021).
Carter, M. E., Soden, M. E., Zweifel, L. S. & Palmiter, R. D. Genetic identification of a neural circuit that suppresses appetite. Nature 503, 111–114 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ryan, P. J., Ross, S. I., Campos, C. A., Derkach, V. A. & Palmiter, R. D. Oxytocin-receptor-expressing neurons in the parabrachial nucleus regulate fluid intake. Nat. Neurosci.20, 1722–1733 (2017).
Article CAS PubMed PubMed Central Google Scholar
Tsou, J. H. et al. Negative emotions recruit the parabrachial nucleus efferent to the VTA to disengage instrumental food seeking. J. Neurosci. 43, 7276–7293 (2023).
Article CAS PubMed PubMed Central Google Scholar
Nagashima, T. et al. State-dependent modulation of positive and negative affective valences by a parabrachial nucleus-to-ventral tegmental area pathway in mice. Front. Neural Circuits 17, 1273322 (2023).
Berridge, K. C., Venier, I. L. & Robinson, T. E. Taste reactivity analysis of 6-hydroxydopamine-induced aphagia: implications for arousal and anhedonia hypotheses of dopamine function. Behav. Neurosci. 103, 36–45 (1989).
Article CAS PubMed Google Scholar
Berridge, K. C. & Valenstein, E. S. What psychological process mediates feeding evoked by electrical stimulation of the lateral hypothalamus?. Behav. Neurosci. 105, 3–14 (1991).
Article CAS PubMed Google Scholar
Peciña, S., Cagniard, B., Berridge, K. C., Aldridge, J. W. & Zhuang, X. Hyperdopaminergic mutant mice have higher ‘wanting’ but not ‘liking’ for sweet rewards. J. Neurosci. 23, 9395–9402 (2003).
Article PubMed PubMed Central Google Scholar
Zessen, et al. Cue and reward evoked dopamine activity is necessary for maintaining learned Pavlovian associations. J. Neurosci. 41, JN-RM-2744-20 (2021).
Lee, K. et al. Temporally restricted dopaminergic control of reward-conditioned movements. Nat. Neurosci. 23, 209–216 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Z. et al. Hedonic eating is controlled by dopamine neurons that oppose GLP-1R satiety. Science 387, eadt0773 (2025).
Rangan, G. K. et al. Clinical characteristics and outcomes of hyponatraemia associated with oral water intake in adults: a systematic review. BMJ Open 11, 46539 (2021).
Article Google Scholar
Kim, C. K. et al. Simultaneous fast measurement of circuit dynamics at multiple sites across the mammalian brain. Nat. Methods 13, 325–328 (2016).
Uchida, Y., Hikida, T., Honda, M. & Yamashita, Y. Heterogeneous appetite patterns in depression: computational modeling of nutritional interoception, reward processing, and decision-making. Front. Hum. Neurosci. 18, 1502508 (2024).

Download references

Acknowledgements

This study was supported by JSPS KAKENHI (JP21K15210 to T.M.; JP22H01105 to T.O.; JP20H00625 to Y.Y; JP23K24205, JP23K18163, and JP25K02547 to T.H.), AMED (JP22gm6510012 and JP24wm0625111 to T.O.; JP21wm0425010 and JP21gm1510006 to T.H.), JST CREST (JPMJCR21P4 to Y.Y.), JST SPRING (JPMJSP2138 to I.N., JPMJSP2120 to Y.U.), Salt Science Research Foundation Grants (2341 to T.O.; 2438 to T.H.), HOKUTO Foundation for the Promotion of Biological Science (to T.O.), LOTTE Foundation (to T.O.), Takeda Science Foundation (to T.O. and T.H.), Inamori Foundation (to T.O.), SR Foundation (to T.O.). Intramural Research Grant for Neurological and Psychiatric Disorders of NCNP (4–6, 6–9 to Y.Y.), the Collaborative Research Program of Institute for Protein Research, the University of Osaka (CRa-25-03) (to T.H.). The funders played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript. Illustrations of animals in Figs. 1d, 2c, 3b were created in BioRender.

Author information

These authors contributed equally: Takaaki Ozawa, Issei Nakagawa, Yuuki Uchida.

Authors and Affiliations

Laboratory for Advanced Brain Functions, Institute for Protein Research, The University of Osaka, Osaka, Japan
Takaaki Ozawa, Issei Nakagawa, Mayuka Abe, Tom Macpherson & Takatoshi Hikida
Department of Biological Sciences, Graduate School of Science, The University of Osaka, Osaka, Japan
Takaaki Ozawa, Issei Nakagawa, Mayuka Abe, Tom Macpherson & Takatoshi Hikida
Department of Information Medicine, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Tokyo, Japan
Yuuki Uchida & Yuichi Yamashita
Department of Neural Computation for Decision-Making, Brain Information Communication Research Laboratory Group, Advanced Telecommunications Research Institute International, Kyoto, Japan
Yuuki Uchida
Graduate School and Faculty of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
Tom Macpherson

Authors

Takaaki Ozawa
View author publications
Search author on:PubMed Google Scholar
Issei Nakagawa
View author publications
Search author on:PubMed Google Scholar
Yuuki Uchida
View author publications
Search author on:PubMed Google Scholar
Mayuka Abe
View author publications
Search author on:PubMed Google Scholar
Tom Macpherson
View author publications
Search author on:PubMed Google Scholar
Yuichi Yamashita
View author publications
Search author on:PubMed Google Scholar
Takatoshi Hikida
View author publications
Search author on:PubMed Google Scholar

Contributions

These authors contributed equally: T.O., I.N., Y.U.; conceptualization: T.O, Y.Y, T.H; data curation: T.O., I.N., Y.U.; formal analysis: T.O., I.N., Y.U., M.A.; funding acquisition: T.O., I.N., Y.U., T.M., Y.Y., T.H.; investigation: T.O., I.N., Y.U.; methodology: T.O., Y.U., Y.Y., T.H.; project administration: T.O., Y.Y., T.H.; supervision: T.O., Y.Y., T.H.; writing—original draft: T.O., I.N., Y.U., T.M., Y.Y., T.H.; writing—review and editing: T.O., I.N., Y.U., T.M., Y.Y., T.H.

Corresponding authors

Correspondence to Takaaki Ozawa, Yuichi Yamashita or Takatoshi Hikida.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementaly_Information

Supplementaly_Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Ozawa, T., Nakagawa, I., Uchida, Y. et al. Flexible value coding in the mesolimbic dopamine system depending on internal water and sodium balance. npj Sci Food 9, 197 (2025). https://doi.org/10.1038/s41538-025-00558-w

Download citation

Received: 17 April 2025
Accepted: 21 August 2025
Published: 30 September 2025
DOI: https://doi.org/10.1038/s41538-025-00558-w

Subjects

Abstract

Similar content being viewed by others

Dopamine subsystems that track internal states

Enhanced cognitive flexibility and phasic striatal dopamine dynamics in a mouse model of low striatal tonic dopamine

Individual differences in decision-making shape how mesolimbic dopamine regulates choice confidence and change-of-mind

Introduction

Results

Behavioral and dopaminergic responses to appetitive and aversive outcomes

State-dependent, bidirectional responses of dopamine neurons in the VTA to water and salt

State-dependent, bidirectional dopamine release in the nucleus accumbens in response to water and salt

A homeostatic learning model explains state-dependent activity in the dopaminergic system

Discussion

Methods

Subjects

Stereotaxic cannula implantation and virus injection

Apparatus

Test of sucrose and quinine

Test of water and salt: water-restriction phase 1

Test of water and salt: salt restriction phase

Test of water and salt: water-restriction phase 2

Behavioral measurement

The effect of ENaC blockade

Fiber photometry

Histological verification

Statistics

Computational modeling

Computational model: HRL model

Simulation conditions and data recording

Training, testing, and measurement procedures

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementaly_Information

Supplementaly_Data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links