Introduction

The habenula is a small bilateral nucleus located at the dorsal end of the diencephalon. In mammals, it comprises two distinct nuclear complexes: the lateral habenula (LHb) and the medial habenula (MHb). These two subregions greatly differ in their neuroanatomical connectivity [1] and gene expression profiles [2]. They are thus believed to be functionally distinct [3]. While there exists a large body of animal research devoted to studying the LHb, relatively little is known about the MHb [2]. Over the past 2 decades, the LHb has garnered growing attention [4], largely for its involvement in dopaminergic reward circuitry and its proposed role in psychiatric conditions such as addiction and mood disorders (MDs) [5]. The idea that the LHb plays a role in MDs arose from two major streams of research in animal models.

One central line of work hinted at an involvement of the LHb in the etiology of MDs by bringing attention to its role in negative reward processing. This function of the LHb was first suggested by a 2003 fMRI study in humans [6], and further supported in 2007 by the work of Matsumoto and Hikosaka in nonhuman primates [7]. In the latter study, habenular neuron activity was recorded, revealing that the LHb is activated by cues predicting the absence of reward and inhibited by cues predicting reward. These findings suggested that the LHb encodes negative reward prediction errors (RPEs). This idea was later bolstered by the results of a follow-up experiment from the same group, revealing activation of the LHb in response to punishment and inhibition following reward [8]. Subsequent work in fish and rodents yielded analogous findings [9, 10], indicating that the LHb’s role in the coding of negative RPEs is evolutionarily ancient and widely conserved across species. Owing to its function in negative reward processing, the LHb is well-positioned to contribute to mood disorders. Indeed, reward processing is altered in major depressive disorder (MDD) [11]. More precisely, not only is reward learning impaired, but neural processing of RPEs is also altered [12, 13]. Thus, because the LHb is involved in the processing of RPEs, it stands out as a structure that, when dysfunctional, may contribute to reward-related deficits observed in depression.

More direct evidence for the involvement of the LHb in mood disorders was provided by a second body of research investigating the LHb in animal models of depression(for reviews see [5, 14, 15]). Overall, findings from this literature imply that elevated baseline activity of the LHb, which has been observed in multiple animal models of depression (e.g., [16,17,18]), plays an instrumental role in depression. Baseline hyperactivity of the LHb in depression was first discovered in the seminal work of Caldecott-Hazard et al. [16], who showed that glucose metabolism in the LHb was elevated across three different rodent models of depression. This elevation was found bilaterally. The LHb was the only region across the brain to be consistently hyperactive in the three models. More specifically, it is the increase in LHb burst firing that appears to be depressogenic [19]. It is thought that the heightened bursting activity of the LHb results in a more pronounced inhibition of monoaminergic nuclei, thereby promoting depression [20]. Indeed, via direct and indirect efferent pathways, the LHb has the unique capacity to inhibit both serotonin release via the raphe nuclei [21] and dopamine release via the ventral tegmental area (VTA) and the substantia nigra [22].

Various sources of evidence underscore LHb hyperactivity as a key mechanism in the etiology of depression. Proulx et al. [23] showed that optogenetic activation of a projection from the LHb to the rostromedial tegmental nucleus (RMTg) is sufficient to induce depressive-like symptoms in rats. In contrast, pharmacological inhibition of the LHb was shown to exert antidepressant effects in a rat model of treatment-resistant depression (TRD) [24]. Moreover, lesions of the LHb reduce depressive-like behavior if performed once animals have been exposed to chronic mild stress [25] and prevent the development of depressive-like symptoms if performed before exposing animals to repeated inescapable electric shocks [26]. Lastly, there is evidence that the rapid antidepressant effect of ketamine is elicited by reversing the increased burst firing of LHb neurons [27]. In essence, the above literature highlights LHb hyperactivity as a core neural mechanism driving depression.

Historically, progress in understanding the functions of the human habenula has been hampered by its small size and deep subcortical location, which makes precise in vivo imaging challenging. Indeed, imaging the habenula is challenging in many ways. First, its small size makes it highly susceptible to partial volume effects, where the signal from the habenula may be contaminated by signals from adjacent thalamic regions. Additionally, its proximity to the CSF-filled third ventricle increases the habenula’s vulnerability to cardiorespiratory physiological noise [28, 29]. Furthermore, the habenula is difficult to delineate due to its low anatomical MRI contrast with neighboring structures [30, 31] and conventional normalization algorithms struggle to align it accurately [32]. Despite these challenges, studying the habenula has become more feasible over the past decade due to significant advancements in both structural and functional imaging techniques, including the development of myelin-sensitive imaging and the advent of ultra-high-field (UHF) MRI. Myelin-sensitive imaging facilitates habenula delineation by increasing its contrast relative to the adjacent thalamus, while UHF MRI provides improved resolution, as well as superior signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) [28, 33]. Thanks to these technical advancements, there is now a burgeoning field investigating the human habenula both in healthy individuals and in relation to MDs, particularly depression. In healthy individuals, numerous findings have aligned with key results from animal studies. For instance, animal tract-tracing studies have demonstrated anatomical connections between the habenula and several key brain regions, including the VTA [1, 34], median raphe nuclei (MRN) [1, 34], dorsal raphe nuclei (DRN) [1, 34], and periaqueductal gray (PAG) [34]. Correspondingly, human functional magnetic resonance imaging (fMRI) studies have revealed resting-state functional connectivity (RSFC) between the habenula and these same regions with quite good consistency [35,36,37,38], even though there are also some inconsistencies. Although functional connectivity cannot demonstrate anatomical connectivity, these patterns are nevertheless consistent with animal findings. Similarly, as discussed above, evidence from primates indicates that the LHb has a role in the coding of negative RPEs, and results from human studies are consistent with this notion. Indeed, fMRI studies have shown that the habenula is activated by the reception of negative feedback [39], by punishment [40, 41], by a cue predicting an aversive stimulus [42], and by the omission of an expected reward [43].

Multiple reviews about the habenula and its role in MDs have been published recently (e.g., [15, 20, 44,45,46]). However, these reviews focused on animal studies. To our knowledge, human studies have not been reviewed yet. As animal findings may not directly apply to humans, a review of human studies on the habenula and its relationship with MDs is greatly needed. Therefore, this systematic review aims to synthesize findings from human studies concerning the habenula and its relationship with MDs. In the paper, when referring to animal studies, we will differentiate between the LHb and the MHb. Because of the small size of the habenula, in vivo human studies do not usually differentiate between these two nuclei and instead study the habenula as a whole [33]. Thus, when discussing human studies, we will refer to the habenula altogether without distinguishing the LHb from the MHb.

Methods

This systematic review was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [47] (see Supplementary Information for the PRISMA checklist). The methodological quality of the studies was assessed with the Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies [48]. This review does not have a registered protocol.

An initial literature search was conducted using PubMed in April 2023. The search was later updated in January 2024. Inclusion criteria were a) human sample; b) structural or functional brain measurement of the habenula. Exclusion criteria were a) published in a language other than English; c) articles reporting results from deep brain stimulation (DBS) of the habenula without results pertaining to habenular structure or function; c) review papers; d) preprints. Lastly, because this review pertains to MDs, papers studying suicide or bipolar disorder (BD) were included, while those concerning post-traumatic stress disorder (PTSD) were excluded, as PTSD is rather classified as an anxiety disorder [49]. Detailed information on the search strategy, study selection, data extraction, and quality assessment is provided in the Supplementary Information.

Results

Description of studies

The search yielded 93 articles, of which 58 were retained for full-text screening. Moreover, one article was identified through citation searching. In total, 50 articles were included in the review (Fig. 1 [50]). These 50 articles comprised a total of 53 individual samples. Sample sizes varied between one and 423. There were 21 articles wherein at least one group had a sample size of 20 or less. Demographic and clinical characteristics of participants are presented in Table 1. Papers were published between 1999 and 2024 (Fig. 2). The majority of the samples (64%) were collected either in China (20 samples) or in the United States (14 samples) (Fig. 2). According to our assessment of methodological quality, out of the 53 samples included in our review, six were classified as having good quality, 38 as fair, and nine as poor (for details, see Supplementary Information). The articles included in the review can be grouped into three distinct classes. One category encompasses articles that reported results about the activity of the habenula in the context of MDs (n = 12). By activity, we refer to any direct or indirect measure aiming at quantifying neuronal activity. We chose to group these studies because they are all relevant to the matter of habenular hyperactivity in depression. Another category comprises the papers that studied the connectivity of the habenula during resting-state fMRI in patients with MDs (n = 24), while the last category includes papers that presented structural findings (n = 19). Because some studies were included in more than one category [40, 51,52,53,54], the sum of studies in each category exceeds the total number of studies included in the review.

Fig. 1
figure 1

PRISMA flow diagram of the study selection process.

Table 1 Study characteristics.
Fig. 2: Left graph: number of studies included in the review by year of publication.
figure 2

Right graph: number of samples included in the review by country where the sample was collected.

Activation studies

This section will synthesize findings from studies examining the activity of the habenula during a task or at rest. A total of twelve studies fit into this category (see Table S1). In 10 studies, the habenula was a region-of-interest (ROI), while in two others [55, 56] the habenula came up as a cluster in whole-brain analyses. Among the 10 studies that used the habenula as an ROI, seven separated the left from the right habenula for the analyses [40, 53, 57,58,59,60,61], while three combined both habenulas [62,63,64]. Six studies used fMRI [40, 53, 56, 58, 59, 63], two used positron emission tomography (PET) scan [55, 57], and four employed intracerebral electrodes to record local field potentials (LFPs) within the habenula [60,61,62, 64]. Four studies reported resting-state results [57, 59, 61, 64], while six studies reported task-related results [40, 55, 56, 58, 62, 63]. Two other studies reported both resting-state and task-related results [53, 60].

Comparison of resting-state habenular activity between MDD patients and HCs

The study conducted by Lawson et al. [53] is the only study that directly compared resting-state habenular activity between MDD patients and healthy controls (HCs). In this study, both MDD patients and HCs underwent a resting-state fMRI session. In order to estimate the habenular activity, authors used arterial spin labeling (ASL), an MRI technique allowing to quantify cerebral blood flow (CBF) [65]. No between-group difference was found in habenular CBF.

Induction and treatment of depression

Three studies provided data relevant to the matter of baseline habenular hyperactivity in human depression by either administering a treatment for depression (one study) or by employing a pharmacological procedure aimed at inducing a depressive-like state in participants (two studies). Carlson et al. [57] reported findings consistent with the idea that a treatment for depression can reduce baseline habenular activity. They observed that in a sample of 20 patients receiving ketamine treatment for TRD, glucose metabolism in a region encompassing the right habenula decreased following ketamine injections. However, there was no significant difference for the left habenula. The two studies seeking to induce a state resembling depression did so by exposing their participants to acute tryptophan depletion (ATD). ATD is a method aimed at inducing a transient depressive-like state that consists of lowering brain levels of tryptophan, the precursor of serotonin. This method is particularly effective in inducing depressive symptoms in people already vulnerable to depression [66]. Accordingly, the two studies that employed ATD enrolled participants currently in remission from MDD—participants assumed to be susceptible to depression. The first study used PET scan to measure neural activity during cognitive tasks under ATD [55]. Strikingly, across the whole brain, a voxel in a region encompassing the habenula was where the activity was the most strongly negatively correlated with tryptophan levels. In other words, lower levels of tryptophan were associated with more activity in the habenula. Moreover, following ATD, a voxel in a region encompassing the habenula was where activity was most strongly correlated with depressive symptoms across the brain, consistent with the baseline hyperactivity of the habenula present in animal models of depression. In the second study, participants underwent a resting-state fMRI session following the administration of ATD. Patients with remitted major depressive disorder (rMDD) showed significantly higher left habenular blood flow when compared to HCs [59]. However, there was no significant difference for the right habenula.

Intracranial recording

Rather than employing ex vivo methods such as fMRI and PET scan, four studies have used intracerebral recordings to study the relationship between habenular activity and MDs. These studies measured LFPs within the habenula. The LFP is the electrical potential measured in the extracellular environment surrounding neurons [67]. When measuring LFPs within habenula, the assumption is that the signal measured reflects neural activity from habenular neurons (but see [68, 69]).

Sonkusare et al. [60] used the aperiodic component, which reflects non-oscillatory brain activity. It is thought that the greater the aperiodic component, the greater the excitation-inhibition ratio [70]. Therefore, in this study, the aperiodic component was used as a proxy for the excitation-inhibition ratio of the habenula. Their results revealed a high correlation (r = 0.88) between the aperiodic component of the left habenula and depression scores in a sample of six TRD patients. This positive correlation between the aperiodic component and depressive symptoms could indicate that higher excitation within the left habenula is associated with higher levels of depression, in line with the increased burst firing of the LHb described in rodent models of depression. It is worth noting, however, that four out of their six patients had a diagnosis of BD, which may affect the habenula in a different way than unipolar depression. Furthermore, no significant correlation was found for the right habenula. Another paper investigated the association between the power spectral density (PSD) of oscillations recorded in the habenula and depressive symptoms. Lower PSD of habenular oscillations ranging from theta to gamma frequency bands (except for low betas) were generally associated with heightened depression in TRD patients [61]. This was the case for both the left and the right habenula. PSD of habenular oscillations does not directly inform us about activity in the habenula. However, lower PSD of habenular oscillations suggests lower neuronal synchronization within the LHb. Because burst firing is thought to enhance the synchronization of neural networks [71, 72], one possibility is that lower PSD reflects lower burst firing. These results appear consistent with the results of a case report where one patient underwent DBS for TRD. In this study [64], the PSD in the delta, theta, alpha, and beta frequency bands increased along with the clinical improvements of this patient over time. However, results from these two studies conflict with a study conducted in a rodent model of depression. In this study, LFPs in the LHb were measured before and after a DBS protocol aimed at treating depression. PSD was lower after the DBS from theta to low gamma frequencies [73]. Because DBS reduced depressive symptoms, this suggests that lower PSD in these frequencies was associated with a decrease in depressive symptoms. Lastly, another study reported that, in response to negative emotional stimuli, neither alpha nor theta frequencies recorded in the habenula correlated with depression scores in patients with psychiatric disorders [62]. On the contrary, one study in rodents found that depression was associated with higher theta-band synchronization [27]. However, an important distinction between these studies may explain the apparent inconsistency. Indeed, in the human study, LFPs were measured in response to negative emotional stimuli, which was not the case in the animal study.

Task-based fMRI

Five task-based fMRI studies reported results about the habenula and its relationship with MDs. Three of these studies investigated whether there is a difference in habenular activation between MDD patients and HCs when processing stimuli with negative motivational value. One study [40] found that MDD patients exhibited a trend toward heightened activation in the right habenula when processing punishment. In contrast, the same study also found a trend for reduced activity in the left habenula. Another study observed a reduction in activity in the left habenula of MDD patients in response to losses in a task involving monetary rewards and losses [58]. Lastly, one study found that the activity of the whole habenula when processing stimuli with negative motivational value was reduced in MDD patients [53]. Additionally, Kumar et al. [63] reported no group difference in negative RPE signals in the habenula between MDD and HCs [63]. The fifth study found that the activity of the habenula during the processing of loss prediction error was negatively correlated with anhedonia in MDD [56].

Connectivity studies

This section will synthesize findings from studies that used fMRI to examine resting-state connectivity of the habenula in the context of MDs. There were 24 such studies (see Table S2). In all studies, the habenula was defined a priori as an ROI. Among the 24 studies, 19 separated the left from the right habenula seeds for the analyses, while five combined both seeds. Twenty-two papers were directly interested in the habenula and its relationship with MDs, while two others investigated the habenula in other conditions: chronic low-back pain [74] and anorexia [75]. These two studies nevertheless reported correlations between depressive symptoms and the resting-state functional connectivity (RSFC) of the habenula. Of the 22 studies directly interested in MDs, 16 conducted analyses comparing the connectivity of the habenula between groups (e.g., MDD versus HC), and six investigated how the connectivity of the habenula changes in response to depression treatments: ketamine injections [76,77,78], bright light therapy [79], DBS [51], and electroconvulsive therapy (ECT) [80]. From these same 22 studies, 15 conducted a seed-to-whole-brain analysis with the habenula as a seed. One study performed both ROI-to-ROI and seed-to-whole-brain analyses [76]. Another one used a seed-to-voxel analysis where voxels within specific ROIs were considered in addition to ROI-to-ROI analyses [81]. Three studies exclusively performed ROI-to-ROI analyses [52, 82, 83]. Lastly, two studies opted for network analysis [84, 85], which is a framework that envisions the brain as an intricate network. This network can be illustrated graphically by nodes and edges, with the latter symbolizing the relationships between the nodes [86].

Among connectivity studies (i.e., comparing habenular connectivity between groups with and without MDs or before and after a depression treatment), reports of findings about regions of the default mode network (DMN) stood out as particularly common (Fig. 3). The DMN regions reported in at least two studies were the precuneus (5 studies; 10 pairings), the medial prefrontal cortex (mPFC) (when combining results for its ventral and dorsal parts) (5 studies; 6 pairings), the middle temporal gyrus (MTG) (4 studies; 5 pairings), the parahippocampal gyrus (PHG) (3 studies; 3 pairings), and the posterior cingulate gyrus (PCC) (2 studies; 2 pairings). Among brain regions outside the DMN (Fig. 3), the regions reported in at least two studies were the inferior frontal gyrus (IFG) (5 studies; 6 pairings), the dorsolateral prefrontal cortex (dlPFC) (5 studies; 5 pairings), the anterior/middle cingulate cortex (5 studies; 5 pairings), the superior temporal gyrus (STG) (5 studies; 5 pairings), the precentral gyrus (4 studies; 4 pairings), the postcentral gyrus (4 studies; 4 pairings), the cerebellum (4 studies; 7 pairings), the inferior temporal gyrus (ITG) (3 studies; 4 pairings), the locus coeruleus (3 studies; 3 pairings), the occipital gyrus (2 studies; 3 pairings), the supplementary motor area (2 studies; 2 pairings), the amygdala (2 studies; 2 pairings), the insula (2 studies; 2 pairings), and the calcarine/lingual gyrus (2 studies; 2 pairings).

Fig. 3: Synthesis of connectivity studies.
figure 3

Left graph: number of pairings and regions reported for DMN regions. Right graph: number of pairings and regions reported for regions outside of the DMN.

Findings related to structures reported in more than one study are presented in Table 2 (DMN regions) and Table 3 (other regions). Overall, in most cases, no clear pattern emerged from the findings. Indeed, among the studies that were similar enough to be directly compared, the vast majority of findings were (1) not reported in more than one study or (2) reported in more than one study but contradicted by at least one other study. For a narrative description of the results, see the Supplementary Information. The only finding that was reported in at least three studies was that the RSFC between the left mPFC and the habenula was higher in depression [35, 54, 87].

Table 2 Regions of the DMN.
Table 3 Other Regions.

Structural studies

Overall, 19 studies reported structural results. In 16 studies, the habenula was an ROI, while in three others [88,89,90] the habenula came up as a cluster in whole-brain analyses. Among the 16 studies using the habenula as an ROI, 12 separated the left from the right habenula for the analyses, while four combined both habenulas [51, 91,92,93]. Fourteen studies presented volumetric results and three studies used diffusion tensor imaging (DTI) [52, 93, 94]. One study used MRI-based quantitative susceptibility mapping, a method used to estimate brain iron deposition [90]. One last study measured serotonin transporter (SERT) availability across the brain to predict response to antidepressants [89].

Volumetric studies

In total, 11 studies reported total habenular volume exclusively (see Table S3). One study reported total habenular volume in addition to white and gray matter volumes [91]. Two studies focused on gray matter volume but did not report whole habenular volume [88]. Among all volumetric studies, seven compared the volume of the habenula between depressed patients and HCs. Five out of the seven studies found no significant difference in habenular volume between depressed patients and HCs [53, 54, 91, 95, 96]. One study observed a smaller volume of the right habenula in MDD patients [97], while another reported a larger bilateral habenular volume in MDD patients [40]. Another study, which measured habenular gray matter volume exclusively, found that TRD patients had a smaller habenular gray matter volume [88]. Savitz et al. [98] compared the habenular volume between HCs and patients with either medicated BD, unmedicated BD, unmedicated MDD, or unmedicated rMDD. They did not find any difference between MDD patients and controls. Their main finding was that unmedicated BD patients had a smaller habenular volume than HCs for both left and right habenulas. Another study used post-mortem data to compare the volume of the habenula between HCs and a group containing both MDD and BD patients. They found a smaller volume of the right MHb and right LHb in MDD/BD patients versus HCs [99]. Lastly, two studies compared BD patients and HCs. Both studies reported no difference in habenular volume between groups [92, 100]. Overall, despite some conflicting volumetric findings, it does not seem like habenular volume is consistently altered in MDs.

Structural connectivity studies

Three studies investigated habenular structural connectivity and its relationship with MDs. To do so, they used diffusion tensor imaging (DTI), a technique used to map white matter tracts in the brain [101]. Cho et al. [94] investigated structural connectivity between the thalamus and the habenula in MDD patients and HCs. They observed that MDD patients had significantly more white matter tracts connecting the right habenula to the left mediodorsal thalamus. In depressed patients, a study utilizing DTI found that those who failed to respond to treatment had lower fractional anisotropy in the right habenular afferent fibers compared to treatment responders. This was interpreted as indicative of lower structural connectivity between the habenula and its upstream connections [52]. Lastly, another study found that higher fractional anisotropy between the habenula and the hypothalamus was related to lower depression scores [93]. Overall, due to the variety of research questions addressed, no clear pattern emerges from these studies.

Discussion

Is the habenula hyperactive in human depression?

Baseline hyperactivity of the LHb stands out as a central finding from animal models of depression [5]. Activation studies are the most pertinent to ascertain its existence in human depression. Four reviewed studies are particularly relevant to the matter of habenular hyperactivity in depression.

The study conducted by Lawson et al. [53] was the best designed to determine whether the habenula is hyperactive in human depression. Using ASL, the researchers compared habenular CBF between actual MDD patients and HCs during resting-state. This study offers three key strengths regarding the investigation of habenular hyperactivity in depression. First, the design allows for a more direct comparison with findings from animal models, where the activity of the habenula has been compared between animals exhibiting depressive-like behaviors and controls (e.g., [16, 102]). Second, this study measured resting-state rather than task-related habenular activity. This also facilitates comparisons with the animal literature. Indeed, animal findings suggest that depression is associated with higher baseline activity in the habenula [103], but task-related habenular activity has not been compared between depressed animals and controls, to our knowledge. Third, the use of ASL, rather than the more commonly employed BOLD fMRI, is particularly well-suited to addressing the research question. ASL is an MRI technique measuring tissue perfusion (i.e., blood flow) using magnetically labeled arterial blood water protons as an endogenous tracer [104]. It is ideal for assessing baseline brain activity, as it allows for quantification in absolute units [105]. Indeed, ASL allows measurement of CBF, which is a measure of the amount of blood delivered to brain tissue at the capillary level, expressed in milliliters of blood per 100 grams of tissue per minute (ml/100 g/min). It reflects the rate of blood supply to a specific area of the brain over a set time period [105].In contrast, the physiological meaning of the BOLD signal is more ambiguous [106], as BOLD is measured in arbitrary units and reflects changes relative to a baseline, rather than absolute activity [107, 108]. Therefore, BOLD fMRI is less suitable for measuring baseline brain activity. Additionally, the BOLD signal is prone to low-frequency drifts unrelated to neural activity [109], while ASL is not affected by these drifts [110]. Indeed, previous studies have demonstrated that ASL-derived resting CBF measurements are highly stable across time, with consistency observed across intervals ranging from minutes to days [111,112,113]. Importantly, in contradiction with the habenular hyperactivity reported in animal studies (e.g., [16,17,18, 114]), Lawson et al. found no significant difference in habenular CBF between the groups [53]. However, ASL has an inherently low SNR [115], which, combined with the study’s relatively small sample size (23 MD and 22 HCs), may explain the non-significant findings. Additionally, this study employed a pulsed ASL (PASL) sequence, which has considerably lower SNR than the commonly recommended pseudo-continuous ASL (pCASL) sequence [116, 117].

Other approaches used to investigate the activity of the habenula aimed at either inducing a depressive-like state in participants or at relieving depression. Using PET scan, Carlson et al. [57] found evidence of reduced glucose metabolism following ketamine treatment in a region consistent with the location of the right habenula. In contrast, no significant difference was found for the left habenula [57]. However, the fact that ketamine reduces habenular glucose metabolism does not necessarily imply that the habenula is hyperactive in MDD patients. Moreover, the conclusions of this study should be interpreted with great caution, as the image resolution was quite low, with a voxel size (6 mm isotropic) considerably greater than the habenula itself. Therefore, these findings cannot speak specifically to the habenula, but rather to a region that encompasses, but is not limited to, the habenula. Findings from this study can nevertheless be interpreted as consistent with the theory of hyperactivity, although by no means do they constitute strong evidence. There are also two studies that sought to induce depressive-like symptoms in rMDD patients by exposing them to ATD [55, 59]. One study acquired ASL images to measure CBF [59], while the other used PET to assess glucose metabolism [55]. Taken together, these two studies were both coherent with the idea of habenular hyperactivity in depression. One caveat is that, as is typical of PET and ASL imaging, the resolution in both studies was low, suggesting that the signal attributed to the habenula may have been contaminated by signals from surrounding regions. Another important consideration is the uncertainty surrounding ATD as an adequate model of depression. Although ATD appears to negatively impact mood in individuals with rMDD, it does not seem to reliably lower mood in HCs [118]. Furthermore, although Roiser et al. [59] did observe elevated activity in the left habenula in rMDD patients, ATD did not significantly affect mood in their sample. Overall, in light of the current evidence, we cannot confidently assert that habenular hyperactivity exists in human depression. On the one hand, the study that was best designed found no evidence for habenular hyperactivity in depression [53]. On the other hand, three studies did point to habenular hyperactivity in depression [55, 57, 59]. However, because the relationship between ATD and depression is unclear, and because of the low resolution of ASL and PET images, results from the two ATD studies [55, 59], although coherent with habenular hyperactivity, cannot be regarded as a strong source of evidence.

Indirect data relevant to the matter of habenular hyperactivity in human depression have also been provided by studies employing intracerebral recording methods to measure LFPs from the habenula in patients undergoing DBS surgery. Intracerebral recordings offer a promising avenue to address whether the habenula is hyperactive in human depression. Because habenular intracerebral recordings have also been used in animal models, this method permits direct cross-species comparisons. This is critically important, as findings derived from invasive animal research may sometimes be difficult to relate to results from human non-invasive neuroimaging, thus impeding the translation of findings. Using the same tools across species has been proposed as a way to bridge the methodological gap between human and animal neuroscience in order to foster translation [119].

So far, four studies have used intracerebral recordings to study the habenula and its relationship with MDs in humans [60,61,62, 64]. While the results from one study could be coherent with the increased burst firing of the LHb described in rodent models [60], three studies reported results incoherent with animal findings and seemingly at odds with the theory of habenular hyperactivity in human depression [61, 62, 64]. Overall, no coherent picture emerges from these early studies using intracerebral recordings. Samples are too small to draw definitive conclusions, as each of the four studies included fewer than 10 participants. Moreover, the samples are heterogeneous, containing not only patients with MDD but also those with BD [60, 62] and schizophrenia [62]. Another thing to consider is that it is possible that LFPs recorded in the habenula also reflect activity from more distant sources [68, 69]. Indeed, volume conduction, which refers to the propagation of electrical potential to a location away from the source [120], may contaminate the signal [121].

Several activation studies compared the blood-oxygen-level-dependent (BOLD) activity of the habenula between MDD patients and HCs while they completed a task in an MRI scanner [40, 53, 56, 58, 63]. Regrettably, because these studies measure the activity of the habenula during a task, they cannot directly answer the question of whether the baseline activity of the habenula is enhanced in human depression, as is the case in animal models. One finding, that of lower habenular activation in MDD when processing stimuli of negative motivational value, was reported in three different studies and hence merits further attention. So far, to our knowledge, animal studies have not investigated if there is a difference between depressed and non-depressed animals in LHb firing when encoding stimuli of negative motivational value. Therefore, the reviewed studies cannot be directly compared to the animal literature.

The habenula as a potential hub connecting the reward system to the DMN

Overall, results from connectivity studies were mostly unreplicated, conflicting, or could not be compared between studies because of methodological heterogeneity. Even the sole finding that was reported in at least three individual studies is not entirely convincing. Indeed, three studies reported higher RSFC between the left mPFC and the habenula in depression [35, 54, 87]. However, three similar studies did not report any significant difference in RSFC between the habenula and the mPFC [122,123,124]. Moreover, although the three studies [35, 54, 87] agree in suggesting higher RSFC between the habenula and the left mPFC in MDD, there is a caveat. Indeed, while one study observed this higher RSFC for the whole habenula [35], another detected it in the left habenula exclusively [87], and yet another observed it in the right habenula exclusively [54].

Variability between the studies may be a factor explaining some of the conflicting findings. For instance, the clinical populations within the samples examined were quite heterogeneous, which renders between-study comparisons difficult. Some samples included only a specific type of depressed patients (e.g., unmedicated MDD, TRD, subclinical depression (SD)). Other studies assigned patients to different groups based on specific characteristics (e.g., MDD with versus without suicidal ideation, psychiatric inpatients with versus without a history of past suicide attempts). Research questions also varied across studies (e.g., response to treatment, prediction of treatment response, comparing patients with MDs versus HCs) and the analyses performed were diverse (e.g., static functional connectivity, dynamic functional connectivity, effective connectivity, network analysis). Another source of variability is that some studies used seed-to-whole-brain analysis using the habenula as a seed, while other studies used seed-to-ROIs analysis. In addition, some studies used the whole habenula as a seed, while others separated the left and the right habenula.

Although connectivity results were challenging to interpret, a recurring pattern was the frequent report of DMN regions. Work on non-clinical populations has shown that the habenula is functionally connected to areas of the DMN such as the precuneus, PCC, mPFC, and PHG [35, 37, 38]. The DMN is thought to play a role in rumination [125, 126] and is one of the main networks disrupted in depression [127]. Importantly, a study in a rat model of depression has demonstrated a causal relationship between the LHb and the DMN [128]. Indeed, the study showed that inhibition of the LHb, which alleviates depressive symptoms in rodents [24], induces a decrease in DMN connectivity. This suggests that the habenula and the DMN may interact together in their involvement in depression. More specifically, there could be a relationship between habenular hyperactivity and the hyperconnectivity of the DMN observed in human depression [129, 130] (although hypoconnectivity has also been reported [131]).

In addition to the DMN, another core network involved in depression is the reward network [127], which encompasses the habenula [132]. Neuroscientific investigations about the DMN and the reward system have largely been pursued in isolation from each other [133]. Interestingly, efforts to integrate findings from these two fields have recently been undertaken. Indeed, it has been proposed that the DMN can be conceptualized as a reinforcement learning agent [133]. As such, the DMN’s role would be to optimize behavior, which involves the minimization of prediction errors to maximize reward over time. One possibility is that the habenula may act as hub linking the subcortical reward system with the cortical DMN. The ventromedial prefrontal cortex (vmPFC), which is connected to the habenula and is part of the DMN, could act as the messenger allowing communication between the DMN and the habenula. According to Rolls [134], although the LHb’s activity correlates with RPEs, it is unlikely that the LHb is the structure computing these errors. Indeed, Rolls [134] proposes that the bulk of the computation is done by cortical areas, particularly the orbitofrontal cortex (encompassed in the vmPFC). The signal is then proposed to be sent to the LHb, which then relays it to structures such as the midbrain dopaminergic neurons [135]. Evidence from rodents suggests that the projection from the mPFC to the LHb [136] might be of critical importance to the etiology of depression. For instance, chemogenetic activation of this projection was shown to produce depressive-like symptoms in mice, whereas its inhibition has antidepressant effects [137]. In this review, we identified three studies reporting higher RSFC between the left mPFC and the habenula in depression [35, 54, 87]. This finding and results from preclinical studies suggest that the glutamatergic projection from mPFC neurons to the habenula may be hyperactive in depressed patients, thereby contributing to habenular hyperactivity (provided that habenular hyperactivity is indeed present in human depression, which, as stated above, has not been demonstrated yet).

Future research is needed to elucidate the connections between the reward system and the DMN as well as how this potentially relates to depression. Because of its extensive connections with the vmPFC, the habenula may be a key node allowing communication between these two networks and is thus a target for further investigations in the emergent research endeavor attempting to provide an integrative account of these two important brain networks.

Is the volume of the habenula altered in human depression?

Structural studies have mostly concentrated on habenular volume (14 studies). Five out of the seven studies that compared the volume of the habenula between depressed patients and HCs found no significant difference. One study observed a smaller volume of the right habenula in MDD patients (Cho, Park et al., [97]), while another found a larger bilateral habenular volume in MDD patients (Liu, Valton et al., [40]). Of the two studies examining differences in habenular volume between HCs and BD patients, neither found any significant group differences [92, 100].

Although habenular morphology has not been a great object of interest in animal models of depression, we know of two studies that addressed the subject. One study noted lower bilateral MHb volume in a mice model of MDs, but no difference for the LHb [138]. The other study observed lower volumes of both the bilateral MHb and LHb in rats exposed to chronic stress [139]. More research is needed to establish whether there exists a reduction in habenular volume in animal models of depression.

Overall, the reviewed data do not provide evidence for an alteration of habenular volume in MDs. An important consideration is that the methodological quality of the studies was quite varied. First, some studies [40, 53, 95] assessed habenular volume using Lawson’s method to identify the habenula [29], although this method was not designed for this purpose, as it relies on geometric delineation [140]. Moreover, the resolution of some of the studies was likely insufficient to clearly delineate the habenula. Indeed, Kim and colleagues [140] demonstrated that when high-resolution anatomical images from a 7 T scanner were down-sampled to 0.8 mm isotropic, subtle details aiding in habenular delineation were lost, though these details remained visible at 0.5 mm isotropic. Among volumetric studies, only four had a resolution of 0.7 mm or less in all planes [95,96,97,98], which one may argue is a bare minimum to be able to delineate the habenula with relatively good precision. Among these studies, three found no difference in habenular volume between MDD patients and HCs [95, 96, 98], while one study found that MDD patients had a significantly smaller right habenular volume than HCs [97]. Therefore, even among higher-quality studies, the evidence remains conflicting. Volumetric studies depend more heavily on precise delineation of habenular boundaries than functional studies, indicating that even the highest-quality research to date may lack the resolution and contrast needed to produce replicable results or detect subtle group differences. Although methods to improve volumetric study quality are available, researchers have yet to fully capitalize on recent advancements in imaging techniques for studying the habenula in MDs. Moving forward, volumetric studies specifically designed to assess habenular volume will be essential. Key improvements will include utilizing images with enhanced contrast over standard MPRAGE (e.g., MP2RAGE [141] or other myelin-sensitive imaging techniques), and leveraging UHF MRI to acquire higher-resolution images (e.g., 0.5 mm isotropic [140]).

As for DTI studies, they will also benefit from higher image resolution. For instance, using high resolution (i.e., 0.7 mm isotropic) DTI on a 7 T scanner, Strotman and colleagues were able to distinguish fiber from the LHb and MHb [142], which was not possible at lower resolutions [30].

Limitations and future directions

One noteworthy limitation of this review is that the protocol was not preregistered. Our review was also limited by the methodological shortcomings of individual studies. Despite methodological improvements over the last decades [30], precisely imaging the habenula remains a technical challenge. Some of the conflicting findings among reviewed studies may be explained by the small size of the habenula and its low contrast with neighboring structures, as this may have led to heterogeneity in the definition of the habenula. Variability in the methods used to segment the habenula may also have been at play. While most papers used a manual method or a semi-automated method involving manual steps (26 studies), other papers defined the habenula based on Montreal Neurological Institute (MNI) coordinates thought to represent the location of the habenula (10 studies), which likely resulted in delineation inaccuracies, as the location of the habenula in normalized space varies between individuals [32]. Some papers also used brain atlases (3 studies) or fully automated segmentation methods (2 studies). Minor inaccuracies in defining the habenula might have led to signal corruption by adjacent structures or CSF. Another important shortcoming of many functional studies is the use of a lower image resolution (e.g., 3.5 mm isotropic), often explained by the use of a 3 Tesla MRI scanner. Given that the size of each habenula is 30 mm [3] at most [143], which corresponds to one 3 mm isotropic voxel, such a resolution may not be sufficient. This is why an increasing number of studies investigating the habenula have been using 7 Tesla MRI scanners [144]. Limited sample size in certain studies (e.g., there were 21 articles wherein at least one group had a sample size of 20 or less) also casts doubts on the reliability of some of the results. For instance, some inconsistencies in results might stem from underpowered studies unable to detect effects reported by others. Lastly, a limpid integration of the results is somewhat impeded by the diversity of the populations studied, which precluded a meta-analysis. Even within the depression continuum, there were significant differences between the samples regarding disease stage (e.g., first episode MDD, SD, rMDD).

As briefly noted in the introduction, research in healthy humans seems quite consistent with findings from animal studies, —with the caveat that we have not conducted a systematic review on this matter— indicating that the habenula is involved in coding negative RPEs [39,40,41,42,43] and is functionally connected to the VTA, MRN, DRN, and PAG [35,36,37,38]. These results indicate that neuroimaging measures of the habenula can be both robust and reliable in human studies. One potential reason for the discrepancy in the reliability of findings on the habenula in mood disorders (MDs) compared to healthy individuals may be that, due to the habenula’s low signal-to-noise ratio (SNR), the statistical power of studies has been adequate to detect within-subject differences but not strong enough to identify between-subject differences. Another thing to consider is that resting-state studies in healthy humans have used on average longer resting-state sessions (5 min [37], 10 min [38], 60 min [35], and 60 min [36]), while out of the 26 resting-state samples reviewed here, only 7 samples had a scanning duration over 8 min, and only 3 samples over 9 min. Reliability of RSFC estimates could likely be enhanced in future studies by using longer resting-state sessions (i.e., at least 9 min [145]). Moreover, resting-state studies in healthy humans have used on average a much higher resolution (1.3 [38], 1.5 [37], and 2 [35, 36] mm isotropic).

To conduct better studies on the human habenula and enhance the replicability of findings, we offer various recommendations. First, when available, an MRI scanner with a higher field (e.g., 7 T rather than 3 T) should be used. UHF MRI will improve the resolution of both structural and functional MRI images, and enhance SNR [28]. Activity studies will also benefit, as task-related activation at 7 T is markedly enhanced, exhibiting higher percent signal changes [146, 147]. Additionally, resting-state studies will benefit, as at UHF, RSFC exhibits considerable improvements in test-retest reliability [148, 149]. Higher resolution also increases tissue contrast [28, 150], enabling better segmentation of structures with low contrast to neighboring structures, such as the habenula [142]. Optimizing anatomical image acquisition to accurately identify the habenula is paramount. Although the habenula has low anatomical contrast to the surrounding thalamus on MPRAGE images [30], new approaches have shown better contrast at 3 T (e.g., T1w over T2w, STAGE-enhanced T1W (T1WE) [30], or MP2RAGE [141] images). At higher resolution (e.g., 7 T scanner), MP2RAGE with 0.5 mm isotropic resolution has demonstrated clear habenula contrast [140].

Preprocessing should also be conducted with a tailored approach depending on the study’s purpose. For normalization, it is important to consider that the location of the habenula varies in spatially transformed images [32]. Accordingly, it has been suggested to omit spatial normalization and instead conduct functional analyses using a participant-specific habenula ROI [29]. For the same reason, atlas-based approaches, such as probability maps or using MNI coordinates, should be avoided for habenula identification [32]. A promising approach is to use automatic segmentation techniques to identify the habenula, which will save time and remove the subjectivity associated with manual delineation [30]. Similarly, it has been recommended that volumetric analyses be conducted in native space, as normalization may introduce errors in habenular volume measurements [32]. Special attention should also be given to physiological noise. Indeed, the habenula resides in close proximity to the third ventricule, and thus may be prone to physiological artifacts of cardiorespiratory origin (e.g., pulsations from blood flow and breathing). Thus, acquiring cardiac and respiratory data and correcting for physiological noise with tools such as the PhysIO Toolbox [151] may improve SNR.

Moving forward, transcending disciplinary boundaries between human and animal research will be essential to the translation of findings. Our results underscore how the current lack of permeability between these fields can hinder translation. For instance, very few studies included in this review directly tested the hypothesis of habenular hyperactivity (at rest) in human depression, although this is arguably the most important finding about the LHb derived from animal models of depression. Moreover, it is surprising that the volume of the habenula has been the subject of so much interest in humans, while this has not been of major interest in the animal literature. This may be due to an early report of reduced habenular volume in depressed patients [99].

In conclusion, our main findings can be summarized in three points. First, we found that the available evidence is not sufficient to either conclude the presence or the absence of habenular hyperactivity in human depression. In contrast, the finding of baseline habenular hyperactivity is widely replicated in animal models of depression (e.g., [16,17,18, 114]). Second, results from connectivity studies were mainly inconsistent across studies. However, one interesting observation is that regions of the DMN appeared to be overrepresented in the results. Findings from connectivity studies were difficult to compare to the animal literature, as fMRI has rarely been used in rodents to study the LHb in models of MDs, except for a handful of studies (e.g., [128, 152, 153]). Lastly, we found no evidence for habenular volumetric alterations in MDs. In rodents, two studies have investigated the matter. Both detected a decrease in MHb volume in depressed animals. One study found a decrease in LHb volume, while the other did not.