Introduction

Women are disproportionately affected by psychiatric and neurodegenerative disorders, including depression, anxiety, and Alzheimer’s disease (AD)1,2,3. One potential explanation for this female preponderance is an aetiological role of sex hormones such as oestrogens4,5,6,7. Oestradiol, the most abundant and potent oestrogen in the female body, exerts dynamic effects on brain morphology, neurochemistry, and function8,9,10,11,12, and is generally considered neuroprotective13,14,15,16. During the menopause transition, menopausal hormone therapy (MHT) is often prescribed to alleviate symptoms17,18 by partially replenishing sex hormones19,20. However, the effects of MHT use on mental health, AD risk, and markers of neurodegeneration are widely debated4,21,22,23,24,25, and the risks and benefits remain unclear.

MHT might help to mitigate neurological symptoms during the menopause transition26,27,28,29 with potential long-term implications for brain health28,30. However, findings from observational studies on MHT use are mixed. Some studies report an increased risk of AD or dementia31,32,33, and adverse effects on brain structure and neurodegenerative markers34,35,36. Other studies indicate a decreased AD risk37,38,39,40,41 and beneficial effects on brain health42,43,44, while some research finds no clinically relevant associations with neurological outcomes45,46,47. A meta-analysis of four randomised controlled trials (RCTs), based on data from the Women’s Health Initiative Memory Study (WHIMS), found an increased risk of dementia with MHT use compared to placebo30, though potential limitations of this dataset have been discussed48. The role of MHT in psychiatric disorders is also unclear. Some RCTs report a reduction in depressive symptoms with MHT use in menopausal females with depression49,50, as well as prevention of menopausal depression and anxiety51,52,53. Other RCTs reported no impact of MHT on depression and anxiety27,54,55, while some observational studies have shown an increased risk of depression in MHT users56,57.

Discrepancies in findings across studies may partially arise from reverse causation or confounding factors, particularly in observational studies. For example, individuals with severe menopausal symptoms including cognitive difficulties or mood changes58 may be more likely to be prescribed MHT36,45 or receive it following surgery (i.e., hysterectomy or oophorectomy59,60). Differences in factors such as MHT formulation, dosage, timing of onset, and duration of use are also likely to influence study results4,7,30,36. Furthermore, although RCTs are considered the gold standard in establishing causal effects61, their limited duration may preclude examination of long-term outcomes like AD30,62.

Mendelian randomisation (MR) is an epidemiological method that utilises genetics in a quasi-experimental approach to estimate causal effects63. Random allocation of genetic variants predicting a given phenotype at conception is analogous to random allocation to intervention in RCTs64. Genetic variants cannot be influenced by environmental processes, which minimises concerns about confounding and reverse causation63.

MR can be extended to investigate the causal effects of intervening on drug target proteins65,66,67 (Fig. 1). This includes the two main oestrogen receptors (ERα and ERβ); these proteins are encoded by genes ESR1 and ESR2, respectively, and are targeted by oestrogen therapies such as MHT13,68,69. Genetic variants such as single-nucelotide polymorphisms (SNPs) that are located in the gene encoding the drug targets of interest (in this case, ESR1 and ESR2) can be used to proxy the effects of pharmacological intervention at a specific receptor if they are associated with downstream effects similar to the desired drug response65,70.

Fig. 1: Principles of Mendelian Randomisation for studying drug effects.
Fig. 1: Principles of Mendelian Randomisation for studying drug effects.
Full size image

Genetic variants in ESR1 and ESR2, which encode oestrogen receptors α and β, and that influence downstream biomarkers similar to the desired drug target effect (e.g., higher bone mineral density), are used as instrumental variables to proxy target perturbation. The causal effects of this genetically proxied perturbation on clinical outcomes (e.g., Alzheimer’s disease) are then estimated using Mendelian randomisation (MR).

These SNPs can be used as instrumental variables in MR analyses to study the effect of these drug target perturbations on psychiatric and neurological outcomes65. With increasing availability of large-scale genetic data71,72, such drug-target MR studies present unprecedented opportunities to assess these relationships while avoiding resource constraints of RCTs and potential confounding in observational studies73.

Here, we performed the first drug-target MR study examining the causal effects of MHT on AD risk, key brain structural outcomes, depression, and anxiety, all of which are more common in females compared to males. To do this, we used genetic variants that mimic the effects of MHT as instrumental variables. Specifically, we selected SNPs in the ESR1 and ESR2 genes that had been associated in genome-wide association studies (GWAS) with relevant biomarkers representing downstream effects of drugs targeting these receptors (bone mineral density, sex hormone-binding globulin (SHBG) levels, and haemoglobin levels) at genome-wide significance (p < 5 × 10−8). We tested associations of these variants with AD risk, cortical grey matter (GM) volume, hippocampal volume, white matter hyperintensity (WMH) volume, depression, and anxiety.

Results

Main findings

We identified three instruments to proxy pharmacological perturbation of the oestrogen receptors: seven SNPs in ESR1 associated with bone mineral density, one SNP in ESR1 associated with SHBG levels, and one SNP in ESR2 associated with haemoglobin levels (Table 1).

Table 1 Main SNPs used to proxy ERα and ERβ perturbation

No significant associations were found between genetically predicted ERα and ERβ perturbation and AD or WMH volume (Fig. 2, Tables 2, 3, and 4).

Fig. 2: Two-sample Mendelian randomisation estimates for ERα/ERβ perturbation.
Fig. 2: Two-sample Mendelian randomisation estimates for ERα/ERβ perturbation.
Full size image

Forest plot showing beta estimates with 95% confidence intervals for genetically proxied perturbation of the oestrogen receptor α (ERα; proxied by bone mineral density [BMD] and sex hormone-binding globulin [SHBG]) and oestrogen receptor β (ERβ; proxied by haemoglobin [HMG]) on Alzheimer’s disease (AD), grey matter volume (GM vol), hippocampal volume (HC vol), white matter hyperintensity volume (WMH vol), depression, and anxiety. For ERβ associations, all estimated signs have been reversed for illustration. Only inverse-variance weighted (IVW) or Wald ratio estimates are shown. Findings significant at a false discovery rate of 5% are marked with an asterisk.

Table 2 Two-sample MR estimates for ERα perturbation proxied by bone mineral density
Table 3 Two-sample MR estimates for ERα perturbation proxied by sex hormone-binding globulin levels
Table 4 Two-sample MR estimates for ERβ perturbation proxied by haemoglobin levels

Genetically predicted ERα perturbation, proxied by bone mineral density, showed associations with higher cortical GM volume (IVW β = 0.171, 95% CI [0.025, 0.317], p = 0.021), as well as lower hippocampal volume (IVW β = −0.192, 95% CI [−0.338, −0.046], p = 0.010), though neither association remained significant after FDR correction (p = 0.160 and p = 0.148, respectively) (Table 2).

Genetically predicted ERβ perturbation, proxied by haemoglobin levels, showed significant associations with higher risk of depression (Wald Ratio β = −0.656, 95% CI [−0.992, −0.319], p < 0.001, FDR-corrected p = 0.002) (Fig. 2, Table 4). Genetically predicted ERα perturbation, proxied by bone mineral density, showed associations with higher anxiety risk (IVW β = 0.655, 95% CI [0.107, 0.319], p = 0.019), but this did not remain significant after FDR correction (p = 0.160) (Table 2).

Discussion

This MR study examined the effects of genetically predicted oestrogen receptor perturbation on neuropsychiatric outcomes and brain structure. In brief, we did not find significant effects on AD, brain structure, and anxiety, but genetically predicted ERβ perturbation was significantly associated with a higher risk of depression.

There was no evidence for significant associations between genetically proxied perturbation of ERα or ERβ and AD, indicating that activity of these receptors, which are targeted by MHT, does not influence AD risk. This finding contrasts with WHIMS74, which reported a doubled risk of developing dementia among postmenopausal women age 65 and older, though primarily for orally administered conjugated equine oestrogen therapy with progesterone30. Other trials enroling younger females or administering MHT closer to menopause have demonstrated mostly neutral effects of MHT on cognition35,52,75,76. This discrepancy could potentially be explained by the critical window hypothesis, which suggests that the benefits of MHT are contingent on early initiation77,78,79. Although MR estimates can provide insights into lifelong effects of genetic variants, they do not equate to the impacts of pharmacological interventions initiated at specific times65. Therefore, despite the lack of evidence for a link between oestrogen receptor perturbation and AD, potential risks and benefits associated with the timing of MHT initiation remain possible.

Furthermore, no evidence was found for associations between genetically predicted ERα or ERβ perturbation and brain structural outcomes, indicating that drugs targeting these receptors do not seem to impact cortical GM volume, hippocampal volume, or WMH volume. Although ERα was associated with higher cortical GM volume and lower hippocampal volume, these findings did not remain significant after FDR correction. In a previous RCT study, recently postmenopausal females receiving conjugated equine oestrogens for four years had greater ventricular volume increases compared to placebo35, but the increase was not different from placebo three years after discontinuation76. In a separate trial, there were no significant differences in total hippocampal volume between multiple doses of oestradiol and placebo following short-term administration, though higher bilateral posterior hippocampal volume was increased after three months at the highest dose80. Overall, the reported impact of MHT on GM changes ranges from volume increases to decreases or null-effects (for a full review, see ref. 6), and comparability of studies is complicated by confounders such as age, mental health status, formulation, and duration of MHT use6,7.

While relationships between MHT use and brain health might depend on formulation, dosage, timing of onset, and duration of use, the previously published GWAS, whose data we relied on, may also be subject to the healthy volunteer effect81 and survivor bias82. These biases, which result from the tendency of healthier individuals to participate and the exclusion of those who have died or experienced severe illness, may affect the generalisability of our findings.

Our analyses used GWAS summary statistics for AD, brain volumes, and depression derived from large meta-analyses. Phenotype definitions may differ across contributing cohorts, particularly for clinical and psychiatric outcomes, and this heterogeneity could introduce measurement error and attenuate causal estimates83. We restricted our analyses to European-ancestry samples to reduce confounding by population stratification, which improves internal validity. However, we primarily relied on variant-biomarker and variant-outcome associations reported in both males and females, mainly due to the low availability of sex-stratified GWAS. While our results are therefore generalisable to both sexes (i.e., no associations between oestrogen receptor perturbation and AD risk in both males and females), mixed-sex summary statistics may limit the precision of female-specific estimates. While these limitations highlight the need for future GWAS with harmonised outcome definitions and sex-stratified analyses, our cis-MR design using variants proximal to drug targets provides a conservative and biologically informed approach to causal inference.

Genetically predicted ERβ perturbation, proxied by haemoglobin levels, was significantly associated with higher depression risk, supporting the role of oestrogens in mental health. Oestrogens are known to influence mood regulation through various mechanisms in the brain, such as modulating neurotransmitter systems and neuroplasticity84,85,86, but the role of both endogenous and exogenous oestrogens in mood disorders is complicated and multifaceted. For instance, studies suggest that sensitivity to fluctuations in sex hormone levels may drive depressive symptoms in premenstrual dysphoric disorder (PMDD) or perimenopause87,88,89. Hormonal contraceptive use has been associated with increased risk of depression in observational studies90,91,92,93, especially in adolescents92. In contrast, a recent network meta-analysis including 14 RCTs concluded that hormonal contraceptive use did not lead to increased depressive symptoms in adult females94. Similarly, MHT use has been shown to be generally neutral54,55 or even protective49,50,51,52 for depressive symptoms in observational studies and RCTs.

Our study indicated an increased risk of depression associated with ERβ perturbation; however, as noted above, while our methods can provide evidence of the presence and direction of causal effects, they are not equivalent to pharmacological intervention65. Additionally, MR studies work under the assumption of constant genetic exposure effects over a lifetime, which may not capture the reality of time-varying effects95. As the relationship between oestradiol and psychiatric outcomes may change across different life stages84, future MR studies could employ methods such as multivariable MR to examine time-varying effects96, though this would require several GWAS on relevant traits across different age groups or timepoints in an individual’s life. Accordingly, we could not assess age- or menopause-specific genetic effects in this study because suitable age-or menopause-stratified GWAS were unavailable.

We did not find evidence supporting a role of ERα in the risk of depression. ERα perturbation was associated with higher anxiety risk, though this was not significant following FDR correction. Both receptor subtypes are located in brain regions associated with cognitive function and emotion97,98, but there is higher expression of ERβ in the thalamus and hippocampus99, areas involved in mood disorders100,101,102, which may explain our findings. Following future research confirming a differential role of ERα and ERβ in depression, oestrogen receptor modulators that selectively act on ERα could be explored to avoid inducing depressive symptoms with hormone therapy use.

There are some limitations in the use of haemoglobin as a proxy for ERβ perturbation. While higher oestrogen has been correlated with lower haemoglobin levels103,104,105,106, we could not identify any RCTs that linked use of exogenous oestrogens to lower haemoglobin, making this proxy less robustly validated than, for example, bone mineral density for ERα perturbation. Additionally, as about two-thirds of the participants in the haemoglobin level GWAS107 were from the UK Biobank, we expected substantial sample overlap between the phenotype and outcome samples, which could introduce population-specific effects affecting the robustness of the results. Finally, using haemoglobin to proxy ERβ assumes it does not influence depression independently of ERβ activity (exclusion restriction). However, plausible pathways exist from low haemoglobin to fatigue and depressive symptoms. Our follow-up standard MR analysis (Supplementary Note 1) showed an association between haemoglobin and depression by IVW, though not across sensitivity methods, suggesting potential partial violation of this assumption. We therefore interpret the ERβ perturbation and depression result cautiously, emphasising that further research with alternative ERβ proxies is needed to confirm this finding.

This study represents an important initial step in using drug target MR studies to identify risks and benefits associated with MHT use. Although these studies cannot fully substitute for the precision and specific temporal dynamics captured in clinical pharmacological trials, this method can validate drug targets, identify relevant side-effects and outcomes, and aid in drug repurposing by identifying new therapeutic uses for existing drugs, without the high cost and long duration associated with RCTs65,66. Triangulation of findings across these different methodological approaches is essential for a robust understanding of the causal effects of MHT.

Although our current results should be interpreted in the context of the discussed limitations, they provide a foundation for future research elucidating biological pathways and evaluating long-term safety and efficacy of MHT use to optimise women’s healthcare oucomes. For example, well-powered sex-stratified GWAS are mostly lacking, and in addition to AD or psychiatric outcomes, GWAS on depression subtypes such as postpartum depression or PMDD would improve our understanding of female-specific findings related to hormonal treatments. Future analyses could also aim to examine the contribution of progesterone receptor perturbation on neuropsychiatric and brain structural outcomes. Exogenous oestradiol, via MHT or hormonal contraception, is commonly combined with progesterone to balance its effects108, and various formulations might differentially affect risk of mental disorders92,109 as well as AD24,30,110. At the time of the current study, genetic variants within the PGR gene had not been associated with relevant biomarkers representing downstream drug effects, and we were thus unable to explore its role in these neuropsychiatric outcomes. Given the variety of MHT combinations, it is vital to conduct future studies that can provide a basis for clarifying the causal effects of these on health outcomes.

In conclusion, this study found no evidence that genetically predicted ER perturbation, as targeted by MHT, significantly affects AD risk or associated brain structural measures, including cortical GM volume, hippocampal volume, and WMH volume. However, ERβ perturbation may be associated with higher depression risk, indicating a potential causal role of oestrogen in mood regulation. Future drug target MR studies can complement observational studies and RCTs by offering critical insights into the causal effects of MHT on various health outcomes. This approach is essential for clarifying inconsistent findings and guiding pharmaceutical research, ultimately optimising patient care and ensuring long-term health benefits for women.

Methods

Study design and instrument selection

Figure 1 illustrates the principles of MR studies used to investigate drug effects. By utilising genetic variants that proxy receptor activity, this approach can help us understand whether the binding of oestrogens to oestrogen receptors, the primary aim of MHT, has a direct impact on AD risk, brain structural outcomes, and mental health.

To proxy oestrogen receptor perturbation, we gathered all reported associations between SNPs located in the ESR1 and ESR2 genes and various biomarkers, based on the NHGRI-EBI GWAS Catalog (www.ebi.ac.uk/gwas/home)111. We selected only SNPs located within 200kb of the gene start and end (genome build 38p14; location for ESR1 = chromosome 6, 151,656,691 - 152,129,619, and for ESR2 = chromosome 14, 64,084,232 - 64,338,112) and associated with biomarkers at genome-wide significance (p < 5 × 10−8). We removed any associations observed in male-only samples to ensure we captured female-specific effects, or non-European samples, given that different genetic ancestries can influence allele frequencies and linkage disequilibrium (LD) patterns112, and the lack of available ancestry-specific outcome GWAS. Table 5 provides an overview of exclusions for instrumental variable selection.

Table 5 Exclusions applied to SNP associations for ESR1 and ESR2

Biomarkers were chosen based on their biological plausibility to mimic drug target protein effects. For ESR1, we selected “bone mineral density" and “sex hormone-binding globulin (SHBG) levels". Increased bone mineral density or reduced fracture risk has been observed in participants taking MHT compared to placebo in several RCTs113,114,115,116. SHBG concentrations are significantly higher during use of hormonal contraception (which targets ERα) in a dose-response manner, as highlighted by a meta-analysis of experimental studies117. As there were two plausible biomarkers available, we selected both, as consistent results across these would strengthen our results. For ESR2, we chose the biomarker “haemoglobin levels". Females have lower haemoglobin levels compared to males103, with times marked by increased levels of oestrogens (i.e., pregnancy) resulting in reductions in haemoglobin104. In transgender participants receiving oestradiol therapy, haemoglobin decreased significantly as oestradiol levels increased105,106.

Following biomarker selections, there were 30 SNP associations for bone mineral density, 5 for SHBG levels, and 4 for haemoglobin levels (Table 5); some SNPs were associated with a biomarker in several studies. Where SNPs associated with the same biomarker were in LD (r2 < 0.1), we selected a main SNP and its associated statistics based firstly on whether it had been identified in more than one sample, secondly, the sample size of the study reporting the association, and thirdly, the availability of summary statistics. Where the same SNP association was reported in several studies, summary statistics were obtained from the study with the largest sample size. All SNPs not in LD with any other SNPs were also selected as main SNPs. Summary statistics for the main SNPs were downloaded from the GWAS Catalog or retrieved manually from the publication. Supplementary Tables 13 provide the full list of studies reporting associations for SNPs in ESR1 and ESR2 with the biomarkers, including information on LD and availability of summary statistics. Table 1 displays the final selection of SNP associations comprising the three instruments reflecting ERα and ERβ perturbation. We obtained summary statistics from the largest available study for each biomarker rather than meta-analysing across multiple GWAS due to substantial sample overlap between studies (e.g., UK Biobank participants appear in both ref. 118 and ref. 119), differing measurement platforms and phenotype definitions across cohorts, and because most additional studies were considerably smaller and would contribute minimal power while introducing heterogeneity.

Outcome selection

Our primary outcomes of interest were AD, depression, and anxiety, given the female preponderance of these conditions and the conflicting evidence surrounding associations with exogenous oestrogens1,4. We further wanted to investigate associations with relevant brain structural outcomes; GM and hippocampal volume changes have been closely associated with the onset and progression of dementia120, as well as depression121,122 and anxiety123,124. Finally, as recent evidence highlights menopause as a risk factor for WMHs125,126, which have also been associated with increased risk of AD127,128, WMH volume was included as an outcome.

Variant-outcome associations were derived from large previously published GWAS. For AD, we selected a meta-analysis of GWAS on clinically diagnosed late-onset AD with 94,437 individuals129. Genetic associations with brain structural outcomes (cortical GM, hippocampal, and WMH volume), were obtained from the largest GWAS of brain imaging phenotypes from the UK Biobank with 22,138 individuals130. For depression, we selected a meta-analysis of three GWAS on a spectrum of depression phenotypes with 807,553 individuals (246,363 cases and 561,190 controls)131, and for anxiety, we chose a meta-analysis of nine GWAS on anxiety disorders with 17,310 individuals132. All genetic associations were based on GWAS of European ancestry samples. Variant-outcome associations for the genetic instruments are provided in Supplementary Tables 49.

Mendelian randomisation

Two-sample MR analyses were used to obtain estimates for the association between genetically predicted oestrogen receptor perturbation, proxied by relevant biomarkers, and AD, cortical GM volume, hippocampal volume, WMH volume, depression, and anxiety. Analyses were conducted using the TwoSampleMR (version 0.5.6) package in R (version 4.2.1). Variants were harmonised between datasets, ensuring that the associations between SNPs and exposure and between SNPs and the outcome reflected the same allele. For instruments with more than two variants, the Inverse Variance Weighted (IVW)70 method was performed as the primary approach, which regresses the effect sizes of the variant-biomarker associations against the effect sizes of the variant-outcome associations. Several other methods, such as weighted median133, weighted and simple mode134, and MR Egger135 were performed to assess the robustness of results, as broadly consistent results across these methods strengthen the causal inference. For instruments with a single variant, the Wald ratio method was employed. To adjust for multiple testing, false discovery rate (FDR; 5%)136 corrected p-values were calculated across all instances of IVW and Wald ratio methods. To assess potential violation of the exclusion restriction where a proxy biomarker might itself causally influence an outcome (e.g., haemoglobin - depression), we conducted standard follow-up MR analyses of the biomarker on the outcome for any significant results (see Supplementary Note 1).

Ethical considerations

This study used publicly available summary statistics from previously published GWAS. No new data involving human participants were collected or analysed by the authors. Ethical approval for all contributing studies was obtained from the respective institutional review boards or ethics committees, and all participants provided written informed consent in accordance with the Declaration of Helsinki. Studies utilising UK Biobank data107,118,119,130,137 operated under ethical approval from the North West Multi-Centre Research Ethics Committee. For ref. 129, all contributing cohorts obtained approval from local Institutional Review Boards; the AGES Reykjavik Study was approved by the Icelandic National Bioethics Committee (VSN 00-063), the Icelandic Data Protection Authority, and the U.S. NIA IRB; and CHARGE consortium studies received approval from respective local IRBs. For ref. 107, the Icelandic component received approval from the Icelandic National Bioethics Committee (VSNb2015010033-03.12). For ref. 131, UK Biobank data were used under approval from the NHS National Research Ethics Service (11/NW/0382), with additional approvals for Generation Scotland (NHS Tayside Committee on Research Ethics, Ref 15/ES/0040), BiDirect and Münster cohorts (Ethics Committee of the University of Münster and Westphalian Chamber of Physicians), 23andMe (external AAHRPP-accredited institutional review board), and 35 Psychiatric Genomics Consortium cohorts as detailed in the original publication. For ref. 132, contributing cohorts received approvals from MGS (institutional review boards at participating US institutions), PsyCoLaus (local institutional review board, Switzerland), SHIP (Ethics Committee of the University of Greifswald, Germany), QIMR (Human Research Ethics Council of QIMR, Australia), TRAILS (Dutch Central Committee on Research Involving Human Subjects), Rotterdam Study (appropriate ethical approvals, Netherlands), and NESDA/NTR (Central Ethics Committee on Research Involving Human Subjects of the VU University Medical Center, Amsterdam). To the best of our knowledge, all other contributing studies complied with relevant ethical regulations and obtained informed consent, as detailed in the original publications.