Depression symptom-specific genetic associations in clinically diagnosed and proxy case Alzheimer’s disease

Gilchrist, Lachlan; Spargo, Thomas P.; Green, Rebecca E.; Coleman, Jonathan R. I.; Howard, David M.; Thorp, Jackson G.; Adey, Brett N.; Lord, Jodie; Davies, Helena L.; Mundy, Jessica; ter Kuile, Abigail R.; Davies, Molly R.; Hübel, Christopher; Bristow, Shannon; Lee, Sang Hyuck; Rogers, Henry; Curtis, Charles; Kakar, Saakshi; Malouf, Chelsea M.; Kalsi, Gursharan; Arathimos, Ryan; Corbett, Anne; Ballard, Clive; Brooker, Helen; Creese, Byron; Aarsland, Dag; Hampshire, Adam; Velayudhan, Latha; Eley, Thalia C.; Breen, Gerome; Iacoangeli, Alfredo; Kõks, Sulev; Lewis, Cathryn M.; Proitsi, Petroula

doi:10.1038/s44220-024-00369-0

Download PDF

Article
Open access
Published: 15 January 2025

Depression symptom-specific genetic associations in clinically diagnosed and proxy case Alzheimer’s disease

Nature Mental Health volume 3, pages 212–228 (2025)Cite this article

10k Accesses
10 Citations
15 Altmetric
Metrics details

Subjects

Abstract

Depression is a risk factor for the later development of Alzheimer’s disease (AD), but evidence for the genetic relationship is mixed. Assessing depression symptom-specific genetic associations may better clarify this relationship. To address this, we conducted genome-wide meta-analysis (a genome-wide association study, GWAS) of the nine depression symptom items, plus their sum score, on the Patient Health Questionnaire (PHQ-9) (GWAS-equivalent N: 224,535–308,421) using data from UK Biobank, the GLAD study and PROTECT, identifying 37 genomic risk loci. Using six AD GWASs with varying proportions of clinical and proxy (family history) case ascertainment, we identified 20 significant genetic correlations with depression/depression symptoms. However, only one of these was identified with a clinical AD GWAS. Local genetic correlations were detected in 14 regions. No statistical colocalization was identified in these regions. However, the region of the transmembrane protein 106B gene (TMEM106B) showed colocalization between multiple depression phenotypes and both clinical-only and clinical + proxy AD. Mendelian randomization and polygenic risk score analyses did not yield significant results after multiple testing correction in either direction. Our findings do not demonstrate a causal role of depression/depression symptoms on AD and suggest that previous evidence of genetic overlap between depression and AD may be driven by the inclusion of family history-based proxy cases/controls. However, colocalization at TMEM106B warrants further investigation.

Pervasive biases in proxy genome-wide association studies based on parental history of Alzheimer’s disease

Article 04 November 2024

Genome-wide association of polygenic risk extremes for Alzheimer's disease in the UK Biobank

Article Open access 19 May 2022

Deep post-GWAS analysis identifies potential risk genes and risk variants for Alzheimer’s disease, providing new insights into its disease mechanisms

Article Open access 15 October 2021

Main

Epidemiological studies suggest that a diagnosis of depression is a risk factor for the later development of dementia^1,2,3,4, of which Alzheimer’s disease (AD) is the most common form, accounting for ~80% of the over 40 million global cases⁵. Establishing the underlying mechanisms by which depression confers increased risk for AD offers a pathway by which new interventions might be implemented and the global dementia burden reduced⁶.

As twin studies have demonstrated, both depression and AD are substantially heritable—approximately 40% and 80%, respectively^7,8. Furthermore, large-scale genome-wide association studies (GWASs) have demonstrated high polygenicity, identifying over 70 genomic risk loci for AD and nearly 200 for depression^{9,10,11,12,13,14,15}. It is therefore possible that their phenotypic association is partially due to a shared genetic architecture. However, results from previous investigations into the genetic overlap between the two disorders have been mixed. For example, some findings indicate non-significant genetic overlap^16,17, others a significant—if modest—genetic correlation of ~16–17% and a risk-increasing causal effect of depression on AD^18,19,20.

According to the Diagnostic and Statistical Manual of Mental Disorders (DSM-5), diagnosis of a major depressive episode requires the presence of at least five of a possible nine symptoms for ≥2 weeks, including one of the two cardinal symptoms—depressed mood or anhedonia²¹. Potentially hundreds of symptom combinations are possible to meet these diagnosis criteria²². As such, heterogeneity poses challenges to researchers seeking to better understand differences in the genetic contribution to depression and its subtypes²³. However, the decomposition of depression into individual symptoms has provided insight into unique patterns of genome-wide significant loci and cross-trait genetic associations, as demonstrated in a recent GWAS of depression symptoms on the Patient Health Questionnaire (PHQ-9) by Thorp and colleagues²⁴.

A number of studies suggest that anhedonia may be a better predictor of dementia than depressed mood^25,26. Furthermore, several depression symptoms, including appetite changes, psychomotor dysfunction and sleep disruption, are commonly observed in non-depressed patients with dementia^27,28,29. Taking this into account alongside the mixed nature of previous findings examining the genetic overlap between depression and AD, it is possible that leveraging depression symptom-level genetic information may offer greater insight into the disorders’ shared genetic architecture.

However, any association between depression and AD must also consider the potential influence of differences in case/control ascertainment in AD GWASs. A review by Escott-Price and colleagues³⁰ notes that recent large-scale AD GWASs contain a relatively small proportion of clinically ascertained cases/controls, with a large percentage of cases ascertained by proxy, that is, cases and controls are defined as individuals with and without a self-reported parental history of AD/dementia, respectively. The combination of clinical and proxy samples in AD GWAS meta-analyses has proved an effective way of boosting sample size and variant discovery^13,14,15,31. However, evidence suggests that this has come at the expense of specificity in regard to genomic risk loci and an apparent stagnation in the percentage of variance explained by common variants³⁰. Most importantly for cross-trait analysis, recent studies indicate that the direction of Mendelian randomization (MR) causal estimates for AD risk factors on AD can be in the opposite direction depending on whether the AD outcome GWAS contains both clinical and proxy cases/controls or is more strictly clinically ascertained^32,33.

To address these points, here we report a large genome-wide meta-analysis of PHQ-9 depression symptom items using data from the Genetic Links to Anxiety and Depression (GLAD) Study³⁴, the PROTECT Study³⁵ and two questionnaires from UK Biobank (UKB)³⁶. We obtained summary statistics from previous large-scale GWAS for clinical⁹ and broad¹⁰ depression, and six AD GWASs (three with clinical + proxy case/control ascertainment^13,14,15, one with proxy-only³¹ and two with clinical-only^12,37). We used these GWASs to assess the presence, strength and differences in genetic overlap between depression, depression symptoms and AD, with the additional aim of better understanding the influence of different AD case ascertainment strategies on associations.

Results

For a flowchart of this study, see Fig. 1. For details on depression and AD GWAS summary statistics obtained for this study, see Methods.

**Fig. 1: Analysis flowchart for the present study.**

PHQ-9 genome-wide meta-analyses

The final genome-wide meta-analysis—conducted using multi-trait analysis of GWAS (MTAG)³⁸—identified a total of 40 genomic risk loci between the 10 PHQ-9 phenotypes (GWAS-equivalent N range: 224,535–308,421). Only one depression symptom—suicidal thoughts—identified no genome-wide significant variants. Three lead single nucleotide polymorphisms (SNPs) were shared with more than one PHQ-9 phenotype, leaving a total of 37 unique genomic risk loci (Table 1). The significance of each of the lead variants in each of the samples contributing to the meta-analysis is provided in Supplementary Table 1. Expression quantitative trait loci (eQTL) mapping in functional mapping and annotation (FUMA) mapped lead variants at genomic risk loci to 76 genes (Supplementary Table 2). The SNP heritability (h²_SNP) value for the MTAG-PHQ-9 GWAS ranged from 1.12% for suicidal thoughts to 6.78% for the PHQ-9 sum score. h²_SNP z-scores were all >4 (range 6.59–18.50) (Supplementary Table 3), indicating sufficient heritability to obtain reliable genetic correlation estimates in downstream analyses³⁹. The genomic inflation factors (λ_GC) ranged from 1.0638 to 1.2156, with linkage disequilibrium score regression (LDSC) intercepts ranging from 0.9997 to 1.0007, indicating that inflation was due to the polygenic signal as opposed to confounding due to population stratification⁴⁰. Manhattan and quantile–quantile (QQ) plots are presented in Fig. 2.

Table 1 Genomic risk loci from the ten MTAG-PHQ-9 genome-wide association meta-analyses

Full size table

**Fig. 2: Manhattan and QQ plots for each of the ten MTAG-PHQ-9 GWAS meta-analyses.**

Genetic correlations

Of the 72 bivariate genetic correlations (r_g) calculated between the 12 depression phenotypes and the six AD GWASs, 24 were nominally significant and 20 remained significant after false discovery rate (FDR) correction (P_FDR ≤ 0.05) (r_g range −0.25–0.35; P-value range 1.25 × 10⁻²–4.01 × 10⁻⁵; P_FDR range 4.5 × 10⁻²–1.9 × 10⁻³). Of these, 19 were identified when the AD GWAS in the pair contained either clinical + proxy cases and controls, or proxy-only cases and controls (Fig. 3 and Supplementary Table 4). Only one P_FDR significant association was found when using a clinical AD GWAS—between suicidal thoughts and Wightman et al. (excl. UKB) (r_g = −0.25, P = 6.78 × 10⁻³, P_FDR = 3.48 × 10⁻²). All depression phenotypes were significantly genetically correlated with each other (r_g range, 0.57–0.98; P ≤ 3.71 × 10⁻²³) (Supplementary Table 5 and Supplementary Material 1). Only one PHQ-9 symptom pair—concentration problems and psychomotor changes—showed a genetic correlation that was not statistically different from one (95% confidence interval (CI) included one), indicating genetic heterogeneity across depression symptoms.

Local genetic correlations

After univariate testing, a total of 4,271 bivariate local genetic correlation tests were conducted in local analysis of [co]variant association (LAVA)⁴¹ across 324 genomic loci. Of these, 716 were nominally significant and 15 remained significant at P_FDR ≤ 0.05 across 14 unique genomic loci (local r_g range −0.81–0.82; P-value range 1.48 × 10⁻⁴–4.2 × 10⁻⁶; P_FDR range 4.22 × 10⁻²–1.38 × 10⁻²) (Supplementary Table 6). Of the 15 statistically significant tests, ten were identified when using clinical + proxy/proxy-only AD GWASs. No depression phenotype showed a statistically significant association at the same genomic locus with more than one AD GWAS. However, for 10 of the 15 statistically significant tests, nominally significant local genetic correlation was observed between the depression phenotype and at least one additional AD GWAS at the same locus (Supplementary Table 7). Only locus 1790 (chr12: 51769420–53039987) showed a significant P_FDR local genetic correlation with more than one depression phenotype—concentration and sleep problems—both with the clinical-only Wightman et al. GWAS. The numbers of positively and negatively correlated loci identified between each phenotype pair are presented in Supplementary Table 8.

Colocalization

Following LAVA, 14 P_FDR-significant regions of local genetic correlation were passed to the COLOC-reporter pipeline⁴² across 15 depression–AD phenotype pairs. A further 14 colocalization tests were conducted where a nominally significant local genetic correlation was observed at a P_FDR-significant locus between the same depression phenotype and a different AD GWAS. As such, a total of 29 statistical colocalization tests were conducted to follow up the LAVA results. No 95% credible sets were identified by sum of single effects (SuSiE) for any phenotype pairs in these regions. All analyses were therefore conducted under the single causal variant assumption of coloc.abf. No colocalization was identified at any of these loci (mean posterior probability for hypothesis 4 (PP.H₄ ) = 0.59%) (Supplementary Table 7). All but two of these tests indicated no causal variant present in either phenotype (PP.H₀ > 0.8). The two tests in locus 319 (chr2: 126754028–127895644) indicated a strong probability of a causal variant for the Kunkle et al.¹² and Wightman et al.³⁷ clinical AD GWASs (PP.H₂ > 0.9). This locus contains BIN1, a known risk gene for AD that is involved in tau regulation^43,44.

An additional 762 colocalization tests were conducted with the six AD GWASs using regions ±250 kb (r² > 0.1) of lead variants from the MTAG-PHQ-9, broad and clinical depression GWAS. SuSiE identified evidence of colocalization in regions ±250 kb of lead variants at genomic risk loci 14 (depressed mood), 15 (appetite change) and 16 (PHQ-9 sum score) and for broad depression at chr7: 12000402–12500402 (PP.H₄ range 0.79–0.85), all with the same three AD GWASs—Bellenguez et al.¹⁵, Wightman et al.¹⁴ and Wightman et al. (excluding the UKB)³⁷ (Supplementary Table 9). These colocalizations were all in the region of the transmembrane protein 106B gene (TMEM106B), which is visualized using LocusZoom⁴⁵ in Fig. 4. Colocalization was also identified for the same phenotype pairs using coloc.abf (Supplementary Table 10). The same depression phenotypes and loci were suggestive of colocalization with Jansen et al.¹³ (PP.H₄ > 0.6).

Fig. 4: LocusZoom plots of the transmembrane protein 106B (TMEM106B) gene region, containing evidence of colocalization (PP.H4 ≤ 0.8). — **Fig. 4: LocusZoom plots of the transmembrane protein 106B (*TMEM106B*) gene region, containing evidence of colocalization (PP.H₄ ≤ 0.8).**

In a follow-up analysis, we assessed statistical colocalization for ±250 kb TMEM106B (chr7: 12000920–12532993) between these four AD GWASs and all remaining depression phenotypes. Additional colocalization was identified at TMEM106B between both fatigue and psychomotor changes with the Bellenguez et al.¹⁵, Wightman et al.¹⁴ and Wightman et al. (excluding the UKB)³⁷ AD GWASs (Supplementary Table 11), and was suggestive for fatigue with Jansen et al.¹³ (Supplementary Table 21).

SMR analysis using gene expression in the TMEM106B region

We further followed-up colocalizing regions using summary-based Mendelian randomization (SMR)⁴⁶ to integrate eQTLs. In total, 50 tests were conducted for prefrontal cortex and peripheral blood eQTL probes within chr7: 12000920–12532993 (genes TMEM1106B and VWDE) and the ten AD/depression phenotypes implicated in cross-trait colocalization. Of these, 11 associations remained significant after Bonferroni correction (P ≤ 0.001), all with expression levels of TMEM106B (Supplementary Table 13). Peripheral blood TMEM106B expression was positively associated with broad depression (b_SMR [s.e.] = 0.029 [0.004], P = 2.30 × 10⁻⁷) and showed evidence of colocalization (P_HEIDI = 0.108). Prefrontal cortex TMEM106B expression was significantly associated with all ten of the AD/depression phenotypes. Significant associations with AD were consistently positive (b_SMR range 0.029–0.15; P-value range 1.042 × 10⁻⁵–3.395 × 10⁻⁴). Conversely, significant associations with depression phenotypes were consistently negative (b_SMR range −0.097 to −0.033; P-value range 2.302 × 10⁻⁷–4.08 × 10⁻⁵). All brain-based associations showed evidence of colocalization (P_HEIDI ≥ 0.05).

Mendelian randomization

We conducted 144 MR tests to assess bidirectional causal effects between the depression phenotypes and AD (72 in each direction). In our primary MR method, CAUSE⁴⁷, no significant causal effects were identified between any of the depression items and AD in either direction, even at nominal significance (Supplementary Table 14).

F-statistics indicated that instrument strength was sufficient (F_Mean range 22.43–63.36; F_Min range 20.84–31.56; F_Max range 26.37–402.86). Measurement error, as indicated by the I_GX² statistics, was low, indicating instrument suitability for MR-Egger (I_GX² range 0.91–0.98). P_FDR ≤ 0.05 was applied in each of the other MR methods to correct for the 144 tests conducted, after which no statistically significant associations were observed for any method (Supplementary Table 15).

No evidence of colocalization was observed within the APOE region between any depression phenotype and any AD GWAS, with a maximum PP.H₄ of 16.58% observed in the region (Supplementary Table 16).

Polygenic risk scores

No statistically significant associations were detected between any depression phenotype polygenic risk score (PRS) and AD case/control status in any of the three AD target samples (P_FDR ≤ 0.05, corrected within each target sample). Exclusion of the APOE region had no effect on results. (Supplementary Table 17 and Supplementary Material 2).

Similarly, no significant associations were observed between any AD-PRS and PHQ-9 depression items within the GLAD (Supplementary Table 18) or PROTECT (Supplementary Table 19) samples after FDR correction, with or without the APOE region.

Discussion

This study presents a genome-wide meta-analysis of PHQ-9 depression symptom items (GWAS-equivalent N range: 224,535–308,421), identifying 37 genomic risk loci. Subsequent genetic correlation analysis identified 20 significant global correlations and 15 significant local correlations at 14 loci with AD, across six AD GWASs with varying proportions of clinical case/control ascertainment. Significant global genetic correlations were primarily found with AD GWASs containing proxy cases and controls. Although no colocalization was identified at any of the regions of local genetic correlation, strong evidence of colocalization was observed between several depression phenotypes and AD in the region of TMEM106B. MR and PRS analyses did not yield significant results, and no evidence of colocalization was observed between depression phenotypes and any AD GWAS in the region of APOE.

The increased power of our PHQ-9 GWAS allowed for the identification of 28 more genomic risk loci than the previous PHQ-9 GWAS²⁴. Several loci identified in this study have shown previous associations with related phenotypes. For example, SHISA4—identified in association with fatigue symptoms—was implicated as playing a role in disrupted sleep⁴⁸ and daytime napping⁴⁹. The top variant for sleep problems at genomic risk loci 6 (MEIS1)—rs113851554 (chr2: 66750564)—was also the top variant in a GWAS of insomnia and restless leg syndrome⁵⁰. Additionally, the obesity gene FTO⁵¹ was identified as a genomic risk locus for appetite changes. Although the role of FTO in depression is inconclusive⁵², it has been linked to anxiety and depression symptoms in individuals with anorexia nervosa (AN)⁵³. Its identification in association with appetite change symptoms—a phenotype relevant to eating behaviors—suggests that symptom-based genetic analysis can help identify the phenotype-relevant biology of individual depression symptoms.

Our findings also highlight cross-symptom genetic similarities. For example, TMEM106B—a gene identified in previous depression GWASs^9,10—was the nearest gene to lead variants for three PHQ-9 items—appetite changes (rs13234970), depressed mood (rs3807866) and the PHQ-9 sum score (rs12699338). TMEM106B was strongly suggested as a causal gene in a recent multi-ancestry depression GWAS⁵⁴. Furthermore, dysregulation of TMEM106B expression has been implicated in association with major depressive disorder (MDD)⁵⁵ as well as with the anxious and weight gain MDD subtypes, both of which are associated with treatment resistance⁵⁶. TMEM106B has also been implicated in self-reported diagnosis of anxiety disorder⁵⁷, neuroticism⁵⁸ and in a latent factor GWAS of depressive, manic and psychotic symptoms/disorders⁵⁹, suggesting a link to psychiatric risk more generally.

The observed colocalization at TMEM106B between multiple depression phenotypes and both proxy + clinical and clinical-only AD is therefore of particular interest. Two previous studies have identified TMEM106B as playing a role in both depression and AD^18,19. TMEM106B is involved in lysosomal function—particularly in motor neurons⁶⁰—and is classically considered a frontotemporal dementia risk gene⁶¹. As well as being identified in recent AD GWASs^14,15, it is also associated with brain aging, cognitive decline and neurodegeneration across other brain disorders, including amyotrophic lateral sclerosis, multiple sclerosis and Parkinson’s disease^62,63,64,65. TMEM106B is also linked to higher levels of cerebrospinal fluid (CSF) neurofilament light (NfL) chain⁶⁶—itself predictive of cognitive decline, brain atrophy and cortical amyloid burden in individuals with AD and mild cognitive impairment⁶⁷. Higher levels of plasma NfL are also observed in individuals with depression⁶⁸. Accordingly, colocalization between depression phenotypes and AD at TMEM106B indicates that depression may be genetically linked to overall brain health and the resulting general dementia risk. Our study suggests that this overlap may be driven by the genetic architecture of specific depression symptoms, highlighting the benefits of symptom-level genetic analysis. However, SMR analysis indicates that levels of TMEM106B expression as measured in brain have directionally opposite effects for depression phenotypes and AD. As such, further work is required to better understand the role of TMEM106B in brain disorders.

Depression/depression symptom PRSs were not predictive of AD case/control status in three clinical samples, and we did not find evidence of any MR causal associations. Although in contradiction to the study by Harerimana et al.²⁰, these MR findings are consistent with previous studies^17,69. The overall lack of evidence in our analyses versus the relationship observed in previous epidemiological studies suggests the relationship is subject to unidentified confounding. The investigation of this is an important step for future research.

Previous studies have shown changes in the direction of MR effects depending on whether the outcome AD GWAS contains proxy or clinical cases/controls³², but this study differs in that it demonstrates a similar effect with genetic correlations. Of the significant genetic correlations we identified, 95% were identified in proxy + clinical or proxy-only AD GWASs. Where two previous studies^18,20 identified a genetic correlation between depression and AD, it is noticeable that they used the Jansen et al. proxy + clinical AD GWAS as their primary outcome.

Exactly why depression/depression symptoms show differences in genetic correlation between proxy and clinical AD is a matter of interest, particularly as no genetic correlations were identified with the Bellenguez et al.¹⁵ GWAS, despite this also containing proxy + clinical phenotyping. As mentioned, the Bellenguez et al.¹⁵ GWAS defines proxy cases/controls as a binary phenotype, whereas Wightman et al.¹⁴ and Jansen et al.¹³ define proxy cases/controls as a continuous phenotype. These phenotyping differences probably partially explain the differences in the genetic correlation results, given that all three GWASs use the same proxy cases/controls from UKB. However, genetic correlations were also observed with the proxy-only Marioni et al.³¹ GWAS—a meta-analysis of maternal and paternal AD status where parental age-at-diagnosis/age-of-death is controlled for in the GWAS model, instead of being used for weighting the proxy phenotype before analysis. Considering that depression is itself associated with all-cause mortality⁷⁰, it is plausible that including age-at-diagnosis/age-of-death in proxy AD phenotyping induces a form of bias in later cross-trait analyses when the other trait is itself associated with longevity. Further investigation of this issue is required.

Nonetheless, conflicting results such as these pose a problem to researchers seeking to identify genetic relationships between AD and its risk factors. Large differences in the presence or direction of effects depending on which AD GWAS is used to assess associations increases the difficulty in discerning true associations. As such, a sensible approach for future cross-trait genetic studies of AD would be to conduct primary analyses using a clinically ascertained AD phenotype, with different proxy/clinical ascertainment GWASs used to examine consistency. Although this approach would limit researchers to AD GWASs with smaller sample sizes for primary analyses, it would also ensure that the results are driven not by AD proxy phenotyping alone.

This study has several limitations. Despite it being such a large meta-analysis of PHQ-9 items, the ability to detect genome-wide significant variants was probably limited by small sample sizes relative to other psychiatric conditions. Additionally, our analyses were restricted to individuals of European ancestry. The GWAS results may therefore have poor transferability to other ancestry groups. Furthermore, our study uses data from UKB, which is known to be affected by healthy volunteer bias and as a consequence is not fully representative of the wider population⁷¹. We also note that recent work by Huang and colleagues⁷² has suggested that the PHQ-9 capture does not capture the symptom-level genetic heterogeneity underlying depression as accurately as the Composite International Diagnostic Interview Short-Form (CIDI-SF). Although the present study is better powered due to a larger overall sample size, we suggest that future GWAS meta-analyses of individual depression symptoms would benefit from utilizing multiple rating scales.

This study focused on depression as a risk factor for AD. However, there is evidence that some late-life depression represents a prodromal phase of dementia onset^6,73, possibly related to dementia biomarker levels⁷⁴. Therefore, dementia-related depression may be biologically distinct from depression as a mental health disorder. Future genomic studies of dementia-related depression—as undertaken with psychosis in AD⁷⁵—could prove illuminating.

In conclusion, this study describes a genome-wide meta-analysis of PHQ-9 depression symptom items (GWAS-equivalent N range: 224,535–308,421), identifying 37 unique genomic risk loci. Genetic correlations between depression/depression symptoms and AD were primarily observed when the AD GWAS contained clinical + proxy or proxy-only AD case/control ascertainment. Despite null results in MR and PRS, colocalization in the TMEM106B region between four depression phenotypes and AD across both proxy and clinical AD GWASs suggests that future research is warranted into the shared biological mechanisms underlying the role of this locus in depression and AD.

Methods

GWAS in the UK Biobank, GLAD and PROTECT

Patient Health Questionnaire-9 phenotypes

The PHQ-9 is a well-validated clinical screening questionnaire used to assess depression symptom severity on nine individual symptoms in the Diagnostic and Statistical Manual of Mental Disorders, fourth edition (DSM-IV)⁷⁶. The severity of each symptom is measured by the self-reported persistence of that symptom over the preceding two weeks, on a scale of 0 to 3. Scores of 3 indicate an individual experienced that symptom nearly every day, 2 indicates an individual experienced that symptom on more than half the days, 1 indicates an individual experienced that symptom for several days, and 0 indicates no experience of that symptom at all. The sum of an individual’s scores over all nine items (sum score) ranges from 0 to 27. For an overview of the PHQ-9 items and response distribution for each sample, see Supplementary Table 20. Supplementary Table 21 provides sum-score distributions.

Study population

In each GWAS sample, individuals were only retained if they had reported European ancestry and provided a valid response to all PHQ-9 items. Individuals were excluded if they had reported a previous professional diagnosis of schizophrenia, psychosis, mania, hypomania, bipolar or manic depression (UKB field ID 20544) or a previous prescription of medication for a psychotic experience (UKB field ID 20466).

GWAS software

GWAS analyses were conducted using REGENIE v3.1.3⁷⁷. In step one of REGENIE, ridge regression is applied to a subset of quality-controlled variants to fit, combine and decompose a set of leave-one-chromosome-out (LOCO) predictions. Here, quality control for step one was undertaken using PLINK v1.9⁷⁸. In step two, imputed variants are tested for association with the phenotype. LOCO predictions from step one are included as covariates to control for proximal contamination. For all GWASs, genotyping batch, sex, age and age-squared were included as covariates, as were the maximum available genetic principal components (PCs) for GLAD (10 PCs) and PROTECT (20 PCs) to control for population stratification. For the UKB analyses, 16 PCs were included, as recommended by Privé and colleagues⁷⁹. Assessment center was also included as a covariate for UKB analyses.

A total of 40 GWASs were conducted for the meta-analyses—one for each of the nine PHQ-9 depression symptom phenotypes as well as the sum score across all nine items in each of the four samples. To maximize the statistical power, PHQ-9 phenotypes were treated as continuous (range of 0–3 for individual items and 0–27 for the sum score) and analyzed using linear regression. Analyses were restricted to the autosomes.

GWAS with UK Biobank

UKB is a large-scale biomedical database and research resource consisting of ~500,000 individuals with data across a broad range of phenotypes, including mental health outcomes³⁶. Individuals in UKB have been genotyped on the custom UK Biobank Axiom or UKBiLEVE arrays, with imputed data available for ~90 million variants imputed with IMPUTE2 using the Haplotype Reference Consortium (HRC)⁸⁰ and combined UK10K + 1000 Genomes Phase 3 reference panels⁸¹.

UKB participants completed the PHQ-9 in two online surveys. In total, 157,345 individuals provided responses as part of the Mental Health Questionnaire (UKB-MHQ) (category 136) between 2016 and 2017, and 167,199 individuals provided responses as part of the Experience of Pain Questionnaire (UKB-EoP) (category 154) between 2019 and 2020.

After filtering for self-reported European ancestry, valid PHQ-9 responses and previous diagnosis/prescription exclusions, 144,630 (UKB-MHQ) and 155,027 (UKB-EoP) individuals remained before genetic quality control for REGENIE. In step one, SNPs with a call rate of >98%, minor allele frequency (MAF) > 1% and Hardy–Weinberg equilibrium test P > 1 × 10⁻⁸ were retained, as were individuals with variant missingness < 2%, no unusual levels of heterozygosity and not mismatched on sex. Individuals were retained if they were determined to be of European ancestry based on 4-means clustering on the first PCs.

For the final GWAS analyses, 143,171 (mean age [s.d.] = 63.70 [7.68]; % female = 56.38%) and 152,932 (mean age [s.d.] = 65.95 [7.63]; % female = 56.57%) individuals proceeded from the MHQ and EoP questionnaires, respectively. Of these, 108,601 individuals had provided responses on both questionnaires. In step two, a total of 9,746,698 imputed variants were retained with MAF ≥ 0.01 and imputation quality (INFO) score ≥ 0.7.

GWAS with the Genetic Links to Anxiety and Depression study

The GLAD study has the specific goal of recruiting a large cohort of recontactable individuals with anxiety or depression into the National Institute for Health and Care Research (NIHR) Mental Health BioResource, with genetic, environmental and phenotypic data collected³⁴. Genotyping for GLAD was conducted using the UKB v2 Axiom array and imputed using the TopMed imputation pipeline⁸².

After filtering for self-reported European ancestry, valid PHQ-9 responses and previous diagnosis/prescription exclusions, 15,472 individuals remained before genetic quality control for REGENIE step one. Genotype data were provided by the study team and had been filtered to retain SNPs with a genotype call rate > 95%, MAF > 1%, Hardy–Weinberg equilibrium test P > 1 × 10⁻¹⁰, and individuals with genotype missingness < 5%. Individuals were also excluded if they had unusual levels of heterozygosity, were mismatched on sex and were of non-European ancestry based on 4-means clustering. A total of 15,171 individuals (mean age [s.d.] = 39.27 [14.61]; % female = 78.30%) were retained for the final analysis. In step two, a total of 13,979,187 imputed variants with MAF ≥ 0.001 and INFO ≥ 0.7 were analyzed.

GWAS with the PROTECT study

PROTECT is an online registry of ~25,000 UK-based individuals that aims to track cognitive health in older adults. Individuals were only considered eligible for inclusion in PROTECT if they were older than 50 years, had no previous dementia diagnosis and had internet access. Genetic data are available alongside phenotypic data for ~10,000 of the participants. These individuals were genotyped on the Illumina Infinium Global Screening Array and imputed on the 1000 Genomes reference panel⁸³ using the Michigan imputation server and genotype phasing using Eagle.

After filtering for self-reported European ancestry, valid PHQ-9 responses and previous diagnosis/prescription exclusions, 7,589 individuals remained for genetic quality control for step one of REGENIE. Genetic data in PROTECT had been quality-controlled previously before imputation to only retain individuals and variants with a call rate of >98%, Hardy–Weinberg equilibrium test P > 0.00001 and excluding unusual heterozygosity³⁵. Variants used in step one were down-sampled from the imputed data using a snplist from the Illumina Infinium Global Screening Array provided by the PROTECT investigators. Variants were retained if they had MAF > 1%. After mismatched sex and 4-means clustering ancestry exclusions, a total of 7,589 individuals (mean age [s.d.] = 61.96 [7.07]; % female = 75.13%) proceeded to step two. In step two, 9,388,534 imputed variants with MAF ≥ 0.001 and imputation INFO score ≥ 0.7 were analyzed.

GWAS summary statistics

An overview of additional summary statistics obtained for this study is provided in Table 2.

Table 2 An overview of previously conducted GWASs for depression and AD used in this study

Full size table

Clinical and broad depression

To examine potential differences in genetic overlap with AD between depression as a disorder compared to individual depression symptoms, summary statistics for two previously conducted GWASs of clinical and broad depression were obtained from the Psychiatric Genomics Consortium (PGC; https://pgc.unc.edu/for-researchers/download-results/). For clinical depression, we used a subsample of the MDD GWAS by Wray et al.⁹ that excluded samples from the UKB and 23andMe, and contained only individuals for whom case ascertainment was defined through structured diagnostic interview or electronic health records. For the broad definition depression GWAS, we used a subsample of the depression GWAS by Howard et al.¹⁰, which also excluded samples from 23andMe. In addition to clinical cases and controls used by Wray et al.⁹, this broad depression GWAS included individuals in the UKB for whom case–control ascertainment was based on self-reported responses to the questions ‘Have you ever seen a general practitioner for nerves, anxiety tension or depression?’ and ‘Have you ever seen a psychiatrist for nerves anxiety, tension or depression?’

Alzheimer’s disease

Summary statistics were obtained from six previously conducted AD GWASs: three with proxy + clinical, one with proxy-only and two with clinical-only case ascertainment. All three of the proxy + clinical AD GWASs (Bellenguez et al.¹⁵, Wightman et al.¹⁴ and Jansen et al.¹³) and the proxy-only AD GWAS (Marioni et al.³¹) used data from the UKB for proxy AD samples.

There are some key differences in the way these AD GWASs define proxy cases and controls. Bellenguez et al.¹⁵ define proxy cases/controls as a binary phenotype, whereby individuals reporting a parent with AD or dementia are considered cases and those reporting no parental history are considered controls. Wightman et al.¹⁴ and Jansen et al.¹³ instead define proxy cases/control as a continuous phenotype, summing the number of parents an individual has reported with dementia and down-weighting unaffected parents by their age (or age of death).

For the proxy-only Marioni et al.³¹ GWAS, summary statistics were obtained from a meta-analysis of paternal and maternal AD. Here, proxy phenotyping was based on the self-report of either maternal or paternal AD, including the parent’s age at the time of reporting/age of death as a covariate.

Summary statistics from a clinical-only subsample of the GWAS by Wightman et al. that excluded proxy cases/controls from the UKB³⁷ were obtained from the authors. Summary statistics for a final clinical-only AD GWAS were obtained from stage 1 of the GWAS by Kunkle and colleagues¹².

Summary statistic standardization

Summary statistics from all 40 depression symptom GWASs, the two depression GWASs and the six AD GWASs were standardized using MungeSumstats⁸⁴ in R version 4.2.1. Using dbSNP 141 and the BSgenome.Hsapiens.1000genomes.hs37d5 reference genome, missing rsIDs were corrected, duplicates and multi-allelic variants removed, effect alleles and the direction of their effects aligned to the reference genome, and variants filtered at an INFO score of ≥0.7 and MAF ≥ 0.01. The GLAD Study and Bellenguez et al.¹⁵ summary statistics were lifted over from GRCh38 to GRCh37.

SNP heritability

SNP heritability (h²_SNP) estimates were calculated using LDSC³⁹. Briefly, LDSC calculates h²_SNP by regressing the effect sizes from GWAS summary statistics on their LD score as computed in a reference panel—in this case HapMap3 variants contained within the European sample of 1000 Genomes Phase 3. Liability scale h²_SNP was calculated naïvely from the standardized depression GWAS and AD GWAS using a 15% and 5% population prevalence, respectively^9,14. Heritability z-scores were calculated for all phenotypes by dividing the h²_SNP estimates by their standard error.

GWAS meta-analysis of depression symptoms

To leverage the maximum genetic information available controlling for the sample overlap between the UKB-MHQ and UKB-EoP samples, the REGENIE output for each PHQ-9 phenotype from the UKB-EoP, GLAD Study and PROTECT were first subject to inverse variance weighted (IVW) meta-analysis using METAL⁸⁵. All available variants were included, for a total of 8,425,618 (N = 175,692). Multi-trait analysis of GWAS (MTAG)³⁸ v1.0.8 was then used to meta-analyze the METAL output with the UKB-MHQ sample. Although MTAG is commonly used for the joint genetic analysis of multiple traits or multiple measurements of the same trait, by assuming the heritability of included phenotypes are equal (--equal-h2) and their genetic correlation is one (--perfect-gencov), MTAG performs an IVW meta-analysis of the same measures of the same trait, accounting for sample overlap using the cross-trait intercept from LDSC^39,40. Heritability estimates for all samples, plus the METAL meta-analysis, are shown in Supplementary Table 22. Genetic correlations between the UKB-MHQ and METAL GWAS are provided in Supplementary Table 23. For greater detail on the IVW function of MTAG, see the online methods of the original MTAG paper³⁸. A total of 8,196,874 SNPs with MAF > 0.01 were available for MTAG analysis.

This MTAG function provides one set of summary statistics and two GWAS-equivalent sample sizes—one for each original sample included. A single, weighted GWAS-equivalent N was obtained for each PHQ-9-MTAG GWAS, using the following formula:

$${N\left({\rm{final}}\right)=\tfrac{\left({N}_{1}\left({\rm{pre}}\right)\times {N}_{1}\left({\rm{post}}\right)\right)+\left({N}_{2}\left({\rm{pre}}\right)\times {N}_{2}\left({\rm{post}}\right)\right)}{{N}_{1}\left({\rm{pre}}\right)+{N}_{2}({\rm{pre}})}}$$

where N₁ and N₂ represents the UKB-MHQ and METAL GWASs, respectively, pre is the mean sample size prior to inclusion in MTAG, and post is the GWAS-equivalent sample estimated by MTAG following analysis.

Genomic risk loci and gene annotation

The GWAS meta-analysis results were annotated using FUMA GWAS⁸⁶ v3.1.6a. Genome-wide significance was set at P ≤ 5 × 10⁻⁸. Lead variants at genomic risk loci were defined by clumping all variants correlated at r² > 0.1, 250 kb either side, performed using the European sample of the 1000 Genomes Phase 3 reference panel. Lead variants were mapped to genes within 10 kb using positional mapping and eQTLs from four brain (BrainSeq⁸⁷, PsychENCODE⁸⁸, CommonMind⁸⁹ and BRAINEAC⁹⁰) and five blood (BloodeQTL⁹¹, BIOS⁹², eQTLGen cis and trans⁹³, Twins UK⁹⁴ and xQTLServer⁹⁵) eQTL datasets, alongside all 54 tissue-type eQTLs from GTEx v8 (https://gtexportal.org/home/tissueSummaryPage).