Introduction

Type 2 diabetes mellitus (T2DM) and major depressive disorder (MDD) are two of the most prevalent chronic health conditions, both contributing significantly to the global disease burden [1, 2]. T2DM primarily affects glucose metabolism [3], whereas MDD is characterized by persistent mood disturbances and emotional dysregulation [4]. Despite these distinct clinical manifestations, accumulating evidence has revealed a robust bidirectional relationship between the two disorders. A recent large-scale epidemiological study has reported significant comorbidity [4], with depression increasing the risk of diabetes-related hospitalization by fivefold in a major prospective cohort [5]. Moreover, depression is associated with worse outcomes among individuals with T2DM, including more complications and higher mortality [6]. However, the underlying mechanisms linking T2DM and MDD remain poorly understood, particularly at the molecular and biological level. Elucidating these shared pathophysiological pathways is crucial, as the co-occurrence of T2DM and MDD not only exacerbates disease burden but also complicates clinical management and impairs treatment outcomes.

Exploring the structural brain changes associated with T2DM and MDD is key to uncovering the neurobiological links between these two diseases [7, 8]. Among the various neuroimaging markers available, gray matter volume (GMV) is particularly relevant due to its stable measurement ability [9] and critical role in supporting cognitive functions [10], which is typically impaired in these disease [11, 12]. Identifying the shared GMV changes between T2DM and MDD could provide important insights into their common neurobiological mechanisms. However, neuroimaging studies within each disease have reported mixed findings regarding GMV changes. For example, in T2DM, while some studies have observed significant GMV reductions in regions such as the temporal cortex [13, 14], others have reported no significant changes in any region [15, 16]. Similarly, in MDD, GMV reductions are frequently reported in the inferior temporal and fusiform regions [17, 18], yet other research has failed to detect significant changes in whole brain [19] or identified GMV abnormalities in other brain regions [20]. These inconsistencies likely arise due to variations in sample size, methodologies, and population characteristics. To resolve these discrepancies, neuroimaging meta-analysis serves as a powerful approach to synthesize findings across multiple studies, providing more robust and reliable results [21,22,23,24,25]. By focusing on GMV alterations, neuroimaging meta-analyses can offer a clearer understanding of the common brain structural changes associated with T2DM and MDD, laying the foundation for identifying shared neurobiological mechanisms.

Both T2DM and MDD are highly heritable conditions, with heritability estimates of over 25% for T2DM [26] and 30–50% for MDD [27], indicating that genetic factors play a significant role in their development and progression. Genome-wide association studies (GWAS) have identified multiple genetic variants associated with each disease [28, 29], and recent research has suggested a genetic correlation between T2DM and MDD [30]. This implies that shared genetic factors may contribute to the susceptibility of both diseases, potentially explaining their frequent co-occurrence. Despite these findings, the exact genetic mechanisms driving GMV changes in both diseases are still not well understood. Investigating the genetic factors underlying GMV alterations is critical for understanding the neurobiological pathways that connect T2DM and MDD. Transcriptome-neuroimaging association analysis integrates gene expression data with neuroimaging phenotypes [31,32,33], providing an effective method for exploring shared genetic mechanisms and identifying specific genes linked to GMV changes. By utilizing resources like the Allen Human Brain Atlas (AHBA) [34], researchers can map gene expression profiles in regions where GMV alterations are observed in T2DM and MDD, uncovering the genetic drivers that contribute to these structural brain changes and advancing our understanding of the common pathways involved in both diseases.

In this study, we aimed to investigate the shared molecular mechanisms underlying GMV changes in T2DM and MDD. We first conducted neuroimaging meta-analyses to identify GMV alterations specific to each disease and pinpoint the common changes occurring in both conditions. Following this, we utilized conjunctional false discovery rate (conjFDR) methods [35] to identify genes shared between the two diseases based on recent GWAS summary data for T2DM and MDD [28, 29]. Next, we performed transcriptome-neuroimaging association analyses with these shared genes, employing the AHBA to uncover the genetic drivers contributing to these structural brain changes. The identified shared genes were subsequently prioritized through expression-trait association analyses. Finally, knockout mouse models were employed to investigate the functional roles of the prioritized genes. The findings from this study not only enhance our understanding of the neurobiological mechanisms linking T2DM and MDD but also highlight potential genetic targets for future therapeutic interventions aimed at alleviating the cognitive and emotional impairments commonly associated with these diseases. An outline of the analytical framework is illustrated in Fig. 1.

Fig. 1: Workflow illustrating the integrative analysis pipeline.
Fig. 1: Workflow illustrating the integrative analysis pipeline.
Full size image

A Neuroimaging meta-analyses were conducted to identify GMV alterations in T2DM and MDD, and to determine the overlapping regions of GMV reduction between the two conditions. B Shared genes were identified using conjFDR analysis applied to large-scale GWAS summary statistics for T2DM and MDD. Gene expression profiles were extracted from the AHBA database to construct the gene expression matrix. C Transcriptome-neuroimaging association analysis combined voxel-wise Z-values of GMV changes with the expression matrix of shared genes to identify genes associated with GMV alterations in each disorder and their overlap. D Expression-trait association analyses were used to prioritize shared genes, and functional validation was conducted through phenotypic profiling of knockout mouse models. AHBA allen human brain atlas, conjFDR conjunctional false discovery rate, GMV gray matter volume, HC healthy control, MDD major depression disorder, T2DM type 2 diabetes mellitus.

Materials and methods

Search strategy and selection criteria

A systematic search was conducted in the PubMed, Web of Science, and Embase databases to retrieve studies published prior to September 2024 using the following search terms: “voxel-based morphometry” or “VBM”, and “gray matter volume” or “grey matter volume” or “GMV”, and “major depressive disorder” or “major depression” or “MDD” or “type 2 diabetes mellitus” and “T2DM”. To identify additional studies, we also manually searched the reference lists of the included articles. Studies were included if they met all of the following criteria: (1) published in peer-reviewed English journals; (2) employed the voxel-based morphometry (VBM) approach to analyze whole-brain GMV in patients with MDD or T2DM compared to healthy controls; (3) provided peak coordinates of GMV alterations along with the corresponding statistical values in Montreal Neurological Institute (MNI) or Talairach space, or reported null findings; and (4) included participants aged between 18 and 65 years. Studies were excluded if they met any of the following criteria: (1) non-original research, such as meta-analyses, editorials, or review articles; or (2) included participants with comorbidities involving other mental illnesses. In cases where patient groups overlapped with those of another study, the study with the larger sample size was retained. If an article reported multiple independent patient samples, these were treated as separate datasets. Our meta-analysis was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [36], and the detailed study selection procedures are summarized in Fig. 2.

Fig. 2: Flowchart of study selection for the neuroimaging meta-analyses of T2DM and MDD.
Fig. 2: Flowchart of study selection for the neuroimaging meta-analyses of T2DM and MDD.
Full size image

A Study selection process for T2DM. B Study selection process for MDD. GMV gray matter volume, HC healthy control, MDD major depressive disorder, N number of studies, T2DM type 2 diabetes mellitus.

The quality of the included studies was evaluated using a 10-point checklist, which assessed several key factors, including the demographic and clinical characteristics of the subjects, the methods used for image acquisition and analysis, and the quality of the reported results and conclusions (Supplementary Table 1). Each criterion was scored as 1, 0.5, or 0, depending on whether it was fully, partially, or not met (Supplementary Tables 2, 3). Studies scoring above 9 were considered eligible and were included in subsequent analyses. Additionally, the following data were extracted from the included studies: demographic information (e.g., sample size, mean age, gender), clinical characteristics (e.g., illness duration, medication status), and methodological details (e.g., field strength, statistical threshold, scanning parameters). The peak coordinates and relevant statistics (e.g., t-values or their equivalents) were also recorded.

TMUGH T2DM dataset

A T2DM dataset collected at Tianjin Medical University General Hospital (TMUGH) was included, comprising 156 right-handed individuals with T2DM and 148 healthy controls matched for age, sex, and handedness. Its inclusion in the meta-analysis increased the sample size and statistical power for detecting GMV alterations associated with T2DM. Detailed preprocessing and statistical procedures are provided in the Supplementary Materials.

Voxel-wise meta-analysis

A meta-analysis was performed using the Seed-based d Mapping with Permutation of Subject Images software (SDM-PSI, version 6.22; https://www.sdmproject.com/) to examine abnormal GMV changes in patients with T2DM and MDD. For each study, peak coordinates and corresponding statistical values (t-values, or P- and z-values converted to t-values via the SDM online converter) were utilized alongside the t-map from the TMUGH dataset to reconstruct full-brain effect-size images using default settings (full anisotropy = 1, full-width at half-maximum = 20 mm, voxel size = 2 mm, gray matter mask). Key steps included (1) generating multiple imputations for study and subject images, (2) performing group analyses on the imputed subject images within each study, and (3) conducting meta-analyses of the imputed study images using a random-effects model, with Rubin’s rules applied to integrate the final meta-analysis results [37]. Age was included as a covariate to control for potential confounding effects across both patient and control groups in the meta-analysis, as T2DM patients are typically older than those with MDD. To control for family-wise error, a subject-based nonparametric permutation test (1000 permutations) was conducted with threshold-free cluster enhancement (TFCE), setting significance at FWE-TFCE P < 0.05 with a minimum cluster extent of 100 voxels. Finally, heterogeneity across studies was assessed using Cochran’s Q test and I² statistics, and Egger’s test was employed to evaluate publication bias in significant findings [38].

To uncover shared neurobiological mechanisms, we performed SDM-PSI’s multimodal analysis [39] to identify common GMV changes across T2DM and MDD. Following the multimodal analysis, we conducted meta-regression analyses to examine how demographic and clinical factors moderated GMV changes within each condition. Specifically, for T2DM, we investigated the impact of variables such as the percentage of female participants, BMI, HbA1c levels, illness duration, and the proportion of patients on medication. For MDD, we assessed the percentage of female patients, onset age, illness duration, Hamilton Depression Rating Scale (HDRS) score, and the proportion of patients on medication. To assess the robustness of meta-analytic findings and explore potential sources of heterogeneity in MDD, we conducted subgroup analyses based on four consistently reported study-level variables: MRI field strength (1.5 vs. 3.0 T), scanner manufacturer (GE, Siemens, Philips), smoothing kernel size (8 mm vs. others), and medication status (medicated vs. unmedicated). These parameters were selected due to their frequent reporting and limited variability across included MDD datasets, enabling meaningful stratification. In contrast, subgroup analysis was not feasible for T2DM, given the smaller number of eligible studies (n = 15) and the limited availability of stratified metadata such as medication information.

Global genetic correlation between T2DM and MDD

GWAS summary statistics for T2DM were obtained from the DIAGRAM consortium, based on a large multi-ancestry meta-analysis involving 180,834 individuals with T2DM and 1,159,055 controls, partially derived from the UK Biobank [28]. For this study, we used the European ancestry subset, as it provided the largest sample size, comprising 80,154 cases and 853,816 controls. Depression summary statistics were obtained from a meta-analysis of six datasets, involving 371,184 individuals with depression and 978,703 controls, all of European ancestry [29]. To prevent sample overlap, which could introduce bias in the conjFDR analysis, participants from the UK Biobank were excluded from the depression dataset, and samples from 23andMe were removed due to access restrictions, resulting in a refined GWAS subset with 166,773 cases and 507,679 controls. Genome-wide genetic correlation between T2DM and MDD was estimated using linkage disequilibrium score regression (LDSC; https://github.com/bulik/ldsc). Analyses were restricted to HapMap3 SNPs, and LD scores were derived from the European reference panel of the 1000 Genomes Project.

Shared genes between T2DM and MDD

Conditional FDR (condFDR) and conjFDR analyses were used to enhance discovery for each trait and identify shared genetic variants between T2DM and MDD, respectively. CondFDR, based on an empirical Bayesian framework, increases the power of GWAS by identifying significant loci and quantifying genetic overlap across complex traits. ConjFDR extends condFDR by prioritizing SNPs associated with a primary trait (e.g., T2DM) through conditional adjustment for a secondary trait (e.g., MDD), thereby leveraging the combined power of two GWAS to enhance variant identification [40]. For more detailed methodological descriptions, please refer to the Supplementary Materials. We set the significance threshold for condFDR analysis at 0.01 and conjFDR analyses at 0.05, following prior studies [41,42,43]. To minimize bias from complex linkage disequilibrium (LD) patterns, SNPs within the MHC (chr6: 25,119,106-33,854,733), 8p23.1 (chr8: 7,200,000-12,500,000), and MAPT (chr17: 40,000,000-47,000,000) were excluded before fitting the model [44]. To visualize the cross-trait genetic enrichment of T2DM and MDD, quantile-quantile (Q-Q) plots were generated at four increasingly P-value thresholds (P < 1, P < 0.1, P < 0.01, and P < 0.001), where leftward deflection from the expected line indicated cross-trait enrichment as the strength of association in the secondary trait increased.

To identify genes shared between T2DM and MDD, we used FUMA (https://fuma.ctglab.nl/) with the following steps. Independent significant SNPs were selected based on the conjFDR < 0.05 and were required to be independent of each other, with r2 < 0.6 [40, 45, 46]. Candidate SNPs were defined as those with conjFDR < 0.10 and LD r2 ≥ 0.6 with at least one independent significant SNP. These candidate SNPs were then mapped to protein-coding genes using three complementary mapping approaches: positional mapping, linking SNPs to genes within a 10 kb physical distance; eQTL mapping, associating SNPs with genes affected by allelic variation in expression; and chromatin interaction mapping, which uses 3D genome structure to link SNPs to both nearby and distant genes. Genes identified by any of these mapping methods were considered shared genes between T2DM and MDD.

To further investigate the genomic loci shared between T2DM and MDD and better understand the biological roles of candidate SNPs, we performed the following steps. First, we identified lead SNPs among the independent significant SNPs as those in LD with other significant SNPs at r2 < 0.1. For each lead SNP, a locus was established by including its associated independent SNPs and their related candidate SNP. Loci within 250 kb of each other were merged to form a single genomic locus, and LD information for this analysis was obtained from the European ancestry reference panel of the 1000 Genomes Project. Second, all candidate SNPs were annotated with combined annotation-dependent depletion (CADD) scores, RegulomeDB scores, and chromatin states. CADD scores assess the potential deleteriousness of each variant by integrating multiple annotations, such as evolutionary conservation and predicted functional impacts, with higher scores suggesting a greater likelihood of pathogenicity [47]. A CADD score threshold of 12.37 was applied to identify variants with potential pathogenic effects. RegulomeDB scores measure the regulatory potential of SNPs, with values ranging from 1–7; lower scores (e.g., 1a, 1b) represent stronger regulatory evidence, helping to pinpoint SNPs likely involved in gene regulation [48]. Chromatin states classify genomic regions based on epigenetic markers, identifying functional elements such as promoters, enhancers, and heterochromatic regions [49, 50].

Transcriptome-neuroimaging association analysis

Gene expression data were sourced from the AHBA database, a comprehensive repository that provides spatially mapped gene expression data across the human brain, derived from postmortem samples of six neurotypical adult donors [34] (Supplementary Table 4). This dataset encompasses over 20,000 genes, measured with approximately 58,000 probes across 3702 tissue samples from various brain regions. The abagen toolbox (https://www.github.com/netneurolab/abagen) [51] was used to extract and preprocess the gene expression data, following several key steps [52, 53]. First, gene probes were reannotated, and those that were unreliable or unmatched to genes were discarded [54]. An intensity-based filter was then applied, removing probes whose intensity was below background noise in at least 50% of tissue samples. The MNI coordinates of the tissue samples were aligned with the anatomical images using the alleninf package (https://github.com/chrisgorgo/alleninf). For each gene, the probe exhibiting the highest differential stability was selected as its representative. Gene expression values were subsequently normalized using a scaled robust sigmoid function, which was applied across genes for each sample and donor and across samples for each gene and donor. Additional normalization was performed within each distinct brain structure (cortex, subcortical/brain stem, and cerebellum) to adjust for differences in gene expression patterns among these regions. After preprocessing, the shared genes identified through conjFDR analysis for T2DM and MDD were extracted from the resulting gene expression matrix, and used for subsequent transcriptome-neuroimaging association analysis.

Using the unthresholded voxel-wise meta-analytic z-map representing GMV differences between patients with T2DM/MDD and healthy controls, we calculated the mean z-value within a 6 mm radius sphere centered at each tissue sample’s MNI coordinate to represent the GMV difference for that sample. Within each disease group, cross-sample correlations between GMV differences and gene expression profiles were assessed using Pearson’s correlation to identify genes whose expression levels correlated with GMV alterations in T2DM and MDD, respectively. To control for multiple comparisons, the Benjamini-Hochberg FDR method was applied (FDR P < 0.05). Genes associated with GMV alterations in both T2DM and MDD were identified by intersecting the results from the two separate cross-spatial correlation analyses, highlighting potential shared genetic factors underlying GMV changes in both conditions. To assess the significance of these shared genes, we conducted a permutation test (1000 iterations) using the Brain Surrogate Maps with Autocorrelated Spatial Heterogeneity (BrainSMASH) method [55], with significance set at Pperm < 0.05.

Prioritizing shared genes using expression-trait association analyses

To further refine the shared genes identified through the transcriptome-neuroimaging association analysis, we employed expression-trait association analyses to prioritize genes potentially relevant to GMV alterations in both T2DM and MDD. We used S-PrediXcan [56], a method that estimates genetically regulated gene expression in specific tissues and correlates it with complex traits, to conduct single-tissue expression-trait association analyses. This approach integrates GWAS summary data with pre-computed eQTL prediction models derived from individual brain tissues available in the Genotype-Tissue Expression (GTEx v8) dataset [57], including the amygdala, anterior cingulate cortex (BA24), caudate, cerebellar hemisphere, cerebellum, cortex, frontal cortex (BA9), hippocampus, hypothalamus, nucleus accumbens, putamen, cervical spinal cord C1, and substantia nigra. Next, we employed S-MultiXcan [58], an extension of S-PrediXcan, to combine results across tissues, increasing statistical power and identifying shared genetic effects across brain regions. Genes lacking significant brain eQTL associations were excluded, and significant expression-trait associations were identified using a Benjamini-Hochberg FDR-corrected threshold of P < 0.05.

Abnormal phenotypes in knockout mouse models of prioritized shared genes

To explore the functional relevance of shared genes between T2DM and MDD, we leveraged the Mouse Genome Informatics (MGI) database (https://www.informatics.jax.org/), a comprehensive international resource dedicated to laboratory mice. MGI database provides valuable information on gene mutations, knockout models, and associated phenotypic markers, enabling us to investigate the phenotypic consequences of gene loss in mice. Genes retrieved from in the MGI database were subsequently queried for human homologs using the Ensembl BioMart platform (https://mart.ensembl.org/) to ensure cross-species validity and translational relevance of our findings. Our analysis primarily focused on phenotypic changes observed in the nervous system and metabolic system, particularly those related to brain structure, function, or fasting circulating glucose level. By examining the abnormal behavioral and neurobiological phenotypes of the shared genes in knockout or mutant mouse model, we aimed to explore how the loss of these genes may affect behavior, such as activity levels, anxiety, or cognitive function, which could be relevant to both T2DM and MDD.

Results

Included studies in meta-analysis

Our systematic search and selection process, as illustrated in Fig. 2, initially identified 211 records for T2DM and 1348 records for MDD through database searches. After removing duplicates, 157 records for T2DM and 868 for MDD were screened by title and abstract, which led to the exclusion of 107 T2DM and 693 MDD records, leaving 50 T2DM and 175 MDD articles for eligibility assessment. After applying rigorous inclusion and exclusion criteria for full-text screening, 15 studies on T2DM and 50 on MDD were finally selected for meta-analysis, with all selected studies receiving a quality score above 9 (Supplementary Tables 2, 3). Among these, 2 T2DM studies [59, 60] and 6 MDD studies [18, 20, 61,62,63,64] included different patient subgroups, which we treated as independent datasets. Additionally, we incorporated our own TMUGH T2DM dataset, comprising 156 T2DM patients and 148 healthy controls. In total, our meta-analysis included 18 datasets for T2DM (860 patients and 725 controls) and 57 datasets for MDD (3867 cases and 3810 controls). Detailed demographic information, clinical characteristics, and methodological specifics for each dataset are provided in Supplementary Tables 5, 6.

GMV differences in meta-analysis

Compared to healthy controls, T2DM patients exhibited significant GMV reductions, primarily in the bilateral insula, bilateral median cingulate/paracingulate gyri (MCC), bilateral medial orbital part of superior frontal gyrus (SFGmO), and right superior temporal gyrus (STG). Additional reductions were observed in the crus I of left cerebellum, lobule IV/V of right cerebellum, left inferior occipital gyrus (IOG), inferior parietal gyrus (IPG), and left triangular and opercular parts of inferior frontal gyrus (IFGtriang/IFGoperc) (FWE-TFCE P < 0.05, Fig. 3A and Supplementary Table 7). In individuals with MDD, significant GMV decreases were observed predominantly in the bilateral SFGmO, bilateral IFGtriang, left IFGoperc, bilateral insula, left middle temporal gyrus (MTG), right middle frontal gyrus (MFG), right STG, bilateral MCC and the lobule VIII of right cerebellar hemisphere (FWE-TFCE P < 0.05, Fig. 3B and Supplementary Table 8). No regions with increased GMV were identified in either T2DM or MDD groups. The consistency of these findings was evaluated using Cochran’s Q test and I² statistics, both of which indicated no significant between-study heterogeneity, suggesting consistent GMV alterations across the studies. Furthermore, Egger’s test confirmed the absence of publication bias for the significant clusters (Supplementary Tables 7, 8).

Fig. 3: GMV reductions identified through meta-analyses of T2DM and MDD, as well as their overlap.
Fig. 3: GMV reductions identified through meta-analyses of T2DM and MDD, as well as their overlap.
Full size image

A GMV reductions in T2DM patients compared to healthy controls. B GMV reductions in MDD patients compared to healthy controls. C Shared GMV reductions observed in both T2DM and MDD. For overlapping regions, the SDM-Z value with the smaller absolute magnitude was used as the representative value. The color bar indicates SDM-Z values, where darker colors correspond to larger decreases in GMV. Statistical maps are thresholded using FWE-TFCE-corrected P < 0.05 and cluster extent ≥ 100 voxels. GMV gray matter volume, IFG inferior frontal gyrus, INS insula, IOG inferior occipital gyrus, L left, MCC median cingulate/paracingulate gyri, MDD major depressive disorder, MFG median frontal gyrus, MTG middle temporal gyrus, ORBsup orbital part of the superior frontal gyrus, R right, SDM seed-based d mapping, SFG superior frontal gyrus, STG superior temporal gyrus.

The multimodal analysis revealed shared GMV reductions in T2DM and MDD patients, specifically in the bilateral MCC/SFGmO, left IFGtriang/IFGoperc, and right insula/STG (Fig. 3C, Supplementary Table 9). These results point to potential common neurobiological mechanisms underlying both conditions. In T2DM group, meta-regression analysis showed that GMV in the left IFGtriang was negatively correlated with BMI, GMV in the bilateral MCC was negatively correlated with HbA1c levels, and GMV in the lobule IV-V of left cerebellum was negatively associated with disease duration. In contrast, for MDD, meta-regression analysis indicated that GMV in the left MCC was negatively associated with HDRS scores. Detailed meta-regression results are provided in the Supplementary Table 10. Subgroup analyses in MDD revealed that the core regions exhibiting gray matter volume reductions—such as the bilateral IFG, left MTG, and right insula—were broadly consistent across strata defined by field strength, scanner manufacturer, smoothing kernel size, and medication status. These convergent patterns support the stability of the primary meta-analytic findings across varying methodological conditions.

Genetic correlation and shared genes between T2DM and MDD

LDSC revealed a significant positive genetic correlation between T2DM and MDD, with an rg of 0.219 and a P value of 2.05 × 10−22, indicating substantial polygenic overlap. As shown in Fig. 4A, the conditional Q-Q plot for T2DM exhibited a leftward shift with decreasing P-values for MDD, and conversely, the Q-Q plot for MDD displayed a leftward shift with decreasing P-values for T2DM, confirming the polygenic overlap between the two traits. Using conjFDR, we identified 128 independent significant SNPs and 3635 candidate SNPs. Among the candidate SNPs, the majority were located in intronic (1830, 50.3%) and intergenic (1158, 31.9%) regions, with 26 (0.7%) SNPs located within ncRNA exons and 449 (12.3%) SNPs in intronic region. Additionally, approximately 4.68% (170/3635) of the candidate SNPs were indicated as potentially pathogenic, based on their CADD scores exceeding 12.37. Around 3.49% (127/3635) of the candidate SNPs exhibited a higher likelihood of regulatory functionality, as indicated by RegulomeDB scores ranging from 1b–2c, based on evidence from eQTLs and chromatin marks. Furthermore, 83.99% (3054/3635) of the candidate SNPs were located in regions with a predominantly open chromatin configuration, characterized by a chromatin state score of less than 8. The information of candidate SNPs is shown in Fig. 5A. Through three mapping methods (positional mapping, eQTL mapping, and chromatin interaction mapping), we identified 664 protein-coding genes associated with candidate SNPs within loci shared between T2DM and MDD, as shown in Supplementary Table 11.

Fig. 4: Results of the conjFDR analysis.
Fig. 4: Results of the conjFDR analysis.
Full size image

A Conditional Q-Q plots showing the polygenic overlap between T2DM and MDD. The left panel illustrates the nominal -log10(P) versus empirical -log10(q) values for T2DM conditioned on MDD, while the right panel shows the reverse (MDD conditioned on T2DM). B Manhattan plot depicting -log10(conjFDR) values for SNPs across chromosomes. The dotted horizontal line represents the threshold for significant shared associations (conjFDR < 0.05). Independent lead SNPs are highlighted in red. conjFDR conjunctional false discovery rate, CHR chromosome, Q-Q quantile-quantile, T2DM type 2 diabetes mellitus, MDD major depression disorder.

Fig. 5: Functional annotation of shared SNPs and expression-trait associations of prioritized genes between T2DM and MDD.
Fig. 5: Functional annotation of shared SNPs and expression-trait associations of prioritized genes between T2DM and MDD.
Full size image

A Distribution of functional categories, RegulomeDB scores, and minimum chromatin states for candidate SNPs in shared loci associated with T2DM and MDD. B Distribution of functional categories, RegulomeDB scores, and minimum chromatin states for lead SNPs in shared loci associated with T2DM and MDD. In panels A and B, the y-axis represents the functional categories used to classify candidate SNPs and lead SNPs separately, while the x-axis indicates log10-transformed number of candidate SNPs and lead SNPs within each category. C Heatmap summarizing the expression-trait association results of nine shared genes. Colors represent log10-transformed P-values, with darker shades indicating significant associations. ACC anterior cingulate cortex, FDR false discovery rate, CSC cervical spinal cord C1, MDD major depression disorder, NA not available, NAc nucleus accumbens, ncRNA_in intron-derived non-coding RNA, ncRNA_ex exon-derived non-coding RNA, T2DM type 2 diabetes mellitus, UTR3 3′ untranslated region, UTR5 5′ untranslated region.

Among the 128 independent SNPs, 80 lead SNPs were identified (Fig. 4B). Of these, 55% (44/80) were found in intronic regions, and 15% (12/80) were intronic SNPs identified as non-synonymous mutations. Four of 80 lead SNPs (rs325485, rs6953951, rs10837055, rs813043) had CADD greater than 12.37, indicating potential pathogenicity. Two SNPs (rs11629241 and rs8066513) exhibited a higher likelihood of regulatory function, as indicated by a RegulomeDB score of 2b. Additionally, 90% (72/80) of lead SNPs were located in regions characterized by a predominantly open chromatin configuration, with chromatin state scores of less than 8. Supplementary Table 12 and Fig. 5B provide detailed information on lead SNPs. Additionally, we discovered 73 distinct genomic loci, with the most significant locus at chr17:65788157-66098979 (lead SNP: rs8072225, conjFDR = 6.355 × 10−7). Detailed information on these loci is provided in Supplementary Table 13.

Identification of genes associated with GMV alterations in T2DM and MDD

After preprocessing the gene expression data from AHBA, 3235 samples and 15,633 genes were retained. The shared genes identified through conjFDR analysis were then extracted, resulting in a final gene expression matrix containing 3235 samples and 528 genes. Subsequently, transcriptome-neuroimaging association analysis was conducted to explore the relationship between gene expression and GMV alterations in both MDD and T2DM. These analyses identified spatial correlations between gene expression and GMV alterations, with 189 genes linked to T2DM, 286 genes associated with MDD, and 154 genes shared between both diseases (Pperm < 0.05). The details of result are shown in Supplementary Tables 1416. Based on these associations, the prioritizing expression-trait association analysis identified nine genes (FADS3, GIGYF2, KPNA2, LONRF2, MON1A, NEGR1, NICN1, PLXNB1, and SPPL3) as high-confidence genes associated with GMV alterations in both T2DM and MDD (Supplementary Table 17 and Fig. 5C).

Mouse knockout models of identified novel genes

To investigate the functional relevance of the prioritized shared genes between T2DM and MDD, we utilized the MGI resource to explore knockout mouse models and their associated phenotypic modifications. For example, among the prioritized genes, GIGYF2 and NEGR1 exhibited various abnormal phenotypes related to the neural system, including abnormal neuron number, motor neuron degeneration, and decreased brain weight, highlighting the potential roles these genes play in maintaining neural integrity and function. Additionally, three of the prioritized genes (GIGYF2, LONRF2, and MON1A) were linked to metabolic abnormalities such as increased fasting circulating glucose levels and impaired glucose tolerance, which align with known disturbances in glucose metabolism observed in T2DM. These metabolic alterations suggest that these genes may regulate both neural functions and glucose homeostasis. Furthermore, all these four genes were found to be direct homologs in humans, suggesting their potential importance in shared neurobiological and metabolic pathways underlying both T2DM and MDD. Detailed information of all prioritized shared genes, as well as their phenotypic effects in knockout mouse models, is provided in Supplementary Table 18.

Discussion

To our knowledge, this study is the first to investigate shared neuroimaging and molecular mechanisms between T2DM and MDD. We identified significant commonalities in brain structural changes, with GMV reductions observed in regions such as the bilateral MCC/ORBsup, left IFGtriang/IFGoperc, and right insula/STG, which are associated with emotional regulation, cognitive functions, and metabolic processes. Complementing these neuroimaging findings, genetic analyses identified shared genes between T2DM and MDD. Transcriptome analysis linked these genetic factors to brain structural alterations, offering insights into pathways through which genetic risk may drive both neural and metabolic dysfunction. Together, these findings deepen our understanding of the shared biological basis of T2DM and MDD and provide a foundation for future research on comorbidity mechanisms.

In this study, shared reductions in GMV were observed in T2DM and MDD patients, highlighting potential neurobiological links between these conditions. The insula plays a critical role in interoceptive awareness, integrating physiological signals to create a sense of bodily state, and in emotional regulation [65, 66], where it supports adaptive responses to stress and emotional stimuli. Dysfunction in this region leads to poor emotional regulation, disrupted physiological states, and destroyed working memory, manifesting as heightened affective symptoms and disruptions in somatic perception commonly seen in T2DM and MDD [67, 68]. Similarly, the STG, involved in auditory processing and emotional perception [69, 70], contributes to interpreting social and emotional cues [71]. Structural changes in this region could lead to impaired auditory processing and difficulties in social interactions, with MDD patients often exhibiting reduced ability in sensory processing and integration [72] and T2DM patients experiencing related deficits in communication and perception [73]. The IFG, particularly the triangular and opercular parts, supports higher-order cognitive functions, such as language processing, inhibitory control, and goal-directed behavior [74, 75]. Thus, reductions in IFG GMV may lead to cognitive rigidity and impairments in verbal reasoning. These effects may be further exacerbated by metabolic dysfunction in T2DM [76]. The MCC and SFGmO, which are involved in decision-making, emotional regulation, and stress processing, also show reductions in GMV [77, 78], indicating deficits in managing cognitive and affective responses. The MCC supports attentional control and conflict monitoring [79], crucial for coping with stress and maintaining emotional resilience [80], while the SFGmO facilitates decision-making and emotional evaluation [78], with dysfunction in these regions linked to maladaptive stress responses and impaired behavioral regulation in both conditions. Together, these findings suggest shared structural changes that underpin the cognitive, emotional, and physiological disruptions characteristic of T2DM and MDD.

The conjFDR analysis was used to detect the shared genetic architecture between MDD and T2DM. Stratified conditional Q-Q plots revealed SNP enrichment between MDD and T2DM, highlighting substantial polygenic overlap. Additionally, a previous study revealed there exists genetic correlation between T2DM and MDD [30]. In the present study, we revealed significant genetic overlap between T2DM and MDD, identifying 73 shared loci. Among these, the most significant shared locus, (chr17:65788157-66098979, lead SNP: rs8072225) was located within the intronic region of the BPTF gene, which encodes a transcription factor involved in chromatin remodeling and has been linked to neurodevelopmental disorders [81, 82] and linked with depression [29]. This finding highlights the potential role of transcriptional regulation in common genetic basis of T2DM and MDD. Additionally, most of the shared candidate SNPs were found in intronic (50.3%) and intergenic (31.9%) regions, with a smaller proportion (4.7%) classified as potentially pathogenic based on high CADD scores. Many of these SNPs exhibited regulatory functionality, supported by RegulomeDB scores and chromatin state annotations, indicating their likely influence on gene expression [48, 50]. By integrating positional, eQTL, and chromatin interaction mapping, we identified 664 shared protein-coding genes, further underscoring the molecular connections between these two conditions. These results suggest that regulatory regions and transcriptional control play crucial roles in mediating the genetic links between T2DM and MDD, offering new insights into the molecular mechanisms underlying their comorbidity.

Building upon these findings, transcriptome–neuroimaging and expression–trait association analyses were conducted to identify and prioritize shared genes contributing to GMV changes in T2DM and MDD. Among the nine prioritized genes, several are involved in both neural and metabolic pathways. NEGR1 is associated with adult hippocampal neurogenesis, affective regulation [83], and systemic metabolism improvement [84]. FADS3 is linked to lipid metabolism and colocalizes with genetic signals for psychiatric disorders [85]. GIGYF2, a synapse-enriched transcript with specific neuronal localization [86], may regulate insulin-related signaling [87]. LONRF2 and MON1A are involved in neuronal energy metabolism; LONRF2 contributes to motor neuron survival and is implicated in neurodegeneration with metabolic disruption [88], while MON1A plays a role in autophagosome maturation, with dysfunction implicated in both neurodegeneration and altered energy regulation [89]. PLXNB1 has been associated with mood disorders through its effects on neurite outgrowth and brain connectivity [90], although its metabolic role remains to be clarified. NICN1 is involved in HbA1c-related metabolic signaling [91], while SPPL3 is functionally implicated in protein modification, cellular metabolism, and immune-inflammatory responses, and has been linked to MDD risk [92].

The MGI database suggests that mouse gene knockout phenotypes resemble the clinical manifestations of human diseases. The mouse model exhibited abnormality in the nervous system, including neurodegeneration and reduced brain weight, which align with neuropathological changes reported in human MDD [93, 94] and T2DM cases [95, 96]. Additionally, the mice exhibited abnormal fasting circulating glucose levels, which is a hallmark feature of T2DM patients and also a recognized risk factor for MDD [97]. The shared genes represent promising targets for multi-mechanistic interventions addressing T2DM-MDD comorbidity. Their dual involvement in neurobiological and metabolic processes suggests that interventions simultaneously enhancing neuroplasticity while improving metabolic function may yield synergistic benefits. Notably, targeting convergent pathways such as neuroinflammation (SPPL3), autophagy (MON1A), and neuronal energy metabolism (LONRF2) may vulnerabilities shared by both disorders. Future research should explore combinatorial approaches, while considering patient stratification based on these molecular signatures.

Despite the findings of this study, several limitations should be acknowledged. First, the datasets used in the meta-analysis were cross-sectional, limiting the ability to infer causal relationships between GMV alterations and disease phenotypes in T2DM and MDD. Second, while the subgroup analyses provided additional insight into potential sources of variability, some subgroup comparisons were based on a limited number of datasets. Therefore, these findings should be interpreted with caution and considered exploratory in nature. Third, the meta-analysis included an unbalanced number of datasets for T2DM (18 datasets) and MDD (57 datasets), which may impact the statistical power of the results. The disparity in sample sizes could lead to less reliable comparisons of GMV alterations between the two conditions. Fourth, while age was included as a covariate to control for age differences between T2DM and MDD patients, this adjustment may not entirely remove the impact of age on GMV alterations. Fifth, although most genes have identifiable orthologs in the MGI database, caution should be exercised when interpreting these results due to inherent interspecies differences between mice and humans. Future studies utilizing parallel experiments in non-human primates or organoid models are needed to address this gap. Finally, the inclusion of studies with varying methodologies, imaging protocols, and quality could introduce heterogeneity in the results, potentially affecting the consistency and reliability of the findings. Although heterogeneity tests indicated no significant differences, subtle methodological variations may still influence the results and their interpretation.

This study reveals shared neuroimaging and molecular signatures between T2DM and MDD, highlighting potential common pathways contributing to both disorders. By integrating genetic, transcriptomic, and neuroimaging data, we identified significant brain structural changes and underlying genetic mechanisms shared between T2DM and MDD. The prioritization of shared genes and their functional relevance in knockout mouse models suggests that both metabolic and neurobiological mechanisms may contribute to these conditions. These findings not only enhance our understanding of the shared genetic markers and brain structural alterations associated with T2DM and MDD but also provide a foundation for future research into shared therapeutic targets.