Abstract
Antidepressant efficacy for major depressive disorder (MDD) remains limited, with the neural mechanisms underlying treatment response poorly understood. The default mode network (DMN), particularly the connectivity between the medial prefrontal cortex (mPFC) and posterior cingulate cortex (PCC), has been implicated in MDD pathophysiology and may be linked to treatment outcomes. However, its potential as a biomarker for antidepressant response has not been validated. Here, we investigate the relationship between DMN connectivity and antidepressant treatment response in MDD. Resting-state fMRI data from four large MDD cohorts (n = 4271) were analyzed using Granger causality to examine directional effective connectivity (EC) within the DMN. Linear mixed-effects models compared EC between recurrent MDD patients, first-episode drug-naïve patients, and healthy controls. We also examined associations between EC, medication use, illness duration, depressive symptoms, and treatment outcomes. Additionally, Support Vector Machine (SVM) classifiers and support vector regression (SVR) were trained using EC from mPFC to PCC to predict treatment response. Our results revealed that recurrent MDD patients exhibited significantly reduced EC from mPFC to PCC compared to healthy controls and first-episode patients, with this reduction correlating with antidepressant medication use and illness duration. Importantly, DMN connectivity was associated with treatment improvement rather than core depressive symptoms, including suicide, anhedonia, or emotional blunting. Crucially, EC from mPFC to PCC predicted antidepressant treatment response, and SVM classifiers demonstrated high predictive accuracy for therapeutic outcomes. In conclusion, reduced EC from mPFC to PCC may serve as a biomarker for antidepressant treatment response in MDD, offering insights into MDD neurobiology and supporting the clinical potential of DMN connectivity measures for guiding treatment decisions. The SAINT, Xijing_QG, and Xijing_KG datasets were approved by the Ethics Committee of the First Affiliated Hospital, Fourth Military Medical University (approval numbers: KY20202066-F-1, XJLL-KY20222111, and KY20222165-F-1, respectively) and registered with clinicaltrials.gov (identifiers: NCT 04653337, NCT 05577481, and NCT 05544071, respectively).
Similar content being viewed by others
Introduction
Major depressive disorder (MDD) is a devastating psychiatric affliction, ranking as the second leading cause of global disability, boasting a point prevalence that surpasses 4%1. Despite the widespread use of antidepressant medication2, its therapeutic efficacy is not entirely satisfactory with only a modest superiority over placebo (Cohen’s d of ~0.3)3. Moreover, ~50% of patients with MDD could be treatment-resistant depression4. However, the neural mechanisms that underlie antidepressant treatment remain elusive, and the lack of reliable biomarkers poses a hindrance to the effective prediction of treatment responses.
Default mode network (DMN) is thought to play a pivotal role in the pathophysiology of MDD. A plethora of evidence suggests a correlation between the Default Mode Network and the primary clinical symptom of rumination in major depression5,6. A recent study revealed a decrease in functional connectivity (FC) within the DMN when comparing 848 patients with MDD to 794 normal controls (NCs)7. Importantly, this effect was only statistically significant in individuals with recurrent MDD, as opposed to those with first-episode drug-naïve (FEDN) MDD. Additionally, fMRI-characterization of DMN could serve as a complement to the existing symptom-based diagnoses for MDD. By incorporating increased functional connectivity in the DMN, a support vector machine (SVM) classifier was able to effectively distinguish between patients with MDD and NCs, achieving an AUC of 90%8. Moreover, abnormal dynamic functional network connectivity estimated from the DMN has been shown to be predictive of symptom severity in MDD9.
Accumulating evidence shows that DMN might be closely associated with treatment response for MDD. The identified hyperconnectivity within the DMN served as a distinguishing factor between patients diagnosed with treatment-resistant MDD and those exhibiting treatment sensitivity10. Moreover, research findings have suggested that the observed hyperconnectivity within the DMN and the connections between the DMN and the executive control network (ECN) offered potential as indicators of successful treatment11. However, the validation of the DMN as a biomarker for antidepressant treatment is yet to be established.
The medial prefrontal–limbic circuitry is thought to be a key neural pathway underlying the therapeutic effects of antidepressant treatment12,13,14. Previous studies have demonstrated that antidepressants could modulate emotional processing, exerting neural effects within the prefrontal and limbic circuitry12. In addition, both patients and healthy controls exhibited increased activity in the medial prefrontal and core limbic regions in response to positive emotions, while these regions showed decreased activity to negative emotions following repeated administration of antidepressants13. The therapeutic effects of antidepressant treatment are mediated through “top-down” influences from the prefrontal cortex to the limbic regions15. Thus, we further focused on two key regions of the DMN, the medial prefrontal cortex (mPFC) and the posterior cingulate cortex (PCC)—one located in the prefrontal cortex and the other within the limbic system—to investigate the relationship between information flow from the mPFC to the PCC and antidepressant treatment, and to evaluate its potential as a biomarker.
In this study, we used an unprecedentedly large sample of 4133 (2142 MDD patients and 1991 NCs) to investigate abnormal effective connectivity (EC) patterns within DMN and the validation of these patterns as potential biomarkers for predicting responses to both pharmacological and rTMS treatments was carried out using four datasets. Finally, in this study, we found a new neuroimage biomarker (EC from the mPFC to the PCC within DMN) and demonstrate that (1) significantly reduced EC from mPFC to PCC was observed in recurrent MDD patients compared with NCs and FEDN MDD patients (Fig. 1D, E); (2) reduced EC from mPFC to PCC in recurrent MDD patients was associated with illness duration and medication effect (Fig. 2); (3) EC from mPFC to PCC is anticipated to be significantly reduced in patients with MDD who demonstrate improvement following pharmacological treatment (Fig. 3). (4) EC from mPFC to PCC is associated with antidepressant treatment improvement rather core depressive symptoms including suicide, anhedonia, or emotional blunting (Figs. 4 and 5); (5) EC from mPFC to PCC could reliably discriminate between patients who responded positively to antidepressant treatment and those who did not (Fig. 6); (6) EC from mPFC to PCC could effectively predict therapeutic outcomes following antidepressant treatment (Fig. 7). This new biomarker provides a way forward in treatment evaluation, relapse prediction and understanding the pathogenesis of MDD.
The average EC from mPFC to PCC was computed across 132 connections, as illustrated in (A). Violin plots depict the distribution of mean EC from mPFC to PCC differences among various groups: B MDD vs. NC; C First Episode Drug Naïve (FEDN) MDD vs. NC; D Recurrent MDD vs. NC; and E FEDN MDD vs. Recurrent MDD. It is noteworthy that, for each comparison, only sites with a sample size larger than 10 in each group were considered. The t values represent the statistics for these comparisons in Linear Mixed-Effects Model (LMM) analyses. *p < 0.05; **p < 0.01.
The violin figures show the distribution of mean EC from mPFC to PCC within DMN for FEDN MDD patients with long vs. short illness duration (A), for all MDD patients with long vs. short illness duration (B), and for FEDN MDD patients with vs. without medication usage (C). The t values represent the statistics for these comparisons in Linear Mixed-Effects Model (LMM) analyses. *p < 0.05.
The violin figures show the distribution of mean EC from mPFC to PCC within DMN for MDD patients with a positive medication response: pre- versus post-treatment (A), for MDD patients with a negative medication response: pre- versus post-treatment (B), for MDD patients with a positive rTMS response: pre- versus post-treatment (C), and for MDD patients with a negative rTMS response: pre- versus post-treatment (D). Here, responders are identified as individuals exhibiting a reduction of 50% or greater on the HAMD-17 scale following the specific antidepressant treatment (positive response), while non-responders are those who do not meet this threshold (negative response). *p < 0.05.
Sensitivity-specificity curves based on model predictions for predicting the response to all treatment (213 responders versus 102 non-responders), medication treatment (173 responders versus 56 non-responders) and rTMS treatment (40 responders versus 46 non-responders). The calculation of area under the curve (AUC) values was performed for each sensitivity-specificity curve.
Methods
Participants
This study included four data sets for major depression (mddrest, SAINT, Xijing_KG and Xijing_QG). All participants provided written informed consent at their local institution. Further study details are provided in the online supplement.
The mddrest data set collected by the REST-meta-MDD project7 from the Depression Imaging REsearch ConsorTium (DIRECT)16 in 2017, is currently the largest MDD dataset. Resting-state fMRI data from this data set were shared from twenty-five study cohorts in China. In this study, we used a sample of (n = 4133, 2142 MDD/1991 NCs) from both the first and second releases.
The SAINT dataset was approved by the Ethics Committee of the First Affiliated Hospital, Fourth Military Medical University (KY20202066-F-1) on October 21, 2020, and was registered with clinicaltrial.gov (identifier: NCT 04653337). All participants were recruited from the Department of Psychiatry at the First Affiliated Hospital, Air Force Military University, from January 2021 to October 2021. Thirty-two subjects received the Stanford Accelerated Intelligent Neuromodulation Therapy (SAINT)17. All treatments were delivered with a Black Dolphin Navigation Robot system (SmarPhin S-50, Solide Brain Control Medical Technology Co., Ltd, Xi’an, China). In a precise experimental protocol, three consecutive sessions of intermittent theta-burst stimulation (iTBS) were administered at 90% of the resting motor threshold (RMT) within a span of 9 min and 52 s. The subjects underwent a total of ten iTBS sessions, comprising 18,000 pulses, with a 50-min interval between each session. This daily regimen was sustained for five consecutive days, resulting in each patient receiving a cumulative total of 90,000 pulses throughout the entire treatment duration. According to exclusion and inclusion criteria, twenty-six treatment-resistant patients with major depression were enrolled in this study. Suicidal ideation severity was measured using the Chinese Version of the Beck Scale for Suicide Ideation (BSI-CV).
The Xijing_KG dataset was collected through Randomized Clinical Trials (RCTs). The dataset was approved by the Ethics Committee of the First Affiliated Hospital, Fourth Military Medical University (KY20222165-F-1) on May 24, 2023, and was registered with clinicaltrial.gov (identifier: NCT 05544071). All patients were recruited from the Psychosomatic Department of Xijing Hospital between March 2023 and April 2024. Inclusion criteria included (1) Diagnosis of MDD according to the criteria outlined in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-V), and currently experiencing an episode; (2) A total score greater than 17 on the Hamilton Depression Rating Scale (HAMD-17) at screening and baseline; (3) A total score greater than 20 on the Snaith-Hamilton Pleasure Scale (SHAPS) at screening and baseline; (4) No use of any antidepressant medications for at least 2 weeks prior to screening. Exclusion criteria included (1) Substance use disorders, excluding nicotine and caffeine; (2) Current diagnosis of any psychiatric disorders other than MDD, Generalized Anxiety Disorder, Social Anxiety Disorder, Panic Disorder, Agoraphobia, or Specific Phobias; (3) History of schizophrenia or schizoaffective disorder, or current or past depressive episodes with psychotic symptoms; (4) Other psychiatric disorders with depressive symptoms as the primary manifestation, personality disorders, or intellectual disabilities; (5) Significant clinical abnormalities identified during screening that may affect participation in the study or compromise safety, or that could interfere with the interpretation of study results; (6) Any history deemed by the investigator to pose a risk to the subject or interfere with the interpretation of study results; (7) Participation in other drug clinical trials or device clinical trials within 1 month prior to enrollment; (8) Presence of metal or electronic devices in the head or skull; (9) History of epilepsy; (10) History of cardiovascular disease or cardiovascular events; (11) History of obsessive-compulsive disorder; (12) History of autism spectrum disorder; (13) Prior exposure to repetitive transcranial magnetic stimulation (rTMS); (14) Any other conditions deemed by the investigator as unsuitable for participation in the study.
Patients in Xijing_KG dataset enrolled in the study will receive a selective serotonin reuptake inhibitor (SSRI), sertraline hydrochloride, at a daily dose of 100–150 mg. During the initial treatment phase (days 1–3), patients will receive 25 mg/day, with an increase to 50 mg/day on days 4–7. In the absence of dose-limiting adverse events, the dosage will be titrated to 100 mg/day during the second week and subsequently adjusted within the range of 100–150 mg/day to achieve optimal clinical response. Thereafter, the dosage will be maintained as consistently as possible. The medication is administered once daily, at a fixed time in the morning, either on an empty stomach or following a meal. Clinical evaluations will be conducted at baseline, immediately following the final session of repetitive transcranial magnetic stimulation (rTMS), and at 15 and 30 days post-rTMS. Anhedonia severity in patients with major depressive disorder (MDD) will be assessed using the Snaith-Hamilton Pleasure Scale (SHAPS) and the Chinese Version of the Temporal Experience of Pleasure Scale (CV-TEPS). Changes in depressive symptoms will be evaluated using the Montgomery-Asberg Depression Rating Scale (MADRS) and the 17-item Hamilton Depression Rating Scale (HAMD-17). Suicidal ideation severity will be measured using the Chinese Version of the Beck Scale for Suicide Ideation (BSI-CV). Additionally, MRI scans will be performed at baseline and on the 15th day after treatment.
The Xijing_QG dataset was collected through Randomized Clinical Trials (RCTs). The dataset was approved by the Ethics Committee of the First Affiliated Hospital, Fourth Military Medical University (XJLL-KY20222111) on May 24, 2023, and was registered with clinicaltrial.gov (identifier: NCT 05577481). Patients were recruited from the Department of Psychosomatic Medicine, Xijing Hospital, between March 2023 and January 2024. Inclusion Criteria included (1) Adults aged 18–60 years, right-handed; (2) Diagnosis of MDD based on DSM-V criteria; (3) A total score >17 on the 17-item Hamilton Depression Rating Scale (HAMD-17); (4) A score ≥6 on the Chinese Version of the Beck Scale for Suicide Ideation (BSI-CV); (5) Normal findings in physical examination, medical history, vital signs, and routine laboratory tests, including blood, urine, and stool analyzes, liver and kidney function, electrolytes, electrocardiogram, and electroencephalogram; (6) Informed of the safety profile of repetitive transcranial magnetic stimulation (rTMS), expressed willingness to comply with the treatment regimen, and provided signed informed consent. Exclusion Criteria included (1) Age <18 or >60 years; (2) History of severe physical illness or depression secondary to psychoactive substances or non-dependent substances; (3) Presence of metallic or electronic implants, such as intracranial metallic objects, cochlear implants, pacemakers, stents, or other metallic foreign bodies; (4) Risk of seizures, including a history of brain disease, head trauma, alcohol abuse, abnormal electroencephalogram findings, MRI evidence of structural brain abnormalities, or a family history of epilepsy; (5) Requiring immediate intervention due to suicidal behavior or severe self-injury; (6) Presence of psychotic symptoms requiring the use of antipsychotic medications; (7) History of electroconvulsive therapy (ECT) within 2 months prior to enrollment; (8) Pregnant, breastfeeding, or planning to conceive during the study period. Participants were required to have received at least 6 weeks of treatment with SSRIs or SNRIs prior to enrollment. During the study, the type of antidepressant was maintained, and the dosage was kept as close as possible to the pre-enrollment regimen. Concomitant use of medications for somatic diseases was permitted, provided the types and doses of these medications remained unchanged throughout the study. The treatment was administered continuously for 15 days, with clinical scales and cognitive function assessments conducted at baseline, day 7, and day 15 (end of treatment) following treatment. Emotional blunting in patients with MDD was assessed using the Oxford Depression Questionnaire (ODQ). Depressive symptom changes were measured using the HAMD-17. MRI scans were performed at baseline and at the conclusion of the 15-day treatment period.
fMRI Preprocessing
Standard preprocessing of the mddrest was done at each site using the Data Processing Assistant for Resting-State fMRI (DPARSF, http://rfmri.org/DPARSF). Functional data of SAINT were preprocessed using the Graph Theoretical Network Analysis (GRETNA, https://www.nitrc.org/projects/gretna). Specifically, the initial ten volumes of each subject were discarded to account for potential instability in the signal. Slice-timing was performed to correct for temporal differences between slices. Then, realignment was conducted to correct for head motion during the acquisition by applying translation and rotation corrections to the fMRI images at different time points. The high-resolution T1-weighted images were co-registered with the functional images, and a segmentation process was performed to classify the brain into gray matter, white matter, and cerebrospinal fluid (CSF) compartments. The resulting deformation parameters from the T1-weighted images to the Montreal Neurological Institute (MNI) template was utilized to normalize the resting-state fMRI data to a standard space. Additionally, a Gaussian filter with a half maximum width of 6 mm was used to smooth the functional images. Each participant’s time series was band-pass filtered in the range of 0.01-0.1HZ. Finally, we regressed out the effects of head motion, white matter and CSF signals.
Effective connectivity analysis
After preprocessing, we extracted time series for Dosenbach 160 functional regions of interest (ROIs)18. We then defined 33 DMN ROIs as those overlapping with the DMN delineated by Yeo et al.19 and calculated effective connectivity within DMN ROIs using granger causality analysis (GCA)20. In accordance with prior research findings21,22, we have delineated the Default Mode Network (DMN) into four principal components (Fig. 1A). These encompass 12 ROIs situated in the medial prefrontal cortex (mPFC), 11 ROIs located in the posterior cingulate cortex (PCC), 6 ROIs in the left parietal cortex (LPC), and 4 ROIs in the right parietal cortex (RPC). The analysis was performed using the REST-GCA toolbox (https://rfmri.org/REST-GCA), the order is set to 1 according to the previous study23. Prior to performing Granger Causality Analysis (GCA), the stationarity of each regional time series was assessed using the Augmented Dickey-Fuller (ADF) test. Time series that failed to meet the stationarity requirement (p < 0.05) were considered non-stationary. These non-stationary series were then differenced (first-order differentiation) iteratively until stationarity was achieved. All subsequent GCA analyzes were conducted exclusively on the preprocessed, stationary time series. Mean effective connectivity (EC) from mPFC to PCC (averaged across 12 × 11 = 132 connections) was computed (Fig. 1A). Granger causality analysis was executed using MATLAB scripts.
Harmonization of site differences and covariates
To address site-specific artifacts and confounding effects, we employed the ComBat harmonization method24,25 to control for site differences and covariates in DMN EC. This approach enabled us to maintain biological variability while minimizing site-related variation. For each EC value, Combat model can be formulated as follows:
in which \({const}\) represents the constant term; \({X}^{T}\) is a design matrix for the covariates of interest (age, sex, head motion); \(\beta\) is the vector of coefficients associated with \(X\); \(\gamma\) represents additive site effects (location parameter), while \(\delta\) denotes multiplicative site effects (scale parameter).
Accordingly, the harmonized EC values were defined as:
where \(\hat{\delta }\) and \(\hat{\gamma }\) represent the empirical Bayesian estimates of \(\delta\) and \(\gamma\), respectively.
Group-level analysis
Linear mixed models (LMMs)26 were employed for between-group inference regarding EC from mPFC to PCC. This approach effectively controlled for confounding variables, including diagnosis, age, sex, education, head motion effects, and site-specific variations. The model was implemented using MATLAB’s fitlme command (https://www.mathworks.com/help/stats/fitlme.html), formulated as y ∼1 + Diagnosis + Age + Sex + Education + Motion + (1 | Site) + (Diagnosis | Site). This model provided t and p values for the fixed effect of Diagnosis. Cohen’s d effect size was subsequently computed as \(d=\,\frac{T({n}_{1}+{n}_{2})}{\sqrt{{df}}\sqrt{{n}_{1}{n}_{2}}}\)27.
In addition, we utilized LMM to compare 2142 MDD patients with 1991 NCs. Considering that several sites reported whether patients with MDD were in their first episode or recurrent, we compared 828 first-episode drug-naïve (FEDN) MDD patients with 812 corresponding normal controls (NCs) from nine sites and 310 recurrent MDD patients with 928 corresponding NCs from ten sites. Furthermore, we conducted a comparison between 516 FEDN MDD patients and 281 recurrent MDD patients from nine sites. In this particular analysis, the LMM model incorporated the replacement of the diagnosis variable with the FEDN or recurrent status to investigate specific distinctions between these subgroups. Due to the skewed distribution of illness duration, with the majority of cases being brief, we opted to contrast the terciles with the longest and shortest illness durations instead of using Diagnosis in the LMM. Additionally, to assess the impact of medication, we substituted Diagnosis with medication status (on/off, assessed at the time of the scan) in the LMM model. Finally, we used paired t test to compare the changes in EC between pre-treatment and post-treatment MDD patients.
Prediction of antidepressant response using EC form mPFC to PCC
Support Vector Machine (SVM)28 classifiers were employed to predict the response to antidepressant treatment using EC from mPFC to PCC. In other words, the aim was to distinguish between patients who responded positively to antidepressant treatment and those who did not. In this study, we used a total of 315 MDD patients including 213 patients who exhibited a positive response to antidepressant treatment (Medication: 173 responders; rTMS: 40 responders) and 102 patients who showed no response to antidepressant treatment (Medication: 56 non-responders; rTMS: 46 non-responders). Specifically, 161 MDD patients from mddrest dataset (Medication: 145 responders versus 16 non-responders), 26 MDD patients from SAINT dataset (rTMS: 20 responders versus 6 non-responders), 72 MDD patients from Xijing_KG dataset (Medication: 9 responders versus 24 non-responders; rTMS: 12 responders versus 27 non-responders) and 40 MDD patients from Xijing_QG dataset (Medication: 3 responders versus 16 non-responders; rTMS: 8 responders versus 13 non-responders). In the context of the EC from mPFC to PCC, comprising \(12\times 11=132\) connections, a meticulous approach was adopted to mitigate overfitting to noise in the data and prevent inflation of prediction performance. Thus, we employed the recursive feature elimination method29 to identify salient connections by recursively eliminating the least important ones based on model performance. This meticulous process yielded a selection of 38 connections for further analysis. Additionally, we applied regularization techniques (L1 regularization), during the feature selection process. Subsequently, the 38 identified connections were employed to train a Support Vector Machine (SVM) model utilizing the scikit-learn machine learning library30. In addition, we used the grid search function to generate candidate hyper-parameters, spanning from \({10}^{-5}\) to \({10}^{5}\) times the default values of C. Then C was set to 5. In addition, we utilized Support Vector Machine (SVM) with the ‘class_weight=balanced’ parameter, which adjusts the model to account for class imbalances during training. Finally, we implemented five-fold cross validation to assess performance and accuracy, area under the curve (AUC), sensitivity, specificity, f1-score, Matthews Correlation Coefficient (MCC)31 were used as evaluating indicators. To further ensure the reliability of the results, the entire feature selection and model training process was iterated 100 times, with the average performance serving as the final evaluation metric. To assess the statistical significance of the machine learning-based treatment prediction results, a nonparametric permutation test was conducted. Specifically, the treatment response labels were randomly shuffled across participants 1000 times, followed by the repeated application of fivefold cross-validation to generate a distribution of classification accuracies. The p-value was calculated as the proportion of the cross-validated accuracies from the permuted data that exceeded those obtained without permutation.
Prediction of relative symptom improvement for rTMS treatment using EC from mPFC to PCC
We developed an end-to-end machine-learning algorithm for predicting the medication and rTMS treatment outcome using EC from mPFC to PCC features. Specifically, support vector regression (SVR) model was used to predict clinical improvements after medication or rTMS treatment for 138 patients with MDD (Medication: 52 participants; rTMS: 86 participants). Clinical improvements were quantified by subtracting the post-treatment scores from the pre-treatment scores on the 17-item Hamilton Depression Rating Scale (HAMD-17)32. Recursive feature elimination method was used to reduce the dimension of mPFC->PCC EC features and 38 salient connections were retained according to performance of the model. In addition, we used the grid search function to generate candidate hyper-parameters, spanning from \({10}^{-5}\) to \({10}^{5}\) times the default values of C. Then C was set to 9. The performance of five-out cross validation was quantified using the Mean Absolute Error (mae), Root Mean Squared Error (rmse) and Pearson’s correlation coefficient.
Results
Participant characteristics
In this study, 4133 participants in the mddrest dataset, 26 MDD patients in the SAINT dataset, 72 MDD patients in the Xijing_KG dataset and 40 MDD patients in the Xijing_QG were enrolled. Tables 1–4 demonstrate the demographic and clinical characteristics of the participants.
Reduced EC from mPFC to PCC within DMN in recurrent MDD patients
Mean EC from mPFC to PCC was compared with 2930 MDD patients and 2445 NCs, but no significant change (t = 0.051, p = 0.960, d = 0.0015; Fig. 1B) was observed. On subgroup analyzes, no significant change in FEDN MDD patients compared with NCs (t = 0.371, p = 0.711, d = 0.0063; Fig. 1C), while reduced EC from mPFC to PCC was observed in recurrent MDD patients compared with NCs (t = −2.969, p = 0.003, d = −0.1416; Fig. 1D). In addition, recurrent MDD patients showed significantly lower EC than FEDN MDD patients (t = −2.248, p = 0.025, d = −0.1690; Fig. 1D).
DMN connectivity was associated with illness duration and medication status effect
The reduced EC observed from mPFC to PCC in recurrent MDD patients, as opposed to its absence in FEDN MDD patients, may result from the influence of illness duration or medication history. Significantly decreased EC from mPFC to PCC (t = −2.295, p = 0.022, d = −0.1809; Fig. 2A) was observed in the tercile with longest illness duration (≥24 mo, 1199 MDD patients from 29 sites) compared with the tercile with shortest illness duration (≤3 mo, 301 MDD patients from 15 sites). To reduce the impact of medication, we further examined the effect of illness duration in FEDN MDD patients. The tercile with longest illness duration (≥24 mo, 478 MDD patients from 23 sites) showed lower EC (t = −1.970, p = 0.049, d = −0.1998; Fig. 2B) than the tercile with shortest illness duration (≤3 mo, 272 MDD patients from 15 sites). Moreover, we conducted additional examinations to scrutinize the impact of medication. In particular, we compared individuals with FEDN MDDs (534 MDDs from 15 sites) with first-episode MDDs on medication (700 MDDs from 18 sites) and observed a significantly enhanced EC mPFC to PCC (t = 2.221, p = 0.027, d = 0.1434; Fig. 2C).
Antidepressant treatment effect
Considering the association between medication status and EC from mPFC to PCC, we further investigated the changes of EC following the antidepressant treatment including medication treatment and rTMS treatment on three external prospective treatment datasets. To gain a more accurate understanding of the mechanisms, we divided the dataset based on treatment response. Here, responders are identified as individuals exhibiting a reduction of 50% or greater on the HAMD-17 scale following the specific antidepressant treatment (positive response), while non-responders are those who do not meet this threshold (negative response). Significantly enhanced EC from mPFC to PCC (t = 2.128, p = 0.044, d = 0.9425; Fig. 3A) was observed in the pre-treatment patients with MDD compared with post-treatment patients with MDD, when patients with MDD showed positive response following medication treatment. Additionally, we further tested an interaction between time and responder to more directly assess whether EC changes are specific to responders. Specifically, we employed a two-sample t-test to compare the change in EC from mPFC to PCC, calculated by subtracting pre-treatment values from post-treatment values, between responders and non-responders. This analysis revealed no significant difference (t = −1.09), suggesting that the observed changes in EC are not significantly different between responders and non-responders.
Association of DMN connectivity with depressive symptom
The investigation delving into the relationship between EC from mPFC to PCC and HAMD scores underwent rigorous testing on REST-meta-MDD dataset (2369 MDD patients). Surprisingly, the analysis did not unveil a statistically significant correlation (Pearson’s r = 0.0069, p = 0.7359). When assessing the impact of symptom severity in FEDN MDD patients (n = 1022), the correlation failed to attain statistical significance (Pearson’s r = 0.0496, p = 0.1134). Conversely, among patients with recurrent MDD (n = 283), a significant correlation emerged (Pearson’s r = 0.1180, p = 0.0473).
Furthermore, we further investigated the relationship between pre- and post-treatment DMN connectivity (EC from mPFC to PCC) and depressive symptoms (as measured by BCI-CV, ODQ, SHAPS, and HAMD), as well as the association between pre-treatment DMN connectivity and treatment improvement (Fig. 5). Note that BCI-CV assesses suicidal ideation, ODQ evaluates emotional blunting, SHAPS measures anhedonia, and HAMD evaluates the severity of depression. We found that DMN connectivity, both pre- and post-treatment, was unrelated to depressive symptoms. However, as treatment progressed, pre-treatment DMN connectivity was able to predict changes in SHAPS (r = −0.31, p = 0.008) and ODQ scores (r = 0.34, p = 0.036).
DMN Connectivity characterizes medication response
We further used DMN EC and EC from mPFC to PCC for predicting the response to all antidepressant treatment (213 responders versus 102 non-responders), medication treatment (173 responders versus 56 non-responders) and rTMS treatment (40 responders versus 46 non-responders). Briefly, we first trained SVM classifiers during the fivefold cross-validation using the EC from mPFC to PCC features as input. To ensure result stability, the process was iterated 100 times, with the average outcome serving as the ultimate performance metric for the model. The classifier achieved an accuracy of 78.6% (AUC, 76.5%; sensitivity, 81.2%; specificity, 71.8%; f1, 84.8%; MCC, 49.3%; permutation test-validated using 1000 permutations, p < 0.001) for the all treatment, an accuracy of 87.8% (AUC, 86.9%; sensitivity, 88.5%; specificity, 85.2%; f1, 92.2%; MCC, 65.0%; permutation test-validated using 1000 permutations, p < 0.001) for the medication treatment and an accuracy of 83.8% (AUC, 84.3%; sensitivity, 85.3%; specificity, 83.3%; f1, 81.3%; MCC, 68.0%; permutation test-validated using 1000 permutations, p < 0.001) for the rTMS treatment. Figure 6 shows sensitivity-specificity curves based on model predictions using EC from mPFC to PCC. Confusion Matrices and calibration plots are provided in Supplementary Fig. S4.
Furthermore, we further compared our proposed biomarker with several established biomarkers, including: (1) whole-brain functional connectivity (FC), (2) default mode network (DMN) FC, (3) medial prefrontal cortex - posterior cingulate cortex (mPFC-PCC) FC, (4) graph-theoretical measures such as betweenness, degree, clustering coefficient, efficiency, and local efficiency across five network nodes, and (5) clinical predictors including age, sex, and pre-treatment Hamilton Depression Rating Scale (HAMD) scores. To ensure robustness, we employed a Support Vector Machine (SVM) for model training across all biomarkers. The comparison results, summarized in Table 5, demonstrate that the effective connectivity (EC) from the mPFC to the PCC yielded the highest classification accuracy, emphasizing the superior performance of our biomarker and its promising potential for clinical application.
Prediction of relative symptom improvement for rTMS using DMN connectivity
In addition to forecasting responses to medication treatment, we further employed EC from mPFC to PCC for predicting the therapeutic outcomes associated with medication and rTMS. We built prediction model using support vector regression method and model performance was tested using five-fold cross-validation (Fig. 7). The findings revealed a significant predictive capability of EC from mPFC to PCC for the observed changes in treatment scores during fivefold cross-validation (All treatment: Fig. 5A, Pearson’s r = 0.56, mae = 4.84, rmse = 6.36, p < 0.0001; Medication: Fig. 5B, Pearson’s r = 0.75, mae = 2.72, rmse = 4.24, p < 0.0001; rTMS: Fig. 5C, Pearson’s r = 0.72, mae = 4.07, rmse = 5.53, p < 0.0001).
Discussion
In this study, employing an exceptionally large sample size (n = 4133), we identified a substantial reduction in top-down EC within the DMN, specifically from mPFC to PCC, in patients with recurrent MDD compared to both NCs and those with FEDN MDDs. Additionally, the diminished EC from mPFC to PCC in recurrent MDD patients showed associations with being scanned while on antidepressant medication and the duration of illness. Importantly, EC from mPFC to PCC demonstrated a remarkable capacity to predict treatment outcomes for both antidepressant medication and rTMS treatment.
Generally, the DMN is frequently partitioned into an anterior sub-network, which is focused on the mPFC, and a posterior sub-network, centered around the PCC33. Along the cortical midline, mPFC and PCC demonstrated robust functional coherence with both sub-networks, suggesting a potential role as functional hubs facilitating information transfer between sub-networks34. EC stands apart from conventional functional connectivity as it goes beyond simply computing the correlation among time courses of interacting regions. Instead, it possesses the capability to infer causal influences from one region to another, illustrating the directional flow of signals, including top-down and bottom-up processes, within a brain network. Significantly reduced effective connection from mPFC to PCC observed in the recurrent MDD group may imply a diminished flow of information from the anterior sub-network to the posterior sub-network within the DMN. Due to both sub-networks contributing to specific processes associated with self-generated thought34, this top-down regulation within DMN could be linked to the predominant clinical symptom of rumination in major depression5,6. Furthermore, the anatomical connection between the PCC and mPFC through the cingulum bundle in brain structure35 prompts the inquiry into whether structural abnormalities are present in patients with recurrent MDD compared to NCs, necessitating further investigation.
Significantly reduced EC from mPFC to PCC in recurrent MDD was associated with illness duration and antidepressant medication treatment. Specifically, we verified that the tercile with the longest illness duration (≥24 months) exhibited lower EC from mPFC to PCC compared to the tercile with the shortest illness duration (≤3 months), and individuals with FEDN MDD who were scanned while on medication displayed decreased EC from mPFC to PCC compared with those without medication. These results are consistent with a previous study, where the EC from the right parietal cortex (RPC) to the PCC within DMN exhibited a significant decrease in patients with post-treatment MDD compared with NCs, after an eight-week treatment period36. Our identification of a medication-induced decrease in EC from mPFC to PCC implies that antidepressant medications could mitigate depressive symptoms by reducing the EC from mPFC to PCC. To minimize the influence of medication, we conducted additional analyzes to explore the impact of illness duration in FEDN MDD patients. Among FEDN MDD patients, those in the tercile with the longest illness duration (≥24 months) displayed decreased EC from mPFC to PCC in comparison to those in the tercile with the shortest illness duration (≤3 months). This result suggests that the prolonged impact of the illness may lead to a lower EC from mPFC to PCC. However, the identified medication effect and illness duration effect were observed in a retrospective cross-sectional sample, underscoring the need for validation through longitudinal designs integrating medication and disease follow-up.
Another significant finding of the present study is the capacity of EC from mPFC to PCC within DMN to reliably predict treatment responses to both medication and rTMS. Despite receiving optimal treatment for MDD, only 30% of patients achieve full recovery or remission. The remaining 70% either respond without achieving remission (around 20%) or show no response at all (50%)37, commonly termed as treatment-resistant depression (TRD). Individuals with Treatment-Resistant Depression (TRD) bear the highest direct and indirect medical costs among those with MDD38. Specifically, TRD individuals have twice the probability of hospitalization, incurring a cost exceeding six times the mean total cost for non-treatment-resistant depressed patients39. Hence, there is a pressing need for objective biomarkers to identify TRD and inform individualized treatment decisions. In this study, EC from mPFC to PCC reliably differentiated patients with positive responses to medication treatment from those who did not respond and predicted therapeutic outcomes following rTMS treatment among individuals with TRD. This suggests that the identified fMRI signature holds promise as a neuroimaging biomarker for guiding treatment choices in the context of antidepressant response. Furthermore, this might lead to avoidable morbidity and economic costs if individuals are transitioned to another intervention prematurely based on evidence indicating minimal expected benefit from antidepressant treatment using our fMRI signature.
Our findings contribute to the growing body of evidence surrounding the role of the default mode network (DMN) in brain function and its potential as a therapeutic target. While repetitive transcranial magnetic stimulation (rTMS) has been widely utilized in clinical practice, it has traditionally focused on the emotional network (sgACC) and cognitive control network (DLPFC). Despite the known pivotal role of the DMN in various cognitive and emotional processes, it has not yet been targeted for therapeutic intervention. Our study reveals a significant association between the DMN and treatment outcomes, providing strong evidence for the feasibility of DMN-targeted interventions. This novel approach could offer new avenues for therapeutic interventions, particularly for individuals with disorders involving DMN dysfunction. Moving forward, targeted rTMS stimulation within the DMN holds promise as a viable strategy for treating a range of neuropsychiatric conditions, warranting further investigation into its therapeutic potential.
The limitations of this study should be noted. First, our findings and biomarkers need validation in independent datasets. In the future, inclusion of UK Biobank MDD data and data from the ENIGMA-MDD consortium could be considered for analysis. Second, we only considered pharmacological treatment and rTMS, while other treatments with potentially distinct mechanisms of action, such as electroconvulsive therapy (ECT) or psychotherapy, should also be taken into consideration. Third, we employed fMRI signatures to distinguish patients exhibiting positive responses to medication treatment from non-responders, but we did not conduct a detailed investigation based on the specific types of medication. Finally, despite our efforts to harmonize these datasets, residual site-specific differences may persist, and some of these variations may not be fully accounted for or corrected by ComBat.
Data availability
Deidentified and anonymized data were contributed from studies approved by local Institutional Review Boards. All study subjects provided written informed consent at their local institution. Data of REST-meta-MDD I are available at: http://rfmri.org/REST-meta-MDD. Data of REST-meta-MDD II and SAINT used during the current study are available from the corresponding author upon reasonable request.
References
Ferrari, A. J. et al. Burden of depressive disorders by country, sex, age, and year: findings from the global burden of disease study 2010. PLoS Med. 10, e1001547 (2013).
Moore, T. J. & Mattison, D. R. Adult utilization of psychiatric drugs and differences by sex, age, and race. JAMA Intern. Med. 177, 274–275 (2017).
Khan, A. & Brown, W. A. Antidepressants versus placebo in major depression: an overview. World Psychiatry 14, 294–300 (2015).
Thomas, L. et al. Prevalence of treatment-resistant depression in primary care: cross-sectional data. Br. J. Gen. Pract. 63, e852–e858 (2013).
Hamilton, J. P., Farmer, M., Fogelman, P. & Gotlib, I. H. Depressive rumination, the default-mode network, and the dark matter of clinical neuroscience. Biol. psychiatry 78, 224–230 (2015).
Zhou, H.-X. et al. Rumination and the default mode network: meta-analysis of brain imaging studies and implications for depression. NeuroImage 206, 116287 (2020).
Yan, C.-G. et al. Reduced default mode network functional connectivity in patients with recurrent major depressive disorder. Proc. Natl Acad. Sci. USA 116, 9078–9083 (2019).
Scalabrini, A. et al. All roads lead to the default-mode network—global source of DMN abnormalities in major depressive disorder. Neuropsychopharmacology 45, 2058–2069 (2020).
Sendi, M. S. et al. Abnormal dynamic functional network connectivity estimated from default mode network predicts symptom severity in major depressive disorder. Brain Connect. 11, 838–849 (2021).
Dichter, G. S., Gibbs, D. & Smoski, M. J. A systematic review of relations between resting-state functional-MRI and treatment response in major depressive disorder. J. Affect. Disord. 172, 8–17 (2015).
Gerlach, A. R. et al. MRI predictors of pharmacotherapy response in major depressive disorder. NeuroImage 36, 103157 (2022).
Harmer, C. J., Goodwin, G. M. & Cowen, P. J. Why do antidepressants take so long to work? A cognitive neuropsychological model of antidepressant drug action. Br. J. Psychiatry 195, 102–108 (2009).
Ma, Y. Neuropsychological mechanism underlying antidepressant effect: a systematic meta-analysis. Mol. Psychiatry 20, 311–319 (2015).
Fu, C. H. et al. Neuroanatomical dimensions in medication-free individuals with major depressive disorder and treatment response to SSRI antidepressant medications or placebo. Nat. Ment. Health 2, 164–176 (2024).
Zheng, K. Z. et al. Incapacity to control emotion in major depression may arise from disrupted white matter integrity and OFC-amygdala inhibition. CNS Neurosci. Ther. 24, 1053–1062 (2018).
Chen, X. et al. The DIRECT consortium and the REST-meta-MDD project: towards neuroimaging biomarkers of major depressive disorder. Psychoradiology 2, 32–42 (2022).
Cole, E. J. et al. Stanford neuromodulation therapy (SNT): a double-blind randomized controlled trial. Am. J. Psychiatry 179, 132–141 (2022).
Dosenbach, N. U. et al. Prediction of individual brain maturity using fMRI. Science 329, 1358–1361 (2010).
Yeo, B.T. et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J Neurophysiol. 106, 1125-1165 (2011).
Deshpande, G., LaConte, S., James, G. A., Peltier, S. & Hu, X. Multivariate Granger causality analysis of fMRI data. Hum. Brain Mapp. 30, 1361–1373 (2009).
Buckner, R. L., Andrews-Hanna, J. R. & Schacter, D. L. The brain’s default network: anatomy, function, and relevance to disease. Ann. N. Y. Acad. Sci. 1124, 1–38 (2008).
Wirth, M. et al. Semantic memory involvement in the default mode network: a functional neuroimaging study using independent component analysis. NeuroImage 54, 3057–3066 (2011).
Zang, Z.-X., Yan, C.-G., Dong, Z.-Y., Huang, J. & Zang, Y.-F. Granger causality analysis implementation on MATLAB: a graphic user interface toolkit for fMRI data processing. J. Neurosci. Methods 203, 418–426 (2012).
Fortin, J.-P. et al. Harmonization of cortical thickness measurements across scanners and sites. NeuroImage 167, 104–120 (2018).
Fortin, J.-P. et al. Harmonization of multi-site diffusion tensor imaging data. NeuroImage 161, 149–170 (2017).
West, B. T., Welch, K. B. & Galecki, A. T. Linear Mixed Models: A Practical Guide Using Statistical Software (CRC Press, 2022).
Rosenthal, R. & Rosnow, R. L. Essentials of Behavioral Research: Methods and Data Analysis (Beth Mejia, 2008).
Suthaharan, S. & Suthaharan, S. Support Vector Machine. Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning, 207–235 (Boston, MA: Springer US, 2016).
Lin, X. et al. A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information. J. Chromatogr. B 910, 149–155 (2012).
Abraham, A. et al. Machine learning for neuroimaging with scikit-learn. Front. Neuroinformatics 8, 14 (2014).
Chicco, D., Tötsch, N. & Jurman, G. The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BioData Min. 14, 1–22 (2021).
Ballesteros, J. et al. Sensitivity to change, discriminative performance, and cutoff criteria to define remission for embedded short scales of the Hamilton depression rating scale (HAMD). J. Affect. Disord. 102, 93–99 (2007).
Andrews-Hanna, J. R., Reidler, J. S., Sepulcre, J., Poulin, R. & Buckner, R. L. Functional-anatomic fractionation of the brain’s default network. Neuron 65, 550–562 (2010).
Andrews-Hanna, J. R., Smallwood, J. & Spreng, R. N. The default network and self-generated thought: Component processes, dynamic control, and clinical relevance. Ann. N. Y. Acad. Sci. 1316, 29–52 (2014).
Schmahmann, J. D. et al. Association fibre pathways of the brain: parallel observations from diffusion spectrum imaging and autoradiography. Brain 130, 630–653 (2007).
Li, L. et al. Abnormal resting state effective connectivity within the default mode network in major depressive disorder: a spectral dynamic causal modeling study. Brain Behav. 7, e00732 (2017).
Trivedi, M. H. et al. Evaluation of outcomes with citalopram for depression using measurement-based care in STAR* D: implications for clinical practice. Am. J. Psychiatry 163, 28–40 (2006).
Kim, E., Bagalman, J. & Goetzel, R. Cost burden of treatment resistance in patients with depression. Am. J. Manag. Care 16, 370–377 (2010).
Crown, W. H. et al. The impact of treatment-resistant depression on health care utilization and costs. J. Clin. Psychiatry 63, 963–971 (2002).
Acknowledgements
This work was supported by the National Natural Science Foundation of China (62436005, 32541016, 62473303), the Noncommunicable Chronic Diseases–National Science and Technology Major Project (2024ZD0527800), and the Key Research and Development Program of Shaanxi (2023-YBGY-011).
Author information
Authors and Affiliations
Consortia
Contributions
K.Z.: Conceptualization, Software, Methodology, Writing (original draft). L.C.: Conceptualization, Writing (review). H.W.: Data curation, Writing (review). B.L.: Conceptualization, Writing (review & editing), Supervision. B.C.: Conceptualization, Writing (review & editing), Supervision.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zheng, K., Chen, L., Wang, H. et al. Beyond depression symptoms: the default mode network as a predictor of antidepressant response. npj Mental Health Res 5, 2 (2026). https://doi.org/10.1038/s44184-025-00182-2
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s44184-025-00182-2









