Introduction

Schizophrenia (SCZ) is a neurological disorder that frequently manifests as both positive and negative symptoms in addition to aberrant cognitive functioning. SCZ usually develops in late adolescence or early adulthood, and males are more likely to develop SCZ than females1,2,3. For patients with SCZ, early diagnosis and treatment can help improve clinical symptoms, reduce illness suffering, improve quality of life, and extend life expectancy4. However, the current diagnosis of SCZ relies primarily on clinical assessment, which is subjective and may lead to misdiagnosis or delayed intervention. Given these limitations, there is an urgent need to identify objective biomarkers to improve diagnostic accuracy and facilitate treatment.

Although the precise cause of SCZ remains unknown, abnormalities in neural signaling pathways, such as abnormal dopamine system, have been accepted as a major contribution to SCZ symptoms5,6,7. Mood, cognition, sleep and other physiological and behavioral features are influenced by the 5-hydroxytryptamine (5-HT) regulation in the central nervous system8,9,10. The glutamate hypothesis proposes that SCZ may be linked to abnormal function of the glutamate system, a key neurotransmitter involved in learning, emotion regulation and cognitive processes11,12,13.

SCZ is a central nervous system disorder and brain tissue samples are not easily available. Therefore, peripheral blood mononuclear cells (PBMCs) can be a useful research vehicle for SCZ because they are easily available and can reveal some molecular changes in the central nervous system14,15. Approximately 19–22% of the transcriptome was coexpressed between peripheral blood and brain tissue16. Another study also found that some genes showed differential expression in SCZ patients’ peripheral blood cells and brain tissue17. The expression patterns of pathways linked to phospholipid metabolism, ribosomal signaling and energy metabolism are similar in SCZ patients’ brain tissue and PBMCs18. In addition, study in Parkinson’s disease has demonstrated that transcriptomic profiles in postmortem brains can also be detected in peripheral blood, with significant correlations to clinical symptoms19. These findings suggest that molecular alterations in peripheral blood may partially reflect central nervous system changes, and that identifying the association between peripheral gene expression profiles and SCZ progression could help elucidate underlying mechanisms and potential clinical applications.

In this study, we validated the association of differentially expressed mRNAs of neural signaling pathways in peripheral blood leukocytes screened by RNA-Sequencing analysis and evaluated the diagnostic values as biomarkers for SCZ.

Methods

Subjects

This case-control study for validation included a total of 217 cases with SCZ and 217 controls. All patients with SCZ were recruited from the Huai’an No. 3 People’s Hospital, Jiangsu Province. The controls were selected from the cohort which employed a multistage cluster random sampling method to select residents aged ≥18 years old in Siyang county adjacent to Huai’an city, and matched by age (difference ≤2 years) and gender. All SCZ patients were experiencing an acute episode and were not receiving antipsychotic medication. Two independent clinicians used the International Classification of Disease, Tenth Revision (ICD-10) as the reference for the SCZ’s diagnostic criteria.

The study complies completely with the Declaration of Helsinki. The Institutional Review Board of Nanjing Medical University in China gave its approval to the protocols and consent forms (NJMUER201600334, NJMUER2019929). The informed consent form was signed by each participant or their legal guardians.

Data collection

The data collected from the study population included general information: age, gender, diagnosis type, smoking and drinking status. SCZ family history data were collected through interviews with participants or their first-degree relatives.

Trained psychiatrists assessed the patients based on the Positive and Negative Syndrome Scale (PANSS), including its three subscales: the positive subscale, the negative subscale and the general psychopathology subscale, after admission and before treatment.

RNA sequencing and mRNAs selection

There were 9 SCZ patients and 20 controls in the discovery set. We performed whole transcriptome RNA sequencing by Illumina Novaseq6000 platform. Differentially expressed mRNAs at |log2FC | > 1 and P < 0.05 were used as the basis for an enrichment analysis using the Kyoto Encyclopedia of Genes and Genomes (KEGG). The primers were amplified by primer gradient experiments and verified using Sanger sequencing. The specific primer validation process is shown in Supplemental Materials.

RNA extraction and mRNAs expression detection

Over 8 hours of fasting of participants, venous blood was collected using an anticoagulant tube with ethylene diamine tetraacetic acid (EDTA)-k2, and white blood cells were rapidly separated and mixed with human peripheral blood RNA preservation solution (Eaglink, EGEN2026, China). The experimental procedures and the 2−ΔΔCT calculation method are provided in Supplemental Materials. Relative expression levels of genes were determined using this method.

Statistical analysis

All variables had descriptive statistics generated. The data were reported using the median and interquartile range for continuous variables that did not fit the normal distribution requirements and the mean ± standard deviation for those that did. Continuous variables were compared between cases and controls using the Mann-Whitney U test for non-normally distributed data and the t-test for normally distributed data. The Chi-square (χ2) test was used to compare differences between cases and controls for categorical variables. Fold change (FC) is the ratio of gene expression in the cases to gene expression in the controls, and is used to measure the difference in gene expression between cases and controls. Stratified analyses were performed for age and gender, and heterogeneity tests were performed using generalized linear models. Subgroups were analyzed by diagnosis type and the median PANSS total score. Spearman rank correlation was used to examine the relationships between gene expression levels and the PANSS total score, the positive scale score (P Score), the negative scale score (N Score) and the general psychopathology scale score (G Score).

The dose-response relationship between gene expression levels and the risk of SCZ was investigated using restricted cubic spline (RCS) regression, corrected for extremes by median ± 3-fold interquartile range, adjusted for age and gender, and using the median of gene expression as the reference point, with the number of nodes at four. Genes that were differentially expressed between cases and controls or had a non-linear association with the SCZ were included in the gene expression score (GES). Gene expression was grouped into quartiles (Q1, Q2, Q3, Q4) and included in logistic regression models, and odds ratios (ORs) and 95% confidence intervals (CIs) were calculated for gene expression and SCZ. The GES was calculated for each subject, its formula is: \({\rm{GES}}={\sum }_{i=1}^{n}(\beta i* {xi})\), βi is the logistic regression coefficient for each gene and \({xi}\) is the expression of each gene. Receiver operating characteristic (ROC) curve were constructed to assess the area under curve (AUC) of gene expression. Traditional risk factors (TRF) include age, gender and SCZ family history. Gene expression level enhancement was assessed by the Delong test, net reclassification index (NRI) and integrated discrimination improvement (IDI).

To verify the robustness of the findings, two sensitivity analyses were performed in this study: (1) excluding acute schizophrenia-like psychotic disorder, and (2) excluding both acute schizophrenia-like psychotic disorder and their matched controls. The stability of the diagnostic value of SCZ was assessed by comparing the changes in differentially expressed genes in the two sensitivity analyses.

All statistical analyses were performed in R software (version 4.3.2). A two-tailed P < 0.05 was considered significant.

Ethics approval and consent to participate

The study complies completely with the Declaration of Helsinki. The Institutional Review Board of Nanjing Medical University in China gave its approval to the protocol and consent forms (NJMUER201600334, NJMUER2019929).

Results

Bioinformatics analysis

A total of 1205 mRNAs showed differential expression (P < 0.05, |log2FC | >1), of which 623 genes showed up-regulation and 582 genes showed down-regulation. Based on the results of differentially expressed mRNAs, KEGG enrichment analysis screened 20 mRNAs in three neural signaling pathways, including glutamatergic synapses, dopaminergic synapses and 5-hydroxytryptaminergic synapses. A total of 8 genes, DUSP1, GNG10, GNG7, PRKACA, CREB5, PPP3R1, PPP1CB and MAPK13, were validated by primer gradient experiments and Sanger sequencing. Primer sequences are shown in Table S1.

Characteristics of study population

The characteristics of the study participants are summarized in Table 1. Briefly, cases and controls were well-matched on age and gender. The SCZ case group had lower proportions of smoking and drinking. Four clinical diagnosis types including paranoid SCZ (59.5%), undifferentiated SCZ (22.1%), acute schizophrenia-like psychotic disorder (17.5%) and simple SCZ (0.9%) were enrolled in this study.

Table 1 Demographic and clinical characteristics between SCZ cases and controls.

Comparison of gene expression levels between SCZ cases and controls

Compared with the controls, SCZ cases displayed elevated expression levels of CREB5 (FC = 1.260, P = 0.001), PPP3R1 (FC = 1.434, P < 0.001) and PPP1CB (FC = 1.319, P < 0.001) (Table 2), whereas the expression levels of DUSP1, GNG10, GNG7, PRKACA and MAPK13 did not differ between SCZ cases and controls (P > 0.05; Table 2).

Table 2 Comparison of mRNAs expression level between SCZ cases and controls.

Subgroups analysis of comparing mRNAs expression level between SCZ cases and controls by age, gender, diagnosis type and PANSS score

PPP3R1 expression level of SCZ cases was higher than that of controls both in subgroups aged ≤35 years (FC = 1.456, P = 0.001) and aged >35 years (FC = 1.404, P < 0.001). CREB5 (FC = 1.289, P = 0.001) and PPP1CB (FC = 1.344, P < 0.001) expression levels were higher only in SCZ cases aged >35 years, while MAPK13 expression level of SCZ cases was lower than controls (FC = 0.776, P = 0.020) in the subgroup aged ≤35 years (Table S2).

Both in male and female SCZ cases had higher expression levels of PPP3R1 and PPP1CB (FC > 1, P < 0.05; Table S3) than the controls, while in male SCZ cases had higher levels of CREB5 expression level than controls (FC = 1.335, P = 0.002). There is no difference in other mRNAs expression levels between SCZ cases and controls, regardless of gender (P > 0.05).

Compared with the controls, paranoid SCZ cases displayed increased expression levels of PPP3R1 (FC = 1.503, P < 0.001), PPP1CB (FC = 1.457, P < 0.001) and CREB5 (FC = 1.287, P = 0.001), undifferentiated SCZ cases displayed increased expression levels of PPP3R1 (FC = 1.275, P < 0.01) and PPP1CB (FC = 1.321, P < 0.01) but decreased expression level of DUSP1 (FC = 0.702, P < 0.05), and acute schizophrenia-like psychotic disorder cases displayed a increased PPP3R1 expression level (FC = 1.431, P < 0.01) but decreased expression level of MAPK13 (FC = 0.748, P < 0.05), respectively (Table 3).

Table 3 Comparison of mRNAs expression level between SCZ cases of different diagnosis type and controls.

Both in the PANSS < 85 and the PANSS ≥ 85 subgroups, PPP3R1 and PPP1CB expression levels were higher than controls (FC > 1, P < 0.01; Table S4). SCZ cases displayed increased expression levels of PRKACA and CREB5 but decreased expression level of GNG10 than controls only in the PANSS ≥ 85 group (PRKACA: FC = 1.140, P = 0.018; CREB5: FC = 1.755, P < 0.001; GNG10: FC = 0.682, P = 0.041; Table S4).

None of the above differences in gene expression was heterogeneous between subgroups of age and gender (P > 0.05, Table S2, S3).

Association of gene expression with SCZ risk

The expression levels of seven genes DUSP1, GNG10, GNG7, PRKACA, CREB5, PPP3R1 and PPP1CB showed “U” shaped dose-response relationships with SCZ (Poverall < 0.05, Pnonlinear < 0.05; Fig. 1).

Fig. 1: Expression level of mRNAs and the risk of SCZ.
figure 1

Restrict cubic spline regression analysis of the association of mRNAs expression level and the risk of SCZ with odds ratio (OR) and 95% confidence interval (CI) after adjustment for age and gender. A DUSP1 expression showed nonlinear relationships with SCZ, P for overall = 0.020, P for nonlinear = 0.011. B GNG10 expression showed nonlinear relationships with SCZ, P for overall < 0.001, P for nonlinear < 0.001. C GNG7 expression showed nonlinear relationships with SCZ, P for overall = 0.002, P for nonlinear = 0.001. D PRKACA expression showed nonlinear relationships with SCZ, P for overall = 0.024, P for nonlinear = 0.018. E CREB5 expression showed nonlinear relationships with SCZ, P for overall < 0.001, P for nonlinear = 0.008. F PPP3R1 expression showed nonlinear relationships with SCZ, P for overall < 0.001, P for nonlinear < 0.001. G PPP1CB expression showed nonlinear relationships with SCZ, P for overall < 0.001, P for nonlinear = 0.005. H MAPK13 expression didn’t show linear or nonlinear relationships with SCZ, P for overall = 0.247, P for nonlinear = 0.761.

Diagnostic value of mRNAs for SCZ

The AUC (95% CI) was 0.602 (0.549-0.655) for the TRF model (Fig. 2 and Table S5). Incorporating GES including DUSP1, GNG10, GNG7, PRKACA, CREB5, PPP3R1 and PPP1CB to the TRF, TRF + GES model significantly improved the AUC to 0.743 (95% CI: 0.697-0.789; Table S5) with P less than 0.001 by Delong test. The NRI was 0.212 (95% CI: 0.116–0.309; P < 0.001) and the IDI was 0.083 (95% CI: 0.058–0.107; P < 0.001).

Fig. 2: The ROC curve of GES and TRF for SCZ diagnosis.
figure 2

Traditional risk factor (TRF) model including age, gender and family history of SCZ. Gene expression score (GES) model including the expression levels of DUSP1, GNG10, GNG7, PRKACA, CREB5, PPP3R1 and PPP1CB. The area under curve (AUC) of TRF was 0.602. The AUC of TRF+GES was 0.743.

Correlation between gene expression and PANSS score

GNG10 expression showed weak negative correlations with P score (r = -0.219, P = 0.009), G score (r = -0.188, P = 0.027) and PANSS total score (r = -0.200, P = 0.018), and PPP1CB expression showed weak positive correlations with P score (r = 0.269, P = 0.001) and PANSS total score (r = 0.167, P = 0.048) (Table S6).

Sensitivity analysis

The expression levels of CREB5, PPP3R1, and PPP1CB were still higher in SCZ cases than in controls (FC > 1, P < 0.05), after excluding acute schizophrenia-like psychotic disorder cases (Table S7) and further excluding the paired controls (Table S8).

The nonlinear associations of mRNAs expression levels of DUSP1, GNG10, GNG7, CREB5, PPP3R1 and PPP1CB except PRKACA persisted (Poverall < 0.05, Pnonlinear < 0.05) whether excluding acute schizophrenia-like psychotic disorder cases alone and further excluding their paired controls (Figures S1 and S2).

Furthermore, the TRF + GES model maintained stable discriminatory power (AUC > 0.7, P < 0.001) for SCZ after excluding acute schizophrenia-like psychotic disorder cases and further excluding the corresponding paired controls (Tables S9 and S10).

Discussion

Recently, altered mRNAs expression levels in peripheral blood were observed in patients with SCZ20,21, while the role of neural signaling pathways in the pathophysiologic mechanisms of SCZ is still being explored. This study investigated eight differentially expressed mRNAs in neural signaling pathways identified by RNA-sequencing screening and validated CREB5, PPP3R1 and PPP1CB for SCZ and particularly, higher DUSP1 and MAPK13 expression levels in patients with undifferentiated SCZ and acute schizophrenia-like psychotic disorder respectively. In addition, CREB5, PPP3R1, PPP1CB, DUSP1, GNG10, GNG7 and PRKACA, showed “U” shaped dose-response relationships with SCZ.

The cAMP-response element-binding (CREB) protein family, which includes CREB5, is crucial for learning memory, neuronal cell growth, synaptic plasticity, and gene expression downstream of cAMP signaling22,23. Numerous investigations have demonstrated that SCZ patients exhibit reduced CREB protein and mRNAs expression level24,25. Even though CREB5 expression differed significantly between cases and controls, our research revealed a more complex nonlinear relationship between CREB5 expression level and SCZ rather than a straight forward linear one, which may indicate a more intricate mechanism between this gene and SCZ.

PPP3R1, a signaling pathway member of calmodulin phosphatase. The lack of calmodulin phosphatase may cause changes in synapse function, attention problems and cognitive change26,27, as well as a range of behavioral impairments linked to SCZ, including cognitive dysfunction, working memory loss, and decreased social interaction with evidence of animal study28.

PPP1CB encodes protein as one of the three catalytic subunits of protein phosphatase 1 (PP1), which is involved in the regulation of synaptic plasticity and neurological disorders29,30. PP1 deficiency is associated with the development of learning and memory impairments, and thus, PPP1CB may be associated with the onset of SCZ by regulating PP1.

Our findings suggested that the expression levels of DUSP1 and MAPK13 decreased in patients with undifferentiated SCZ and acute schizophrenia-like psychotic disorder. The mitogen-activated protein kinase (MAPK) phosphatase family, of which DUSP1 is an important member, regulates the MAPK signaling pathway to contribute to learning, memory, neuronal stress response and synaptic plasticity31,32. And MAPK13 encodes a member of the MAPK family33. Thus, DUSP1 and MAPK13 may be linked to the pathophysiological mechanisms of SCZ by regulating the MAPK signaling pathway.

G protein-coupled receptors (GPCR) are central mediators of neurotransmitter signaling, and GNG10 may affect neurotransmission, emotional and cognitive functions by modulating the GPCR signaling pathway34,35. A previous study observed that GNG10 expression level in peripheral blood was up-regulated in patients with first-episode SCZ36. However, our study found a nonlinear association of GNG10 expression level with SCZ but not a single linear association. Furthermore, the correlation analysis observed a negative correlation between GNG10 expression level and PANSS total score and its subscale scores, which suggested that a lower expression level of GNG10 may be associated with more severe positive, general psychopathological and overall symptoms of SCZ.

GNG7 binds to G protein α and β subunits and participates in D1 dopamine receptor-mediated neuroprotective responses37. Behavioral alterations result from the loss of the G protein gamma (7)-subunit, which is involved in signaling pathways that regulate the stability or formation of heterotrimers of particular G proteins38,39.

The PRKACA encodes the protein kinase A (PKA) catalytic subunit Cα, which is considered the predominant isoform, is expressed in most tissues, and is a core kinase of the cyclic adenosine monophosphate (cAMP) pathway40,41. The dysregulation of cAMP signaling pathway in patients with SCZ42, and our findings support the association of PRKACA and SCZ by affecting the cAMP signaling pathway.

The strengths of this study mainly include, firstly, analyzing the expression changes of specific pathways (glutamatergic synapses, dopaminergic synapses and 5-hydroxytryptaminergic synapses), which helps to deeply understand the pathogenesis of SCZ. Secondly, the GES can significantly improve the differentiation and prediction ability of the model. Thirdly, peripheral blood samples are easier to obtain than brain tissue samples, which greatly reduces the difficulty and cost of obtaining samples and is less traumatic to the human body. Lastly, we use of RCS regression models to investigate the association between gene expression levels and SCZ risk. This approach sensitively identifies and characterizes potential dose-response relationships, and provides more new evidence for elucidating the biological effects of differentially expressed mRNAs involved in the complex mechanisms of SCZ.

The limitations of this study include, firstly, that some subgroups had too few patients. Secondly, this was a single-center study; future multicenter studies could enhance the generalizability of the findings. Thirdly, fewer covariates were adjusted in this study, and further relevant data could be collected for future analyses. Lastly, the sample size of screening set is constrained by using the current threshold (P < 0.05 and |log₂FC | > 1) to identify differentially expressed mRNA and further study with sufficient conditions would be warranted in the future.

Conclusion

The expression levels of CREB5, PPP3R1, and PPP1CB were elevated in the neural signaling pathway of peripheral blood leukocytes of patients with SCZ; there was a significant dose-response relationship between the expression levels of DUSP1, GNG10, GNG7, PRKACA, CREB5, PPP3R1, and PPP1CB and the risk of SCZ; the gene expression scores by integrating these genes could significantly improve the diagnostic ability of SCZ.