Introduction

The cognitive faculty of language supports interpersonal communication and thinking1, both of which are disrupted in psychotic disorders such as schizophrenia. The thought and communication disorders observed in individuals with schizophrenia appear to stem from a structural disruption in language, i.e., grammatical impairment due to a divergence of syntax from healthy speakers2,3,4,5,6. However, despite the substantial body of work, the existing literature presents a fragmented understanding of the precise nature and extent of syntactic deficits.

Disorganised speech, a diagnostic feature of schizophrenia in DSM-57, is assessed on the basis of incoherence that leads to a failure of effective communication. Syntax production, if impaired, can generate conversational incoherence. Similarly, impaired comprehension of syntax (i.e., who did what to whom?) may contribute to impaired meaning and misinterpretations that typify positive psychotic symptoms such as persecutory delusions, as well as uncooperativeness, and lack of insight. Estimating the relative impairments in syntax production and comprehension is important because these processes rely on distinct cognitive mechanisms, despite sharing the common structural substrate (representation) of language8,9. Production involves generating grammatically correct and contextually appropriate sentences, while comprehension requires decoding and interpreting syntactic structures in real-time. Understanding the nature of the relationship between deficits in syntactic comprehension and production can clarify the level (shared structural vs. distinct cognitive) at which the mechanisms of language disturbances operate in schizophrenia. In the current study, we systematically review the literature published to date on both syntactic production and comprehension in schizophrenia.

Producing and inferring meaning via language is not based on isolated lexical concepts (semantic categories), but involves the interactional basis offered by grammatical constructions. Grammar enables the signifiers and the signified to be put together. Thus, there is a strong case to be made for syntax-level deficits, i.e., an aberration in the way words are composed in an order, to have primacy in the language disorder of schizophrenia10,11,12,13. Several thoughtful reviews in recent times have hinted at the critical importance of syntactic deficits in schizophrenia4,14,15,16,17. Bora and colleagues highlighted a role for impaired syntactic comprehension when analyzing the linguistic correlates of the burden of formal thought disorder18. Nonetheless, to our knowledge, a comprehensive meta-analytic quantification of the overall magnitude of grammatical impairment in both comprehension and production in schizophrenia is still lacking.

Quantifying the degree of grammatical impairment in schizophrenia is critical for two reasons. Firstly, the use of the various linguistic markers in speech to predict clinically important outcomes is an emerging pursuit in the field (e.g., onset of first episode19,20,21, relapses22). Despite the many studies carried out to date, one major obstacle in bringing such predictive analytics to routine clinical use is the lack of empirical guidance on feature selection in these models. As a result, many automatically derived linguistic variables are being tested in clinical prediction models, with minimal overlap among different studies, impeding interpretability and successful external validation (e.g., not a single linguistic feature overlapped across the 18 prediction analysis studies identified in a recent review14). This can be addressed via evidence-based preselection of variables that most proximally relate to the clinical construct of interest i.e., the presence of schizophrenia in our case [see Meehan and colleagues23 for a state-of-the-art review]. Meta-analytic estimation of the effect size of syntax production/comprehension variables will provide evidence for their utility in speech-based predictive analytics.

Secondly, given the relevance of social interaction for functional recovery24, interventions that ameliorate communication deficits in schizophrenia are steadily growing in recent times25,26,27. Yield from these trials can be improved by identifying the most affected syntactic markers as treatment targets and establishing if distinct subgroups with varying degrees of deficits are likely to occur in schizophrenia. In the presence of a high degree of interindividual variability in syntactic deficits, stratified RCTs for communicative remediation are likely to have a better yield. Thus, meta-analytic estimation of the effect size and variability of syntactic deficits will inform forthcoming intervention trials.

Our primary goal of this review is to provide a quantitative synthesis of the degree and interindividual variability of syntactic language deficit across the domains of syntactic comprehension, anomaly/error detection, and various levels of complexity and integrity of syntactic production in schizophrenia. We also aim to investigate the relationship between syntactic production, comprehension, and symptom severity and identify potential research gaps and opportunities in this area of work.

In this meta-analysis, we find robust evidence for grammatical impairments in schizophrenia across all domains examined, with particularly strong effects for syntax comprehension. People with schizophrenia show increased variability for some of the indices of syntax processing, suggesting the existence of potential subgroups with differing degrees of grammatical impairment.

Methods

Search strategy and selection criteria

The original protocol was registered on the Open Science Framework registry (May 2024), with an update after the initial search but before undertaking statistical analysis (October 2024)28. This update included missing information on meta-analytic methods and bias assessment framework, adding specifications (grouping of syntactic domains, metaregression variables) and planned deviations (reporting pronoun aberrations separately from the current report, dropping reaction time and parts-of-speech measures to reduce bias from reporting inconsistencies). Any further deviations that occurred after the data analysis (the use of a multivariate approach to meta-analysis) are explicitly reported as such. Institutional review board approval was not required as this study involved analysis of previously published data. This review adheres to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines29 and recent recommendations to protect against researcher bias in meta-analysis30. We performed a literature search across multiple electronic databases, with PubMed (MEDLINE) and Scopus, Web of Science (Core Collection) as primary sources, followed by non-MEDLINE-indexed studies identified using PsycINFO up to May 1, 2024. Search terms included a combination of keywords and Medical Subject Headings (MeSH) terms related to schizophrenia (schizophrenia OR schizo* OR psychos* OR psychot*), language (language OR verbal OR linguistic OR speech OR communicat* OR thought), syntax (syntax OR syntactic OR gramma*) with the ‘explode’ option for non-MeSH variations of language when appropriate (e.g., language, verbal, linguistic, speech, communicat*, pronoun* with ‘exp’ in PubMed; See Supplementary Note 1). Two reviewers (DE and LP) independently screened titles and abstracts against the inclusion criteria using Rayyan software31 after removing duplicates. Full texts of relevant studies were assessed for eligibility. We then added further studies to the pool by screening the bibliography and hand-searching all citations received by the identified studies via Google Scholar. Imported databases with all studies retrieved via primary search are provided as links in Supplement (Supplementary Data 1) and PRISMA checklist as Supplementary Data 2.

We included English language publications describing studies that (1) enrolled adults (aged 18 or above) diagnosed with schizophrenia spectrum disorders (schizophrenia, schizoaffective, or schizophreniform psychosis) and a control group of healthy adults without known psychiatric disorders (2) assessed speech production and/or comprehension, focusing on grammar and syntax. This includes evaluating either grammatical comprehension (by quantifying a person’s ability to understand complex sentences or detect errors in the syntactic formation) and/or production (by assessing the degree of global [narrative level] or local [clausal/phrasal level] complexity, length and integrity in the utterances or sentences). This grouping of domains of interest was based on Morice and Ingram’s original work32 that separated complexity and integrity in syntax production in schizophrenia, with phrasal/clausal level complexity (coordination) later included by Thomas and colleagues33. This set was further extended as per Lu’s Syntactic Complexity Analyzer approach34 to distinguish production length from other complexity measures.

Only empirical studies with quantitative measures derived in the same manner from both groups were included. Studies focused on subjects <18 years of age35,36,37, case reports/case series38, and those without a healthy control group13,39,40,41,42,43 were excluded. Additionally, studies focused on high-risk subjects without a diagnosed schizophrenia spectrum disorder44, studies reporting verbal outputs that were either restricted (e.g., scripted conversations45) or likely to have been edited after production (e.g., written reports and social media texts46,47,48,49,50), non-naturalistic speech (e.g., word list generation, repetition, monitoring or recall of memorized text51,52,53), analysis restricted to parts-of-speech tagging (with no sentential syntax)54,55,56,57 or providing only second order derivatives (e.g., speech graph metrics58 or factor scores59) without direct indices of syntax production/comprehension were not eligible. One study with a retraction notice was also excluded60. Studies with unconventional criteria for syntactic complexity61,62,63 and those without quantitative measures or plots that allowed effect size estimation were also excluded51,64,65. For a list of articles excluded at the stage of data extraction with the reasons for exclusion and main results, see Supplementary Table 1 embedded in Supplementary Information.

Data extraction

We extracted the available clinical/demographic data (author(s), publication year, country, sample size, mean age, and symptom severity based on standardized scales [e.g., PANSS, SANS/SAPS, BPRS/BPRS-E, with the reported total scores in each patient sample converted to a scale of 0 to 1 via min-max transformation (See Supplementary Note 2)], sex distribution, chlorpromazine equivalent of antipsychotic dose (conversions from other drug equivalents or Defined Daily Doses as per ref. 66), mode [free speech, visual/verbal stimulus such as picture/proverb elaboration, sentence to picture matching] and the language of task administration). When overlapping samples were published in more than one paper, we extracted data from the largest reported sample. The instances where two studies reported data from overlapping participant samples are cited here32,67,68,69,70,71,72,73,74,75. In each case, to avoid duplication and ensure accurate effect size calculations, we extracted data from the largest reported sample. We reached out to selected authors (12.2%) when quantitative measures were unclear for clarifications. For studies where numerical values were not provided33,59,76,77,78,79 we extracted these values from published plots using a visual data extraction tool (plotdigitizer.com). When more than one mean was reported on the same measurement from the same sample (e.g., on/off medications as in ref. 80), we included the average as the summary measure. Some of the studies reported median and range values instead of mean and SD required for Cohen’s d estimation53,77,81. In such instances, we used the five-number summary approach82, available at https://www.math.hkbu.edu.hk/~tongt/papers/median2mean.html.

Quality assessment

The quality of the studies was assessed using a purposively modified Newcastle–Ottawa Scale83, widely used in psychiatry, where rating scale use for exposure assignment is a common practice84. The following indicators were evaluated: case definition, representativeness, selection of control group, comparability of groups, ascertainment of ‘exposure’ (i.e., measurement of syntactic variables of interest), and quality of data reporting. Items in the Newcastle-Ottawa framework are known to have low reliability among raters85 (e.g., demonstrating the timing of measurements) and lack of clarity86 (e.g., emphasis on independent validation of the case status, response proportions, the practice of higher scores for population-based controls, statistical adjustment and blinding which are often unsatisfactory in case-control designs) were replaced these with items specific to psychiatric diagnoses and linguistic variable assessment (see Supplementary Table 2). Furthermore, we defined likely confounders a priori for bias assessment (age, sex, education, and native language being different from the language of assessment). Each study was independently rated by two authors (DE and LP), with disagreements resolved by discussion.

Statistics and reproducibility

Statistical analyses were conducted using the JASP 0.19.0.0 package87. Effect sizes were calculated from available means and standard deviations (Cohen’s d = (M2 − M1)/SDpooled) from each set of analyses. As some studies reported error rates while others reported accuracy rates, all effect sizes were sign-adjusted to read as controls > schizophrenia when producing summary values (See Supplementary Note 3).

We pooled the d values using Bayesian model-averaged (BMA) meta-analysis via metaBMA R package implemented in JASP88, with default priors for heterogeneity (Inverse-Gamma [1, 0.15] and effect size (Cauchy [0, 0.707]). BMA evaluates the likelihood of the data under a combination of models regarding the meta-analytic effect and heterogeneity, reporting model-averaged effects. Evidence in favor of a group difference was categorized as weak (for BF10 1 to <3), moderate (BF10 3 to <10), strong (BF10 10 to <30), very strong (BF10 30 to <100), and extreme (BF10 > 100).

Meta-regression analyses were performed when sufficient evidence for heterogeneity between studies was uncovered in any domain. We included the mean age of People with schizophrenia, proportion of females with schizophrenia, mean chlorpromazine equivalent dose, language of the study assessment (English vs. non-English), and study quality scores as potential moderators.

Robust Bayesian meta-analysis89 was used to assess the sensitivity of the results to the potential presence of publication bias and heterogeneity.

Log Coefficient of Variation Ratio90 (lnCVR: natural log of ratio of the estimated total coefficient of variation between the patient and the control group) was used to quantify the difference in variability after scaling to the mean of each group [lnCVR = 0 indicates equal variability; >0 greater variability, while <0 indicates lower variability in schizophrenia vs. controls].

Given the between-domain heterogeneity, we used a random-effects model to pool the 6 lnCVR measures and the 6 Cohen’s d estimates across the domains and assessed the overall effect.

The 6 domains of interest are outcomes that are likely to be correlated with each other, though the participant-level correlations were seldom reported in the individual studies. Based on the assumption that individual-level correlation will lead to population (study) level correlation91, we employed multivariate meta-analysis with a missing at random assumption to analyze all correlated outcomes jointly. This enabled increased efficiency of the meta-analysis, allowing us to synthesize across domain-specific effect sizes using a random-effects approach91 to estimate the overall meta-effect of grammatical impairment using BMA. We used the mvmeta R package92. Due to the large amount of missing values in the aggregated data (only n = 18 had measurements for more than two domain indicators), all the data were subjected to multiple imputation by chained equations using the mice93 R package. We then estimated the within-study correlation using the metavcov R package94. All effect sizes were transformed into Hedge’s g for this analysis. A random-effect model was used because of the significant heterogeneity between individual studies. Borrowing of strength (BoS)95 was calculated to compare the results of multivariate meta-analyses to separate univariate methods. Note that this was not a pre-registered analysis and should only be considered as a supplemental analysis, given the amount of missing data.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

Study selection

A total of 820 studies were identified through the initial database search. After removing duplicates, 509 unique studies remained. Following title and abstract screening, and adding hand-searched references, 86 articles were retrieved as relevant, of which 45 studies met the inclusion criteria for numerical synthesis for the meta-analysis11,32,33,53,67,70,74,76,77,78,80,81,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126 (see Fig. 1).

Fig. 1: PRISMA 2020 flow diagram for the systematic review of syntax and schizophrenia.
figure 1

Preferred reporting items for systematic reviews and meta-analyses (PRISMA) flowchart showing the screening process and various reasons for inclusions and exclusions. n number of studies.

Study characteristics

The final list included studies published between 1982 and 2024, with summary data from a total of n = 1679 people with schizophrenia and n = 1281 controls available from 79 comparisons across 6 domains of interest. The weighted mean age across studies was 32.31 (SD = 5.6) years, with no difference in distribution among people with schizophrenia and control cohorts (paired t = 0.85, p = 0.4). Only 29.2% of participants were women, with 5 studies recruiting only men80,110,117,127,128; only 8 studies had >40% women. The studies predominantly included individuals diagnosed with established schizophrenia spectrum disorders (n = 1292), with first-episode samples forming 33.59% of the total sample (n = 564). A great majority of studies (64.4%) recruited English-speaking participants. Some studies reported separate contrasts based on the presence of Formal Thought Disorder (FTD/no-FTD104,105,129) or stage of illness (FEP/established schizophrenia11,67,103). Quality scores are presented in Supplementary Table 3. See Supplementary Table 4 for a description of included studies.

There is no single accepted index to measure grammatical impairment in mental health conditions. As a result, we found a notable variation in the method used to quantify the variables of interest, and in some cases, more than one variable for the same domain was reported. As a general principle, we chose the measures with the closest theoretical alignment to the 6 domains of interest for this meta-analysis. Within each domain, we chose tasks and variables that were most commonly used across studies. Other study-specific decisions in variable choices are discussed in the Supplementary Note 3.

Information availability

While mean age (93.3% of studies), language of testing (100%), and sex distribution (93.3%) were available for most studies, an estimate of antipsychotic dose exposure (48.9%) and overall symptom severity (40%) were less often reported. Most studies only provided the overall proportion of antipsychotic use and domain-specific symptom scores (generally positive symptoms: See Supplementary Tables 5 and 6 embedded in Supplementary Information). As a result, we included age, assessment language, sex, and the study quality scores in the meta-regression analyses, but only reported moderator/effect-size bivariate correlations for antipsychotic dose and total symptom severity index.

Meta-analytical results

The results of Bayesian Meta-Analysis for each group of studies are shown in Table 1 along with the data on between-studies heterogeneity, log coefficient of variation ratios, and publication bias. BMA showed extreme evidence for reduced syntactic comprehension, error detection, production length, phrasal complexity, production integrity, and global complexity in people with schizophrenia (all BF10 > 100; Fig. 2). Random effects analysis across the 6 domain-specific effects indicated extreme evidence (BF10 = 3173; estimated d = 0.87) for an overall grammatical impairment in schizophrenia. See the Supplementary Results 1 for multivariate meta-analysis of correlated outcomes.

Table 1 Summary of Bayesian model-averaged meta-analysis of grammatical impairment in schizophrenia spectrum disorders
Fig. 2: Forest plots for domain-specific meta-analyses of syntactic production and comprehension in schizophrenia.
figure 2

a Forest plots for Bayesian Model-Averaged estimates of group differences in syntactic comprehension (n = 16 studies). b Error detection (n = 6). c Production length (n = 17). d Phrasal complexity (n = 16). e Production integrity (n = 11). f Global complexity (n = 13). Estimated Cohen’s d values (with 95% credible intervals) reflect patient-control differences. FTD Formal Thought Disorder, nFTD no-FTD, FEP First Episode Psychosis, SCZ Established schizophrenia. Circles represent the effect of individual studies, with their size weighted by sample size; diamond lozenges represent pooled effects.

Between-study heterogeneity (tau) was strong for global complexity, production length, and integrity. Of these domains, the meta-regression analysis revealed age as a significant moderator for global complexity while study quality was the most significant known source of heterogeneity for production integrity (Table 1; Fig. 3). The moderator/effect-size bivariate correlations were not significant for antipsychotic dose (r31 = 0.27, p = 0.14) or total symptom severity index (r33 = 0.06, p = 0.74) across all domains. While the number of studies on clinically detectable FTD was insufficient for a meta-regression, visual inspection of the forest plots revealed that all FTD contrasts had above-average Cohen’s d values for syntactic comprehension and phrasal complexity but not for production integrity.

Fig. 3: Observation from meta-regression analysis.
figure 3

Studies with relatively older patient cohorts demonstrated larger effect size differences for global syntactic complexity (left panel). Studies with higher quality scores reported greater effect sizes for production integrity (right panel).

Meta-analysis of within-group variations indicated higher inter-individual variability in people with schizophrenia for syntactic comprehension, phrasal complexity, and production length (lnCVR = 0.13–0.41; medium to large variation effect130) but not for other measures (Fig. 4). Random effects analysis across the 6 domain-specific variation estimates indicated moderate evidence (BF10 = 5.27; estimated lnCVR = 0.21) for excess variability among people with schizophrenia compared to healthy controls.

Fig. 4: Forest plots for domain-specific meta-analyses of variation in syntactic production and comprehension in schizophrenia.
figure 4

a Forest plots for Bayesian Model-Averaged estimates of patient-control differences in variability (log coefficient of variation ratio, lnCVR) for syntactic comprehension (n = 16). b Error detection (n = 6). c Production length (n = 17). d Phrasal complexity (n = 16). e Production integrity (n = 11). f Global complexity (n = 13). Positive lnCVR values (95% credible intervals) indicate greater variability in people with schizophrenia. FTD Formal Thought Disorder, nFTD no-FTD, FEP First Episode Psychosis, SCZ Established schizophrenia. Circles represent the effect of individual studies, with their size weighted by sample size; diamond lozenges represent pooled effects.

Using Robust BMA, we found no or weak evidence for publication bias in all of the individual meta-analyses, with moderate to extreme evidence for group differences retained for domain-specific impairments in syntax (Table 1).

The unregistered multivariate meta-analysis improved the precision of estimates (Fig. 5), again indicating error detection and syntactic comprehension to be the most affected domains, followed by all 4 production domains. Correlations were not robust, but hinted that an impairment in error detection (comprehension) may co-occur with reduced production integrity and lower global complexity (production). But these results were affected by the randomness of imputation due to the large amount of missing data. As shown by Borrowing of Strength analysis, all precision estimates were boosted (median of 75%) by the adjustment for correlations among the domains.

Fig. 5: Unregistered multivariate meta-analysis.
figure 5

Top panel: Left: Imputed study-level correlations (heatmap) indicated that an impairment in error detection (comprehension) may co-occur with reduced production integrity and lower global complexity (production), but the strength of the imputed relationships was unstable due to a large volume of missing data. Right: Borrowing of Strength analysis indicating the gain in domain-specific estimates with the unregistered multi-variate approach. Bottom panel: Forest plot of random effects multivariate analysis across domains. Circles represent the effect of individual studies, with their size weighted by sample size; diamond lozenges represent pooled effects.

Discussion

To our knowledge, this is the first meta-analysis on the association between schizophrenia and the use of grammar/syntax. BMA reveals extreme evidence in support of a global grammatical impairment across the domains of interest in schizophrenia, with the most robust effects being noted for comprehension of complex syntax and detection of errors, followed by production length and integrity. The evidence favoring illness-related differences was moderately strong for global and phrasal complexity, even after taking between-studies heterogeneity and publication bias into account. This implies that people with schizophrenia understand simpler sentences better, ignore syntactical errors, and speak in less sophisticated, shorter sentences that may not have a complete syntactic structure. Within the patient group, variability in grammar production/comprehension was higher than that of the healthy control group; this may occur in the presence of subgroups with varying degrees of impairment in schizophrenia. Taken together, a broad spectrum of grammatical impairment appears to be a key feature of schizophrenia.

Given the relatively modest sample sizes in individual studies (median patient n = 32), our meta-analytic synthesis offers a more robust and representative effect size of grammatical impairment in people with schizophrenia. Nevertheless, one limitation is our reliance on summary measures reported by authors instead of individual participant data. As 40% of case-control contrasts came from studies completed 20 years ago, we assessed (a priori) the likelihood of data availability to be low. Notable variation in study quality was noted, with representativeness across sexes and assessment languages being poor. Our synthesis is also limited by the diversity of variables used to define the domain-specific divergence; this likely accounts for the high heterogeneity observed in certain domains. Individual studies seldom reported the subject-level correlations among the various domains (especially between production and comprehension divergence), precluding our ability to test one of our pre-registered aims (but see the unregistered multivariate meta-analysis).

We also record notable variations in clinical sampling, with some studies focusing exclusively on those with FTD110. We found insufficient data to estimate the effect of FTD across all domains and excluded studies that only compared FTD and non-FTD patient groups131. But our results indicate that grammatical impairment occurs irrespective of the presence of FTD. It is important to note that at an individual level, the degree of grammatical impairment is likely to be much higher among people with schizophrenia as it is influenced by comorbid developmental disorders and poor proficiency in a non-native language, both of which led to participant exclusion in the studies we identified. Furthermore, people with schizophrenia with more severe linguistic deficits often lack the capacity to provide written informed consent; given the fluctuating nature of clinical symptoms (including thought disorder), it is likely that cross-sectional assessments reported in primary studies fail to capture the most symptomatic phases of the illness, wherein syntactic deficits may be more prominent. Thus, the effect size reported here should be considered as a conservative estimate of the real-world complexities of grammatical impairment in schizophrenia.

One of the strengths of our review is the depth of our literature search - covering 50 years of work. In contrast to Ehlen and colleagues4 who recently “identified no studies evaluating syntax production in individuals with schizophrenia”, our search strategy located n = 29 studies on syntax production. Furthermore, our robust BMA analytical approach accounts for the uncertainty in heterogeneity and publication bias estimates and offers a comprehensive meta-analytic quantification of the overall magnitude of grammatical impairment in schizophrenia. The robust medium-to-large deficit in syntax production makes a strong case for including speech-based predictive analytics for early detection of schizophrenia, reinforcing prior37,132 and ongoing studies in this regard133. We offer specific recommendations for future studies of grammatical impairment in schizophrenia that can refine the effect size estimates reported in the current meta-analysis (see Box 1).

Several domains of language function, such as pronoun use134, semantic coherence55, and fluency135, are affected in schizophrenia. Compared to the other reported impairments, deficits in syntax, being a rule-based feature of language, are potentially remediable across the lifespan. Syntactic improvement may also affect other levels of linguistic processing (see Box 2). Therapeutic gain has been shown in aphasic disorders with structured rehabilitation/education approaches (e.g., mapping therapy, syntax stimulation136,137) or via targeted cognitive training (e.g., working memory138). In schizophrenia, studies investigating the causal relationship between syntactic deficits and other linguistic domains are needed, along with those investigating the neural basis of these deficits. By demonstrating evidence for a small-to-medium-sized increase in inter-individual variability in syntactic deficit (especially for phrasal complexity and syntactic comprehension), our synthesis encourages pre-trial selection of patients for communicative remediation. In particular, for syntactic comprehension, the combination of a large effect-size deficit, low between-studies heterogeneity, and the possibility of finding highly impaired subgroups indicates its suitability as an outcome measure for linguistic intervention trials.

The neural and social interactional basis of the observed syntactic deficits warrants attention in future studies. Emerging arguments against the presence of specific neural substrates for syntax/combinatorial processing in human language139,140, indicate that the syntactic aberrations in schizophrenia may reflect deficits at multiple levels of language processing, especially semantic cognition; this remains to be empirically studied . Our observation of a generalized syntactic deficit across patient samples argues against focusing exclusively on those with clinically detectable FTD in mechanistic studies of linguistic divergence in psychosis (see refs. 67,141,142 for a similar argument).

Our estimate of overall syntactic impairment (d = 0.87) can be considered as a large effect by convention, but smaller than the generalized cognitive impairment reported in schizophrenia (d = 1.2143) and comparable to mechanistic observations relevant to its pathophysiology (e.g., presynaptic dopamine excess in neuroimaging studies d = 0.79144). Studies included in our meta-analysis either excluded participants with notably low IQ or matched IQ between groups; thus, we cannot attribute the observed syntactic divergence to a generalized cognitive impairment. Unlike the constrained neuropsychological tests used to assess cognitive deficits, syntactic deficits (especially in production) reported here has been observed on the basis of narratives/conversations that occur in more natural contexts. Thus, grammatical impairments, often carried by patients without much self-awareness, are likely to have intrusive effects on one’s everyday social functions.

In conclusion, our meta-analysis substantiates the long-suspected role of grammatical aberrations in schizophrenia. The question of whether these deficits occur independently of lexico-semantic abnormalities or are part of a broader linguistic impairment remains unresolved. Nonetheless, the findings underscore the need for targeted interventions to address these linguistic differences. More general implications include the importance of adjusting verbal exchanges in therapeutic and other settings (e.g., inpatient units, educational, vocational and legal institutions) for schizophrenia.