Abstract
The Positive and Negative Syndrome Scale (PANSS) is widely used to assess schizophrenia symptoms. Initially designed with three subscales, Marder et al.´s 5-factor-Model (M5M) first proposed in 1997 has been frequently used in treatment trials, but it has never been systematically reviewed for its measurement properties. We utilized the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) guideline for systematic reviews and meta-analytical procedures to assess the psychometric properties of the M5M-PANSS. COSMIN comprises several steps: literature search, risk-of-bias assessments, assessing the updated criteria for good measurement properties, feasibility aspects and grading the quality of the evidence. We further assessed the goodness of fit of other PANSS factor models. We included 95 publications. The M5M-PANSS showed good construct validity, but “insufficient” structural validity. Evidence of other COSMIN domains is largely lacking. Among the multiple (73) factor solutions examined with confirmatory methods, several other 5-factor solutions had better model fit. According to COSMIN rules the M5M should not be recommended for use. Other five-factor models such as the one proposed by Wallwork et al.1 warrant further evaluation. Nevertheless, the factor composition of the M5M and these other models was relatively similar, so previously published results should not be disregarded.
Similar content being viewed by others
Introduction
The Positive and Negative Syndrome Scale (PANSS)2 is the standard instrument for assessing symptoms of schizophrenia. Initially, the PANSS was developed by Kay et al.2 as a three-subscale instrument consisting of a positive subscale with 7 items, a negative subscale with 7 items and a third general psychopathology subscale with 16 items. However, Kay and Sevy3 later hypothesized a pyramidical structure within the scale and Peralta and Cuesta4 proposed a five-factor structure in 1994. Another five-factor model was published by Marder et al.5 in 1997. This Marder-5-factor-Model (M5M), comprises the following factors: (1) positive factor, (2) negative factor, (3) disorganized thought factor, (4) uncontrolled hostility/excitement factor, (5) anxiety/ depression factor5. Multiple other factor analyses have been conducted since and most found similar, but not identical 5-factor models. The M5M is especially important as it has established itself especially in antipsychotic drug trials. Le Moigne et al.6 for example, quote 15 such trials. However, the measurement properties of the M5M-PANSS5 have never been systematically reviewed.
We have already conducted a COSMIN systematic review on the measurement properties of the PANSS in its original three-subscale structure7.
However, it must be considered that, even though the same items are used, according to e-mail correspondence with COSMIN8,9,10, the scale needs to be treated as a separate instrument. Findings of the original subscales cannot be simply transferred to the M5M factors, because certain measurement properties of a scale are dependent on its structure.
Our first objective is to comprehensively review and provide insight into the measurement properties of the M5M-PANSS5 by applying the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) guideline for systematic reviews of patient-reported outcome measures8,9,10 including meta-analytic methods when appropriate.
Our second objective is to assess the goodness of fit of further PANSS factor models and compare it with the M5M.
Methods
The methods used in this review are based on the COSMIN guideline for systematic reviews of patient-reported outcome-measures8,9,10. COSMIN was initially created for patient-reported outcome-measures (PROMs) but can also be applied to clinician-reported outcome-measures (ClinROMs)11,12, such as the PANSS. There are several steps: literature search, quality assessment of the individual analyses with the COSMIN Risk-of-Bias checklist, applying updated criteria for good measurement properties, grading the overall quality of evidence with a modified Grading of Recommendations assessment, development and evaluation (GRADE) approach, assessing feasibility aspects and formulating a recommendation. An overview is presented in Fig. 1.
Refer to the methods part and appendix of our publication for further information. For the unmodified, full version of the COSMIN methodology, refer to the COSMIN manual. Step 1: Risk of bias checklist. Step 2:Criteria for content validity and Updated criteria for good measurement properties.aFor the detailed criteria refer to the COSMIN manual for content validity. Step 3: Summarize the evidence and give a final rating. bThe same criteria like in Step 2 apply. cThese are the individual ratings on each result obtained in Step 2. Step 4: Grade approach. Step 5: Formulate recommendations. Step 6: Describe feasibility aspects.
Data extraction sheets as recommended by COSMIN (compare COSMIN manual, pp. 66–73) were used8,9,10.
All steps of the COSMIN methodology were conducted independently by two reviewers (SG and MR). Any disagreements were resolved by consensus, with the involvement of a third, professor-level reviewer (SL) if necessary. We followed the same procedure as in our COSMIN review of the PANSS in its original three-subscale structure7.
In COSMINs terminology the term “studies” is used for individual analyses8,9,10. We referred to individual analyses as “analyses” or “assessments” and publications as “studies” to improve comprehensibility and readability.
We investigated the M5M5 for its importance in antipsychotic drug trials, but we also examined other 5-factor solutions found in the literature and their fit.
A protocol was registered on the Open-Science-Foundation website where more details can be found (https://doi.org/10.17605/OSF.IO/5EGMD; compare Appendix 1)13.
Search strategy and selection criteria
PhD librarian Farhad Shokraneh searched PubMed and EMBASE using a modified COSMIN filter, see Appendix 2 and Terwee et al.14. The search term used according to COSMIN was: ‘PANSS’OR ‘Positive and Negative Syndrome Scale‘ AND ‘Schizophrenia‘ AND [COSMIN inclusion-filter] NOT [COSMIN exclusion-filter]. We screened the identified titles and abstracts, and reference lists of included publications for evaluation studies of the M5M-PANSS5 in which at least 80% patients had a psychotic disorder in any stage (e.g. acute or in remission). Articles that addressed the model fit of other PANSS-models (e.g. 7-factor solutions or 5-factor solutions other than M5M-PANSS) were also included.
There were no language or country restrictions. Following COSMIN, only full-text articles were included, thus conference abstracts were excluded. Reasons for the exclusion of articles as well as the process of screening the literature search results can be found in a PRISMA diagram in Appendix 3. If uncertainties arose regarding results or methods of relevant articles, their authors were contacted and asked to revalidate the results or processes. This applied to 29 articles.
Assessing the risk of bias (step 1 in Fig. 1)
The reporting quality of the analyses included was assessed with the COSMIN risk of bias checklist which consists of 10 domains (called “boxes” by COSMIN): (1) ClinROM development, (2) content validity, (3) structural validity, (4) internal consistency, (5) cross-cultural validity/ measurement invariance, (6) reliability, (7) measurement error, (8) criterion validity, (9) hypothesis testing for construct validity, (10) responsiveness8,9,10. Each box consists of two to eight items rated as either “very good”, “adequate”, “doubtful”, “inadequate” or “not applicable”, a worst score counts principle is applied for each domain.
Assessing the criteria for content validity and the updated criteria for good measurement properties (step 2 in Fig. 1)
The results of each analysis are evaluated following COSMIN’s criteria for I. content validity (Fig. 1, step 2, domains 1 and 2) or II. the updated criteria for good measurement properties8,9,10. (Fig. 1, step 2, domains 3–10) depending on the applicable domain. An analysis’ evidence for a respective measurement property is rated either “sufficient” (+) or “insufficient” (−), with a third option for an “indeterminate” rating (?), if neither “sufficient” nor “insufficient” ratings are applicable. Only for content validity (domains 1 and 2) a fourth rating option of “inconsistent” (±) is available.
The overall rating for I. Content validity is generally made up of three individual ratings for the ClinROM development study, content validity analyses and a third rating based on the reviewers (SG,MR,SL) own evaluation of the scales content validity (compare pp. 53–59 COSMIN content validity manual8,9,10).
The II. Updated criteria address domains 3–10. Concerning domain 3 structural validity the COSMIN guideline does not provide criteria for factor models resulting from exploratory methods8,9,10. We therefore used the same criteria we created for our COSMIN review of the Kay three-subscale PANSS7, adapted for the M5M5 for the purpose of comparing the exploratory derived models to the M5M. Our approach followed Elsman et al.15 who processed exploratory factor analyses (EFAs) and principal component analyses (PCAs) in the same way. The criteria are: (1) ≥50% explained variance, (2) factor loadings >0.30, (3) ≤10% cross loading items, (4) five-factor model, and (5) ≥80% of the items distributed as in the M5M5. A “sufficient” rating was only given if all the criteria regarding the exploratory derived models were met. Otherwise, the respective rating was “insufficient”. For the assessment of confirmatory factor analyses, the Updated criteria8,9,10 provide the criterion of comparative fit index (CFI)16 >0.95 or root mean square error of approximation (RMSEA)17,18 <0.06 for fit indices. We processed normed fit indices (NFI)19 as we did with CFIs20.
For assessing domains 9 construct validity and 10 responsiveness COSMIN requires the formulation of hypotheses. The hypotheses are: 1. Between instruments measuring the same or a very similar construct, we expected a correlation of ≥0.5, 2. Between instruments measuring related but different constructs, we expected a correlation of ≤0.6 and 3. Between instruments measuring dissimilar or contrary constructs, we expected a correlation of ≤0.4.
We considered correlations of the M5M total score and its factor-scores with other scales, correlations of single PANSS items were not considered. For details about the assessment of all other domains suggested by COSMIN see pp. 28–29 of the COSMIN manual8,9,10.
Statistical method
In situations where few analyses are available, a simple analysis-count approach is sufficient in COSMIN reviews, see e.g. Weigel et al.11, Wehr et al.12 or Zúñiga Le-Bert et al.21. We summarized measures of construct validity with standard random-effects meta-analyses using Comprehensive Meta-Analysis (CMA) Version 222. In situations where it was not possible to aggregate all data for a measurement property of the M5M-PANSS5 or its factors, we used the same hybrid approach as in our Kay PANSS COSMIN review7: the updated criteria rating of the pooled result was multiplied by the number of analyses included in the pooling to give them the appropriate weight, and summed up with ratings of analyses that could not be aggregated and were therefore rated individually.
Grading the quality of evidence (step 4 in Fig. 1)
The COSMIN modified GRADE approach applies to the accumulated evidence on each measurement property, spanning 4 domains: (1) risk of bias, (2) inconsistency (3) imprecision and (4) indirectness8,9,10.
The final quality of the evidence is graded as high, moderate, low, or very low (Table 1a). Initially, evidence is assumed to be of high quality. If the criteria of the domains are not fulfilled, the evidence is then downgraded by one to three levels (see Table 1b).
Recommendations (step 5 in Fig. 1)
According to COSMIN8,9,10, an overall judgment of (A) (can be recommended for use and results obtained with the scale can be trusted), (B) (potential for recommendation but needs further research into the measurement properties) or (C) (should not be recommended for use) was made.
Recommendations are mainly dependent on the evidence for content validity and internal consistency. (compare Fig. 1, step 5).
Assessment of feasibility aspects (step 6 in Fig. 1)
To supplement the recommendations, we compiled information on the feasibility of the PANSS. Examples for the type of information assessed by COSMIN are presented in Fig. 1.
Assessment of other five-factor models
We assessed structural validity of all other factor structures validated by confirmatory factor analyses in terms of goodness of fit. We compared the item distribution of 5-factor models which reached fit indices defined as “sufficient” by COSMIN8,9,10, i.e. CFI > 0.95 or RMSEA < 0.06, in at least one analysis. As the COSMIN criteria are very stringent in this regard, we additionally described the item distribution of 5-factor models reaching a more relaxed criterion of CFI ≥ 0.90 or RMSEA ≤ 0.08 which are frequently used in the literature23,24. We counted how often “sufficient” goodness of fit was reached by each model. We compared the item distribution of these models with one another and with the M5M5 to understand similarities and differences. As this final step was exploratory and descriptive, following Elsman15 we did not take cross-loadings into equation in our comparison and just assorted the items to the factors they loaded the highest on.
We restricted our assessment of other models to goodness of fit because in 23 included studies confirmatory assessments for 73 other factor models were identified (see results). Thus, a full COSMIN approach would not have been feasible. Moreover, all these examinations were conducted to 5-factor models only because models with fewer or more factors were comparably rare.
Results
Literature search
The literature search yielded 8070 results. Their title-abstract screening identified 508 articles as relevant for full-text screening. Through full-text screening 90 publications were included in the review. The additional screening of references resulted in 5 further inclusions. Therefore, the total of included publications was 95 (compare Appendix 3 PRISMA Diagram of the Search).
The included studies comprised populations of patients with schizophrenia-spectrum disorders in various disease states, with a median patient count per study of N = 270, a median mean age of 35.6 years and a median percentage of female participants of 34.1%. The median average disease duration was 11.6 years. Our review includes studies involving both inpatients and outpatients (quantitative distribution not reported by most authors). PANSS assessments were conducted in over 15 different languages. Most assessments were conducted in English (58%), French (12%) or Spanish (9%). For comprehensive characteristics of the study populations, please refer to Appendix 4.
Risk of bias
Risk-of-bias evaluates the reporting quality of the analyses included (not the actual results) and is presented in Table 2.
ClinROM development (domain (D1) and content validity (D2) of the PANSS in its original three-subscale structure has been evaluated by us in another publication7. As the M5M emerged from the original three-subscale model2, no further ClinROM development and content validity studies were available.
In the 90 publications 128 suitable assessments (called ‘studies’ by COSMIN) of the structural validity (D3) for evaluating the M5M, were identified24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97. Most of them (64.84%) received “adequate” or “very good” ratings. Additionally, for each of the five factors, three analyses assessed the factors dimensionality71. All those dimensionality assessments received “adequate” ratings. Two assessments conducted confirmatory factor analyses on the negative factor98 and received “very good” ratings.
No analyses on the measurement properties internal consistency (D4), cross-cultural validity/ measurement invariance (D5), reliability (D6), measurement error (D7) and criterion validity (D8) could be identified for the M5M-PANSS5 and its factors.
Only five analyses for hypothesis testing for construct validity (D9) of the M5M total score were identified99 and all of them were rated “doubtful”. More analyses were available for the individual factors99,100,101, but most received “doubtful” ratings (70.59–100%), with a few analyses for the positive and the negative factor receiving “adequate” or “very good” ratings.
For the M5M total score and for each of the five factors, five analyses on responsiveness (D10) were identified87. All analyses on responsiveness received “adequate” ratings.
Assessing the criteria for content validity and the updated criteria for good measurement properties
Assessing the criteria for content validity
No study matching COSMINs definition of a ClinROM development study was available for the M5M. Furthermore, as for the original three-subscale version7, there were no specific content validity assessments of the M5M available. Nevertheless, the domain content validity is independent of scale structure and therefore the content validity rating of the original three-subscale PANSS-30 was applicable to the M5M as well. Based on the original PANSS’ ClinROM development study2 and reviewer rating, the content validity rating was “inconsistent”7.
Assessing the updated criteria for good measurement properties
All results on the updated criteria for good measurement properties are reported in Table 2.
Structural validity (D3)
95 exploratory factor analyses,92,93,94,95 and 10 confirmatory factor analyses specifically of the M5M,24,45,55,72,74 for which fit indices were presented were identified. Only 125,28,31,43,46,49,58,63,68,70 of the 95 exploratory derived models received “sufficient” ratings, 80 models69,70,72,74,76,77,92,93,94 received “insufficient” ratings, and 3 models73,75,95 received an “indeterminate” rating. CFI-values ranged between 0.511 and 0.887 and none surpassed COSMINs cut-off of CFI > 0.95 (or of ≥0.90 in the more relaxed sensitivity analysis), therefore those analyses were rated “insufficient”, as well. 67 of the 95 (70.5%) exploratory analyses resulted in five-factor solutions5,54,56,58,59,61,79,82,83,85,88,94,95. Furthermore, 9 exploratory analyses3,34,57,58,62,72,92,93 found four-factor models, 653,63,70,81,94 found six-factor models, 746,81,87 found seven-factor models, 280,84 found bifactorial models (containing 1 general factor and 5 specific factors), 233,86 found three-factor models, one4 found an eight-factor model, one27 found a nine-factor model.
23 further IRT-analytical analyses were identified60,66,67,71,78,80,96,97. 17 of the 23 analyses assessed the scalability of the PANSS78,96, of which 15 concluded that the PANSS is not scalable78,96, thus supporting the evidence indicating the insufficiency of the structural validity of the PANSS as a 30-item instrument, as reported in our previous publication7. Due to not fulfilling all updated criteria for IRT analyses8,9,10, all 23 analyses were rated “indeterminate”. Overall, the structural validity of the M5M5 was rated “insufficient”.
All principal component analyses assessing the dimensionality of the factors71 concluded that the M5M factors are unidimensional but were rated “indeterminate” due to not fulfilling all necessary updated criteria. Confirmatory factor analyses were only available for the negative factor. In 2 CFAs of the negative factor CFIs were 0.973 and 0.98298 surpassing COSMINs criterion of CFI > 0.958,9,10 and were thus rated as “sufficient”.
Thus, we concluded that the structure of the M5M5 with 30 items is not satisfactory, but previous studies have still established that the factors are unidimensional and that the negative factor has a good model fit.
No analyses on the measurement properties internal consistency (D4), cross-cultural validity/ measurement invariance (D5), reliability (D6), measurement error (D7) and criterion validity (D8) of the M5M total score or the five factors could be identified.
Construct validity (D9)
5 analyses for hypothesis testing for construct validity with correlations between the M5M total score and its factors were identified99. The correlations ranged between 0.64 and 0.89, while we hypothesized that they would be lower, resulting in “insufficient” ratings.
12 to 17 analyses for hypothesis testing for construct validity of the M5M factors were identified99,100,101 with correlations of the factors with each other, the M5M total score, the CGI-S102 and Andreasen’s SAPS103 and SANS104. 70.59–83.33% of the assigned hypotheses could be confirmed for the respective factors, which were consequently rated “sufficient” for construct validity. An overview of the correlations and the respective hypotheses is presented in Appendix 5.
Responsiveness (D10)
5 analyses for the assessment of responsiveness were identified for the M5M total score and all 5 factors87. 40–60% of the hypotheses could be confirmed for the factors resulting in “inconsistent” ratings. None of the hypotheses for the M5M total scores could be confirmed hence why the responsiveness of it was rated “insufficient”. Nevertheless, the M5M has frequently and successfully been used in antipsychotic drug trials which we do not address here. Those trials indicate satisfactory responsiveness.
GRADE approach
The quality of evidence regarding the construct validity of the M5M total score and of all factors but the negative factor was downgraded by one level to “moderate” quality of evidence due to a lack of well-executed analyses or inconsistent results.
The quality of evidence was not determined for the responsiveness of the five factors because they received inconsistent ratings in the updated criteria and were therefore not graded8,9,10.
The quality of all other measurement properties of the scale or the factors was not downgraded (compare Table 3).
Meta-analysis
Refer to Appendix 6 for pooled results and a more comprehensive overview of the data of individual analyses entering meta-analysis, all pooled results, as well as a forest plot of all correlations for construct validity and their 95% confidence interval. Publication bias could not be assessed because for no measurement property at least 10 analyses that could be pooled were available.
Recommendation
Due to the high-quality evidence for “insufficient” structural validity of the M5M-PANSS it is categorized as (C), not recommendable, following the COSMIN guideline8,9,10.
Feasibility
We gathered information on the feasibility of the PANSS in our publication on the original three-subscale PANSS7, which is applicable independent of the used model. Strengths of the PANSS are its availability in over 40 languages, simple score calculation by summing up item scores, the availability of the PANSS manual with its precise definitions, the Structured Clinicial Interview for PANSS assessments (SCI-PANSS)105 and the Informant Questionnaire (IQ-PANSS)106. Weaknesses are the scale’s long completion time of 30–50 min2, requirements for rater training to achieve acceptable interrater reliability making the standardization process laborious107, and the high cost of the instrument ($96.40 for the PANSS Technical Manual, $90.80 for 25 PANSS Rating and Profile Forms)108 due to copyright reasons. The European Medicines Agency (EMA)109 requires the use of the PANSS for drug approval. All the information gathered on feasibility as required by COSMIN can be found in Appendix 7.
Assessment of fit indices of other five-factor models
As the M5M never met COSMIN’s criteria for fit indices, we examined other factor solutions, as well. 23 studies1,24,41,45,53,55,56,72,74,80,84,85,97,110,111,112,113,114,115,116,117,118,119 presented goodness of fit indices for other models than the M5M5 and the original Kay et al. structure2. A total of 73 models (52 five-factor, 6 six-factor, 4 four-factor, 3 three-factor, 2 two-factor, 2 unidimensional, 2 specific bifactorial models (containing 1 general factor and 5 specific factors), one seven-factor model, one eight-factor model) were assessed by 237 confirmatory analyses (the higher number of analyses than studies stems from the fact that most studies tested several models and several tested them in more than one sample).
The vast majority of these analyses (199)1,24,41,45,55,56,72,74,80,84,85,97,111,112,113,114,115,116,117,118,119 were on 5-factor models other than the M5M. The remaining analyses were 1624,45,72,85,110,112,115 on 4-factor models, 645,53,72,84,115 on 6-factor models, 4110,111,112 on 3-factor models, 4110,111,112 on 2-factor models, 380,84 on specific bifactorial-models (containing 1 general factor and 5 specific factors), 3110,112 on unidimensional models, one41 on a 7-factor model and one72 on an 8-factor model.
Only 19 of the 237 fit indices surpassed the COSMIN updated criteria criterion of CFI > 0.95/ RMSEA < 0.06. The respective fit indices belonged to an overall number of 9 models: the Wallwork1 five-factor model85,119, the Lindenmayer49 five-factor model85, 1 of 2 van der Gaag56 five-factor models85, the White120 five-factor model85, the Jiang24 five-factor model24, Freitas’ 201985 five-factor model85, Kay and Sevys´3 pyramidical four-factor model85, Andersons´80 bifactorial model80, the Reichenberg53 six-factor model53. It should be noted that most of the CFIs with values > 0.95 were presented in Freitas et al. 201985 (73.68%).
29 further fit indices on 22 models surpassed the criterion of CFI > 0.9 or RMSEA ≤ 0.08. In addition to the models just mentioned there were, the Strauss-Peralta 1974/1992121,122 four-factor model described by Cuesta and Peralta110, the Peralta122 three-factor model described by Cuesta and Peralta110, the Peralta and Cuesta4 four-factor model110, the Mass42 five-factor model24, the Lancon 200040 five-factor model24, the Hayashi113 five-factor model113, the Fitzgerald114 five-factor model24, the Drake45 (I) five-factor model45, the Drake45 (II) five-factor model45, the Reichenberg53 six-factor model53, the van den Oord115 five-factor model115, the van den Oord115 six-factor model115 and Andersons´80 five-factor model80.
Due to the predominance of five-factor models in this research (identified in 675,54,56,58,59,61,79,82,83,85,88,94,95/9592,93,94,95=71% of exploratory factor analyses, 61,24,49,56,85,120/91,3,24,49,53,56,80,85,120 =67% of those with COSMIN CFI > 0.95 and 141,24,40,42,45,49,56,80,85,120/221,3,4,24,40,42,45,49,53,56,80,85,110,120,121,122=64% of those with CFI ≥ 0.90) we further focused on those. They are presented in Table 4. Details on all models and fit indices are presented in Appendix 8.
There were differences in frequency as to how often the 13 models had a CFI > 0.95/ ≥ 0.90: Wallwork (2012) had most (485,119/91,24,85,117,119); followed by White (1997) (385/524,85); Van der Gaag (2006I) (385/624,56,85); Lindenmayer (2004) (385/385); Freitas (2019) (285/385); Jiang (2013) (124/324). While the Wallwork1 model had the most satisfying fit indices and van der Gaags ranked second, they were as well among the most assessed models.
Several items seem to reliably load onto the same dimension in all five factor models which surpassed CFI > 0.95 at least once: P1 (Delusions), P3 (Hallucinatory behavior) and G9 (Unusual thought content) on the positive factor, N2 (Emotional withdrawal), N3 (Poor rapport), N4 (Passive social withdrawal) and N6(Lack of spontaneity) on the negative factor, G2 (Anxiety), G3 (Guilt) and G6(Depression) on the anxiety/depression factor, P4 (Excitement), P7 (Hostility) and G14 (Poor impulse control) on the hostility/excitement factor.
Several items seem to reliably load onto the same dimension in all five-factor models which surpassed CFI ≥ 0.90 at least once: P1 and G9 on the positive factor, N3 on the negative factor, G2 and G6 on the anxiety/ depression factor, G14 on the hostility/ excitement factor. Notably, the same consistencies in item distribution also apply to the M5M model. No items could be identified which consistently loaded on the cognitive factor in all of these models, and the composition of this factor was least homogeneous.
Discussion
The M5M-PANSS5 is very frequently used in antipsychotic drug trials. This systematic review is, to the best of our knowledge, the first to assess the measurement properties of the M5M following COSMIN8,9,10. We included 95 publications and were only able to assess three COSMIN measurement properties, namely structural validity, construct validity and responsiveness, for the other seven domains no data were available. Good construct validity is a strength, because the domains correlated strongly with other rating scales on the same constructs. We already determined that the content validity of the PANSS has short-comings in our COSMIN review of the original three-subscale PANSS7. However, the main limitation of the M5M is structural validity, because goodness of fit measures did not reach COSMIN’s criterion of CFI > 0.95 and the more relaxed criterion of CFI ≥ 0.90. Some other 5-factor solutions had better fit indices.
Most psychometric analyses assessing PANSS factor solutions focused almost entirely on structural validity. It was found out relatively early that the original 3 subscale version is not appropriate3, hence researchers tried to identify factor structures with better fit. In doing so they overlooked that any new factor structure means that we are essentially talking about a new instrument with new subscales which require a full assessment of other measurement properties, as well8,9,10. One reason as to why this has so far not been properly done may be that to date no consistently fitting factor solution has been identified (see below). Nevertheless, such analyses would be important if the original PANSS were to be replaced by other factor solutions.
We could not confirm “sufficient” goodness of fit for the M5M solution. We, therefore, examined other solutions. In 66 publications 95 different factor-models were found by exploratory factor analyses and 73 were examined with confirmatory methods. Most of these yielded 5-factor solutions (67/95 models) so that we focused on them. Overall, their factor content was usually similar. Nevertheless, there occurred a lot of item changes between the factors which are probably caused by differences in populations (e.g. acutely ill versus stable state of disease or different diagnoses), differences in fit indices applied or sample size (leading to more or less uncertainty). Therefore, it was impossible for us to identify the best factor solution.
Our assessment of the PANSS factor structure shows that it is difficult to identify a uniform factor solution of the PANSS which provides good fit in different and heterogeneous populations. However, we also identified consistencies among the models. The description of which items went to which factor in the various models revealed a certain pattern (see Table 4). Some items always loaded on the same factors. Therefore, the names given to the identified factors were relatively consistent across models. We emphasize that the cognitive factor is the least robust factor in terms of item to factor assignment.
Two models stood out. Wallwork et al.1 identified a model solution by counting “votes” from published factor analyses which was subsequently tested and validated in several independent publications. Van der Gaag et al.56 used ten-fold cross-validation as a particularly stringent method. In our work both Wallwork’s and van der Gaag’s models most frequently had good fit indices. It should also be noted that the M5M, although it showed no good model fit, was relatively similar in item composition to the other models summarized in Table 4. Shafer et al. 2019123 conducted a meta-analysis on the structure of the PANSS and BPRS-E scales, identifying “core items” among them. Shafer’s’ findings are largely consistent with our results and should also be considered in the development process of scales based on the PANSS items.
Limitations
Several limitations of this review, some of which are similar to those of our previous publication7, must be acknowledged. The first two limitations relate purely to the shortcomings of the COSMIN methodology, while the subsequent ones also contain matters relating to the psychometric literature on the PANSS.
First, our “vote count approach” in the comparison of models is certainly far from ideal, e.g. the COSMIN cutoff 0.95 for confirmatory factor analyses is to some extent arbitrary. However, our analysis with the frequently used cutoff in the literature 0.90 which is more relaxed yielded comparable results.
Second, while the COSMIN framework offers a structured approach to evaluate the measurement properties of instruments like the PANSS, it does not accommodate certain methodologies that might provide a more precise analysis. This required us to use certain workarounds to accommodate for the variety of approaches used and prevented us from assessing a few analyses like Network analyses124, orthonormal projective non-negative matrix factorization (OPNMF)125 leading to their exclusion and certain structural equation modeling (SEM) analyses73,75 which could not be qualitatively evaluated and were rated “indeterminate”7. Due to the shortcomings of COSMIN, several methodologically different procedures with different levels of validity were assessed based on the same criteria, e.g. no consideration of different rotation methods or eigenvalue criteria7. Nevertheless, the results of non-included or “indeterminate” rated assessments support our findings.
Third, most measurement properties of the M5M-PANSS have not been addressed yet, therefore, we cannot make a statement regarding them. Thus, it is possible that if future studies assessed them, our recommendation would become better.
Fourth, we focussed on the M5M, because it is frequently used in antipsychotic drug trials. Ideally, we would also have addressed all other solutions, but there are so many (95!) that this would not have been feasible. Moreover, as there is so little evidence on the M5M (we identified 4 validation studies), it is unlikely that the other solutions have been assessed more thoroughly.
Fifth, the PANSS is an instrument with ordinal scaling but, following the original authors of the included studies, we assumed that it is continuous7.
Sixth, differences in the PANSS assessments between analyses must be considered. The SCI-PANSS105, a structured clinical interview, and the IQ-PANSS106, a structured informant questionnaire, are available for PANSS assessments. Additionally, the PANSS Institute provides training for raters. However, the use of these instruments and the rater level are often not specified or may vary.
Seventh, 74% of the “sufficient” fit indices were extracted from the same publication85, in which 78% of all calculated fit indices for different models surpassed CFI ≥ 0.95. Models with reportedly good fit in the Freitas publication had worse fit in some other analyses (e.g. Wallwork-model’s CFI in Freitas et al. (2019)85=0.964, Wallwork-model’s CFI in Langeveld et al.116=0.812). Therefore, and because Freitas et al.’s analyses were conducted in samples of patients in a stable disease state85, these results must be treated with caution. The good model fit might not apply to other populations such as acutely ill patients or as differences might be due to differences in methodological approaches, clinical heterogeneity and interviewer variability across multiple centers. Only 4 of the previous 9 models achieve the COSMIN cutoff of CFI > 0.95, if the publication by Freitas (in the sense of a sensitivity analysis) is disregarded. These are the Jiang24 five-factor model, the Anderson80 bifactorial model, the Reichenberg53 six-factor model and the Wallwork1 five-factor model. The Wallwork1 five-factor model is the only one of these four that achieves CFI > 0.95 in more than 1 analysis. This suggests a certain independence of the positive results of the Wallwork1 model from the possible bias of the Freitas 201985 publication.
Eight, the COSMIN criteria do not address the situation if items loaded on various factors which the original authors resolved by assigning such items to the factors they most strongly loaded on.
Conclusion
We conclude that the Marder five-factor PANSS has “insufficient” structural validity, although most of the 5 factor solutions (including M5M) have core items in common across all factors except the cognitive factor. Therefore, future validation studies may focus on other models such as the 5-factor model proposed by Wallwork and colleagues. Any such model requires a full exploration of its measurement properties before it can be generally recommended.
Data availability
The publications from which the data were analyzed in the current study are available in the PubMed and Embase databases. PubMed can be accessed via [https://pubmed.ncbi.nlm.nih.gov], Embase is a subscription-based database that requires institutional or individual paid access. References to all included publications can be found in the reference section of this publication. The meta-analytical data generated during this study are available within the paper and its supplementary files. No specific data repository was used; therefore, accession codes are not needed.
References
Wallwork, R. S., Fortgang, R., Hashimoto, R., Weinberger, D. R. & Dickinson, D. Searching for a consensus five-factor model of the Positive and Negative Syndrome Scale for schizophrenia. Schizophr. Res. 137, 246–250 (2012).
Kay, S. R., Fiszbein, A. & Opler, L. A. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr. Bull. 13, 261–276 (1987).
Kay, S. R. & Sevy, S. Pyramidical model of schizophrenia. Schizophr. Bull. 16, 537–545 (1990).
Peralta, V. Psychometric properties of the positive and negative syndrome scale (PANSS) in schizophrenia. Psychiatry Res. 53, 31–40 (1994).
Marder, S. R., Davis, J. M. & Chouinard, G. The effects of risperidone on the five dimensions of schizophrenia derived by factor analysis: combined results of the North American trials. J. Clin. Psychiatry 58, 538–546 (1997).
Le Moigne, A. et al. PANSS individual item and marder dimension analyses from a pivotal trial of RBP-7000 (monthly extended-release risperidone) in schizophrenia patients. J. Clin. Psychiatry 82, 21m13906 (2021).
Geck, S. A. et al. COSMIN Systematic Review and Meta-analysis of the Measurement Properties of the PANSS. EClinicalMedicine. (in the press).
Mokkink, L. B. et al. COSMIN Risk of Bias checklist for systematic reviews of Patient-Reported Outcome Measures. Qual. Life Res. 27, 1171–1179 (2018).
Prinsen, C. A. C. et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual. Life Res. 27, 1147–1157 (2018).
Terwee, C. B. et al. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual. Life Res. 27, 1159–1170 (2018).
Weigel, L. et al. The Brief negative Symptom Scale (BNSS): a systematic review of measurement properties. Schizophrenia (Heidelburg) 9, 45 (2023).
Wehr, S. et al. Clinical Assessment Interview for Negative Symptoms (CAINS): a systematic review of measurement properties. Schizophr. Bull. 50, 747–756 (2024).
Roithmeier M. A. et al. Systematic review of the Positive and Negative Syndrome Scale following the COSMIN standard. https://doi.org/10.17605/OSF.IO/5EGMD (2024).
Terwee, C. B., Jansma, E. P., Riphagen, I. I. & de Vet, H. C. Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Qual. Life Res. 18, 1115–1123 (2009).
Elsman, E. B. M. et al. Systematic review on the measurement properties of diabetes-specific patient-reported outcome measures (PROMs) for measuring physical functioning in people with type 2 diabetes. BMJ Open Diab. Res. Care 10, e002729 (2022).
Bentler, P. M. Comparative fit indexes in structural models. Psychol. Bull. 107, 238–246 (1990).
Steiger, J. H. & Lind, J. C. Statistically based tests for the number of common factors. Paper presented at the Psychometric Society Annual Meeting, Iowa City, IA (1980).
Browne, M. W. & Cudeck, R. Alternative ways of assessing model fit. Sociol. Methods Res. 21, 230–258 (1992).
Bentler, P. & Bonett, D. Significance tests and goodness-of-fit in analysis of covariance structures. Psychol. Bull. 88, 588–606 (1980).
Newsom J. T. Some Clarifications and Recommendations on Fit Indices 123–133 (USP, 655, 2012).
Zúñiga Le-Bert, M., Wiessner, M., Wehr, S., Weigel, L. & Leucht, S. Schizophrenia Quality of Life Scale and Schizophrenia Quality of Life Scale Revision 4: a systematic review of measurement properties. Schizophr. Bull. https://doi.org/10.1093/schbul/sbae119 (2024).
Borenstein, M., Hedges, L., Higgins, J. & Rothstein, H. An introduction to meta-analysis. In Introduction to Meta-Analysis Vol. 19 (John Wiley & Sons Ltd, West Sussex, UK, 2009).
Kline, R. B. Principles and Practice of Structural Equation Modeling (The Guilford Press, Guilford, NY, 2010).
Jiang, J., Sim, K. & Lee, J. Validated five-factor model of positive and negative syndrome scale for schizophrenia in Chinese population. Schizophr. Res. 143, 38–43 (2013).
Dollfus, S., Petit, M., Lesieur, P. & Menard, J. F. Principal-component analysis of PANSS and SANS-SAPS global ratings in schizophrenic patients. Eur. Psychiatry 6, 251–259 (1991).
Lepine, J. Dimensions positives et négatives dans les schizophrénies. Les. Cah. Prism. 1, 23–29 (1991).
Lindström, E. & von Knorring, L. Principal component analysis of the Swedish version of the Positive and Negative Syndrome Scale for schizophrenia. Nord. J. Psychiatry 47, 257–263 (1993).
Bell, M. D. Five-Component model of schizophrenia: assessing the factorial invariance of the positive and negative syndrome scale. Psychiatry Res. 52, 295–303 (1994).
Kawasaki, Y. et al. Evaluation and interpretation of symptom structures in patients with schizophrenia. Acta Psychiatr. Scand. 89, 399–404 (1994).
Lindenmayer, J. P., Bernstein-Hyman, R. & Grochowski, S. Five-factor model of schizophrenia. Initial validation. J. Nerv. Ment. Dis. 182, 631–638 (1994).
Lindenmayer, J. P., Bernstein-Hyman, R. & Grochowski, S. A new five factor model of schizophrenia. Psychiatr. Q 65, 299–322 (1994).
Dollfus, S. & Petit, M. Principal-component analyses of PANSS and SANS-SAPS in schizophrenia: their stability in an acute phase. Eur. Psychiatry 10, 97–106 (1995).
Lancon, C. et al. Psychometric properties of the PANSS in a sample of French schizophrenic patients. Encephale 23, 1–9 (1997).
Loas, G., Noisette, C., Legrand, A. & Delahousse, J. A four-syndrome model of chronic schizophrenia: principal components analysis of the Postive and Negative Syndrome Scale (PANSS) in 153 chronic schizophrenics. Encephale 23, 10–18 (1997).
Higashima, M. et al. P300 and the thought disorder factor extracted by factor-analytic procedures in schizophrenia. Biol. Psychiatry 44, 115–120 (1998).
Klapal, M., Eggers, C., Bunk, D. & Koriath, H. The 5 factor model of childhood schizophrenia]. Nervenarzt 69, 238–242 (1998).
Lançon, C., Aghababian, V., Llorca, P. M. & Auquier, P. Factorial structure of the Positive and Negative Syndrome Scale (PANSS): a forced five-dimensional factor analysis. Acta Psychiatr. Scand. 98, 369–376 (1998).
Bunk, D, Eggers, C & Klapal, M Symptom dimensions in the course of childhood-onset schizophrenia. Eur. Child Adolesc. Psychiatry 8(Suppl. 1), I29–I35 (1999).
Lançon, C., Reine, G., Llorca, P. M. & Auquier, P. Validity and reliability of the French-language version of the Positive and Negative Syndrome Scale (PANSS). Acta Psychiatr. Scand. 100, 237–243 (1999).
Lançon, C., Auquier, P., Nayt, G. & Reine, G. Stability of the five-factor structure of the Positive and Negative Syndrome Scale (PANSS). Schizophr. Res 42, 231–239 (2000).
Lykouras, L. et al. Five-factor model of schizophrenic psychopathology: how valid is it? Eur. Arch. Psychiatry Clin. Neurosci. 250, 93–100 (2000).
Mass, R., Schoemig, T., Hitschfeld, K., Wall, E. & Haasen, C. Psychopathological syndromes of schizophrenia: evaluation of the dimensional structure of the positive and negative syndrome scale. Schizophr. Bull. 26, 167–177 (2000).
Wolthaus, J. E. D. et al. Component structure of the Positive And Negative Syndrome Scale (PANSS) in patients with recent-onset schizophrenia and spectrum disorders. Psychopharmacology 150, 399–403 (2000).
El Yazaji, M. et al. Validity of the depressive dimension extracted from principal component analysis of the PANSS in drug-free patients with schizophrenia. Schizophr. Res. 56, 121–127 (2002).
Drake, R. J. et al. The evolution of symptoms in the early course of non-affective psychosis. Schizophr. Res. 63, 171–179 (2003).
Emsley, R. et al. The factor structure for the Positive and Negative Syndrome Scale (PANSS) in recent-onset psychosis. Schizophr. Res. 61, 47–57 (2003).
Honey, G. D. et al. The functional neuroanatomy of schizophrenic subsyndromes. Psychol. Med. 33, 1007–1018 (2003).
Lee, K. H., Harris, A. W., Loughland, C. M. & Williams, L. M. The five symptom dimensions and depression in schizophrenia. Psychopathology 36, 226–233 (2003).
Lindenmayer, J. P. et al. Effects of atypical antipsychotics on the syndromal profile in treatment-resistant schizophrenia. J. Clin. Psychiatry 65, 551–556 (2004).
Masiak, M. & Loza, B. Core factors of schizophrenia structure based on PANSS and SAPS/SANS results. Discerning and head-to-head comparison of PANSS and SASPS/SANS validity. Psychiatr. Pol. 38, 795–808 (2004).
Mohr, P. E. et al. The heterogeneity of schizophrenia in disease states. Schizophr. Res. 71, 83–95 (2004).
Fresán, A. et al. A forced five-dimensional factor analysis and concurrent validity of the Positive and Negative Syndrome Scale in Mexican schizophrenic patients. Schizophr. Res. 72, 123–129 (2005).
Reichenberg, A., Rieckmann, N. & Harvey, P. D. Stability in schizophrenia symptoms over time: findings from the Mount Sinai Pilgrim Psychiatric Center Longitudinal Study. J. Abnorm. Psychol. 114, 363–372 (2005).
Tirupati, S. N., Padmavati, R., Thara, R. & McCreadie, R. G. Psychopathology in never-treated schizophrenia. Compr. Psychiatry 47, 1–6 (2006).
van der Gaag, M. et al. The five-factor model of the Positive and Negative Syndrome Scale I: confirmatory factor analysis fails to confirm 25 published five-factor solutions. Schizophr. Res. 85, 273–279 (2006).
van der Gaag, M. et al. The five-factor model of the Positive and Negative Syndrome Scale II: a ten-fold cross-validation of a revised model. Schizophr. Res. 85, 280–287 (2006).
Villalta-Gil, V. et al. Four symptom dimensions in outpatients with schizophrenia. Compr. Psychiatry 47, 384–388 (2006).
Levine, S. Z. & Rabinowitz, J. Revisiting the 5 dimensions of the Positive and Negative Syndrome Scale. J. Clin. Psychopharmacol. 27, 431–436 (2007).
Ruhrmann, S. et al. Efficacy of flupentixol and risperidone in chronic schizophrenia with predominantly negative symptoms. Prog. Neuro-Psychopharmacol. Biol. Psychiatry 31, 1012–1022 (2007).
Santor, D. A., Ascher-Svanum, H., Lindenmayer, J.-P. & Obenchain, R. L. Item response analysis of the Positive and Negative Syndrome Scale. BMC Psychiatry 7, 66 (2007).
Lindenmayer, J. P., Bossie, C. A., Kujawa, M., Zhu, Y. & Canuso, C. M. Dimensions of psychosis in patients with bipolar mania as measured by the positive and negative syndrome scale. Psychopathology 41, 264–270 (2008).
Ruiz-Veguilla, M. et al. Neurodevelopmental markers in different psychopathological dimensions of first episode psychosis: the ESPIGAS study. Eur. Psychiatry 23, 533–540 (2008).
Gil, D. et al. Validity of the cognitive factor of the Positive and Negative Syndrome Scale as a measure of cognitive functioning in schizophrenia. Rev. Psiquiatr. Salud Ment. 2, 160–168 (2009).
Hwang, S. S. H., Chang, J. S., Lee, K. Y., Ahn, Y. M. & Kim, Y. S. The causal model of insight in schizophrenia based on the positive and negative syndrome scale factors and the structural equation modeling. J. Nerv. Ment. Dis. 197, 79–84 (2009).
Citrome, L., Meng, X. & Hochfeld, M. Efficacy of iloperidone in schizophrenia: a PANSS five-factor analysis. Schizophr. Res. 131, 75–81 (2011).
Khan, A., Lewis, C. & Lindenmayer, J.-P. Use of non-parametric item response theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS). BMC Psychiatry 11, 178 (2011).
Levine, S. Z., Rabinowitz, J. & Rizopoulos, D. Recommendations to improve the positive and negative syndrome scale (PANSS) based on item response theory. Psychiatry Res. 188, 446–452 (2011).
Kim, J. H. et al. Evaluation of the factor structure of symptoms in patients with schizophrenia. Psychiatry Res. 197, 285–289 (2012).
Kumar, A. & Khess, C. R. J. Factor analysis of positive and negative syndrome scale in schizophrenia: an exploratory study. Indian J. Psychiatry 54, 233–238 (2012).
Higuchi, C. H. et al. Factor structure of the Positive and Negative Syndrome Scale (PANSS) in Brazil: convergent validation of the Brazilian version. Braz. J. Psychiatry 36, 336–339 (2014).
Khan, A. et al. The evolution of illness phases in schizophrenia: a non-parametric item response analysis of the Positive and Negative Syndrome Scale. Schizophr. Res.: Cogn. 1, 53–89 (2014).
Stochl, J. et al. Multilevel ordinal factor analysis of the positive and negative syndrome scale (PANSS). Int. J. Methods Psychiatr. Res. 23, 25–35 (2014).
Woodward, T. S. et al. Symptom changes in five dimensions of the Positive and Negative Syndrome Scale in refractory psychosis. Eur. Arch. Psychiatry Clin. Neurosci. 264, 673–682 (2014).
Anderson, A. et al. Sparse factors for the positive and negative syndrome scale: which symptoms and stage of illness? Psychiatry Res. 225, 283–290 (2015).
Fong, T. C. T., Ho, R. T. H., Wan, A. H. Y., Siu, P. J. C. Y. & Au-Yeung, F. S. W. Psychometric validation of the consensus five-factor model of the Positive and Negative Syndrome Scale. Compr. Psychiatry 62, 204–208 (2015).
Xu, K. et al. Preliminary analysis of positive and negative syndrome scale in ketamine-associated psychosis in comparison with schizophrenia. J. Psychiatr. Res 61, 64–72 (2015).
Best, M. W., Grossman, M., Oyewumi, L. K. & Bowie, C. R. Examination of the Positive and Negative Syndrome Scale factor structure and longitudinal relationships with functioning in early psychosis. Early Interv. Psychiatry 10, 165–170 (2016).
Ostergaard, S. D., Lemming, O. M., Mors, O., Correll, C. U. & Bech, P. PANSS-6: a brief rating scale for the measurement of severity in schizophrenia. Acta Psychiatr. Scand. 133, 436–444 (2016).
Thokagevistk, K., et al. Validation of disease states in schizophrenia: comparison of cluster analysis between US and European populations. J. Market Access Health Policy 4, https://doi.org/10.3402/jmahp.v4.30725 (2016).
Anderson, A. E. et al. Disparity between General Symptom Relief and Remission Criteria in the Positive and Negative Syndrome Scale (PANSS): a post-treatment bifactor item response theory model. Innov. Clin. Neurosci. 14, 41–53 (2017).
Dragioti, E., Wiklund, T., Siamouli, M., Moutou, K. & Fountoulakis, K. N. Could PANSS be a useful tool in the determining of the stages of schizophrenia? A clinically operational approach. J. Psychiatr. Res. 86, 66–72 (2017).
Peitl, V., Štefanović, M. & Karlović, D. Depressive symptoms in schizophrenia and dopamine and serotonin gene polymorphisms. Prog. Neuropsychopharmacol. Biol. Psychiatry 77, 209–215 (2017).
Yehya, A. et al. Validation of the five-factor model of the Arabic version of the Positive and Negative Syndrome Scale in Schizophrenia. Psychopathology 50, 211–218 (2017).
Anderson, A. E. et al. Bifactor modeling of the Positive and Negative Syndrome Scale: generalized psychosis spans schizoaffective, bipolar, and schizophrenia diagnoses. Schizophr. Bull. 44, 1204–1216 (2018).
De Freitas, R. et al. Can the positive and negative syndrome scale (PANSS) differentiate refractory from non-refractory schizophrenia? A factor analytic investigation based on data from the pattern cohort study. Schizophr. Bull. 44, S371 (2018).
Grover, S., Dua, D., Chakrabarti, S. & Avasthi, A. Factor analysis of symptom dimensions (psychotic, affective and obsessive compulsive symptoms) in schizophrenia. Asian J. Psychiatry 38, 72–77 (2018).
Hopkins, S. C., Ogirala, A., Loebel, A. & Koblan, K. S. Transformed PANSS factors intended to reduce pseudospecificity among symptom domains and enhance understanding of symptom change in antipsychotic-treated patients with schizophrenia. Schizophr. Bull. 44, 593–602 (2018).
Lefort-Besnard, J. et al. Patterns of schizophrenia symptoms: hidden structure in the PANSS questionnaire. Transl. Psychiatry 8, 237 (2018).
Lin, C. H. et al. Early improvement in PANSS-30, PANSS-8, and PANSS-6 scores predicts ultimate response and remission during acute treatment of schizophrenia. Acta Psychiatr. Scand.137, 98–108 (2018).
Ostergaard, S. D., Foldager, L., Mors, O., Bech, P. & Correll, C. U. The validity and sensitivity of PANSS-6 in treatment-resistant schizophrenia. Acta Psychiatr. Scand.138, 420–431 (2018).
Ostergaard, S. D., Foldager, L., Mors, O., Bech, P. & Correll, C. U. The validity and sensitivity of PANSS-6 in the Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) study. Schizophr. Bull. 44, 453–462 (2018).
Petruzzelli, M. G. et al. Early onset first episode psychosis: dimensional structure of symptoms, clinical subtypes and related neurodevelopmental markers. Eur. Child Adolesc. Psychiatry 27, 171–179 (2018).
Walsh-Messinger, J. et al. Factor structure of the Positive and Negative Syndrome Scale (PANSS) differs by sex. Clin. Schizophr. Relat. Psychoses 11, 207–213 (2018).
Fountoulakis, K. N. et al. Staging of schizophrenia with the use of PANSS: an International Multi-Center Study. Int. J. Neuropsychopharmacol. 22, 681–697 (2019).
Lim, K. et al. Large-scale evaluation of the Positive and Negative Syndrome Scale (PANSS) symptom architecture in schizophrenia. Asian J. Psychiatr. 62, 102732 (2021).
Baandrup, L. et al. Scalability of the Positive and Negative Syndrome Scale in first-episode schizophrenia assessed by Rasch models. Acta Psychiatr. Scand. 146, 21–35 (2022).
Findling, R. L. et al. An optimized version of the Positive and Negative Symptoms Scale (PANSS) for pediatric trials. J. Am. Acad. Child Adolesc. Psychiatry 62, 427–434 (2023).
Kagan, S. et al. Longitudinal invariance of the positive and negative syndrome scale negative dimension in antipsychotic naïve first-episode schizophrenia. Early Interv. Psychiatry 16, 581–586 (2022).
Hopkins, S. C., Ogirala, A., Loebel, A. & Koblan, K. S. Understanding antipsychotic drug treatment effects: a novel method to reduce pseudospecificity of the Positive and Negative Syndrome Scale (PANSS) factors. Innov. Clin. Neurosci. 14, 54–58 (2017).
Rabinowitz, J., Mehnert, A. & Eerdekens, M. To what extent do the PANSS and CGI-S overlap? J. Clin. Psychopharmacol. 26, 303–307 (2006).
Van Erp, T. G. M. et al. Converting positive and negative symptom scores between PANSS and SAPS/SANS. Schizophr. Res. 152, 289–294 (2014).
Busner, J. & Targum, S. D. The clinical global impressions scale: applying a research tool in clinical practice. Psychiatry (Edgmont) 4, 28–37 (2007).
Andreasen, N. C. The Scale for the Assessment of Positive Symptoms (SAPS) (University of Iowa, Iowa City, IA, 1984).
Andreasen, N. C. The Scale for the Assessment of Negative Symptoms (SANS) (University of Iowa, Iowa City, IA, 1983).
Kay, S. R., Opler, L. A. & Lindenmayer, J. Structured Clinical Interview for the PANSS (SCI-PANSS) (Multi-Health Systems, 1992).
Kay, S. R., Opler, L. A. & Fiszbein, A. Positive and Negative Syndrome Scale (PANSS): Manual (Multi-Health Systems, Inc., 2006).
Müller, M. J. et al. Evaluation of standardized rater training for the Positive and Negative Syndrome Scale (PANSS). Schizophr. Res. 32, 151–160 (1998).
Pearson Assessments. Positive and Negative Syndrome Scale https://www.pearsonassessments.com/store/usassessments/en/Store/Professional-Assessments/Personality-%26-Biopsychosocial/Positive-and-Negative-Syndrome-Scale/p/P100025000.html (2023).
European Medicines Agency. Guideline on Clinical Investigation oOf Medicinal Products, Including Depot Preparations, in the Treatment of Schizophrenia—Revision 1 (European Medicines Agency, 2012).
Cuesta, M. J. & Peralta, V. Psychopathological dimensions in schizophrenia. Schizophr. Bull. 21, 473–482 (1995).
Dollfus, S. & Everitt, B. Symptom structure in schizophrenia: two-, three- or four-factor models? Psychopathology 31, 120–130 (1998).
Nakaya, M., Suwa, H. & Ohmori, K. Latent structures underlying schizophrenic symptoms: a five-dimensional model. Schizophr. Res. 39, 39–50 (1999).
Hayashi, N., Igarashi, Y., Yamashina, M. & Suda, K. Is there a gender difference in a factorial structure of the positive and negative syndrome scale? A test by structural equation modeling. Psychopathology 35, 28–35 (2002).
Fitzgerald, P. B. et al. A confirmatory factor analytic evaluation of the pentagonal PANSS model. Schizophr. Res. 61, 97–104 (2003).
Van den Oord, E. J. et al. Factor structure and external validity of the PANSS revisited. Schizophr. Res. 82, 213–223 (2006).
Langeveld, J. et al. Is there an optimal factor structure of the Positive and Negative Syndrome Scale in patients with first-episode psychosis? Scand. J. Psychol. 54, 160–165 (2013).
Rodriguez-Jimenez, R. et al. Cognition and the five-factor model of the positive and negative syndrome scale in schizophrenia. Schizophr. Res. 143, 77–83 (2013).
Stefanovics, E. A., Elkis, H., Zhening, L., Zhang, X. Y. & Rosenheck, R. A. A cross-national factor analytic comparison of three models of PANSS symptoms in schizophrenia. Psychiatry Res. 219, 283–289 (2014).
Higuchi, C. H. et al. Identifying strategies to improve PANSS based dimensional models in schizophrenia: Accounting for multilevel structure, Bayesian model and clinical staging. Schizophr. Res. 243, 424–430 (2022).
White, L., Harvey, P. D., Opler, L. & Lindenmayer, J. P. Empirical assessment of the factorial structure of clinical symptoms in schizophrenia. A multisite, multimodel evaluation of the factorial structure of the Positive and Negative Syndrome Scale. The PANSS Study Group. Psychopathology 30, 263–274 (1997).
Strauss, J. S., Carpenter, W. T., Jr. & Bartko, J. J. The diagnosis and understanding of schizophrenia. Part III. Speculations on the processes that underlie schizophrenic symptoms and signs. Schizophr. Bull. 61–69 https://doi.org/10.1093/schbul/1.11.61 (1974).
Peralta, V., de Leon, J. & Cuesta, M. J. Are there more than two syndromes in schizophrenia? A critique of the positive-negative dichotomy. Br. J. Psychiatry 161, 335–343 (1992).
Shafer, A. & Dazzi, F. Meta-analytic exploration of the joint factors of the Brief Psychiatric Rating Scale-Expanded (BPRS-E) and the Positive and Negative Symptoms Scales (PANSS). J. Psychiatr. Res. 138, 519–527 (2021).
Esfahlani, F. Z., Sayama, H., Visser, K. F. & Strauss, G. P. Sensitivity of the Positive and Negative Syndrome Scale (PANSS) in detecting treatment effects via network analysis. Innov. Clin. Neurosci. 14, 59–67 (2017).
Chen, J. et al. Neurobiological divergence of the positive and negative schizophrenia subtypes identified on a new factor structure of psychopathology using non-negative factorization: an International Machine Learning Study. Biol. Psychiatry 87, 282–293 (2020).
Acknowledgements
We are grateful to the authors who replied to our mails/inquiries: Jimmy Lee Chee Keong, Jan Stochl, Leslie Citrome, Danilo Bzdok, Dalibor Karlović, Ji Chen, Helio Elkis, Eric Youngstrom, Stephen Marder, Jean-Pierre Lindenmayer, Daniel Purnine, Morris D. Bell, Kenneth Koblan, Wieneke Mokkink, Alessandro Rossi, Kim Good. We thank Farhad Shokraneh, Ph.D., for conducting the literature search.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
The authors confirm contribution to the paper as follows: study conception and design: M.R., S.G., S.L.; data collection: M.R. (first rater), S.G. (second rater); analysis and interpretation of results: M.R. (first rater), S.G. (second rater), S.L. (third rater); statistical consultation: M.B.; draft manuscript preparation: M.R. and S.G.; revision for important intellectual content: S.L., M.B., S.W., L.W., J.P., and J.M.D.; The work will be part of the doctoral thesis of M.R.; All authors reviewed the results and approved the final version of the manuscript. All authors have agreed to be personally accountable for the author’s own contributions and to ensure that questions related to the accuracy or integrity of any part of the work, are appropriately investigated, resolved, and the resolution documented in the literature.
Corresponding author
Ethics declarations
Competing interests
In the last 3 years, S.L. has received honoraria for advising, consulting, and for lectures and/or for educational material from Angelini, Apsen, Boehringer Ingelheim, Eisai, Ekademia, Gedeon Richter, Janssen, Karuna, Kynexis, Lundbeck, Medichem, Medscape, Mitsubishi, Neurotorium, Otsuka, Novo Nordisk, Recordati, Rovi, and Teva. All other authors declare that they have no competing interests relevant to this research.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Roithmeier, M., Geck, S., Bühner, M. et al. COSMIN review of the PANSS Marder factor solution and other factor models in people with schizophrenia. Schizophr 11, 51 (2025). https://doi.org/10.1038/s41537-025-00600-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41537-025-00600-6