Table 1 Fluency measures and POS tags in the two corpora, with statistical comparison.
Analysis | Variable | PR narratives | Reference corpus | W | p-value | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
M ± SD | Range | N | % | M ± SD | Range | N | % | ||||
Fluency | WPN | 84.82 ± 99.61 | 1–391 | – | – | 91.76 ± 101.86 | 2–787 | – | – | 8,936 | 0.008 |
SPN | 8.43 ± 7.76 | 1–32 | – | – | 12.84 ± 11.83 | 1–92 | – | – | 9,682 | < 0.001 | |
WPS | 13.55 ± 8.93 | 1–53 | – | – | 16.17 ± 7.47 | 2–46 | – | – | 9,146.5 | 0.003 | |
TTR | 0.71 ± 0.12 | 0.35–1 | – | – | 0.90 ± 0.10 | 0.60–1 | – | – | 1,790.5 | < 0.001 | |
Word frequency (log) | 4.47 ± 0.59 | 1.71–5.43 | – | – | 4.59 ± 0.26 | 2.96–5.04 | – | – | 7,140 | 0.770 | |
POS tagging | ADJ | 6.53 ± 7.62 | 0–50 | 435 | 4.63 | 8.07 ± 5.54 | 0–33.33 | 1,002 | 6.40 | 9,312.0 | < 0.001 |
ADP | 8.92 ± 5.16 | 0–28.57 | 910 | 9.68 | 12.26 ± 4.26 | 0–22.45 | 1,959 | 12.51 | 10,383.5 | < 0.001 | |
ADV | 9.25 ± 6.30 | 0–33.33 | 930 | 9.90 | 6.68 ± 4.24 | 0–25 | 1,044 | 6.67 | 5,167.5 | < 0.001 | |
AUX | 9.56 ± 7.45 | 0–50 | 871 | 9.27 | 5.90 ± 3.54 | 0–22.22 | 904 | 5.77 | 4,593.0 | < 0.001 | |
CCONJ | 3.44 ± 3.12 | 0–16.67 | 439 | 4.67 | 4.21 ± 2.60 | 0–13.33 | 641 | 4.09 | 8,418.0 | < 0.001 | |
DET | 7.74 ± 5.38 | 0–28.57 | 841 | 8.95 | 9.64 ± 3.72 | 0–18.75 | 1,565 | 10.00 | 9,307.0 | < 0.001 | |
NOUN | 15.29 ± 12.66 | 0 −100 | 1,266 | 13.47 | 18.11 ± 5.64 | 0–33.33 | 2,646 | 16.90 | 10,418.0 | < 0.001 | |
PRON | 7.48 ± 5.37 | 0–25 | 928 | 9.88 | 4.94 ± 3.24 | 0–18.33 | 851 | 5.44 | 5,069.0 | < 0.001 | |
PROPN | 2.95 ± 12.86 | 0–100 | 104 | 1.11 | 5.20 ± 7.85 | 0–52.56 | 1,164 | 7.44 | 11,718.5 | < 0.001 | |
PUNCT | 10.74 ± 7.46 | 0–40 | 863 | 9.18 | 13.41 ± 4.31 | 3.03–33.33 | 1,938 | 12.38 | 9,906.0 | < 0.001 | |
SCONJ | 2.66 ± 3.01 | 0–14.29 | 301 | 3.20 | 0.99 ± 1.17 | 0–4.30 | 188 | 1.20 | 4,931.5 | < 0.001 | |
VERB | 14.80 ± 6.95 | 0–50 | 1,470 | 15.64 | 9.32 ± 3.92 | 0–18.60 | 1,528 | 9.76 | 2,932.5 | < 0.001 | |