Table 9 Correlations between writing proficiency scoring by human raters and GPT 4-based AES.
From: Applying large language models for automated essay scoring for non-native Japanese
Measures | Correlation | |
|---|---|---|
Human scoring -GPT 4 scoring | Lexical richness | 0.708 |
Syntactic complexity | 0.672 | |
Cohesion | 0.751 | |
Content elaboration | 0.722 | |
Grammatical accuracy | 0.734 |