Table 9 Correlations between writing proficiency scoring by human raters and GPT 4-based AES.

From: Applying large language models for automated essay scoring for non-native Japanese

 

Measures

Correlation

Human scoring -GPT 4 scoring

Lexical richness

0.708

Syntactic complexity

0.672

Cohesion

0.751

Content elaboration

0.722

Grammatical accuracy

0.734