Table 2 Inter-human-rater reliability results by categories
From: Evaluating large language models in analysing classroom dialogue
Code | Cohen Kappa’s (κ) value |
|---|---|
ELI | 0.807 |
EL | 0.739 |
REI | 0.821 |
RE | 0.853 |
CI | 0.724 |
SC | 0.716 |
RC | 0.628 |
A | 0.853 |
Q | 0.745 |
RB | 0.697 |
RW | 0.646 |
SU | 0.782 |
SA | 0.830 |
OI | 0.761 |
O | 0.742 |