Table 4 Annotation categories for the human evaluation of the meaning of the generated text.
From: Generation and evaluation of artificial mental health records for Natural Language Processing
Category | Group | |
|---|---|---|
1 | Fully preserved | SAME |
2 | Preserved, details omitted | GOOD |
3 | Modified, does not contradict the diagnosis | GOOD |
4 | Modified, contradicts the diagnosis | BAD/IRR |
5 | Modified, irrelevant | BAD/IRR |
6 | No clinical sense | NO SENSE |
7 | Incomprehensible | NO SENSE |