Table 4 Annotation categories for the human evaluation of the meaning of the generated text.

	Category	Group
1	Fully preserved	SAME
2	Preserved, details omitted	GOOD
3	Modified, does not contradict the diagnosis	GOOD
4	Modified, contradicts the diagnosis	BAD/IRR
5	Modified, irrelevant	BAD/IRR
6	No clinical sense	NO SENSE
7	Incomprehensible	NO SENSE

Quick links

Search