Fig. 1
From: Automated generation of discharge summaries: leveraging large language models with clinical data

Qualitative evaluation per category; The width of each violin at a given rating value reflects the density of responses. Although most ratings clustered at 4 and 5, a minority of ratings at 2–3 created a visible distribution spread. This helps visualize variability beyond just reporting averages.