Fig. 5: Results of clustering on symptom profile at 12 weeks.
From: Persistent COVID-19 symptoms in a community study of 606,434 people in England

Clustering was conducted using CLARA (partitioning around medoids) algorithm. Two stable clusters were identified at 12 weeks. Cluster L1 (“tiredness cluster”) had high prevalence of tiredness. Cluster L2 (“respiratory cluster”) was a smaller subset of 4,441 participants who had high prevalence of shortness of breath and tight chest as well as chest pain. Panel a shows symptom prevalence by cluster. Panel b shows the distribution of symptom counts by cluster (median 2 symptoms for cluster L1 [n = 15,799] and 3 symptoms for cluster L2 [n = 4441]). Box plots in panel b show median, first and third quartiles; whiskers indicate 1.5*the interquartile range; data beyond this range are plotted as points. Panel c shows the self-reported symptom severity and medical treatment sought by cluster (with those who were no longer symptomatic at 12 weeks for comparison).