Fig. 3: Evaluation metrics reported across included studies.
From: Evaluating the performance of wearable EEG sleep monitoring devices: a meta-analysis approach

a Reported evaluation metrics per study: Presence (green) or absence (red) of overall and sleep-stage-specific evaluation metrics reported in each study. b Frequency of reported metrics across studies. Total number of studies reporting each evaluation metric, grouped by overall vs. stage-specific metrics. For devices’ overall performance, κ (n = 37) and ACC (n = 24) were the most frequently reported metrics. For sleep-stage specific performance, SE (n = 33) and PPV (n = 18) were the most frequently reported metrics. Confusion matrices were provided in 34 studies. ACC: accuracy, CM: confusion matrix, κ: Cohen’s Kappa, MCC: Matthews correlation coefficient, NPV: negative predictive value, PPV: positive predictive value, SE: sensitivity, SP: specificity.