Table 5 Diagnostic performance of baseline and proposed methods in terms of AUC. We evaluated the patient-level mean diagnostic performance for detection of stridor or snoring. The mean, standard deviation, and 95% confidence interval were obtained from 10-fold Monte Carlo validation.
Metric | Method | No. training samples | ||
---|---|---|---|---|
4 | 6 | 8 | ||
Macro-avg AUC (95% CI) | Baseline | 0.7961 ± 0.08 (0.747, 0.846) | 0.8921 ± 0.04 (0.867, 0.917) | 0.9203 ± 0.03 (0.902, 0.939) |
Proposed (Ver. 1) | 0.8941 ± 0.04 (0.869, 0.919) | 0.9340 ± 0.04 (0.909, 0.959) | 0.9526 ± 0.02 (0.940, 0.965) | |
Proposed (Ver. 2) | 0.9151 ± 0.06 (0.878, 0.952) | 0.9541 ± 0.03 (0.935, 0.973) | 0.9597 ± 0.02 (0.947, 0.972) | |
Micro-avg AUC (95% CI) | Baseline | 0.8152 ± 0.07 (0.772, 0.859) | 0.8604 ± 0.08 (0.811, 0.910) | 0.9105 ± 0.06 (0.873, 0.948) |
Proposed (Ver. 1) | 0.8833 ± 0.04 (0.858, 0.908) | 0.9323 ± 0.04 (0.908, 0.957) | 0.9411 ± 0.03 (0.923, 0.960) | |
Proposed (Ver. 2) | 0.8974 ± 0.09 (0.842, 0.953) | 0.9541 ± 0.03 (0.935, 0.973) | 0.9578 ± 0.02 (0.945, 0.970) |