Table 3 Experimental results of an ensemble of 10 fused multi-label models trained on the SData set (\(n=22\)) and evaluated on the GData set (\(n=157\)) for the detection of PD. The AUC in the age corrected GData column refers to the mean AUC across 10 randomly generated, age-matched GData subsets (as such, we do not report performance metrics values at the two operating points for this experiment).
Operating point | Metric | GData | GData (age corrected) |
|---|---|---|---|
High sensitivity \((>0.9)\) | Sensitivity | 0.920 | |
Specificity | 0.689 | ||
High specificity \((>0.9)\) | Sensitivity | 0.600 | |
Specificity | 0.917 | ||
AUC | 0.868 | 0.834 |