Table 6 Performance comparison between stacking and soft voting ensembles over 10 runs. Results presented as mean ± standard deviation. Statistical significance was assessed using a paired t-test.
Metric | Stacking (Mean ± Std) | Soft Voting (Mean ± Std) | P-Value |
|---|---|---|---|
Acc | 97.43 ± 0.13% | 97.06 ± 0.10% | < 0.0010 |
Pre | 98.12 ± 0.15% | 97.70 ± 0.18% | < 0.0012 |
Rec | 96.50 ± 0.17% | 96.00 ± 0.16% | < 0.0010 |
F1-score | 97.42 ± 0.14% | 96.83 ± 0.15% | < 0.0010 |
AUROC | 98.85 ± 0.14% | 98.40 ± 0.15% | < 0.0011 |