Table 3 Mean (standard deviation) of AUC for the cancer vs. non-cancer classification problem across the four hold-one-site-out folds with \(F{S}_{sd}^{\theta ,\kappa }\) and \(F{S}_{d}^{\theta ,\kappa }\) \(\theta \in \{SFS,WLCX,mRMR,ROC\}\), \(\kappa \in \{LDA,QDA,SVM,RF\}\).

From: Stable and discriminating features are predictive of cancer presence and Gleason grade in radical prostatectomy specimens: a multi-site study

 

SFS

WLCX

FS sd

FS d

% Improvement

FS sd

FS d

% Improvement

LDA

0.99 (0.01)

0.99 (0.01)

−0.15

0.96 (0.04)

0.97 (0.04)

−0.22

QDA

0.98 (0.02)

0.99 (0.02)

−0.11

0.95 (0.04)

0.96 (0.05)

−1.18

SVM

0.98 (0.01)

0.99 (0.01)

−0.68

0.96 (0.03)

0.97 (0.04)

−0.36

RF

0.98 (0.02)

0.99 (0.01)

−0.84

0.95 (0.03)

0.96 (0.03)

−1.63

 

mRMR

ROC

FS sd

FS d

% Improvement

FS sd

FS d

% Improvement

LDA

0.99 (0.01)

0.99 (0.01)

−0.37

0.99 (0.01)

0.99 (0.01)

0.00

QDA

0.96 (0.04)

0.98 (0.02)

−2.32

0.98 (0.03)

0.98 (0.05)

0.00

SVM

0.97 (0.02)

0.99 (0.01)

−1.50

0.98 (0.02)

0.98 (0.01)

−0.03

RF

0.99 (0.01)

0.99 (0.01)

0.16

0.98 (0.00)

0.99 (0.00)

−0.66

  1. For each classifier model, the top 5 most stable and discriminating or most discriminating features were employed for constructing \(F{S}_{sd}\) and \(F{S}_{d}\) respectively. For every feature selection-classification pair four models were trained and validated, one model for every possible combination of three of the four sites. The three chosen sites were combined and used for training and the held out site was used for validation. The improvement between \(F{S}_{sd}\) and \(F{S}_{d}\) is shown. A positive improvement indicates that \(F{S}_{sd}\) outperformed \(F{S}_{d}\). Note that for this particular problem, the prediction AUC for all models were very high, nearly perfect in most cases.