Table 3 Performance comparisons between the proposed method method (DL_\(\hbox {DA}_s\)) against the baselines, using stratified data augmentation (m: the augmentation data for each category is m as many as the original sample).

From: Deep learning data augmentation for Raman spectroscopy cancer tissue classification

Methods

All Data

Treated

UNTreated

A

F1

AUC

A

F1

AUC

A

F1

AUC

LR

0.685

0.628

0.848

0.800

0.550

0.833

0.626

0.601

0.830

LR_\({\hbox {DA}}_s\) (\(m=1)\)

0.671

0.563

0.850

0.840

0.569

0.866

0.586

0.521

0.832

LR_\({\hbox {DA}}_s\) (\(m=2)\)

0.678

0.589

0.861

0.820

0.556

0.897

0.606

0.557

0.841

SVM

0.745

0.716

0.895

0.840

0.582

0.908

0.697

0.692

0.880

SVM_\({\hbox {DA}}_s\) (\(m=1)\)

0.691

0.627

0.901

0.820

0.554

0.900

0.626

0.596

0.891

SVM_\({\hbox {DA}}_s\) (\(m=2)\)

0.725

0.670

0.903

0.820

0.556

0.900

0.677

0.651

0.901

MLP

0.745

0.721

0.898

0.820

0.562

0.896

0.707

0.703

0.886

MLP_\({\hbox {DA}}_s\) (\(m=1)\)

0.758

0.736

0.904

0.820

0.562

0.893

0.727

0.724

0.891

MLP_\({\hbox {DA}}_s\) (\(m=2)\)

0.758

0.739

0.910

0.820

0.562

0.905

0.727

0.726

0.896

LSTM

0.724

0.717

0.892

0.740

0.602

0.824

0.717

0.717

0.905

LSTM_\({\hbox {DA}}_s\) (\(m=1)\)

0.845

0.836

0.934

0.860

0.708

0.880

0.838

0.838

0.937

LSTM_\(\hbox {DA}_s\) (\(m=2)\)

0.818

0.805

0.952

0.880

0.763

0.963

0.788

0.786

0.940

CNN

0.772

0.757

0.912

0.860

0.589

0.881

0.727

0.728

0.902

DL_\({\hbox {DA}}_s\) (\(m=1)\)

0.772

0.751

0.922

0.860

0.590

0.861

0.727

0.726

0.910

DL_\({\hbox {DA}}_s\) (\(m=2)\)

0.785

0.763

0.926

0.840

0.577

0.868

0.758

0.754

0.917