Table 1 Review of previous work.

From: Metaheuristic integrated machine learning classification of colon cancer using STFT LASSO and EHO feature extraction from microarray gene expressions

Sl.No.

Author and year

Database

Feature extraction/dimensionality reduction technique

Classifiers used

Evaluation metrics

1

Liu et al. (2013)16

Colorectal cancer Dataset (Prof.Lindy Durant’s research)

Continuous Wavelet Transform (CWT)

Genetic Algorithm based on Bayes classifier

Accuracy = 78.6%

2

Islam et al. (2023)17

Ischemic sensitivity dataset

genomap

genomap + genoNet

Accuracy = 93%

3

Xiao et al. (2018)18

TCGA Database

Stacked sparse auto-encoder (SSAE)

Support Vector Machine (SVM), Neural Network (NN), Random Forest (RF)

Accuracy = 99.89%

4

You et.al. (2013)19

(Bio) Dataset

F-test & Partial Least Squares (PLS), ReliefF and PLS, Recursive Feature Elimination and PLS

LDA,SVM, NBC,KNN (K-Nearest Neighbourhood), INN

96.43%

5

Bonev et al. (2008)20

Datasets from Broad Institute, Stanford Genomic Resources, and Princeton University

Mutual Information

KNN, SVM

Accuracy = 89%

6

Xu et al. (2018)21

Simulated Genome Dataset Available in Bioinformatics online

Extreme phenotype sampling (EPS)

LASSO

Accuracy = 90%

7

Torkey et al. (2021)22

METABRIC, Nature 2012, and Nat Commun 2016, COX-PASnet, and TCGA

Principal Component Analysis (PCA) with Sparse Autoencoders

Cox Regression, Random Survival Forest (RSF)

Accuracy = 98%

8

Abdulla et al. (2020)23

Leukemia and DLBCL dataset

Genetic Algorithm (GA) and Binary Gray Wolf Optimization (BGWO)

Random Forest (RF) and KNN

Accuracy = 95%

9

Li et al. (2018)24

Colon cancer and Leukemia dataset

Variants of LDA, LASSO Spectral regression discriminant analysis (SRDA), locality preserving projections (LPP), kernel discriminant analysis with spectral graph analysis (SRKDA)

LDA, RLDA, HLDA,NDA, SRDA, LPP, SRKDA, and Lasso SRDA

Accuracy = 84%

10

Zhang et al. (2015)25

Leukemia-ALLAML, SRBCT

Projection Matrix

kNN, Locally Linear Discriminant Embedding (LLDE) , DNE (Discriminant Neighbourhood Embedding)

Accuracy = 90%

11

Subhajit et al. (2015)26

SRBCT, ALL_AML and MLL microarray datasets

Particle Swarm Optimization (PSO)—adaptive KNN

SVM (Linear, RBF, Polynomial, Quadratic)

Accuracy = 95%

12

Nursabillilah et al. (2022)27

Breast Cancer Gene Expression Database

Information Gain, Relief and Fisher Score (filter-based methods), LASSO (embedded-based method), GA, PSO, harmony search, ant colony, artificial bee colony, firefly algorithm, cuckoo search, gravitational search, grey wolf, whale optimization

RF, KNN, Naïve Bayes (NB), Logistic Regression (LR), Fuzzy Logic, Artificial NN (ANN)

Accuracy = 90% to 100%

13

Wang et al. (2003)28

Leukemia, Colon Cancer, Brain tumours and NCI60

Fuzzy c-means clustering, Weighted/Mean component plane

Fisher's linear discriminant

Accuracy = 95%

14

Aziz et al. (2016)29

colon cancer, acute leukemia, prostate cancer, lung cancer II, and high-grade glioma

Independent component analysis (ICA) and fuzzy backward feature elimination (FBFE)

SVM and NB

Accuracy = 90%

15

Yaqoob et al. (2024)30

Breast cancer dataset (Kent Ridge)

Sine Cosine and Cuckoo Search Algorithm

SVM

Accuracy = 99%

16

Joshi et al. (2024)31

Colon cancer

Cuckoo Search and Spider Monkey Optimization

SVM and NB

Accuracy = 92%