Table 1 Comparative analysis of existing and proposed methods.
From: Hybrid deep learning framework for accurate classification of high dimensional genomic data
Study | Approach | Data Type Used | Strengths | Limitations | Special Features |
|---|---|---|---|---|---|
El-Nabawy et al.13 | Feature-fusion for breast cancer classification | Clinical, genomic, histopathology data | Combines different data sources well | Suffers from data imbalance issues | Feature fusion |
Lu et al.14 | BrcaSeg – DL linking tissue images and genomics | Histopathology and genomic data | Maps image and genetic patterns | Needs broader validation studies | Tissue-genomic mapping |
Wei et al.15 | DeepTIS – two-stage DL for TIS prediction | Genomic sequences | Good at identifying initiation sites | Weak when sequences differ a lot | Two-stage design |
Huang et al.16 | ML applications for genomics-based therapy | Genomics for therapeutic tasks | Good coverage of therapy predictions | Interpretation challenges remain | Therapy-specific ML |
Ye et al.17 | Image-based DL for pan-cancer classification | Pan-cancer genomic data | High cancer detection rates | Hard to balance across cancer types | Image-genomics fusion |
Ahemad et al18. | ML methods for COVID-19 detection from genomics | Human genomic COVID-19 data | Good for early disease detection | Limited generalization | COVID-19 focus |
Erfanian et al19. | DL in single-cell genomics | Single-cell RNA and transcriptomics | Captures fine cellular variation | Black-box nature of models | Single-cell DL |
Bazgir and Lu20 | REFINED-CNN for survival prediction | High-dimensional genomic features | Handles complex feature spaces | Predictions are less explainable | Focus on survival analysis |
Khodaei et al21. | LPC and ML for viral genomic classification | Viral genomic sequences | Accurate virus classification | Sensitive to noise in sequences | LPC feature extraction |
Wang et al22. | DNNGP – DL for plant multi-omics data | Multi-omics plant datasets | Higher accuracy for genomic traits | Preprocessing is complex | Multi-omics fusion |
Zhu et al.23 | GSRNet with multi-scale CNN and BiGRU | Genomic signals and regions | Captures complex multi-scale features | Needs more computation and careful training | Adversarial training |
Abu-Doleh and Al Fahoum24 | XGBoost-CNN for cell type classification | Single-cell RNA-seq images | Good classification accuracy, useful for biological patterns | Hard to interpret results in large datasets | Image-based gene expression |
Mohammed et al.25 | U-Net based DNA sequence classifier | DNA sequence data | Strong feature extraction from sequences | Validation across multiple datasets is limited | Encoder-decoder structure |
Barber and Oueslati26 | Pre-trained ResNet-50, GoogleNet, and custom CNN | Exon and intron classification | Boosts classification with deep feature learning | Resource-heavy models | Use of multiple pre-trained networks |
Nawaz et al.27 | Sequential pattern discovery in genomic data | Sequential genomic data | Better sequence pattern recognition | Difficult to generalize across species | Sequential modeling |
Abbas et al.28 | Adaptive federated learning for cancer detection | Multi-source cancer data | Preserves data privacy | Model syncing challenges in real-world setups | Healthcare 5.0 integration |
Mora-Poblete et al29. | DL for phenomic-genomic modeling | Eucalyptus tree genomic data | Strong prediction by linking traits | Only for plants, limited species coverage | Phenomic-genomic integration |
Feng et al.30 | AI Breeder for genomic crop prediction | Crop genomic data | Genotype-phenotype integration | Needs careful tuning for diversity | AI-based breeding tool |
Batra et al.31 | AI pipeline for lung cancer screening | Radiology, clinical, genomic data | Combines multiple data types well | Managing privacy between modalities is complex | Early screening AI system |
Sangeetha et al.32 | Multimodal fusion DL for lung cancer | Multimodal lung cancer datasets | Boosts prediction through fusion | Handling missing data remains difficult | Enhanced multimodal fusion |
Raja et al.38 | Attention-guided CNN for liver tumor prediction | Genomic features | Focus on key genomic markers | Needs high-quality labeled data | Attention mechanism used |
Lin et al.39 | DL for fish phenotyping | Fish genomic and phenotypic data | Accurate prediction of physical traits | Focused only on tiger pufferfish | Genomic-based phenotyping |
Wang et al.40 | Cropformer – interpretable DL for crops | Crop genomic data | Combines performance and interpretability | Needs powerful computing resources | Transformer-based structure |
Wu et al.41 | AutoGP – AI for maize breeding programs | Maize genomic data | Good breeding predictions | Needs very large datasets | AI platform for breeding |
Proposed Method | TabNet & CNN with AFR | High-dimensional genomic data | Strong feature selection, robust learning, accurate classification | May need tuning for very sparse data | Combines TabNet’s feature selection with CNN’s DL power |