DeepStackVEGF a stacking ensemble deep learning framework for vascular endothelial growth factor prediction

Ali, Farman; Khalid, Majdi; Algarni, Abdulmohsen; Rajkhan, Naif Waheb; Asiry, Othman; Du, Meng-Ze

doi:10.1038/s41598-026-40134-0

Download PDF

Article
Open access
Published: 11 March 2026

DeepStackVEGF a stacking ensemble deep learning framework for vascular endothelial growth factor prediction

Farman Ali¹,
Majdi Khalid²,
Abdulmohsen Algarni³,
Naif Waheb Rajkhan⁴,
Othman Asiry⁵ &
…
Meng-Ze Du^6,7,8

Scientific Reports , Article number: (2026) Cite this article

804 Accesses
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Vascular Endothelial Growth Factor (VEGF) plays a central role in angiogenesis, regulating both physiological processes such as wound healing, tissue repair, and bone formation, and pathological events including tumor progression, metastasis, and diabetic retinopathy. Due to its crucial role in vascular biology, VEGF serves as an important therapeutic target in anti-angiogenic drug development and precision medicine. However, conventional experimental methods for VEGF identification are costly and time-consuming, emphasizing the need for efficient computational approaches. To address this challenge, we introduce DeepStack-VEGF, an advanced deep learning framework designed for accurate and robust VEGF prediction. The model integrates diverse sequence-derived features, including physicochemical descriptors, sequential patterns, evolutionary information, and secondary structure motifs, further enhanced by pretrained embeddings from UniProt and ProtBert. Feature optimization was achieved using Support Vector Machine–Recursive Feature Elimination. DeepStack-VEGF employs a stacking ensemble of three architectures including Feedback Generative Adversarial Network Gated Recurrent Unit and Capsule Convolutional Neural Network each contributing distinct representational capabilities. Comprehensive evaluations demonstrate that the fused feature set and stacking ensemble substantially outperform individual models, achieving superior accuracy, robustness, and generalization. By combining deep learning with biological insight, DeepStack-VEGF provides a reliable and scalable computational framework for VEGF identification, supporting rational drug discovery, anti-angiogenic therapy design, and precision medicine applications.

Vascular endothelial growth factor signaling in health and disease: from molecular mechanisms to therapeutic perspectives

Article Open access 19 May 2025

VEGF-B prevents excessive angiogenesis by inhibiting FGF2/FGFR1 pathway

Article Open access 18 August 2023

Biology and therapeutic targeting of vascular endothelial growth factor A

Article 25 July 2023

Data availability

We have made all datasets, feature extraction sets, and classifier codes freely available on GitHub at the following link: https://github.com/Farman335/DeepStack-VEGF

References

A. M. Byrne, D. J. Bouchier‐Hayes, J. H. J. J. o. c. Harmey, and m. medicine, "Angiogenic and cell survival functions of vascular endothelial growth factor (VEGF)," vol. 9, no. 4, pp. 777–794, 2005.
Hu, K. & Olsen, B. R. J. The roles of vascular endothelial growth factor in bone repair and regeneration. Bone 91, 30–38 (2016).
Google Scholar
Johnson, K. E. & Wilgus, T. A. J. Vascular endothelial growth factor and angiogenesis in the regulation of cutaneous wound repair. Adv. Wound Care 3(10), 647–661 (2014).
Google Scholar
Shinkaruk, S., Bayle, M., Lain, G. & Deleris, G. J. C. Vascular endothelial cell growth factor (VEGF), an emerging target for cancer chemotherapy. Curr. Med. Chem. Anti-Cancer Agents 3(2), 95–117 (2003).
Google Scholar
M. L. Formica, H. G. Awde Alfonso, S. D. J. P. R. Palma, and Perspectives, "Biological drug therapy for ocular angiogenesis: Anti‐VEGF agents and novel strategies based on nanotechnology," vol. 9, no. 2, p. e00723, 2021.
K. R. Khrapko et al., "Methods of DNA sequencing by hybridization based on optimizing concentration of matrix-bound oligonucleotide and device for carrying out same," ed: Google Patents, 1996.
Pagano, K. et al. Identification of a novel extracellular inhibitor of FGF2/FGFR signaling axis by combined virtual screening and NMR spectroscopy approach. Bioorg. Chem. 136, 106529 (2023).
Google Scholar
Zaha, D. C. J. Significance of immunohistochemistry in breast cancer. World J. Clin. Oncol. 5(3), 382 (2014).
Google Scholar
Alsini, R. et al. Deep-VEGF: Deep stacked ensemble model for prediction of vascular endothelial growth factor by concatenating gated recurrent unit with two-dimensional convolutional neural network. J. Biomol. Struct. Dyn. https://doi.org/10.1080/07391102.2024.2323144 (2024).
Google Scholar
Ali, F. et al. VEGF-ERCNN: A deep learning-based model for prediction of vascular endothelial growth factor using ensemble residual CNN. J. Comput. Sci. 83, 102448 (2024).
Google Scholar
M. Khalid, F. Ali, W. Alghamdi, A. Alzahrani, R. Alsini, and A. Alzahrani, "An ensemble computational model for prediction of clathrin protein by coupling machine learning with discrete cosine transform," J. Biomol. Struct. Dyn., pp. 1–9, 2024.
Ali, F. et al. Identification of defensins using transformer-derived protein embeddings and discrete cosine transformation-enhanced evolutionary features with generative adversarial capsule bidirectional temporal convolutional neural network. Int. J. Biol. Macromol. https://doi.org/10.1016/j.ijbiomac.2025.148336 (2025).
Google Scholar
Ali, F. et al. DBPPred-PDSD: Machine learning approach for prediction of DNA-binding proteins using Discrete Wavelet Transform and optimized integrated features space. Chemometr. Intell. Lab. Syst. 182, 21–30 (2018).
Google Scholar
Hasegawa, K., Moriwaki, Y., Terada, T., Wei, C. & Shimizu, K. Feedback-AVPGAN: Feedback-guided generative adversarial network for generating antiviral peptides. J. Bioinform. Comput. Biol. 20(06), 2250026 (2022).
Google Scholar
Madhu, G., Mohamed, A. W., Kautish, S., Shah, M. A. & Ali, I. Intelligent diagnostic model for malaria parasite detection and classification using imperative inception-based capsule neural networks. Sci. Rep. 13(1), 13377 (2023).
Google Scholar
S.-Y. Lin, "A Comparative Evaluation of Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU), and Transformer-Based Informer Model for Predicting Rice Leaf Blast," Purdue University, 2024.
U. Consortium. UniProt: a hub for protein information. Nucleic Acids Res. 43(D1), D204–D212 (2015).
Google Scholar
S. K. Burley, H. M. Berman, G. J. Kleywegt, J. L. Markley, H. Nakamura, and S. Velankar, Protein Data Bank (PDB): the single global macromolecular structure archive," Protein crystallography: methods and protocols, pp. 627-641, 2017.
Mistry, J. et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 49(D1), D412–D419 (2021).
Google Scholar
Alghamdi, W. et al. A deep learning model for epidermal growth factor receptor prediction using ensemble residual convolutional neural network. Sci. Rep. 15(1), 33676 (2025).
Google Scholar
He, X. et al. PhaseNet: A computational framework for identifying phase-separating proteins based on protein language model. Int. J. Biol. Macromol. https://doi.org/10.1016/j.ijbiomac.2025.149044 (2025).
Google Scholar
Almusallam, N. et al. Multi-headed ensemble residual CNN: A powerful tool for fibroblast growth factor prediction. Results Eng. 24, 103348 (2024).
Google Scholar
Ali, F. et al. AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information. Comput. Biol. Med. 139, 105006 (2021).
Google Scholar
Ali, F. et al. DEEP-EP: Identification of epigenetic protein by ensemble residual convolutional neural network for drug discovery. Methods 226, 49–53 (2024).
Google Scholar
A. Ahmad, S. Akbar, M. Hayat, F. Ali, and M. Sohail, "Identification of antioxidant proteins using a discriminative intelligent model of k-space amino acid pairs based descriptors incorporating with ensemble feature selection," Biocybernet. Biomed. Eng., 2020.
Ahmad, A. et al. Deep-AntiFP: Prediction of antifungal peptides using distanct multi-informative features incorporating with deep neural networks. Chemometr. Intell. Lab. Syst. 208, 104214 (2021).
Google Scholar
Khan, A. et al. Prediction of antifreeze proteins using machine learning. Sci. Rep. 12(1), 1–10 (2022).
Google Scholar
Sikander, R., Ghulam, A. & Ali, F. XGB-DrugPred: Computational prediction of druggable proteins using eXtreme gradient boosting and optimized features set. Sci. Rep. 12(1), 1–9 (2022).
Google Scholar
R. Alsini et al., "Deep-VEGF: deep stacked ensemble model for prediction of vascular endothelial growth factor by concatenating gated recurrent unit with two-dimensional convolutional neural network," pp. 1–11, 2024.
S. Hasnat, F. A. Shifa, S. Murshed, S. T. Ahmed Rumee, and M. M. Mahbub, "A Framework for Accurate Prediction of Plastic-Degrading Enzymes using Convolutional Neural Networks," bioRxiv, p. 2024.10. 20.619257, 2024.
Z. Chen et al., 2018 iFeature: a python package and web server for features extraction and selection from protein and peptide sequences Bioinformatics vol. 34, no. 14, pp. 2499–2502,
Wang, H. & Hu, X. J. B. Accurate prediction of nuclear receptors with conjoint triad feature. BMC Bioinformatics 16(1), 1–13 (2015).
Google Scholar
Li, F. et al. "Computational analysis and prediction of PE_PGRS proteins using machine learning. Comput. Struct. Biotechnol. J. 20, 662–674 (2022).
Google Scholar
Zhao, L., Li, J., Zhang, B. & Jiang, X. Combining knowledge distillation and neural networks to predict protein secondary structure. Sci. Rep. 15(1), 32031 (2025).
Google Scholar
Ayimbila, F. et al. Exploring the binding interaction of 1, 4-naphthoquinone derivative–human serum albumin complex by biophysics and molecular simulation. Sci. Rep. 15(1), 19249 (2025).
Google Scholar
Chowdhury, A. S., Reehl, S. M., Kehn-Hall, K., Bishop, B. & Webb-Robertson, B.-J.M. Better understanding and prediction of antiviral peptides through primary and secondary structure feature importance. Sci. Rep. 10(1), 1–8 (2020).
Google Scholar
Chowriappa, P., Dua, S., Kanno, J. & Thompson, H. W. Protein structure classification based on conserved hydrophobic residues. IEEE/ACM Trans. Comput. Biol. Bioinform. 6(4), 639–651 (2008).
Google Scholar
Rayhan, F. et al. iDTI-ESBoost: Identification of drug target interaction using evolutionary and structural features with boosting. Sci. Rep. 7(1), 17731 (2017).
Google Scholar
A. Ghulam, Z. N. K. Swati, F. Ali, S. Tunio, N. Jabeen, and N. Iqbal, "DeepImmuno-PSSM: Identification of Immunoglobulin based on Deep learning and PSSM-Profiles," 2023.
M. Khalid et al., "An ensemble computational model for prediction of clathrin protein by coupling machine learning with discrete cosine transform," pp. 1–9, 2024.
Mall, R., Kaushik, R., Martinez, Z. A., Thomson, M. W. & Castiglione, F. Benchmarking protein language models for protein crystallization. Sci. Rep. 15(1), 2381 (2025).
Google Scholar
S. Rahu et al., "UBI-XGB: Identification of ubiquitin proteins using machine learning model," vol. 8, pp. 14–26, 2022.
Ali, F. et al. Leveraging deep learning for epigenetic protein prediction: A novel approach for early lung cancer diagnosis and drug discovery. Health Inf. Sci. Syst. 13(1), 28 (2025).
Google Scholar
A. Khan, J. Uddin, F. Ali, A. Banjar, and A. Daud, "Comparative analysis of the existing methods for prediction of antifreeze proteins," Chemometrics Intell. Lab. Syst., p. 104729, 2022.
Lin, X. et al. AFPDeepPred: A deep learning framework for accurate identification of antifreeze proteins. J. Chem. Inf. Model. 65(22), 12256–12267 (2025).
Google Scholar
Ali, F. et al. Deep-CABPred: Deep learning model for predicting functional chlorophyll ab binding proteins in trait-based plant ecology using hybrid embedding with semi-normalized temporal convolutional networks. Ecol. Inform. https://doi.org/10.1016/j.ecoinf.2025.103400 (2025).
Google Scholar
Khan, Z. U., Ali, F., Ahmad, I., Hayat, M. & Pi, D. iPredCNC: Computational prediction model for cancerlectins and non-cancerlectins using novel cascade features subset selection. Chemometr. Intell. Lab. Syst. 195, 103876 (2019).
Google Scholar
Sikander, R., Ghulam, A. & Ali, F. XGB-DrugPred: Computational prediction of druggable proteins using eXtreme gradient boosting and optimized features set. Sci. Rep. 12(1), 5505 (2022).
Google Scholar
S. Zouari et al., "Deep‐GB: A novel deep learning model for globular protein prediction using CNN‐BiLSTM architecture and enhanced PSSM with trisection strategy," 2024.
M. Ullah, A. Iltaf, Q. Hou, F. Ali, and C. Liu, "A foreground extraction approach using convolutional neural network with graph cut," in 2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC), 2018, pp. 40–44: IEEE.
Kabir, M. et al. Prediction of membrane protein types by exploring local discriminative information from evolutionary profiles. Anal. Biochem. 564, 123–132 (2019).
Google Scholar
Ali, F., Ahmed, S., Swati, Z. N. K. & Akbar, S. DP-BINDER: Machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information. J. Comput.-Aided Mol. Des. 33(7), 645–658 (2019).
Google Scholar
Ali, F. et al. AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information. Comput. Biol. Med. https://doi.org/10.1016/j.compbiomed.2021.105006 (2021).
Google Scholar
Zouari, S. et al. Deep-GB: A novel deep learning model for globular protein prediction using CNN-BiLSTM architecture and enhanced PSSM with trisection strategy. IET Syst. Biol. 18(6), 208–217 (2024).
Google Scholar
Hasegawa, K. et al. Feedback-AVPGAN: Feedback-guided generative adversarial network for generating antiviral peptides. J. o. B. 20(06), 2250026 (2022).
Google Scholar
M. Huh, S.-H. Sun, and N. Zhang, "Feedback adversarial learning: Spatial feedback for improving generative adversarial networks," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 1476–1485.
Almusallam, N. et al. An omics-driven computational model for angiogenic protein prediction: Advancing therapeutic strategies with Ens-deep-AGP. Int. J. Biol. Macromol. 282, 136475 (2024).
Google Scholar
F. Ali, A. Masmoudi, T. Alkhalifah, F. Alturise, W. Alghamdi, and M. Khalid, "IR-MBiTCN: Computational prediction of insulin receptor using deep learning: A multi-information fusion approach with multiscale bidirectional temporal convolutional network," Int. J. Biol. Macromol., p. 143844, 2025.
Mahdi, E. T., Al-Barzinji, S. M. & Awad, W. K. Object detection using capsule neural network: An overview. Babylon. J. Mach. Learn. 2024, 157–164 (2024).
Google Scholar
Vijayakumar, T. Comparative study of capsule neural network in various applications. J. Artif. Intell. 1(01), 19–27 (2019).
Google Scholar
K. Sun et al., "CACNN: Capsule attention convolutional neural networks for 3D object recognition," IEEE Transac. Neural Netw. Learn. Syst., 2023.
R. Dey and F. M. Salem, "Gate-variants of gated recurrent unit (GRU) neural networks," in 2017 IEEE 60th international midwest symposium on circuits and systems (MWSCAS), 2017, pp. 1597–1600: IEEE.
Merkelbach, K., Schaper, S., Diedrich, C., Fritsch, S. J. & Schuppert, A. Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups. Sci. Rep. 13(1), 4053 (2023).
Google Scholar
R. J. a. p. a. Rana, "Gated recurrent unit (GRU) for emotion classification from noisy speech," 2016.
A. Banjar, F. Ali, O. Alghushairy, and A. Daud, "iDBP-PBMD: A machine learning model for detection of DNA-binding proteins by extending compression techniques into evolutionary profile," Chemometrics Intell. Lab. Syst., p. 104697, 2022.
Xie, P. et al. Toward high-efficiency, low-resource, and explainable neuropeptide prediction with MSKDNP. Brief. Bioinf. 26(5), bbaf466 (2025).
Google Scholar

Download references

Acknowledgements

1. National Natural Science Foundation of China, Grant No. 62501110 2. China Postdoctoral Science Foundation, Grant No. 2020TQ0138

Author information

Authors and Affiliations

Department of Computer Science, Bahria University, Islamabad, 44000, Pakistan
Farman Ali
Department of Computer Science and Artificial Intelligence, College of Computing, Umm Al-Qura University, 21955, Makkah, Saudi Arabia
Majdi Khalid
Department of Computer Science, King Khalid University, 61421, Abha, Saudi Arabia
Abdulmohsen Algarni
Department of Computer Science, Faculty of Computing and Information Technology, King Abdul Aziz University, 21589, Jeddah, Saudi Arabia
Naif Waheb Rajkhan
Department of Information Technology, College of Computing and Information Technology at Khulais, University of Jeddah, Jeddah, Saudi Arabia
Othman Asiry
School of Health and Medical Technology, Chengdu Neusoft University, Chengdu, 611844, Sichuan Province, People’s Republic of China
Meng-Ze Du
School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
Meng-Ze Du
Qingyuan People’s Hospital, The Sixth Affiliated Hospital of Guangzhou Medical University, B24 Yinquan South Road, Qingyuan, 511518, Guang Dong Province, People’s Republic of China
Meng-Ze Du

Authors

Farman Ali
View author publications
Search author on:PubMed Google Scholar
Majdi Khalid
View author publications
Search author on:PubMed Google Scholar
Abdulmohsen Algarni
View author publications
Search author on:PubMed Google Scholar
Naif Waheb Rajkhan
View author publications
Search author on:PubMed Google Scholar
Othman Asiry
View author publications
Search author on:PubMed Google Scholar
Meng-Ze Du
View author publications
Search author on:PubMed Google Scholar

Contributions

**Farman Ali:** Drafted and composed the manuscript, contributed to the interpretation of results. **Majdi Khalid:** Performed the experiments, assisted in data acquisition. **Abdulmohsen Algarni:** Conducted validation analyses, contributed to data verification. **Naif Waheb Rajkhan:** Substantively revised the manuscript, contributed to critical editing. **Othman Asiry:** Reviewed the manuscript, contributed to critical feedback, and interpretation. **Meng-Ze Du:** Supervision.

Corresponding authors

Correspondence to Farman Ali or Meng-Ze Du.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval

No human or animal subjects are involved in this work.

Consent for publication

All authors agree on the publication of the paper.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Ali, F., Khalid, M., Algarni, A. et al. DeepStackVEGF a stacking ensemble deep learning framework for vascular endothelial growth factor prediction. Sci Rep (2026). https://doi.org/10.1038/s41598-026-40134-0

Download citation

Received: 17 October 2025
Accepted: 10 February 2026
Published: 11 March 2026
DOI: https://doi.org/10.1038/s41598-026-40134-0