Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
ONCOPLEX: an oncology-inspired hypergraph model integrating diverse biological knowledge for cancer driver gene prediction
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 13 January 2026

ONCOPLEX: an oncology-inspired hypergraph model integrating diverse biological knowledge for cancer driver gene prediction

  • Etab Mohammed Alotaibi1,
  • Omer S. Alkhnbashi2,3 &
  • Van Dinh Tran1 

Scientific Reports , Article number:  (2026) Cite this article

  • 612 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Cancer
  • Computational biology and bioinformatics

Abstract

Cancer development is driven by a small subset of somatic mutations, known as driver mutations, that disrupt key regulatory processes in cells. These mutations occur in specific genes, called cancer driver genes, whose altered functions promote tumor initiation and progression. Accurately identifying driver genes remains a major challenge due to their rarity and the overwhelming presence of passenger mutations. Recent advances in graph-based deep learning have improved the modeling of gene interactions, but most approaches are limited to pairwise connections and fail to capture the higher-order complexity of biological systems. We introduce ONCOPLEX, a hypergraph-based neural network framework that models genes as nodes and curated cancer-related pathways as hyperedges, enabling the representation of multi-gene interactions. Unlike previous methods, ONCOPLEX integrates diverse molecular and phenotypic features, such as somatic mutations, gene expression, and DNA methylation, into a pathway-informed hypergraph structure to learn biologically meaningful gene representations. ONCOPLEX is trained in a supervised manner on labeled driver and non-driver genes, with unlabeled genes included as nodes during representation learning. Comprehensive evaluations across pan-cancer and cancer-type-specific settings show that ONCOPLEX consistently outperforms state-of-the-art methods in classification and ranking metrics. It accurately recovers known driver genes and highlights novel candidates supported by literature and enrichment analyses. These findings underscore the power of pathway-guided hypergraph modeling for advancing cancer driver gene discovery.

Data availability

Core features, including gene expression, mutation, and methylation profiles, were obtained from The Cancer Genome Atlas (TCGA) via the GDC portal (https://portal.gdc.cancer.gov/). The comprehensive feature set was downloaded from DORGE (https://doi.org/10.1126/sciadv.aba6784). Graphs were constructed using MSigDB pathway data (https://www.gsea-msigdb.org/). Cancer driver gene labels were collected from the Network of Cancer Genes v6.0, DigSee, COSMIC CGC v91, and IntOGen v2024.09.204. All processed data, including node features, graph structures, and gene labels, are available at: https://github.com/etab12/ONCOPLEX.

References

  1. The national cancer institute (nci). https://www.cancer.gov/.

  2. Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: The next generation. Cell144, 646–674 (2011).

    Google Scholar 

  3. Hanahan, D. Hallmarks of cancer: New dimensions. Cancer Discov.12, 31–46 (2022).

    Google Scholar 

  4. Vogelstein, B. et al. Cancer genome landscapes. Science (New York, N.Y.) https://doi.org/10.1126/science.1235122 (2013).

    Google Scholar 

  5. Alexandrov, L. et al. Signatures of mutational processes in human cancer. Nature500, 415–421. https://doi.org/10.1038/nature12477 (2013).

    Google Scholar 

  6. Lawrence, M. et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature https://doi.org/10.1038/nature12213 (2013).

    Google Scholar 

  7. Goh, K.-I. et al. The human disease network. Proc. Natl. Acad. Sci. U S A104, 8685–8690. https://doi.org/10.1073/pnas.0701361104 (2007).

    Google Scholar 

  8. Köhler, S., Bauer, S., Horn, D. & Robinson, P. N. Walking the interactome for prioritization of candidate disease genes. Am. J. Hum. Genet.82, 949–958. https://doi.org/10.1016/j.ajhg.2008.02.013 (2008).

    Google Scholar 

  9. Pearson, K. The problem of the random walk. Nature72, 342–342. https://doi.org/10.1038/072342a0 (1905).

    Google Scholar 

  10. Qi, Y., Suhail, Y., Lin, Y.-Y., Boeke, J. D. & Bader, J. S. Finding friends and enemies in an enemies-only network: A graph diffusion kernel for predicting novel genetic interactions and co-complex membership from yeast genetic interactions. Genome Res.18, 1991–2004. https://doi.org/10.1101/gr.077693.108 (2008).

    Google Scholar 

  11. Luo, P., Ding, Y., Lei, X. & Wu, F.-X. deepDriver: Predicting cancer driver genes based on somatic mutations using deep convolutional neural networks. Front. Genet. https://doi.org/10.3389/fgene.2019.00013 (2019).

    Google Scholar 

  12. Tokheim, C., Papadopoulos, N., Kinzler, K., Vogelstein, B. & Karchin, R. Evaluating the evaluation of cancer driver genes. Proc. Natl. Acad. Sci.113, 201616440. https://doi.org/10.1073/pnas.1616440113 (2016).

    Google Scholar 

  13. Collier, O., Stoven, V. & Vert, J.-P. LOTUS: A single- and multitask machine learning algorithm for the prediction of cancer driver genes. PLOS Comput. Biol.15, 1–27. https://doi.org/10.1371/journal.pcbi.1007381 (2019).

    Google Scholar 

  14. Schulte-Sasse, R., Budach, S., Hnisz, D. & Marsico, A. Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms. Nat. Mach. Intell.3, 1–14. https://doi.org/10.1038/s42256-021-00325-y (2021).

    Google Scholar 

  15. Peng, W., Tang, Q., Dai, W. & Chen, T. Improving cancer driver gene identification using multi-task learning on graph convolutional network. Brief. Bioinform.23, bbab432. https://doi.org/10.1093/bib/bbab432 (2021).

    Google Scholar 

  16. Zhang, T., Zhang, S.-W., Xie, M.-Y. & Li, Y. A novel heterophilic graph diffusion convolutional network for identifying cancer driver genes. Brief. Bioinform.24, bbad137. https://doi.org/10.1093/bib/bbad137 (2023).

    Google Scholar 

  17. Wang, C. et al. DriverRWH: discovering cancer driver genes by random walk on a gene mutation hypergraph. BMC Bioinform.23, 277. https://doi.org/10.1186/s12859-022-04788-7 (2022).

    Google Scholar 

  18. Zhang, N. et al. A novel hypergraph model for identifying and prioritizing personalized drivers in cancer. PLoS Comput. Biol.20, e1012068. https://doi.org/10.1371/journal.pcbi.1012068 (2024).

    Google Scholar 

  19. Wang, Y., Jin, S. & Zou, X. Pitch: A pathway-induced prioritization of personalized cancer driver genes based on higher-order interactions. IEEE J. Biomed. Health Inform. https://doi.org/10.1109/JBHI.2025.3538536 (2025).

    Google Scholar 

  20. Deng, C. et al. Identifying new cancer genes based on the integration of annotated gene sets via hypergraph neural networks. Bioinformatics40, i511–i520. https://doi.org/10.1093/bioinformatics/btae257 (2024).

    Google Scholar 

  21. Ye, Z. et al. GRB2 stabilizes RAD51 at reversed replication forks suppressing genomic instability and innate immunity against cancer. Nat. Commun.15, 2132. https://doi.org/10.1038/s41467-024-46283-y (2024).

    Google Scholar 

  22. Petrosino, M. et al. The complex impact of cancer-related missense mutations on the stability and on the biophysical and biochemical properties of MAPK1 and MAPK3 somatic variants. Hum. Genom.17, 95. https://doi.org/10.1186/s40246-023-00544-x (2023).

    Google Scholar 

  23. Liu, Y. et al. Pan-cancer analysis on the role of PIK3R1 and PIK3R2 in human tumors. Sci. Rep.12, 5924. https://doi.org/10.1038/s41598-022-09889-0 (2022).

    Google Scholar 

  24. Tharin, Z. et al. Pik3ca and pik3r1 tumor mutational landscape in a pan-cancer patient cohort and its association with pathway activation and treatment efficacy. Sci. Rep. https://doi.org/10.1038/s41598-023-31593-w (2023).

    Google Scholar 

  25. Deng, M. et al. Kras mutations upregulate runx1 to promote occurrence of head and neck squamous cell carcinoma. Mol. Carcinog.62, 1284–1294. https://doi.org/10.1002/mc.23563 (2023).

    Google Scholar 

  26. Yi, Q. et al. Spectrum of BRAF aberrations and its potential clinical implications: insights from integrative pan-cancer analysis. Front. Bioeng. Biotechnol. https://doi.org/10.3389/fbioe.2022.806851 (2022).

    Google Scholar 

  27. Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterprofiler: an r package for comparing biological themes among gene clusters. OMICS: J. Integr. Biol.16, 284–287. https://doi.org/10.1089/omi.2011.0118 (2012) (PMID: 22455463).

    Google Scholar 

  28. Stoup, N., Liberelle, M., Lebègue, N. & Van Seuningen, I. Emerging paradigms and recent progress in targeting erbb in cancers. Trends Pharmacol. Sci.45, 552–576. https://doi.org/10.1016/j.tips.2024.04.009 (2024).

    Google Scholar 

  29. Gao, Y., Luo, Y., Ji, G. & Wu, T. Functional and pathological roles of adenylyl cyclases in various diseases. Int. J. Biol. Macromol.281, 136198. https://doi.org/10.1016/j.ijbiomac.2024.136198 (2024).

    Google Scholar 

  30. Collier, O., Stoven, V. & Vert, J.-P. Lotus: A single- and multitask machine learning algorithm for the prediction of cancer driver genes. PLOS Comput. Biol.15, 1–27. https://doi.org/10.1371/journal.pcbi.1007381 (2019).

    Google Scholar 

  31. Feng, Y., You, H., Zhang, Z., Ji, R. & Gao, Y. Hypergraph neural networks. AAAI 2019 (2018).

  32. Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci.102, 15545–15550. https://doi.org/10.1073/pnas.0506580102 (2005).

    Google Scholar 

  33. Chang, K. et al. The cancer genome atlas pan-cancer analysis project. Nat. Genet45, 1113–1120. https://doi.org/10.1038/ng.2764 (2013).

    Google Scholar 

  34. Lyu, J. et al. Dorge: Discovery of oncogenes and tumor suppressor genes using genetic and epigenetic features. Sci. Adv.6, eaba6784. https://doi.org/10.1126/sciadv.aba6784 (2020).

    Google Scholar 

  35. Loshchilov, I. & Hutter, F. Fixing weight decay regularization in Adam. CoRRabs/1711.05101 (2017).

  36. Bai, S., Zhang, F. & Torr, P. H. S. Hypergraph convolution and hypergraph attention. Pattern Recognit.110, 107637. https://doi.org/10.1016/j.patcog.2020.107637 (2021).

    Google Scholar 

Download references

Funding

This study was supported by Dubai RDI Grant(2025/DRDI0406), the Center for Applied and Translational Genomics (CATG), Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai Health, Dubai, United Arab Emirates.

Author information

Authors and Affiliations

  1. Information and Computer Science Department, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, Saudi Arabia

    Etab Mohammed Alotaibi & Van Dinh Tran

  2. Center for Applied and Translational Genomics (CATG), Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai Health, Dubai, United Arab Emirates

    Omer S. Alkhnbashi

  3. College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai Health, Dubai, United Arab Emirates

    Omer S. Alkhnbashi

Authors
  1. Etab Mohammed Alotaibi
    View author publications

    Search author on:PubMed Google Scholar

  2. Omer S. Alkhnbashi
    View author publications

    Search author on:PubMed Google Scholar

  3. Van Dinh Tran
    View author publications

    Search author on:PubMed Google Scholar

Contributions

V.D.T., E.M.A., and O.S.A. conceived the study. V.D.T. and O.S.A. supervised the project. E.M.A. collected and processed the data, implemented the model, performed the experiments, and prepared the initial draft of the manuscript. All authors contributed to the development of the methodology, analysis of the results, and writing and revision of the manuscript. All authors approved the final version of the manuscript.

Corresponding authors

Correspondence to Omer S. Alkhnbashi or Van Dinh Tran.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alotaibi, E.M., Alkhnbashi, O.S. & Tran, V.D. ONCOPLEX: an oncology-inspired hypergraph model integrating diverse biological knowledge for cancer driver gene prediction. Sci Rep (2026). https://doi.org/10.1038/s41598-026-36127-8

Download citation

  • Received: 03 August 2025

  • Accepted: 09 January 2026

  • Published: 13 January 2026

  • DOI: https://doi.org/10.1038/s41598-026-36127-8

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Cancer driver gene prediction
  • Hypergraph neural networks
  • Multi-omics integration
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing: Cancer

Sign up for the Nature Briefing: Cancer newsletter — what matters in cancer research, free to your inbox weekly.

Get what matters in cancer research, free to your inbox weekly. Sign up for Nature Briefing: Cancer