Introduction

Cancer continues to be one of the leading health problems, with increasing morbidity and mortality worldwide1,2. The IARC projects that in 2022, there will be 19.9 million new cancer cases and 9.7 million cancer-related deaths3. Several irreconcilable theories exist that explain the complex pathophysiology of cancer. However, the uncontrolled proliferation of cells is one of the leading and most accepted mechanisms4. According to cancer statistics, there are more than 200 distinct types of cancers, which are substantially different in their behaviour and response to treatment. Different cancers are diagnosed using different approaches or a combination of approaches, such as cytology, cytogenetics, histology, pathology, imaging, and other relevant clinical data that can help in the process of diagnosis5.

Artificial intelligence (AI) is a branch of computer science that deals with developing smart machines that can perform tasks that normally require human input and intelligence6. AI is an emerging and interdisciplinary science that was first introduced in 1956 by John McCarthy7. It has significantly improved performance in basic biology, pharmacology, and clinical medicine (Fig. 1), sometimes matching or even outperforming human experts in particular tasks. Over the years, AI has emerged as a key tool in healthcare, with applications such as diagnosis, prognosis, screening, and drug discovery8,9.

Fig. 1
Fig. 1
Full size image

Demonstrates the various areas in which artificial intelligence (AI) aids in cancer research and treatment, from imaging, clinical decision support, and global data integration to mutation mapping, biomarker discovery, and treatment development.

Natural language processing (NLP), machine learning (ML), evolutionary algorithms, and deep learning (DL) are some of the subfields of artificial intelligence (AI)8,10. DL and ML are two of the most important since DL uses multilayered neural networks to identify patterns through hierarchical abstraction, while ML enables autonomous learning from datasets9,11,12. By finding patterns in existing datasets, supervised machine learning techniques like support vector machines and random forest algorithms have been used in oncology to classify tumours, predict prognoses, and produce data-driven details12. Similarly, to improve tumour detection in medical imaging, DL architectures such as convolutional neural networks (CNNs) have been used in radiology. Recurrent neural networks (RNNs), which recognise sequential patterns in speech or text, are increasingly being used in natural language processing (NLP) applications, particularly in the extraction of clinically relevant data from reports on cancer pathology11. All of these strategies have worked together to establish AI as a revolutionary tool for early cancer detection, treatment selection, and individualised care12,13.

Three major areas, early cancer diagnosis, therapy, and prediction, that span the domains of cancer incidence, recurrence, and survival, were highlighted by Sebastian and Peter as areas in oncology where AI holds particular promise14. AI is also crucial for cancer screening and clinical trials. Diagnostics, particularly radiology and pathology, account for 80% of AI applications in oncology. Artificial intelligence (AI) algorithms are often faster and more accurate than human experts at processing and interpreting complex imaging data from MRIs, CT scans, and X-rays. These tools enable earlier and more accurate diagnosis by spotting minute patterns and abnormalities that the human eye might miss15.

In recent years, electronic healthcare record (EHR) infrastructures have expanded globally, allowing large volumes of clinical data to be stored and accessed more efficiently16. Numerous digital collaborations are now using EHRs to advance research in the early diagnosis of cancer17. AI-based imaging algorithms are increasingly being applied in clinical practice to detect and monitor potentially cancerous lesions as well as to support management decisions18,19. For instance, an FDA-approved software platform enables comprehensive detection and longitudinal tracking of pulmonary nodules, prediction of lung malignancy in lesions identified on low-dose CT scans20, and integration of management guidelines21. Deep neural networks have also been developed for identifying enlarged lymph nodes and colonic polyps on CT images, and a real-time DL computer-aided detection system for colonoscopy has received FDA approval to enhance polyp detection22,23. Furthermore, AI-augmented interpretation of endoscopic images has been shown to consistently improve the accuracy of oesophageal cancer detection24,25.

With the rapid expansion of digital pathology, AI is being applied to pathological image analysis for diagnosis, grading, and prognostic biomarker assessment. Much of the progress has focused on automating labour-intensive tasks, thereby enhancing pathologists’ efficiency and allowing greater focus on complex, high-level decision-making associated with disease presentation26. One notable example is a DL system that uses whole-slide image analysis of radical prostatectomy specimens to assign Gleason scores, with an accuracy higher than that of general pathologists27. The identification of tumour-infiltrating lymphocytes in tissue slide images from The Cancer Genome Atlas, a prognostic indicator for clinical outcomes across 13 cancer subtypes, has also been automated using convolutional neural networks28. AI has also been effectively used to annotate skin lesions, including melanoma, and classify dermoscopy images with accuracy on par with that of skilled dermatologists29,30.

To fully characterise the tumour landscape and improve diagnostic accuracy, there is increasing emphasis on integrating multi-modal data, such as transcriptomic, radiomic, metabolomic, genomic, and clinical features, given the complexity of tumour biology. Cross-modality association studies are made easier, and model development is supported by large-scale resources like LinkedOmics, which includes multi-omic data from 11,158 patients with 32 different types of cancer31,32. When combined with clinically annotated cohorts, the application of AI to analyse large-scale multi-omics datasets has made it possible to predict RNA splice sites, enhance variant detection, uncover drug-susceptibility genes, and gain a better understanding of cancer biology33,34,35,36.

This review aims to evaluate the current literature on the role of AI in cancer with a focus on cancer diagnosis, prognosis, drug development, and effective treatment strategies. This will also analyse the emerging role of AI-based algorithms in radiology and imaging. The importance of several platforms, such as DL, ML, and RL, will also be evaluated in the context of cancer care. This review will also combine evidence from multiple omics with cancer genomics and personalised drug design to show how AI-driven approaches are transforming cancer research and clinical practice. Particular emphasis is placed on how these technologies improve diagnostic accuracy, make it easier to accurately identify therapeutic targets, and speed up the development of individualised treatment plans.

Methodological approaches

Literature search strategy

A range of electronic databases, including EMBASE, Web of Science, JSTOR, ScienceDirect, BioMed Central, Medline, PubMed, and Google Scholar, were searched to retrieve literature for this review article. Artificial intelligence (AI), cancer, oncology, cancer genomics, diagnosis, drug discovery, cancer epigenetics, machine learning, computational oncology, deep learning, biomarkers, big data, and precision medicine were among the search terms used. To generate various keyword combinations, logical operators (“AND,” “OR”) were used. Articles with clinical relevance to artificial intelligence and oncology, as well as those published in high-impact factor journals, were given preference. As shown in the flowchart (Fig. 2), out of the 641 articles that were retrieved, 310 were deemed suitable for inclusion.

Fig. 2
Fig. 2
Full size image

Shows how research articles were screened and selected for this review article. Initially, 641 articles were retrieved from different databases, of which 310 articles were selected.

Eligibility criteria

Original research articles published in English that looked at the use of AI in oncology, with a focus on machine learning and deep learning techniques, were the main focus of the inclusion criteria. There were also other types of articles that summarised the use of AI in oncology, like meta-analyses and systematic reviews. Letters, editorials, commentaries, articles lacking primary data, and publications published in languages other than English were all excluded based on the exclusion criteria. Inadequately detailed review articles were also omitted (Table 1).

Table 1 Inclusion and exclusion criteria

Overview of AI-driven technologies in healthcare

Over the last few years, AI technologies have been increasingly integrated into several sectors, including healthcare. These tools not only help in diagnosis, screening, treatment, and decision-making but also give a sense of hope to patients, clinicians, and caregivers with more accuracy and improved outcomes in the near future37. During the COVID-19 pandemic, several AI-assisted technologies were used to provide personalised assistance and recommendations to patients38. Similarly, several other AI-driven digital technologies, such as Google Assistant, Amazon Alexa, Apple Siri, and AI-driven robots and personal assistants, are now being used in a wide range of scenarios39.

In the last decade, we have effectively utilised machine learning techniques to analyse large volumes of medical data. This approach enables systems to learn independently, producing accurate predictions instead of depending on traditional rule-based techniques40,41,42. ML and DL algorithms powered by AI have been used to find intricate patterns in clinical and medical images. To enable meaningful decisions that are frequently difficult for humans, they try to translate images and support clinical judgments. AI improves the collection of diverse data streams into symptomatic systems that are dynamically integrated43,44,45,46. These techniques can help you extract important information from patient data found in electronic health records (EHRs)47,48,49,50 and can help diagnose diseases by uncovering risk factors and their underlying causes51,52,53.

As ML continues to evolve, DL has emerged as a significant subset that uses multiple layers of nonlinear modules54,55,56,57. These stacked modules work together to gradually convert raw data into more abstract representations56,57,58,59. DL offers intriguing possibilities for biomedical informatics and disease diagnosis, with numerous projects currently underway to apply these techniques in the medical field. For example, Google DeepMind has announced its plans to utilise its expertise for medical applications57. These days, convolutional neural networks (CNNs) are a popular deep learning technique for automatically detecting tumours in cancer patients’ medical images60,61,62,63,64.

The early onset of heart failure has been predicted using DL-based recurrent neural network (RNN) models65,66,67. Likewise, generative adversarial networks (GANs) are dynamic medical imaging tools that enhance training techniques, enhance image quality, and are crucial to the advancement of healthcare technologies68,69,70,71. Using ML and AI algorithms to efficiently interpret text data is known as natural language processing (NLP)72,73,74,75. Common NLP techniques include topic modelling, which creates topics of interest by identifying word frequencies and associations75,76; and determines whether a text accurately or negatively reflects attitudes, opinions, and feelings77,78.

Computer vision (CV) is another branch of artificial intelligence that enables computers to understand and interpret visual information from images and videos79,80,81,82,83,84. In the medical field, it supports a range of tasks and extracts clinically relevant information from medical images and videos, such as endoscopic recordings, CT scans, MRIs, and X-rays85,86. By examining CT and MRI scans, CV helps doctors pinpoint potentially life-threatening issues like internal bleeding and blocked blood vessels with greater accuracy. The use of automation, especially through robotic systems, enhances this precision by catching tiny details that might escape the human eye. This boost in diagnostic ability ultimately elevates the overall quality of patient care87,88,89. CV can be applied in a range of healthcare settings, from early disease detection to patient health monitoring during and after surgical procedures. Its integration into healthcare has developed into a ground-breaking tactic that is altering how medical data is viewed, assessed, and utilised90,91,92.

The role of artificial intelligence in cancer diagnosis

The AI-assisted technologies and algorithms have permeated the healthcare industry with great success. The use of these technologies in the diagnosis, screening, and treatment of different types of cancer is attracting enormous attention93. For instance, machine learning, a subset of AI that functions based on analysing large datasets, can be used in decision-making94. Similarly, deep learning, a subset of machine learning, is another key tool, based on an artificial neural network of interconnected layers called computing units95. Moreover, several deep learning computer vision (DLCV) models have been developed with important applications in healthcare96. These technologies are rapidly expanding and being applied in different phases of cancer diagnosis and treatment, with added benefits of accuracy, sensitivity, and improved outcomes.

AI in cancer imaging

AI has demonstrated a lot of promise in the modern era for tumour identification and description. AI is a first line of defence against omission errors and lowers observational errors, whereas detection involves locating objects of interest on radiographs14,97,98. Characterisation can include prognosticating diseases and forecasting results based on specific treatment modalities, in addition to staging and segmenting tumours (Fig. 3). This information is crucial for later diagnostic evaluations and for calculating radiation dosages during treatment planning99,100,101,102,103. Despite being widely used as the standard for evaluating automated algorithms, manually traced segmentations may overlook subclinical conditions and restrict analyses because of human biases98,104. Through automated segmentation, AI has the potential to significantly improve tumour measurement accuracy, efficiency, and reproducibility105,106,107.

Fig. 3
Fig. 3
Full size image

Role of AI in the diagnosis of different types of cancers, which includes imaging, detection, characterisation, staging, and segmentation of tumours.

Given that the normal clinical course of a cancer patient identifies multiple crucial phases where AI systems can offer significant advantages, ML and DL present a variety of opportunities to improve cancer imaging98,108. For instance, one of the effective uses of imaging technologies is to perform cancer risk assessment. In the United States, several states require breast density assessment as part of cancer risk evaluation. DL systems have shown high accuracy in classifying breast density, enabling more consistent patient notification in breast cancer screening109,110. Studies have shown that the DL model integrating mammographic features with established risk factors outperformed conventional breast cancer risk models in identifying individuals at the highest risk of malignancy111,112. More recently, strong agreement in breast cancer risk assessment was reported when mammographic breast density was evaluated by a senior radiologist, a junior radiologist, and AI software113.

AI algorithms are increasingly enhancing cancer screening by improving sensitivity and specificity through the analysis of large imaging and biomarker datasets114. In breast cancer screening, a study by McKinney et al. has revealed that AI models can match expert reader performance and, in some cases, exceed it by an absolute margin of 11.5%. These models can also support second readers during mammographic review, reducing their workload by up to 88%115. Additionally, AI-assisted triaging, which prioritises image interpretation, has been found acceptable by women undergoing mammographic screening116,117. In a recent study by Eisemann et al.118, 463,094 women underwent screening, of whom 260,739 were examined with AI support by 119 radiologists. The AI-supported group achieved a breast cancer detection rate of 6.7 per 1000, which was 17.6% higher and statistically superior to the rate of 5.7 per 1000 observed in the control group. In another study, Jiang et al.119 attempted to investigate whether the diagnostic performance of radiologists in differentiating cancer from noncancer on breast MRI improves when using an AI system compared with conventionally available software. The study concluded that the performance of radiologists to differentiate benign from malignant breast cancer significantly increased from 0.71 to 0.76 with the help of AI algorithms119.

Image quality and signal-to-noise ratio are inversely related to radiation dose, making it impossible to arbitrarily reduce the radiation exposure per examination. Deep neural networks can map data from one high-dimensional space to another120,121,122,123. For instance, transforming a low-dose, high-noise CT image into a high-dose, low-noise representation. Accordingly, novel approaches based on deep learning reconstruction (DLR) are being developed, with the potential to reduce radiation dose significantly. Two DLR solutions received FDA clearance and became clinically available in 2019124. DLR-based reconstruction methods have demonstrated lower radiation doses and/or improved image quality, while maintaining reasonably short reconstruction times. A pilot DLR study recently reported volumetric tomographic imaging generated from ultrasparse data sampling combined with a patient-specific prior125, which could further reduce radiation dose.

In DL applications, convolutional neural networks (CNNs) have been designed and validated for malignancy risk estimation. Using 16,077 lung nodules from the National Lung Screening Trial, one such algorithm by Venkadesh et al. demonstrated strong performance, achieving an area under the curve (AUC) of 0.93126. Zhang et al. developed an ML-based logistic regression model using CT images from 108 patients to predict and stratify pathological low and high-grade bladder cancer127. On a validation cohort of 37 patients, the model achieved an AUC of 0.86127. In another attempt, Fan et al. employed radiomics features from CT scans with 119 stage II colorectal cancer patients to train and test an ML model, which predicted microsatellite instability status with an AUC of 0.75128.

AI in cancer pathology

For many years, pathology, particularly histological evaluation, has been the gold standard for cancer diagnosis. Tissue-based analysis provides essential information on tumour type, grade, stage, and biomolecular characteristics to guide clinical decision-making129,130,131. However, there are several disadvantages to conventional pathology. Manual slide interpretation is time-consuming and often subject to inter-observer variability, particularly in complex situations132,133. In recent times, a variety of AI tools have emerged to assist with tumour evaluation. One notable example is Paige Prostate, which uses AI to identify prostate cancer by marking suspicious spots in biopsy samples. This support is invaluable for pathologists, ensuring that they don’t miss any potentially harmful areas134. Paige Lung is another AI tool for lung cancer detection that helps pathologists find early-stage cancers more precisely, including small nodules and subtle features that conventional analyses might miss135. A study by Eloy et al. found that Paige Prostate significantly reduced the need for immunohistochemistry (IHC) tests, second opinion requests, and reporting times while maintaining high diagnostic accuracy136.

In digital pathology, a wide range of AI technologies have been introduced in image processing and classification26,137,138. Several studies have reported the integration of AI in the identification, detection, and segmentation of tissues. Using a deep neural network (DNN), Chen and colleagues developed a model that aimed at automatic lymphocyte detection during H&E images. In another study, Ehteshami et al. designed a deep learning algorithm-based model to detect metastasis of lymph nodes in women with breast cancer139,140. Their model revealed comparable performance with an expert pathologist. Time is a critical factor in conventional pathology; however, several models in digital pathology have focused on automating the processes with reduced time consumption and better outcomes141,142,143.

PathAI assists in the identification and categorisation of various subtypes of breast cancer by precisely analysing histopathological images. Additionally, it has been used to ascertain if a patient has HER2144,145. AI-driven endoscopic lesion detection systems in gastrointestinal pathology improve diagnostic accuracy and reduce missed lesions by identifying and characterising lesions like polyps during colonoscopy through real-time image analysis146.

An important development in breast cancer grading is the application of DL models to improve the stratification of intermediate Nottingham Histological Grade (NHG) 2 cases, which have traditionally been challenging to diagnose due to their variability and unclear prognostic value. These models, which detect minute morphological patterns in whole-slide histopathology images, can be used to categorise NHG 2 tumours into lower- and higher-risk groups147. This method uses standard Haematoxylin and Eosin (H&E) slides and is quicker, less expensive, and yields prognostic data that is comparable to molecular assays. In a study, Mantrala et al.148 showed that AI could grade nuclear pleomorphism with performance on par with human experts, reducing pathologists’ inconsistencies148. Their research demonstrated that AI is capable of identifying important nuclear morphological characteristics that determine tumour grade and provide reliable survival stratification for a range of patient populations.

Without the need for labelled datasets, self-supervised learning (SSL) is an unsupervised method that aims to learn meaningful data representations. By comparing several representations of the same input, contrastive learning, a well-known SSL technique, trains models to distinguish between similar and dissimilar examples149,150,151,152,153,154. When Ciga et al. used contrastive learning on 57 different unlabelled breast cancer (BC) histopathology datasets, they discovered that their model performed better on datasets specific to BC than ImageNet-pretrained networks155,156. Similarly, Ashraf et al. classified BC histopathological images efficiently using contrastive learning and a new ResNet50-Inception hybrid architecture, reaching up to 98% accuracy with only 3.6 million parameters157.

Identifying small, occult tumour foci in lymph nodes remains challenging with conventional pathology and often necessitates additional confirmatory tests158,159. Promising approaches for more accurate lymph node metastasis detection are provided by recent developments in AI, especially DL, which may lessen the need for IHC and save time and money160,161,162,163,164. An algorithm created by Liu et al. can accurately detect metastatic cancer cells, including very tiny foci, in sentinel lymph node biopsies165. Their findings showed that even with typical variations in tissue samples, the algorithm continued to perform robustly. Steiner et al. assessed the function of DL support in pathologists’ examination of lymph nodes for metastases of breast cancer in a different study. Their results demonstrated that integrating AI decreased review time and error rates while increasing diagnostic accuracy, particularly for hard-to-detect micrometastases164. Bándi et al.166 investigated continual learning strategies for cancer-independent detection of lymph node metastases. This method eliminates the need for cancer-specific retraining by allowing AI models to generalise across a variety of cancer types, such as breast, colon, and head-and-neck cancers. Across a variety of datasets, their findings showed excellent accuracy and dependability166. Radiomics may be able to easily integrate into digital pathology workflows when paired with DL-based and cancer-independent AI models for lymph node metastasis detection.

Artificial intelligence in cancer genomics

Precision oncology is undergoing a dramatic change as a result of the integration of AI and genomics. By identifying somatic and germline pathogenic variants, advances in high-throughput sequencing technologies, especially next-generation sequencing (NGS), have completely changed our understanding of tumour biology167,168,169,170,171,172. Simultaneously, AI, particularly through ML, which finds patterns in data without explicit programming, and deep learning (DL), a more sophisticated method that uses artificial neural networks, are turning into a crucial tool for extracting clinically significant information from intricate, high-dimensional genomic datasets5,173.

Role of AI in mutation detection

Histopathological and genomic diagnostic techniques are used in precision oncology to help patients choose the best treatments for them. H&E staining is one of the most popular methods in histopathology, which analyses tumour morphology and phenotype and is still essential for cancer diagnosis and subtyping26,174. With their ability to predict patient survival and treatment response, genomic biomarkers are frequently used in advanced or metastatic cancers to supplement phenotypic assessment. Thus, more individualised treatment approaches have been made possible by genomics26. Over the past few decades, these developments have helped to significantly improve clinical results175,176.

Multiomics is the integrated analysis of multiple layers of biological information, including genomics, proteomics, and epigenetics, to achieve a more holistic understanding of biological systems177. Given that cancer is a complex, multifactorial disease with changes at several molecular levels, integrating these disparate datasets is especially crucial in precision oncology. Through the integration of knowledge from various omics layers, scientists and medical professionals can better describe tumour heterogeneity, find more accurate biomarkers, and establish customised therapeutic targets, ultimately leading to the development of more specialised cancer treatments178.

Structural variations (SVs) are genomic changes that involve large DNA segments (≥50 bp). These are referred to as copy-number variations (CNVs) and can be caused by a variety of mutational events, including duplications, insertions, and deletions, which alter the amount of genomic material179,180,181. Although CNVs have been analysed using a variety of techniques, whole-genome sequencing (WGS) is becoming more and more accepted as the preferred approach due to declining costs and ongoing advancements in variant detection technologies182.

On the other hand, small variants are defined as genomic changes that impact shorter DNA segments (<50 bp). These include small insertions or deletions (indels), which usually range from one to a few dozen base pairs, and single-nucleotide variants (SNVs), in which only one base is changed. Such variants, though smaller in scale, can have significant functional consequences, particularly when they occur within coding or regulatory regions183,184,185,186.

Cancer initiation and progression are greatly influenced by germline pathogenic variants (gPVs), which are inherited, and somatic pathogenic variants (sPVs), which develop within tumour cells187,188. The presence of gPVs in the BRCA1 and BRCA2 genes, which predispose carriers to breast and ovarian cancers, is one of the most well-characterised hereditary signatures in oncology189. High- and moderate-penetrance genes like TP53, PALB2, STK11, CHEK2, RAD51D, and ATM are among the other genes linked to hereditary breast cancer190. Cancer biology and treatment also heavily rely on somatic PVs in tumour suppressors and oncogenes191,192,193. For example, KRAS mutations are important in colorectal and pancreatic cancers because they promote uncontrolled cell growth, whereas TP53 mutations impair a vital tumour suppressor called the “guardian of the genome,” which leads to genomic instability, metastasis, and a poor prognosis194.

Recently, AI tools like ML, DL, CNNs, and MIL have shown promise in detecting genetic mutations in cancer195,196,197,198 (Fig. 4). MRI, PET, digital tomosynthesis, and mammography are among the imaging modalities for which researchers have created radiomic signatures that allow for the noninvasive prediction of genetic changes199,200,201. AI techniques for mutation prediction from image data are made possible by the growing digitisation of H&E slides in pathology. These methods are especially helpful when there is a shortage of tumour tissue and also cut down on the time and expense of genetic testing. Recent research using AI has demonstrated promising outcomes in the diagnosis, classification, and grading of various cancer types202,203,204,205,206,207 as well as in the detection of gene mutations, microsatellite instability, and treatment response208,209,210,211,212,213.

Fig. 4
Fig. 4
Full size image

This figure demonstrates how AI supports cancer genomics by analysing sequencing data to detect cancer-associated mutations and integrating multi-omics data to discover clinically relevant biomarkers for diagnosis and therapy.

One prevalent and important pathophysiological characteristic of cancer is mutations in genes that control cell division and apoptosis. These mutations affect the course of the disease, patient survival, and treatment approaches in addition to aiding in the development of cancer214,215. Numerous AI-based models have demonstrated efficacy in detecting mutations in a variety of cancer types, such as gastrointestinal, brain, lung, and breast cancers216. A pre-trained 3D CNN was employed by Shao et al.217 to identify EGFR mutations in lung adenocarcinoma. After entering patients’ PET/CT scans into the model, they discovered that combining PET/CT scans with clinical information produced the best diagnostic results (AUROC: 0.73)217. Similarly, using DL methods, Dammak et al.218 aimed to identify high tumour mutation burden in lung squamous cell carcinoma using histopathological images, with their models achieving an AUC ranging from 0.6 to 0.8.

Liang et al.219 proposed a CNN model to identify PDGFRA and KIT mutations in gastrointestinal stromal tumours (GIST) from histological images. Using pre-trained models on ImageNet, their approach predicted these drug-sensitive mutations with an accuracy of 70–85%. In another study, Cao et al.220 trained their model using colorectal cancer histological images from the TCGA-COAD database and applied transfer learning (TL) to adapt it to the Asian-CRC dataset. Initially, the model trained on TCGA-COAD achieved an AUROC of 0.6497 on Asian-CRC. After implementing TL, performance improved to an AUROC of 0.8504, and further increasing the number of Asian-CRC cases used in fine-tuning boosted the AUROC to 0.9264220. Similarly, Zeng et al.220 investigated the identification of IDH mutation types in glioma patients using multi-modal MRI data. They employed a pre-trained ImageNet model for feature extraction, achieving an AUROC of 0.86 in predicting IDH status, with a sensitivity of 77.78% and a specificity of 75%220.

In 2020, Yuhan Su et al.221 applied BERT to classify genetic mutations by analysing textual evidence from an annotated database, demonstrating improved capability in handling complex clinical text. Among the three BERT-based approaches tested, the BERT with the abstract shortness method performed best as a standalone model, achieving an F1 score of 0.705221. More recently, in 2023, Sanad Aburass and colleagues developed a hybrid ensemble model integrating GRU, LSTM, CNN BiLSTM, and GloVe embeddings. Using Kaggle’s Personalised Medicine dataset, this model achieved an F1 score of 0.831 for classifying gene mutations in cancer222. Sun et al.223 developed a deep neural network (DNN) model using genomic point mutations to classify tissues as either one of 12 TCGA cancer types or healthy controls from the 1000 Genomes Project. Trained on the most frequent cancer-specific point mutations derived from whole-exome sequencing data, the classifier achieved high accuracy in distinguishing healthy from tumour tissues (AUC = 0.94). However, its performance declined in the multi-class classification task of differentiating all 12 cancer types simultaneously, with an AUC of 0.70223.

In prostate cancer, one study combined ML techniques with features recommended by the National Institute for Health and Care Excellence to explore associations between prognosis and genetic mutation profiles224. ML has been used in other studies to predict the relationship between colorectal cancer and being overweight using healthy eating index scores225. Furthermore, by comparing the performance of various machine learning models, modelling techniques have shown the value of the exact binomial test for examining genome-wide somatic mutations226. Cho et al. used TCGA data from patients with lung adenocarcinoma to further demonstrate the value of machine learning in cancer genomics. These studies demonstrate how AI can enhance the detection of mutations in large cancer datasets and provide vital information on genetic alterations linked to the outcomes of patients with lung cancer227.

The role of AI in biomarker discovery

AI in oncology has revolutionised cancer diagnosis, prognosis, and treatment due to the increasing demand for precision medicine228. One of AI’s most important contributions has been the identification of biomarkers, which are crucial for precision oncology. Biomarkers, which are genes, proteins, or other measurable substances that indicate the occurrence or course of disease, have long been used in cancer diagnosis and treatment planning. If the complex nature of cancer necessitates finding novel therapeutic strategies in which important molecular pathways could be targeted, at the same time, it highlights the need for discovering biomarkers that can help in early detection and treatment selection229,230. Contrary to anatomic and qualitative-based markers, the focus has been shifted to find non-invasive and more specific biomarkers that could better characterise survival prediction, tumour phenotype, and prediction of treatment response231,232. The emergence of AI-assisted technologies and advanced deep learning approaches may mark a new era for non-invasive biomarkers in cancer care. AI improves diagnostic accuracy by evaluating intricate histopathological images233. These methods have proven effective in differentiating between cancer patients and healthy people, advancing precision medicine and the creation of tailored cancer treatments234.

Potential biomarkers and pathways in glioblastoma were investigated experimentally using blood-based liquid biopsies, with a focus on platelets235. This method found seven enriched pathways and 42 genes, including those linked to immunological regulation, neurodegenerative processes, and protein translation. A multi-step machine learning framework was used to relate the results to 12-month survival. Along with related proteins, the analysis also looked at CD133, a putative glioblastoma biomarker. Using a multi-step machine learning framework, it connected these results to 12-month survival. In a different investigation, Liu et al.236 developed a gene signature based on machine learning that included 12 T-cell exhaustion-related subtypes in glioblastoma. Using transcriptional profiling of T-cell immunity in the tumour microenvironment, the authors discovered two hitherto unknown subtypes, TEX-C1 and TEX-C2. These subtypes showed different immunological and clinical profiles, according to multi-omics analysis236.

Studies have suggested that AI-driven predictive frameworks can play a key role in the pre-treatment evaluation237,238. To create an ML-based predictive framework for pretreatment evaluation, Kawakami et al.239 divided 334 patients with ovarian epithelial carcinoma and 101 patients with benign ovarian tumours into training and test groups in 2019. The model incorporated 32 biomarkers and clinical parameters from standard blood tests using classifiers like RF. At 92.4%, the RF classifier successfully differentiated ovarian epithelial carcinoma from benign tumours; however, its accuracy in predicting clinical stage was lower at 69.0%. These studies confirm the importance of biomarkers in the early detection, diagnosis, and treatment of cancer.

Integration of AI into cancer drug discovery and development

Through ML and DL, AI improves compound optimisation, expedites target identification, enables rapid analysis of large biomedical datasets, and improves clinical outcome prediction240. AI assists with lead optimisation and de novo drug design in addition to screening by generating novel molecular structures and refining their properties to improve efficacy and safety241. Predictive modelling helps with preclinical development, especially when using in silico simulations to assess toxicity and pharmacokinetics. Additionally, AI speeds up the discovery of biomarkers, assisting in the identification of molecular signatures for patient stratification and individualised treatment plans. At the clinical stage, AI incorporates real-world data to improve patient recruitment, streamline trial design, and reduce turnaround times, all of which raise the success rate of developing cancer drugs241,242.

Advanced models that can learn from high-dimensional biological data are becoming more and more important in the integration of AI into cancer drug discovery and development. This change is best illustrated by transformer-based frameworks used in single-cell transcriptomics, which model tumour–immune interactions, reconstruct gene regulatory networks, and capture tumour heterogeneity243. By combining translational applications with basic cancer genomics, these methods speed up the discovery of new drug targets, forecast resistance mechanisms, and improve treatment approaches (Fig. 5).

Fig. 5: Application of generative pre-trained transformers to the transcriptomes of single-cell cancer.
Fig. 5: Application of generative pre-trained transformers to the transcriptomes of single-cell cancer.
Full size image

In silico drug response prediction, regulatory network inference, and precise tumour clustering are made possible by the method.

AI in drug target identification and validation

Since AI integrates and analyses a variety of biological datasets, it is becoming more and more significant in drug target identification. In contrast to conventional methods, AI-driven models combine data from proteomics, genomics, and other high-throughput data sources to more effectively identify new targets244,245. For example, it is possible to identify candidate genes and their protein products as possible therapeutic targets by mining genomic data to find genetic variations linked to disease246. Similarly, proteomic analyses allow AI to examine protein functions, structures, and interaction networks, providing details about disease pathways and target druggability247. For better systematic evaluation and target identification, several platforms could be integrated with AI248,249, such as DrugBank250, Antibiotic Adjuvant DataBase251, PubChem252, and the Antibiotic Combination DataBase240.

Fahimian et al.253 developed a novel algorithm called RepCOOL to identify potential candidates for the treatment of stage II breast cancer253. The discovery of novel GSK-3 inhibitors for the treatment of acute myeloid leukaemia is another significant advancement254. At the same time, ligand-based virtual screening methods have employed isolation forests, artificial neural networks, and Gaussian mixture models to expedite drug identification and improve prediction accuracy255.

In 2020, Yin et al.256 introduced DeepDrugw, a deep learning framework for predicting drug-target interactions (DTI), and drug-drug interactions (DDI), using graph-based techniques257. Similarly, by employing a graph autoencoder based on GCN to capture higher-order structural relationships within heterogeneous networks, Qu et al.258 demonstrated graph–drug–target interactions, enabling a more accurate representation of drugs, proteins, and their surrounding contexts259. Recently, Yu et al.260 proposed a heterogeneous GNN that combines Bi-LSTM with an attention mechanism to improve heterogeneous data aggregation and node feature extraction. To further improve model performance, a negative sampling strategy was added.

Several approaches have been used in recent studies to predict unknown drug targets, such as disease-related molecular networks261, drug–drug similarity networks, protein–protein similarity networks262, and drug side-effect networks263. To predict DTIs based on drug-target similarity networks, Tian et al.264 Sadaqat et al.265 successfully identified important target proteins by using molecular modelling, network pharmacology, and sequencing data to examine the effects of Bacopa monnieri on hepatocellular carcinoma. However, these models often have drawbacks, such as unbalanced or incomplete datasets, a lack of validated drug–target data, and a limited capacity to incorporate intricate relationships between various databases.

Following identification, a therapeutic target needs to be thoroughly validated to ensure its applicability. Since target identification and validation are inherently complex and multidisciplinary processes, it is necessary to employ a range of validation techniques to increase confidence in the observed results266. Careful execution of these steps remains crucial for reducing attrition rates and avoiding ineffectiveness in later stages of development267. A recent study demonstrated the effective use of AI in locating and validating targets for the creation of cancer medications. Researchers used an AI-driven screening method to find Z29077885, a novel anticancer compound that targets STK33268. Both in vitro and in vivo tests verified the efficacy of Z29077885 after it was identified. Mechanistic analyses revealed that the compound inhibits the STAT3 signalling pathway, which results in cell cycle arrest at the S phase and apoptosis269.

AI in drug screening and lead optimisation

High throughput screening (HTS) employs automated systems, advanced detection tools, and microtiter plates to effectively gather and analyse data in its cellular or molecular scale experiments. This method enables the rapid evaluation of thousands to millions of compounds during the discovery phase, which is an essential strategy for identifying active compounds early in the drug development process256. AI-driven virtual screening, in particular, expedites the selection process and drastically reduces the number of compounds that require experimental validation by making it simpler to predict how molecules will interact with target proteins258.

AI can help overcome resistance to current treatments by facilitating the transformation of genes and proteins that are known or may be associated with resistance into novel therapeutic targets. To predict drug sensitivity, for instance, Xiao et al.270 used ridge regression models that were trained on transcriptomic profiles and drug response data. This approach was verified across multiple colorectal cancer cohorts. In vitro pharmacodynamic assays revealed that two BCL-XL inhibitors, WEHI-539 and navitoclax, exhibited selective sensitivity towards colorectal cancer cells with high chromosomal instability, confirming CIN as a promising therapeutic target in colorectal cancer270. Similarly, Zhang et al.271 integrated multi-omics datasets to predict patient responses to immune checkpoint inhibitors using a Bayesian model. They found that anti-tumour immunity was inversely correlated with a stemness signature271. The end-to-end DL framework developed by Wen et al.272 combines a transformer architecture with a self-supervised graph neural network. They filtered 4,527,000 compounds and found 50 candidate clusters by optimising the model on the BindingDB database272.

Synthetic lethality (SL) has gained more attention recently as a useful approach for identifying potential targets for anticancer drugs in cancer pateints273. However, high costs, batch variability, and off-target effects are frequently limited by experimental SL screening. To address these issues, Wang et al.274 presented KG4SL, a brand-new model based on graph neural networks (GNNs) that integrates knowledge graph (KG) messaging into the prediction procedure. Their findings showed that the accuracy and efficacy of SL prediction were significantly improved by incorporating KG information into the GNN framework274.

Docking and scoring methods are used in structure-based virtual screening to find molecules that have a high binding affinity for a target protein275. Traditional docking procedures are frequently computationally demanding and restrict the scalability of large-scale screening, even though this approach is a useful tool in anticancer drug design. In a study, a DL-based study tool was designed to predict the docking score for cancer patients276. In another study, Yasuo et al. introduced SIEVE-Score, an AI-driven structure-based virtual screening method for hit compound identification that demonstrated notable improvements over the most advanced techniques currently in use277.

An important factor in drug development is the possibility of strong interactions, which can be evaluated by using AI models to estimate the binding affinity between target proteins and candidate drug molecules278. A predictive model called DrugnomeAI was created by Raies et al.279 to tackle the problem of targeted drug synthesis. Targets’ druggability throughout the human exome was predicted using DrugnomeAI, which is based on a stochastic semi-supervised machine learning framework. The study was noteworthy for its ability to accurately predict the druggability of targets linked to oncological diseases.

AI in preclinical and clinical trials

AI has a lot of potential to improve clinical trial design and execution. Improvements in trial development can be directly attributed to developments in cancer imaging and diagnostics280. AI’s potential to expedite and simplify patient recruitment is yet another significant benefit280. 50,000 patients from 19 datasets participated in a systematic review and meta-analysis that showed AI-assisted approaches to cancer clinical trial enrolment performed on par with, and in some cases better than, conventional manual screening techniques281.

By predicting patient behaviour and clinical outcomes, AI technologies can help with patient retention in addition to trial enrolment282. One AI model, for example, was created to forecast grade 3 or 4 adverse events in patients with breast cancer undergoing neoadjuvant treatment283. Similarly, using information from a publicly accessible cancer centre database, an ML platform created outside of the trial context was used to estimate 30-day unplanned readmission rates in cancer patients284. These tools can help investigators implement dose adjustments, early interventions, or customised symptom management strategies by identifying individual risk profiles for adverse events, trial attrition, or hospital readmission.

ML can optimise the choice of treatment regimens during the design phase of clinical trials by using simulation techniques to analyse large datasets from previous studies. For instance, in a clinical study of advanced non-small cell lung cancer patients who had never received systemic therapy before, Zhao et al. used a reinforcement learning technique called Q-learning285. This technique identified individualised, optimal strategies for particular patient subgroups using patient data from clinical reinforcement trials. Likewise, meticulous patient cohort selection and clear recruitment standards are important factors that determine the success of clinical trials. Remarkably, 86% of trials fall short of their recruitment goals within the allotted time286. AI models have been used to assess and improve trial eligibility requirements to overcome this difficulty. For example, Liu et al.287 showed that many commonly used criteria, especially laboratory test thresholds, significantly limited enrolment by simulating clinical trials in a cohort of 61,094 patients with advanced non-small cell lung cancer using a computational framework.

Tumour heterogeneity continues to pose a major obstacle in assessing treatment responses. AI-driven models that integrate baseline and follow-up CT imaging provide a powerful means of predicting therapeutic efficacy and enhancing patient stratification in clinical trials287,288,289. More accurate immunotherapy outcome prediction in advanced non-small cell lung cancer (NSCLC) has been made possible by modelling based on multi-temporal CT image features, which supports better trial design and forecasting287. Better patient selection, more precise monitoring of treatment effects, and the development of individualised therapeutic approaches in clinical trials are all made possible by these methods. By enhancing patient recruitment, streamlining trial design, and facilitating real-time treatment response prediction, these studies show how AI is changing cancer clinical trials290. AI boosts productivity, lowers attrition, and aids in the creation of customised treatments by integrating multi-modal data and sophisticated modelling. Accelerated patient delivery of safer, more effective cancer medications will depend on the ongoing validation and uptake of AI tools269,291.

Challenges and opportunities

AI has a wide range of uses in oncology, as this review article discusses, from drug discovery and personalised treatment to genomics and diagnosis. Large patient data sets can be integrated by AI-driven tools, which can improve chemotherapy, enable more accurate and timely prognosis, and possibly lower the risk of cancer progression15,292. While we have made significant advancements, challenges like patient data privacy and algorithmic bias have likely prevented the full-scale adoption of AI technologies in healthcare (Fig. 6). So, it is essential to find a middle ground between these exciting prospects and a careful consideration of the technological, ethical, and practical challenges.

Fig. 6: Shows various types of AI, such as ML, DL, NLP, TL, and RL, that could be used in cancer care.
Fig. 6: Shows various types of AI, such as ML, DL, NLP, TL, and RL, that could be used in cancer care.
Full size image

The figure also demonstrates the current challenges and limitations in these models.

Despite the potential benefits of AI systems in cancer, we cannot ignore the significant flaws and risks that come with the use of AI293,294,295. One major concern is racial bias in various medical studies. For instance, several AI tools have been known to reflect biases in how they assess the health of African Americans, like the incorrect assumption that they have thicker skin or lower lung capacity compared to white patients296,297,298,299.

There are important legal and regulatory issues with the use of AI in cancer treatment that require careful thought198,300,301. In a scoping review, conducted by Chamouni et al. concluded that the most frequent ethical issue found in 20 articles was data privacy, especially genomic information and sensitive imaging data302. The second crucial issue they reported was the principle of non-maleficence, or not hurting patients, because in certain settings, AI tools come with the risk of losing lives if the system is unable to differentiate between real and false-positive cancer lesions302. Another important question arises: who will take the responsibility, the AI tool developer, clinicians, or the hospital that used that system?

Future studies should focus on AI-assisted multivariable prediction models, which can minimise the overall error and may improve outcomes in cancer care303,304. Moreover, stakeholders, including AI-experts, clinicians, and organisations, must move towards multi-institutional collaborations, which may pave the way for establishing reliable networks, AI models, and databases305,306,307,308. Issues related to patients’ privacy underlined the need for collaborations with legislators to find solutions that put patients’ privacy and safety first to achieve better and long-lasting results309. Furthermore, large, high-quality datasets are necessary for AI models, particularly DL approaches, and institutions must collaborate to gather, curate, and compile these datasets310,311.

To minimise the patient’s privacy-related issues, leakage of data must be prevented by using fixed algorithms that keep the patient’s data locked. It is important to make sure that federated learning and safe data-sharing frameworks are implemented because they can promote inter-institutional cooperation without jeopardising patient confidentiality. While designing AI models, the large datasets must be properly validated, annotated, and tested to have an idea about the errors before applying them to the final models. Data generated as a result of pathology, imaging, or endoscopy is often not labelled properly, and therefore should not be considered suitable for AI-algorithm training. Additionally, to guarantee generalisability, AI models that exhibit strong performance on small, single-institution datasets need to be verified across a variety of populations and institutions. To conclude, the emergence of AI has revolutionised cancer care, but we are still short of successfully developed models with real-world applications. Collective efforts are needed among clinicians, AI model developers, and policymakers to assist in designing specialised models with robust policy and regulatory implementations. This will promote an atmosphere of trust accuracy and confidence in the use of AI tools in actual clinical settings.

Conclusion

Artificial intelligence and its associated algorithms, such as machine learning and deep learning, will continue to evolve in cancer care. Currently, AI algorithms such as ML, DL, GNN, NN, and NLP have already revolutionised the diagnosis, prognosis, screening, biomarker discovery, digital pathology, drug response, outcome prediction, and monitoring. The emergence of ML and DL-based AI algorithms has resulted in significant accuracy and improved outcomes in digital pathology. AI algorithms that combine genomics, radiomics, and other clinical parameters have yielded promising results. NN has been widely used in cancer imaging combined with other tools for better sensitivity, accuracy, and predictive values. AI-based CNN-CAD systems have demonstrated significantly improved efficacy and accuracy in clinical oncology312,313.

The studies we reviewed show that the use of AI has improved the radiologists’ performances, treatment response, diagnostic accuracy, and decision-making in handling complex cases. However, these AI algorithms have not yet fully evolved, so they have not been integrated into routine clinical practices. Most of the existing AI tools are not fully validated, and there is a space of mistrust between both patients and clinicians. To achieve greater outcomes, both patients and clinicians must know the credibility and capability of these systems. Furthermore, healthcare AI regulations are still developing, underscoring the critical need for robust governance frameworks that promote innovation while guaranteeing safety and efficacy for cancer patients.