Immunotherapy drug target identification using machine learning and patient-derived tumour explant validation

Augustine, Marcellus; Nene, Nuno Rocha; Fu, Hongchang; Pinder, Christopher L.; Ligammari, Lorena; Simpson, Alexander P.; Sanz-Fernández, Irene; Thakkar, Krupa; Qian, Danwen; Fitzsimons, Evelyn; Simpson, Benjamin S.; Vendramin, Roberto; Castro, Andrea; Niederer, Heather; Turajlic, Samra; Quezada, Sergio A.; McGranahan, Nicholas; Watkins, Chris; Swanton, Charles; Litchfield, Kevin

doi:10.1038/s42256-026-01201-3

Download PDF

Article
Open access
Published: 18 May 2026

Immunotherapy drug target identification using machine learning and patient-derived tumour explant validation

Nature Machine Intelligence (2026) Cite this article

Subjects

Abstract

Immunotherapy has revolutionized cancer treatment, yet only a minority of individuals respond clinically, necessitating alternative strategies that can benefit these patients. Novel immuno-oncology targets may achieve this through bypassing resistance mechanisms to standard therapies. We introduce Mining Immunotherapy Drug tArgetS (MIDAS), a multimodal graph neural network system for immuno-oncology target discovery. MIDAS leverages gene interactions, multi-omic patient profiles, immune cell biology, antigen processing, disease associations and phenotypic consequences of genetic perturbations. It generalizes to time-sliced data, outcompetes state-of-the-art baselines (including OpenTargets) and ranks approved targets above those in clinical development. Moreover, MIDAS recovers immunotherapy-response-associated genes in unseen patients, thereby capturing immunotherapy response determinants. Interpretability analyses reveal a reliance on autoimmunity, regulatory networks and immuno-oncology pathways. Functionally perturbing oncostatin M–oncostatin M receptor signalling, a proposed MIDAS target, in TRACERx melanoma-patient-derived explants yielded reduced dysfunctional CD8⁺ T cells, which associate with immunotherapy response, and reduced CCL4 levels. Furthermore, oncostatin M and oncostatin M receptor expression is associated with altered T cell and macrophage profiles in bulk transcriptomic data from patient samples. These data are consistent with a role for oncostatin M–oncostatin M in modulating the tumour microenvironment towards immunosuppressive, tumour-promoting phenotypes. Our results present a machine learning framework for analysing multimodal data for immuno-oncology target discovery.

Preclinical models for prediction of immunotherapy outcomes and immune evasion mechanisms in genetically heterogeneous multiple myeloma

Article Open access 16 March 2023

Decoding immunotherapy response through computational modeling

Article Open access 15 April 2026

Multimodal immunogenomic biomarker analysis of tumors from pediatric patients enrolled to a phase 1-2 study of single-agent atezolizumab

Article Open access 10 April 2023

Main

Checkpoint inhibitor (CPI) immunotherapy has revolutionized cancer treatment, with more than 50 FDA approvals and durable response observed in some patients¹. However, across cancer types, a minority of patients benefit clinically due to primary and secondary resistance^2,3. This highlights the urgent need for novel therapies to rescue patients failed by conventional immuno-oncology treatments. CPI response depends on host- and tumour-specific factors³, which traditional preclinical models struggle to fully recapitulate, limiting target discovery efforts⁴. The expanding wealth of high-dimensional complex data capturing different tumour–immune facets prompts interest in leveraging machine learning (ML) to derive biological insights⁵. Indeed, sophisticated ML technology capable of modelling disease complexity and inferring mechanistic insights are being increasingly applied to drug discovery across indications^6,7,8,9.

ML target discovery requires training data that richly profile disease mechanisms through different, complementary lenses to recapitulate the biological complexity and variation observed in human patients¹⁰. Sifting through ever-increasing datasets for highly informative subsets to understand disease mechanisms is a challenging feature selection problem¹¹. Integrating domain expertise in system development can address this and enrich for task-specific signals within the training corpus¹¹.

Multimodality permits comprehensive disease profiling by capturing complex interactions across omics levels that underlie phenotypes, rather than the mostly correlative insights from isolated omics analyses^8,12,13,14. This is particularly relevant for immuno-oncology, where target discovery systems must model host- and tumour-specific factors that influence anti-tumour immune responses. These are studied using diverse assays and data modalities. Multimodal integration facilitates incorporating tumour-intrinsic and tumour-extrinsic modulators of anti-tumour immunity. The former includes antigen presentation¹⁵ or immune-sensitizing pathway perturbations³, whereas the latter includes the tumour microenvironment (TME) composition^3,16,17. Additional data modalities include causal perturbations driving immune-mediated tumour elimination^18,19, population-scale gene–phenotype associations²⁰ and gene function within broader biological networks²¹. Integrating such multi-omic and functional evidence addresses the limitations of learning discovery engines from largely correlative relationships by providing causal information.

A natural approach to combine multimodal data for gene-centric inference that constrains possible solutions and is supported by previous successes involves encapsulating data within a multimodal biological graph^6,7,8,9,22. Graph neural networks (GNNs) are powerful ML techniques that can be deployed on such graph-structured data to compute node embeddings using message passing frameworks that aggregate input features along routes constrained by network topology.

Here we present a multimodal GNN system, Mining Immunotherapy Drug tArgetS (MIDAS), dedicated to immuno-oncology target discovery and supported by validation in the advanced, clinically relevant patient-derived explant (PDE) experimental platform. We evaluate diverse ML systems on a bespoke, integrated multimodal dataset of patient molecular profiles, preclinical evidence and population-scale genetic data. MIDAS outperforms existing computational and experimental target discovery methods, generalizes to time-sliced data to identify prospective targets, distinguishes approved versus targets in development, and recovers CPI response differentially expressed genes (DEGs). We perturb oncostatin M (OSM)–oncostatin M receptor (OSMR) signalling (a highly ranked MIDAS target) in the ex vivo PDE platform, demonstrating reduced dysfunctional T cells and CCL4 levels, supporting the validity of our approach.

Results

Multimodal graph ML system for novel immunotherapy drug target discovery

Therapies acting on targets supported by human genetic evidence are more likely to receive approval^23,24,25. To model anti-tumour immunity and harness genetic evidence alongside other omics and non-omics data, we developed MIDAS using a bespoke multimodal immuno-oncology dataset (Fig. 1a,b). This comprised (1) CPI-treated patient exomes and transcriptomes³; (2) tumour-infiltrating immune single-cell transcriptomics (single-cell RNA-sequencing (scRNA-seq)) atlases^26,27; (3) human leucocyte antigen-peptidomics (HLA-peptidomics)²⁸; (4) immuno-oncology-relevant gene–phenotype associations^20,21; and (5) causal gene perturbations from CRISPR tumour–T cell co-cultures^18,19. Each dataset was pre-processed into low-dimensional representations (Methods and Supplementary Table 1), permitting integration by augmenting the Hetionet gene–interaction–gene (GiG) network²¹.

**Fig. 1: Development of a multimodal graph ML system to identify novel immunotherapy drug targets.**

We developed a binary classification immuno-oncology target discovery framework, applying the ‘closed-world’ assumption in which unlabelled instances are considered negative^7,29. Positive instances (known targets) represented n = 260 non-antigenic targets in clinical development by 2019 (ref. ³⁰) (Methods), with remaining genes considered negative. Node classification GNNs were trained to predict the immuno-oncology target status (Fig. 1b).

We defined three in silico validation tasks for the trained model (Fig. 1c). The first assessed generalization to time-sliced data, checking whether the model assigned higher scores for targets entering trials after the target label collection cut-off (in 2019 (ref. ³⁰); Methods). Second, owing to high drug development costs³¹, we investigated the prediction translational potential, assessing if the approved targets scored higher than those under clinical development (phases I–III) and remaining genes. Crucially, the model was blinded to these annotations. Third, we assessed CPI response DEG recovery in unseen patients (Methods), probing whether the model captures factors influencing tumour–immune dynamics.

Graph ML system achieves robust performance across in silico immuno-oncology benchmarks

We investigated multiple GNN architectures for immunotherapy target discovery, varying the k-nearest neighbours sampled during message passing to enhance generalization³² and efficiency (Methods). All GNNs achieved an area under the receiver operating characteristic (ROC-AUC) > 0.8 on cross-validation (CV) and held-out test folds (Supplementary Fig. 1a and Supplementary Table 2). The highest-performing variants (ranked by the held-out ROC-AUC) were compared using the Bayesian information criterion. A simplified graph isomorphism network (GIN³³) proved optimal, featuring pre-/post-processing layers that utilise multilayer perceptrons (MLPs), but without MLP-based GIN layers or global concatenation (Supplementary Fig. 1b and Methods). All the following analyses used this model (referred to as MIDAS GIN; Supplementary Table 3).

To assess whether GNN complexity was necessary, we trained diverse non-geometric models using the same data (swapping GiG topology for node degree; Supplementary Methods; Supplementary Tables 4 and 5 list the performances and optimal hyperparameters, respectively). MIDAS GIN outperformed all variants (false-discovery rate (FDR) < 0.05; Supplementary Fig. 1c). We also implemented a meta-learning approach that sequentially integrated data (Supplementary Methods and Supplementary Fig. 2), combining gene importances from base immunotherapy response predictors (trained using sequencing data) with the remaining features via a stacked binary classifier ensemble. Again, MIDAS GIN proved superior (FDR = 5.99 × 10⁻⁴; Fig. 2a). Hence, leveraging GNN message passing to integrate multi-omics, preclinical, population-scale genetic data and biological contexts enables improved performance.

**Fig. 2: Graph ML system achieves robust performance across in silico immuno-oncology benchmarks.**

We then compared MIDAS GIN against existing target discovery methods (Fig. 2a), finding that it outperformed OpenTargets (https://www.opentargets.org/) direct (FDR = 3.28×10⁻³), indirect (FDR = 1.74×10⁻⁴) and combined evidence (FDR = 2.63×10⁻⁴). MIDAS GIN also showed improved performance compared to TargetDB³⁴, a random forest (RF) tractability predictor, tractability estimates (FDR = 2.71×10⁻⁴) and multiparameter optimization (MPO; equally weighing all evidence sources; FDR = 3.47×10⁻⁴). Additionally, MIDAS GIN performed better than DepMap (https://depmap.org/portal/) gene effect scores (FDR = 8.70×10⁻¹⁹), which reflect gene perturbation influences on cell viability, and the CRISPR CD8⁺ T cell co-cultures (FDR = 8.70×10⁻¹⁹ and FDR = 6.29×10⁻²⁰).

Next, we assessed the model using our in silico validation tasks. First, MIDAS GIN generalized to the time-sliced data by differentially ranking genes that were prospectively recognized as immuno-oncology targets (P = 1.1×10⁻¹⁰, Mann–Whitney U test; Fig. 2b). OpenTargets direct evidence, the next best performer, also ranked time-sliced targets above random (P = 8.7×10⁻⁹, Mann–Whitney U test), but differentiated between them less effectively (Supplementary Fig. 1d).

Second, despite being blinded to clinical phase annotations, MIDAS GIN, on average, predicts higher scores for approved targets compared to non-target genes (Fig. 2c). This result held when comparing approved versus targets in development (P = 0.00163, Mann–Whitney U test). Recategorizing time-sliced targets as in-development targets did not diminish this (P = 2.23×10⁻⁴, Mann–Whitney U test; Supplementary Fig. 1e).

Third, MIDAS GIN recovers CPI response DEGs in unseen trials, even after accounting for gene expression (randomly sampling genes, stratified by mean expression, for the null distribution). The top 200 predictions were enriched for more DEGs than size-matched random sets (empirical P < 0.05 in pan- and lung cancers; Fig. 2d). MIDAS GIN correlated with the DEG Wald statistic, with stronger correlations amongst higher-ranked genes observed in pan-, lung and renal cancer (P < 0.05, Spearman’s rank; Supplementary Fig. 1f). Hence, MIDAS GIN achieves robust performance, assigns higher scores to prospectively identified immuno-oncology targets as well as to approved targets compared to those in development, and captures patient tumour–immune dynamics, supporting the biological relevance of its predictions.

Global interpretability analysis reveals features informing immuno-oncology target prediction

We next investigated potential biological drivers of model predictions. Gene Set Enrichment Analysis (GSEA) revealed strong enrichment for immuno-oncology pathways (Fig. 3a, Supplementary Table 6 and Supplementary Note 1), including IL-2, CD3 and TCR signalling. In particular, PD-1 signalling (a canonical immuno-oncology target) was highly enriched, further supporting that MIDAS GIN captures signals relevant to immuno-oncology discovery.

Interpretable drug discovery can help to justify clinical development programme costs¹⁰ and build a mechanistic understanding. We applied permutation feature importance to assess which factors most influence model training (Methods and Fig. 3b), finding that the gene regulatory network (GRN) and autoimmunity features ranked the highest. Importance rankings were consistent across CV test and held-out sets (Supplementary Fig. 3a,b); the top 4 biological feature categories (14.3%) were stable across splits (Supplementary Fig. 3b). In particular, bulk transcriptomics (RNA-seq) and macrophage biology ranked highly in the CV train but not in the CV test and held-out sets. As a further sensitivity analysis, we examined how feature importance varies with specificity thresholds (measuring the absolute change in recall post-permutation). Again, the top ranked categories remained stable (Supplementary Table 7), highlighting consistent model behaviour. Differences between training and test sets probably reflect features driving minor overfitting, despite robust CV protocols. Since genes were ranked by output scores, and not assigned binary labels, variations across specificity thresholds likely have minimal impact.

Having assessed the influence of node features, we next examined the role of knowledge graph connectivity by permuting GiG edges whilst preserving node degrees, thereby corrupting edge biological information but maintaining node topology. By corrupting GNN message passing mechanics, network permutation caused the greatest training performance drop (Fig. 3c), underscoring the critical role of gene–gene interactions for immuno-oncology discovery.

MIDAS identifies candidate immunotherapy drug targets

To identify novel immuno-oncology targets, we removed positively labelled instances from the top 300 predictions, and reviewed the remaining for novelty and biological plausibility, shortlisting 43 targets (Methods and Supplementary Table 8). These were manually reviewed, in ranked order, for biological plausibility, safety, druggability, clinical and preclinical competition, and availability of appropriate compounds for ex vivo validation.

Proposed immunotherapy targets include OSM and OSMR, which score 0.966 and 0.965, respectively (Fig. 4a). OSM, an IL-6 cytokine family member produced by activated T cells and macrophages, acts on fibroblasts, cancer and myeloid cells. OSMR dimerizes with IL-31RA or IL-6ST (gp130) to mediate signalling³⁵. Another highly predicted candidate, PTPN22 (scoring 0.793), is a tyrosine phosphatase that negatively regulates TCR signalling (Fig. 4a). Preclinical evidence (disjoint from the MIDAS training data) supports both candidates, underscoring the effectiveness of MIDAS GIN^{36,37,38,39,40,41,42}.

**Fig. 4: Novel candidate immunotherapy drug targets.**

Next, we investigated the factors driving candidate prediction using a permutation approach (Supplementary Fig. 4a,b and Methods). Targets were mapped to pathways, which were independently permuted, defining importance as the normalized absolute prediction change. Key pathways for PTPN22 are functionally relevant and include those used in TCR signalling, specifically ZAP-70 translocation as well as CD3 and TCRζ phosphorylation (Fig. 4b). Downstream of the TCR, PTPN22 dephosphorylates TCRζ, CD3ε and ZAP-70 (ref. ⁴³). ZAP-70 phosphorylation correlates strongly with CD8⁺ T cell activation markers, including granzyme B (GZMB) and Ki-67 (ref. ³⁹). Consistently, inhibiting PTPN22 in mice promotes CD8⁺ T cell-mediated anti-tumour responses^37,39. Hence, MIDAS GIN likely scores PTPN22 highly by leveraging relevant functional biology.

Pathway importances were averaged across the interacting partners, OSM and OSMR. Interleukin and cytokine signalling were the most influential (Fig. 4c), showcasing a dependence on immune function. The preferential reliance on IL-4 and IL-13 signalling could reflect that OSM induces IL-4 and IL-13 (ref. ⁴⁴). In turn, IL-4 promotes IL-31RA expression, which heterodimerizes with OSMR^45,46, and myeloid-mediated immunosuppression^47,48. Thus, MIDAS likely deems OSM–OSMR signalling as an immunotherapy target based on relevant tumour immunology.

Functional validation of novel immunotherapy drug targets

To evaluate predicted targets, we independently blocked OSM–OSMR and PTPN22 signalling (with an anti-OSM antibody and PTPN22 inhibitor, respectively) in n = 8 stage 3 or 4 melanoma PDEs, assessing for an impact on T cell phenotypes and functionality (Fig. 5a and Supplementary Tables 9 and 10). PDEs are a sophisticated preclinical platform that preserves the endogenous TME, facilitating the investigation of functional perturbations in tumour-infiltrating immune cells⁴. They recapitulate patient CPI response⁴⁹ and have been used to probe new immunotherapies^{50,51,52,53,54,55}.

**Fig. 5: Functional validation of novel drug targets.**

Since PTPN22 acts downstream of the TCR, we expected its inhibition to directly affect T cells. By contrast, anti-OSM was expected to operate indirectly, blocking T cell/macrophage-produced OSM signalling in OSMR-expressing cells (fibroblasts, cancer and myeloid cells), which may subsequently influence T cell states. Using high-dimensional flow cytometry, we profiled immune phenotypes in CD4⁺ and CD8⁺ T cells, assessing markers of proliferation (Ki-67), cytotoxicity (GZMB) and activation (HLA-DR, OX40 (CD134), 4-1BB (CD137) and PD-1 (CD279)). We also investigated pre-dysfunctional (PD-1⁺TCF-7⁺CD39⁻) and dysfunctional (PD-1⁺TCF-7⁻CD39⁺) CD8⁺ T subsets, which correlate with the CPI response⁵⁶.

Following anti-OSM treatment, we observed reduced dysfunctional CD8⁺ T cells (PD-1⁺TCF-7⁻CD39⁺; P = 0.00781; Fig. 5b), no effect on pre-dysfunctional CD8⁺ T cells (P = 0.933; Fig. 5b) and a decrease in CD4⁺GZMB⁺ T effectors (P = 0.0156; Fig. 5c). Upon PTPN22 inhibition, we noted a decrease in dysfunctional CD8⁺ T cells (Fig. 5d) and an increase in CD4⁺ T effectors expressing Ki-67, HLA-DR, or OX40 (CD134) and CD8⁺ T cells expressing Ki-67, HLA-DR, or PD-1 (Fig. 5e), although none of these achieved statistical significance.

Cytokine bead array (CBA) analysis of culture medium from these experiments revealed reduced CCL4 following anti-OSM treatment (P = 0.0391; Fig. 5f). Additionally, the CCL4 expression correlates with OSM in lung (Spearman ρ = 0.368, P = 1.30×10⁻¹³), bladder (Spearman ρ = 0.518, P = 3.99×10⁻¹¹) and renal (Spearman ρ = 0.271, P = 0.00141) cancers, and with OSMR in bladder cancer (Spearman ρ = 0.494, P = 6.48×10⁻¹⁰), in the CPI2500 cohort (paper in preparation). Since CCL4 is associated with tumour-promoting macrophages⁵⁷, this is consistent with a potential role for OSM–OSMR signalling in polarizing macrophages towards M2-like phenotypes (associated with immunosuppressive, tumour-promoting effects).

To further explore its immune associations, we next examined whether OSM and OSMR gene expression associates with CIBERSORT T cell scores⁵⁸, which reflect the estimated relative proportions of T cell subtypes of interest, in CPI2500 tumour bulk transcriptomes (paper in preparation). High OSM and OSMR were associated with decreased CD4⁺ resting memory, but increased CD4⁺ activated memory, CD8⁺ and regulatory T cells (Supplementary Figs. 5 and 6). Signature resolution precluded CD8⁺ T subtype analysis, meaning it was not possible to confirm whether this reflected an increase in exhausted cells.

Given the link between OSM–OSMR and suppressive M2-like macrophage polarization^{40,41,42,59,60,61,62,63,64}, we investigated CIBERSORT macrophage quantifications⁵⁸. A trend towards increased M2 macrophages in bladder cancer was observed at a higher expression of OSMR (P = 0.0281, FDR = 0.112; Supplementary Fig. 7) and OSM (P = 0.0720, FDR = 0.144; Supplementary Fig. 8). Elevated M1 macrophages (a pro-inflammatory phenotype associated with increased anti-microbial and anti-tumour activity) associated with higher OSMR (FDR = 0.0467; Supplementary Fig. 7) and OSM (FDR = 0.00357; Supplementary Fig. 8) expression in bladder cancer. An increase in the M1/M2 ratio at higher OSM expression was found in lung cancer (FDR = 0.0215; Supplementary Fig. 8). M0 macrophages (naïve macrophages that undergo polarization towards M1- or M2-like phenotypes) were elevated at higher levels of OSMR (bladder: FDR = 0.0467; Supplementary Fig. 7) and OSM (bladder: FDR = 0.00357; lung: FDR = 9.5×10⁻⁵; renal: P = 0.050, FDR = 0.121; Supplementary Fig. 8). These data suggest that OSM–OSMR signalling may contribute to influencing TME macrophage phenotypes.

Although PTPN22 inhibition did not significantly alter the tumour-infiltrating T lymphocyte compartments, a small decrease in CD4⁺ T cell cytotoxicity and a reduction in dysfunctional CD8⁺ T cells (a population associated with immunotherapy response) were observed following OSM-OSMR perturbation. These findings support OSM–OSMR blockade as a candidate immunotherapy strategy, underscoring the utility of MIDAS for novel immuno-oncology target discovery.

Discussion

We propose MIDAS, a multimodal GNN system, as a dedicated and effective cancer immunotherapy target discovery engine. It achieves robust performance at recovering immuno-oncology targets, outperforming existing tested computational and experimental methods (including OpenTargets and CRISPR co-cultures). MIDAS generalizes to time-sliced data, differentially scoring targets that entered clinical trials prospectively as well as targets that are approved compared to those in development, which suggests important ramifications for derisking highly expensive, failure-prone drug discovery programmes^31,65. Our experiments highlight the advantage of using GNNs over alternative ML frameworks, as they effectively leverage the underlying gene interaction network, which contains information central to target discovery. Furthermore, MIDAS GIN enriches for fundamental tumour immunology (Supplementary Note 1) and its predictions can be influenced by relevant biological pathways, showing that it can extract pertinent information from the input multimodal dataset and lending credence to its predictions.

MIDAS GIN scores the OSM and OSMR genes highly. They were previously implicated in tumour-promoting TMEs in mouse pancreatic and breast cancer models^40,41,42. Extending this, we observe significantly reduced dysfunctional CD8⁺ T cells and CCL4 levels (linked to tumour-promoting macrophages⁵⁷) following OSM-OSMR perturbation in a clinically relevant experimental platform that retains the TME, recapitulates patient response and has facilitated immunotherapy exploration across cancers^{49,50,51,52,53,54,55}.

OSM and OSMR expressions are associated with altered T cell profiles. In particular, the increased proportions of regulatory T cells are consistent with prior reports, showing less exhausted intratumoural T cells in OSM-deficient tumours⁴¹. Although both M1 and M2 macrophages were elevated in bladder cancer, the elevated M0 macrophage scores noted with higher expressions of OSM and OSMR genes could indicate polarization away from anti-tumour M1-like tumour-associated macrophages, consistent with a role for OSM–OSMR in promoting tumour-associated macrophage infiltrates^{40,41,42,59,60,61,62,63,64}. These data imply that OSM–OSMR may act in a functional capacity within the TME, thereby underscoring the efficacy of MIDAS GIN.

Anti-OSM agents have been trialled in autoimmune and rheumatological indications, but none have achieved FDA approval. Despite some favourable safety data in phase I trials (ClinicalTrials.gov ID NCT04138043 (ref. ⁶⁶)), poor safety profiles and lack of efficacy were observed during phase II (refs. ^67,68). We propose that future work should focus on oncology indications and aim to address the existing issues regarding binding affinity⁶⁸ and safety. The multiple agents in preclinical development⁶⁹ indicate continued interest in therapeutically manipulating this axis.

GNNs^70,71 and other graph-based approaches⁷ have advanced the integration of multi-omic datasets in biology, attracting considerable interest, particularly regarding their explainability. However, no algorithm achieves superiority across all evaluation dimensions, invariably necessitating trade-offs⁷². Due to the size and density of the underlying graph used in the MIDAS GIN model, along with its complex, multilayered structure, executing a GNN-specific interpretability routine would require benchmarking several models, which is computationally prohibitive for our main objectives. Moreover, current interpretability methods lack an extensively validated and accepted framework regarding appropriate performance metrics⁷³. We, therefore, utilized a permutation strategy to explore how node features and GiG network topology influence MIDAS GIN.

Transcriptional dynamics (GRN) were the most important, possibly echoing that immuno-oncology targets modulate immune cell transcriptional states⁷⁴ towards less exhausted, more anti-inflammatory phenotypes. The reliance on autoimmunity coincides with the link between it and immuno-oncology^75,76,77, as gene perturbations driving immune hyperactivity to cause disease could feasibly trigger anti-tumour immunity.

Another key feature alludes to the role genes play within Treg and functional dendritic cell (DC) interactomes. Regulatory T cells suppress cytotoxic CD8⁺ T cells and restrain anti-tumour immunity, rendering them an immunotherapy target^78,79. Functional DCs present tumour-derived antigen to naïve T and B cells, connecting innate and adaptive immunities. They cross-link CD4⁺ and CD8⁺ T cells, which is critical for maximal CD8⁺ cytotoxicity, with these triads linked to the CPI response⁸⁰. Together, these data imply that MIDAS GIN is probably driven by relevant tumour immunology.

In future, MIDAS GIN embeddings or output scores could be augmented with new information (for example, additional scRNA-seq or CRISPR screen data) to enrich for targets with specific traits. This could facilitate flexible use of model insights without retraining the full model, reducing resource demands.

Despite these successes, MIDAS has limitations. First, posing target discovery as a binary classification problem assumes all genes addressed in clinical trials are true, and equal, immunotherapy targets. This expands the positive set for training, but identifying targets that will probably receive regulatory approval is more efficient due to clinical trial attrition and development costs^31,65.

Second, unlabelled genes were treated as negatives, a standard assumption in target discovery methods⁷, as proving a gene has no relevance to the domain of interest is not trivial. This introduces noise as undiscovered immunotherapy targets are mislabelled. Yet, using undersampling of negatively labelled nodes, repeated CV and fold-model bagging ensured that MIDAS aligned with robust ensemble-based positive-unlabelled learning methods^29,81,82. Additionally, MIDAS GIN leverages biological context and network connectivity, which acts as prior knowledge for each class, to identify candidate immunotherapy targets, following methodologies proposed for advanced negative sampling in positive-unlabelled learning frameworks for knowledge graphs⁸³.

Third, manual review was required to triage targets and decide whether they should be perturbed via agonistic or antagonistic strategies, which can introduce bias. Future work should focus on these areas to create true end-to-end ML target discovery pipelines. Since current approaches output predictions for thousands of genes, a systematic functional assessment of high-scoring candidates would incur prohibitively large time and financial costs. This could be addressed through, for example, encoding druggability predictions, protein structure information or literature-derived features within the input features. Regarding perturbation direction, strategies could include incorporating target directionality in the classification task, or explainability approaches that infer whether a candidate is considered stimulatory or inhibitory.

The success of MIDAS across in silico and functional validation tasks staunchly supports applying sophisticated ML for immuno-oncology target discovery. It underscores the value of multimodal, multi-omics datasets to accurately model factors influencing tumour–immune dynamics. Leveraging tools such as MIDAS to produce data-driven candidates could ameliorate the financial burden, long timescales and high attrition that plague clinical trials and traditional target discovery.

Methods

Datasets

CPI1000+ cohort

The CPI1000+ cohort comprised 15 studies and 9 tumour types (melanoma, lung, bladder, renal, breast, gastric, colorectal cancer, head and neck, with the remaining tumour types grouped under the “other” category). Whole exome and bulk transcriptomics (RNA-seq) sequencing data were processed as previously described³. Clinical end-points were defined by radiological response according to the RECIST criteria. Complete response (CR) or partial response (PR) was classified as a responder (R), whereas stable disease (SD) or progressive disease (PD) was classed as a non-responder (NR). Sample numbers were n = 941 (R = 259, NR = 682; mutation), n = 1,061 (R = 276, NR = 785; copy number alteration) and n = 933 (R = 249, NR = 684; RNA-seq) patients.

Bulk RNA-seq transcripts per million data were quantile normalized using the preprocessCore R package (v.1.56.0)⁸⁴, after excluding genes unexpressed in at least 65% of patients. The effects of the source study and tissue source (whether fresh frozen or fresh frozen and paraffin embedded) were regressed out using linear regression.

The processed exome data were categorized for modelling as follows (Supplementary Table 1): mutation data were classified into loss of function, missense or otherwise (non-mutated/synonymous mutation), with ties broken according to this hierarchy. Copy number data were represented using log[R] (defined as $\log [{\rm{R}}]=\frac{{\log }_{2}[\mathrm{total}\,\mathrm{copy}\,\mathrm{number}]}{2}$).

scRNA-seq datasets

We utilized previously constructed and internal scRNA-seq atlases specific to B cells²⁶, T cells and DCs, together with a version of a published macrophage atlas²⁷. Processed data in the form of raw counts were obtained from the Gene Expression Omnibus under the following accession numbers: GSE123813, GSE121638, GSE131907, GSE123139, GSE114727, GSE127465, GSE178341, GSE148071 and GSE169246. Additionally, raw count data were downloaded from the Sequence Read Archive with the accession number SRZ190804 (ref. ⁸⁵) from https://github.com/czbiohub-sf/scell_lung_adenocarcinoma (ref. ⁸⁶), and from http://blueprint.lambrechtslab.org (ref. ⁸⁷). Further, additional information was sourced from ref. ⁸⁸. Processed scRNA-seq data from two additional cohorts were retrieved from the Single Cell Portal (https://singlecell.broadinstitute.org/single_cell/study/SCP1288/tumor-and-immune-reprogramming-during-immunotherapy-in-advanced-renal-cell-carcinoma#study-summary) and the Human Tumor Atlas Network data portal at https://data.humantumoratlas.org/ (ref. ⁸⁹). An additional scRNA-seq dataset was requested from the corresponding author of ref. ⁹⁰.

For each atlas, cells of the specific lineage were extracted from each scRNA-seq study based on the annotations provided in the source publications. The raw count matrices were merged and analysed using Seurat (v.4.0.6)⁹¹. Quality control was rigorously applied to the cells based on several quality control metrics, including total unique molecular identifier count, total number of expressed genes and the percentage of mitochondrial gene expression. Cells that did not meet the following criteria were filtered out: (1) 300 < unique molecular identifiers < 5,000,000, (2) 200 < expressed genes < 6,000 and (3) mitochondrial gene expression percentage < 20%.

The remaining cells were normalized using the SCTransform function from Seurat (v.4)⁹². Dimensional reduction via canonical correlation analysis was used to identify anchors using 3,000 genes, which were then used by the IntegrateData function to eliminate batch effects. Principal component analysis was performed on the integration-transformed expression matrix, and the top 30 principal components were used for graph-based clustering and further dimensionality reduction using uniform manifold approximation and projection (UMAP).

Clusters were identified using the FindClusters function⁹², with the resolution parameter varied from 0 to 1. Final resolutions were determined to be 0.5 for macrophages, 0.4 for DCs and T cells and 0.2 for B cells, selected based on the elbow plot method. The FindAllMarkers function from Seurat⁹² was used for intercluster differential expression analyses. The top cluster-specific DEGs were subsequently utilized to assign cell-type labels to the clusters.

Cell-type-specific interactomes

SCINET (v. 1.0)⁹³ was applied to scRNA-seq data normalized using SCTransform (from the Seurat package^91,92) to construct cell-type-specific protein–protein interaction networks (Supplementary Table 1). SCINET was not used for batch correction. The resulting interactomes were validated by assessing whether cell-type marker genes, comprising the top 50 DEGs (or all DEGs for cell types with <50 DEGs), had higher topological specificity scores than non-marker genes (Supplementary Fig. 9).

EDGE immunopeptidomics cohort

Publicly available matched HLA-peptidomics and bulk transcriptomics data (n = 74 samples, where n = 60 were from patients diagnosed with cancer) were downloaded²⁸ and subsequently processed to yield Pearson correlation coefficients describing associations between gene expression and peptide presentation across all sample HLA molecules (Supplementary Table 1). A q-value threshold of 0.05 was used to identify peptides that were detected on the HLA molecules from a given sample.

GWAS catalogue

Germline single-nucleotide polymorphism (SNP) data were downloaded from the genome-wide association studies (GWAS) catalogue²⁰. Intergenic SNPs were excluded. Significant SNP–phenotype associations (using the Bonferroni-corrected threshold: 5×10⁻⁸) for immuno-oncology-relevant phenotypes were extracted⁷⁷. Next, they were categorized, using domain expertise, into autoimmune/rheumatic/allergic (including diseases from ref. ⁹⁴), blood counts and cytokine/chemokine levels. For each gene, the number of SNPs associated with each of these categories was counted and used as features for downstream immuno-oncology target discovery (Supplementary Table 1).

Genome-wide CRISPR co-culture screens

Genome-wide CRISPR screens derived from publicly available tumour–T cell co-culture screens^18,19 were downloaded and processed using MAGeCK with default values⁹⁵. These results were represented as –log₁₀(MAGeCK scores) and indicate whether gene knockouts in cancer cells led to proliferation (immune evasion) or death (immune sensitization) when co-cultured with T cells (Supplementary Table 1). Features corresponded to immune-evading and -sensitized cells, for each cell line separately.

Pathway analyses

Over-representation and GSEA analyses were performed using the WebGestaltR package (v.0.4.6)⁹⁶, with the Reactome pathway database⁹⁷ and adjusting for multiplicity using the FDR (Benjamini–Hochberg) method, with significance defined as FDR < 0.05. Over-representation analyses were performed to identify processes that were upregulated in a pre-specified list of interest, whereas GSEA was used when there was no prior filtering of genes.

Statistical analysis

All statistical tests performed were two sided, unless otherwise stated. Multiple testing correction was performed using the FDR. The threshold used for significance was either P = 0.05 or FDR = 0.05, if multiple testing adjustment was performed. Normality was assessed using either the Shapiro–Wilk test or the Anderson–Darling test (if sample size was more than 5,000). Unless stated otherwise, parametric tests were used only if the data were normally distributed based on the above tests, with non-parametric methods used otherwise. Comparisons of ROC-AUC were performed using the DeLong’s test or bootstrapping, in the case of curves with different directions, using the pROC (v.1.18.5) R package⁹⁸.

In certain cases, empirical P values were calculated from permutation tests using the definition of a P value, the probability of obtaining a value at least as extreme as that observed: empirical P $=\frac{{\rm{number\; instances}}\ge {\rm{|observed\; value|}}}{{\rm{total\; number\; of\; instances}}}$. P values obtained in this manner are explicitly stated in text.

No a priori power calculations were performed, and blinding was not used. Held-out sets were not used for model training or hyperparameter optimization. Unless otherwise stated, all statistical analyses were performed in R (v.4.1.3).

CIBERSORT analysis

Bulk RNA-seq data from the CPI2500 cohort, an extension of our previous CPI1000+ cohort³, were processed using a standardized in-house bioinformatics pipeline developed within the CPI2500 working group. Gene expression quantification was performed using RSEM⁹⁹, producing transcripts per million values for all annotated genes. For immune cell composition analysis, CIBERSORT⁵⁸ was applied to the expression data to estimate the relative proportions of 22 immune cell types. Only samples with successful deconvolution results (P < 0.05) were retained for downstream analysis. All correlation analyses of the CPI2500 data were assessed via the Spearman’s rank correlation coefficient using transcripts per million values within each cancer type. Comparisons of estimated proportions of specific immune cell types between samples with high and low OSM and OSMR expressions (defined as the lower quartile and upper quartile of expression, respectively) were assessed using a two-sided Mann–Whitney test.

Data visualization was performed in R (v.4.1.3) using the ggplot2 package (v.3.4.1).

MIDAS GNN models

Multimodal biomedical data integration

We encapsulated our rich multimodal database within the Hetionet biomedical graph to support geometric deep learning GNN model development²¹ (Fig. 1a,b and Supplementary Table 1). Briefly, Hetionet represents biomedical information as a heterogeneous network compiled from 29 public datasets. This comprehensive network includes 47,031 nodes, classified into 11 distinct entities: genes, compounds, anatomy, diseases, symptoms, side effects, biological processes, cellular components, molecular functions, pathways and pharmacological classes. These entities are interconnected by 2,250,197 edges of 24 types, representing various relationships between the nodes. Gene–disease associations, GiG and GRN were extracted from the Hetionet biomedical knowledge graph²¹. Both networks are represented as unweighted networks, where an edge connects two gene nodes only if their products interact (GiG) or influence the expression of the target nodes (GRN), the latter also having directionality.

Graph nodes (n = 11,919) comprised genes with edges linking genes involved in n = 99,275 interactions (Fig. 1b and Supplementary Table 1). Genes were also labelled with n = 6,387 gene-autoimmune or rheumatic disease associations and their respective node degree within the GRN. To reduce input omics data dimensionality, bulk data were first pre-processed to describe the association between each data type and CPI response. Bulk transcriptomics and copy number data were represented as the median expression and log[R] values, respectively, across patient samples, categorized by response to CPI therapy. Mutation data were summarized as the mean weighted sum of mutation types (loss of function = 2, missense = 1, otherwise = 0) for R and NR patients.

The scRNA-seq atlas data, following the use of the SCTransform function, were pre-processed to create cell-type-specific interactomes that describe the role of different intratumoural immune subsets using the SCINET package (v.1.0)⁹³. These interactome networks were subsequently summarized using the SCINET topological specificity score⁹³. Briefly, this is a measure of gene influence within a specific cell-type interactome that accounts for its role in a global cell-type-agnostic reference. HLA-peptidomics, CRISPR co-culture screens and GWAS catalogue data were pre-processed as described above and appended to the node feature matrix. This effectively reduced data dimensionality and allowed the creation of meaningful features for integration along the GiG skeleton.

Missingness was classified into pseudo-missing and true-missing features. In pseudo-missing features, the missing values were imputed as 0: Shapley importance scores from CPI response predictors, genetic SNP–phenotype associations and GiG network degrees. In true-missing features, certain genes were absent and could not reliably be imputed with 0: gwCRISPR co-cultures (excluded missing genes), HLA-peptidomics (median imputation) and GRN node degrees (median imputation). HLA-peptidomics missing training data and held-out sets were imputed using the median value computed on the training set. Missing GRN degrees in the training data were imputed using the median value computed on the training set, and those in the held-out dataset were imputed using that computed on the held-out set. Missing gene topological specificity scores were imputed with 0.

Positive class labels for model development

For all the target discovery models, we defined an immuno-oncology target as genes that have been addressed in at least phase I clinical trials. Immuno-oncology targets (from 2017 to 2019) were downloaded from the Cancer Research Institute iAtlas (https://isb-cgc.shinyapps.io/iatlas/). After excluding targets identified by the terms ‘Pathway’ = ‘Antigen’ and ‘Description’ containing ‘vir’, ‘vaccine’, ‘cell therapy’ and ‘cellular therapy’, n = 332 true-positive immuno-oncology targets were obtained. After exclusions due to missing data, there were n = 260 positively and n = 11,659 negatively labelled genes for GNN development.

MIDAS GNN design space

We systematically studied the architectural design space for GNNs that could integrate rich multimodal data and leverage Hetionet for immuno-oncology target discovery²¹. Specifically, three key components were considered in the implemented approach¹⁰⁰: (1) a stacked set of pre-processing MLP layers (allowing for feature engineering through combining the input features and the injection of nonlinearity), including dropout layers and batch normalization; (2) a module covering stacked GNN designs, for example, convolutional-based, attention-based, or sample and aggregate-based algorithm specifically designed for inductive node embedding that generalizes to unseen nodes^{32,33,101,102}; and (3) a final post-processing module also comprising stacked MLPs (to refine the node embeddings and inject further nonlinearity, thereby increasing the overall network depth and expressiveness) and leading to a probability output¹⁰⁰. We expanded the number of nodes from the pre-processing MLP module to the GNN (by a factor of 2) and contracted the GNN output to the post-processing MLP (by the same factor). We used a factor of 2 to reduce computational complexity that would otherwise arise from further optimizing this value. This design choice was inspired by similar approaches in CNNs (inverted bottleneck¹⁰³) and transformers (feed-forward networks for expansion and compression¹⁰⁴).

Hyperparameter search was done in Optuna (v.3.0)¹⁰⁵ by maximizing the ROC-AUC in test folds during CV (see the ‘Resampling optimization strategies for target prediction models’ section) using the default tree-structured Parzen estimator. In addition to the message passing model-specific hyperparameters, the general hyperparameters optimized were embedding dimension, number of layers in the pre-processing and post-processing modules, learning rate and dropout rate (Supplementary Table 11). For each algorithm, the domains for hyperparameter search were meticulously crafted by extending the recommended search regions specified in the original papers of each algorithm.

MIDAS GIN architecture

The underlying graph for the GNNs was the GiG network extracted from Hetionet²¹. To evaluate the performance of various GNNs (Supplementary Methods describes their general structure), several architectures available through the PyTorch Geometric package (v.2.3.1)¹⁰⁶ were used with an inductive learning paradigm. These comprised models such as graph sample and aggregate³², graph convolutional networks¹⁰², graph attention networks¹⁰¹ and variants of the GIN³³. These variants comprised a GIN using MLPs (GIN MLP); a GIN MLP that did not implement any pre- or post-processing layers (GIN MLP no proc); a GIN MLP no proc variant that implemented global concatenation (GIN MLP no proc concat); and a GIN that uses a linear layer instead of an MLP, pre- and post-processing layers, and does not implement a global concatenation. The last variant had the highest performance and is referred to as MIDAS GIN. Therefore, the GIN message passing model and the characteristics of the optimal model are described below:

$${{\mathbf{x}}}_{\mathrm{i}}^{({\mathrm {k}})}={\mathrm{MLP}}((1+\epsilon )\cdot {{{\mathbf{x}}}_{\mathrm{ i}}}^{\left({\mathrm {k}}-1\right)}+\mathop{\sum }\limits_{\mathrm{j}\in {\mathscr{N}}\left({\mathrm {i}}\right)}{{{\mathbf{x}}}_{\mathrm{j}}}^{\left({\mathrm {k}}-1\right)}),$$

(1)

where ${{\mathbf{x}}}_{{\mathrm i}}^{({\mathrm k}-1)}$ and ${{\mathbf{x}}}_{{\mathrm j}}^{({\mathrm k}-1)}$ are the vector of features for each node i and all nodes j belonging to the neighbourhood of i (${\mathscr{N}}({i})$) in the (k – 1) stacked layer. The MLP is built from piling several modules of ${\rm{ReLU}}\left({\rm{BN}}\left({\rm{Linear}}\left(\,\right)\right)\right)$, where ${\rm{ReLU}}$ refers to a rectified linear unit, ${\rm{BN}}$ is a batch normalization layer and ${\rm{Linear}}$ denotes a linear layer. The output of equation (1) was then passed through a ${\rm{DROPOUT}}\left(\,\right)$ layer to avoid overfitting. This is close to the original architecture³³. The parameter $\epsilon$ weighs the contribution of each node to its own embedding.

In our modified GIN (referred to as MIDAS GIN), we used a ${\mathrm{Linear}}\left(\,\right)$ layer instead of an MLP:

$${{\mathbf{x}}}_{\mathrm{ i}}^{({\mathrm {k}})}={\mathrm{DROPOUT}}({\mathrm{ReLU}}({\mathrm{BN}}({\mathrm{LINEAR}}(\left(1+\epsilon \right)\cdot {{{\mathbf{x}}}_{\mathrm{i}}}^{\left({\mathrm {k}}-1\right)}+\mathop{\sum }\limits_{\mathrm{j}\in {\mathscr{N}}\left({\mathrm {i}}\right)}{{{\mathbf{x}}}_{\mathrm{j}}}^{\left({\mathrm {k}}-1\right)})))),$$

(2)

The use of the dropout, rectified linear unit (ReLU) and batch normalization layers are justified in the Supplementary Methods. The sum was used in $\mathop{\sum }\limits_{\mathrm{j}\in {\mathscr{N}}\left({\mathrm {i}}\right)}{{{\mathbf{x}}}_{\mathrm{j}}}^{\left({\mathrm {k}}-1\right)}$ as this was proven to confer the largest expressivity to the GIN model³³. As mentioned above, each of these message passing layers was combined with the pre-processing (equation (3)) and post-processing (equation (4)) layers:

$${{\mathbf{x}}}_{\mathrm{i}}^{\mathrm{m}}={\mathrm{DROPOUT}}^{\left({\mathrm {m}}-1\right)}({\mathrm{ReLU}}^{\left({\mathrm{m}}-1\right)}({\mathrm{BN}}^{\left({\mathrm{m}}-1\right)}({\mathrm{LINEAR}}^{\left({\mathrm{m}}-1\right)}({{\mathbf{x}}}_{\mathrm{i}}^{\left({\mathrm {m}}-1\right)})))),$$

(3)

$${{\mathbf{x}}}_{\mathrm{i}}^{\mathrm{n}}={\mathrm{DROPOUT}}^{\left({\mathrm {n}}-1\right)}({\mathrm{ReLU}}^{\left({\mathrm{n}}-1\right)}({\mathrm{BN}}^{\left({\mathrm{n}}-1\right)}({\mathrm{LINEAR}}^{\left({\mathrm{n}}-1\right)}({{\mathbf{x}}}_{\mathrm{i}}^{\left({\mathrm {n}}-1\right)})))),$$

(4)

where m and n represent the number of stacked layers for each module, that is, pre-processing (equation (3)) and post-processing (equation (4)), respectively.

All models were assessed using a rigorous CV resampling strategy, as detailed later.

To improve the generalization ability of the models to external datasets under the inductive paradigm and to enhance training speed, all models were optimized for the number of sampling k-nearest neighbours, with the search being over k-nearest neighbour values of 50, 100 and 200 (ref. ³²). This affects the percentage of nodes in ${\mathscr{N}}{\mathscr{(}}i)$ that is taken into account during message passing. The top performance was achieved with the GIN model (k-nearest neighbours = 50) represented in equation (2) with pre-processing and post-processing layers (Supplementary Table 11 lists the optimal hyperparameters).

Interpretability of MIDAS graph models by permutation feature importance and degree-preserving graph null models

We quantified the importance of each feature using permutation analysis. The influence of a feature was assessed by examining the change in the prediction performance of the model, measured by the difference between the ROC-AUC on the CV training set and the mean ROC-AUC following 500 permutations of that feature. Each feature was permuted individually and then grouped into biological categories, meaning that some biological categories (which map to multiple features) will have >500 permutations for each fold. For data visualization purposes, the standard error was computed using the standard deviation across the differences between the original performance and that following each permutation. This provided a concise insight into the attributes of the model without necessitating further training.

We repeated this permutation analysis on the CV and held-out test sets to investigate the consistency across data splits. Here, the permutation feature importance scores are measured as the absolute difference in ROC-AUC between that computed on the original dataset and the mean across the corresponding permuted versions. Therefore, the original performance across the CV training folds (CV train performance) was compared against the performance across the permuted CV training folds, the original performance across the CV test folds (CV test performance) against the permuted CV test folds, and the original held-out test performance against the permuted version. As a further sensitivity analysis, we investigated how feature importances vary across specificity thresholds (0.7, 0.8 and 0.9). Model predictions were categorized to achieve these specificity values and the permutation feature importance was measured as the absolute change in recall post-permutation.

Additionally, we evaluated whether biological information encapsulated within the connectivity of the underlying network provided value to the model compared with a null distribution obtained by local edge swapping (500 times) whilst preserving the node degree distribution. Edge permutations were performed using the Python package Xswap (v.0.0.2 (ref. ¹⁰⁷)), associated with the Hetionet project²¹. The permutation importance was defined as the difference between the ROC-AUC of the training set and that following degree-preserving edge permutation.

We also used an additional interpretability strategy to determine the contribution of specific pathways to the output scores of nodes identified as potential targets. The graph structure reflects gene–gene interactions; therefore, gene nodes are connected if their products interact with each other. Consequently, it is naturally organized into biological pathways (since proteins involved in the same pathway interact to propagate signals, leading to the functional effect of that pathway^108,109). GNN message passing is constrained by these GiG edges and determines the model output. Therefore, information flow within the network occurs along interacting genes and, thus, the biological pathways to which they correspond.

Pathway permutation importance was calculated by permuting only edges that span gene nodes involved in a given pathway. To calculate pathway permutation importance for a specific predicted target, the target was first mapped to its manually curated pathways. For each pathway in turn, only the edges connecting genes belonging to that pathway were shuffled (in a degree-preserving manner), keeping all other edges constant. In this manner, only information flow within that specific biological pathway was corrupted, whereas flow through any other pathway remained unchanged.

This was repeated and the predicted score for the candidate target of interest was obtained for each of the 100°-preserving permutations for each pathway in which the target was involved. The importance of a pathway for target prediction was quantified as the median absolute normalized (as shown below) change in the model output score. Following the permutation of a given pathway, if there is a large change in the corresponding model prediction for the candidate target of interest, it suggests that this pathway highly influences the output prediction for that candidate target. On the other hand, if the output score was robust to the permutation of GiG edges within a specific pathway, it follows that this pathway is unlikely to be critical for predicting the candidate target of interest.

To account for the influence of pathway size on importance (Supplementary Fig. 4a), we normalized the results by the square root of the edge count (Supplementary Fig. 4b). This method assessed feature importance for the trained model and, therefore, did not retrain the model post-permutation. In this manner, the issue of redundant information contained in potentially collinear features is bypassed.

Train–test split for target prediction models

For the prediction of immuno-oncology targets, a dataset comprising a total of 15,261 genes was stratified into distinct subsets for training and validation purposes. Specifically, 75% of the genes (n = 11,442) were randomly allocated to the training set in which CV was performed. The held-out dataset, which included the remainder of the genes (n = 3,819), was excluded from all training and optimization processes. This data split was stratified by the immuno-oncology target status and missingness for all input variables, utilizing the MultilabelStratifiedShuffleSplit() function from the iterative-stratification package (v.0.1.7)¹¹⁰ to ensure balanced representation.

Since not all genes were present in the GiG network, a total of n = 11,919/15,261 (78.1% of the maximum possible) genes in the integrated dataset were used to develop the MIDAS variants. Within this subset of genes present in the GiG network, the training set comprised n = 8,933/11,442 (78.1%), whereas the held-out test set contained n = 2,986/3,819 (78.2%).

Resampling optimization strategies for target prediction models

Within each training fold, the allocation of genes to CV subsets was further stratified by the immuno-oncology target status. To address class imbalance, random undersampling of the majority class was implemented for each training fold using techniques from the imbalanced-learn package. Hyperparameter optimization for the ensemble meta-learners and GNN models was conducted under a robust framework of ten-times-repeated tenfold CV. Consistency was maintained across the meta-learner (Supplementary Methods) and GNN techniques by using the same random seed in the samplers, thereby facilitating a fair comparison of their performances.

For consistency, the ROC-AUC metric was used to evaluate model performance across validation folds and in the held-out set, as well as to compare against existing target discovery methods. Since the output probability scores were not categorized into discrete predicted labels (which would require optimizing the threshold value), metrics requiring distinct categorical class label predictions would be less suited to capturing model behaviour. Furthermore, the ROC-AUC measure is a standard metric in binary classification, including for approaches that do not rely on GNNs or deep learning^{5,22,70,111,112}. Additionally, it is independent of prevalence, unlike other metrics such as Matthew’s correlation coefficient and the precision–recall curve. Finally, we randomly undersample the majority set to balance the classes, justifying the use of the ROC-AUC.

Immuno-oncology candidate targets were subsequently ranked based on their probability of being a target, as predicted by both meta-learner (Supplementary Methods) and GNN models. Each model derived from the CV training folds was utilized to propose candidates from both the corresponding CV test folds and held-out dataset. This process generated probability distributions for each gene, from which the mean score was calculated (that is, bagging across CV folds) and used to rank the candidates. Therefore, individual predictions for a specific gene come only from CV fold models in which this gene was not included in any CV training folds. This methodology ensured a comprehensive evaluation and ranking of potential immuno-oncology targets based on their predicted likelihood of being true targets.

Benchmarking against alternative methods

We benchmarked our proposed immuno-oncology target discovery system against multiple alternative methods using the held-out test set. OpenTargets (https://www.opentargets.org/) scores were filtered to those for the indication ‘cancer’ and downloaded. Both direct evidence scores (which directly links a gene to a disease) and indirect evidence scores (leveraging disease hierarchical classifications to postulate target–disease links) were separately tested, as was a combination method that aggregated both scores by the median value. TargetDB is an RF model that predicts the tractability probability for all genes³⁴. It has an MPO tool for differentially weighting evidence sources (for example, structural data or genetic links). Both tractability estimates and MPO scores (with equal weights) were tested. DepMap (https://depmap.org/portal/) gene effect scores (which reflect the influence of gene knockout/knockdown on cell line viability) were downloaded for genes within the held-out test set and assessed for the ability to identify known immuno-oncology targets. We removed data for all carcinomas in situ or non-solid tumours, as our focus with MIDAS was to identify immunotherapy drug targets for established solid tumours. Finally, MAGeCK scores from the CRISPR cancer–CD8⁺ T cells co-cultures were investigated for immuno-oncology target identification. These ability of these alternative methods to identify immunotherapy drug targets was assessed, by the ROC-AUC, in a binary immuno-oncology target classification task.

In silico model validation

To robustly validate the target discovery models, we devised multiple external and orthogonal in silico validation exercises (Fig. 1c). First, we assessed model generalization by investigating predictions for genes that had entered stage I immuno-oncology clinical trials after we froze our dataset for model development in 2019. ClinicalTrials.gov (https://clinicaltrials.gov/) was searched (in November 2023) using the following parameters: condition/disease = cancer; intervention/treatment = immunotherapy; sex = all; age = all; study phase = early phase I or phase I; study type = interventional; earliest date: 01/01/2020. This retrieved n = 453 results.

We extracted n = 48 new, time-sliced targets (n = 36 in the graph system after accounting for gene exclusions due to missingness) that were not annotated as such during model training (Fig. 1c). MIDAS predictions for these time-sliced targets were compared with a null distribution generated by the mean prediction across 1,000 size-matched randomly sampled gene sets. To avoid artificially decreasing the values in the null distribution, we did not exclude the time-sliced targets when performing the random sampling. Empirical P values were used to assess statistical significance.

Next, we examined the model using orthogonal tasks. The first such task was whether models could discriminate targets that were clinically approved from those undergoing clinical trials (Fig. 1c). We extracted clinical phase information for targets from the Cancer Research Institute (https://isb-cgc.shinyapps.io/iatlas/), aggregating by the maximum phase for all trials addressing the same target. We did not include the time-sliced targets mentioned above in this analysis. Model predictions for targets in each group were then compared using the Kruskal–Wallis and Mann–Whitney U tests.

We then investigated whether our modelling systems could identify genes that were differentially expressed between patients who did and did not respond to CPI therapy (Fig. 1c). We used unseen bulk RNA count data from the CPI2500 cohort (an extension of our previous CPI1000+ cohort³), comprising n = 658 new patients (n = 129 had disease that responded to CPI, whereas n = 529 had disease that did not) from three studies that were not used during the development of either model.

DEGs were identified for pan-, lung (n = 59 R and n = 321 NR), bladder (n = 34 R and n = 108 NR) and renal (n = 36 R and n = 100 NR) cancers using the DESeq2 (v.1.34.0) R package¹¹³, adjusting for the effects of study and patient sex. All bladder cancer samples originated from the same study and, therefore, this variable was dropped from the DESeq2 analysis. For the pan-cancer setting, we further adjusted for tumour type. The intersections between these DEGs and the top 200 predicted targets from MIDAS GIN were computed and compared with that generated from a null distribution of 1,000 randomly sampled (stratified by expression), size-matched gene sets. The Spearman correlation between MIDAS predictions and the Wald test statistic for the top 1,000-ranked genes was also assessed in bins of n = 200 genes.

Candidate target triage

The top 300-ranked predictions were manually reviewed to exclude genes labelled as known immuno-oncology targets for training (excluding n = 101), as well as targets with obvious immune functions or targets unlikely to be relevant to immuno-oncology, to prioritize n = 43 potentially novel immunotherapy targets. The exclusion of highly obvious immune targets was confirmed through comparing the significant biological processes that were overrepresented between the initial ranked list and the resulting shortlist, observing that the vast majority of immune pathways were depleted (Supplementary Fig. 10).

From this shortlist, targets were excluded if they were a named target of a clinical candidate drug at phase I or above in an oncology indication (Cortellis, https://access.clarivate.com/login?app=cortellis; OpenTargets, https://www.opentargets.org/), were common essential genes or were not expressed in immune cells (Human Protein Atlas, https://www.proteinatlas.org/).

Targets were prioritized for literature review if immune cell expression was high globally or differentially in subsets. Reviewed targets were prioritized if there was literature supporting a plausible link to an immuno-oncology mechanism (PubMed, https://pubmed.ncbi.nlm.nih.gov/), and deprioritized if existing structure and druggability scores indicated a difficult-to-drug target (TargetDB³⁴, Cansar, https://cansar.ai/). Target selection for the ex vivo assays required tool compounds to be commercially available.

Functional validation of candidate immunotherapy targets

Study ethics and research compliances

The study involved materials from TRAcking Cancer Evolution through therapy (Rx) (TRACERx) melanoma: exploratory analysis of genomic signatures of progression in melanoma (TRACERx Melanoma) (IRAS:68421). The study was reviewed and approved by both the Royal Marsden Committee for Clinical Research (CCR) (CCR:3569) and the London-Chelsea Research Ethics Committee (REC) (REC: 11/LO/0003), and was performed in compliance with all relevant ethical regulations.

PDEs

Patient characteristics, patient-derived tumour material procurement, processing and cryopreservation

All patients provided written consent for tissue samples that were not required for diagnosis to be used for research purposes. Patient-derived materials were collected from patients diagnosed with melanoma at the Royal Marsden Hospital. Patient characteristics are described in Supplementary Tables 9 and 10.

Tumour tissues were obtained from surgical resections and were macroscopically selected by a pathologist. Parts of the tumour were collected in an ice-cold collection medium (University of Wisconsin Solution (UW Solution, Bridge to Life) supplemented with 100 μg ml⁻¹ of Primocin (InvivoGen)) for subsequent tumour processing and cryopreservation. Tumour materials were immediately processed by manual sectioning into small fragments of 1–2-mm³ size on a CoolBox XT Workstation (Corning).

After processing, fragments from different spatial regions were mixed together (to minimize heterogeneity across PDEs derived from the same patient sample) and were frozen in cryovials containing 1 ml of 90% fetal bovine serum (FBS; Gibco) and 10% dimethyl sulfoxide (DMSO; Sigma-Aldrich) with 15 fragments per vial. All samples were cryopreserved in liquid-phase nitrogen until future usage.

Human PDE cultures

For each well of a 96-well plate, a single tumour fragment was embedded in the extracellular matrix (ECM) containing sodium bicarbonate (7.5% (Gibco)), Collagen I (final concentration, 1 mg ml⁻¹ (Corning)), Matrigel (final concentration, 4 mg ml⁻¹ (Corning)) and tumour medium (Dulbecco’s modified Eagle’s medium (Gibco) supplemented with 1 mM of sodium pyruvate (Sigma-Aldrich), 1× Minimum Essential Medium non-essential amino acids (Sigma-Aldrich), 1× GlutaMax (Gibco), 10% FBS and 1% penicillin–streptomycin). The ECM was prepared on ice by the slow mixing of the components in the order listed above. To each well of the plate, 40 µl of ECM was added and the plate transferred to a 37 °C incubator for ≥30 min to solidify.

To thaw the cryopreserved PDEs, vials were thawed in a 37 °C water bath until only a small amount of ice remained. Tumour fragments were then transferred to a 50 ml centrifuge tube and a prewarmed wash medium (Dulbecco’s modified Eagle’s medium (Gibco), 10% FBS and 1% penicillin–streptomycin) was slowly added up to 10 ml. The tumour fragments were then washed by transferring to a cell strainer and sequentially lowering the contained fragments into three wells of a six-well plate each containing 7 ml of wash medium.

After thawing and washing, one fragment was placed on top of the solidified ECM in each well and an additional 40 µl of the ECM was added on top, and then incubated for at least 30 min before subsequent treatment. After ECM solidification, 120 µl of the tumour medium was added to each well on top of the ECM.

For untreated wells, the tumour medium was supplemented with DMSO (1:1,000, Sigma-Aldrich) as a negative control for all experiments. For treated wells, the medium was supplemented with the PTPN22 inhibitor (11 µM, MedChem Express) or anti-OSM antibody (5 µg ml⁻¹, Bio-Techne) to perturb PTPN22 and OSM-OSMR, respectively.

Flow cytometry analysis of PDEs

For the analysis of T cell phenotype and activation states (Fig. 5a), PDEs were analysed by high-dimensional flow cytometry after culture using the following antibodies: BUV395 Mouse Anti-Human Ki-67 (RRID: AB_2738577, Clone: B56, BD Biosciences, 1:40), BUV496 Mouse Anti-Human CD8 (RRID: AB_2870223, Clone RPA-T8, BD Biosciences, 1:80), BUV563 Mouse Anti-Human CD45RA (RRID: AB_2870211, Clone: HI100, BD Biosciences, 1:80), BUV737 Mouse Anti-Human CD39 (RRID: AB_2738919, Clone: TU66, BD Biosciences, 1:20), BUV805 Mouse Anti-Human CD3 (RRID: AB_2870181, Clone: SK7, BD Biosciences, 1:40), BV480 Mouse Anti-Human CD103 (RRID: AB_2743774, Clone: Ber-ACT8, BD Biosciences, 1:40), BV711 Mouse Anti-Human HLA-DR (RRID: AB_2738378, Clone: G46-6, BD Biosciences, 1:40), BB790-P Anti-Human CD4 (RRID: N/A, Clone: SK3, BD Custom Conjugates, 1:160), Alexa Fluor 700 Mouse Anti-Human GZMB (RRID AB_1645453, Clone: GB11, BD Biosciences, 1:80), Brilliant Violet 421 Mouse Anti-Human CD279 (PD-1) (RRID: AB_10960742, Clone: SK3, BioLegend, 1:20), Brilliant Violet 650 Mouse Anti-Human CD197 (CCR7) (RRID: AB_2563867, Clone: G043H7, BioLegend, 1:10), Brilliant Violet 785 Mouse Anti-Human CD45 (RRID: AB_2563129, Clone: HI30, BioLegend, 1:20), FITC Anti-Human HLA-A, HLA-B, HLA-C (RRID: AB_314873, Clone: W6/32, BioLegend, 1:40), PE Mouse Anti-Human TCF1 (TCF-7) Antibody (RRID: AB_2728492, Clone: 7F11A10, BioLegend, 1:10), PE/Dazzle 594 Mouse Anti-Human CD137 (4-1BB) (RRID: AB_2566260, Clone: 4B4-1, BioLegend, 1:20), PE/Cyanine7 Mouse Anti-Human CD134 (OX40) (RRID: AB_10901161, Clone: Ber-ACT35, BioLegend, 1:20), APC Rat Anti-Human Foxp3 (RRID: AB_1603280, Clone: PCH101, Invitrogen, 1:40).

For the analysis of treatment effects, PDEs were manually retrieved from the ECM after 48 h of culturing. Tumour fragments were pooled for each condition and processed into single-cell suspensions by enzymatic digestion on a rotator at 37 °C for 45 min with a digestion mixture (RPMI 1640 supplemented with 1 mg ml⁻¹ of collagenase type IV (Sigma-Aldrich) and 25.2 µg ml⁻¹ of DNAse I (Sigma-Aldrich)). Samples were subsequently washed with ice-cold PBS, followed by manual mashing through a 100-µm filter (Miltenyi Biotec).

For flow cytometry staining, cells were Fc-blocked with Human TruStain FcX Fc Receptor Blocking Solution (BioLegend) for 20 min at room temperature together with chemokine receptor staining (Brilliant Violet 650 Mouse Anti-Human CD197 (CCR7)). Antibodies for surface markers (BUV496 Mouse Anti-Human CD8, BUV563 Mouse Anti-Human CD45RA, BUV737 Mouse Anti-Human CD39, BUV805 Mouse Anti-Human CD3, BV480 Mouse Anti-Human CD103, BV711 Mouse Anti-Human HLA-DR, BB790-P Anti-Human CD4, BV785 Mouse Anti-Human CD45, BV421 Mouse Anti-Human CD279 (PD-1), FITC Anti-Human HLA-A, HLA-B, HLA-C, PE/Dazzle 594 Mouse Anti-Human CD137 (4-1BB), PE/Cyanine7 Mouse Anti-Human CD134 (OX40) and eBioscience Fixable Viability Dye eFluor 780) were prepared in a mixture of Brilliant Staining Buffer Plus (BD Biosciences) and FACS buffer (PBS containing 2% FBS and 5 mM of EDTA). Blocked cells were stained in the antibody mix for 30 min at 4 °C.

For intracellular staining, the cells were washed three times and then fixed and permeabilized using the FOXP3 Transcription Factor Staining Buffer set (Thermo Fisher Scientific) at room temperature in the dark for 30 min. Cells were again washed three times and then resuspended in 50 µl of staining mix containing antibodies for intracellular markers (BUV395 Mouse Anti-Human Ki-67, BUV496 Mouse Anti-Human CD8, BUV805 Mouse Anti-Human CD3, BB790-P Anti-Human CD4, Alexa Fluor 700 Mouse Anti-Human GZMB, PE Mouse Anti-Human TCF1 (TCF-7) and APC Rat Anti-Human Foxp3 antibodies) and incubated at room temperature for 30 min. The samples were washed three times before acquisition.

Samples were acquired on a BD FACSymphony A5 (BD Biosciences) and data collected using the BD FACS Diva software v.9.1 (BD Biosciences)¹¹⁴, with further analysis performed using FlowJo (v.10.9.0), Prism (v.10.0.2; GraphPad) and R (v.4.1.3). All data were analysed using paired two-sided Wilcoxon signed rank tests. An example of the gating strategy is shown in Supplementary Fig. 11.

Cytokine and chemokine analysis

For the analysis of chemokines and cytokines, the explant culture supernatants were collected after 48 h of culturing. The supernatants from each condition were pooled and centrifuged at 450g for 5 min twice to remove any debris. Supernatants were immediately frozen and stored at −80 °C until future usage. The concentration of the analytes was quantified using LEGENDplex cytometric bead array. Supernatants were thawed on ice, and the presence of the indicated cytokines and chemokines was detected using the LEGENDplex 14-plex customized Human Proinflammatory Chemokine and Human CD8/NK panels (BioLegend). The assay was performed, and the samples were acquired on the Novocyte Quanteon flow cytometer (Agilent Technologies) according to the manufacturer’s instructions. The concentrations of the indicated analytes were quantified using LEGENDplex cloud-based data analysis. All data were analysed using paired two-sided Wilcoxon signed rank tests in R (v.4.1.3).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The following datasets are publicly available: single-cell transcriptomics (Methods provide full details on individual studies), HLA-peptidomics²⁸, CRISPR co-cultures^18,19, GWAS catalogue²⁰ and Hetionet²¹. Node permutation data for Fig. 3 are available via GitHub at http://github.com/augustgm/MIDAS. Bulk sequencing patient cohorts are described elsewhere: CPI1000+ (ref. ³) and CPI2500 (paper in preparation). Further information and requests for resources should be directed to C.S. PDEs were derived from metastatic melanoma samples obtained in the TRACERx Melanoma trial. Due to patient confidentiality and sample limitation restrictions, these biological materials are not available. Source data are provided with this paper.

Code availability

The source code to train the MIDAS GIN model and perform interpretability analyses using a demonstration dataset is available via GitHub at https://github.com/augustgm/MIDAS (ref. ¹¹⁵). The repository also contains code to recreate all the main figures and is publicly available as of the date of publication.

References

Sharma, P. et al. The next decade of immune checkpoint therapy. Cancer Discov. 838, 838–857 (2021).
Chowell, D. et al. Improved prediction of immune checkpoint blockade efficacy across multiple cancer types. Nat. Biotechnol. 40, 499–506 (2021).
Article Google Scholar
Litchfield, K. et al. Meta-analysis of tumor- and T cell-intrinsic mechanisms of sensitization to checkpoint inhibition. Cell 184, 596–614.e14 (2021).
Article Google Scholar
Powley, I. R. et al. Patient-derived explants (PDEs) as a powerful preclinical platform for anti-cancer drug and biomarker discovery. Br. J. Cancer 122, 735–744 (2020).
Elmarakeby, H. A. et al. Biologically informed deep neural network for prostate cancer discovery. Nature 598, 348–352 (2021).
Article Google Scholar
Gogleva, A. et al. Knowledge graph-based recommendation framework identifies drivers of resistance in EGFR mutant non-small cell lung cancer. Nat. Commun. 13, 1667 (2022).
Article Google Scholar
Paliwal, S., de Giorgio, A., Neil, D., Michel, J. B. & Lacoste, A. M. Preclinical validation of therapeutic targets predicted by tensor factorization on heterogeneous graphs. Sci. Rep. 10, 18250 (2020).
Article Google Scholar
Ren, F. et al. A small-molecule TNIK inhibitor targets fibrosis in preclinical and clinical models. Nat. Biotechnol. 43, 63–75 (2025).
Kamya, P. et al. PandaOmics: an AI-driven platform for therapeutic target and biomarker discovery. J. Chem. Inf. Model. 64, 3961–3969 (2023).
Article Google Scholar
Vamathevan, J. et al. Applications of machine learning in drug discovery and development. Nat. Rev. Drug Discov. 18, 463–477 (2019).
Article Google Scholar
Deng, C., Ji, X., Rainey, C., Zhang, J. & Lu, W. Integrating machine learning with human knowledge. iScience 23, 101656 (2020).
Article Google Scholar
Kreitmaier, P., Katsoula, G. & Zeggini, E. Insights from multi-omics integration in complex disease primary tissues. Trends Genet. 39, 46–58 (2023).
Article Google Scholar
Chen, C. et al. Applications of multi-omics analysis in human diseases. MedComm 4, e315 (2023).
Article Google Scholar
Chakraborty, S., Hosen, M. I., Ahmed, M. & Shekhar, H. U. Onco-Multi-OMICS approach: a new frontier in cancer research. Biomed Res. Int. 2018, 9836256 (2018).
Article Google Scholar
McGranahan, N. et al. Allele-specific HLA loss and immune escape in lung cancer evolution. Cell 171, 1259–1271.e11 (2017).
Article Google Scholar
Nixon, B. G. et al. Tumor-associated macrophages expressing the transcription factor IRF8 promote T cell exhaustion in cancer. Immunity 55, 2044–2058.e5 (2022).
Article Google Scholar
Xiong, D., Wang, Y. & You, M. A gene expression signature of TREM2^hi macrophages and γδ T cells predicts immunotherapy response. Nat. Commun. 11, 5084 (2020).
Article Google Scholar
Vredevoogd, D. W. et al. Augmenting immunotherapy impact by lowering tumor TNF cytotoxicity threshold. Cell 178, 585–599.e15 (2019).
Article Google Scholar
Lawson, K. A. et al. Functional genomic landscape of cancer-intrinsic evasion of killing by T cells. Nature 586, 120–126 (2020).
Article Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
Article Google Scholar
Himmelstein, D. S. & Baranzini, S. E. Heterogeneous network edge prediction: a data integration approach to prioritize disease-associated genes. PLoS Comput. Biol. 11, e1004259 (2015).
Article Google Scholar
Zhang, Y. et al. An end-to-end method for predicting compound-protein interactions based on simplified homogeneous graph convolutional network and pre-trained language model. J. Cheminform. 16, 67 (2024).
Article Google Scholar
Hingorani, A. D. et al. Improving the odds of drug development success through human genomics: modelling study. Sci. Rep. 9, 18911 (2019).
Article Google Scholar
Nelson, M. R. et al. The support of human genetic evidence for approved drug indications. Nat. Genet. 47, 856–860 (2015).
Article Google Scholar
Minikel, E. V., Painter, J. L., Dong, C. C. & Nelson, M. R. Refining the impact of genetic evidence on clinical success. Nature 629, 624–629 (2024).
Article Google Scholar
Fitzsimons, E. et al. A pan-cancer single-cell RNA-seq atlas of intratumoral B cells. Cancer Cell 42, 1784–1797.e4 (2024).
Article Google Scholar
Coulton, A. et al. Using a pan-cancer atlas to investigate tumour associated macrophages as regulators of immunotherapy response. Nat. Commun. 15, 5665 (2024).
Article Google Scholar
Bulik-Sullivan, B. et al. Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification. Nat. Biotechnol. 37, 55–63 (2018).
Article Google Scholar
Bekker, J. & Davis, J. Learning from positive and unlabeled data: a survey. Mach. Learn. 109, 719–760 (2020).
Article MathSciNet Google Scholar
Eddy, J. A. et al. CRI iAtlas: an interactive portal for immuno-oncology research. F1000Research 9, 1028 (2020).
Article Google Scholar
Wouters, O. J., McKee, M. & Luyten, J. Estimated research and development investment needed to bring a new medicine to market, 2009–2018. JAMA 323, 844–853 (2020).
Hamilton, W. L., Ying, R. & Leskovec, J. Inductive representation learning on large graphs. In Proc. Advances in Neural Information Processing Systems Vol. 30 (eds Guyon, I. et al.) 1024–1034 (Curran Associates, 2017).
Xu, K., Hu, W., Leskovec, J. & Jegelka, S. How powerful are graph neural networks? In Proc. 7th International Conference on Learning Representations 9104–9120 (OpenReview.net, 2019).
de Cesco, S., Davis, J. B. & Brennan, P. E. TargetDB: a target information aggregation tool and tractability predictor. PLoS ONE 15, e0232644 (2020).
Article Google Scholar
Cornelissen, C., Lüscher-Firzlaff, J., Baron, J. M. & Lüscher, B. Signaling by IL-31 and functional consequences. Eur. J. Cell Biol. 91, 552–566 (2012).
Article Google Scholar
Brownlie, R. J., Wright, D., Zamoyska, R. & Salmond, R. J. Deletion of PTPN22 improves effector and memory CD8+ T cell responses to tumors. JCI Insight 4, e127847 (2019).
Cubas, R. et al. Autoimmunity linked protein phosphatase PTPN22 as a target for cancer immunotherapy. J. Immunother. Cancer 8, e001439 (2020).
Article Google Scholar
Teagle, A. R. et al. Deletion of the protein tyrosine phosphatase PTPN22 for adoptive T cell therapy facilitates CTL effector function but promotes T cell exhaustion. J. Immunother. Cancer 11, e007614 (2023).
Article Google Scholar
Ho, W. J. et al. Systemic inhibition of PTPN22 augments anticancer immunity. J. Clin. Invest. 131, e146950 (2021).
Araujo, A. M. et al. Stromal oncostatin M cytokine promotes breast cancer progression by reprogramming the tumor microenvironment. J. Clin. Invest. 132, e148667 (2022).
Lee, B. Y. et al. Heterocellular OSM-OSMR signalling reprograms fibroblasts to promote pancreatic cancer growth and metastasis. Nat. Commun. 12, 7336 (2021).
Article Google Scholar
Shrivastava, R. et al. M2 polarization of macrophages by oncostatin M in hypoxic tumor microenvironment is mediated by mTORC2 and promotes tumor growth and metastasis. Cytokine 118, 130–143 (2019).
Article Google Scholar
Wu, J. et al. Identification of substrates of human protein-tyrosine phosphatase PTPN22. J. Biol. Chem. 281, 11002–11010 (2006).
Article Google Scholar
Fritz, D. K. et al. A mouse model of airway disease: oncostatin M-induced pulmonary eosinophilia, goblet cell hyperplasia, and airway hyperresponsiveness are STAT6 dependent, and interstitial pulmonary fibrosis is STAT6 independent. J. Immunol. 186, 1107–1118 (2011).
Article Google Scholar
Akkenapally, S. V. et al. IFNγ, IL-4, and IL-13 upregulate IL-31 receptor alpha in airway smooth muscle cells to induce airway hyperresponsiveness in asthma. J. Immunol. 210, 67.18 (2023).
Article Google Scholar
Fritz, D. K., Kerr, C., Botelho, F., Stampfli, M. & Richards, C. D. Oncostatin M (OSM) primes IL-13- and IL-4-induced eotaxin responses in fibroblasts: regulation of the type-II IL-4 receptor chains IL-4Rα and IL-13Rα1. Exp. Cell Res. 315, 3486–3499 (2009).
Article Google Scholar
Shirota, H. et al. IL4 from T follicular helper cells downregulates antitumor immunity. Cancer Immunol. Res. 5, 61–71 (2017).
Article Google Scholar
Parveen, S. et al. Therapeutic targeting with DABIL-4 depletes myeloid suppressor cells in 4T1 triple-negative breast cancer model. Mol. Oncol. 15, 1330–1344 (2021).
Article Google Scholar
Voabil, P. et al. An ex vivo tumor fragment platform to dissect response to PD-1 blockade in cancer. Nat. Med. 27, 1250–1261 (2021).
Article Google Scholar
Kaptein, P. et al. Addition of interleukin-2 overcomes resistance to neoadjuvant CTLA4 and PD1 blockade in ex vivo patient tumors. Sci. Transl. Med. 14, 9779 (2022).
Article Google Scholar
Váraljai, R. et al. Interleukin 17 signaling supports clinical benefit of dual CTLA-4 and PD-1 checkpoint inhibition in melanoma. Nat. Cancer 4, 1292–1308 (2023).
Article Google Scholar
Kaptein, P. et al. CD8-targeted IL2 unleashes tumor-specific immunity in human cancer tissue by reviving the dysfunctional T-cell pool. Cancer Discov. 14, 1226–1251 (2024).
Collins, A. et al. Development of a patient-derived explant model for prediction of drug responses in endometrial cancer. Gynecol. Oncol. 160, 557–567 (2021).
Article Google Scholar
Shekarian, T. et al. Immunotherapy of glioblastoma explants induces interferon-γ responses and spatial immune cell rearrangements in tumor center, but not periphery. Sci. Adv. 8, 9440 (2022).
Article Google Scholar
Vendramin, R. et al. Nonsense-mediated mRNA decay inhibition reshapes the cancer immunopeptidome. Immunity https://doi.org/10.1016/j.immuni.2026.02.005 (2026).
van der Leun, A. M., Thommen, D. S. & Schumacher, T. N. CD8+ T cell states in human cancer: insights from single-cell analysis. Nat. Rev. Cancer. 20, 218–232 (2020).
de la Fuente López, M. et al. The relationship between chemokines CCL2, CCL3, and CCL4 with the tumor microenvironment and tumor-associated macrophage markers in colorectal cancer. Tumour Biol. 40, 1010428318810059 (2018).
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457 (2015).
Article Google Scholar
Tripathi, C. et al. Macrophages are recruited to hypoxic tumor areas and acquire a pro-angiogenic M2-polarized phenotype via hypoxic cancer cell derived cytokines oncostatin M and eotaxin. Oncotarget 5, 5350–5368 (2014).
Article Google Scholar
Komori, T., Tanaka, M., Senba, E., Miyajima, A. & Morikawa, Y. Deficiency of oncostatin M receptor β (OSMRβ) exacerbates high-fat diet-induced obesity and related metabolic disorders in mice. J. Biol. Chem. 289, 13821–13837 (2014).
Article Google Scholar
Komori, T., Tanaka, M., Senba, E., Miyajima, A. & Morikawa, Y. Lack of oncostatin M receptor β leads to adipose tissue inflammation and insulin resistance by switching macrophage phenotype. J. Biol. Chem. 288, 21861–21875 (2013).
Article Google Scholar
Dubey, A. et al. Separate roles of IL-6 and oncostatin M in mouse macrophage polarization in vitro and in vivo. Immunol. Cell Biol. 96, 257–272 (2018).
Article Google Scholar
Yuan, Y. et al. Oncostatin M regulates macrophages polarization in osseointegration via yes-associated protein. Int. Immunopharmacol. 120, 110348 (2023).
Article Google Scholar
Ayaub, E. A. et al. Overexpression of OSM and IL-6 impacts the polarization of pro-fibrotic macrophages and the development of bleomycin-induced lung fibrosis. Sci. Rep. 7, 13281 (2017).
Article Google Scholar
Wong, C. H., Siah, K. W. & Lo, A. W. Estimation of clinical trial success rates and related parameters. Biostatistics 20, 273–286 (2019).
Article MathSciNet Google Scholar
Reid, J. et al. In vivo affinity and target engagement in skin and blood in a first-time-in-human study of an anti-oncostatin M monoclonal antibody. Br. J. Clin. Pharmacol. 84, 2280–2291 (2018).
Article Google Scholar
Denton, C. P. et al. Biological and clinical insights from a randomized phase 2 study of an anti-oncostatin M monoclonal antibody in systemic sclerosis. Rheumatology 62, 234–242 (2022).
Article Google Scholar
Choy, E. H. et al. Safety, tolerability, pharmacokinetics and pharmacodynamics of an anti- oncostatin M monoclonal antibody in rheumatoid arthritis: results from phase II randomized, placebo-controlled trials. Arthritis Res. Ther. 15, R132 (2013).
Article Google Scholar
Wolf, C. L., Pruett, C., Lighter, D. & Jorcyk, C. L. The clinical relevance of OSM in inflammatory diseases: a comprehensive review. Front. Immunol. 14, 1239732 (2023).
Article Google Scholar
Wang, T. et al. MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nat. Commun. 12, 3445 (2021).
Schulte-Sasse, R. et al. Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms. Nat. Mach. Intell. 3, 513–526 (2021).
Amara, K. et al. GraphFramEx: towards systematic evaluation of explainability methods for graph neural networks. In Proc. 1st Learning on Graphs Conference Vol. 198 (eds Rieck, B. & Pascanu, R.) 44:1–44:23 (PMLR, 2022).
Agarwal, C., Queen, O., Lakkaraju, H. & Zitnik, M. Evaluating explainability for graph neural networks. Sci. Data 10, 144 (2023).
Article Google Scholar
Li, K. et al. Multi-omic analyses of changes in the tumor microenvironment of pancreatic adenocarcinoma following neoadjuvant treatment with anti-PD-1 therapy. Cancer Cell 40, 1374–1391.e7 (2022).
Article Google Scholar
Tivol, E. A. et al. Loss of CTLA-4 leads to massive lymphoproliferation and fatal multiorgan tissue destruction, revealing a critical negative regulatory role of CTLA-4. Immunity 3, 541–547 (1995).
Article Google Scholar
Nishimura, H., Nose, M., Hiai, H., Minato, N. & Honjo, T. Development of lupus-like autoimmune diseases by disruption of the PD-1 gene encoding an ITIM motif-carrying immunoreceptor. Immunity 11, 141–151 (1999).
Article Google Scholar
Mangani, D., Yang, D. & Anderson, A. C. Learning from the nexus of autoimmunity and cancer. Immunity 56, 256–271 (2023).
Article Google Scholar
Arce Vargas, F. et al. Fc-optimized anti-CD25 depletes tumor-infiltrating regulatory T cells and synergizes with PD-1 blockade to eradicate established tumors. Immunity 46, 577–586 (2017).
Article Google Scholar
Dannull, J. et al. Enhancement of vaccine-mediated antitumor immunity in cancer patients after depletion of regulatory T cells. J. Clin. Invest. 115, 3623–3633 (2005).
Article Google Scholar
Espinosa-Carrasco, G. et al. Intratumoral immune triads are required for immunotherapy-mediated elimination of solid tumors. Cancer Cell 42, 1202–1216.e8 (2024).
Mordelet, F. & Vert, J. P. A bagging SVM to learn from positive and unlabeled examples. Pattern Recognit. Lett. 37, 201–209 (2014).
Article Google Scholar
Claesen, M., De Smet, F., Suykens, J. A. K. & De Moor, B. A robust ensemble approach to learn from positive and unlabeled data using SVM base models. Neurocomputing 160, 73–84 (2015).
Article Google Scholar
Wang, P., Li, S. & Pan, R. Incorporating GAN for negative sampling in knowledge representation learning. In Proc. AAAI Conference on Artificial Intelligence Vol. 32 2005–2012 (AAAI Press, 2018).
Bolstad, B. preprocessCore: a collection of pre-processing functions. R package v.1.56.0. GitHub https://github.com/bmbolstad/preprocesscore (2021).
Krishna, C. et al. Single-cell sequencing links multiregional immune landscapes and tissue-resident T cells in ccRCC to tumor topology and therapy efficacy, Cancer Cell 39, 662–677.e6 (2021).
Maynard, A. et al. Therapy-induced evolution of human lung cancer revealed by single-cell RNA sequencing. Cell 182, 1232–1251.e22 (2020).
Qian, J. et al. A pan-cancer blueprint of the heterogeneous tumor microenvironment revealed by single-cell profiling. Cell. Res. 30, 745–762 (2020).
Braun, D. A. et al. Progressive immune dysfunction with advancing disease stage in renal cell carcinoma. Cancer Cell 39, 632–648.e8 (2021).
Article Google Scholar
Chan, J. M. et al. Signatures of plasticity, metastasis, and immunosuppression in an atlas of human small cell lung cancer. Cancer Cell 39, 1479–1496.e18 (2021).
Article Google Scholar
Cheng, S. et al. A pan-cancer single-cell transcriptional atlas of tumor infiltrating myeloid cells. Cell 184, 792–809.e23 (2021).
Article Google Scholar
Hafemeister, C. & Satija, R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 20, 296 (2019).
Article Google Scholar
Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587.e29 (2021).
Article Google Scholar
Mohammadi, S., Davila-Velderrain, J. & Kellis, M. Reconstruction of cell-type-specific interactomes at single-cell resolution. Cell Syst. 9, 559–568.e4 (2019).
Article Google Scholar
Harpsøe, M. C. et al. Body mass index and risk of autoimmune diseases: a study within the Danish National Birth Cohort. Int. J. Epidemiol. 43, 843–855 (2014).
Article Google Scholar
Li, W. et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 15, 554 (2014).
Article Google Scholar
Wang, J. et al. WebGestaltR: gene set analysis toolkit. CRAN https://doi.org/10.32614/cran.package.webgestaltr (2023).
Wu, G. & Haw, R. Functional interaction network construction and analysis for disease discovery. Methods Mol. Biol. 1558, 235–253 (2017).
Article Google Scholar
Robin, X. et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform. 12, 77 (2011).
Article Google Scholar
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 12, 323 (2011).
You, J., Ying, Z. & Leskovec, J. Design space for graph neural networks. In Proc. Advances in Neural Information Processing Systems Vol. 33 (eds Larochelle, H. et al.) 17009–17021 (Curran Associates, 2020).
Veličković, P. et al. Graph attention networks. In Proc. 6th International Conference on Learning Representations (OpenReview.net, 2018).
Kipf, T. N. & Welling, M. Semi-supervised classification with graph convolutional networks. In Proc. 5th International Conference on Learning Representations (OpenReview.net, 2017).
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L. C. MobileNetV2: inverted residuals and linear bottlenecks. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 4510–4520 (IEEE Computer Society, 2018).
Vaswani, A. et al. Attention is all you need. In Proc. Advances in Neural Information Processing Systems Vol. 30 (eds Guyon, I. et al.) 5998–6008 (Curran Associates, 2017).
Akiba, T., Sano, S., Yanase, T., Ohta, T. & Koyama, M. Optuna: a next-generation hyperparameter optimisation framework. In Proc. 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (eds Teredesai, A. et al.) 2623–2631 (ACM, 2019).
Fey, M. & Lenssen, J. E. Fast graph representation learning with PyTorch Geometric. In Proc. ICLR Workshop on Representation Learning on Graphs and Manifolds (OpenReview.net, 2019).
Hanhijärvi, S., Garriga, G. C. & Puolamäki, K. Randomization techniques for graphs. In Proc. SIAM International Conference on Data Mining (eds Apte, C. et al.) 780–791 (SIAM, 2009).
von Mering, C. et al. Comparative assessment of large-scale data sets of protein-protein interactions. Nature 417, 399–403 (2002).
Article Google Scholar
Batada, N. N., Shepp, L. A. & Siegmund, D. O. Stochastic model of protein–protein interaction: why signaling proteins need to be colocalized. Proc. Natl Acad. Sci. USA 101, 6445–6449 (2004).
Sechidis, K., Tsoumakas, G. & Vlahavas, I. On the stratification of multi-label data. In Proc. Machine Learning and Knowledge Discovery in Databases (eds Gunopulos, D. et al.) 145–158 (Springer, 2011).
Mamoshina, P. et al. Machine learning on human muscle transcriptomic data for biomarker discovery and tissue-specific drug target identification. Front. Genet. 9, 378508 (2018).
Ferrero, E., Dunham, I. & Sanseau, P. In silico prediction of novel therapeutic targets using gene-disease association data. J. Transl. Med. 15, 1–16 (2017).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
BD FACSDiva v.9.1 (BD Biosciences, 2026).
Augustine, M. & Nene, N. R. MIDAS: v.1.0.2. Zenodo https://doi.org/10.5281/zenodo.19495670 (2026).

Download references

Acknowledgements

We thank the patients and relatives who participated in the TRACERx Melanoma study. We thank B. Shum, J. Korteweg, A. Murra, L. Terry, D. Kelly and the Biospecimen Collection Team at the Royal Marsden Hospital for facilitating the collection of patient samples from TRACERx Melanoma and the Francis Crick Institute flow cytometry facility for assistance with the flow cytometric analyses. We acknowledge funding from a CRUK Therapeutic Catalyst award (SEBSTF-2022/100004). We thank J. Hlond for her artistic advice regarding figure style. M.A. is supported by the City of London Centre Clinical Academic Training Programme (Year 3, SEBSTF-2021\100007). H.F is funded by the National Institute for Health and Care Research (NIHR) Manchester Biomedical Research Centre (BRC) (NIHR203308), and receives funding from a CRUK Discovery Research Committee Renewing Programme (DRCRPG-Nov24/100006). C.L.P. was partly funded by The Tom Prince Trust and RNOH research funds. A.P.S. and I.S.-F. are funded by CRUK (CRUK Biotherapeutic Program grant (C36463/A20764)). E.F. is supported by a Cancer Research UK Non-Clinical Studentship (CANTAC721\100022). R.V. is funded by the UK Medical Research Council (MR/V033077/1), the Rosetrees Trust and Cotswold Trust (A2437), the Royal Marsden Cancer Charity (thanks to the Ross Russell family and Macfarlanes donations), Melanoma Research Alliance and Cancer Research UK (C69256/A30194). R.V. is funded by a NIHR UCLH BRC award (BRC1303CN/RV/101330). A.C. is a recipient of a Cancer Research Institute Irvington Postdoctoral Fellowship (#CRI4543). S.T. is funded by Cancer Research UK (A29911); the Francis Crick Institute, which receives its core funding from Cancer Research UK (FC10988), the UK Medical Research Council (FC10988) and the Wellcome Trust (FC10988); the National Institute for Health Research (NIHR) Biomedical Research Centre at the Royal Marsden Hospital and Institute of Cancer Research (grant reference number A109), the Royal Marsden Cancer Charity, The Rosetrees Trust (reference number A2204), Ventana Medical Systems Inc. (reference numbers 10467 and 10530), the National Institute of Health (U01 CA247439), Melanoma Research Alliance (reference number 686061), the US Department of Defense (award number W81XWH-22-1-0652) and VHL Alliance. S.A.Q. was funded by a Cancer Research UK (CRUK) Senior Cancer Research Fellowship (C36463/A22246), the CRUK Biotherapeutic Program grant (C36463/A20764) and was awarded a Medical Research Council grant (MR/W002337/1). N.M. receives funding from CRUK (DRCPFA-Nov23/100003) and has received funding from the Wellcome Trust and the Royal Society (211179/Z/18/Z) relevant to this work. N.M. also receives funding from CRUK Lung Cancer Centre of Excellence, Rosetrees and the NIHR BRC at University College London Hospitals. C.S. is a Royal Society Napier Research Professor (RSRP\R\210001). His work is supported by the Francis Crick Institute that receives its core funding from Cancer Research UK (CC2041), the UK Medical Research Council (CC2041), and the Wellcome Trust (CC2041). For the purpose of Open Access, the author has applied a CC BY public copyright license to any author accepted paper version arising from this submission. C.S. is funded by Cancer Research UK (TRACERx (C11496/A17786), PEACE (C416/A21999) and CRUK Cancer Immunotherapy Catalyst Network); Cancer Research UK Lung Cancer Centre of Excellence (C11496/A30025); the Rosetrees Trust, Butterfield and Stoneygate Trusts; NovoNordisk Foundation (ID16584); Royal Society Professorship Enhancement Award (RP/EA/180007 and RF\ERE\231118); National Institute for Health Research (NIHR) University College London Hospitals Biomedical Research Centre; the Cancer Research UK-University College London Centre; Experimental Cancer Medicine Centre; the Breast Cancer Research Foundation (US) (BCRF-23-157); Cancer Research UK Early Detection an Diagnosis Primer Award (Grant EDDPMA-Nov21/100034); and The Mark Foundation for Cancer Research Aspire Award (Grant 21-029-ASP) and ASPIRE Phase II award (Grant 23-034-ASP). CS is in receipt of an ERC Advanced Grant (PROTEUS) from the European Research Council under the European Union’s Horizon 2020 research and innovation programme (grant agreement number 835297). K.L. is funded by the UK Medical Research Council (MR/P014712/1 and MR/V033077/1), the Rosetrees Trust and Cotswold Trust (A2437) and CRUK (C69256/A30194).

Author information

Authors and Affiliations

Tumour Immunogenomics and Immunosurveillance (TIGI) Laboratory, University College London Cancer Institute, London, UK
Marcellus Augustine, Nuno Rocha Nene, Hongchang Fu, Christopher L. Pinder, Lorena Ligammari, Krupa Thakkar, Danwen Qian, Evelyn Fitzsimons, Benjamin S. Simpson, Roberto Vendramin, Andrea Castro & Kevin Litchfield
Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, UK
Marcellus Augustine, Nuno Rocha Nene, Roberto Vendramin & Charles Swanton
Cancer Genome Evolution Research Group, University College London Cancer Institute, London, UK
Marcellus Augustine & Nicholas McGranahan
Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK
Marcellus Augustine, Nuno Rocha Nene, Hongchang Fu, Christopher L. Pinder, Lorena Ligammari, Alexander P. Simpson, Irene Sanz-Fernández, Krupa Thakkar, Danwen Qian, Evelyn Fitzsimons, Benjamin S. Simpson, Roberto Vendramin, Andrea Castro, Sergio A. Quezada, Nicholas McGranahan, Charles Swanton & Kevin Litchfield
Division of Medicine, University College London, London, UK
Marcellus Augustine
Department of Statistical Science, University College London, London, UK
Nuno Rocha Nene
Cancer Dynamics Laboratory, The Francis Crick Institute, London, UK
Hongchang Fu & Samra Turajlic
Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, UK
Hongchang Fu & Samra Turajlic
Cancer Immunology Unit, Research Department of Haematology, University College London Cancer Institute, London, UK
Alexander P. Simpson, Irene Sanz-Fernández & Sergio A. Quezada
Cancer Research Horizons, The Francis Crick Institute, London, UK
Heather Niederer
Cancer Dynamics Laboratory, Cancer Research UK Manchester Institute, The University of Manchester, Manchester, UK
Samra Turajlic
Cancer Research UK City of London Centre, University College London, London, UK
Sergio A. Quezada
Department of Computer Science, Royal Holloway, University of London, London, UK
Chris Watkins
Department of Oncology, University College London Hospitals, London, UK
Charles Swanton

Authors

Marcellus Augustine
View author publications
Search author on:PubMed Google Scholar
Nuno Rocha Nene
View author publications
Search author on:PubMed Google Scholar
Hongchang Fu
View author publications
Search author on:PubMed Google Scholar
Christopher L. Pinder
View author publications
Search author on:PubMed Google Scholar
Lorena Ligammari
View author publications
Search author on:PubMed Google Scholar
Alexander P. Simpson
View author publications
Search author on:PubMed Google Scholar
Irene Sanz-Fernández
View author publications
Search author on:PubMed Google Scholar
Krupa Thakkar
View author publications
Search author on:PubMed Google Scholar
Danwen Qian
View author publications
Search author on:PubMed Google Scholar
Evelyn Fitzsimons
View author publications
Search author on:PubMed Google Scholar
Benjamin S. Simpson
View author publications
Search author on:PubMed Google Scholar
Roberto Vendramin
View author publications
Search author on:PubMed Google Scholar
Andrea Castro
View author publications
Search author on:PubMed Google Scholar
Heather Niederer
View author publications
Search author on:PubMed Google Scholar
Samra Turajlic
View author publications
Search author on:PubMed Google Scholar
Sergio A. Quezada
View author publications
Search author on:PubMed Google Scholar
Nicholas McGranahan
View author publications
Search author on:PubMed Google Scholar
Chris Watkins
View author publications
Search author on:PubMed Google Scholar
Charles Swanton
View author publications
Search author on:PubMed Google Scholar
Kevin Litchfield
View author publications
Search author on:PubMed Google Scholar

Contributions

Original idea and conceptualization were done by M.A., C.S. and K.L. M.A. and N.R.N. implemented all the ML analyses. Data pre-processing and integration were performed by M.A. Processing of omics datasets was executed by M.A., K.T., D.Q., E.F., A.C. and B.S.S. Target prioritization and selection were conducted by M.A., C.L.P., H.N., R.V. and K.L. together with CRUK Cancer Research Horizons. Laboratory experiments were performed by H.F., A.P.S., I.S.-F. and L.L., supervised by C.L.P. and R.V., with data analysed and interpreted by M.A., H.F., C.L.P., H.N., A.P.S., I.S.-F. and R.V. The study was supervised by K.L., C.S., C.W., N.M., S.T. and S.A.Q. M.A. wrote the article and created the figures with contributions from all authors. All authors critically reviewed and approved the article.

Corresponding authors

Correspondence to Marcellus Augustine, Nuno Rocha Nene, Chris Watkins, Charles Swanton or Kevin Litchfield.

Ethics declarations

Competing interests

M.A. reports fees from Neuroute, FutureHouse and Edison Scientific, unrelated to this work. M.A., N.R.N. and C.S. are named as inventors of the patent PCT/EP2025/086701 relating to the use of plasma proteomics for risk prediction of lung cancer (unrelated to this paper). They are also listed as inventors on a patent application (GB) that has been filed but is not related to the method described in this paper. The application is currently unpublished and remains within the priority year. R.V. declares research funding from CRUK TDL–Ono–LifeArc alliance and Genesis Molecular AI. A.C. is a consultant for Tempus Labs. S.T. reports personal fees from Roche, Novartis, AstraZeneca and Ipsen outside the submitted work; and the following patents filed: indel mutations as a therapeutic target and predictive biomarker (PCTGB2018/051892 and PCTGB2018/051893) and clear-cell renal cell carcinoma biomarkers (P113326GB). N.M. has stock options in and has consulted for Achilles Therapeutics and holds a European patent in determining HLA LOH (PCT/GB2018/052004), a patent pending in determining HLA disruption (PCT/EP2023/059039) and is a co-inventor to a patent to identify responders to cancer treatment (PCT/GB2018/051912). C.S. acknowledges grant support from AstraZeneca, Boehringer-Ingelheim, Bristol Myers Squibb, Pfizer, Invitae (previously Archer Dx—collaboration in minimal residual disease sequencing technologies), Ono Pharmaceutical, and Personalis. He is also Co-Chief Investigator of the NHS Galleri trial funded by GRAIL and a paid member of GRAIL’s Scientific Advisory Board. He was Chief Investigator for the AZ MeRmaiD 1 and 2 clinical trials and the Steering Committee Chair. C.S is a paid board member for Novartis from March 2026. He is also a paid board member for Bicycle Therapeutics and is Chair of the Clinical Advisory Group. He receives consultant fees from Genentech, Medicxi, China Innovation Centre of Roche (CICoR) formerly Roche Innovation Centre – Shanghai, Relay Therapeutics (SAB member), Saga Diagnostics (SAB member), and Sarah Cannon Research Institute. He previously received consultant fees from Achilles Therapuetics. C.S has received honoraria from Amgen, AstraZeneca, Bristol Myers Squibb, GlaxoSmithKline, Illumina, MSD, Novartis and Pfizer. C.S. has equity in Bicycle Therapeutics. He has stock options in Novartis, Relay Therapeutics, Saga Diagnostics and Bicycle Therapeutics. He has previously held stock and was co-founder of Achilles Therapeutics. C.S declares a patent application for methods to lung cancer (PCT/US2017/028013); targeting neoantigens (PCT/EP2016/059401); identifying patent response to immune checkpoint blockade (PCT/EP2016/071471); methods for lung cancer detection (US20190106751A1); identifying patients who respond to cancer treatment (PCT/GB2018/051912); determining HLA LOH (PCT/GB2018/052004); predicting survival rates of patients with cancer (PCT/GB2020/050221); methods and systems for tumour monitoring (PCT/EP2022/077987); analysis of HLA alleles transcriptional deregulation (PCT/EP2023/059039); relating to the use of plasma proteomics for risk prediction of lung cancer (PCT/EP2025/086701). C.S. is an inventor on a European patent application (PCT/GB2017/053289) relating to assay technology to detect tumour recurrence. This patent has been licensed to a commercial entity and under their terms of employment C.S is due a revenue share of any revenue generated from such license(s). K.L. has the following disclosures (all unrelated to the current work): patent on indel burden and CPI response pending, patent on ctDNA minimal residual disease calling methods, patent pending on a lung cancer vaccine; speaker fees from Roche Tissue Diagnostics and Ellipses Pharma; research funding from CRUK TDL/Ono/LifeArc alliance and Genesis Therapeutics; and consulting roles with Monopteros Therapeutics, Saga Diagnostics, Kynos Therapeutics and Tempus Labs. Again unrelated to this work, K.L. is currently employed by Isomorphic Labs. The other authors declare no competing interests.

Peer review

Peer review information

Nature Machine Intelligence thanks Serkan Kir, Dong-Qing Wei and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Supplementary Note 1, Tables 1–11, Figs. 1–11 and Methods.

Reporting Summary (download PDF )

Supplementary Tables (download XLSX )

Supplementary Tables 1,6,7,10.

Supplementary Data (download XLSX )

Supplementary Data 1–10.

Source data

Source Data Fig. 2 (download XLSX )

Statistical source data.

Source Data Fig. 3 (download XLSX )

Statistical source data.

Source Data Fig. 4 (download XLSX )

Statistical source data.

Source Data Fig. 5 (download XLSX )

Statistical source data.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Augustine, M., Nene, N.R., Fu, H. et al. Immunotherapy drug target identification using machine learning and patient-derived tumour explant validation. Nat Mach Intell (2026). https://doi.org/10.1038/s42256-026-01201-3

Download citation

Received: 21 November 2024
Accepted: 04 February 2026
Published: 18 May 2026
Version of record: 18 May 2026
DOI: https://doi.org/10.1038/s42256-026-01201-3

Subjects

Abstract

Similar content being viewed by others

Main

Results

Multimodal graph ML system for novel immunotherapy drug target discovery

Graph ML system achieves robust performance across in silico immuno-oncology benchmarks

Global interpretability analysis reveals features informing immuno-oncology target prediction

MIDAS identifies candidate immunotherapy drug targets

Functional validation of novel immunotherapy drug targets

Discussion

Methods

Datasets

CPI1000+ cohort

scRNA-seq datasets

Cell-type-specific interactomes

EDGE immunopeptidomics cohort

GWAS catalogue

Genome-wide CRISPR co-culture screens

Pathway analyses

Statistical analysis

CIBERSORT analysis

MIDAS GNN models

Multimodal biomedical data integration

Positive class labels for model development

MIDAS GNN design space

MIDAS GIN architecture

Interpretability of MIDAS graph models by permutation feature importance and degree-preserving graph null models

Train–test split for target prediction models

Resampling optimization strategies for target prediction models

Benchmarking against alternative methods

In silico model validation

Candidate target triage

Functional validation of candidate immunotherapy targets

Study ethics and research compliances

PDEs

Patient characteristics, patient-derived tumour material procurement, processing and cryopreservation

Human PDE cultures

Flow cytometry analysis of PDEs

Cytokine and chemokine analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links