Explainable artificial intelligence of DNA methylation-based brain tumor diagnostics

Benfatto, Salvatore; Sill, Martin; Jones, David T. W.; Pfister, Stefan M.; Sahm, Felix; von Deimling, Andreas; Capper, David; Hovestadt, Volker

doi:10.1038/s41467-025-57078-0

Download PDF

Article
Open access
Published: 20 February 2025

Explainable artificial intelligence of DNA methylation-based brain tumor diagnostics

Nature Communications volume 16, Article number: 1787 (2025) Cite this article

13k Accesses
9 Citations
6 Altmetric
Metrics details

Subjects

Abstract

We have recently developed a machine learning classifier that enables fast, accurate, and affordable classification of brain tumors based on genome-wide DNA methylation profiles that is widely employed in the clinic. Neuro-oncology research would benefit greatly from understanding the underlying artificial intelligence decision process, which currently remains unclear. Here, we describe an interpretable framework to explain the classifier’s decisions. We show that functional genomic regions of various sizes are predominantly employed to distinguish between different tumor classes, ranging from enhancers and CpG islands to large-scale heterochromatic domains. We detect a high degree of genomic redundancy, with many genes distinguishing individual tumor classes, explaining the robustness of the classifier and revealing potential targets for further therapeutic investigation. We anticipate that our resource will build up trust in machine learning in clinical settings, foster biomarker discovery and development of compact point-of-care assays, and enable further epigenome research of brain tumors. Our interpretable framework is accessible to the research community via an interactive web application (https://hovestadtlab.shinyapps.io/shinyMNP/).

Artificial intelligence in neuro-oncology: advances and challenges in brain tumor diagnosis, prognosis, and precision treatment

Article Open access 29 March 2024

Rapid brain tumor classification from sparse epigenomic data

Article Open access 28 February 2025

Comparison of DNA methylation based classification models for precision diagnostics of central nervous system tumors

Article Open access 02 October 2024

Introduction

Cancer diagnosis has a pivotal role in clinical oncology. Precise tumor classification is required for making accurate prognoses and adequate treatment decisions. For many tumor types, histopathological evaluation represents the standard diagnostic method. However, over the last decades, characterization of genomic alterations has enabled more refined tumor classification and molecularly stratified treatments. Today, high throughput technologies generate genetic and epigenetic tumor landscapes at unprecedented resolution. In addition, the arrival of “big data” in medicine, and particularly in cancer diagnostics, has paved the way for the employment of machine learning (ML) algorithms in clinical routines^1,2. Trained on extensive reference datasets, these algorithms automate complex tasks while often surpassing human accuracy. Several recent ML applications have been successfully employed in the context of cancer diagnostics, for instance in the early detection of tumors from radiologic images³ or circulating cell-free tumor DNA⁴. Artificial intelligence (AI) will have a growing impact on diagnostic pathology in the future².

We and colleagues have recently developed a ML-based approach that utilizes array-based genome-wide DNA methylation profiles to enable fast, accurate, and affordable classification of central nervous system (CNS) tumors (known as the “Heidelberg brain tumor classifier”)⁵. Over 100 different molecular brain tumor classes have been recognized⁶ and their precise classification remains difficult even for experienced neuropathologists. Our widely employed ML-based approach supports clinicians in making precise diagnoses, and it is particularly helpful to solve challenging cases^{7,8,9,10,11,12,13}. Similar DNA methylation-based classifiers have been developed for other tumor types, including sarcoma, meningioma, sinonasal tumors, and intrahepatic bile duct cancer^14,15,16,17, demonstrating generalizability and reliability of the method¹⁸.

DNA methylation is a key epigenetic mark that plays an important role in regulating the phenotypic landscape during normal development and cancer^19,20,21. The etiology of CNS tumors is deeply connected to the development of the nervous system, and their high diversity mirrors the complexity of cellular phenotypes in the human brain. There is mounting evidence showing that the DNA methylation patterns of tumors reflect their respective cell of origin²², in addition to superimposed somatic epigenetic alterations that are specific to tumors²³. While genome-wide DNA methylation profiles are the basis for ML-based tumor classification, it remains largely unclear which specific patterns are being used for distinguishing classes²⁴.

Here, we present the development of an explainable artificial intelligence (XAI) framework to interpret the Heidelberg brain tumor classifier. Generally, XAI is motivated by the need for human interpretable models to better understand automated predictions, to gain insights into the underlying data, and to build up trust in AI^25,26. Our XAI framework enables the identification of tumor-specific epigenetic signatures, may reveal biomarkers, and provides insights into brain tumor biology. Moreover, we provide a publicly accessible web application for easy exploration between tumor classes, methylation sites, and associated genes to facilitate a detailed understanding of the models’ decisions.

Results

Development of an interpretable AI framework

The Heidelberg brain tumor classifier and other DNA methylation-based classifiers are based on the Random Forest (RF) algorithm which has been shown to be a very suitable ML algorithm to utilize high-dimensional genomic datasets^27,28,29. RF is an ensemble method that utilizes a multitude of binary decision trees trained on a subset of samples (“in-bag” samples) and features³⁰. For each splitting node of every decision tree, the algorithm selects the feature (i.e., a DNA methylation probe) that provides the best binary split. In this way RF naturally calculates feature importances, which has been shown to be a powerful metric to identify complex biological associations^31,32,33,34. We reasoned that quantifying the number of times a feature is selected during training provides us with a simple metric to create an interpretable framework that is capable of highlighting class-specific DNA methylation patterns. To this end, we retrieved the original dataset and the RF models of the Heidelberg brain tumor classifier. The dataset contains DNA methylation array profiles of 2801 samples corresponding to 82 tumor and 9 normal control classes (Fig. 1a; an overview of classes and their abbreviations is provided in Supplementary Fig. 1a and Table S1). For each sample, the DNA methylation status of 428,799 genomic sites (high-quality probes) is measured.

**Fig. 1: Overview of the explainable AI workflow.**

In the original work, a RF classifier was trained using all 428,799 probes (“outer” classifier) to select the 10,000 most informative probes for the final (“inner”) classifier (Fig. 1b, Supplementary Fig. 1b)⁵. We first investigated the outer RF model. Specifically, we analyzed each pairwise combination of tumor samples that were used for training a given tree (eight in-bag samples per class) and summarized the probe usage as the number of times a probe was selected to split their respective classes (see “Methods”). We used a positive sign if the probe was hypermethylated and a negative sign if the probe was hypomethylated (higher or lower than the threshold value, respectively) in the samples at the split. Probe usage was aggregated across all trees into a three-dimensional array of all 8281 (91 by 91) possible class combinations for each of the 428,799 probes (Fig. 1c). In total, the array is composed of 3.55 × 10⁹ data points. The absolute probe usage across all data points is 2.24 × 10¹¹, representing the total amount of splits between in-bag samples over all trees. This 3D array serves as the starting point for all the analyses and for the development of an interactive web application described below (Fig. 1d).

Global patterns of differential probe usage

We first investigated the global patterns of probe usage of the outer RF classifier. We found that a relatively small subset of probes contributed the majority of the total usage. The top 10,000 or 25,000 probes (2.3% or 5.8% of all probes) contributed to 61.2% or 78.1% of the probe usage across all class combinations, respectively (Fig. 2a). In contrast, the 250,000 least used probes (58.3%) contributed to only 1.3% of the total usage. Looking at probes that separated individual classes from all others, the top 10,000 probes contributed between 96.7% and 55.7% of the total usage (classes LIPN and LGG_PA_MID, respectively; see Table S1 for extended class annotations; Fig. 2b). There was high inequality in the probe usage for each class, as described by Gini indexes ranging from 0.99 for LIPN to 0.89 for LGG_PA/GG_ST (Supplementary Fig. 2a). These analyses indicate that the contribution of a majority of probes is negligible, while few probes are highly informative to distinguish between tumor classes. Among those, fewer probes are selected for the classification of biologically very distinct classes (e.g., LIPN or ETMR), while for more closely related classes the model makes use of a larger number of probes to discriminate between those classes (e.g., the members of the low grade glioma methylation class family; Supplementary Fig. 2b). Similar results were observed in an independent RF classifier trained on the same dataset, demonstrating a high stability of probe usage values across different models (Supplementary Fig. 2c–f). Moreover, probe usage values showed high concordance with feature importance values calculated using the SHAP approach³⁵, highlighting the robustness of our method in assessing features importances compared to established methods (Supplementary Fig. 3a–c).

**Fig. 2: Class-specific probe usage across functional genomic regions.**

We then asked if probes displayed differential usage according to their functional genomic localization and grouped probes according to the following annotations: CpG islands (regions of high CpG density), shores (regions within 2 kb from CpG island), shelves (regions 2–4 kb from CpG island), and open sea regions (the rest of the genome). Usage distributions showed major differences between classes. For instance, probes falling within CpG islands were frequently used to distinguish IDH-mutant gliomas (classes O_IDH, A_IDH, and A_IDH_HG) from other classes (52.0% of the average probe usage; Fig. 2c, Supplementary Fig. 4a). These probes were predominantly hypermethylated in IDH-mutant gliomas, in accordance with the previously reported CpG island methylator phenotype (CIMP)^36,37. Interestingly, similar patterns were observed for tumor classes LYMPHO and ETMR (80.6% and 56.3%, respectively). Conversely, probes that were used to distinguish PITAD and LIPN classes were mostly located in shelf and open sea regions and were frequently hypomethylated (78.6% and 57.7%, respectively). These classes are characterized by low overall DNA methylation levels (Supplementary Fig. 4b).

Other functional genomic regions that often show differential DNA methylation between cell types and disease states are enhancer regions and large-scale heterochromatic domains. Hypomethylation within enhancer regions is generally associated with enhancer activation and transcription factor binding²¹. Heterochromatic domains, which frequently are positioned at the nuclear lamina, are condensed and transcriptionally silent regions in which DNA methylation is believed to be lost in a passive process in many different types of cancer³⁸. To investigate probe usage in these genomic regions of the RF classifier, we overlapped probes with annotated enhancers (defined across ENCODE cell lines) and DNase I hypersensitive sites (DHS; Fig. 2d, Supplementary Fig. 4c, d). In addition, we overlapped probes with large-scale heterochromatic domains (H3K9me3-positive domains, defined in K562 lymphoblast cells and H1 embryonic stem cells) and lamina-associated domains (LADs, defined in Tig3 fibroblast cells; Fig. 2e, Supplementary Fig. 4e, f). We found that ETMR is often classified by hypermethylated probes located within DHS and CpG island enhancers. High usage of hypermethylated probes within enhancers that are not overlapping CpG islands was observed for different classes of ATRTs, supporting a previous study showing that H3K27ac enhancer landscapes distinguish between ATRT subgroups³⁹. While probe usage patterns in LADs largely mirrored those observed in heterochromatic domains defined in K562 cells, we observed more pronounced patterns in H1 cells. Hypomethylated probes in those regions were frequently employed to classify LGG_MYB, MELAN, EFT_CIC, and CNS_NB_FOXR2 classes. Hypermethylated probes within LADs and heterochromatin domains were used for multiple GBM classes. These results indicate that genomic regions of different sizes (sub-kilobase to megabases) are employed to distinguish between different tumor classes, with potential links to their biology.

Finally, we focused on promoter regions which are often described as the primary region of transcriptional regulation by DNA methylation. Promoter hypermethylation is generally associated with gene silencing. To this aim, we grouped probes in 250 bp windows within 10 kb regions centered around annotated transcription starting sites (TSS) of all annotated genes. We observed highest probe usage in close proximity of the TSS in the majority of classes (Fig. 2f). To test if this enrichment was due to the higher probe coverage of promoters on the Illumina DNA methylation array (32.5% of available probes are located within 1 kb of annotated TSS), we also considered the average probe usage within windows. While hypermethylated probes in proximity of TSS showed the highest usage for IDH-mutant tumors, this analysis showed that most other tumor classes relied on probes that were located further upstream or downstream of the TSS (Fig. 2f). This analysis suggests that probes located distal to promoter regions (e.g., in enhancers, gene bodies) are more informative for classification of most tumor classes.

High genomic redundancy of informative probes

For further analyses we focused on the inner RF model that is based on 10,000 probes selected from the outer RF model and is used for generating predictions with the Heidelberg brain tumor classifier⁵. Using this model, we extracted and aggregated pairwise probe usage values similarly as for the outer classier. We first asked if informative probes separated multiple classes from each other or were specific to individual classes (i.e., separated a single class from all others). We also asked to which degree there is redundancy between informative probes and if they map in close proximity to each other. To this aim, we performed unsupervised clustering and t-SNE dimensionality reduction of all 10,000 probes according to their usage (using Pearson’s correlation coefficient as the distance measure). Our analysis indicated clearly defined groups of probes that were separated into a total of 88 clusters (Fig. 3a, Supplementary Fig. 5a). The number of probes per cluster was highly variable and ranged from 2 to 211 (median of 117; Supplementary Fig. 5b). Interestingly, we found that the majority of clusters were associated with a single tumor class, indicating high class specificity of most selected probes (Fig. 3b). Specifically, 71 of 88 clusters (80.7%) were associated with a specific tumor class. For instance, to classify ETMR samples, the model predominantly employed probes belonging to cluster 27 (hypermethylated, n = 136 probes) and cluster 78 (hypomethylated, n = 44 probes; Fig. 3c, d). We also identified 17 clusters (19.3%) that were associated with 2 or 3 tumor classes. Among these were clusters associated with GBM, IDH-mutant glioma, and PITAD methylation class families of closely related tumor classes (Fig. 3b). For example, probes belonging to cluster 1 (hypermethylated, n = 211) were associated with classes O_IDH, A_IDH, and A_IDH_HG (Fig. 3e, f). When looking at probes that distinguish these three classes from each other, we found that a smaller set of probes within cluster 1 was specifically used to distinguish O_IDH from the other two, while few other probes scattered across different clusters were employed to split A_IDH and A_IDH_HG (Supplementary Fig. 5c).

**Fig. 3: Unsupervised clustering of probes by their usage.**

We next asked if probes belonging to a given cluster mapped in close proximity to each other (e.g., multiple probes associated with the same gene). We found that probes from each cluster were distributed over many regions across the genome and did not show a specific enrichment towards a particular chromosome or region (Supplementary Fig. 5d). The 44 probes belonging to cluster 78 (ETMR-specific probes) mapped to 32 distinct genomic regions (Fig. 3g). For cluster 1 (IDH-mutant glioma-specific probes, n = 211), we identified 106 genomic regions (Fig. 3h). These results indicate a high genomic redundancy of probes employed by the classifier that may mitigate potential intra-class variability between individual patient samples (e.g., due to stochastic copy-number changes). A main characteristic of the RF algorithm is that it derives predictions across a large number of decision trees. Making use of nearly redundant probes across the genome may explain the high robustness of DNA methylation array-based tumor classification.

Interpretable AI yields insights into tumor biology

To make our interpretable framework accessible to the research community, we developed a user-friendly web application utilizing the shiny R package (“shinyMNP”, https://hovestadtlab.shinyapps.io/shinyMNP/). Our application consists of four main panels that allow the user to query and explore the dataset in different ways. In the first panel, users can explore the top probes employed by the classifier to identify a given class (Supplementary Fig. 6a). The second panel of the app allows users to identify probes that distinguish any two classes (Supplementary Fig. 6b). The third panel visualizes the total probe usage for select genes of interest across all classes (Supplementary Fig. 6c). The interactive heatmap shows the usage of associated probes across all possible class combinations. The final panel visualizes the top genes associated with the most used probes for each class, as represented as a directed network in which arrows connect genes to associated tumor classes (Fig. 4a).

**Fig. 4: shinyMNP interface and illustrative class-specific genes.**

Using our shinyMNP web application, we investigated class-specific probes and their associated genes for potential biological discovery. We selected four illustrative examples that demonstrate the application of our interpretable framework. As a first example, we identified the promoter region of SHPRH that was specifically hypermethylated in ETMR, a tumor class that is characterized by chromosomal instability and high levels of R-loops^40,41 (Fig. 4b). In accordance, gene expression analysis highlighted pronounced SHPRH downregulation specifically in ETMR tumors (Supplementary Fig. 6d). The E3-ligase SHPRH poly-ubiquitinates PCNA to prevent genomic instability from stalled replication forks that may be caused by R-loops^42,43,44,45. Our observation suggests that silencing of SHPRH may contribute to the chromosomal instability and high levels of R-loops in ETMR. Second, we identified a number of probes that were specifically hypomethylated throughout the entire gene body of PWWP3A (also known as MUM1) specific to the recently defined tumor class HGNET_MN1 (Fig. 4c). PWWP3A is exclusively expressed in this tumor class (Supplementary Fig. 6e) and is involved in DNA damage repair and chromatin organization⁴⁶. The third example highlights the hypomethylation of the broader promoter region of the TBX19 transcription factor (also known as TPIT) in PITAD_ACTH and relates to the pituitary origin of this tumor class (Fig. 4d). Immunohistochemistry for TPIT has been established as a specific marker for the identification of PITAD_ACTH in routine diagnostics⁴⁷. Inactivating mutations of TBX19 are associated with isolated deficiency of adrenocorticotropic hormone (ACTH), indicating a crucial role of this transcription factor in the regulation of the pituitary corticotroph lineage^48,49. As a last example, we identified pronounced hypermethylation of a single probe near the fourth exon of the proto-oncogene RET specific to the recently defined tumor class HGNET_BCOR in which RET is highly expressed (Fig. 4e; Supplementary Fig. 6f)⁵⁰. RET encodes a transmembrane receptor of the tyrosine protein kinase family and is an important gene in CNS development^51,52,53,54. While it is unknown how the observed hypermethylation is associated with the high expression of RET in HGNET_BCOR, we postulate that overexpression may be due to hypermethylation of a regulatory element, as multiple distal enhancers have been identified in this genomic region. RET may represent a HGNET_BCOR-specific vulnerability as potent targeted drugs have recently been FDA-approved for the treatment of other RET-altered tumors⁵⁵.

In summary, our interpretable framework reveals associations between specific genes and tumor classes that may be employed for future studies in the context of biomarker discovery, therapeutic target identification, and cancer biology research of CNS tumors. Our publicly accessible web application of the interpretable framework enables researchers of the scientific community to perform similar analyses across all classes included in the Heidelberg brain tumor classifier.

Discussion

The ability to classify CNS tumors based on their DNA methylation profiles using artificial intelligence approaches has irrevocably changed brain tumor classification in clinical practice and in research. Despite the usefulness of the Heidelberg brain tumor classifier, a clear understanding of the classifier’s inner decision-making process has been missing. To this aim, we developed an interpretable framework to better explain its underlying working rules.

By calculating the pairwise probe usage between classes over all RF trees, we simultaneously enable both global and local interpretability. DNA methylation patterns that are globally associated with individual tumor classes are readily identified, such as CpG island hypermethylation or hypomethylation in broad heterochromatic regions. On the other hand, our framework allows us to pinpoint probes that are locally important to distinguish between pairs of select tumor classes. Results may sometimes overlap, as a probe can have both high global and local importance for the same class. For instance, for some methylation class families, such as the IDH-mutant glioma classes, a group of redundant probes are used to separate these classes from all the other classes at the global level. After this high-level separation, the algorithm selects some of the few class-specific probes, as identifiable with our pairwise analysis. In the research context, this approach can be applied to annotate DNA methylation patterns that are shared among different classes and families of brain tumors and at the same time to uncover genomic sites that are unique to a single class. In addition to their value as biomarkers for classification and diagnostic relevance, tumor-associated genes could reveal valuable targets for the development of precision therapies, such as the association between RET and HGNET_BCOR, for which targeted inhibitors have recently become available.

Importantly, our interpretable framework can be readily adapted to future versions of the Heidelberg brain tumor classifier, for example to discover relevant probes and DNA methylation patterns associated with additional tumor classes. Our approach can also be transferred to other tumor classifiers that are based on DNA methylation profiling and the RF algorithm^14,15. For example, applying our approach to a RF classifier of a recently published cohort sarcoma samples¹⁵ showed that the main principles unveiled from the CNS tumor classifier also hold true in other DNA methylation-based tumor classifiers (Supplementary Fig. 7). Furthermore, motivated by recent advances using nanopore sequencing technology^{56,57,58,59,60}, we foresee the implementation of point-of-care assays to allow for more affordable and rapid diagnosis that make use of reduced subsets of informative and non-redundant CpG sites based on our findings. Our results also have potential applications in the context of liquid biopsies for early cancer detection, classification, and disease monitoring over time from circulating tumor DNA⁶¹. Our study provides a unique reference for incorporating DNA methylation profiling into these studies.

Overall, our interpretable framework provides a better understanding of the underlying working rules of the Heidelberg brain tumor classifier. Our resource will facilitate the discovery of disease biomarkers and therapeutic targets, and support the development of bioinformatic pipelines, machine learning models and point-of-care assays for rapid diagnostics, early detection, and disease monitoring.

Methods

Datasets

DNA methylation profiles (from Illumina Infinium HumanMethylation450K BeadChip arrays) from 2,801 samples of the reference dataset of the Heidelberg brain tumor classifier⁵ were downloaded as IDAT files from GEO (GSE109381 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109381]). Samples were associated with 82 molecular tumor classes and 9 control classes. Processed gene expression data (Affymetrix U133 Plus 2.0 Array) were retrieved from GEO (GSE73038 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE73038]) and annotations were obtained from the original publication⁵⁰. The following probes were selected: 211421_s_at (RET); 221290_s_at (MUM1/PWWP3A); 226366_at (SHPRH). All computational analyses were performed in R (v.4.1.3). The pre-processed DNA methylation array data of the sarcoma cohort¹⁵ were retrieved from GEO (GSE140686 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE140686]).

Random forest models

Random forests consist of many binary decision trees, which are simple ML algorithms that resemble the human cognitive process of decision-making. The algorithm constructs individual trees by selecting features that offer the best split between remaining samples, resulting in a flowchart-like structure that makes them easy to understand and interpret. Terminal nodes represent different classes that can be predicted. Decision trees are composed of a limited number of features and are readily interpretable. At the same time, individual trees suffer from low accuracy in complex scenarios and are prone to overfitting. In other words, their rigid, but transparent structure limits their ability to learn and generalize. To overcome those limitations, RF aggregates the prediction of multiple decision trees on a subset of samples and features.

In order to make the Heidelberg CNS tumor classifier interpretable, we used the trained RF models (“outer” and “inner” classifier) from the original publication (v.11b2)⁵. The inner classifier was trained using 10,000 probes that were selected from the outer classifier trained on all 428,799 high quality probes. Probes were selected based on their per-class importance (MeanDecreaseAcuarcy value). We first calculated the rank of each probe within every class (low rank = high MeanDecreaseAcuarcy), and subsequently selected the 10,000 probes with the lowest rank across classes⁵.

To re-train the outer RF model (Supplementary Fig. 2c–f), we used the randomForest R package (v.4.7-1.1) and the following parameters: ntree = 10000, mtry = 654, sampsize = rep(8, 91), importance = T. To train an outer RF model for the sarcoma cohort (Supplementary Fig. 6) we used the following parameters: ntree = 10,000, mtry = 654, sampsize = rep (7, 65). The inner RF sarcoma model was trained using the 10,000 most important features (overall MeanDecreaseAccuracy value from the outer classifier) and using mtry = 100. All the RF models were trained with the option keep.inbag = T, required for the next steps.

To calculate SHAP feature importances, we trained an independent random forest model in Python (v.3.12.5) with the scikit-learn library (v.1.5.1) using the same 10,000 selected features, achieving an out-of-bag error of 0.04. We calculated Shapley values using the shap library (v.0.45.1). To reduce computing time, Shapley values were calculated for a single sample in each of the 91 classes (similarly using one sample per class as the background).

Extraction of random forest probe usage

We assumed that the number of times a feature (i.e., a DNA methylation probe) is selected to perform a split between tumor classes reflects the importance of that particular feature in that context. Since this information is not directly accessible from the innate structure of a trained RF model object, we calculated the probe usage as follows: In an iterative process over each of the 10,000 trees, all paths from the root node to each terminal node were extracted. Next, those paths were compared in a pairwise manner, annotating the probe ID and its methylation status at the node that splits two terminal nodes (annotated as −1 for hypomethylated and 1 for hypermethylated). From these pre-computed comparisons, we retrieved the splitting nodes for all 529,984 possible combinations between in-bag samples (8 samples for each of the 91 classes). Subsequently, splitting nodes were aggregated at the tumor class level across all trees (representing the probe usage). Probe usage values were stored as a 3D array in which each layer (428,799 probes) is composed of a symmetric matrix representing the probe usage of all 91 by 91 (8281) possible class comparisons. The probe usage can be summarized as the total or average probe usage per class or across all classes. For these steps the data.table (v.1.14.2) and iterpc (v.0.4.2) R packages were used. The code for extracting probe usages from a trained RF model is accessible at https://github.com/hovestadt/shinyMNP.

Analysis of functional genomic regions

Probes were grouped by tumor classes and mean probe usage was calculated for each category of functional genomic regions. CpG island, shore, shelf, sea, DHS, and enhancers annotations were extracted from the Illumina array annotation file (HumanMethylation450 manifest file v1.2, downloaded from the Illumina website). Lamina associated domains (LADs, identified in human Tig3 human fibroblasts) and heterochromatin domains (identified in H1-hESC and K562 cell lines) positions were downloaded from the ENCODE portal. The human genome assembly hg19 and RefSeq gene annotation were retrieved from the UCSC genome browser website. Probes were mapped to the nearest TSS using functions from the GenomicRanges Bioconductor package (v.1.46.1). Probes with a distance greater than 5000 bp were removed. The total and average probe usage was plotted as a function of the distance to the nearest TSS for each tumor class in 250 bp windows.

Dimensionality reduction and clustering

Vectorized probe usages were employed to calculate the 10,000 by 10,000 Pearson’s distance matrix. This matrix was used as input for a t-SNE dimensionality reduction calculated using the Rtsne R package (v.0.16) and the following parameters: theta = 0.1, pca = F, num_threads = 0, is_distance = T, max_iter = 5000. For unsupervised clustering, we first calculated the k-nearest neighbors and constructed the SNN graph from the distance matrix. Then, clusters were identified using the Louvain algorithm to optimize the modularity function. The Seurat Bioconductor package (v.4.1.1) was employed for this step. To associate each cluster of probes to tumor classes we first calculated the fraction of probe usage over the total for each cluster and class. A tumor class was associated with a cluster if the fraction was greater than 0.1 (10%). The genomic location of probes within clusters was plotted using the karyoploteR Bioconductor package (v.1.20.3).

Development of the shinyMNP web application

To create an interactive web application that allows researchers to access our interpretable framework we used the same 3D probe usage array as described above and precomputed different summaries for rapid access. The app was developed using the shiny (v.1.7.1), tidyverse (v.1.3.1), ggplot2 (v.3.3.5), shinythemes (v.1.2.0), rhdf5 (v.2.38.1), plotly (v.4.10.0), heatmaply (v.1.3.0), igraph (v.1.3.1) and visNetwork (v.2.1.0) R/Bioconductor packages.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

No new datasets have been generated as part of this study. The DNA methylation reference dataset and the gene expression dataset are accessible from GEO (GSE109381 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109381]) and GSE73038, respectively). Source data are provided with this paper.

Code availability

The code to calculate probe usages from the RF model and perform subsequent analyses is available at: https://github.com/hovestadt/shinyMNP.

References

Topol, E. J. High-performance medicine: the convergence of human and artificial intelligence. Nat. Med. 25, 44–56 (2019).
Article CAS PubMed MATH Google Scholar
Stenzinger, A. et al. Artificial intelligence and pathology: from principles to practice and future applications in histomorphology and molecular profiling. Semin. Cancer Biol. 84, 129–143 (2022).
Article CAS PubMed Google Scholar
Yala, A., Lehman, C., Schuster, T., Portnoi, T. & Barzilay, R. A deep learning mammography-based model for improved breast cancer risk prediction. Radiology 292, 60–66 (2019).
Article PubMed Google Scholar
Jamshidi, A. et al. Evaluation of cell-free DNA approaches for multi-cancer early detection. Cancer Cell 40, 1537–1549.e12 (2022).
Article CAS PubMed MATH Google Scholar
Capper, D. et al. DNA methylation-based classification of central nervous system tumours. Nature 555, 469–474 (2018).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Louis, D. N. et al. The 2021 WHO Classification of Tumors of the Central Nervous System: a summary. Neuro. Oncol. 23, 1231–1251 (2021).
Article CAS PubMed PubMed Central MATH Google Scholar
Jaunmuktane, Z. et al. Methylation array profiling of adult brain tumours: diagnostic outcomes in a large, single centre. Acta Neuropathol. Commun. 7, 1–18 (2019).
Article Google Scholar
Karimi, S. et al. The central nervous system tumor methylation classifier changes neuro-oncology practice for challenging brain tumor diagnoses and directly impacts patient care. Clin. Epigenetics 11, 1–10 (2019).
Article MATH Google Scholar
Priesterbach-Ackley, L. P. et al. Brain tumour diagnostics using a DNA methylation-based classifier as a diagnostic support tool. Neuropathol. Appl. Neurobiol. 46, 478–492 (2020).
Article CAS PubMed PubMed Central Google Scholar
White, C. L. et al. Implementation of DNA methylation array profiling in pediatric central nervous system tumors - the AIM BRAIN Project: an Australian and New Zealand Children’s Haematology and Oncology (ANZCHOG) Group study. J. Mol. Diagn. https://doi.org/10.1016/j.jmoldx.2023.06.013 (2023).
Article PubMed Google Scholar
Capper, D. et al. Practical implementation of DNA methylation and copy-number-based CNS tumor diagnostics: the Heidelberg experience. Acta Neuropathol. 136, 181–210 (2018).
Article CAS PubMed PubMed Central MATH Google Scholar
Sturm, D. et al. Multiomic neuropathology improves diagnostic accuracy in pediatric neuro-oncology. Nat. Med. 29, 917–926 (2023).
Article CAS PubMed PubMed Central MATH Google Scholar
Pickles, J. C. et al. DNA methylation-based profiling for paediatric CNS tumour diagnosis and treatment: a population-based study. Lancet Child Adolesc. Health 4, 121–130 (2020).
Article CAS PubMed MATH Google Scholar
Jurmeister, P. et al. DNA methylation-based classification of sinonasal tumors. Nat. Commun. 13, 7148 (2022).
Article ADS PubMed PubMed Central MATH Google Scholar
Koelsche, C. et al. Sarcoma classification by DNA methylation profiling. Nat. Commun. 12, 498 (2021).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Sahm, F. et al. DNA methylation-based classification and grading system for meningioma: a multicentre, retrospective analysis. Lancet Oncol. 18, 682–694 (2017).
Article CAS PubMed MATH Google Scholar
Dragomir, M. P. et al. DNA methylation-based classifier differentiates intrahepatic pancreato-biliary tumours. EBioMedicine 93, 104657 (2023).
Article CAS PubMed PubMed Central Google Scholar
Koelsche, C. & von Deimling, A. Methylation classifiers: brain tumors, sarcomas, and what’s next. Genes Chromosomes Cancer 61, 346–355 (2022).
Article CAS PubMed MATH Google Scholar
Mattei, A. L., Bailly, N. & Meissner, A. DNA methylation: a historical perspective. Trends Genet. 38, 676–707 (2022).
Article CAS PubMed MATH Google Scholar
Li, E. & Zhang, Y. DNA methylation in mammals. Cold Spring Harb. Perspect. Biol. 6, a019133 (2014).
Article PubMed PubMed Central Google Scholar
Greenberg, M. V. C. & Bourc’his, D. The diverse roles of DNA methylation in mammalian development and disease. Nat. Rev. Mol. Cell Biol. 20, 590–607 (2019).
Article CAS PubMed MATH Google Scholar
Smith, K. S. et al. Unified rhombic lip origins of group 3 and group 4 medulloblastoma. Nature 609, 1012–1020 (2022).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Hovestadt, V. et al. Decoding the regulatory landscape of medulloblastoma using DNA methylation sequencing. Nature 510, 537–541 (2014).
Article ADS CAS PubMed MATH Google Scholar
Klauschen, F. et al. Toward explainable artificial intelligence for precision pathology. Annu. Rev. Pathol. 19, 541–570 (2024).
Article CAS PubMed Google Scholar
Azodi, C. B., Tang, J. & Shiu, S.-H. Opening the Black Box: interpretable machine learning for geneticists. Trends Genet. 36, 442–455 (2020).
Article CAS PubMed MATH Google Scholar
Nussberger, A.-M., Luo, L., Celis, L. E. & Crockett, M. J. Public attitudes value interpretability but prioritize accuracy in Artificial Intelligence. Nat. Commun. 13, 5821 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Touw, W. G. et al. Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle? Brief. Bioinform. 14, 315–326 (2013).
Article PubMed MATH Google Scholar
Chen, X. & Ishwaran, H. Random forests for genomic data analysis. Genomics 99, 323–329 (2012).
Article CAS PubMed MATH Google Scholar
Maros, M. E. et al. Machine learning workflows to estimate class probabilities for precision cancer diagnostics on DNA methylation microarray data. Nat. Protoc. 15, 479–512 (2020).
Article CAS PubMed MATH Google Scholar
Breiman, L. Random Forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
Díaz-Uriarte, R. & Alvarez de Andrés, S. Gene selection and classification of microarray data using random forest. BMC Bioinforma. 7, 3 (2006).
Article MATH Google Scholar
Petralia, F., Wang, P., Yang, J. & Tu, Z. Integrative random forest for gene regulatory network inference. Bioinformatics 31, i197–i205 (2015).
Article CAS PubMed PubMed Central Google Scholar
Basu, S., Kumbier, K., Brown, J. B. & Yu, B. Iterative random forests to discover predictive and stable high-order interactions. Proc. Natl Acad. Sci. USA 115, 1943–1948 (2018).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Benfatto, S. et al. Uncovering cancer vulnerabilities by machine learning prediction of synthetic lethality. Mol. Cancer 20, 111 (2021).
Article PubMed PubMed Central MATH Google Scholar
Lundberg, S. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. (2017).
Turcan, S. et al. IDH1 mutation is sufficient to establish the glioma hypermethylator phenotype. Nature 483, 479–483 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Noushmehr, H. et al. Identification of a CpG island methylator phenotype that defines a distinct subgroup of glioma. Cancer Cell 17, 510–522 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhou, W. et al. DNA methylation loss in late-replicating domains is linked to mitotic cell division. Nat. Genet. 50, 591–602 (2018).
Article CAS PubMed PubMed Central MATH Google Scholar
Johann, P. D. et al. Atypical teratoid/rhabdoid tumors are comprised of three epigenetic subgroups with distinct enhancer landscapes. Cancer Cell 29, 379–393 (2016).
Article CAS PubMed MATH Google Scholar
Lambo, S. et al. The molecular landscape of ETMR at diagnosis and relapse. Nature 576, 274–280 (2019).
Article PubMed PubMed Central MATH Google Scholar
Lambo, S., von Hoff, K., Korshunov, A., Pfister, S. M. & Kool, M. ETMR: a tumor entity in its infancy. Acta Neuropathol. 140, 249–266 (2020).
Article PubMed PubMed Central Google Scholar
Sood, R. et al. Cloning and characterization of a novel gene, SHPRH, encoding a conserved putative protein with SNF2/helicase and PHD-finger domains from the 6q24 region. Genomics 82, 153–161 (2003).
Article CAS PubMed Google Scholar
Motegi, A. et al. Human SHPRH suppresses genomic instability through proliferating cell nuclear antigen polyubiquitination. J. Cell Biol. 175, 703–708 (2006).
Article CAS PubMed PubMed Central Google Scholar
Gan, W. et al. R-loop-mediated genomic instability is caused by impairment of replication fork progression. Genes Dev. 25, 2041–2056 (2011).
Article CAS PubMed PubMed Central MATH Google Scholar
Motegi, A. et al. Polyubiquitination of proliferating cell nuclear antigen by HLTF and SHPRH prevents genomic instability from stalled replication forks. Proc. Natl Acad. Sci. USA 105, 12411–12416 (2008).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Huen, M. S. Y. et al. Regulation of chromatin architecture by the PWWP domain-containing DNA damage-responsive factor EXPAND1/MUM1. Mol. Cell 37, 854–864 (2010).
Article CAS PubMed PubMed Central MATH Google Scholar
Lloyd, R. V., Osamura, R. Y., Klöppel, G. & Rosai, J. WHO Classification of Tumours of Endocrine Organs. (WHO Classification of Tumours, 2017).
Metherell, L. A. et al. TPIT mutations are associated with early-onset, but not late-onset isolated ACTH deficiency. Eur. J. Endocrinol. https://doi.org/10.1530/eje.0.1510463 (2004).
Oh, J. Y. et al. Transcriptomic profiles of normal pituitary cells and pituitary neuroendocrine tumor cells. Cancers 15, 110 (2022).
Article PubMed PubMed Central MATH Google Scholar
Sturm, D. et al. New brain tumor entities emerge from molecular classification of CNS-PNETs. Cell 164, 1060–1072 (2016).
Article CAS PubMed PubMed Central MATH Google Scholar
Takahashi, M., Ritz, J. & Cooper, G. M. Activation of a novel human transforming gene, ret, by DNA rearrangement. Cell 42, 581–588 (1985).
Article CAS PubMed MATH Google Scholar
Takahashi, M. et al. Cloning and expression of the ret proto-oncogene encoding a tyrosine kinase with two potential transmembrane domains. Oncogene 3, 571–578 (1988).
CAS PubMed MATH Google Scholar
Treanor, J. J. et al. Characterization of a multicomponent receptor for GDNF. Nature 382, 80–83 (1996).
Article ADS CAS PubMed MATH Google Scholar
Trupp, M. et al. Functional receptor for GDNF encoded by the c-ret proto-oncogene. Nature 381, 785–789 (1996).
Article ADS CAS PubMed Google Scholar
Mehta, G. U. et al. US Food and Drug Administration regulatory updates in neuro-oncology. J. Neurooncol. 153, 375–381 (2021).
Article CAS PubMed PubMed Central MATH Google Scholar
Kuschel, L. P. et al. Robust methylation-based classification of brain tumours using nanopore sequencing. Neuropathol. Appl. Neurobiol. 49, e12856 (2022).
Patel, A. et al. Rapid-CNS: rapid comprehensive adaptive nanopore-sequencing of CNS tumors, a proof-of-concept study. Acta Neuropathol. 143, 609–612 (2022).
Article PubMed PubMed Central MATH Google Scholar
Vermeulen, C. et al. Ultra-fast deep-learned CNS tumour classification during surgery. Nature 622, 842–849 (2023).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Djirackor, L. et al. Intraoperative DNA methylation classification of brain tumors impacts neurosurgical strategy. Neurooncol. Adv. 3, vdab149 (2021).
PubMed PubMed Central Google Scholar
Euskirchen, P. et al. Same-day genomic and epigenomic diagnosis of brain tumors using real-time nanopore sequencing. Acta Neuropathol. 134, 691–703 (2017).
Article CAS PubMed PubMed Central Google Scholar
Liu, A. P. Y. et al. Serial assessment of measurable residual disease in medulloblastoma liquid biopsies. Cancer Cell 39, 1519–1530.e4 (2021).
Article CAS PubMed PubMed Central MATH Google Scholar

Download references

Acknowledgements

We thank the members of the Hovestadt lab for insightful discussions and critical feedback on the paper. This work was supported by the Charles H. Hood Foundation, the Children’s Cancer Research Fund, the V Foundation, and the Claudia Adams Barr Program in Innovative Cancer Research (V.H.).

Author information

Authors and Affiliations

Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
Salvatore Benfatto & Volker Hovestadt
Division of Hematology/Oncology, Boston Children’s Hospital, Boston, MA, USA
Salvatore Benfatto & Volker Hovestadt
Broad Institute of MIT and Harvard, Cambridge, MA, USA
Salvatore Benfatto & Volker Hovestadt
Division of Pediatric Neurooncology, Hopp Children’s Cancer Center (KiTZ), Heidelberg, Germany
Martin Sill & Stefan M. Pfister
National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and Heidelberg University Hospital, Heidelberg, Germany
Martin Sill, David T. W. Jones & Stefan M. Pfister
German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Heidelberg, Germany
Martin Sill, David T. W. Jones & Stefan M. Pfister
Division of Pediatric Glioma Research, Hopp Children’s Cancer Center (KiTZ), Heidelberg, Germany
David T. W. Jones
Department of Pediatric Oncology, Hematology & Immunology, Heidelberg University Hospital, Heidelberg, Germany
Stefan M. Pfister
Department of Neuropathology, Heidelberg University Hospital, Heidelberg, Germany
Felix Sahm & Andreas von Deimling
Clinical Cooperation Unit Neuropathology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Heidelberg, Germany
Felix Sahm & Andreas von Deimling
Department of Neuropathology, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
David Capper
German Cancer Consortium (DKTK), Partner Site Berlin, German Cancer Research Center (DKFZ), Heidelberg, Germany
David Capper

Authors

Salvatore Benfatto
View author publications
Search author on:PubMed Google Scholar
Martin Sill
View author publications
Search author on:PubMed Google Scholar
David T. W. Jones
View author publications
Search author on:PubMed Google Scholar
Stefan M. Pfister
View author publications
Search author on:PubMed Google Scholar
Felix Sahm
View author publications
Search author on:PubMed Google Scholar
Andreas von Deimling
View author publications
Search author on:PubMed Google Scholar
David Capper
View author publications
Search author on:PubMed Google Scholar
Volker Hovestadt
View author publications
Search author on:PubMed Google Scholar

Contributions

S.B. and V.H. developed the explainable AI approach and investigated results; S.B. created the web application; M.S. and V.H. developed the random forest classification algorithm; D.T.W.J., S.M.P., F.S., A.v.D., and D.C. generated the reference cohort and defined methylation classes; D.T.W.J., S.M.P., and V.H. conceptualized the study; S.B. and V.H. wrote the paper with comments from all authors. All authors approved the final version of the paper.

Corresponding author

Correspondence to Volker Hovestadt.

Ethics declarations

Competing interests

M.S., D.T.W.J., S.M.P., F.S., A.v.D., D.C., and V.H. are listed as inventors on the patent ‘DNA-methylation based method for classifying tumor species’ (EP3268492B1) filed by Deutsches Krebsforschungszentrum Stiftung des öffentlichen Rechts and Ruprecht-Karls-Universität Heidelberg. M.S., D.T.W.J., S.M.P., F.S., A.v.D., and D.C. are shareholders in and co-founders of Heidelberg Epignostix GmbH. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Sylvain Cussat-Blanc, Xiao-Nan Li and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Benfatto, S., Sill, M., Jones, D.T.W. et al. Explainable artificial intelligence of DNA methylation-based brain tumor diagnostics. Nat Commun 16, 1787 (2025). https://doi.org/10.1038/s41467-025-57078-0

Download citation

Received: 08 January 2024
Accepted: 07 February 2025
Published: 20 February 2025
DOI: https://doi.org/10.1038/s41467-025-57078-0

This article is cited by

Current AI technologies in cancer diagnostics and treatment
- Ashutosh Tiwari
- Soumya Mishra
- Tsung-Rong Kuo
Molecular Cancer (2025)
Artificial Intelligence in cancer epigenomics: a review on advances in pan-cancer detection and precision medicine
- Karishma Sahoo
- Prakash Lingasamy
- Vijayachitra Modhukur
Epigenetics & Chromatin (2025)
Advances of artificial intelligence-enabled epigenetics
- Dima Abdelrahim Jamil Alsayaydeh
- Vigneswaran Narayanamurthy
- Suhaila Binti Mohd Najib
Health and Technology (2025)