Accurate low and high grade glioma classification using free water eliminated diffusion tensor metrics and ensemble machine learning

Vidyadharan, Sreejith; Rao, B. V. V. S. N. Prabhakar; Yogeeswari, P.; Kesavadas, C.; Rajagopalan, Venkateswaran

doi:10.1038/s41598-024-70627-9

Download PDF

Article
Open access
Published: 27 August 2024

Accurate low and high grade glioma classification using free water eliminated diffusion tensor metrics and ensemble machine learning

Sreejith Vidyadharan¹,
B. V. V. S. N. Prabhakar Rao¹,
P. Yogeeswari²,
C. Kesavadas³ &
…
Venkateswaran Rajagopalan¹

Scientific Reports volume 14, Article number: 19844 (2024) Cite this article

2365 Accesses
4 Citations
Metrics details

Subjects

Abstract

Glioma, a predominant type of brain tumor, can be fatal. This necessitates an early diagnosis and effective treatment strategies. Current diagnosis is based on biopsy, prompting the need for non invasive neuroimaging alternatives. Diffusion tensor imaging (DTI) is a promising method for studying the pathophysiological impact of tumors on white matter (WM) tissue. Single-shell DTI studies in brain glioma patients have not accounted for free water (FW) contamination due to tumors. This study aimed to (a) assess the efficacy of a two-compartment DTI model that accounts for FW contamination and (b) identify DTI-based biomarkers to classify low-grade glioma (LGG) and high-grade glioma (HGG) patients. DTI data from 86 patients (LGG n = 39, HGG n = 47) were obtained using a routine clinical imaging protocol. DTI metrics of tumorous regions and normal-appearing white matter (NAWM) were evaluated. Advanced stacked-based ensemble learning was employed to classify LGG and HGG patients using both single- and two-compartment DTI model measures. The DTI metrics of the two-compartment model outperformed those of the standard single-compartment DTI model in terms of sensitivity, specificity, and area under the curve of receiver operating characteristic (AUC-ROC) score in classifying LGG and HGG patients. Four features (out of 16 features), namely fractional anisotropy (FA) of the edema and core region and FA and mean diffusivity of the NAWM region, showed superior performance (sensitivity = 92%, specificity = 90%, and AUC-ROC = 90%) in classifying LGG and HGG. This demonstrates that both tumorous and NAWM regions may be differentially affected in LGG and HGG patients. Our results demonstrate the significance of using a two-compartment DTI model that accounts for FW contamination by improving diagnostic accuracy. This improvement may eventually aid in planning treatment strategies for glioma patients.

Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms

Article Open access 02 November 2023

Leveraging TME features and multi-omics data with an advanced deep learning framework for improved Cancer survival prediction

Article Open access 24 April 2025

Bi-exponential diffusion-weighted imaging for differentiating high-grade gliomas from solitary brain metastases: a VOI-based histogram analysis

Article Open access 30 December 2024

Introduction

Gliomas are the most prevalent type of brain tumors^1,2. As per Global Cancer Statistics, 2020³, nearly 308,102 new cases of central nervous system cancer were diagnosed in 185 countries. Glioma is usually classified into low-grade glioma (LGG) and high-grade glioma (HGG)². Based on the molecular characterization of central nervous system tumors, the survival time of HGG patients is shorter than that of LGG patients¹. Therefore, it is important to diagnose brain tumors early to plan treatment strategies. Present clinical diagnosis is based on biopsy, which is an invasive procedure. This invasive procedure is expensive and prone to complications. Therefore, a non invasive quantitative approach using neuroimaging is essential to characterize the tumor. Magnetic resonance imaging (MRI) modalities T1-weighted, T2-weighted, fluid-attenuated inversion recovery (FLAIR), and T1-weighted contrast-enhanced (T1-c) images are currently employed in the qualitative radiological diagnosis of brain tumors. These conventional MRIs do not provide a quantitative assessment of the microstructural pathophysiological effects of gliomas on brain white matter (WM) tissue. Recent evidence^4,5,6 shows that the pathophysiological process of the tumor is not restricted locally but also spreads globally through the WM pathways to other brain regions.

Diffusion tensor imaging (DTI) is a quantitative approach that uses water diffusion as a probe to assess brain WM tissue. Studies^7,8 that measured fractional anisotropy (FA) and mean diffusivity (MD) values of the tumor region found that FA was able to classify LGG and HGG patients but not MD. In addition to FA and MD, others^9,10,11,12 have used Westin’s indices, axial diffusivity (AD), and radial diffusivity (RD) to study different tumor regions (core, enhancing, and edema) and the normal-appearing WM (NAWM, meaning WM excluding the tumor regions) of the contralateral hemisphere. They found that the tumor affects the structural integrity of NAWM. All of the above studies used a single-compartment model for estimating the diffusion tensor. Aggressive tumor growth/invasion into the surrounding tissues (especially in HGG patients) leads to the accumulation of vasogenic edema^13,14,15. This causes free water (FW) contamination in the WM tissue^16,17,18. The FW in turn corrupts the FA and MD values^19,20 and fiber tractography reconstruction^16,17.

Therefore, in this study, (a) we will apply a bi-tensor model (two-compartment model, one for the brain tissue and one for FW separately), which is recommended when FW contamination is present^16,17,18 because it improves the accuracy and specificity of the estimated DTI metrics. We hypothesize that the DTI metrics estimated using the bi-tensor model will classify LGG and HGG patients with high sensitivity and specificity values when compared to the single compartment model, (b) we will analyze DTI features of both the tumor and NAWM in LGG and HGG patients to identify neuroimaging non-invasive biomarkers, and c) we will use machine learning to classify glioma patients as opposed to the commonly used statistical inference methods, which are sometimes suboptimal²¹ because the relationship between predictor and dependent variables may vary for different datasets, which is unacceptable.

Materials and methods

Data acquisition

Among different types of brain tumor patients, we selected only the glioma patients who were categorized into LGG and HGG. An experienced radiologist (one of the authors) from Sree Chitra Tirunal Institute of Medical Science and Technology (SCTIMST), Thiruvananthapuram, India recruited the patients for this study. The glioma grading was determined after histopathological examination and biopsy. Retrospective imaging data of these glioma patients were collected from the SCTIMST database. The imaging was performed at the time of the biopsy. MRI scans of 86 brain tumor patients LGG (n = 39) and HGG (n = 47) were acquired. The patients MRI data included was acquired over the period spanning from March 8, 2013, to September 30, 2019. Since our clinical retrospective data was collected before the implementation of the WHO 2021 glioma grade classification criteria, the molecular diagnosis was not utilized for this cohort. Therefore, the classification relied solely on the histopathological criteria available at the time of data collection. The data for this study was collected as a part of the technological project in collaboration with SCTIMST. Data from this hospital were shared with us via a secured server keeping patient demographic details anonymous and sharing only the pathological details. Hence the patient demographic detail is unavailable in this study since we did not have direct access to this information.

Imaging protocol

Since the data for this study came from a retrospective clinical repository, the patients were scanned either in a 1.5 T Siemens MRI scanner (Magnetom Avanto, Erlangen, Germany) or a 3 T General Electric Discovery MR750w scanner (Boston, United States). The MRI sequences include T1-weighted, T2-weighted, T1-c, and FLAIR, along with diffusion weighted images. Details about the imaging parameters are given in the Supplementary method section.

Data processing

Tumor segmentation

The image pre-processing steps include skull stripping and bias field correction on T1-weighted, T2-weighted, FLAIR, and T1-c images. The FSL BET tool (version 6.0.4, https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/FslInstallation) was used for skull stripping. Bias field correction was performed using the FSL FAST tool. Tumor segmentation for both LGG and HGG patients was performed using nnU-Net, a self-adapting automated deep learning framework²². Briefly, the nnU-Net architecture consists of four configurations, for more details refer to Isensee et al.²². nnU-Net takes four-channel inputs, namely T1-weighted, T2-weighted, FLAIR, and T1-c images, and outputs three segmented tumor images, namely core tumor, enhanced tumor, and edema, for each patient. For our dataset, the third configuration (3-D low resolution) of nnU-Net gave superior segmentation accuracy, but it failed to segment the resected tumor regions in a few of our patients. The missed resected tumor region was manually segmented and included. All the above steps were verified and validated by one of the authors who is an experienced radiologist. Figure 1 shows the segmented tumor regions in a typical brain tumor patient.

Diffusion tensor image processing

A. Standard DTI model (single-compartment model)

Diffusion weighted images were processed using DSI-Studio (30 December 2020, build https://dsi-studio.labsolver.org/download.html). Affine registration was performed to correct for eddy current and motion distortion effects. The b-matrix was corrected for the gradient orientations. Brain extraction was performed on distortion-corrected images. A mono exponential fit using the linear least square optimization algorithm²³ was used to estimate the diffusion tensor based on Eq. (1) given below:

$${\text{S}}_{{\text{j}}} = {\text{ S}}_{0} {\text{exp }}\left( { - {\text{b}}_{{\text{j}}} {\text{x}}_{{\text{j}}}^{{\text{T}}} {\text{D x}}_{{\text{j}}} } \right)$$

(1)

where S_j is the diffusion weighted image in the gradient direction x_j and a b-value of b_j, S₀ is the image with no diffusion weighting (b = 0), and D is the 3 × 3 diffusion tensor to be estimated. The DTI maps were then generated from the fitted diffusion tensor model. Average FA, AD, RD, and MD values for the core, enhanced, and edema regions for each patient were obtained by multiplying the binarized tumor segmented images with the DTI maps. Figure 2 shows the workflow details of DTI feature extraction from core, enhanced, and edema tumor regions.

B. Free water eliminated (FWE) DTI model (two-compartment model)

Single-shell diffusion weighted imaging is commonly employed in routine clinical imaging protocols like ours. We used the two-compartment model using Eq. (2) given below to fit the single-shell data as given by Golub et al.²⁴ who used a regularized gradient descent algorithm for optimization:

$${\text{A }} = {\text{ f exp }}\left( { - {\text{b}}_{{\text{j}}} {\text{x}}_{{\text{j}}}^{{\text{T}}} {\text{D}}_{{\text{t}}} {\text{x}}_{{\text{j}}} } \right) \, + \, \left( {{1} - {\text{f}}} \right){\text{ exp }}\left( { - {\text{b}}_{{\text{j}}} {\text{D}}_{{\text{w}}} } \right),$$

(2)

where A = S_j/S₀, S_j is the image obtained after applying the gradient in the direction of x_j and b-value of b_j, S₀ is the image with no diffusion weighting (b = 0), f is effective tissue-water fraction, (1 − f) is the effective FW fraction, D_t (diffusion tensor of tissue) and D_w (diffusion tensor for FW) are the bi-tensors to be estimated.

The bias-corrected diffusion weighted images (as processed for standard DTI model), binary brain mask, and corrected b-matrix files were used as input to the FW estimation code obtained from https://github.com/mvgolub/FW-DTI-Beltrami, with the pre-installed open source library DIPY²⁴. After tensor fitting, FW eliminated FA, AD, RD, and MD maps were obtained. A FW map was also generated for each patient. The FW map characterizes water molecules that move freely and are not confined by their environment. Therefore, it represents the proportion of FW content present within each voxel. A visual illustration of the DTI maps obtained from both standard and FWE DTI models in this study is shown in Supplementary Fig. S1. Average FA, AD, RD, and MD values of the core, enhanced, and edema regions for each patient were obtained by multiplying the binarized tumor segmented images with the FA, AD, RD, and MD maps. The mean FW features of both tumorous and NAWM regions were discarded from this study to evaluate the performance of the standard and FWE DTI models using the same set of features.

C. Normal-appearing white matter

In the context of gliomas, NAWM is defined as the WM that is not visibly affected by the pathophysiological process of the tumor as it appears normal in conventional MRI sequences^{25,26,27,28,29,30,31}. However, glioma studies have analyzed NAWM using different approaches. One of the approaches is to analyze a specific region of interest (ROI) in the WM surrounding the tumor region (core + enhanced + edema region) known as ipsilateral NAWM and in the WM region on the contralateral hemisphere where the tumor is absent known as contralateral NAWM. On the other hand, instead of analyzing a specific ROI in the contralateral/ipsilateral NAWM, other researchers^29,30,31 have analyzed the entire WM region excluding the whole tumor region (i.e., core + enhanced + edema region), and termed it as NAWM because they found that the tumor infiltration process can diffuse beyond the vicinity of the tumor region thereby causing global microstructural changes to the WM tissue. They demonstrated that globally assessing WM excluding the whole tumor region can provide valuable information on overall WM integrity and health^29,30,31. All the above-mentioned studies either used the term “contralateral NAWM/ipsilateral NAWM” when they focused on the region-specific analysis or “NAWM” when they aimed to globally evaluate the WM structural integrity due to the tumor. Similar to the previous studies^29,30,31 we analyzed the entire WM region excluding the tumor and considered it as NAWM because the tumor can affect not only the neighboring/adjacent WM tissue (locally) but also can produce a diffuse effect in the other parts of the brain (globally). Also, more importantly, a significant percentage i.e., 42% of the patients (36 out of 86) considered in our study had tumor located in both the brain hemispheres hence, identifying contralateral and ipsilateral NAWM is not possible.

To obtain NAWM images, the following processing steps were performed: (i) a WM mask image was extracted by thresholding the FA map (FA > 0.2) of each patient which is a standard method^25,32,33,34, (ii) this WM mask image was subtracted from the segmented whole tumor region (which includes core, enhanced, and edema region) to obtain the NAWM mask. This NAWM mask was then multiplied with the FA, AD, RD, and MD maps. The whole brain average values for the above maps were obtained for each patient. We used the FSL (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki) functions fslmaths and fslstats for the above calculations. The workflow diagram is shown in Fig. 3.

Machine learning

All the DTI features/measures obtained from the tumorous and NAWM using the standard DTI model and FWE DTI model (described above in "Diffusion tensor image processing") are given in Table 1. Different machine learning algorithms were used to classify LGG and HGG patients based on the above-mentioned features. These algorithms include support vector machine (linear and radial bias), random forest, naïve Bayes, AdaBoost, and gradient boost (these results are given in Supplementary Table S1). After assessing their performance (based on the sensitivity, specificity, and area under the curve of the receiver operating characteristics (AUC–ROC) score), we devised an advanced ensemble stacked learning-based architecture by combining the predictive power of individual models and providing each model’s prediction as an input to a meta learner. The ensemble machine learning model (EMLM) is shown in Fig. 4. Feature selection was performed using Weka software (version 3.8.6, https://www.cs.waikato.ac.nz/ml/weka/) using the CFS subset eval attribute selector and BestFirst search method. Features that were less correlated to each other and highly correlated with the target class (LGG-target value 0 and HGG target value 1) were selected. We adopted two approaches (1) all the features were used in the machine learning for classification and (2) only the Weka-selected features were used. We have presented only the performance metrics of the EMLM using Weka selected features in this article. The results obtained from the pre-selected features are provided in Supplementary Tables S2 and S3. Custom Python (version 3.9.7, Jupyter Notebook) codes were written for EMLM using the sklean, seaborn, NumPy, pandas, and matplotlib packages. The model parameters include n_estimators = 200 (for random forest, AdaBoost, and gradient boosting classifier), random state = 42, and train-test split = five-fold cross-validation.

Table 1 Shows the list of features extracted from the standard and the FWE DTI model in this study.

Full size table

Ethical approval and consent to participate

The study was conducted according to the declaration of Helsinki. Institutional Ethics committee (IEC Regn No. ECR/189/Inst/KL/2013/RR-16) at Sree Chitra Tirunal Institute of Medical Science and Technology, Thiruvananthapuram, India approved this study waiving patient informed consent as this is a retrospective study. The approval number is IEC/1177. All procedures were performed in accordance with relevant guidelines.

Results

Machine learning results on DTI metrics derived from the standard DTI model

A total of 12 DTI features were measured from the tumorous region, including FA, AD, RD, and MD values of core, enhanced, and edema regions. For the NAWM region, four DTI features, namely FA, AD, RD, and MD, were measured. The above 12 features of the tumorous region + 4 NAWM region features were considered together as whole brain features. Weka attribute selection was performed to remove the redundant features for each of the ROIs mentioned in Table 1 and these selected features were used to train the EMLM. The results for EMLM evaluated on Weka selected features are shown in Table 3. Refer to Supplementary Table S2 for the results evaluated on pre-selected features. Note that the performance metrics reported in Table 2 are the average sensitivity, specificity, and AUC-ROC scores across all five folds.

Table 2 Shows the overall performance metrics of the EMLM when trained using the Weka selected DTI features of the tumorous region, NAWM region, and tumorous + NAWM region separately for the standard DTI model.

Full size table

Machine learning results for DTI metrics derived from the FWE DTI model

A total of 12 DTI features were measured from the tumorous region, including FA, AD, RD, and MD values of core, enhanced, and edema regions. A total of four DTI features, i.e., FA, AD, RD, and MD were measured from the NAWM region. The above 12 tumorous region features + 4 NAWM region features were considered together as whole brain features. Weka attribute selection was performed to remove the redundant features for each of the ROIs mentioned in Table 1 and these selected features were used to train the EMLM. The results for EMLM evaluated on Weka selected features are shown in Table 3. Refer to Supplementary Table S3 for the results evaluated on pre-selected features.

Table 3 Shows the overall performance metrics of the EMLM when trained with the Weka selected DTI features of the tumorous region, NAWM region, and tumorous + NAWM region for the FWE DTI model.

Full size table

Discussion

The important findings of this study are as follows: (a) DTI metrics derived from the FWE DTI model classified LGG and HGG with high sensitivity, specificity, and AUC-ROC score when compared to the standard DTI model, (b) we found that FW-corrected DTI is a robust approach to assess the WM structural integrity in glioma patients, and (c) when EMLM was trained with Weka selected features, i.e., FA of edema and core region, and FA and MD of NAWM region from the combined tumor + NAWM region features (16 pre-selected total features from FWE DTI model) classified LGG and HGG with superior performance metrics refer to Table 3. This demonstrates that (i) both tumorous and NAWM regions may be differentially affected between LGG and HGG patients, (ii) only 4 out of 16 features are required to classify LGG and HGG patients with good sensitivity, specificity, and AUC-ROC score indicating their potential role as neuroimaging biomarkers, and (d) EMLM gives better classification accuracy when compared with the routinely used single classifier model in brain tumor studies.

Comparison between Tables 2 and 3 shows that the DTI features derived from FWE DTI gave superior sensitivity, specificity, and AUC-ROC scores in classifying LGG and HGG patients when compared with the DTI metrics derived from the standard DTI model. We found that the values of FA, AD, and RD, which reflect WM integrity, were abnormally lower when using the standard DTI model as opposed to the FWE DTI model. For instance, when using the standard DTI model, the mean FA value for the core tumor region in LGG patients (n = 39) was 0.15 and that in HGG patients (n = 47) was 0.21. Similarly, for the enhanced region, the mean FA was 0.19 for LGG and 0.21 for HGG. On the other hand, when the FWE DTI model was used, these values were 0.44 in LGG, 0.50 in HGG for the tumor core region, 0.46 in LGG, and 0.47 in HGG for the enhanced region. The same phenomena were observed for the AD and RD measures. The reliability of the FWE DTI model is established by other researchers^17,24,35. Similar to our study, other studies^17,24,35 have also observed an increase in the FA values after correcting for FW contamination. Starck³⁵ et al. also observed an increase in the FA value of tumorous regions after FW elimination. Golub et al.²⁴ observed an increase in FA values in the artificially simulated lesion region after FW elimination. Pasternak et al.¹⁷ also found an increase in FA values in the region near the ventricles and edema after FW elimination, which eventually improved the accuracy of fiber tractography in those regions. The poor performance of the EMLM in classifying LGG and HGG patients when using the metrics from the standard DTI model can be attributed to FW contamination. This is evident from Tables 2 and 3. We can see that when using the FWE DTI model, an improvement in performance was observed in tumorous, NAWM, and tumorous + NAWM ROIs.

To understand the effect of FW contamination, we performed an analysis by placing ROIs on the tumorous region and the corresponding region in the contralateral hemisphere (see Fig. 5). ROI 1 is located in the FW map at the high-water content region with mean FW value = 0.99 and ROI 2 is located in the contralateral hemisphere with a mean FW value = 0.68 (see Fig. 5b, g). From the previously mentioned figures and the mean FW values of ROIs 1 and 2, we can see that the contralateral hemisphere (ROI 2) is not contaminated with FW. We placed the same ROIs in the MD and FA maps of both the standard DTI (see Fig. 5c, h for MD map while Fig. 5e, j for FA map) and FWE DTI (see Fig. 5d, i for MD map while Fig. 5f, k for FA map) models. The MD values in ROI 1 and ROI 2 were 1.245 × 10^–3 mm²/s and 0.894 × 10^–3 mm²/s for the standard DTI model, whereas for the FWE DTI model, the mean MD values for ROI 1 = 0.685 × 10^–3 mm²/s and ROI 2 = 0.873 × 10^–3 mm²/s. Similarly, the FA values in ROI 1 = 0.145 and ROI 2 = 0.784 for the standard DTI model, and for the FWE DTI model, the mean FA values were ROI 1 = 0.59 and ROI 2 = 0.701. In comparison with healthy tissue (i.e., ROI 2) in the contralateral hemisphere, we can observe that (a) in ROI 1 (FW contaminated) of the standard DTI model, the FA and MD values were very low (0.145) and high (1.245 × 10^–3 mm²/s), and (b) after FW elimination, i.e., using the FWE DTI model, the FA (0.59) and MD (0.685 × 10^–3 mm²/s) measures returned to their optimal values. This analysis reveals that correction for FW contamination is required to obtain a robust estimate of the DTI measures.

The features of the FWE DTI model were able to classify LGG and HGG patients with superior sensitivity and specificity values when compared with the standard DTI model (see Tables 2 and 3). Now, if we look at Table 3 that shows the performance of features estimated from the FWE DTI model for the three different ROIs (i.e., tumorous region, NAWM region, and tumorous + NAWM region) we can infer that the tumorous + NAWM region gives superior classification accuracy when compared to the other two ROIs. This demonstrates that the features of both tumorous and NAWM regions play a critical role in classifying LGG and HGG patients. We believe that the tumorous and NAWM regions may be differentially affected in LGG and HGG patients. We attribute the subpar performance of the standard DTI model for the same ROI, i.e., tumorous + NAWM (Table 2) to the potential water contamination as reported in previous studies^17,24,35. Our results show that the 4 features (FA of edema and core for tumorous region, FA and MD of NAWM) out of 16 features from the tumorous + NAWM region can provide superior classification accuracy between LGG and HGG when compared with using all 16 features. Because no other FWE DTI model-based brain tumor classification studies were available, it is not clear why the EMLM performed better on the above four features. However, similar to our observation, studies using the standard DTI model reported FA of the core tumor region^8,36 as a valuable measure in classifying LGG and HGG. Similarly, other standard DTI model studies^37,38 have reported that the tumor edema region can aid in glioma classification. In addition to the tumor region DTI measures, our analysis identified FA and MD of NAWM as important features in classifying LGG and HGG patients. Our results concur with those of other studies that have also reported FA of NAWM^39,40,41 and MD of NAWM^25,29,31 to be important parameters in glioma classification.

Another important finding in this study is that the EMLM model performed better than individual classifiers (refer to Supplementary Table S1). Previous studies^{42,43,44,45,46} have employed various individual machine learning algorithms (single classifiers) to classify LGG and HGG based on radiomic features extracted from the tumorous region. However, none of these studies have specifically focused on training a machine learning model with features that capture the pathophysiological process of the tumor and its effects on WM structural integrity. Our findings align with those of other studies^47,48,49 that reported good accuracy, sensitivity, and specificity when they employed ensemble models compared to individual classifiers in glioma classification. This is because ensemble learning leverages multiple machine learning classifiers to achieve improved predictive capability compared with single classifiers⁵⁰. Another reason given by a study⁵¹ is that the diversity in data enhances the machine learning model’s discriminative ability. Our DTI features were diverse (both tumorous and NAWM regions), and this diverse data may have enabled the training process to capture discriminative information for robust classification. In addition, ensemble learning is more adaptable than single classifiers⁵².

Conclusion

Our results proved that the features derived using FWE DTI model was able to classify LGG and HGG patients with superior accuracy when compared to the standard DTI model. Our results also demonstrate that FW contamination correction is crucial for obtaining reliable DTI measures. Remarkably, the best classification results were achieved when utilizing only the FA of core and edema and the FA and MD of NAWM among 16 diverse DTI features (tumorous + NAWM region) extracted from the FWE DTI model. These measures can serve as non invasive neuroimaging biomarkers and can be confirmed in histopathological and longitudinal imaging studies. Furthermore, these findings suggest that assessing both the tumor and NAWM region can provide valuable insights into glioma classification. Our study also demonstrated that EMLM may be a more suitable machine learning approach than individual classifiers for features like ours.

Data availability

Due to the proprietary nature of the datasets obtained from medical institutions and in accordance with standard protocols governing patient data confidentiality, regrettably, we are unable to grant public access to the data. Requests for data access should be formally directed to venkateswaran@hyderabad.bits-pilani.ac.in.

References

Louis, D. N. et al. The 2016 World Health Organization classification of tumors of the central nervous system: A summary. Acta Neuropathol. 131, 803–820. https://doi.org/10.1007/s00401-016-1545-1 (2016).
Article PubMed Google Scholar
Louis, D. N. et al. The 2021 WHO classification of tumors of the central nervous system: A summary. Neuro Oncol. 23, 1231–1251. https://doi.org/10.1093/neuonc/noab106 (2021).
Article PubMed PubMed Central CAS Google Scholar
Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71, 209–249. https://doi.org/10.3322/caac.21660 (2021).
Article PubMed CAS Google Scholar
Lv, K. et al. Neuroplasticity of glioma patients: Brain structure and topological network. Front. Neurol. 13, 871613. https://doi.org/10.3389/fneur.2022.871613 (2022).
Article PubMed PubMed Central Google Scholar
Ji, M. et al. Detection of human brain tumor infiltration with quantitative stimulated Raman scattering microscopy. Sci Transl Med 7, 309ra163. https://doi.org/10.1126/scitranslmed.aab0195 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kut, C. et al. Detection of human brain cancer infiltration ex vivo and in vivo using quantitative optical coherence tomography. Sci. Transl. Med. 7, 292ra100. https://doi.org/10.1126/scitranslmed.3010611 (2015).
Article PubMed PubMed Central Google Scholar
Stadlbauer, A. et al. Gliomas: Histopathologic evaluation of changes in directionality and magnitude of water diffusion at diffusion-tensor MR imaging. Radiology 240, 803–810. https://doi.org/10.1148/radiol.2403050937 (2006).
Article PubMed Google Scholar
Inoue, T., Ogasawara, K., Beppu, T., Ogawa, A. & Kabasawa, H. Diffusion tensor imaging for preoperative evaluation of tumor grade in gliomas. Clin. Neurol. Neurosurg. 107, 174–180. https://doi.org/10.1016/j.clineuro.2004.06.011 (2005).
Article PubMed Google Scholar
Jiang, L. et al. Analysis of DTI-derived tensor metrics in differential diagnosis between low-grade and high-grade gliomas. Front. Aging Neurosci. 9, 271. https://doi.org/10.3389/fnagi.2017.00271 (2017).
Article PubMed PubMed Central Google Scholar
Wang, S. et al. Differentiation between glioblastomas and solitary brain metastases using diffusion tensor imaging. Neuroimage 44, 653–660. https://doi.org/10.1016/j.neuroimage.2008.09.027 (2009).
Article PubMed Google Scholar
Duy Hung, N., Minh Duc, N., Van Anh, N. T., Thanh Dung, L. & He, D. V. Diagnostic performance of diffusion tensor imaging for preoperative glioma grading. Clin. Ter. 172, 315–321. https://doi.org/10.7417/CT.2021.2335 (2021).
Article PubMed CAS Google Scholar
Seow, P. et al. Neural fiber integrity in high- versus low-grade glioma using probabilistic fiber tracking. Acad. Radiol. 28, 1721–1732. https://doi.org/10.1016/j.acra.2020.09.007 (2021).
Article PubMed Google Scholar
Betz, A. L., Iannotti, F. & Hoff, J. T. Brain edema: A classification based on blood-brain barrier integrity. Cerebrovasc. Brain Metab. Rev. 1, 133–154 (1989).
PubMed CAS Google Scholar
Papadopoulos, M. C. et al. Molecular mechanisms of brain tumor edema. Neuroscience 129, 1011–1020. https://doi.org/10.1016/j.neuroscience.2004.05.044 (2004).
Article PubMed CAS Google Scholar
Unterberg, A. W., Stover, J., Kress, B. & Kiening, K. L. Edema and brain trauma. Neuroscience 129, 1021–1029. https://doi.org/10.1016/j.neuroscience.2004.06.046 (2004).
Article PubMed CAS Google Scholar
Schonberg, T., Pianka, P., Hendler, T., Pasternak, O. & Assaf, Y. Characterization of displaced white matter by brain tumors using combined DTI and fMRI. Neuroimage 30, 1100–1111. https://doi.org/10.1016/j.neuroimage.2005.11.015 (2006).
Article PubMed Google Scholar
Pasternak, O., Sochen, N., Gur, Y., Intrator, N. & Assaf, Y. Free water elimination and mapping from diffusion MRI. Magn. Reson. Med. 62, 717–730. https://doi.org/10.1002/mrm.22055 (2009).
Article PubMed Google Scholar
Pierpaoli, C. & Jones, D. K. Removing CSF contamination in brain DT-MRIs by using a two-compartment tensor model. In International Society for Magnetic Resonance in Medicine Meeting 1215 (2004)
Papadakis, N. G. et al. Study of the effect of CSF suppression on white matter diffusion anisotropy mapping of healthy human brain. Magn. Reson. Med. 48, 394–398. https://doi.org/10.1002/mrm.10204 (2002).
Article PubMed Google Scholar
Alexander, A. L., Hasan, K. M., Lazar, M., Tsuruda, J. S. & Parker, D. L. Analysis of partial volume effects in diffusion-tensor MRI. Magn. Reson. Med. 45, 770–780. https://doi.org/10.1002/mrm.1105 (2001).
Article PubMed CAS Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article Google Scholar
Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211. https://doi.org/10.1038/s41592-020-01008-z (2021).
Article PubMed CAS Google Scholar
Basser, P. J., Mattiello, J. & LeBihan, D. MR diffusion tensor spectroscopy and imaging. Biophys. J. 66, 259–267. https://doi.org/10.1016/S0006-3495(94)80775-1 (1994).
Article PubMed PubMed Central CAS Google Scholar
Golub, M., Neto Henriques, R. & Gouveia Nunes, R. Free-water DTI estimates from single b-value data might seem plausible but must be interpreted with care. Magn. Reson. Med. 85, 2537–2551. https://doi.org/10.1002/mrm.28599 (2021).
Article PubMed Google Scholar
Sahin, S., Ertekin, E., Sahin, T. & Ozsunar, Y. Evaluation of normal-appearing white matter with perfusion and diffusion MRI in patients with treated glioblastoma. MAGMA 35, 153–162. https://doi.org/10.1007/s10334-021-00990-5 (2022).
Article PubMed CAS Google Scholar
Mehrabian, H., Lam, W. W., Myrehaug, S., Sahgal, A. & Stanisz, G. J. Glioblastoma (GBM) effects on quantitative MRI of contralateral normal appearing white matter. J. Neurooncol. 139, 97–106. https://doi.org/10.1007/s11060-018-2846-0 (2018).
Article PubMed Google Scholar
Horvath, A. et al. Increased diffusion in the normal appearing white matter of brain tumor patients: Is this just tumor infiltration?. J. Neurooncol. 127, 83–90. https://doi.org/10.1007/s11060-015-2011-y (2016).
Article PubMed Google Scholar
Inglese, M. et al. Whole-brain N-acetylaspartate spectroscopy and diffusion tensor imaging in patients with newly diagnosed gliomas: A preliminary study. AJNR Am. J. Neuroradiol. 27, 2137–2140 (2006).
PubMed PubMed Central CAS Google Scholar
Jutten, K. et al. Diffusion tensor imaging reveals microstructural heterogeneity of normal-appearing white matter and related cognitive dysfunction in glioma patients. Front. Oncol. 9, 536. https://doi.org/10.3389/fonc.2019.00536 (2019).
Article PubMed PubMed Central Google Scholar
Horvath, A. et al. Biexponential diffusion alterations in the normal-appearing white matter of glioma patients might indicate the presence of global vasogenic edema. J. Magn. Reson. Imaging 44, 633–641. https://doi.org/10.1002/jmri.25202 (2016).
Article PubMed Google Scholar
Hope, T. R. et al. Serial diffusion tensor imaging for early detection of radiation-induced injuries to normal-appearing white matter in high-grade glioma patients. J. Magn. Reson. Imaging 41, 414–423. https://doi.org/10.1002/jmri.24533 (2015).
Article PubMed Google Scholar
Taki, Y. et al. Linear and curvilinear correlations of brain white matter volume, fractional anisotropy, and mean diffusivity with age using voxel-based and region-of-interest analyses in 246 healthy children. Hum. Brain Mapp. 34, 1842–1856. https://doi.org/10.1002/hbm.22027 (2013).
Article PubMed Google Scholar
Connor, M. et al. Regional susceptibility to dose-dependent white matter damage after brain radiotherapy. Radiother. Oncol. 123, 209–217. https://doi.org/10.1016/j.radonc.2017.04.006 (2017).
Article PubMed PubMed Central Google Scholar
Vernooij, M. W. et al. White matter atrophy and lesion formation explain the loss of structural integrity of white matter in aging. Neuroimage 43, 470–477. https://doi.org/10.1016/j.neuroimage.2008.07.052 (2008).
Article PubMed CAS Google Scholar
Starck, L. et al. Effects of multi-shell free water correction on glioma characterization. Diagnostics (Basel) https://doi.org/10.3390/diagnostics11122385 (2021).
Article PubMed Google Scholar
Liu, X. et al. MR diffusion tensor and perfusion-weighted imaging in preoperative grading of supratentorial nonenhancing gliomas. Neuro Oncol. 13, 447–455. https://doi.org/10.1093/neuonc/noq197 (2011).
Article PubMed PubMed Central Google Scholar
Ma, L. & Song, Z. J. Differentiation between low-grade and high-grade glioma using combined diffusion tensor imaging metrics. Clin. Neurol. Neurosurg. 115, 2489–2495. https://doi.org/10.1016/j.clineuro.2013.10.003 (2013).
Article PubMed Google Scholar
Min, Z. G., Niu, C., Rana, N., Ji, H. M. & Zhang, M. Differentiation of pure vasogenic edema and tumor-infiltrated edema in patients with peritumoral edema by analyzing the relationship of axial and radial diffusivities on 3.0T MRI. Clin. Neurol. Neurosurg. 115, 1366–1370. https://doi.org/10.1016/j.clineuro.2012.12.031 (2013).
Article PubMed Google Scholar
Beppu, T. et al. Fractional anisotropy value by diffusion tensor magnetic resonance imaging as a predictor of cell density and proliferation activity of glioblastomas. Surg. Neurol. 63, 56–61. https://doi.org/10.1016/j.surneu.2004.02.034 (2005).
Article PubMed Google Scholar
Kinoshita, M. et al. Fractional anisotropy and tumor cell density of the tumor core show positive correlation in diffusion tensor magnetic resonance imaging of malignant brain tumors. NeuroImage 43, 29–35. https://doi.org/10.1016/j.neuroimage.2008.06.041 (2008).
Article PubMed CAS Google Scholar
Lu, S., Ahn, D., Johnson, G. & Cha, S. Peritumoral diffusion tensor imaging of high-grade gliomas and metastatic brain tumors. AJNR Am. J. Neuroradiol. 24, 937–941 (2003).
PubMed PubMed Central Google Scholar
Cui, G. et al. Machine-learning-based classification of lower-grade gliomas and high-grade gliomas using radiomic features in multi-parametric MRI. arXiv preprint arXiv:1911.10145 (2019).
Kumar, A. et al. Machine-learning-based radiomics for classifying glioma grade from magnetic resonance images of the brain. J. Pers. Med. https://doi.org/10.3390/jpm13060920 (2023).
Article PubMed PubMed Central Google Scholar
Polly, F. P., Shil, S. K., Hossain, M. A., Ayman, A. & Jang, Y. M. 2018 International Conference on Information Networking (ICOIN). 813–817 (2018).
Vamvakas, A. et al. Imaging biomarker analysis of advanced multiparametric MRI for glioma grading. Phys. Med. 60, 188–198. https://doi.org/10.1016/j.ejmp.2019.03.014 (2019).
Article PubMed CAS Google Scholar
Lin, K., Cidan, W., Qi, Y. & Wang, X. Glioma grading prediction using multiparametric magnetic resonance imaging-based radiomics combined with proton magnetic resonance spectroscopy and diffusion tensor imaging. Med. Phys. 49, 4419–4429. https://doi.org/10.1002/mp.15648 (2022).
Article PubMed CAS Google Scholar
Gupta, N., Bhatele, P. & Khanna, P. Glioma detection on brain MRIs using texture and morphological features with ensemble learning. Biomed. Signal Process. Control 47, 115–125. https://doi.org/10.1016/j.bspc.2018.06.003 (2019).
Article Google Scholar
Brunese, L., Mercaldo, F., Reginelli, A. & Santone, A. An ensemble learning approach for brain cancer detection exploiting radiomic features. Comput. Methods Prog. Biomed. 185, 105134. https://doi.org/10.1016/j.cmpb.2019.105134 (2020).
Article Google Scholar
Chandra Joshi, R. et al. Ensemble based machine learning approach for prediction of glioma and multi-grade classification. Comput. Biol. Med. 137, 104829. https://doi.org/10.1016/j.compbiomed.2021.104829 (2021).
Article PubMed CAS Google Scholar
Ryu, J. & Walgampaya, C. Ensemble Classifier based on Misclassified Streaming Data. (2010).
Gong, Z., Zhong, P. & Hu, W. Diversity in machine learning. IEEE Access 7, 64323–64350 (2019).
Article Google Scholar
Hansen, L. K. & Salamon, P. Neural network ensembles. IEEE Trans. Pattern Anal. Mach. Intell. 12, 993–1001. https://doi.org/10.1109/34.58871 (1990).
Article Google Scholar

Download references

Acknowledgements

The authors thank all the patients who participated in this study as well as their caregivers who supported their participation.

Funding

Open access funding provided by Birla Institute of Technology and Science.

Author information

Authors and Affiliations

Department of Electrical and Electronics Engineering, Birla Institute of Technology and Science Pilani, Hyderabad Campus, Hyderabad, 500078, India
Sreejith Vidyadharan, B. V. V. S. N. Prabhakar Rao & Venkateswaran Rajagopalan
Department of Pharmacy, Birla Institute of Technology and Science Pilani, Hyderabad Campus, Hyderabad, 500078, India
P. Yogeeswari
Department of Imaging Sciences and Interventional Radiology, Sree Chitra Tirunal Institute for Medical Sciences and Technology, Trivandrum, 695011, India
C. Kesavadas

Authors

Sreejith Vidyadharan
View author publications
Search author on:PubMed Google Scholar
B. V. V. S. N. Prabhakar Rao
View author publications
Search author on:PubMed Google Scholar
P. Yogeeswari
View author publications
Search author on:PubMed Google Scholar
C. Kesavadas
View author publications
Search author on:PubMed Google Scholar
Venkateswaran Rajagopalan
View author publications
Search author on:PubMed Google Scholar

Contributions

S.V.—processed the data, performed analysis, and wrote the manuscript; B.V.V.S.N.P.R.—supervised the study and made extensive revisions to the manuscript; P.Y.—made extensive revisions to the manuscript; C.K.—acquired the data, analyzed the results, and made extensive revisions to the manuscript; V.R.—supervised data processing, performed data analysis, directed the entire study, and wrote the manuscript.

Corresponding author

Correspondence to Venkateswaran Rajagopalan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vidyadharan, S., Rao, B.V.V.S.N.P., Yogeeswari, P. et al. Accurate low and high grade glioma classification using free water eliminated diffusion tensor metrics and ensemble machine learning. Sci Rep 14, 19844 (2024). https://doi.org/10.1038/s41598-024-70627-9

Download citation

Received: 23 May 2024
Accepted: 19 August 2024
Published: 27 August 2024
DOI: https://doi.org/10.1038/s41598-024-70627-9

Subjects

Abstract

Similar content being viewed by others

Multi-class glioma segmentation on real-world data with missing MRI sequences: comparison of three deep learning algorithms

Leveraging TME features and multi-omics data with an advanced deep learning framework for improved Cancer survival prediction

Bi-exponential diffusion-weighted imaging for differentiating high-grade gliomas from solitary brain metastases: a VOI-based histogram analysis

Introduction

Materials and methods

Data acquisition

Imaging protocol

Data processing

Tumor segmentation

Diffusion tensor image processing

A. Standard DTI model (single-compartment model)

B. Free water eliminated (FWE) DTI model (two-compartment model)

C. Normal-appearing white matter

Machine learning

Ethical approval and consent to participate

Results

Machine learning results on DTI metrics derived from the standard DTI model

Machine learning results for DTI metrics derived from the FWE DTI model

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links