Deep learning-driven brain tumor classification and segmentation using non-contrast MRI

Lu, Nan-Han; Huang, Yung-Hui; Liu, Kuo-Ying; Chen, Tai-Been

doi:10.1038/s41598-025-13591-2

Download PDF

Article
Open access
Published: 30 July 2025

Deep learning-driven brain tumor classification and segmentation using non-contrast MRI

Nan-Han Lu^1,2,
Yung-Hui Huang³,
Kuo-Ying Liu¹ &
…
Tai-Been Chen⁴

Scientific Reports volume 15, Article number: 27831 (2025) Cite this article

2899 Accesses
Metrics details

Subjects

Abstract

This study aims to enhance the accuracy and efficiency of MRI-based brain tumor diagnosis by leveraging deep learning (DL) techniques applied to multichannel MRI inputs. MRI data were collected from 203 subjects, including 100 normal cases and 103 cases with 13 distinct brain tumor types. Non-contrast T1-weighted (T1w) and T2-weighted (T2w) images were combined with their average to form RGB three-channel inputs, enriching the representation for model training. Several convolutional neural network (CNN) architectures were evaluated for tumor classification, while fully convolutional networks (FCNs) were employed for tumor segmentation. Standard preprocessing, normalization, and training procedures were rigorously followed. The RGB fusion of T1w, T2w, and their average significantly enhanced model performance. The classification task achieved a top accuracy of 98.3% using the Darknet53 model, and segmentation attained a mean Dice score of 0.937 with ResNet50. These results demonstrate the effectiveness of multichannel input fusion and model selection in improving brain tumor analysis. While not yet integrated into clinical workflows, this approach holds promise for future development of DL-assisted decision-support tools in radiological practice.

Brain tumor segmentation based on deep learning and an attention mechanism using MRI multi-modalities brain images

Article Open access 25 May 2021

T1-weighted MRI-based brain tumor classification using hybrid deep learning models

Article Open access 27 February 2025

Detection and classification of brain tumor using hybrid deep learning models

Article Open access 27 December 2023

Introduction

Magnetic Resonance Imaging (MRI) remains a cornerstone in the diagnosis and management of brain tumors due to its superior soft tissue contrast and non-invasive nature. However, traditional diagnostic workflows rely heavily on radiologists’ expertise, making the process labor-intensive and subject to inter-observer variability. While contrast-enhanced MRI is commonly used to improve lesion visibility, it may be contraindicated in certain patient populations—such as elderly individuals, those with renal impairment, or patients allergic to contrast agents. These limitations highlight the need for accurate and reliable diagnostic approaches based solely on non-contrast MRI sequences, particularly T1-weighted (T1w) and T2-weighted (T2w) imaging.

In recent years, artificial intelligence (AI), especially deep learning (DL), has significantly advanced medical image analysis by improving diagnostic accuracy and operational efficiency. Convolutional Neural Networks (CNNs) have demonstrated excellent performance in tumor classification, while Fully Convolutional Networks (FCNs) have proven effective for tumor boundary segmentation. These AI-based methods enable automated interpretation of complex medical images and offer substantial potential to assist radiologists in clinical decision-making while reducing diagnostic workload.

Despite these technological advances, key challenges persist. Non-contrast MRI inherently provides lower lesion-to-background contrast, complicating precise tumor boundary delineation. Additionally, deep learning models typically require large, well-annotated datasets and often lack interpretability—factors that are crucial for clinical adoption. Overcoming these barriers calls for the development of robust, generalizable AI frameworks capable of extracting meaningful features from non-contrast images while maintaining clinical relevance and transparency.

To address these challenges, this study proposes a deep learning-based approach that fuses T1w, T2w, and their linear average—(T1w + T2w)/2—into a three-channel RGB format to enrich image representation. This multichannel fusion supports the development of a unified DL pipeline for simultaneous tumor classification and segmentation. CNN architectures are applied for classification tasks, while FCN models are used for precise boundary delineation. The proposed framework presents a non-invasive, efficient, and clinically viable solution for patients who cannot undergo contrast-enhanced imaging. The main contributions of this study are as follows:

A comparative evaluation of multiple CNN and FCN architectures using both internal hospital datasets and external benchmarks (BraTS) under a unified multichannel MRI fusion strategy.
Integration of comprehensive performance metrics—including Dice coefficient, Intersection over Union (IoU), boundary F1-score, and Kappa index—to rigorously assess model performance in the presence of class imbalance.
Delivery of practical insights into the impact of RGB fusion schemes and the comparative effectiveness of lightweight versus deep model architectures—an area that remains underexplored in previous literature.

Related works

In recent years, transfer learning has been widely adopted in medical imaging, allowing models pre-trained on large, diverse datasets to be fine-tuned for specific tasks with limited data. This approach has shown promise in improving diagnostic accuracy across various medical conditions, including brain tumors^1,2. Additionally, federated learning, which enables model training on distributed datasets without sharing patient data, has emerged as a solution to privacy concerns in medical imaging. This technique not only enhances data security but also improves model generalizability across diverse clinical settings^3,4. These advancements align with the goals of our study, which seek to enhance the reliability and applicability of DL in brain tumor diagnostics.

Tumor detection using MRI and deep learning

Magnetic Resonance Imaging (MRI) plays a pivotal role in the diagnosis of brain diseases, particularly tumors, due to its high-resolution anatomical detail and non-invasive nature. Recent studies have demonstrated that integrating machine learning, particularly deep learning (DL), into MRI analysis significantly improves diagnostic efficiency and accuracy^5,6. However, reliance solely on automated systems may risk diminishing the value of clinical judgment. Accurate diagnosis not only depends on image interpretation but also on the integration of patient-specific clinical context, which may be overlooked if technology is overly prioritized.

A range of recent works has explored advanced imaging and machine learning methods to enhance brain tumor detection, highlighting both technical advancements and the practical challenges of clinical implementation^7,8,9,10. The use of computer-aided diagnosis (CAD) and DL techniques in MRI has been shown to improve early tumor detection, classification, and overall treatment outcomes^{11,12,13,14,15,16}. These approaches enhance precision and speed, but extremely high reported accuracies—approaching 100%—raise concerns about potential overfitting and limited generalizability in real-world settings.

Several studies have reported the effectiveness of 2D CNN models in brain tumor classification using MRI, reflecting substantial progress in neuro-oncology diagnostics^17,18,19. Nonetheless, challenges related to generalizing these models across heterogeneous clinical environments and patient populations remain significant. Further, emerging machine learning techniques using MRI and fMRI have enabled tumor detection and disease progression prediction without labeled data, supporting semi-supervised and unsupervised learning frameworks.

CAD systems also demonstrate the ability to detect subtle brain metastases that are frequently missed during routine assessments by leveraging CNNs trained on unique and diverse datasets. To improve multiclass classification performance, many studies have adopted transfer learning strategies, which enhance diagnostic reliability and accelerate model convergence. However, achieving consistent performance across diverse healthcare environments with minimal recalibration remains a critical hurdle^20,21,22,23.

CNN-based classification of brain tumors

Convolutional Neural Networks (CNNs), as a subset of deep learning (DL), have demonstrated exceptional performance in brain tumor classification using MRI. A dual CNN architecture based on VGG-16 has been reported to classify three major tumor types—meningioma, glioma, and pituitary—with near-perfect accuracy²⁴. Similarly, a weighted ensemble model that integrates features from VGG19 and other CNN variants has shown superior performance on the Cancer Genome Atlas dataset²⁵. In parallel, 3D residual networks have been developed to effectively identify primary brain metastasis sites, leveraging volumetric data to improve spatial context²⁶.

Advanced CNN-based models increasingly integrate complementary technologies to enhance diagnostic precision. Examples include visual feature extraction pipelines, the use of blockchain for data security, and stacked ensemble frameworks that combine multiple network outputs^{27,28,29,30,31}. While these innovations have shown strong potential, integrating such systems into real-world clinical workflows remains a complex challenge due to issues such as interoperability, computational demands, and clinician acceptance.

Transfer learning techniques continue to play a crucial role in improving CNN performance on limited medical datasets. Pretrained networks fine-tuned for brain tumor classification have shown high diagnostic accuracy^32,33. Furthermore, optimized residual learning frameworks incorporating metaheuristic algorithms have further enhanced classification performance^34,35. Nevertheless, ensuring consistent effectiveness across diverse imaging protocols and patient demographics remains an ongoing challenge.

Beyond classification, DL models are now being employed for segmentation and tumor grading. Hybrid models combining U-Net and DenseNet architectures have demonstrated improved accuracy in segmenting tumor boundaries³⁶. Federated learning approaches, which preserve data privacy while enabling cross-institutional training, have also shown promise in enhancing model generalizability for automated segmentation tasks^26,37. Additionally, ensemble DL systems have proven effective in non-invasive glioma grading using FLAIR MRI sequences^38,39,40.

Recent developments also include fully integrated diagnostic pipelines capable of classifying diffuse gliomas from whole-slide images, thus potentially eliminating the need for histological examination^41,42. Despite these advancements, widespread clinical adoption still requires rigorous validation to ensure the reliability, robustness, and interpretability of these models across varied clinical settings^43,44,45,46.

The current state-of-the-art research

Recent advancements in artificial intelligence (AI) and deep learning (DL) have led to innovative approaches in both medical and industrial domains. Gangopadhyay et al. (2025) introduced a lightweight self-attention-based multi-task deep learning model that demonstrates high efficiency for industrial solar panel monitoring and environmental surveillance, showcasing the broader applicability of multi-task DL frameworks beyond healthcare⁴⁷.

In the realm of medical AI, Roy et al. emphasized the importance of explainable AI (XAI) for enhancing transparency and trust in healthcare ecosystems, advocating for models that provide interpretable outputs suitable for clinical integration⁴³. Extending this theme, Kabiraj et al. developed a weakly supervised model capable of multi-disease detection and localization in thoracic X-rays, offering both diagnostic accuracy and interpretability without requiring dense annotations³¹.

Roy et al. (2024) also proposed a forward attention-based deep network tailored for breast histopathology image classification, demonstrating how attention mechanisms can improve performance in high-resolution medical imaging tasks³⁰. Earlier, Roy and Shoghi (2019) presented a pioneering computer-aided tumor segmentation technique using T2-weighted MR images from patient-derived tumor xenografts, laying the groundwork for precision tumor delineation in MRI¹⁵.

Together, these studies highlight key trends in state-of-the-art research: the integration of attention mechanisms, the move toward explainable and weakly supervised models, and the adoption of multi-task architectures—all aiming to enhance performance, interpretability, and clinical relevance. These innovations provide critical insights and inspiration for the present study’s focus on RGB-fused MRI-based brain tumor classification and segmentation using CNN and FCN models.

Methods and materials

Ethics approval

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of E-DA Hospital, Kaohsiung, Taiwan (Approval Number: EMRP-110-084, Date of Approval: 5 August 2021). Verbal and written information detailing all experimental procedures was provided to all participants, and written informed consent was obtained prior to their participation and the collection of experimental data.

MR imaging and protocols

The MRI scanners used in this study include a range of GE Signa Excite models operating at 1.5 Tesla (Table 1). The acquired images have a 512 × 512 pixels resolution in DICOM format. The imaging parameters include a 90-degree flip angle, a specific absorption rate (SAR) ranging from 0.73 to 1.33 W/kg, and a slice thickness of 5 mm. The repetition time (TR) varies between 380 and 600 milliseconds, while the echo time (TE) ranges from 9 to 23 milliseconds. The scans are obtained with 6.5 mm slice spacing, with series descriptions for Axial T1w and non-contrast sequences, provided by the GE Signa Excite GEMR1 1.5T scanner.

Table 1 MRI imaging parameters for brain tumor detection.

Subjects

Abstract

Similar content being viewed by others

Brain tumor segmentation based on deep learning and an attention mechanism using MRI multi-modalities brain images

T1-weighted MRI-based brain tumor classification using hybrid deep learning models

Detection and classification of brain tumor using hybrid deep learning models

Introduction

Related works

Tumor detection using MRI and deep learning

CNN-based classification of brain tumors

The current state-of-the-art research

Methods and materials

Ethics approval

MR imaging and protocols

The enrolled samples

The fusion of T1w and T2w

The CNN models for classification of MRI images

The FCN models for MRI image segmentation

Evaluated performance of CNN and FCN models

Results

Evaluation of RGB fusion and CNN models for tumor classification

Analysis of segmentation performance across brain tumor types and FCN models

External validation on the brats benchmark dataset

Discussion

Miss classification using CNN models

Segmentation for brain tumors

Generalization assessment of CNN and FCN models using brats data

Conclusions

Limitations and future works

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Ethics approval and consent to participate & consent for publication

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links