Improving early detection of Alzheimer’s disease through MRI slice selection and deep learning techniques

Şener, Begüm; Açıcı, Koray; Sümer, Emre

doi:10.1038/s41598-025-14476-0

Download PDF

Article
Open access
Published: 10 August 2025

Improving early detection of Alzheimer’s disease through MRI slice selection and deep learning techniques

Begüm Şener¹,
Koray Açıcı² &
Emre Sümer¹

Scientific Reports volume 15, Article number: 29260 (2025) Cite this article

3695 Accesses
11 Altmetric
Metrics details

Subjects

Abstract

Alzheimer’s disease is a progressive neurodegenerative disorder marked by cognitive decline, memory loss, and behavioral changes. Early diagnosis, particularly identifying Early Mild Cognitive Impairment (EMCI), is vital for managing the disease and improving patient outcomes. Detecting EMCI is challenging due to the subtle structural changes in the brain, making precise slice selection from MRI scans essential for accurate diagnosis. In this context, the careful selection of specific MRI slices that provide distinct anatomical details significantly enhances the ability to identify these early changes. The chief novelty of the study is that instead of selecting all slices, an approach for identifying the important slices is developed. The ADNI-3 dataset was used as the dataset when running the models for early detection of Alzheimer’s disease. Satisfactory results have been obtained by classifying with deep learning models, vision transformers (ViT) and by adding new structures to them, together with the model proposal. In the results obtained, while an accuracy of 99.45% was achieved with EfficientNetB2 + FPN in AD vs. LMCI classification from the slices selected with SSIM, an accuracy of 99.19% was achieved in AD vs. EMCI classification, in fact, the study significantly advances early detection by demonstrating improved diagnostic accuracy of the disease at the EMCI stage. The results obtained with these methods emphasize the importance of developing deep learning models with slice selection integrated with the Vision Transformers architecture. Focusing on accurate slice selection enables early detection of Alzheimer’s at the EMCI stage, allowing for timely interventions and preventive measures before the disease progresses to more advanced stages. This approach not only facilitates early and accurate diagnosis, but also lays the groundwork for timely intervention and treatment, offering hope for better patient outcomes in Alzheimer’s disease. The study is finally evaluated by a statistical significance test.

Explainable early detection of Alzheimer’s disease using ROIs and an ensemble of 138 3D vision transformers

Article Open access 12 November 2024

A novel interpreted deep network for Alzheimer’s disease prediction based on inverted self attention and vision transformer

Article Open access 15 August 2025

Diagnosis of early mild cognitive impairment using a multiobjective optimization algorithm based on T1-MRI data

Article Open access 19 January 2022

Introduction

Alzheimer’s disease (AD) may be a common frame of dementia that causes memory misfortune and a common decay in cognitive work over time due to the passing of brain cells. At first, this disease manifests itself only through simple forgetfulness, begins to manifest itself in a more advanced manner over time. Within the progressed stages of the illness, assembly the patient’s fundamental needs and providing the necessary care can become an increasingly challenging and complex task.

Early detection of Alzheimer’s disease is valuable to prevent the rapid progression of the disease. Diagnosing the disease at an early stage allows the patient to carry out activities of daily living for longer. The application of diverse imaging modalities plays a crucial role in the diagnosis of the disease, and these techniques are intricately associated with the diagnostic process.

The disease is studied in different stages depending on its progression¹. These stages generally range from healthy individuals to early mild cognitive impairment to advanced Alzheimer’s disease. In the Early Mild Cognitive Impairment (EMCI) stage, individuals may experience mild memory problems or other cognitive difficulties. In the Mild Cognitive Impairment (MCI) stage, they may experience cognitive difficulties such as forgetfulness or distraction. In the Late Mild Cognitive Impairment (LMCI) stage, they experience more severe memory problems, such as forgetting important events or information. They have more pronounced difficulties with activities of daily living. In the Alzheimer’s stage, there are severe cognitive problems such as memory loss, impaired decision-making, language and communication problems. Individuals become unable to carry out activities of daily living and may require full-time care. Radiological approaches used to diagnose AD include magnetic resonance imaging (MRI), computed tomography (CT), positron emission tomography (PET), functional MRI (fMRI), and single photon emission computed tomography (SPECT). Within the realm of MRI, various imaging techniques such as T1-weighted, T2-weighted, and proton-weighted images are utilized².

Gaps in other literature studies have a major impact on the conduct of this research. Studies show that advanced-stage AD is relatively easy to detect, while the mild stage is more difficult to detect in AD³. At the same time, there is a lack of studies in the literature on how to select slices for the detection of Alzheimer’s disease in MR images. On the other hand, as a method, when determining the reference image for MRI slice selection, MRI slices were analyzed for each patient and the image with the highest number of edge segments among these slices was found and selected. Considering the techniques and methods used in this study, our motivation was the idea that the Feature Pyramid Network (FPN) structure integrated into the proposed model can improve diagnostic accuracy by extracting details more precisely. It is thought that integrating the methods and techniques used with an innovative approach can fill an important gap in the literature. Another main motivation for your work is to develop a model that can be used in clinical applications. Deep learning models such as Vision Transformers (ViT) and EfficientNet have been proven in the literature to help clinicians make earlier and more accurate decisions by making precise and reliable predictions from MRI images. This is thought to directly contribute to the quality of life of patients by facilitating early diagnosis in clinical practice. Based on this aim, the main motivation objectives of the study are as follows:

To select only the necessary and meaningful slices instead of analyzing all slices. It would improve data quality and lead to more accurate analysis and results.
To ensure that the disease is detected at the EMCI stage so that necessary precautions can be taken before progression to AD.
Innovating deep learning models and providing a new model by integrating it with the Vision Transformers structure.

The article structure is as follows: We briefly introduce Alzheimer’s disease. Then, a literature review on AD is presented. Methodology is demonstrated following the dataset. In the methodology section, definitions of the models and evaluation metrics are given. Finally, the results and discussion along with future work are presented.

Related work

Numerous considerations for determining Alzheimer’s disease are accessible within the literature. The following is a summary of a literature review of these studies.

In a recent study⁴, ran deep learning models for the diagnosis of Alzheimer’s disease with pre-trained networks and transfer learning using the ADNI dataset. They obtained the results by dividing the dataset into train and test. Data augmentation was performed by rotating the images in the dataset. The study received the results by running VGG-19, ResNet-50, and InceptionV3 models, yielding average accuracies of 97.54%, 97.16%, and 98.70%, respectively.

In a study⁵, proposed a model called Aux-ViT as an image transformation network architecture and solved some shallow feature problems with this proposal. Specifically, they added auxiliary multilayer sensors and chose ViT as the base network to eliminate prediction errors. They also utilized the ADNI-3 dataset, and an irregular manufactured cover based on pixel weighting combination to undertake information upgrade. They used T1-weighted and two-class dataset content, and split the training and test sets in an 8:2 ratio and reserved 20% of the training set for validation. They proposed online randomized engineered veil enlargement and multi-information combination upgrade to move forward MRIs. They also aimed to enhance multi-information fusion. Compared to the baseline ViT model, the Aux-ViT model achieved an accuracy of 89.58%. In their study, they presented a practical approach for early diagnosis of Alzheimer’s disease using MRI data.

In a study⁶, investigated and evaluated the applications of different CNN and transformer models on early detection of Alzheimer’s disease. They also presented a multimodal method for Alzheimer’s disease detection based on MRI and PET modality using a combination of EfficientNetV2 and a novel data augmentation and enhanced image transformer based on self-attention generative adversarial networks (SAGAN). They validated the proposed method using the Alzheimer’s Disease Neuroimaging Initiative (ADNI) and the Open Access Imaging Studies Series (OASIS). The proposed method achieved 96% accuracy by combining the key advantages of the image converter and EfficientNetV2.

In another study⁷, proposed TriFormer, a new transformer-based framework for classification using ADNI-1 and ADNI-2 datasets. They divided the dataset into 80% training and 20% testing and obtained the results with 50 epochs. They extracted multi-view picture highlights from MRI utilizing ViT. They obtained the results with a modality fusion transformer that combines the extracted multimodal features to perform more accurate transform predictions by combining image slices with a clinical class marker. They obtained an accuracy of 77.31% for the ADNI-1 dataset and 84.10% for the ADNI-2 dataset.

In a research⁸, conducted a classification analysis of T1-weighted MRI images utilizing the ADNI dataset. They proposed a new model which is a hybrid three-dimensional CNN and transformer design. In addition to the ADNI dataset, they also tested the same model on OASIS and AIBL datasets. They compared this model with eight basic algorithms. The dataset was partitioned such that 80% was allocated for training purposes, while the remaining 20% was designated for testing. Their proposed LongFormer model achieved 93.43% accuracy on the ADNI dataset.

In a research⁹, conducted a study to develop a new model for computer-aided diagnosis (CAD). In their study, they performed data alignment and merging using the ADNI dataset. They applied a method called AliFuse for aligning and merging data from different modalities. This model aims to integrate information from different modalities by processing data from different modalities. The data sets were partitioned into three segments: 70% for training, 20% for testing, and 10% for validation purposes. The proposed model achieved an average accuracy of 87.93% for the three classes.

In a different study¹⁰, conducted a classification task using the ADNI dataset. They augmented the dataset with rotation operations. They used 1.5T and 3 T weighted three-dimensional images, and obtained the results through 5-fold cross validation. A hybrid (ensemble) model by combining ViT and CNN model was utilized yielding 89.46% accuracy for CN vs. AD, 78.60% accuracy for MCI vs. AD and 78.86% accuracy for CN vs. MCI.

In a study¹¹, used the ADNI-3 dataset for the classification of Alzheimer’s disease and divided the dataset into 70% train and 30% test. There are three classes in the data sets. They obtained an accuracy of 98.94% for CN vs. AD, 97.95% for MCI vs. AD and 98.42% for CN vs. MCI classification with EfficientNetB0 model.

Table 1 Summary of literature Studies.

Subjects

Abstract

Similar content being viewed by others

Explainable early detection of Alzheimer’s disease using ROIs and an ensemble of 138 3D vision transformers

A novel interpreted deep network for Alzheimer’s disease prediction based on inverted self attention and vision transformer

Diagnosis of early mild cognitive impairment using a multiobjective optimization algorithm based on T1-MRI data

Introduction

Related work

Materials & methods

Dataset

Structural similarity index measure (SSIM)

Performance metrics

McNemar’s test

Models

Results and discussions

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links