Automated multi-model framework for malaria detection using deep learning and feature fusion

Shahin, Osama R.; Alshammari, Hamoud H.; Alabdali, Raed N.; Salaheldin, Ahmed M.; Saleh, Neven

doi:10.1038/s41598-025-04784-w

Download PDF

Article
Open access
Published: 16 July 2025

Automated multi-model framework for malaria detection using deep learning and feature fusion

Osama R. Shahin¹,
Hamoud H. Alshammari²,
Raed N. Alabdali¹,
Ahmed M. Salaheldin³ &
…
Neven Saleh⁴

Scientific Reports volume 15, Article number: 25672 (2025) Cite this article

2338 Accesses
1 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Malaria remains a critical global health challenge, particularly in tropical and subtropical regions. While traditional methods for diagnosis are effective, they face some limitations related to accuracy, time consumption, and manual effort. This study proposes an advanced, automated diagnostic framework for malaria detection using a multi-model architecture integrating deep learning and machine learning techniques. The framework employs a transfer learning approach that incorporates ResNet 50, VGG16, and DenseNet-201 for feature extraction. This is followed by feature fusion and dimensionality reduction via principal component analysis. A hybrid scheme that combines support vector machine and long short-term memory networks is used for classification. A majority voting mechanism aggregates outputs from all models to enhance prediction robustness. The approach was validated on a publicly available dataset comprising 27,558 microscopic thin blood smear images. The results demonstrated superior performance, achieving an accuracy of 96.47%, sensitivity of 96.03%, specificity of 96.90%, precision of 96.88%, and F1-score of 96.45% using the majority voting ensemble. Comparative analysis highlights the framework’s advancements over existing methods in diagnostic reliability and computational efficiency. This work underscores the potential of AI-driven solutions in advancing malaria diagnostics and lays the foundation for applications in other blood-borne diseases.

Efficient deep learning-based approach for malaria detection using red blood cell smears

Article Open access 10 June 2024

Deep learning-based malaria parasite detection: convolutional neural networks model for accurate species identification of Plasmodium falciparum and Plasmodium vivax

Article Open access 30 January 2025

Multiclass malaria parasite recognition based on transformer models and a generative adversarial network

Article Open access 10 October 2023

Introduction

Red blood cell (RBC) assessment is a significant diagnostic tool for a wide range of diseases, including anaemia, thalassemia, and malaria^1,2,3. When morphological characteristics of the RBC are altered, this change may indicate an infection with a parasite, such as malaria. Usually, blood diseases are diagnosed based on physical examinations of the blood. However, microscopic images play a pivotal role in diagnosing such illnesses. Incorporating the counting of RBCs and examining the morphological change of the cell shape may result in a more comprehensive diagnosis.

Malaria is a global health challenge that leads to deaths, particularly among young children and pregnant women. It is a mosquito-borne disease that is widely spread in tropical and subtropical regions. The common cause of the disease is a Plasmodium parasite that infects humans through female Anopheles mosquitoes^4,5,6. In 2022, the World Health Organization (WHO) reported that 96% of malaria cases were recorded in twenty-nine countries globally. It was estimated that there were 249 million malaria cases, with a frequency of cases that was 58 out of 1000 individuals at high risk⁷. In Africa, malaria cases contributed to 90% of global cases and 92% of malaria-related deaths⁸.

Typically, malaria symptoms are similar to many diseases, like the flu, causing a strong fever, sweating, and headache. Additionally, it manifests with nausea, vomiting, and muscular consequences⁹. If the disease is left without appropriate therapy or ineffective drugs, it leads to grave consequences, specifically for children and pregnant women. The severity includes damage to the RBCs, which leads to severe anaemia and dysfunction of the respiratory system¹. Besides, it impacts the musculoskeletal functions and cardiac muscles, which might result in serious complications⁹.

Microscopic medical image analysis has significantly contributed to the identification and classification of numerous blood-type illnesses. Examining these images manually can lead to an inaccurate diagnosis. Globally, almost all microscopic images are prepared as thin blood smears and digitally as thick films^1,10,11. Typically, malaria is one of the blood-borne illnesses that is commonly diagnosed via microscopic images. Generally, the peripheral blood smears, known as the Giemsa-stained blood films, are the gold standard for diagnosing malaria. This technique involves the visual examination of thick and thin blood films. The aim of inspecting stained, thick blood smear slides under a microscope light is to detect and quantify parasites. Thin blood smear slides are used for identifying species of parasites^1,11. In diagnosis, the degree of morphological changes of the RBC indicates the severity of malaria. The changes entail size variation, distribution variation, and shape variation^4,11. Infected RBCs appear in light red, while parasites appear in blue and dark red colors¹.

However, this manual technique presents several significant challenges that motivate the exploration of automated Artificial Intelligence (AI)-driven solutions. Firstly, accurate diagnosis demands a high level of expertise and extensive training, as the morphological changes in RBCs must be carefully assessed to determine the severity of infection. Furthermore, the visual distinction between malaria parasites and other cellular artifacts or parasitaemia can be highly ambiguous, increasing the risk of misdiagnosis. Another major limitation lies in the labor-intensive nature of manual slide examination^1,2. To confidently declare a negative result, a specialist must thoroughly inspect at least 200 high-powered fields without detecting any parasites⁶. This process is time-consuming and susceptible to human error, particularly in high-volume or resource-limited clinical settings. Additionally, the consistency and reproducibility of manual diagnosis can vary significantly among different operators and laboratories. Such variability can impact the timeliness and effectiveness of treatment, especially in endemic regions where rapid decision-making is crucial.

Given these challenges, there is a clear need for more efficient, accurate, and scalable diagnostic solutions. In recent years, AI techniques – particularly machine learning (ML) and deep learning (DL) – have had a significant impact in this domain¹². These methods address several limitations, such as inaccuracy, time consumption, and the extensive manual effort required². Moreover, AI offers the capability to analyze a larger number of images in a much shorter time.

One major advantage of AI is its ability to automate the detection of malaria, including identifying the presence, type, and severity of parasites⁴. Since AI-based detection and classification are generally faster and more consistent than manual methods, they contribute to more reliable diagnosis and standardized treatment protocols. As a result, the application of various AI techniques in malaria diagnosis has seen considerable growth. In this context, both ML and DL algorithms are being widely applied to support a range of tasks related to malaria detection and analysis³. The key contributions of this study are as follows:

Development of a hybrid classification framework: A novel diagnostic framework is proposed by integrating multiple DL architectures – ResNet-50, VGG-16, and DenseNet-201 – with traditional classifiers, such as Support Vector Machine (SVM) and Long Short-Term Memory (LSTM) networks. This approach leads to improved diagnostic accuracy of malaria detection.
Implementation of feature fusion for optimal representation: The study introduces a feature fusion strategy that effectively combines and reduces high-dimensional features extracted from different DL models. This process eliminates redundant or irrelevant features while preserving the most informative ones, resulting in a discriminative feature vector.
Application of majority voting mechanism for robust decision-making: A majority voting ensemble method is employed to integrate predictions from the various classifiers. This strategy ensures a more stable and reliable final decision by reducing the likelihood of misclassification due to individual model bias or overfitting.
Comprehensive performance evaluation using multiple metrics: The proposed approach is rigorously evaluated using a variety of performance metrics, including accuracy, precision, recall, F1-score, and AUC. This ensures the generalization of performance across different scenarios.
Comparative analysis with existing methods: The results obtained from the proposed system are compared with those of other state-of-the-art methods in the literature. The hybrid framework demonstrates superior performance, achieving higher classification accuracy and greater diagnostic reliability, proving the advantage of integrating DL with ML classifiers.
Contribution to AI-driven healthcare diagnostics: This study not only advances the field of malaria detection but also contributes to the broader domain of AI applications in healthcare. By benchmarking and validating a robust, automated diagnostic pipeline, the research lays the groundwork for future developments in intelligent disease diagnosis systems.

The rest of this article is organized as follows. Related studies on malaria detection using AI methods are covered in the Related Works section. Before presenting the adopted methodology, we provide a Background section to describe the techniques utilized. The Materials and Methods section introduces the proposed pipelines using machine learning and deep learning algorithms. The results of the adopted methods are addressed in the Results section. The Discussion section explains the significance of the results and compares them with those of other related works. Finally, the Conclusion section summarizes the study and outlines expectations for future work.

Related works

Numerous studies have been conducted for malaria diagnosis using a range of AI algorithms. Despite malaria still being diagnosed with traditional techniques, including rapid diagnostic (RD) and the polymerase chain reaction (PCR) tests^3,13, AI algorithms remain promising. Convolutional neural networks (CNNs), particularly with diverse architectures and integration with various techniques, demonstrate meticulous detection of malaria. Moreover, microscopic images are widely used for investigation, either in thin or thick smear film forms.

A series of ML models were employed to forecast malaria, including random forest (RF), artificial neural network (ANN), multiple linear regression, and adaptive neuro-fuzzy inference system³. This study used haematological dataset belonging to 2207 Ghanaian patients with only twenty features. The results were assessed by statistical metrics, named R, R², MSE, and RMSE. The ANN model demonstrated the best performance among the other models, while the RF was the worst model. A set of RBCs was used by Sunarko et al.¹ to detect malaria. The work proposed Otsu’s method to segment the cells and distinguish the background and cell boundary from malaria parasites. Statistical features and skewness were used to detect the healthy cells from those infected. A total of 2000 thin blood smear images were used for training, and another 2000 microscopic images were used for testing. Both the grayscale images and RGB images were analyzed to extract the intensity distributions. Additionally, a K-means clustering method was employed to categorize the different colors within the cell, including identifying the parasites. The results revealed that there was 94.60% accuracy in detecting the infected cells.

Muhammed et al.¹⁰ study focused on identifying the rouleaux formation morphology of the RBCs caused by malaria parasites. Two CNN models were used with different sizes of input images. The study revealed the optimal image size was 300 × 300, which yielded an accuracy of 90.95% for detecting abnormal rouleaux formation. An extension of that work was presented in¹¹, where five CNN models were employed for a dataset obtained from 100 infected patients. This includes XceptionNet, ResNet 50, DenseNet, EfficientNetB4, customized CNN, achieving accuracies of 96.40%, 98.50%, 99.00%, 96.20%, and 98.40%, respectively. Kassim et al.¹⁴ have conducted a study based on real curated data in Bangladesh. They used a total of 965 microscopic thin images belonging to 193 patients. Additionally, the dataset was divided into a polygon set and a point set to discriminate training and testing images, respectively. The authors presented RBCNet by using U-Net to segment the RBCs and then applying faster R-CNN to distinguish infected and non-infected RBCs. The precision for the tested images (point set of data) was 97.54 ± 1.44.

A comprehensive framework was developed in² to identify the RBCs in microscopic images. To accurately identify areas of interest, RBCs were efficiently characterized using a region-based CNN model. Hoyos and Hoyos developed a deep learning-based model using YOLOv8 to detect malaria in addition to leukocytes, based on thick smear microscopic images. The study concluded that the developed model presented promising results compared to other related work, achieving an accuracy of 98.00% and 95.00% in detecting leukocytes and malaria parasites, respectively⁶. Another relevant study focused on using various machine learning models to diagnose malaria³. The applied algorithms were ANN, multi-linear regression, adaptive neuro-fuzzy inference system, and random forest. All models were trained based on fifteen criteria related to hematological parameters of blood tests, such as RBC count, hemoglobin level, etc. The mean square error, root mean square error, R² coefficient, and R coefficient were used to characterize the malaria predicting results.

In 2023, automated CNN models were developed to diagnose malaria¹³. The authors employed a customized CNN, ResNet50, and MobileNetV2. A dataset consisting of 27,558 smear images with 50% infected images and 50% free parasitized images was investigated. The CNN model detected malaria with an accuracy of 95.50%, the MobileNetV2 has achieved an accuracy of 97.06%, and the ResNet50 yielded an accuracy of 96.70%. In this way, the MobileNetV2 consistently outperformed in diagnosing malaria. The question of whether thin smear images or thick smear images are most appropriate for malaria detection was answered by Ozsahin et al.¹⁵. In this context, a customized CNN model was applied to a dataset containing infected images with P. falciparum and P. vivax in addition to uninfected images. Moreover, a transfer learning approach has been adopted to validate the model by using the VGG16, ResNet50, and Inception V3. The evaluation metrics, including accuracy, precision, sensitivity, and F1-score, demonstrated strong performance for thick smear images. Therefore, the study revealed that the thick smear images outperformed the thin smear images in malaria diagnosis.

Previous studies have shown that automated malaria detection relies on the use of machine learning and deep learning algorithms. Given the robustness of DL algorithms in handling large-scale datasets of microscopic images, we adopted a DL-based framework for malaria detection using thin smear images. Among DL algorithms, VGG 16, ResNet50, and DenseNet-201 are dominant in disease diagnosis, such as malaria. Table 1 summarizes related works by publication year, dataset, results, pros, and cons.

Background

This study was developed based on using ML and DL algorithms to diagnose malaria. The DL algorithms were employed to extract features of the utilized microscopic images, while the ML algorithms were used for classification into normal or abnormal images. In DL-based techniques, the CNN was the core of the network architecture with a modification in terms of the number of layers and the order of layers. In implementation, we adopted the ResNet50, the VGG 16, and DenseNet 201. Besides, the support vector machine (SVM), and Long Short-Term Memory (LSTM) networks were covered in Materials and Methods.

ResNet 50

In 2016, the Residual Neural Network (ResNet), introduced by He et al., aims to enhance performance by incorporating residual connections between layers. These connections reduce losses, increase knowledge acquisition, improve training efficiency, and make the model more robust against overfitting. ResNet models consist of multiple layers, ranging from 34 to 1202, with variations in the number of residual blocks and fundamental operations. Among these, ResNet-50 is the most widely used version, comprising 49 convolutional layers and one fully connected (FC) layer¹⁶.

The VGG16

In 2014, Oxford University scholars created the VGG16¹⁵. It consists of 13 convolutional layers divided into 5 segments, in addition to one segment that contains three fully connected layers^15,17. Therefore, the basic architecture of the network, leading to its name, is “the VGG16”. Usually, each layer uses a 3 × 3 kernel filter to reduce the number of parameters and nonlinear effects. The VGG 16 is characterized by a deeper architecture that leads to a progressive learning feature from low level to high level. Its enhanced nonlinear expressive power enables it to capture more features and handle more complex input data efficiently¹⁷. These features make the VGG16 particularly useful for tasks such as image recognition and classification.

DenseNet − 201

The DenseNet (Dense Convolutional Network) is a DL method that was designed to enhance propagation of features and reuse extracted features. It was originated by Huang et al. in 2017 [Ref. 17]. It connects each layer directly to every other layer in a feed-forward pattern. This configuration promotes information flow, which impacts the redundancy of parameters and minimizes the overfitting¹⁸. It resolves the issue of the vanishing gradient, allowing gradients to propagate more efficiently during backpropagation. Consequently, the DenseNet often achieves higher accuracy with a low number of parameters compared to other traditional DL models. This makes it more suitable for image classification and segmentation¹⁹. DenseNet-201’s densely connected layers enable it to capture intricate relationships between features, making it well-suited for tasks that demand fine-grained discrimination²⁰.

Materials and methods

The primary objective of this article is to develop and implement a robust AI-driven model for the accurate diagnosis of malaria using blood film microscopic images. The proposed methodology is designed as a multi-phase approach, with each phase contributing to the precision and reliability of the diagnostic process. The comprehensive workflow, as illustrated in Fig. 1, encompasses various stages of model development and implementation, including data preprocessing, feature extraction, model training, validation, and testing. This structured pipeline aims to ensure a high degree of diagnostic accuracy and efficiency, addressing the critical need for effective malaria detection in clinical and field settings.

1.
Dataset.

The “Cell Images for Detecting Malaria” dataset²¹, hosted on Kaggle, is a well-curated collection of microscopic images of blood smears designed to facilitate research and development in malaria detection using machine learning and deep learning models. It comprises 27,558 images, equally divided into two categories: Parasitized (malaria-infected cells) and Uninfected (healthy cells). The images were captured using light microscopy at a consistent magnification of 100x, ensuring high-resolution visual clarity ideal for machine learning and deep learning applications. The original input image has a size of 150 × 150. The dataset was undergoing some preprocessing, such as resizing to be convenient with the input layer of the proposed AI model.

The dataset is systematically organized into two directories. The Parasitized directory contains images of red blood cells infected with the Plasmodium parasite, marked by distinct visual features such as irregular shapes, dark spots, and uneven internal textures. The Uninfected directory includes images of healthy red blood cells, characterized by their smooth texture, uniform shape, and absence of parasitic artifacts. This structure simplifies data loading and preprocessing tasks for researchers and developers.

One of the standout features of this dataset is its balanced distribution, with an equal number of images in each category. This balance minimizes class imbalance issues, ensuring that machine learning models can train effectively without bias toward a particular class. Additionally, the dataset exhibits biological diversity, encompassing a wide range of cell morphologies, staining variations, and patterns that mirror the complexities encountered in real-world clinical settings.

Overall, the dataset is an invaluable resource for advancing AI-driven healthcare solutions. It is particularly suited for training, validating, and testing classification models as well as for transfer learning, where pre-trained models are fine-tuned for malaria detection tasks. The high-quality, diverse, and well-labelled nature of this dataset ensures that models trained on it are robust, reliable, and capable of performing effectively in real-world diagnostic scenarios. The dataset was split into 70% training, 15% validation, and 15% testing during the implementation of the proposed AI model. Examples of the utilized dataset are provided in Fig. 2. The distribution of the image count among the targeted classes is shown in Table 2.

Table 1 A comparison of related works concerning malaria diagnosis using AI techniques.

Full size table

Table 2 Distribution of the dataset utilized.

Full size table

2.
Phase (a): Implementation of End-to-End Deep Learning Models.

In this phase, the methodology combines the power of transfer learning and end-to-end deep learning approaches to process microscopic blood film images for malaria diagnosis. Three state-of-the-art CNN architectures—ResNet-50²², VGG-16²³, and DenseNet-201²⁴—are employed as feature extractors. These models utilize pre-trained weights from large-scale datasets, enabling them to recognize intricate patterns in blood cell images. In these models, we performed transfer learning by retraining the end head of the CNN, including a fully connected layer, a dropout layer, a SoftMax layer, and a classification layer. Each of the proposed models keep its main parameters in the convolution layer including the number of filters, stride length, and its weights.

For ResNet-50, it captures 2048 deep hierarchical features through its residual connections, addressing vanishing gradient problems for effective learning. Meanwhile, VGG-16 identifies 4096 spatial features using its uniform convolutional architecture, ensuring consistent extraction of visual details. In addition, DenseNet-201 enhances feature propagation by creating densely connected layers, improving the richness and relevance of extracted 1920 features.

Beyond feature extraction, these architectures are also trained as end-to-end models. This allows the entire network—from input to classification layers—to be fine-tuned specifically for malaria diagnosis. By adapting pre-trained models to the task at hand, the networks learn to optimize both low-level and high-level features relevant to identifying infected and healthy cells. This dual approach not only harnesses the strengths of transfer learning but also capitalizes on the adaptability of deep learning models, creating a robust foundation for accurate malaria diagnosis.

In this phase, the methodology combines the strengths of transfer learning and end-to-end deep learning approaches to process microscopic blood film images for malaria diagnosis. Three advanced convolutional neural network (CNN) architectures—ResNet-50, VGG-16, and DenseNet-201—serve as the backbone for feature extraction and classification. These models leverage pre-trained weights from large-scale datasets like ImageNet, enabling them to detect intricate and subtle patterns in blood cell images with remarkable precision and efficiency. The process of feature extraction from CNN models can be mathematically represented as shown in Eq. (1).

$$\:F={f}_{CNN}\left(X;{\theta\:}_{pre-trained}\right)\:\:\:\:\:\:\:\:\:\:$$

(1)

Where $\:X\:$is the input image, $\:{f}_{CNN}$ represents the feature extraction function of the CNN model, $\:{\theta\:}_{pre-trained}$ Are the pretrained weights, $\:F$ Is the resulting feature vector.

Each architecture contributes uniquely to feature extraction. ResNet-50 captures 2048 deep hierarchical features through its residual learning framework, which introduces shortcut connections to bypass layers. This residual connection can be described as shown in Eq. (2).

$$\:y=f\left(x\right)+x$$

(2)

Where $\:x$ is the input for the residual block, $\:f\left(x\right)$ is the learned transformation, $\:y$ is the output of the residual block

VGG-16, known for its simple and sequential structure, extracts 4096 spatial features using uniform convolutional layers with small $\:3\times\:3$ Filters. Equation (3) presents the output of these layers.

$$\:{z}_{i,j,k}=\sum\limits_{m=0}^{M-1}\sum\limits_{n=0}^{N-1}{x}_{i+m,j+n}.\:{w}_{m,n,k}+{b}_{k}$$

(3)

Where $\:{z}_{i,j,k}$ is the output feature map, $\:{x}_{i+m,j+n}$ is the input feature map, $\:{w}_{m,n,k}$ are the weights of the filter, $\:{b}_{k}\:$is the bias term, and $\:M$ and $\:N$ are the filter dimensions. This approach ensures precise detection of spatial characteristics such as cell morphology and texture.

DenseNet-201 enhances feature propagation and reuse through densely connected layers, where each layer is connected to all previous layers. This can be expressed as presented in Eq. (4).

$$\:{x}_{l}={H}_{l}\left(\right[{x}_{0},\:{x}_{1},\:\dots\:..,{x}_{l-1}\:\left]\right)$$

(4)

where $\:{x}_{l}$ is the output of the $\:{l}^{th}$ layer, $\:{H}_{l}$ is the learned transformation, and $\:\left[{x}_{0},\:{x}_{1},\:\dots\:..,{x}_{l-1}\:\right]\:$represents the concatenation of feature maps from preceding layers. This connectivity results in the extraction of 1920 highly relevant features, contributing to a rich feature set.

Beyond feature extraction, these CNN architectures are fine-tuned as end-to-end deep learning models. Fine-tuning optimizes the parameters. $\:{\theta\:}_{fine-tuned}$ for the specific task of malaria diagnosis by minimizing a loss function as shown in Eq. (5).

$$\:{\theta\:}_{fine-tuned}=\text{arg}\text{min}L(f\left(X;\theta\:\right),y)$$

(5)

where $\:f\left(X;\theta\:\right)$ is the predicted output of the model, $\:y$ is the ground truth label, and $\:L$ is the cross-entropy loss function. This process ensures that the models learn both low-level features (such as textures and shapes) and high-level features (such as parasitic patterns) unique to blood smear microscopy.

For more information, Table 3 summarizes the training hyperparameters used for the training of the proposed models, reflecting a carefully tuned configuration that balances learning stability, efficiency, and generalization performance through mini-batch optimization, learning rate scheduling, and periodic validation. As the selection of training hyperparameters may directly influence the proposed models²⁵. Moreover, Fig. 3 provides a flow chart of the proposed algorithm, including the training and testing phases.

Table 3 Different training hyperparameters for the proposed CNN models.

Full size table

3.
Phase (b): Feature Fusion and dimensionality reduction.

Following the independent extraction of features using ResNet-50, VGG-16, and DenseNet-201, the next step in the methodology involves feature fusion. This process combines the diverse feature sets generated by these models into a unified, comprehensive feature vector. The fusion operation is performed through concatenation, which aggregates the feature vectors from the three models according to Eq. (6).

$$\:{F}_{fused}=[{F}_{ResNet}\:,{F}_{VGG\:16}\:,\:{F}_{DenseNet}]$$

(6)

Where $\:{F}_{ResNet}\:,{F}_{VGG\:16}\:,\:{F}_{DenseNet}$ are the feature vectors extracted from ResNet-50, VGG-16, and DenseNet-201, respectively, $\:{F}_{fused}$ Represents the fused feature vector. The total number of features after concatenation is 8064.

This fused feature vector provides a richer representation by combining both global patterns (captured by ResNet-50) and local spatial details (captured by VGG-16 and DenseNet-201). While the combined features significantly enhance the model’s capability to distinguish between malaria-infected and healthy cells, their high dimensionality poses challenges in terms of computational complexity and overfitting.

To address these challenges, Principal Component Analysis (PCA) is employed for dimensionality reduction. PCA transforms the high-dimensional fused feature vector into a lower-dimensional space by finding the principal components that capture the maximum variance in the data. The transformation can be represented as shown in Eq. (7).

$$\:{F}_{PCA}=\:{F}_{fused}\:W$$

(7)

Where $\:{F}_{fused}$ is the original fused feature matrix of size $\:N\times\:8064,\:$N is the number of samples. The projection matrix W is computed by solving the eigenvalue decomposition of the covariance matrix of $\:{F}_{fused}$.

$$\:C=\frac{1}{N}{f}_{fused}^{T}{F}_{fused}$$

(8)

Equation 8 introduces the covariance matrix (C) of the fused feature matrix. It is a square matrix that captures the relationships between the features in the fused feature space. By selecting only the top $\:k$ Principal components that capture 95% of the variance, PCA ensures that the most discriminative features are preserved while reducing the feature space. This reduction minimizes computational complexity, mitigates overfitting, and accelerates subsequent classification tasks, all while maintaining high diagnostic accuracy.

4.
Phase (c): Hybrid Classification Framework.

After dimensionality reduction, the refined feature vector is processed through a hybrid classification framework that combines the strengths of traditional machine learning and deep learning models. This framework consists of two components: Support Vector Machine (SVM) and Long Short-Term Memory Networks (LSTM). Each component plays a vital role in ensuring robust and accurate classification, complementing each other’s capabilities.

a.
Support Vector Machine (SVM).

SVM is a powerful supervised learning algorithm that excels in identifying optimal decision boundaries between classes, even in high-dimensional spaces²⁶. In this framework, the reduced feature vector is provided as input to the SVM classifier, which works by finding a hyperplane that maximally separates the two classes: malaria-infected and healthy samples. The SVM classifier aims to maximize the margin, which is the distance between the hyperplane and the nearest data points on either side, known as support vectors. This approach ensures that the classifier generalizes well to unseen data, making it particularly effective for binary classification tasks like malaria diagnosis. The robustness and reliability of SVM make it an essential component of this hybrid framework. The core objective of SVM is to maximize the margin between the separating hyperplane and the closest data points from each class, known as the support vectors. The optimal hyperplane can be mathematically expressed in Eqs. (9–11)

$$\:f\left(x\right)={w}^{T}x+b=0$$

(9)

Where $\:x$ is the input feature vector, $\:w$ is the weight vector orthogonal to the hyperplane, $\:b$ is the bias term.

To find the optimal hyperplane, SVM solves the following convex optimization problem:

$$\:\underset{a,{\:b}}{\text{min}}\frac{1}{2}{\parallel w\parallel}^{2}\:\:\:\:$$

(10)

$$\:subject\:to:\:{y}_{i}\left({w}^{T}{x}_{i}+b\right)\ge\:1,\:\:\:{\forall\:}_{i}$$

(11)

Where $\:{y}_{i}\in\:\left\{1,-1\right\}$ are the class labels, $\:{x}_{i}$ are the training instances.

By maximizing the margin $\:\frac{2}{\parallel w\parallel}$, SVM minimizes overfitting and improves generalization performance on unseen test data. For non-linearly separable data, SVM incorporates kernel functions $\:K({x}_{i},\:{x}_{j})$to map data into a higher-dimensional space, where linear separation becomes feasible. In the context of this study, the SVM classifier processes the transformed PCA feature set to learn the best separating hyperplane between infected and uninfected cells. Its mathematical rigor, geometric interpretation, and generalization capability make SVM an indispensable component of the proposed hybrid classification system.

b.
Long Short-Term Memory Networks (LSTM).

LSTM, a type of recurrent neural network (RNN), is used in this framework to capture patterns and contextual relationships in the data. Although the feature vector itself is not sequential, LSTM networks excel at modelling dependencies among features, allowing the system to recognize intricate patterns indicative of malaria-infected cells^12,19. LSTMs are designed with specialized mechanisms, such as gates that regulate the flow of information, ensuring the model retains only the most relevant patterns and discards unnecessary details. This makes LSTMs particularly adept at learning complex and nuanced representations from data, adding an extra layer of interpretability and robustness to the classification process. In the context of malaria diagnosis, LSTM adds an interpretive layer capable of capturing complex feature interactions that may not be linearly separable²⁷.

LSTM achieves this through a gated memory cell architecture, which consists of the forget gate, input gate, cell state update, and output gate. These gates control the flow of information, selectively remembering or forgetting parts of the input and prior hidden states, thereby enabling the network to focus on the most informative patterns while discarding irrelevant ones. The internal operations of an LSTM cell at time step $\:t$ are described by the following Eqs. (12–17):

Forgot gate.

$$\:{f}_{t}=\sigma\:\left({W}_{f}\cdot\:\left[{h}_{t-1},{x}_{t}\right]+{b}_{f}\right)$$

(12)

Input gate.
$$\:{i}_{t}=\sigma\:\left({W}_{i}\cdot\:\left[{h}_{t-1},{x}_{t}\right]+{b}_{i}\right)\:\:\:\:$$
(13)

$$\:{\stackrel{\sim}{C}}_{t}=tanh\left({W}_{c}\cdot\:\left[{h}_{t-1},{x}_{t}\right]+{b}_{c}\right)\:\:$$

(14)

Cell state update.
$$\:{C}_{t}={f}_{t}\cdot\:{C}_{t-1}+\:{i}_{t}\cdot\:\:{\stackrel{\sim}{C}}_{t-1}\:\:\:$$
(15)

Output gate.
$$\:{o}_{t}=\sigma\:\left({W}_{o}\cdot\:\left[{h}_{t-1},{x}_{t}\right]+{b}_{o}\right)\:\:$$
(16)

$$\:{h}_{t}=\:{o}_{t}\cdot\:\text{tanh}\left({C}_{t}\right)\:\:$$

(17)

Where $\:{x}_{t}\:$is the input at time step $\:t$, $\:{h}_{t-1}$ is the previous hidden state, $\:{C}_{t}$ is the current cell state, $\:\sigma\:$ is the sigmoid activation function, $\:W\:and\:b$ are the trainable weights and biases, $\:tanh$ is the hyperbolic tangent activation. These mechanisms enable LSTM to retain long-term dependencies and filter out noise, which is critical when learning from complex biological patterns such as variations in red blood cell morphology in malaria.

The designed RNN architecture incorporates an LSTM-based deep learning structure tailored for classifying malaria-infected versus uninfected samples using the reduced feature set obtained from PCA. The network begins with a sequence input layer configured for an input dimension of 3135 features, representing the principal components derived from the original high-dimensional fused feature vector. This is followed by an LSTM layer with 128 hidden units, allowing the network to capture temporal or contextual dependencies across the input features, even though they are not time-series data in the classical sense.

The LSTM layer is succeeded by a fully connected layer that maps the learned feature representations to the desired number of output classes (in this case, two). A SoftMax layer follows, which converts the raw class scores into normalized probabilities, enabling probabilistic interpretation of the classification outputs. Finally, the classification layer computes the loss during training and evaluates the model’s prediction performance. This architecture effectively leverages the LSTM’s capacity to model complex inter-feature relationships, improving the system’s ability to differentiate between malaria-infected and uninfected blood samples. Figure 4 depicts the detailed structure of the proposed network.

5.
Decision Aggregation.

To further improve the reliability and accuracy of the classification process, the methodology incorporates a majority voting mechanism. This mechanism is used to aggregate the predictions from multiple classifiers, including the three end-to-end deep learning models (ResNet-50, VGG-16, and DenseNet-201), as well as the SVM and LSTM classifiers. By combining the outputs of these diverse models, majority voting ensures a consensus-based decision, reducing the likelihood of errors arising from the limitations of any single model²⁸.

In the majority voting process, each classifier independently predicts whether the input blood film image corresponds to a malaria-infected or non-infected sample. These predictions are treated as votes, with each model contributing one vote to the final decision. The class label with the majority of votes is selected as the final classification outcome. For instance, if the five models produce predictions as follows:

ResNet-50: Malaria-infected.
VGG-16: Malaria-infected.
DenseNet-201: Non-infected.
SVM: Malaria-infected.
LSTM: Non-infected.

Hence, the majority class (Malaria-infected in this case) is chosen as the final decision.

This approach is particularly effective in leveraging the complementary strengths of the models involved. The deep learning models provide robust feature extraction and pattern recognition capabilities, while the SVM contributes precise decision boundary optimization, and the LSTM captures complex relationships within the data. By aggregating these diverse perspectives, majority voting minimizes the influence of outlier predictions and improves overall classification robustness.

Moreover, majority voting is inherently adaptable and scalable. Additional classifiers can be integrated into the voting process without significant alterations to the system, further enhancing its versatility. This mechanism also reduces the impact of noisy data or model-specific biases, as the final decision reflects a consensus rather than relying on a single classifier’s output. The outcome of the majority voting process provides the final classification, reliably determining whether the input blood film image is malaria-infected or non-infected. This consensus-based strategy ensures that the diagnostic system delivers high levels of accuracy and confidence, making it suitable for real-world clinical applications where reliability is critical.

Results

The main objective of the study was to implement an automated framework to accurately classify the blood smear image for the diagnosis of Malaria. The performance of the proposed malaria detection methodology was evaluated across different models, including ResNet-50, VGG-16, DenseNet-201, Support Vector Machine (SVM), Long Short-Term Memory (LSTM) networks, and Majority Voting. The evaluation was conducted using multiple metrics, including Accuracy, Sensitivity (SEN), Specificity (SPE), Precision (PRE), Error Rate, False Positive Rate (FPR), False Negative Rate (FNR), Negative Predictive Value (NPV), F1-Score, and Matthews Correlation Coefficient (MCC) using the following equations¹² in terms of TP: True Positive, TN: True Negative, FP: False Positive, and FN: False Negative using the obtained confusion matrices in Fig. 5.

$$\:\text{A}\text{c}\text{c}\text{u}\text{r}\text{a}\text{c}\text{y}\:=\:(\text{T}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{T}\text{N})\:/\:(\text{T}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{N}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{T}\text{N})$$

(18)

$$\:\text{S}\text{e}\text{n}\text{s}\text{i}\text{t}\text{i}\text{v}\text{i}\text{t}\text{y}\hspace{0.17em}=\hspace{0.17em}\text{T}\text{P}\:/\:(\text{T}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{N})$$

(19)

$$\:\text{E}\text{r}\text{r}\text{o}\text{r}\:\text{R}\text{a}\text{t}\text{e}\:=\:(\text{F}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{N})\:/\:(\text{T}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{N}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{T}\text{N})$$

(20)

$$\:\text{Sp}\text{e}\text{c}\text{i}\text{f}\text{i}\text{c}\text{i}\text{t}\text{y}\hspace{0.17em}=\hspace{0.17em}\text{T}\text{N}\:/\:(\text{T}\text{N}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{P})$$

(21)

$$\:\text{P}\text{r}\text{e}\text{c}\text{i}\text{s}\text{i}\text{o}\text{n}\hspace{0.17em}=\hspace{0.17em}\text{T}\text{P}\:/\:(\text{T}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{P})$$

(22)

$$\:\text{F}\text{a}\text{l}\text{s}\text{e}\:\text{P}\text{o}\text{s}\text{i}\text{t}\text{i}\text{v}\text{e}\:\text{R}\text{a}\text{t}\text{e}\hspace{0.17em}=\hspace{0.17em}\text{F}\text{P}\:/\:(\text{F}\text{P}\hspace{0.17em}+\hspace{0.17em}\text{T}\text{N})$$

(23)

$$\:\text{F}\text{a}\text{l}\text{s}\text{e}\:\text{N}\text{e}\text{g}\text{a}\text{t}\text{i}\text{v}\text{e}\:\text{R}\text{a}\text{t}\text{e}\hspace{0.17em}=\hspace{0.17em}\text{F}\text{N}\:/\:(\text{F}\text{N}\hspace{0.17em}+\hspace{0.17em}\text{T}\text{P})$$

(24)

$$\:\text{N}\text{e}\text{g}\text{a}\text{t}\text{i}\text{v}\text{e}\:\text{P}\text{r}\text{e}\text{d}\text{i}\text{c}\text{t}\text{i}\text{v}\text{e}\:\text{V}\text{a}\text{l}\text{u}\text{e}\hspace{0.17em}=\hspace{0.17em}\text{T}\text{N}\:/\:(\text{T}\text{N}\hspace{0.17em}+\hspace{0.17em}\text{F}\text{N})$$

(25)

$$\:\text{F}1-\text{S}\text{c}\text{o}\text{r}\text{e}\:=\text{F}1-\text{S}\text{c}\text{o}\text{r}\text{e}\:=\:(2\:\times\:\:(\text{S}\text{e}\text{n}\text{s}\text{i}\text{t}\text{i}\text{v}\text{i}\text{t}\text{y}\:\times\:\:\text{P}\text{r}\text{e}\text{c}\text{i}\text{s}\text{i}\text{o}\text{n}\left)\right)/\left(\text{S}\text{e}\text{n}\text{s}\text{i}\text{t}\text{i}\text{v}\text{i}\text{t}\text{y}\hspace{0.17em}+\hspace{0.17em}\text{P}\text{r}\text{e}\text{c}\text{i}\text{s}\text{i}\text{o}\text{n}\right)$$

(26)

$$\:\text{M}\text{C}\text{C}\:=\:\:(\text{T}\text{P}\times\:\text{T}\text{N}\:-\:\text{F}\text{P}\times\:\text{F}\text{N})\:/\sqrt{(\text{T}\text{P}+\text{F}\text{P})(\text{T}\text{P}+\text{F}\text{N})(\text{T}\text{N}+\text{F}\text{P})(\text{T}\text{N}+\text{F}\text{N})}\:$$

(27)

In the context of the current study on malaria classification using microscopic blood smear images, the cases in the confusion matrix are interpreted as follows, based on the two main classes: ‘Parasitized’ (malaria-infected) and ‘Uninfected’ (non-malaria), as mentioned in Table 4. Receiver Operating Characteristic (ROC) curves for the proposed models are shown in Fig. 6.

Table 4 Description of the targeted classes of malaria detection.

Full size table

a.
Performance of transfer learning models.

The transfer learning models, ResNet-50, VGG-16, and DenseNet-201, demonstrated robust performance in malaria detection, leveraging their pre-trained architectures for feature extraction and classification. ResNet-50 achieved an accuracy of 95.77%, with a sensitivity of 95.31% and specificity of 96.23%, effectively capturing hierarchical patterns in the data. VGG-16 obtained an accuracy of 96.32%, showing a well-balanced performance with sensitivity (95.79%) and specificity (96.86%). DenseNet-201 achieved an accuracy of 96.25% and the highest sensitivity (96.32%), indicating its exceptional ability to detect malaria-infected samples. All three models delivered competitive results in terms of F1-Score (ranging from 95.75 to 96.30%) and Matthews Correlation Coefficient (MCC) (91.96–93.15%), showcasing their reliability and effectiveness in automated malaria diagnosis. For monitoring the performance of the proposed models, training progress curves including training accuracy, validation accuracy, training loss, and validation loss has been recorded in Figs. 7, 8 and 9 for ResNet-50, VGG-16, and DenseNet-201, respectively.

b.
PCA Results for Dimensionality Reduction.

PCA was applied to the high-dimensional fused feature set of 8064 features to reduce its dimensionality while preserving the most significant information. As depicted in Fig. 10, the cumulative explained variance is plotted against the number of principal components. The curve shows a steep initial rise, indicating that a substantial proportion of the variance in the dataset is captured by the first few principal components.

From the graph, it can be observed that the explained variance plateaus after a certain number of components, with diminishing returns from including additional components. Based on the analysis, 3135 principal components were selected, ensuring that the majority of the variance in the original data is retained. This reduction represents a significant compression of the feature space, approximately 61.2% of the original size, without a substantial loss of critical information.

c.
Performance of the SVM model.

SVM achieved an accuracy of 96.40%. It showed strong specificity (96.71%) and sensitivity (96.08%), which indicated its capability to distinguish between infected and uninfected samples effectively. The F1-Score (96.38%) and MCC (93.09%) also indicated excellent balance and robustness.

d.
Performance of the LSTM model.

LSTM achieved an accuracy of 96.11% and, sensitivity of 95.69%, which indicated that LSTM might miss some infected samples. However, it still showed a strong specificity of 96.52%, making it effective at identifying uninfected samples. The precision (96.49%) and error rate (3.89%) were similar to those of ResNet-50, indicating that LSTM performed well, but with a slight trade-off in sensitivity.

e.
Overall majority voting performance.

The Majority Voting method, which integrates the predictions from ResNet-50, VGG-16, DenseNet-201, SVM, and LSTM, achieved the highest accuracy of 96.47%, outperforming all individual models. This approach also achieved the highest specificity (96.90%) and precision (96.88%), indicating that it was the most reliable method for identifying both malaria-infected and uninfected samples. The sensitivity (96.03%) and F1-Score (96.45%) were also high, contributing to the overall strength of the Majority Voting model. The detailed results for the proposed models are presented in the Table 5.

To evaluate the individual and collective impact of the core components in the proposed framework, an ablation study was conducted using ResNet-50, VGG-16, DenseNet-201, SVM, LSTM, and the final majority voting ensemble, as shown in Table 6. The results demonstrate the standalone effectiveness of each deep learning model, with VGG-16 achieving the highest individual accuracy (96.32%) among the CNNs. When the fused and PCA-reduced features were classified independently using SVM and LSTM, the performance either matched or exceeded that of the CNNs, indicating that the hybrid classifiers effectively capture discriminative patterns not fully exploited by the end-to-end deep networks. The final ensemble, integrating outputs from all models through a majority voting mechanism, yielded the highest performance across all metrics, with an accuracy of 96.47%, an F1-score of 96.45%, and a Matthews Correlation Coefficient of 93.35%. These results validate that each module—CNN feature extraction, SVM-LSTM hybrid classification, and majority voting—contributes meaningfully to the overall system, with the ensemble leveraging their complementary strengths to achieve optimal diagnostic accuracy.

Table 5 Detailed results for the proposed models for malaria diagnosis.

Full size table

Table 6 Ablation study showing the individual and combined performance of CNN-based classifiers.

Full size table

Discussion

This study presents an experimental setting to detect malaria by examining a set of thin smear blood images using a combination of AI-based techniques. As an end-to-end DL model, the transfer learning methodology exemplified by ResNet-50, VGG 16, and DenseNet-201 was employed. We investigated both the SVM and LSTM models as ML algorithms. Moreover, we proposed majority voting as an aggregate decision for malaria detection. Each technique was evaluated through the given evaluation metrics as described in Results. In this study, we employed three widely recognized convolutional neural network architectures—ResNet-50, VGG-16, and DenseNet-201—as feature extractors and end-to-end classifiers. These models remain highly relevant and effective in the context of medical image analysis due to their proven robustness, interpretability, and transfer learning capabilities. VGG-16 offers a straightforward and uniform architecture that simplifies model interpretation and tuning. ResNet-50 introduces residual connections, which effectively address vanishing gradient problems and enable the training of deeper, more expressive networks. DenseNet-201 leverages dense connectivity to enhance feature reuse and gradient flow, which is particularly beneficial when working with fine-grained features in microscopic blood smear images. These architectures have been extensively validated in numerous biomedical imaging studies, providing a strong foundation for benchmarking and performance comparison. While newer models such as ConvNeXt²⁹, EfficientNet-V2, Vision Transformers (ViT)^30,31, and MobileNet V3 offer promising results on large-scale datasets, they often require more complex tuning, higher computational resources, and substantial training data—conditions that are not always ideal in medical imaging contexts. Therefore, the selected architectures strike a practical balance between accuracy, efficiency, and generalizability, making them well-suited for the task of automated malaria diagnosis^32,33.

Notably, Table 5 shows that the voting majority technique outperformed the other techniques in terms of accuracy, specificity, precision, error rate, false positive rate, F1-Score, and Mattews correlation coefficient. Obviously, the DenseNet-201 has recorded the best results in sensitivity, false negative rate, and negative predictive value. Using PCA to optimize the used features has proven its ability to enhance the results, demonstrating significance in detection with less computing time. In the realm of AI, the majority voting techniques permit a collaboration between all involved algorithms to vote according to the dominant decision. This entails taking advantage of a transition from traditional AI algorithms to leveraging the use of groups in decision-making. This fact is proven through our results as illustrated in Table 5. The DenseNet-201 performed efficiently compared to the other algorithms because of its ability to reuse features map through the dense layers leading to minimizing overfitting and vanishing gradients. Furthermore, as noted in training and losses performance curves, DenseNet-201 had the best one.

Notably, a comparison with the studies in Table 1 reveals that none have proposed the use of majority voting for malaria detection. While many existing studies are hampered by limited datasets, this research builds upon the foundation laid by earlier works, particularly those referenced as^2,13, which demonstrate the potential of larger data pools. Furthermore, our study utilized the same dataset as¹³. However, we expanded the analysis by incorporating additional algorithms alongside the majority voting technique, achieving approximately the same level of accuracy.

The study’s limitations include its failure to account for morphological changes in the examined cells. Additionally, different datasets should be used to validate the majority voting method. To show how well the suggested methodology performs over them, additional thick smear blood images should be evaluated.

Conclusions

The study presented an AI-based multi-model framework for malaria detection. In this context, the majority voting represents a significant advancement, setting this study apart from previous research that has not ventured into this innovative way. This approach signifies a meaningful advancement in the field of medical image analysis, as it enhances the robustness and reliability of diagnostic outcomes. Therefore, this ensemble strategy reduces the likelihood of misclassification and strengthens the overall decision-making process.

A key contribution of this study lies in the utilization of a large and diverse dataset, which plays a critical role in improving the generalization capabilities of the model. In other words, a large dataset pool not only improves the results but also paves the way for further methods to detect malaria. Moreover, using thin smear images provides reliable results, demonstrating that malaria detection is unaffected by the microscopic images utilized. This finding confirms that the effectiveness of the detection process remains constant, regardless of variations in microscopic images’ characteristics. Ultimately, this study highlights the importance of evolving methodologies and leveraging existing data more effectively to combat one of the world’s health challenges. The proposed framework not only enhances the current state of malaria detection but also opens new avenues for the application of similar techniques to other infectious diseases. Future work could be summarized as using alternate datasets to validate the proposed methodology and using other algorithms with fine-tuning mechanisms instead of transfer learning. Besides, this methodology can be generalized to other diseases such as leukemia, thereby contributing to a more scalable and effective approach to disease diagnosis through AI.

Data availability

The datasets analyzed during the current study are available in “Malaria Cell Images Dataset”, 2018. https://www.kaggle.com/datasets/iarunava/cell-images-for-detecting-malaria/data.

References

Sunarko, B., Bottema, M., Iksan, N., Hudaya, K. A. & Hanif, M. S. Red blood cell classification on thin blood smear images for malaria diagnosis. In Journal of Physics: Conference Series. 1444(1), 012036. (2020).
Khan, R. U. et al. An intelligent neural network model to detect red blood cells for various blood structure classification in microscopic medical images. Heliyon 10(4), e26149. (2024).
Uzun Ozsahin, D., Duwa, B. B., Ozsahin, I. & Uzun, B. Quantitative forecasting of malaria parasite using machine learning models: MLR, ANN, ANFIS and random forest. Diagnostics 14 (4), 385 (2024).
Article PubMed PubMed Central Google Scholar
Sukumarran, D. et al. Machine and deep learning methods in identifying malaria through microscopic blood smear: A systematic review. Engineering Appl. Artif. Intelligence 133108529. (2024).
Hemachandran, K. et al. Performance analysis of deep learning algorithms in diagnosis of malaria disease. Diagnostics 13 (3), 534–550 (2023).
Article PubMed PubMed Central Google Scholar
Hoyos, K. & Hoyos, W. Supporting malaria diagnosis using deep learning and data augmentation. Diagnostics, 2024, 14(7), 690–709. (2024).
World Health Organization. World Malaria Report. (2024). Available at https://www.who.int/teams/global-malaria-programme/reports/world-malaria-report-2024
Brenas, J. H., Al-Manir, M. S., Baker, C. J. & Shaban-Nejad, A. A malaria analytics framework to support evolution and interoperability of global health surveillance systems. IEEE Access. 5, 21605–21619 (2017).
Article Google Scholar
Marrelli, M. T. & Brotto, M. The effect of malaria and anti-malarial drugs on skeletal and cardiac muscles. Malar. J. 15, 1–6 (2016).
Article Google Scholar
Muhammad, F. A., Sudirman, R., Zakaria, N. A. & Mahmood, N. H. Classification of Red Blood Cell Abnormality in Thin Blood Smear Images using Convolutional Neural Networks. In Journal of Physics: Conference Series. 2622(1), 012011. (2023).
Muhammad, F. A., Sudirman, R., Zakaria, N. A. & Daud, S. N. S. S. Morphology classification of malaria-infected red blood cells using deep learning techniques. Biomed. Signal Process. Control. 99, 106869 (2025).
Article CAS Google Scholar
Salaheldin, A. M., Wahed, M. A., Talaat, M. & Saleh, N. An evaluation of AI-based methods for papilledema detection in retinal fundus images. Biomedical Signal Processing and Control. 92,106120. (2024).
Hemachandran, K. et al. Performance analysis of deep learning algorithms in diagnosis of malaria disease. Diagnostics 13 (3), 534 (2023).
Article PubMed PubMed Central Google Scholar
Kassim, Y. M. et al. Clustering-based dual deep learning architecture for detecting red blood cells in malaria diagnostic smears. IEEE J. Biomedical Health Inf. 25 (5), 1735–1746 (2020).
Article Google Scholar
Ozsahin, D. U., Mustapha, M. T., Duwa, B. B. & Ozsahin, I. Evaluating the performance of deep learning frameworks for malaria parasite detection using microscopic images of peripheral blood smears. Diagnostics 12 (11), 2702. https://doi.org/10.3390/diagnostics12112702 (2022).
Article PubMed PubMed Central Google Scholar
Ozdemir, B. & Pacal, I. A robust deep learning framework for multiclass skin cancer classification. Sci. Rep. 2025 (15(1)), 1–19. https://doi.org/10.1038/s41598-025-89230-7 (2024).
Article CAS Google Scholar
Chen, Y. et al. VGG16-based intelligent image analysis in the pathological diagnosis of IgA nephropathy. J. Radiation Res. Appl. Sci. 16 (3), 100626 (2023).
CAS Google Scholar
Pang, B. et al. Fire-image-DenseNet (FIDN) for predicting wildfire burnt area using remote sensing data. Comput. Geosci. 195, 105783 (2025).
Article Google Scholar
Ji, L., Tian, X., Wei, Z. & Zhu, D. Intelligent fault diagnosis in power distribution networks using LSTM-DenseNet network. Electr. Power Syst. Res. 239, 111202 (2025).
Article Google Scholar
Salim, F., Saeed, F., Basurra, S., Qasem, S. N. & Al-Hadhrami, T. DenseNet-201 and Xception pre-trained deep learning models for fruit recognition. Electronics 12 (14), 3132 (2023).
Article Google Scholar
Malaria Cell Images Dataset, https://www.kaggle.com/datasets/iarunava/cell-images-for-detecting-malaria/data, 2018.
Salaheldin, A. M., Abdel Wahed, M., Talaat, M. & Saleh, N. Deep learning-based automated detection and grading of papilledema from OCT images: A promising approach for improved clinical diagnosis and management. Int. J. Imaging Syst. Technol. 34(4), e23133. (2024).
Akkasaligar, P. T., Pattar, S., Gupta, S., Barker, D. & Gunayyanavarmath, B. Classification of blood smear images using CNN and pretrained VGG16: computer aided diagnosis of malaria disease. In 2024 First International Conference on Technological Innovations and Advance Computing (TIACOMP). 2024, 349–354. (2024), June.
Sukumarran, D. et al. Automated identification of malaria-infected cells and classification of human malaria parasites using a two-stage deep learning technique. IEEE Access. 12, 135746–135763 (2024).
Article Google Scholar
Srinivasu, P. N. et al. Using recurrent neural networks for predicting type-2 diabetes from genomic and tabular data. Diagnostics 12 (12), 3067 (2022).
Article CAS PubMed PubMed Central Google Scholar
Jdey, I., Hcini, G. & Ltifi, H. Deep learning and machine learning for malaria detection: overview, challenges and future directions. Int. J. Inform. Technol. Decis. Mak. 23 (05), 1745–1776 (2024).
Article Google Scholar
Srinivasu, P. N. et al. F. XAI-driven catboost multi-layer perceptron neural network for analyzing breast cancer. Sci. Rep. 14 (1), 1–19. https://doi.org/10.1038/s41598-024-79620-8 (2024).
Article CAS Google Scholar
Gu, H. et al. Majority voting of Doctors improves appropriateness of AI reliance in pathology. Int. J. Hum. Comput. Stud. 190, 103315 (2024).
Article Google Scholar
Ince, S., Kunduracioglu, I., Algarni, A., Bayram, B. & Pacal, I. Deep learning for cerebral vascular occlusion segmentation: A novel ConvNeXtV2 and GRN-integrated U-Net framework for diffusion-weighted imaging. Neuroscience 574, 42–53. https://doi.org/10.1016/j.neuroscience.2025.04.010 (2025).
Article CAS PubMed Google Scholar
Ozdemir, B., Aslan, E. & Pacal, I. Attention enhanced inceptionnext based hybrid deep learning model for lung cancer detection. IEEE Access. 13, 27050–27069. https://doi.org/10.1109/ACCESS.2025.3539122 (2025).
Article Google Scholar
Pacal, I., Ozdemir, B., Zeynalov, J., Gasimov, H. & Pacal, N. A novel CNN-ViT-based deep learning model for early skin cancer diagnosis. Biomed. Signal Process. Control. 104, 107627. https://doi.org/10.1016/j.bspc.2025.107627 (2025).
Article CAS Google Scholar
Lubbad, M. et al. Machine learning applications in detection and diagnosis of urology cancers: a systematic literature review. Neural Comput. Appl. 36 (12), 6355–6379. https://doi.org/10.1007/s00521-023-09375-2 (2024).
Article Google Scholar
İnce, S., Kunduracioglu, I., Bayram, B. & Pacal, I. U-Net-Based models for precise brain stroke segmentation. Chaos Theory Appl. 7 (1), 50–60. https://doi.org/10.51537/chaos.1605529 (2025).
Article Google Scholar

Download references

Acknowledgements

This work was funded by the Deanship of Graduate Studies and Scientific Research at Jouf University under grant No. DGSSR-2024-02-01081. The authors sincerely thank Future University in Egypt for its generous support, which has significantly contributed to the advancement of this research project.

Author information

Authors and Affiliations

Department of Computer Science, College of Computer and Information Sciences, Jouf University, Sakaka, Saudi Arabia
Osama R. Shahin & Raed N. Alabdali
Department of Information Systems, College of Computer and Information Sciences, Jouf University, Sakaka, Saudi Arabia
Hamoud H. Alshammari
Biomedical Engineering and Systems Department, Faculty of Engineering, Cairo University, Giza, Egypt
Ahmed M. Salaheldin
Biomedical Engineering Department, Faculty of Engineering and Technology, Future University in Egypt, New Cairo, Egypt
Neven Saleh

Authors

Osama R. Shahin
View author publications
Search author on:PubMed Google Scholar
Hamoud H. Alshammari
View author publications
Search author on:PubMed Google Scholar
Raed N. Alabdali
View author publications
Search author on:PubMed Google Scholar
Ahmed M. Salaheldin
View author publications
Search author on:PubMed Google Scholar
Neven Saleh
View author publications
Search author on:PubMed Google Scholar

Contributions

All authors participated in the conceptualization of the study. O.R. Shahine, H. Alshammari, R.N. Alabdali, and A. M. Salaheldin designed the framework of the methodology. Ahmed M. Salaheldin conducted the methodology. N. Saleh conducted the literature review and analyzed the results of the study. N. Saleh and A. M. Salaheldin have written the initial draft of the study. O.R. Shahin and N. Saleh have reviewed the final version of the manuscript.

Corresponding authors

Correspondence to Osama R. Shahin or Neven Saleh.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shahin, O.R., Alshammari, H.H., Alabdali, R.N. et al. Automated multi-model framework for malaria detection using deep learning and feature fusion. Sci Rep 15, 25672 (2025). https://doi.org/10.1038/s41598-025-04784-w

Download citation

Received: 02 February 2025
Accepted: 28 May 2025
Published: 16 July 2025
DOI: https://doi.org/10.1038/s41598-025-04784-w

Automated multi-model framework for malaria detection using deep learning and feature fusion

Subjects

Abstract

Similar content being viewed by others

Efficient deep learning-based approach for malaria detection using red blood cell smears

Deep learning-based malaria parasite detection: convolutional neural networks model for accurate species identification of Plasmodium falciparum and Plasmodium vivax

Multiclass malaria parasite recognition based on transformer models and a generative adversarial network

Introduction

Related works

Background

ResNet 50

The VGG16

DenseNet − 201

Materials and methods

Results

Discussion

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Efficient deep learning-based approach for malaria detection using red blood cell smears

Deep learning-based malaria parasite detection: convolutional neural networks model for accurate species identification of Plasmodium falciparum and Plasmodium vivax

Multiclass malaria parasite recognition based on transformer models and a generative adversarial network

Introduction

Related works

Background

ResNet 50

The VGG16

DenseNet − 201

Materials and methods

Results

Discussion

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links