Morphological diagnosis of hematologic malignancy using feature fusion-based deep convolutional neural network

Yadav, D. P.; Kumar, Deepak; Jalal, Anand Singh; Kumar, Ankit; Singh, Kamred Udham; Shah, Mohd Asif

doi:10.1038/s41598-023-44210-7

Download PDF

Article
Open access
Published: 09 October 2023

Morphological diagnosis of hematologic malignancy using feature fusion-based deep convolutional neural network

D. P. Yadav¹,
Deepak Kumar²,
Anand Singh Jalal¹,
Ankit Kumar¹,
Kamred Udham Singh³ &
…
Mohd Asif Shah^4,5,6,7

Scientific Reports volume 13, Article number: 16988 (2023) Cite this article

3733 Accesses
10 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Leukemia is a cancer of white blood cells characterized by immature lymphocytes. Due to blood cancer, many people die every year. Hence, the early detection of these blast cells is necessary for avoiding blood cancer. A novel deep convolutional neural network (CNN) 3SNet that has depth-wise convolution blocks to reduce the computation costs has been developed to aid the diagnosis of leukemia cells. The proposed method includes three inputs to the deep CNN model. These inputs are grayscale and their corresponding histogram of gradient (HOG) and local binary pattern (LBP) images. The HOG image finds the local shape, and the LBP image describes the leukaemia cell's texture pattern. The suggested model was trained and tested with images from the AML-Cytomorphology_LMU dataset. The mean average precision (MAP) for the cell with less than 100 images in the dataset was 84%, whereas for cells with more than 100 images in the dataset was 93.83%. In addition, the ROC curve area for these cells is more than 98%. This confirmed proposed model could be an adjunct tool to provide a second opinion to a doctor.

Tens of images can suffice to train neural networks for malignant leukocyte detection

Article Open access 12 April 2021

Efficient convolutional neural networks for acute lymphoblastic leukaemia prediction in computer vision

Article Open access 16 December 2025

Multiclass leukemia cell classification using hybrid deep learning and machine learning with CNN-based feature extraction

Article Open access 03 July 2025

Introduction

Bone marrow, a soft and versatile tissue accessible in bone depressions, is the site of hematopoiesis, producing millions of blood cells every day¹. Hematopoiesis promotes the formation of blood, which is one the essential components of the human body and it is composed of 80 percent water and 20 percent solid². The red blood cells (RBC), white blood cells (WBC), platelets and plasma are the four blood components available³. White blood cells make up roughly 1% of blood. e. 1 WBC is present in every 100 red blood cells. The neutrophils, lymphocytes, eosinophils, basophils and monocytes. These cells have an average count of 60%, 30%, 5% and, 4 %, under 1% of the total WBC count, respectively⁴. Blood cell cancer refers to bone marrow contains leukemia cells, which are abnormal WBC⁵.

The current prognosis for leukemia is not encouraging, and the disease continues to pose a significant risk to the health of humans. Leukemia was estimated to be the 15th most common cause of cancer incidence and the 11th most common cause of cancer-related mortality worldwide in 2020. It was responsible for 474,519 cancer-incident cases and 311,594 cancer-related deaths. In addition, leukemia is the most common cancer in children younger than five. It is also responsible for the highest percentage of deaths, which substantially costs individuals, families, and countries⁶.

Acute lymphoblastic leukemia (ALL), acute myeloid leukemia (AML), chronic lymphocytic leukemia (CLL), and chronic myeloid leukemia (CML) are the most common types of leukemia identified⁷. The rapid deterioration of the patient is caused by acute leukemia, while chronic leukemia is characterized by gradual progression and may be lymphocytic or myelogenous. Two methods are widely used to diagnose leukemia: The French-American-British (FAB) classification and the World Health Organization (WHO) proposal.

Early identification of this disease is critical for successful treatment. Pathological testing, full blood count, aspiration biopsy, and bone marrow aspiration involving the creation of microscopic blood smear images taken from the potential patient are the methods used to diagnose leukaemia⁸. The leukaemia laboratory test is time-consuming and inconvenient, requiring extra time and effort⁹. Manual analysis for leukaemia diagnosis can result in diagnostic variability and inaccuracies in blast cell counting. As a result, there may be discrepancies in diagnostic outcomes¹⁰. The significant challenges with manual leukaemia diagnosis are non-standardization, conflicting, and subjective findings due to the possibility of human error or differing expert opinions¹¹.

The morphological features-based study of the blood cells is less accurate than automated techniques¹². When an extensive dataset is available, a machine learning (ML) algorithm can help differentiate the blood cells with leukemia from the healthy cells.

Various studies have proved that Machine learning (ML) techniques are more helpful in detecting blast cells from healthy cells and are gaining popularity, as it is faster and more accurate than traditional diagnosis methods¹³. It can be formulated as an image classification task because the cytomorphological analysis is focused on evaluating microscopic cell pictures¹⁴. In the field of natural image and visual question-answering classification, deep convolution neural networks (CNNs) have proven very effective^15,16. CNNs have recently been successfully applied to different medical imaging activities, including the identification of skin cancer^17,18,19, the assessment of retinal disorders²⁰, and the analysis of histological sections^21,22, e.g. by mitosis detection²³, the detection and analysis region²⁴ or the segmentation of tissue types²⁵. This propels us to apply CNNs to the cytomorphological characterization of platelets, specifically those significant in AML. Past work on leukocyte order has predominantly been centered on feature extraction from cytological images^26,27.

More focus was given to lymphoblastic leukemia, where the cytomorphology is less diverse than in the myeloid case^28,29. In medical image analysis, supplying sufficient numbers of labelled images for deep learning models to work has proven to be challenging due to restrictions on the availability and the cost of expert time to provide ground truth annotations^30,31. Therefore, numerous research focused on data sets restricted by the number of patients included or the classification of individual cytological images^32,33. So far, applications of CNNs to classify white blood cells have concentrated on differentiating subtypes such as erythroid and myeloid precursors³².

Matek et al.³⁴ have used the ResNext model to classify leukemia cells. They improved the dataset size using augmentation techniques. The augmented dataset contains 15000 images, which took approximately 96 h to train and test the model. In addition, the method's sensitivity toward the cells having less number of images in the dataset is less. Boldu et al.³⁵ proposed ALNet by coming to the two modules from VGG16 and VGG19. VGG16 module performs the classification of 4 class, and vgg16 perform the classification of 2 class. They reported classification accuracy of 92% on cells and 100% on smears. Eckardt et al.³⁶ use a multi-step deep CNN model using the transfer-learning technique to segment and classify bone marrow cells. Their method classifies bone AML and healthy control with an accuracy of 87%. Khandekar et al.³⁷ applied the You Only Look Once (YOLOv4) deep CNN model to classify blood smears. They perform preprocessing to resize the image and maintain orientation. After that, the concerned object of interest is detected using the segmentation technique, and finally, the feature is extracted using a deep CNN model. Their method reported an F1-score of 92% and a recall of 96%. The rest of the recent methods have been summarized in Table 1

Table 1 Summary of the recent work using machine learning and deep learning.

Full size table

In short, all these methods have a high potential for classifying blood smears. However, the blood smears having less number of images in the dataset need to be explored for better classification. The leukemia cell's morphological characteristics are very similar, which makes it difficult to differentiate them. In addition, a key challenge is cells having less than 100 images in the dataset, which needs a highly sensitive model for identification. Therefore, in the proposed approach, we developed a multilevel feature fusion-based 3SNet for the leukemia cell classification.

The paper's significant contribution is as follows.

(1)
We introduced 3SNet, a novel multi-scale feature fusion-based deep learning model with depth-wise convolution blocks that efficiently differentiate leukemia cells using less computational resources.
(2)
The fewer images of the leukemia cell in the dataset and morphologically similar characteristics make the problem more challenging. Hence, Leukemia cell image and their corresponding LBP and HOG images at three scales are used to extract spatial features, and the fusion technique generates an enhanced feature pool. That makes the system more sensitive toward leukemia cells having fewer images in the dataset.
(3)
We experimentally demonstrated that the proposed model outperforms the AML-Cytomorphology_LMU dataset.

The rest of the paper is organized as follows.

In "Proposed method" section, the proposed method algorithm and model architecture have been elaborated. The result of the 3SNet is discussed in "Results" section, whereas in "Discussion" section, a comparison of the results with the state-of-the-art method has been discussed. Finally, in "Conclusion" section, we have concluded the proposed method.

Proposed method

In this study, we developed a deep convolutional neural network model called 3SNet, which incorporates a multilayer feature fusion approach. The architecture of 3SNet is depicted in Fig. 1. The feature fusion model employed in this study is designed to extract features from the grey image as well as the corresponding histogram of oriented gradients (HOG) and local binary patterns (LBP) images. Subsequently, the aforementioned features are integrated in order enhance their effectiveness, after that the classification module is added to performs classification leaukemia cells.

The convolution blocks are designed using depth-wise convolution techniques to reduce the computation costs. Several methods in the past have done a significant job of improving leukaemia cell classification. However, several limitations of these methods motivated us to design a robust and efficient model. A detailed summary of the models is described in Table 2.

Table 2 The detailed summary of the previous models used for leukemia classification.

Full size table

Local binary pattern (LBP)

The texture of leukemia cells is heterogeneous, which can be explored to categorize them. Hence, in the proposed work, we have used a powerful feature descriptor developed by Ojala et al.⁵². This descriptor associates the analysis of occurrences and local structure analysis by assigning binary patterns to each pixel ${p}_{c}$. After that, the difference between pixel ${p}_{c}$ grey level value and its circular region is evaluated with the radius R centred at ${p}_{c}$. The LBP of the central pixel ${p}_{c}$ is calculated as follows.

$$LBP_{Q,R} \left( {p_{c} } \right) = \mathop \sum \limits_{q = 0}^{Q - 1} (q_{c} - p_{c} )2^{q}$$

(1)

If the value of ${q}_{c}-{p}_{c}$>0, then 1 is assigned in the Eq. (1); otherwise, 0. Finally, the LBP picture is created by combining the texture descriptor and the LBP distribution pattern, as illustrated in Fig. 2. The histogram vector H of the LBP for image representation is given as follows.

$$H = \mathop \sum \limits_{i = 1}^{W} \mathop \sum \limits_{j = 1}^{D} \delta \left( {LBP_{Q,R} \left( {i,j} \right) - k} \right)$$

(2)

The LBP image and their feature descriptor calculation are shown in Figs. 3 and 4, respectively.

Histogram of oriented gradient (HOG)

Dalal and Triggs first used the HOG descriptor for object detection⁵³. It focuses on the local shape and structure of an object. For the region of the image, the histogram is generated by calculating the magnitude and direction of the gradient. In the proposed work, images are resized to 256 × 256. After that, a sliding window of size 3 × 3 is used to calculate the gradient $Grad_{x}$ in the Y-direction and $Grad_{y}$ in the X-direction as follows.

$$Grad_{x} = Im(r,c + 1) - Im(r,c - 1)$$

(3)

$$Grad_{y} = Im(r - 1,c) - Im(r + 1,c)$$

(4)

where r and c refer to the row and column of the image. Finally, magnitude and direction are calculated using the following formulae.

$$Magnitude\left( M \right) = \sqrt {Grad_{x}^{2} + Grad_{y}^{2} }$$

(5)

$$Direction\left( D \right) = \arctan \left( {\frac{{Grad_{y} }}{{Grad_{x} }}} \right)$$

(6)

Novel 3-scale deep CNN model (3SNet)

We have designed a novel 3-scale deep CNN model in the present study. The grey image and their corresponding LBP and HOG images are fed as input to the model. Each scale is seven layers deep and contains a convolution layer of 3 × 3 filter and Conv1, Conv2, Conv3, Conv4, Conv5 and Conv6 of sizes 16, 32, 64, 128, 256, and 512 respectively. After each convolution block, rectified linear unit (ReLU) and batch normalization (BN) are applied. The ReLU activation adds non-linearity to the model by applying a threshold to the pixels obtained from BN layers. This model has 9 × 10⁹ trainable parameters, and it can avoid degradation problems, saturation of the model and gradient descent problems⁵⁴. The ReLU activation is defined as.

$$F\left( x \right) = \left\{ {\begin{array}{*{20}c} {0,x < 0} \\ {x,x > 0} \\ \end{array} } \right\}$$

(7)

where x = input to the layer. After each convolution layer, a max-pooling layer of size 3 × 3 and stride of 2 × 2 is incorporated. Finally, a global average pooling at each layer is applied that generates channel descriptors and combines them to develop feature fusion. The output from the fused feature acts as an input to a Fully Connected layer with 1024 filters followed by BN and ReLU activation. In the end, a dense layer of 15 neurons was added for AML-Cytomorphology_LMU respectively. The classification of multiclass classification is performed using the Softmax optimization function, which converts logits into probability. The input weight and bias calculate the probability value. Finally, the probability value is converted to a particular class of leukaemia cells. The value of the Softmax optimizer can be calculated using Eqs. (7) and (8).

$$P(x = k|\Phi^{\left( i \right)} ) = \frac{{e^{{\Phi^{\left( i \right)} }} }}{{\mathop \sum \nolimits_{k = 0}^{N} e^{{\Phi_{N}^{\left( i \right)} }} }}$$

(8)

$$\Phi = w_{0} y_{0} + w_{1} y_{1} + \ldots + w_{N} y_{N}$$

(9)

where N = 15, ${w}_{0}{y}_{0}$ = bias of kth class, $\Phi \, = \,$input vector, and the value of k = 0–14 for multiclass (15 class of leukemia cell).

Feature fusion

Feature fusion improves the performance of the deep CNN. We have used three deep CNN models for feature extraction in the proposed method. The feature extracted from the HOG, Leukemia Cell and LBP image is fused as follows.

$$X = \left\{ {x_{1} ,x_{2} \ldots x_{n} } \right\}$$

(10)

$$Y = \left\{ {y_{1} ,y_{2} \ldots y_{n} } \right\}$$

(11)

$$Z = \left\{ {z_{1} ,z_{2} \ldots z_{n} } \right\}$$

(12)

Respectively where, $n=512$. An enhanced features pool is generated by concatenations as follows.

$$F_{con} = X \oplus Y \oplus Z = \left( {x_{1} ,x_{2} , \ldots x_{n} ,y_{1} ,y_{2} , \ldots y_{n} ,z_{1} ,z_{2} , \ldots z_{n} } \right)$$

(13)

where ${F}_{con}$ is final feature vector with a bag of 1536 features. The original image and their LBP and HOG image is shown in Fig. 5.

.

Consent to participate

The authors declare their consent to participate in this article.

Results

Dataset

The images used in this research have been taken from the available Munich AML Morphology Dataset, containing 18,365 expert-labelled single-cell images⁵⁵. These single-cell images were produced using the M8 digital microscope/scanner from peripheral blood smears of 100 people from each group, with the first group comprised of patients diagnosed with Acute Myeloid Leukemia at Munich University Hospital between 2014 and 2017and the second group having patients without signs of hematological malignancy.

Training and validation

The training and validation of the proposed method are performed in Python 3.6, Tensorflow 2.0, Windows 10, Nvidia GeForce GTX TITAN X GPU with 128 GB RAM. The leukemia cells like lymphocyte and Promyelocyte have very similar morphological characteristics. Also in the dataset few classes like Lymphocyte, Basophil, Promyelocyte, Promyelocyte (bilobed), Myelocyte, Metamyelocyte, Monoblast, Erythroblast, and Smudge cells have less than 100 images. Due to this high classification, accuracy is difficult to achieve. Considering these challenges, a multimodal features fusion-based model has been proposed to discriminate 15 classes of leukemia cells. The 3SNet model is trained with an image size of 256 × 256 pixels and batch size 32 for 50 epochs. The initial learning rate was set to 0.0001. Since the dataset is imbalanced, we have applied fivefold cross-validation to avoid the biased performance of the model. In a fivefold cross-validation for each fold, one set is used for validation and four sets are used for training. Hence, in each fold, 20% images are used for validation and 80%, of images are used for training. In Fig. 6, we have depicted the confusion matrix of each fold. From the confusion matrix average performance measures like precision, recall, F1-score and accuracy are calculated.

The loss function categorical_crossentropy is used to calculated the training and validation loss of the proposed method and shown in the Fig. 7. We can see in Fig. 7a that initially, validation accuracy fluctuates, but after 40 epochs, changes are negligible. Similarly, in Fig. 7b, training loss reaches close to zero. In addition, initially, validation loss fluctuates and becomes less vibrant after 40 epochs. This shows that the 3SNet model can differentiate leukemia cells with high accuracy and less training and validation loss.

The performance measures of the model are calculated for each fold, as shown in Table 3. Table 3, shows precision, recall F1-score, and accuracy values for each fold. It can be observed, in fold-1, that model performance is less than 50%. After that, it gradually increases in substituent folds. Finally, we can see proposed model achieved an average of 87.93% precision, 88.65% recall, 88.11% F1-score, and 98.16% accuracy.

Table 3 The performance measures of the 3SNet model.

Full size table

Discussion

Microscopic image analysis for blood smear provide essential data for diagnosing and predicting diseases in hematological assessment. Blood comprises three major components red blood cells (RBCs), white blood cells (WBCs) and platelets. Out of these, white blood cells (WBCs) are a part of the immune system and play an important role in the body’s immune system. Leukemia, a blood malignancy that affects the bone marrow and lymphatic system, is generally caused by abnormalities in these WBCs. The morphological differences in the lymphocytes in blood and bone marrow from patients with chronic lymphocytic leukemia and healthy ones have been noticed in various studies. These morphological differences can potentially diagnose the malignancy at various stages, from the primary to the acute stage. Nevertheless, the manual detection of these morphological differences needs expertise, effort and time. Due to this, it is very difficult to identify these cells, and it is necessary to automate this diagnosis with the help of CNN. In this study, we have used a dataset of 18,365 leukemia cells divided into 15 classes. The expert annotates the dataset, which is unbalanced due to the unequal distribution of data. In addition, out of 15 classes, nine classes contain less than 100 images. In Table 4 we have presented a summary several methods using different CNN models on different datasets.

Table 4 Comparison of 3SNet with the recent deep learning methods.

Full size table

In the past, several research on leukemia cells classification has been reported, shown in Table3. In this regard, Thahn et al.³⁵ developed a CNN model for normal and abnormal cell classification. They applied the data augmentation technique to increase the dataset's size, and the model's classification accuracy is 96.6%. In a similar type of research, Shafique et al.⁵⁶ classify blood smears and their three subtypes using AlexNet. The overfitting of the model is avoided using the data augmentation technique and achieves 96.06% classification accuracy. Pansombut et al.⁵⁷ utilized machine and deep learning to classify leukemia cells. First, the feature is extracted using ConvNet; after that, the feature is optimized using a genetic algorithm and finally, a classification accuracy of 81.74% is obtained using a support vector machine (SVM). Ahmed et al.⁵⁹ reported the comparative study of several machine-learning algorithms and the effect of data augmentation on training. They also proposed a deep CNN model for the classification of leukemia cells. Their model classifies leukemia cells with an accuracy of 88% and its subtype with an accuracy of 81%.

Prellber and Kramer et al.⁶⁰ classify leukemia cells using ResNeXt50 with a Squeeze-and-Excitation block. They train their model with original and augmented images and archive a weighted F1-score of 89.91%. Many pieces of research on leukemia cell classification also applied a transfer learning-based approach. Loey et al.⁶¹ compare the performance of AlexNet before and after fine-tuning. They claim that fine-tuning AlexNet performed better and achieved an accuracy of 100%. In similar research, Vogado et al.⁶² applied three deep learning models AlexNet, Coffenet, and Vgg-f to extract features from the leukemia cells. In addition, two classifiers, SVM and KNN were applied for classification. They reported an SVM classifier to outperform and archived an accuracy of 99.76%. Ruberto et al.⁶³ also extract features from pre-trained AlexNet. Nevertheless, before extracting features from leukemia cells, they applied preprocessing, detecting blob, and segmentation to extract objects of interest. Their method achieves 94.1% classification accuracy.

Rehman et al.⁶⁴ extract features using the deep CNN model. Comparative analysis of three classifiers, Naive Base, KNN, and SVM, are performed using the deep features. Out of these three classifiers, Naïve Base achieved 78.34%, KNN 80.42%, SVM 90.91%, and proposed deep classifier 97.78%. Huang et al.⁶⁵ also applied a transfer-learning approach to extract features from Leukemia cells. The Inception-V3, ResNet50, and DenseNet121 classify with a notable accuracy of 74.8%, 84.9% and 95.3% respectively.

In short, all these methods have a high potential for the classification of blood smears. However, many researchers experiment on small datasets, as data augmentation techniques have been used to increase the dataset size. Due to image augmentation, overfitting of the model can be avoided, but several images of the same type lead to the biased performance of the model. In addition, blood smears having a smaller number of images in the dataset need to be explored for their better classification. Therefore, we have not applied the data augmentation technique in the proposed method and focused on the blood smears having fewer images in the dataset. Features extracted from the HOG, Leukemia, and LBP images and aggregated together to form a feature fusion vector that improves the classification performance of the leukemia cell. The 3SNet is the three-scale sequential model used for feature extraction and classification. Each model is trained with the input of 256 × 256 pixels images with a batch size 32 for 50 epochs. Further, a fivefold cross-validation scheme is applied to the model to evaluate bias-free performance. The multi-scale fusion-based CNN model outperforms most blood smears, and outstanding performance is obtained for the cells with less than 100 images in the dataset. The average sensitivity and precision obtained from fivefold cross-validation for the cells with more than 1000 images in the dataset are more than 95%, while cells with less than 100 images in the dataset are 70%. The class-wise performance of each class cell has been compared with the method proposed by Matek et al.³⁴.

Table 5 shows that the Neutrophil (segmented) cells have 8484 images, which is the highest number in the dataset. For the Neutrophil cell, the precision of the model is close to 99%, and the sensitivity is 99.4% better than the 96% of Matek et al.³⁴. For other leukemia cells having more than 1000 images in the dataset, the fusion-based outperforms compared to the available method. Furthermore, the 3SNet is highly sensitive toward the cells having less than 100 images in the dataset. For such cells, except for the myelocyte cells, which had 76.2% precision, achieved more than 80% precision and 80% sensitivity. This notable precision and sensitivity confirm that the proposed 3SNet model can be used for real-time diagnosis.

Table 5 Class-wise performance of 3SNet and Matek et al.³⁴ method.

Full size table

Further, an receiver operating characteristic (ROC) curve is plotted for performance visualization, taking the true positive rate on the Y-axis and the false positive rate on the X-axis^67,68, shown in Fig. 8. We can see in Fig. 8 that most of the leukemia cell ROC curve area is 1, while EBO shows 98% and MON 99%. This confirms that our model is highly sensitive towards leukemia identification. The class-wise performance can also be observed using the bar chart shown in Fig. 9. We can see in Fig. 9 that the proposed 3SNet model sensitivity and specificity are better than the state-of-the-art method.

Ablation study of the proposed model

We conducted two experiments on similar settings, as discussed in "Training and validation" section. However, we changed the setting of the proposed model as follows: In the first experiment, we removed the HOG feature and trained the model for 50 epochs in a batch size of 32. After training of the model, performance measures precision, recall, F1-score and accuracy of the model are calculated as shown in Table 6. Table 6 shows that the 3SNet achieved average precision and F1-score of 86.60% and 85.10%, respectively.

Table 6 The performance measures of the 3SNet mode using Grey and LBP features.

Full size table

In the second experiment, we removed the LBP feature, and the model was trained using gray and HOG features for 50 epochs in a batch size of 32. The average performance measures are shown in Table 7. In Table 7, we can observe that the model achieved an accuracy of 96.13% and a recall value of 84.61%.

Table 7 The performance measures of the 3SNet mode using Grey and HOG features.

Full size table

The dataset used in the study is divided into training and validation. The proposed method applied a similar training and validation set as utilized by Matek et al.³⁴. However, we conducted an ablation study and divided the dataset into 80%, 10%, and 10% for training, validation and testing, respectively. The class-wise sensitivity and precision of each cell on the test dataset are shown in Table 8. In Table 8, we notice that the sensitivity and precision of the cells with large numbers of images is more than 90%. Furthermore, the cells having fewer images also achieved notable performance measure values.

Table 8 Class-wise performance of the proposed 3SNet on the test dataset.

Full size table

Conclusion

This research proposes a novel 3SNet, a deep CNN model for leukemia cell classification. Leukemia cells are a major cause of blood cancer. These blood smears' morphological characteristics are very similar in several classes. Due to this, classification tasks are difficult. To tackle this problem, our method implicitly extracts features from leukemia and their corresponding HOG and LBP images using 3SNet. The HOG feature locates the local shape, and the LBP feature describes the texture pattern of leukemia cells, which helps to discriminate the morphological characteristics of blood smears. The features extracted from three scales are fused and refined to enhance the feature pool. After that, the feature vector is passed to the classification module. The classification performance depicted in Table 5, confirms that the proposed method not only classifies cells having a large number in the dataset with high accuracy but also cells having a smaller number of images in the dataset. Further, depth-wise separable convolution block reduces the computation cost and resources. Hence, this method can be used to design computer-aided diagnostic (CAD) tools that can provide a second opinion to a doctor. The limitation of the model is to feed the images at three scales for training. In addition, the computation costs of the algorithm can be further reduced. In future work, we will add other texture features and a grayscale image to the deep CNN model for further performance improvement. In addition, feature optimization techniques can be applied to the feature pool to enhance the fused features. Further, other lightweight deep CNN models with attention mechanisms can be explored to improve the classification performance. The 2D convolutional layers of the proposed model can be replaced with 3D convolution layers to perform analysis of the 3D images. This will improve the model's capability to diagnose disease more accurately.

Data availability

The data supporting this study’s findings are available from the corresponding author upon reasonable request.

References

Alagu, S., & Bagan, K. B. (2019). Acute lymphoblastic leukemia diagnosis in microscopic blood smear images using Texture features and SVM classifier.
Anwar, S. & Alam, A. A convolutional neural network–based learning approach to acute lymphoblastic leukaemia detection with automated feature extraction. Med. Biol. Eng. Comput. 58(12), 3113–3121 (2020).
PubMed Google Scholar
Chatap, N. & Shibu, S. Analysis of blood samples for counting leukaemia cells using support vector machine and nearest neighbour. IOSR J Comput Eng 16(5), 79–87 (2014).
ADS Google Scholar
Dean, L. & Dean, L. Blood groups and red cell antigens Vol. 2 (NCBI, 2005).
Google Scholar
Asaad, N. Y., Abd El-Wahed, M. M. & Dawoud, M. M. Diagnosis and prognosis of B-cell chronic lymphocytic leukemia/small lymphocytic lymphoma (B-CLL/SLL) and mantle cell lymphoma (MCL). J. Egypt Natl. Canc. Inst. 17(4), 279–290 (2005).
PubMed Google Scholar
Du, M. et al. The global burden of leukemia and its attributable factors in 204 countries and territories: Findings from the global burden of disease 2019 study and projections to 20230. J. Oncol. https://doi.org/10.1155/2022/1612702 (2022).
Article PubMed PubMed Central Google Scholar
Jagadev, P., & Virani, H. G. Detection of leukemia and its types using image processing and machine learning. In 2017 International Conference on Trends in Electronics and Informatics (ICEI) 522–526, (IEEE, 2017).
Negm, A. S., Hassan, O. A. & Kandil, A. H. A decision support system for acute leukemia classification based on digital microscopic images. Alex. Eng. J. 57(4), 2319–2332 (2018).
Google Scholar
Percival, M. E., Lai, C., Estey, E. & Hourigan, C. S. Bone marrow evaluation for diagnosis and monitoring of acute myeloid leukemia. Blood Rev. 31(4), 185–192 (2017).
PubMed PubMed Central Google Scholar
Adjouadi, M. et al. Classification of leukaemia blood samples using neural networks. Ann. Biomed. Eng. 38(4), 1473–1482 (2010).
PubMed Google Scholar
Rawat, J., Singh, A., Bhadauria, H. S. & Virmani, J. Computer aided diagnostic system for detection of leukaemia using microscopic images. Proc. Comput. Sci. 70, 748–756 (2015).
Google Scholar
Mishra, S., Majhi, B., Sa, P. K. & Sharma, L. Gray level cooccurrence matrix and random forest based acute lymphoblastic leukaemia detection. Biomed. Signal Process. Control 33, 272–280 (2017).
Google Scholar
Varshney, C. J., Sharma, A., & Yadav, D. P. Sentiment analysis using ensemble classification technique. In 2020 IEEE Students Conference on Engineering & Systems (SCES) 1–6, (IEEE, 2020).
Rawat, W. & Wang, Z. Deep convolutional neural networks for image classification: A comprehensive review. Neural Comput. 29, 2352–2449 (2017).
MathSciNet PubMed MATH Google Scholar
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. J. Comput. Vision 115(3), 211–252 (2015).
MathSciNet Google Scholar
Sharma, H. & Jalal, A. S. Improving visual question answering by combining scene-text information. Multim. Tools Appl. 81(9), 12177–12208 (2022).
Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
CAS PubMed PubMed Central ADS Google Scholar
Singh, L. K., Garg, H. & Khanna, M. Deep learning system applicability for rapid glaucoma prediction from fundus images across various data sets. Evol. Syst. 13(6), 807–836 (2022).
Google Scholar
Gupta, N., Garg, H. & Agarwal, R. A robust framework for glaucoma detection using CLAHE and EfficientNet. Visual Comput. 38(7), 2315–2328 (2021).
Google Scholar
Eulenberg, P. et al. Reconstructing cell cycle and disease progression using deep learning. Nat. Commun. 8, 463 (2017).
PubMed PubMed Central ADS Google Scholar
Janowczyk, A. & Madabhushi, A. Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases. J. Pathol. Inform. 7, 29 (2016).
PubMed PubMed Central Google Scholar
Fuchs, T. J. & Buhmann, J. M. Computational pathology: challenges and promises for tissue analysis. Comput. Med. Imaging Graph. Off. J. Comput. Med. Imaging Soc. 35, 515–530 (2011).
Google Scholar
Albarqouni, S. et al. Aggnet: Deep learning from crowds for mitosis detection in breast cancer histology images. IEEE Trans. Med. Imaging 35(5), 1313–1321 (2016).
PubMed Google Scholar
Levenson, R. M., Fornari, A. & Loda, M. Multispectral imaging and pathology: Seeing and doing more. Expert Opin. Med. Diagn. 2, 1067–1081 (2008).
PubMed Google Scholar
Gertych, A. et al. Machine learning approaches to analyze histological images of tissues from radical prostatectomies. Comput. Med. Imaging Graph. Off. J. Comput. Med. Imaging Soc. 46, 197–208 (2015).
Google Scholar
Bigorra, L., Merino, A., Alf´erez, S. & Rodellar, J. Feature analysis and automatic identification of leukemic lineage blast cells and reactive lymphoid cells from peripheral blood cell images. J. Clin. Lab. Anal. 31(2), e22024 (2017).
PubMed Google Scholar
Krappe, S., Wittenberg, T., Haferlach, T., & Münzenmayer, C. Automated morphological analysis of bone marrow cells in microscopic images for diagnosis of leukemia: nucleus-plasma separation and cell classification using a hierarchical tree model of hematopoesis. Bildverarbeitung f¨ur die Medizin 2016: Algorithmen - Systeme - Anwendungen; Proceedings des Workshops vom 13. bis 15. M¨arz 2016 in Berlin, 2016.
Scotti, F. Automatic morphological analysis for acute leukemia identification in peripheral blood microscope images. In Computational Intelligence for Measurement Systems and Applications, 2005. CIMSA. 2005 IEEE International Conference on, 96–101. (IEEE, 2005).
Mohapatra, S., Patra, D. & Satpathy, S. An ensemble classifier system for early diagnosis of acute lymphoblastic leukemia in blood microscopic images. Neural Comput. Appl. 24(7–8), 1887–1904 (2014).
Google Scholar
Greenspan, H., van Ginneken, B. & Summers, R. M. Deep learning in medical imaging: Overview and future promise of an exciting new technique. IEEE Trans. Med. Imaging 35(5), 1153–1159 (2016).
Google Scholar
Shen, D., Wu, G. & Suk, H. Deep Learning in Medical Image Analysis. Ann. Rev. Biomed. Eng. 19, 221–248 (2017).
CAS Google Scholar
Choi, J. W. et al. White blood cell differential count of maturation stages in bone marrow smear using dual-stage convolutional neural networks. PloS one 12, e0189259 (2017).
PubMed PubMed Central Google Scholar
Kainz, P., Burgsteiner, H., Asslaber, M. & Ahammer, H. Training echo state networks for rotation-invariant bone marrow cell classification. Neural Comput. Appl. 28(6), 1277–1292 (2017).
PubMed Google Scholar
Matek, C., Schwarz, S., Spiekermann, K. & Marr, C. Human-level recognition of blast cells in acute myeloid leukaemia with convolutional neural networks. Nat. Mach. Intell. 1(11), 538–544 (2019).
Google Scholar
Thanh, T. T. P., Vununu, C., Atoev, S., Lee, S. H. & Kwon, K. R. Leukemia blood cell image classification using convolutional neural network. Int. J. Comput. Theory Eng. 10(2), 54–58 (2018).
Google Scholar
Eckardt, J. N. et al. Deep learning detects acute myeloid leukemia and predicts NPM1 mutation status from bone marrow smears. Leukemia 36(1), 111–118 (2022).
CAS PubMed Google Scholar
Khandekar, R., Shastry, P., Jaishankar, S., Faust, O. & Sampathila, N. Automated blast cell detection for acute lymphoblastic leukemia diagnosis. Biomed. Signal Process. Control 68, 102690 (2021).
Google Scholar
Talaat, F. M. & Gamel, S. A. Machine learning in detection and classification of leukemia using C-NMC_Leukemia. Multim. Tools Appl. 2021, 1–14 (2023).
Google Scholar
Rahman, W. et al. Multiclass blood cancer classification using deep CNN with optimized features. Array 18, 100292 (2023).
Google Scholar
Ansari, S., Navin, A. H., Sangar, A. B., Gharamaleki, J. V. & Danishvar, S. A customized efficient deep learning model for the diagnosis of acute leukemia cells based on lymphocyte and monocyte images. Electronics 12(2), 322. https://doi.org/10.3390/electronics12020322 (2023).
Article Google Scholar
Safuan, S. N. M., Tomari, M. R. M., Zakaria, W. N. W., Mohd, M. N. H. & Suriani, N. S. Investigation of white blood cell biomaker model for acute lymphoblastic leukemia detection based on convolutional neural network. Bull. Electr. Eng. Inform. 9(2), 611–618 (2020).
Google Scholar
Pallegama, R. D. A. U., Madhusanka, B. G. D. A. & Priyankara, H. D. N. S. Acute lymphoblastic leukemia detection using convolutional neural network. Int. J. Eng. Sci. Comput. 10(6), 26529 (2020).
Google Scholar
Rahman, W. et al. Multiclass blood cancer classification using deep CNN with optimized features. Array 18, 100292 (2023).
Google Scholar
Revanda, A. R., Fatichah, C. & Suciati, N. Classification of acute lymphoblastic leukemia on white blood cell microscopy images based on instance segmentation using mask R-CNN. Int. J. Intell. Eng. Syst. 15, 625–637 (2022).
Google Scholar
Rezayi, S., Mohammadzadeh, N., Bouraghi, H., Saeedi, S. & Mohammadpour, A. Timely diagnosis of acute lymphoblastic leukemia using artificial intelligence-oriented deep learning methods. Comput. Intell. Neurosci. 2021, 12. https://doi.org/10.1155/2021/5478157 (2021).
Article Google Scholar
Mallick, P. K. et al. Convergent learning–based model for leukemia classification from gene expression. Pers. Ubiquit. Comput. 27, 1103–1110. https://doi.org/10.1007/s00779-020-01467-3 (2023).
Article Google Scholar
Ahmad, R. et al. Leukocytes classification for leukemia detection using quantum inspired deep feature selection. Cancers 15(9), 2507. https://doi.org/10.3390/cancers15092507 (2023).
Article PubMed PubMed Central Google Scholar
Batool, A. & Byun, Y.-C. Lightweight EfficientNetB3 model based on depthwise separable convolutions for enhancing classification of leukemia white blood cell images. IEEE Access 11, 37203–37215. https://doi.org/10.1109/ACCESS.2023.3266511 (2023).
Article Google Scholar
Rejula, M. A., Amutha, S. & Shilpa, G. M. Classification of acute lymphoblastic leukemia using improved ANFIS. Multimed. Tools Appl. https://doi.org/10.1007/s11042-023-15113-6 (2023).
Article Google Scholar
Elhassan, T. A. et al. Classification of atypical white blood cells in acute myeloid leukemia using a two-stage hybrid model based on deep convolutional autoencoder and deep convolutional neural network. Diagnostics 13(2), 196 (2023).
PubMed PubMed Central Google Scholar
Ahmed, I. A., Senan, E. M., Shatnawi, H. S. A., Alkhraisha, Z. M. & Al-Azzam, M. M. A. Hybrid techniques for the diagnosis of acute lymphoblastic leukemia based on fusion of CNN features. Diagnostics 13(6), 1026 (2023).
PubMed PubMed Central Google Scholar
Ojala, T., Pietikäinen, M. & Harwood, D. A comparative study of texture measures with classification based on featured distributions. Pattern Recognit. 29, 51–59. https://doi.org/10.1016/0031-3203(95)00067-4 (1996).
Article ADS Google Scholar
Dalal, N., & Triggs, B. Histograms of oriented gradients for human detection. In 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05) (Vol. 1). 886–893 (IEEE, 2005).
Zhao, G., Zhang, Z., Guan, H., Tang, P., & Wang, J. Rethinking ReLU to train better CNNs. In 2018 24th International Conference on Pattern Recognition (ICPR) 603–608. (IEEE, 2018).
Matek, C., Schwarz, S., Spiekermann, K. & Marr, C. A single-cell morphological dataset of leukocytes from AML patients and non-malignant controls (AML-Cytomorphology_LMU). TCAI https://doi.org/10.7937/tcia.2019.36f5o9ld (2019).
Article Google Scholar
Shafique, S. & Tehsin, S. Acute lymphoblastic leukemia detection and classification of its subtypes using pretrained deep convolutional neural networks. Technol. Cancer Res. Treat. 17, 1533033818802789 (2018).
PubMed PubMed Central Google Scholar
Pansombut, T., Wikaisuksakul, S., Khongkraphan, K. & Phon-On, A. Convolutional neural networks for recognition of lymphoblast cell images. Comput. Intell. Neurosci. 2019(2019), 7519603–7519603 (2019).
PubMed PubMed Central Google Scholar
Ahmed, N., Yigit, A., Isik, Z. & Alpkocak, A. Identification of leukemia subtypes from microscopic images using convolutional neural network. Diagnostics 9(3), 104 (2019).
PubMed PubMed Central Google Scholar
Jha, K. K. & Dutta, H. S. Mutual information-based hybrid model and deep learning for acute lymphocytic leukemia detection in single cell blood smear images. Comput. Methods Progr. Biomed. 179, 104987 (2019).
Google Scholar
Prellberg, J., & Kramer, O. Acute lymphoblastic leukemia classification from microscopic images using convolutional neural networks. In ISBI 2019 C-NMC Challenge: Classification in Cancer Cell Imaging 53–61. (Springer, 2019).
Loey, M., Naman, M. & Zayed, H. Deep transfer learning in diagnosing leukemia in blood cells. Computers 9(2), 29 (2020).
Google Scholar
Vogado, L. H., Veras, R. M., Araujo, F. H., Silva, R. R. & Aires, K. R. Leukemia diagnosis in blood slides using transfer learning in CNNs and SVM for classification. Eng. Appl. Artif. Intell. 72, 415–422 (2018).
Google Scholar
Di Ruberto, C., Loddo, A. & Puglisi, G. Blob detection and deep learning for leukemic blood image analysis. Appl. Sci. 10(3), 1176 (2020).
Google Scholar
Rehman, A. et al. Classification of acute lymphoblastic leukemia using deep learning. Microsc. Res. Tech. 81(11), 1310–1317 (2018).
PubMed Google Scholar
Huang, F. et al. AML, ALL, and CML classification and diagnosis based on bone marrow cell morphology combined with convolutional neural network: A STARD compliant diagnosis research. Medicine 99(45), e23154 (2020).
PubMed PubMed Central Google Scholar
Boldú, L., Merino, A., Acevedo, A., Molina, A. & Rodellar, J. A deep learning model (ALNet) for the diagnosis of acute leukaemia lineage using peripheral blood cell images. Comput. Methods Progr. Biomed. 202, 105999 (2021).
Google Scholar
Yadav, D. P., Jalal, A. S. & Prakash, V. Human burn depth and grafting prognosis using ResNeXt topology based deep learning network. Multim. Tools Appl. 81(13), 18897–18914 (2022).
Google Scholar
Sahlol, A. T., Kollmannsberger, P. & Ewees, A. A. Efficient classification of white blood cell leukemia with improved swarm optimization of deep features. Sci. Rep. 10(1), 1–11 (2020).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering and Applications, G.L.A. University, Mathura, 281406, India
D. P. Yadav, Anand Singh Jalal & Ankit Kumar
Department of Computer Science, NIT Meghalaya, Shillong, 793003, India
Deepak Kumar
School of Computing, Graphic Era Hill University, Dehradun, 248002, India
Kamred Udham Singh
Kebri Dehar University, Kebri Dehar, Ethiopia
Mohd Asif Shah
Woxsen University, Kamkole, Sadasivpet, Hyderabad, Telangana, 502345, India
Mohd Asif Shah
Division of Research and Development, Lovely Professional University, Phagwara, Punjab, 144001, India
Mohd Asif Shah
Research Fellow, INTI International University, Persiaran Perdana BBN Putra, Nilai, Negeri Sembilan, 71800, Malaysia
Mohd Asif Shah

Authors

D. P. Yadav
View author publications
Search author on:PubMed Google Scholar
Deepak Kumar
View author publications
Search author on:PubMed Google Scholar
Anand Singh Jalal
View author publications
Search author on:PubMed Google Scholar
Ankit Kumar
View author publications
Search author on:PubMed Google Scholar
Kamred Udham Singh
View author publications
Search author on:PubMed Google Scholar
Mohd Asif Shah
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, D.P.Y; Data curation, D.K.; Formal analysis, A.J. and A.K.; Investigation, D.P.Y.; Methodology, A.K. and D.P.Y.; Project administration, D.P.Y.; Visualization, A.J.; Writing—original draft, K.U.S. and M.A.S.; Review and writing.

The authors declare their consent to publish this article.

Corresponding author

Correspondence to Mohd Asif Shah.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yadav, D.P., Kumar, D., Jalal, A.S. et al. Morphological diagnosis of hematologic malignancy using feature fusion-based deep convolutional neural network. Sci Rep 13, 16988 (2023). https://doi.org/10.1038/s41598-023-44210-7

Download citation

Received: 23 May 2023
Accepted: 05 October 2023
Published: 09 October 2023
Version of record: 09 October 2023
DOI: https://doi.org/10.1038/s41598-023-44210-7

This article is cited by

Modified osprey algorithm for optimizing capsule neural network in leukemia image recognition
- Bingying Yao
- Li Chao
- Khalid A. Alnowibet
Scientific Reports (2024)