Hybrid convolutional neural network and bi-LSTM model with EfficientNet-B0 for high-accuracy breast cancer detection and classification

Lilhore, Umesh Kumar; Sharma, Yogesh Kumar; Shukla, Brajesh Kumar; Vadlamudi, Muniraju Naidu; Simaiya, Sarita; Alroobaea, Roobaea; Alsafyani, Majed; Baqasah, Abdullah M.

doi:10.1038/s41598-025-95311-4

Download PDF

Article
Open access
Published: 09 April 2025

Hybrid convolutional neural network and bi-LSTM model with EfficientNet-B0 for high-accuracy breast cancer detection and classification

Umesh Kumar Lilhore¹,
Yogesh Kumar Sharma²,
Brajesh Kumar Shukla³,
Muniraju Naidu Vadlamudi⁴,
Sarita Simaiya^1,5,
Roobaea Alroobaea⁶,
Majed Alsafyani⁶ &
…
Abdullah M. Baqasah⁷

Scientific Reports volume 15, Article number: 12082 (2025) Cite this article

7515 Accesses
16 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Breast cancer detection remains one of the most challenging problems in medical imaging. We propose a novel hybrid model that integrates Convolutional Neural Networks (CNNs), Bidirectional Long Short-Term Memory (Bi-LSTM) networks, and EfficientNet-B0, a pre-trained model. By leveraging EfficientNet-B0, which has been trained on the large and diverse ImageNet dataset, our approach benefits from transfer learning, enabling more efficient feature extraction from mammographic images compared to traditional methods that require CNNs to be trained from scratch. The model further enhances performance by incorporating Bi-LSTM, which allows for processing temporal dependencies in the data, which is crucial for accurately detecting complex patterns in breast cancer images. We fine-tuned the model using the Adam optimizer to optimize performance, significantly improving accuracy and processing speed. Extensive evaluation of well-established datasets such as CBIS-DDSM and MIAS resulted in an outstanding 99.2% accuracy in distinguishing between benign and malignant tumors. We also compared our hybrid model to other well-known architectures, including VGG-16, ResNet-50, and DenseNet169, using three optimizers: Adam, RMSProp, and SGD. The Adam optimizer consistently achieved the highest accuracy and lowest loss across the training and validation phases. Additionally, feature visualization techniques were applied to enhance the model’s interpretability, providing deeper insight into the decision-making process. The Proposed hybrid model sets a new standard in breast cancer detection, offering exceptional accuracy and improved transparency, making it a valuable tool for clinicians in the fight against breast cancer.

Breast cancer classification based on hybrid CNN with LSTM model

Article Open access 05 February 2025

Integrative hybrid deep learning for enhanced breast cancer diagnosis: leveraging the Wisconsin Breast Cancer Database and the CBIS-DDSM dataset

Article Open access 01 November 2024

An integrated framework for breast mass classification and diagnosis using stacked ensemble of residual neural networks

Article Open access 18 July 2022

Introduction

Breast cancer is still one of the most common and deadly diseases in the world, accounting for a significant proportion of cancer-related deaths in women. Breast cancer kills over 670,000 people worldwide each year, according to current statistics, and the number of new cases increases year after year¹. Early detection of breast cancer is critical for lowering mortality rates because it allows for more effective treatment options and improves survival. Early detection of tumors enables healthcare professionals to provide more targeted and personalized treatment, significantly improving prognosis. Breast cancer often appears asymptomatic in its early stages, necessitating the development of dependable and accurate predictive models to detect subtle signs of malignancy^2,3.

A variety of traditional methods have been used over the years to analyze and classify breast cancer, with a focus on imaging techniques such as mammography, ultrasound, and biopsy⁴. Mammography is the most reliable method for detecting early breast cancer; however, it has limitations, particularly in patients with dense breast tissue, where tumors may be concealed. Ultrasound and magnetic resonance imaging (MRI) are frequently used as adjuncts to mammography; however, these techniques necessitate specialized evaluation and can result in subjective interpretations^5,6.

Recent years have demonstrated the ability of statistical and machine learning models to improve the accuracy of breast cancer diagnosis. Support Vector Machines (SVM), Random Forests, and k-nearest Neighbours (k-NN) algorithms have been used to predict the likelihood of malignancy in various datasets, including mammography images and clinical information. However, these models frequently fail to capture the complex relationships in breast cancer data, especially in large and multidimensional datasets. Traditional machine learning methods may not fully exploit the spatial and temporal patterns inherent in medical imaging data^7,8.

Role of machine learning and deep learning in breast cancer diagnosis

Machine learning (ML) and deep learning (DL) are crucial in breast cancer detection, offering significant improvements over traditional methods. Machine learning algorithms, such as decision trees, support vector machines, and logistic regression, have effectively classified breast cancer from mammographic images and diverse clinical data. Nevertheless, their capacity to derive significant patterns from intricate and high-dimensional data is frequently constrained^9,10.

Deep learning models, particularly CNNs, have revolutionized the approach to medical image analysis. CNNs can independently learn features from raw medical images, significantly reducing the need for manual feature extraction. These models have exhibited significant effectiveness in breast cancer diagnosis, especially in mammogram analysis, where CNNs can detect anomalies that may be overlooked by human specialists¹¹. Despite their proficiency in recognizing spatial features, CNNs are not particularly adept at capturing temporal patterns, such as tumor growth or morphological changes over time, which are crucial for accurate predictions.

The amalgamation of spatial and temporal data has improved the effectiveness of deep learning models in breast cancer diagnosis. Models like Recurrent Neural Networks (RNNs) and, more recently, Bi-LSTM networks have demonstrated remarkable efficacy in tasks requiring temporal data processing. These models are particularly beneficial in situations where the progression of the disease is monitored over time^12,13. Despite these advancements, existing models still face challenges, including the requirement for large labeled datasets, hyperparameter optimization difficulties, and overfitting issues.

Challenges in existing research

ML and DL are essential in breast cancer detection, providing substantial advancements compared to conventional techniques. Machine learning algorithms, including decision trees, support vector machines, and logistic regression, have effectively classified breast cancer using mammographic images and clinical data. Nonetheless, their ability to extract meaningful patterns from complex and high-dimensional data is often limited¹⁰. Deep learning models, especially CNNs, have transformed the methodology of medical image analysis. CNNs can autonomously extract features from unprocessed medical images, greatly diminishing the necessity for manual feature extraction. These models have demonstrated considerable efficacy in breast cancer diagnosis, particularly in mammogram analysis, where CNNs can identify anomalies that may be missed by human experts^8,14. Although CNNs excel at identifying spatial features, they are not exceptionally skilled at capturing temporal patterns, such as tumor growth or morphological changes over time, which are essential for precise predictions.

Integrating temporal and spatial information has enhanced the efficacy of deep learning models in breast cancer diagnosis. Models such as RNNs and, more recently, Bi-LSTM networks have exhibited exceptional efficacy in tasks necessitating temporal data processing. These models are especially advantageous when the disease’s progression is tracked over time¹⁵. Notwithstanding these advancements, current models continue to encounter challenges, such as the necessity for extensive labeled datasets, hyperparameter optimization complications, and overfitting problems.

Motivation for the research

The detection and classification of breast cancer through mammographic images is a critical but challenging task, mainly due to the limitations of existing deep-learning models. While state-of-the-art models, such as CNNs, VGG-16, and ResNet, have succeeded in image classification, they often struggle with complex features in mammogram images, such as subtle differences between benign and malignant tumors. Moreover, these models typically do not capture the temporal or contextual dependencies in medical imaging, which are essential for accurate diagnosis. Additionally, the performance of these models can be limited by the need for large amounts of labeled data and the computational cost of training deep networks from scratch.

Our proposed hybrid model addresses these deficiencies by effectively combining advanced CNNs with Bi-LSTM and EfficientNet-B0¹⁶, using transfer learning to extract features from the pre-trained EfficientNet-B0 model. This allows us to overcome the need for large labeled datasets while improving accuracy. The Bi-LSTM component enhances the model’s ability to capture temporal dependencies in the images, further improving classification performance. By fine-tuning the model’s hyperparameters and leveraging advanced optimization techniques, we improve the model’s speed and accuracy. This novel approach significantly outperforms existing models in breast cancer detection, providing a more reliable, interpretable, and efficient solution for clinical use.

Key contributions of the work

Our research develops an Improved Adam optimization-optimized hybrid CNN-LSTM model to address these challenges. We aim to build a model that improves breast cancer image prediction and integrates complex spatial and temporal features. We want to improve the model’s computational efficiency and interpretability. The key contribution of the article is as follows:

Improved CNN architecture By adding more convolutional layers and sophisticated feature extraction techniques, the aim is to capture intricate spatial patterns more effectively in breast cancer images. This adjustment improves the overall performance of the model and represents features more precisely.
Enhanced Bi-LSTM and Transfer Learning: Improve the Bi-LSTM structure to represent sequential relationships in data better. The LSTM is optimized to handle temporal aspects of the data more effectively, resulting in higher prediction accuracy and model stability. Similarly, a Transfer learning method uses pre-trained CNN EfficientNet-B0, which is trained on ImageNet.
Optimize hyperparameter tuning This is performed by Adam optimization, which addresses issues such as overfitting and underfitting, resulting in faster, more reliable predictions and improved model efficiency.
Improved prediction accuracy across the popular breast cancer datasets CBIS-DDSM and MIAS, the proposed approach outperforms existing deep learning models, i.e., VGG-16, VGG-19, DenseNet169, ResNet-50, and DenseNet201, with better accuracy.

The complete article is organized as follows: section two covers related breast cancer detection and analysis work using machine and deep learning methods. Section three covers materials and techniques related to the research. This section covers the functioning of the proposed model and details the dataset. Section four covers the simulation results and analysis of existing and proposed methods; section five covers the conclusion and future direction of the research.

Related works

Breast cancer remains a prominent issue in worldwide healthcare, necessitating the development of sophisticated and accurate diagnostic technologies. Recently, there has been a significant focus on utilizing deep learning methodologies in medical image processing, specifically in breast cancer forecasting and categorization.

Deep learning applications in breast cancer diagnosis

Deep learning models have recently shown significant promise in breast cancer diagnosis. Diverse methodologies have been proposed to improve the accuracy of breast cancer detection and classification using medical imaging techniques. A study used CNNs to detect breast cancer, with a classification accuracy of 89% using mammographic images. The study highlighted the importance of integrating deep learning models to improve model robustness and applicability across diverse populations, implying that future research should prioritize data collection from multiple research institutions.

A recent study introduced a hybrid model that uses MRI scans to predict the treatment response of breast cancer patients by combining radiomic features with convolutional neural networks. The model achieved an accuracy rate of 88%. The authors emphasized the importance of rigorous validation across various imaging protocols to ensure the model’s relevance in clinical settings. An alternative method, described in¹¹, used intra- and inter-modality attention mechanisms for prognostic prediction in breast cancer and achieved a sample accuracy of 91%. This model highlighted the need for more extensive and diverse datasets to address data imbalances and improve predictive accuracy. Many ancillary studies have focused on histopathological images and cytopathology about breast cancer classification. An ensemble learning method in¹² used annotated histopathological slides from various sources to improve diagnostic accuracy, achieving a precision of 90%. Another study in¹⁷ used CNNs to classify cytopathology images and achieved an accuracy of 85%. These studies’ findings emphasize the importance of feature extraction and the challenges of interpretability in complex models. They propose that future initiatives prioritize the development of explainable AI to assist healthcare professionals in clinical decision-making.

Integrating multimodal data for improved diagnosis

Recent advances in multimodal data fusion approaches have improved the efficacy of machine learning models for detecting breast cancer. A study cited in¹⁸ investigated using HER-2 and ER biomarkers with deep neural networks to detect breast cancer. The study combined biological markers and imaging data, demonstrating a high potential for accurate breast cancer segmentation and classification. Furthermore³ looked into using deep neural networks to classify breast cancer using mammographic images, with a pre-processed dataset to improve clinical relevancy. The study cited in¹⁹ demonstrated a significant improvement, as the authors used the XGBoost algorithm to identify the most relevant features for breast cancer prediction, achieving accuracy comparable to all features while significantly shortening training time. The study found that feature selection significantly improves model efficiency.

Studies show that deep learning methods are effective for early detection of breast cancer in a variety of settings. A study by¹⁰ investigated using machine learning algorithms and Artificial Neural Networks (ANNs) to predict breast cancer recurrence. This method showed promise in providing personalized treatment recommendations and increasing patient survival rates. A one-of-a-kind research initiative developed a classification system for breast cancer detection using IoT-enabled imaging data that achieved an accuracy of 89.2%. This study emphasized the importance of real-time data processing in shortening diagnostic timelines while recognizing potential privacy and security concerns¹⁴.

Emerging trends and future directions

Numerous studies have illustrated the efficacy of deep learning models in breast cancer detection; however, several domains remain for future research to enhance model performance. For instance²⁰, recognized the necessity for enhanced model generalisability across varied patient demographics and imaging methodologies. This constraint underscores the need to employ more extensive and diverse datasets in model training. Furthermore, research including^8,21 has indicated that despite the remarkable accuracy of deep learning models, issues concerning data imbalance, feature extraction, and overfitting remain prevalent.

An additional critical focus is the advancement of explainable AI (XAI) methodologies to improve model transparency. Research^8,22 indicates that offering interpretable results to healthcare professionals will enhance their confidence in machine learning tools and facilitate clinical decision-making. Moreover, integrating diverse datasets, including clinical, biological, and imaging data, will enable the development of more comprehensive models that yield more precise and holistic predictions. Furthermore, the incorporation of emerging technologies like the Internet of Medical Things (IoMT) can significantly augment the efficacy of breast cancer prediction models through real-time data collection and analysis. Nonetheless, the imperative of safeguarding data privacy and security must be confronted to protect patient information while preserving model efficacy, as indicated in¹³.

Table 1 Comparative analysis of various existing research in the field of breast cancer research.

Subjects

Abstract

Similar content being viewed by others

Breast cancer classification based on hybrid CNN with LSTM model

Integrative hybrid deep learning for enhanced breast cancer diagnosis: leveraging the Wisconsin Breast Cancer Database and the CBIS-DDSM dataset

An integrated framework for breast mass classification and diagnosis using stacked ensemble of residual neural networks

Introduction

Role of machine learning and deep learning in breast cancer diagnosis

Challenges in existing research

Motivation for the research

Key contributions of the work

Related works

Deep learning applications in breast cancer diagnosis

Integrating multimodal data for improved diagnosis

Emerging trends and future directions

Materials and methods

Proposed model for breast cancer

Working of the proposed hybrid model

EfficientNet-B0

Improved CNN-based feature extraction

Temporal dependencies by improved Bi-LSTM model

Transfer learning (pre-trained CNN)

Role of Adam optimization

Algorithm for the proposed hybrid model

Mathematical modelling

Datasets and data preprocessing

Cancer imaging archive - digital database for screening mammography (CBIS-DDSM)

Data pre-processing on CBIS_DDSM

Mammographic image analysis society (MIAS)

Data pre-processing on MIAS

Performance metric

Simulation results and discussion

Simulation configurations and parameters

Simulation results

Simulation results for CBIS-DDSM

Simulation results for MIAS

Results for different optimizers and impact of data pre-processing

Ablation analysis

Results and discussion

Comparative analysis with state-of-the-art methods

Conclusion and future directions

Conclusion

Future directions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Breast lesion classification via colorized mammograms and transfer learning in a novel CAD framework

Optimizing YOLOv11 for automated classification of breast cancer in medical images

Multimodal Breast Cancer Classification Using Fractional-Order Three-Triangle Multi-delayed Neural Network Optimized with Hunger Games Search

An Innovative and Effective Deep Learning Architecture for Risk and Survival Rate Prediction of Triple Negative Breast Cancer Using Modified Optimization Strategy

Search

Quick links