Abstract
Lung cancer remains the primary cause of cancer-related deaths throughout the world. The main reason behind this is late diagnosis and the restrictions in the manual interpretation of imaging data. In these days Low-Dose Computed Tomography (LDCT) has been widely adopted for early screening. LDCT contains Low Dose x-rays as compared to the normal CT scan. But the existence of noise and subtle nodular patterns often impairs diagnostic accuracy. In this study, authors proposed a novel hybrid deep learning model which uses BM3D for pre-processing and YOLOv8 for segmentation. Further this model integrates Convolutional Neural Networks (CNNs) with Transformer Encoders to enhance the early detection of lung cancer using LDCT scan images. The model powers the spatial feature extraction with the help of CNNs and the contextual reasoning capability of Transformers to achieve superior classification performance. In this work, during the training of model BM3D filtering (advanced image preprocessing technique) are applied to reduce noise and enhance structural details. Further YOLOv8 is used for segmentation. The proposed hybrid model achieved 93.8% sensitivity, 95.1% accuracy, 94.4% F1-Score, 96.2% Specificity, 0.92 Dice Metric and 0.97 AUC for classification. Experimental results demonstrate that the proposed model outperforms existing models in terms of accuracy, precision, recall, AUC, and Dice coefficient. These findings suggest that the hybrid model holds strong potential as a robust tool for early lung cancer screening and clinical decision support.
Similar content being viewed by others
Data availability
The LIDC-IDRI dataset used in this study is publicly available. The Code is available on request from Corresponding author (Gagan Thakral).
References
World Health Organization et al. World Health Statistics 2020 (2020).
Thakral, G. & Gambhir, S. Early detection of lung cancer with low-dose CT scan using artificial intelligence: A comprehensive survey. SN Comput. Sci. 5(5), 441 (2024).
Diwakar, M. & Kumar, P. Wavelet packet based CT image denoising using bilateral method and Bayes shrinkage rule. In Handbook of Multimedia Information Security: Techniques and Applications. 501–511 (2019).
Neal Joshua, E.S., Bhattacharyya, D., Chakkravarthy, M. & Byun, Y.-C. 3D cnn with visual insights for early detection of lung cancer using gradient-weighted class activation. J. Healthc. Eng. 2021 (2021).
Thakral, G., Gambhir, S. & Aneja, N. Proposed methodology for early detection of lung cancer with low-dose CT scan using machine learning. In 2022 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COM-IT-CON). Vol. 1. 662–666 (IEEE, 2022).
Wachowiak, M. P., Rash, G. S., Quesada, P. M. & Desoky, A. H. Wavelet-based noise removal for biomechanical signals: A comparative study. IEEE Trans. Biomed. Eng. 47(3), 360–368 (2000).
Choi, W. et al. Radiomics analysis of pulmonary nodules in low-dose CT for early detection of lung cancer. Med. Phys. 45(4), 1537–1549 (2018).
Asuntha, A. & Srinivasan, A. Deep learning for lung cancer detection and classification. Multimed. Tools Appl. 79, 7731–7762 (2020).
Setio, A. A. A. et al. Pulmonary nodule detection in CT images: False positive reduction using multi-view convolutional networks. IEEE Trans. Med. Imaging 35(5), 1160–1169 (2016).
Dosovitskiy, A. et al. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).
Durgam, R. et al. Enhancing lung cancer detection through integrated deep learning and transformer models. Sci. Rep. 15(1), 15614 (2025).
Firmino, M., Angelo, G., Morais, H., Dantas, M. R. & Valentim, R. Computer-aided detection (CADE) and diagnosis (CADX) system for lung cancer with likelihood of malignancy. Biomed. Eng. Online 15(1), 1–17 (2016).
Chen, J. et al. Transunet: Rethinking the u-net architecture design for medical image segmentation through the lens of transformers. Med. Image Anal. 97, 103280 (2024).
Armato III, S.G. et al. SPIE-AAPM-NCI lung nodule classification challenge dataset (2015).
Suji, R. J., Godfrey, W. W. & Dhar, J. Exploring pretrained encoders for lung nodule segmentation task using LIDC-IDRI dataset. Multimed. Tools Appl. 83(4), 9685–9708 (2024).
Thakral, G., Kumar, U. & Gambhir, S. Robust pre-processing strategies for early lung cancer diagnosis with low-dose CT scans. In 2025 2nd International Conference on Computational Intelligence, Communication Technology and Networking (CICTN). 305–311 (IEEE, 2025).
Thakral, G., Kumar, U. & Gambhir, S. Implementation of deep learning-based segmentation technique on LDCT scan images for detection of lung cancer in early stages. In 2024 International Conference on Computing, Sciences and Communications (ICCSC). 1–6 (IEEE, 2024).
Yin, M., Wortman Vaughan, J. & Wallach, H. Understanding the effect of accuracy on trust in machine learning models. In Proceedings of the 2019 Chi Conference on Human Factors in Computing Systems. 1–12 (2019).
Ankenbrand, M. J., Shainberg, L., Hock, M., Lohr, D. & Schreiber, L. M. Sensitivity analysis for interpretation of machine learning based segmentation models in cardiac MRI. BMC Med. Imaging 21(1), 27 (2021).
Gavelli, G. & Giampalma, E. Sensitivity and specificity of chest X-ray screening for lung cancer. Cancer 89(S11), 2453–2456 (2000).
Phillips, M. et al. Detection of lung cancer using weighted digital analysis of breath biomarkers. Clin. Chim. Acta 393(2), 76–84 (2008).
Shimazaki, A. et al. Deep learning-based algorithm for lung cancer detection on chest radiographs using the segmentation method. Sci. Rep. 12(1), 727 (2022).
Hussain Ali, Y., Sabu Chooralil, V., Balasubramanian, K., Manyam, R.R., Kidambi Raju, S., T. Sadiq, A. & Farhan, A.K. Optimization system based on convolutional neural network and internet of medical things for early diagnosis of lung cancer. Bioengineering 10(3), 320 (2023).
Sweetlin, J. D., Nehemiah, H. K. & Kannan, A. Computer aided diagnosis of pulmonary hamartoma from CT scan images using ant colony optimization based feature selection. Alex. Eng. J. 57(3), 1557–1567 (2018).
Tan, J., Huo, Y., Liang, Z. & Li, L. A comparison study on the effect of false positive reduction in deep learning based detection for juxtapleural lung nodules: cnn vs dnn. In Proceedings of the Symposium on Modeling and Simulation in Medicine. 1–8 (2017).
Kirienko, M. et al. Convolutional neural networks promising in lung cancer t-parameter assessment on baseline FDG-PET/CT. Contrast Med. Mol. Imaging 2018(1), 1382309 (2018).
Sun, W., Zheng, B. & Qian, W. Computer aided lung cancer diagnosis with deep learning algorithms. In Medical Imaging 2016: Computer-Aided Diagnosis. Vol. 9785. 241–248 (SPIE, 2016).
Silva, G. L. F., Valente, T. L. A., Silva, A. C., Paiva, A. C. & Gattass, M. Convolutional neural network-based PSO for lung nodule false positive reduction on CT images. Comput. Methods Prog. Biomed. 162, 109–118 (2018).
Subash, J. & Kalaivani, S. Dual-stage classification for lung cancer detection and staging using hybrid deep learning techniques. Neural Comput. Appl. 36(14), 8141–8161 (2024).
Hareesh, P. & Bellamkonda, S. Deep learning-based classification of lung cancer lesions in CT scans: Comparative analysis of CNN, VGG-16, and MobileNet models. In International Conference on Image Processing and Capsule Networks. 373–387 (Springer, 2023).
Kumar, A., Mehta, R., Reddy, B. R. & Singh, K. K. Vision transformer based effective model for early detection and classification of lung cancer. SN Comput. Sci. 5(7), 839 (2024).
Kumaran S, Y., Jeya, J.J., R, M.T., Khan, S.B., Alzahrani, S. & Alojail, M. Explainable lung cancer classification with ensemble transfer learning of vgg16, resnet50 and inceptionv3 using grad-cam. BMC Med. Imaging 24(1), 176 (2024).
Author information
Authors and Affiliations
Contributions
Gagan Thakral conceptualized and developed the model, performed data preprocessing using BM3D, YOLOv8 segmentation, and the CNN–Transformer architecture, and carried out the experiments, analysis, and initial manuscript drafting. Dr. Umesh Kumar provided technical guidance, methodological validation, and critical revisions to enhance the scientific quality of the work. Dr. Sapna Gambhir contributed as support in research design, validation of results, and comprehensive review and refinement of the manuscript. All authors read and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This study was conducted according to ethical standards.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Thakral, G., Kumar, U. & Gambhir, S. Hybrid CNN–transformer model with BM3D and YOLOv8 for early detection of lung cancer in low-dose CT scans. Sci Rep (2026). https://doi.org/10.1038/s41598-026-43517-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-43517-5


