Medical application driven content based medical image retrieval system for enhanced analysis of X-ray images

Saranya, E.; Chinnadurai, M.

doi:10.1038/s41598-025-14282-8

Download PDF

Article
Open access
Published: 08 August 2025

Medical application driven content based medical image retrieval system for enhanced analysis of X-ray images

E. Saranya¹ &
M. Chinnadurai¹

Scientific Reports volume 15, Article number: 29115 (2025) Cite this article

1427 Accesses
Metrics details

Subjects

Abstract

By carefully analyzing latent image properties, content-based image retrieval (CBIR) systems are able to recover pertinent images without relying on text descriptions, natural language tags, or keywords related to the image. This search procedure makes it quite easy to automatically retrieve images in huge, well-balanced datasets. However, in the medical field, such datasets are usually not available. This study proposed an advanced DL technique to enhance the accuracy of image retrieval in complex medical datasets. The proposed model can be integrated into five stages, namely pre-processing, decomposing the images, feature extraction, dimensionality reduction, and classification with an image retrieval mechanism. The hybridized Wavelet-Hadamard Transform (HWHT) was utilized to obtain both low and high frequency detail for analysis. In order to extract the main characteristics, the Gray Level Co-occurrence Matrix (GLCM) was employed. Furthermore, to minimize feature complexity, Sine chaos based artificial rabbit optimization (SCARO) was utilized. By employing the Bhattacharyya Coefficient for improved similarity matching, the Bhattacharya Context performance aware global attention-based Transformer (BCGAT) improves classification accuracy. The experimental results proved that the COVID-19 Chest X-ray image dataset attained higher accuracy, precision, recall, and F1-Score of 99.5%, 97.1%, 97.1%, and 97.1%, 97.1%, respectively. However, the chest x-ray image (pneumonia) dataset has attained higher accuracy, precision, recall, and F1-score values of 98.60%, 98.49%, 97.40%, and 98.50%, respectively. For the NIH chest X-ray dataset, the accuracy value is 99.67%.

Deep learning-based patient re-identification is able to exploit the biometric nature of medical chest X-ray data

Article Open access 01 September 2022

A latent diffusion approach to visual attribution in medical imaging

Article Open access 06 January 2025

A vision–language foundation model for the generation of realistic chest X-ray images

Article 26 August 2024

Introduction

The usage of digital technologies for medical imaging, including X-rays, Computed Tomography (CT), and Magnetic Resonance Imaging (MRI), has increased in the current generation due to the Internet’s rapid expansion. It provides functional and essential anatomical details of various body parts, classification, monitoring, diagnosis, and treatment planning^1,2. The CPIR field is defined by the abundance of images produced by different image capturing devices³. In the realm of medical image processing, content-based medical image retrieval (CBMIR) has been instrumental due to its extensive picture collections and powerful image retrieval capabilities⁴. Every CBMIR technique includes two fundamental steps: feature extraction and similarity measurement computations⁵. CBIR has emerged as a feature where intensity, texture, color, and shape are removed based on query images from large image sets.

After feature extraction, a feature vector is created during the process of similarity measure calculations to compare each retrieved image, evaluate it in the corresponding medical database, and display the most relevant images to the user^6,7. Traditional TBIR includes imprecise subjective ambiguity, interpretation, and extensive annotation⁸. High-dimensional data is still included in medical image processing technologies like MRI⁹. The time and effort needed to retrieve images using text-based image retrieval (TBIR) is higher than that of other methods, and its retrieval accuracy has been lacking^10,11. Radiologists analyze medical imaging, such as CT scans, to diagnose the condition of broken bones, but it doesn’t show the details of the mussels infections¹². Convolutional Neural Networks in CBIR are irreplaceable in encrypted image analysis¹³.

Medical image retrieval is aided by the Picture Archival and Communication System (PACS), although there is a lack of time and limited manual effort¹⁴. As a tool in medical imaging technologies, computer-aided diagnosis (CAD) efficiently analyzes medical images for patient medical diagnosis¹⁵. Less effective, more human intervention required, and limited feature extraction for image retrieval are some of the issues with traditional image storage systems¹⁶. There was an issue with the image’s transparency and its ability to use deep learning-based CPMIR¹⁷. By using latent image properties to retrieve relevant images, CPMIR creates a feature vector that preserves high-level image representations in the query image¹⁸. There has been little progress in clinical image retrieval algorithms that can access large-scale storage of landscape photos and medical images instead of tags or metadata, despite breakthroughs in content-based image retrieval (CIBR)^19,20.

Motivation

Using sophisticated feature extraction, classification, and similarity matching approaches, an effective content-based image retrieval system for X-ray pictures is provided in this research. Existing techniques like CNN, AOADL-CBIRH, DLECNN, and others have certain drawbacks, like only offering a tiny dataset from visualization²¹ and not supplying patient details like name, gender, age, and others²⁰. Additionally, there is low picture retrieval accuracy²², inability to correctly adjust the hyperparameter²³, and inability to recognize the shape in image retrieval^24,25. Moreover, does not use an optimization-based feature selection algorithm, which degrades the CBIR results²⁶, handles small datasets for the testing process²⁷, too complex to real time applications²⁸, low efficiency in feature extraction²⁹, and produces more noise in retrieval³⁰ in order to overcome the existing methods’ limitations, introduce a novel Bhattacharya context performance aware global attention-based transformer (BCGAT). Here, adding HWHT captures both low and high frequency components, which supports detailed content analysis. Also, addingthe Bhattacharya coefficient (BC) to measure the closeness of feature distributions ensures that more relevant images are retrieved. Moreover, adding a context performance aware global attention-based transformer ensures high precision for an effective result. The main objectives are mentioned below:

To collect input images from COVID-19 Chest X-ray image datasets and the chest X-ray image (pneumonia) dataset, which is passed into pre-processing.
To enhance contrast, noise reduction, and normalization from collected X-ray images using the hybrid wavelet-Hadamard transform to support detailed content analysis.
To extract important textual features such as contrast, correlation, power, and homogeneity using the Gray Level Co-occurrence Matrix (GLCM).
To optimize the performance, dimensionality reduction is used using Sine chaos-based Artificial Rabbit Optimization (SCARO).
To effectively diagnose medical image retrieval by integrating Bhattacharya Context performance aware global attention-based Transformer (BCGAT) classification.

The research paper has been prepared as follows. Section “Related work” designates the collected works analysis of the recent research works connected to CBMIR. Section “Proposed methodology” defines the proposed methodology of CBMIR for X-ray images. Section “Results and discussion” represents the performance analysis of existing and proposed models, and Section “Conclusion” explains the conclusion of the paper.

Related work

This research introduces an important 3D volume visualization method, Li et al.²¹ is recommended for CBIR systems. The suggested method applied the positron emission tomography and computed tomography (PET-CT) non-small cell lung cancer (NSCLC) dataset to improve the visualization of recovered volumetric images. The suggested method has also been applied in various fields of volumetric images. Parameters for rendering are designed to make key structures as visible as possible. The recovery process finds it first and gives it priority. Medical volume graph depicting full PET-CT scan data. This method includes a small visualization dataset.

The author suggests improving key regions optimization and deep learning for two levels of content-based medical image retrieval by Tuyet et al.²². Local object attributes from medical images, including shape and texture, were extracted at the first level of analysis. In contrast, the second level involved an offline task and an online task for content-based picture retrieval in the database. Local feature extraction utilizes a code word to process the supplied user query image. The collection of code words acquired in the initial phase, when applied to the computer, gets the n most similar photos through similarity comparison. The suggested method achieves an accuracy of 91.61%. This method achieves low accuracy in retrieving images.

The scarcity of extensive and reliable datasets has constrained the application of intelligent systems for effective medical image management. Karthik et al.²⁴ recommended a multi-view classification deep neural network model for effective content-based retrieval of medical images. This strategy addresses the issue of inadequately capturing the intrinsic features of images and the restricted accuracy associated with medical imaging. This method aims to reduce variability in different types of scans using body part orientation visual classification labels, such as X-ray images, where different body orientations were observed in similar retrieved images. This method doesn’t identify the shape in image retrieval.

This research employs a deep learning methodology for content-based medical picture retrieval, referencing the survey by Dubey et al.²⁵. This author examines various retrieval types, networks, descriptor types, and supervision methods to build content-based picture retrieval. Employing cutting-edge techniques such as chronological summarization to evaluate picture retrieval efficacy. This learning uses a class-specific feature that preserves the latest trends in image reconstruction. In this paper, various data augmentation, layer manipulation, and feature normalization based objective functions are investigated to maintain robust properties in future analysis. This method cannot properly tune the hyperparameter.

In this research, to improve the image retrieval performance more accurately, using a deep learning-based enhanced convolutional neural network (DLECNN) was recommended by Sivakumar et al.²³. Irrelevant image features were removed from the noisy Corral database using histogram equalization using the recommended method. A user-supplied query image computes similarity measures using the HFCM algorithm. Here, the Fuzzy c-Means (HFCM) algorithm is used to calculate the similarity index of the query image vector with the database images. The suggested method achieves an accuracy of 90.23%. This method does not use an optimization-based feature selection algorithm, which degrades the CBIR results.

Agrawa et al.²⁶ established a deep neural model–based content–based clinical image retrieval system for the early diagnosis and categorization of lung illnesses. This author trains transfer learning-based models for specific disease features on a standard COVID-19 Chest X-ray image dataset collection. This approach minimizes computing burden and enhances the accuracy of the CBMIR system utilizing COVID-19 Chest X-ray image datasets. This document outlines various distance-based metrics for evaluating the influence of categorization on retrieval performance, including chi-square, cosine distances, and Euclidean distance. The trained model was found to achieve an accuracy of 81% in fivefold cross validation. This method handles a small dataset for the testing process.

Rashad et al.²⁷ described an automated method for expanding user queries in medical image retrieval using an effective RbQE methodology. This text proposes a content-based medical image retrieval (CBMIR) method for the precise retrieval of computed tomography (CT) and magnetic resonance (MR) images. The extraction of advanced characteristics from medical images utilizes the AlexNet and VGG-19 models. This method employs two search procedures in image retrieval: a rapid search and a final search. The original query for each class is broadened to obtain the highest-ranked images for a more efficient search procedure. In the concluding search, a query identical to the original is employed to extract data from the database. This method is too complex for real time application.

CBIR systems use CNN and other advanced deep learning algorithms to automatically extract complicated information from medical images. Archimedes optimization Algorithm with Deep Learning Assisted Content-Based Image recovery (AOADL-CBIRH) approach by Issaoui et al.²⁸ to enable accurate recovery of pertinent patient and diagnostic notes. Here, enhance the image quality using adaptive bilateral filtering. The proposed method uses the ECSM-ResNet50 model to tune the AOA-based hyperparameter to improve the retrieval performance. The similarity between images and their retrieval has determined the Manhattan distance metric. This method includes low efficiency in feature extraction.

Content-based medical image recovery framework query growth based on top ranking images by Ahmed et al.²⁹ suggested a new expansion method, which does not require interaction from users due to its fully automatic process. The suggested method enhances the efficient retrieval of related medical images. Here, it enhances the retrieval model’s precision in two ways. Only the most important features of the top-ranked photos in the second part are chosen, and the original enlarged query image is rebuilt using the top-rated image feature, with average values acting as the foundation. This method achieves a precision of 95.8%. This method included more noise in retrieval.

An enormous amount of image data has resulted from the widespread use of medical imaging in clinical diagnostics, making it difficult to organize, control, and retrieve images effectively. Cui et al.³⁰ suggested a deep hash coding based CBMIR framework, which had a CNN layer combined with hash encoding for efficient and accurate retrieval. However, to improve feature extraction specific to medical imaging, the suggested model combines a spatial attention block, a dense block, and a hash learning block-based feature learning network. The performance analysis of the suggested model was evaluated using the TCIA-CT dataset, which achieved an accuracy of 91.2%. Moreover, this model significantly improves medical image retrieval for clinical decision making.

Padate et al.³¹ suggested the content-based image retrieval (CBMIB), which influences the Harris hawks optimization (HHO) algorithm to improve feature selection and retrieval accuracy. The suggested HHO algorithm effectively strikes a balance between exploration and exploitation, optimizing the feature extraction process to improve retrieval results. However, the suggested system was evaluated by using various datasets and compared with traditional optimization algorithms, such as particle swarm optimization (PSO) and genetic algorithm (GA). Moreover, the findings of HHO outperform in terms of precision, recall, and F1-score at values of 90%, 88%, and 88%, respectively. Therefore, the experimental result of HHO optimization and image captioning crucially improves both retrieval performance and user satisfaction, also making it a powerful solution for large scale image retrieval.

CBMIR was one of the popular methods for finding similar images by comparing the input image’s inherent features with those in the database. However, the absence of extensive research in this field was the fundamental cause of the significant difficulties facing the multiclass medical image of CBMIR. Suresh Kumar and Celestin Vigila et al.³² suggested a CNN based auto encoder that enhances the accuracy of feature extraction and improves the result of retrieval. The experimental result of the proposed model was evaluated by using the ImageNet dataset, and it achieved better accuracy outcomes at a range of 95.87%. Therefore, this feature of the suggested model achieves better accuracy and highlights its potential as a transformative tool in advancing medical image retrieval. Table 1 represents the analysis of various existing models.

Table 1 Existing model analysis.

Subjects

Abstract

Similar content being viewed by others

Deep learning-based patient re-identification is able to exploit the biometric nature of medical chest X-ray data

A latent diffusion approach to visual attribution in medical imaging

A vision–language foundation model for the generation of realistic chest X-ray images

Introduction

Motivation

Related work

Research gap and novelty

Proposed methodology

Pre-processing

Noise reduction

Normalization

Resizing and contrast enhancement

Decompose the images by using hybridized Wavelet-Hadamard transform (HWHT)

Feature extraction by gray level co-occurrence matrix

Energy

Contrast

Correlation

Homogeneity

Dimensionality reduction by employing sine chaos based artificial rabbit optimization

Classification by using Bhattacharya context performance aware global attention-based transformer

Results and discussion

Dataset description

COVID-19 Chest X-ray image dataset description36

Chest X-ray image (pneumonia) dataset37

The National Institute of Health (NIHChest X-ray-8) dataset39

Performance evaluation forCOVID-19 chest X-ray image

Performance evaluation for chest X-ray image (pneumonia)

Performance analysis of the NIHChest X-ray-8 dataset

Comparative analysis

Qualitative analysis for the retrieval of images from the proposed model

Computational efficiency of the proposed model

Scalability of the proposed model

Robustness against noisy and low quality inputs

Clinical significance of accurate image retrieval in diagnostic scenarios

Real world challenges

Some other comparative analysis

Discussion

Application of image retrieval based on X-ray images

Conclusion

Limitations and future work

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links

COVID-19 Chest X-ray image dataset description³⁶

Chest X-ray image (pneumonia) dataset³⁷

The National Institute of Health (NIHChest X-ray-8) dataset³⁹