A metaheuristic optimization-based approach for accurate prediction and classification of knee osteoarthritis

Diab, Amal G.; El-Kenawy, El-Sayed M.; Areed, Nihal F. F.; Amer, Hanan M.; El-Seddek, Mervat

doi:10.1038/s41598-025-99460-4

Download PDF

Article
Open access
Published: 14 May 2025

A metaheuristic optimization-based approach for accurate prediction and classification of knee osteoarthritis

Amal G. Diab¹,
El-Sayed M. El-Kenawy^2,5,
Nihal F. F. Areed³,
Hanan M. Amer³ &
…
Mervat El-Seddek⁴

Scientific Reports volume 15, Article number: 16815 (2025) Cite this article

2009 Accesses
1 Citations
Metrics details

Subjects

Abstract

Knee osteoarthritis (KOA) is a severe arthrodial joint condition with significant global socioeconomic consequences. Early recognition and treatment of KOA is critical for avoiding disease progression and developing effective treatment programs. The prevailing method for knee joint analysis involves manual diagnosis, segmentation, and annotation to diagnose osteoarthritis (OA) in clinical practice while being highly laborious and a susceptible variable among users. To address the constraints of this method, several deep learning techniques, particularly the deep convolutional neural networks (CNNs), were applied to increase the efficiency of the proposed workflow. The main objective of this study is to create advanced deep learning (DL) approaches for risk assessment to forecast the evolution of pain for people suffering from KOA or those at risk of developing it. The suggested methodology applies a collective transfer learning approach for extracting accurate deep features using four pre-trained models, VGG19, ResNet50, AlexNet, and GoogleNet, to extract features from KOA images. The numeral of extracted features was reduced for identifying the most appropriate feature attributes for the disease. The binary Greylag Goose (bGGO) optimizer was employed to perform this task, with an average fitness of 0.4137 and a best fitness of 0.3155. The chosen features were categorized utilizing both deep learning and machine learning approaches. Finally, a CNN hyper-parameter algorithm was performed utilizing GGO. The suggested model outperformed previous models with accuracy, sensitivity, and specificity of 0.988692, 0.980156, and 0.990089, respectively. A comprehensive statistical analysis test was performed to confirm the validity of our findings.

Optimizing knee osteoarthritis severity prediction on MRI images using deep stacking ensemble technique

Article Open access 05 November 2024

Interpretable and parameter optimized ensemble model for knee osteoarthritis assessment using radiographs

Article Open access 12 July 2021

A novel approach in diagnosing knee osteoarthritis for content based image retrieval in big data analytics and medical images

Article Open access 14 August 2025

Introduction

Osteoarthritis (OA) is one of the most frequent and debilitating chronic illnesses, accounting for the fourth major cause of disability worldwide¹, with the knee being the most usually smitten joint. Pain is the defining sign of knee OA, driving patients to seek medical care and contributing to a lower quality of life². Knee osteoarthritis (KOA) is a prevalent chronic ailment recognized as degenerative knee joint arthritis that results from 'wear and tear’ within the ligaments that connect the femur and tibial bone^3,4.

Frequently, the disease is associated with gradual structural degradation of articular cartilage, causing patients to suffer permanent physical impairment. Knee OA has a significant global occurrence rate, as per the latest literature review on the epidemiology of OA⁵.

Older age, obesity⁶, and prior injury to the knee⁷ are all considered risk factors for OA, which results in pain that impairs function and lowers life’s quality. Total knee replacement (TKR), the definitive treatment for OA, is costly and has a short lifespan, particularly for those who are obese⁸. Consequently, early recognition of OA in the knee is essential for starting therapy, like losing weight and workouts, which effectively stop the evolution of OA in the knee and delay TKR^6,9. Furthermore, several studies have emphasized the negative impact of knee osteoarthritis on the economy in terms of GDP loss¹⁰, direct healthcare cost burden¹¹, and yearly productivity cost of employment loss^12,13.

KOA affects approximately one in every three individuals^14,15. More than half of persons aged 65 and up have evidence of osteoarthritis, including that one joint. According to the World Health Organization’s (WHO) 2016 osteoarthritis report, 9.6% of men and 18.0% of women past the age of sixty had typical osteoarthritis. Among them, 80% have mobility issues, and 25% find it challenging to carry out their everyday duties¹⁶. According to the United Nations, 130 million people will suffer from KOA by 2050, with 40 million seriously crippled by the condition. KOA is one of the leading five factors that cause disability, posing a growing financial strain on society, mainly because of missed work hours and healthcare costs¹⁷. Figure 1 depicts the healthy knee joint and knee joint with osteoarthritis. Clinically, it is critical to diagnose this joint and determine the afflicted areas appropriately. X-ray, MRI, and CT modalities are utilized for scanning these areas to detect wear and tear, as well as other treatments like implanting and total knee replacement.

Radiography (X-ray) imaging is preferred for assessing OA¹⁸ because of its accessibility, cost-effectiveness, superior spatial resolution and contrast for tissues and bones. There are several forms of OA-related segmentation or categorization techniques to evaluate the knee that are broadly classed as classical approaches and deep learning (DL) approaches^19,20,21. In current clinical procedures, OA intensity is typically assessed visually using radiography images, which are prone to inter-rater heterogeneity and time-consuming for big datasets²².

Deep learning (DL), a sophisticated form of artificial intelligence, is successfully used in various medical imaging tasks²³. DL can potentially give a new technique for designing OA risk estimation algorithms that predict pain progression by extracting meaningful prognostic information from imaging scans in a timely and automated manner. CNN and other deep learning approaches automatically extract visual aspects from the model architecture through a sequence of transformations to enable the learning of complicated features^24,25. CNN belongs to a deep learning technique that falls within the machine learning field of artificial intelligence (AI). CNNs are flexible, relatively simple, and slick for training, as a network learns during the tuning procedure using fewer parameters²⁶. CNN’s overall design consists of a layer for input, hidden layers connected by a sequence of image filters, feed-forward network layers that show image filters on the input image, and an output layer wherein the feature is retrieved^20,25. Integrating CNNs and transfer learning frameworks significantly improves the recognition of images for knee osteoarthritis.

This research intended to create and test algorithms for DL risk evaluation for forecasting the development of pain among individuals who have or are susceptible to osteoarthritis in the knee. DL approaches outperform conventional approaches based on clinical, demographic, and radiographic risk factors regarding pain progression prediction. In this paper, the images were processed, improved, and normalized. The suggested CNN and additional pre-trained algorithms were used for the feature extraction task, and a metaheuristic optimizer was employed to choose the best features among them. Lastly, apply the proposed deep neural network (DNN) architecture for categorizing these features.

The rest of the article is structured as follows: section "Related works" addresses recent research efforts in KOA diagnosis; Section "Material and methods" discusses the methodology for the suggested procedure and the feature selector models; Section "Evaluation criteria" illustrates the study’s significant findings; Section "Classification results and discussion" discusses the classification results and discussion; and section "Conclusion" discusses the study’s conclusion and suggestions.

Research contribution

This work addresses the challenge of automatically classifying osteoarthritis in the knee using X-rays. This study presents the following key contributions:

(1)
A novel system is proposed to assist medical specialists in diagnosing KOA and classifying its severity as needed.
(2)
The classification models’ accuracy is boosted by implementing pre-processing methods that use a high pass filter to filter images in the frequency domain, highlighting the texture of trabecular bone and increasing classification accuracy.
(3)
The impact of the dataset’s imbalanced distribution is minimized, and a rebalancing process is also presented, dramatically increasing classification accuracy.
(4)
A DL model is proposed with the lowest misclassifications in the results.
(5)
The thoughtful CNN model is applied to extract the features out of the images in the dataset.
(6)
The significant features are selected by a bGGO optimizer.
(7)
The selected features are classified by K-nearest neighbor (K-NN), a decision tree (DT), a Multi-layer Perceptron (MLP) and a convolutional neural network (CNN) classifier.
(8)
A CNN hyper-parameter model is executed with a GGO.
(9)
A deep neural network (DNN) model is proposed for identifying the KOA features accurately.
(10)
The KOA recognition performance measures are evaluated against contemporary studies and pre-trained algorithms.

Related works

Some studies have presented methods for classifying Knee Osteoarthritis utilizing various techniques, although the results are far from optimal. New OA classification algorithms are evolving as deep neural network topologies evolve.

In 2016, Antony et al.²⁸ suggested an innovative technique that uses a deep convolutional neural network (DCNN) to categorize the intensity of OA in the knees from radiographs. The outcomes on X-ray images and KL grade dataset demonstrate a notable advancement over the state-of-the-art. In place of template matching, they suggested utilizing horizontal image gradients to train a linear SVM quicker and more precise than template matching. The resulting classification accuracy was 59.6%.

In 2017, Antony et al.²⁹ presented a cutting-edge technique that automatically recognizes knee joints using a fully convolutional neural (FCN) network. By the weighted ratio optimization of two loss functions, namely category cross-entropy and mean-squared loss, they trained convolutional neural networks (CNNs) to evaluate the severity of knee osteoarthritis. They achieved a mean squared error of 0.898 and a multiple classes categorization accuracy of 60.3%.

In 2018, Tiulpin et al.³⁰ suggested an innovative approach to identifying and classifying knee OA using standard radiographs. They used the deep Siamese network structure to classify OA. This architecture’s original purpose was to learn a similarity measure between image pairings. Two branches comprise the entire network, one for each input image. A probability distribution of grades across photos was utilized to assess the graded CAD system. They also tested a well-adjusted ResNet-34 network. The average multiclass accuracy was 66.71%.

In 2018, Suresha et al.³¹ trained a pre-trained networks (ImageNet) through a training approach alternating among object-categorization and region-proposal network fine-tuning, as shared feature across both was predicted to increase prediction reliability. Knee regions that were manually labeled served as ground truth for the region-proposal network’s training. The accuracy of their multiclass categorization was 88.2%.

In 2019, Abedin et al.³² employed Elastic Net (EN) and Random Forests (RF) to develop predicting approaches utilizing patient evaluation information and CNN trained only on an X-ray dataset. The within-subject association between the two knees was modeled using linear mixed-effects models (LMMs). The CNN, EN, and RF algorithms have root mean squared errors of 0.77, 0.97, and 0.94, respectively.

In 2019, Tiulpin et al.³³ introduced an approach based on multimodal machine learning to forecast osteoarthritis progression that uses clinical examination findings, raw radiography data, and the patient’s previous health information. This approach was confirmed using an independent test collection of 3,918 knee pictures among 2129 participants. This approach produced an average precision (AP) of 0.68 (0.66–0.70) and an area under the ROC curve (AUC) of 0.79 (0.78–0.81).

In 2019, Chen et al.³⁴ effectively deployed two deep convolutional neural networks for automated prediction of KOA and its degree of seriousness. The foundational X-ray scans for this approach were received from the OAI. The suggested method begins by recognizing the knee joints in the images utilizing a bespoke YOLOv2 network. They could categorize knee X-ray images into seriousness classifications utilizing the KL grading system after fine-tuning DenseNet, VGG, ResNet, and InceptionV3. Their knee joint identification approach had a recall of 92.2% and a mean Jaccard index of 0.858, while their calibrated VGG-19 model detected knee osteoarthritis severity with 69.7% accuracy.

In 2019, PU Patravali et al.³⁵ developed an approach to calculate cartilage area/thickness utilizing several form descriptors. The generated descriptors achieved an accuracy of 99.81% for the KNN classifier and 95.09% for the DT classifier.

In 2019, PU Patravali et al.³⁶ introduced an innovative method to investigate several segmentation strategies for the early identification of OA. The experiment employed various segmentation techniques, such as Sobel and Prewitt edge segmentation, Otsu’s method of segmentation, and texture-based segmentation. The various statistical features were calculated, analyzed, and categorized. The achieved accuracies were 91.16% for the Sobel approach, 96.80% for Otsu’s approach, 94.92% for the texture approach, and 97.55% for the Prewitt approach.

In 2020, Thomas et al.³⁷ sought to develop an automated system for diagnosing the degree of severity of KOA using radiography. Despite using a large dataset, the approach’s effectiveness was assessed by contrasting its results to the opinions of radiologists specializing in musculoskeletal disorders. The radiograph images were enhanced automatically and then fed into a CNN model. They achieved an F1 score of 70% and overall accuracy of 71% over the whole tested dataset.

In 2020, Leung et al.³⁸ introduced a KOA classification deep-learning algorithm built on sufferers’ knee images with complete knee replacement surgery. They contrasted it with individuals who didn’t have KOA. To discriminate between KL-based grade classes, a ResNet34 model with cross-validation was employed. The study employed a dataset of 4796 photographs obtained from the OAI. The model suggested has an accuracy rate of 72.7%. The restricted dataset size and transfer learning usage hampered the system’s ability to implement more accurately.

In 2021, Javed et al.³⁹ evolved Resnet-14, a residual network that has been pre-trained, to forecast KL grades from radiograph data. A multicenter dataset has been employed to validate the network’s performance. The network obtained 98% accuracy and 98% AUC.

In 2021, Shivanand S. Gornale et al.⁴⁰ proposed a novel method for detecting osteoarthritis by identifying the region of interest. A database of 1,173 knee X-rays was collected and manually graded by two independent medical specialists using the Kellgren and Lawrence grading system. The computation was accomplished using the histogram of the orientated gradient method and the local binary pattern (LBP). The calculated characteristics were categorized with a decision tree classifier. The proposed approach had an accuracy of 97.86% and 97.61%.

In 2022, Ribas et al.⁴¹ suggested an innovative technique for detecting early knee OA based on complicated network modeling and statistical data. The proposed network technique allowed for modeling the primary properties of the X-ray pictures while also increasing the separation between the control and OA groups. The suggested technique’s accuracy was 81.69%.

In 2022, Teo et al.⁴² introduced pre-trained InceptionV3 and DenseNet201 networks using the OAI dataset for extracting features from the OAI data set, which is divided into five categories based on osteoarthritis intensity. The SVM classifier is employed to categorize the features of the deep learning framework. The accuracy rate for DenseNet201-SVM is 71.33%.

In 2023, C. Guida et al.⁵³ suggested a fusion approach that blends three distinct types: MRI, X-ray, and the patient’s clinical data into a single structure, increasing accuracy over the methods utilized independently. The fusion architecture was constructed utilizing two systems from previous studies trained using a limited dataset. It blended a conventional CNN for X-rays and a unique 3D MRI model. The study’s conclusions indicated that the utilized approach received performance accuracy ratings of 76%, which was inadequate and had to be improved.

In 2024, Anandh Sam Chandra Bose et al.⁵⁴ utilized a CNN approach to extract characteristics by clinical imaging data. They utilized sophisticated approaches like PSO and Genetic Bee Colony (GBC) to uncover significant characteristics for improving ML models. Comparing approaches with optimized features to those trained with direct CNN features reveals significant accuracy, sensitivity, specificity, PPV, and NPV improvements across various ML techniques, such as SVM, KNN, RF, and Linear Discriminant Analysis (LDA). Features that GBC chose achieved 99.15% accuracy in binary categorization tasks. In multiclass classification, GBC characteristics paired with RF achieved an accuracy of 98.91%.

In 2024, Muhammed Yildirim and Hursit Mutlu⁴³ created a hybrid model by extracting features utilizing Darknet53, Histogram of Directional Gradients (HOG), Local Binary Model (LBP), and Neighborhood Component Analysis (NCA). The dataset included 1650 knee images divided into five categories: standard, doubtful, mild, moderate, and severe—the experimental investigations compared the suggested method’s performance to eight distinct CNN Models. The developed model had an accuracy rating of 83.6%.

Lately, deep learning algorithms are being used in medical imaging to increase the precision of disease diagnosis. CNNs have been utilized in several research to classify knee osteoarthritis as either standard or osteoarthritis reliably.

The researchers succeeded in achieving satisfactory outcomes with a variety of approaches and materials. Every researcher aims to achieve the promised precision of X-ray image analysis for earlier KOA detection. Another thing to consider is that most current studies were conducted using osteoarthritis initiative (OAI) or MOST datasets, with an imbalanced data distribution. This study differs from earlier studies in that it used a variety of approaches and hybrid materials to achieve high accuracy, as well as an applied data-balanced strategy. Because it is challenging to categorize KOA images correctly, the obstacle was overcome by extracting characteristics from many deep neural models, selecting the best one, and then classifying them. Table 1 summarizes relevant studies concerning the diagnosis of knee osteoarthritis.

Table 1 An overview of relevant literature.

Subjects

Abstract

Similar content being viewed by others

Optimizing knee osteoarthritis severity prediction on MRI images using deep stacking ensemble technique

Interpretable and parameter optimized ensemble model for knee osteoarthritis assessment using radiographs

A novel approach in diagnosing knee osteoarthritis for content based image retrieval in big data analytics and medical images

Introduction

Research contribution

Related works

Material and methods

Dataset description

Data preparation

Feature extraction

Feature selection

GGO algorithm

Exploration operation

Exploitation operation

Searching the area around the optimal solution

Selection of the best solution

Binary GGO algorithm

Image classification

Evaluation criteria

Performance metrics to the pre-trained model and classifier

Performance metrics to the optimizers

Feature extraction results

Feature selection results

Classification results and discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher’s note

Appendix

Appendix

IV. Classification Results

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links