Development of machine learning models for predicting depressive symptoms in knee osteoarthritis patients

Li, Dan; Lu, Han; Wu, Junhui; Chen, Hongbo; Shen, Meidi; Tong, Beibei; Zeng, Wen; Wang, Weixuan; Shang, Shaomei

doi:10.1038/s41598-024-79601-x

Download PDF

Article
Open access
Published: 19 November 2024

Development of machine learning models for predicting depressive symptoms in knee osteoarthritis patients

Dan Li¹^na1,
Han Lu¹^na1,
Junhui Wu¹,
Hongbo Chen²,
Meidi Shen¹,
Beibei Tong¹,
Wen Zeng¹,
Weixuan Wang¹ &
…
Shaomei Shang¹

Scientific Reports volume 14, Article number: 28603 (2024) Cite this article

8288 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Knee osteoarthritis (KOA) combined with depressive symptoms is prevalent and leads to poor outcomes and significant financial burdens. However, practical tools for identifying at-risk patients remain limited. A robust prediction model is needed to address this gap. This study aims to develop and validate a predictive model to identify KOA patients at risk of developing depressive symptoms. The China Health and Retirement Longitudinal Survey (CHARLS) data were used for model development and the Osteoarthritis Initiative (OAI) for external validation. 18 potential predictors were selected using LASSO regression. 4 machine learning models—logistic regression, decision tree, random forest, and artificial neural network—were developed. Model performance was assessed using the area under the operating characteristic curve (AUC), calibration curves, and decision curve analysis. The most important features were extracted from the optimal model on external validation. A total of 469 individuals were included, with 70% used for training and 30% for testing. The random forest model achieved the best performance, with an AUC of 0.928 in the test set, outperforming logistic regression (AUC 0.622), decision tree (AUC 0.611), and neural network models (AUC 0.868). External validation revealed an AUC of 0.877 (95% CI: 0.864–0.889) for the adjusted random forest model. Pain severity was the most significant predictor, followed by the five-time sit-to-stand test (FTSST) and sleep problems. This study is the first in China to apply a predictive model for depressive symptoms in KOA patients, offering a practical tool for early risk identification using routinely available data.

Identification of biomarkers for knee osteoarthritis through clinical data and machine learning models

Article Open access 11 January 2025

A metaheuristic optimization-based approach for accurate prediction and classification of knee osteoarthritis

Article Open access 14 May 2025

Identifying significant structural factors associated with knee pain severity in patients with osteoarthritis using machine learning

Article Open access 26 June 2024

Introduction

Knee osteoarthritis (KOA), a complex musculoskeletal disease characterized by joint pain and limited mobility, affects 37% of persons over 60 years old worldwide¹. Depressive symptoms are a common comorbidity of KOA, with surveys indicating a prevalence of around 20%^2,3,4. Mounting evidence suggests that KOA combined with depressive symptoms is associated with worse pain symptom, increased functional decline, and poor disease prognosis^5,6. This situation complicates KOA patients’ disease management and exacerbates the health-related burden^{3,5,7,8,9,10,11,12,13,14,15,16}.

Compared with the high incidence and risk, depressive symptoms in KOA patients are underdiagnosed and treated, with less than 10% of individuals receiving effective treatment¹⁷. Given the adverse health outcomes and increased financial burden caused by depressive symptoms to KOA patients, strengthening the screening of depressive symptoms for KOA patients is considered beneficial in many studies^18,19,20. However, the latest study of the cost-benefit analysis found that depressive symptoms screening for all OA patients did not significantly reduce treatment costs, questioning its cost-effectiveness²¹. Compared with the large-scale census, it may be a more cost-effective option to screen out the high-risk population by developing a risk prediction model and carrying out the assessment and management of the depressive symptoms for this targeted population.

Recently, there has been growing attention to the issue of depressive symptoms in patients with KOA, and many studies have explored the risk factors for depressive symptoms in this population. Identifying these risk factors could lead to more early diagnosis and treatment that target susceptible populations to improve clinical outcomes. Based on existing research, the risk factors for KOA-related depression can be broadly categorized into sociodemographic, KOA-related symptoms, and other health condition. Sociodemographic factors form the foundation of predictive models, with variables including gender^22,23,24, age^23,25, marital status²³, education level^25,26, income^23,25, and living alone status^27,28 being closely linked to depressive symptoms. Among them, gender and age have been shown to influence the incidence of depressive symptoms^22,23,24,25, while socioeconomic status and living conditions (such as education level and living alone status) are also recognized as significant influencing factors^25,26,27,28. Furthermore, KOA-related symptoms factors, including KOA duration¹⁸, pain^23,29, walking speed³⁰, and the time taken for Five-Times-Sit-to-Stand Test (FTSST)³¹, thought to directly reflect the disease severity and physical function, are also significantly associated with depressive symptoms. Studies have shown that severe pain and functional limitations often increase the risk of depressive symptoms^23,29,32. Additionally, other health conditions factors such as comorbidities²³, ability to perform activity of daily living (ADL)^25,32, self-reported health status³³, history of falls^33,34, sleep problems³⁵, smoking and alcohol consumption status^6,36, and body mass index (BMI)²⁵ are also significantly correlated with depressive symptoms. The deterioration of these health conditions is often accompanied by an increase in depressive moods^23,37,38. The depressive symptoms result from the interaction of multiple factors, a single factor alone cannot fully explain their complexity. Therefore, constructing a comprehensive predictive model that incorporates multiple factors is essential.

However, a validated and reliable multi-factorial model is still lacking. While some studies^19,20,39 have explored the predictive effect of Kellgren-Lawrence (KL) garding on future depressive symptoms risk in KOA patients, the heterogeneity of the results suggesting a need for more in-depth research. Additionally, one study attempted to develop a prediction model based on 122 KOA patients with depressive symptoms, but it failed to include key risk factors such as sociodemographics, rasing concerns about its stability and reliability due to insufficient sample size and lack of external validation¹⁸. Another recent study has tried to predict depressive symptoms by using machine learning (ML) methods, but the prevalence of depressive symptoms in the study sample was much lower than in previous literature, calling into question the representativeness of the model, and the patients in the study were predominantly from white ethnic backgrounds⁴⁰.

By focusing on a representative cohort of middle-aged and elderly in China, this study aimed to develop a multi-factorial model for predicting the depressive symptoms in patients with KOA. While the data originates from a Chinese population, the findings could contribute to earlier diagnosis and targeted treatment strategies of depressive symptoms in KOA patients, ultimately improving clinical outcomes for susceptible populations globally. Furthermore, the application of ML methods in developing such models provides a framework that can be adapted and validated across diverse settings, thereby enhancing their potential utility beyond China. Given the growing prevalence of KOA and associated comorbidities worldwide, addressing these gaps will facilitate a more comprehensive understanding of depressive symptoms in KOA patients on a global scale, fostering advancements in personalized healthcare.

Methods

Data sources and study population

This study used data from the China Health and Retirement Longitudinal Survey (CHARLS) database for model development and the Osteoarthritis Initiative (OAI) for external validation. Both CHARLS and OAI are multicenter, longitudinal, prospective cohorts. CHARLS, based in China, includes data from 17,000 middle-aged and elderly people, encompassing key predictors of depressive symptoms in KOA patients⁴¹. For external validation, data were drawn from the U.S.-based OAI cohort, which focuses on knee health and includes data from 4,796 middle-aged and older patients. We analyzed baseline to 4-year follow-up data from both cohorts. Participants aged ≥ 45 years, diagnosed with KOA, and without depressive symptoms at baseline were included, while those with incomplete KOA or depression data at baseline or follow-up, follow-up durations < 1 year, or missing > 50% of variables were excluded. Ultimately, 496 CHARLS participants contributed to model development and 1,115 OAI participants wer eincluded for validation (Fig. 1).

Outcome variable and assessment tool

The primary outcome in our study was the developement of depressive symptoms at the 4-year follw-up in the database. In the CHARLS cohort, depressive symptoms were assessed using the 10-item CES-D scale, with a score ≥ 12 indicating depressive symptoms⁴². The OAI cohort used the 20-item CES-D scale, with a score ≥ 16 as indicative of depressive symptoms⁴³. Both the 10-item and 20-item CES-D versions have demonstrated good reliability and validity in numerous studies, showing similar effectiveness in identifying depressive symptoms^44,45.

Predictor variables and assessment tools

During the selection of potential predictor variables, we first conducted a literature review to identify variables that have been shown to be important in predicting the outcome and included these literature-supported variables as candidate predictors^{6,22,23,24,25,27,28,29,32,33,34,35,36,46,47}. 18 variables from the CHARLS databases were included as potential predictors in this study. Specifically, the variables include six sociodemographic factors (gender, age, education level, marital status, income, living alone or not), four KOA-related symptoms (duration of KOA, pain intensity, walking speed, FTSST), eight other health condition factors (self-reported health status, difficulties with ADL/IADL, comorbidities, history of falls, frequency of sleep problems, smoking status, alcohol consumption status, BMI).

Comorbidities were stratified based on the numbers into no comorbidities, one comorbidity, and two or more comorbidities. Sleep problems were classified into three groups according to the frequency of self-reported sleep disturbances in the past week: rarely (< 1 day), sometimes (1–4 days), and always (5–7 days). BMI was categorized according to the WHO: normal or underweight (< 25.0 kg/m²) and overweight or obese (≥ 25.0 kg/m²)⁴⁸. Pain severity was classified using the digital pain rating scale into mild (≤ 3 points), moderate (4–6 points), and severe (≥ 7 points). Walking speed and the FTSST were both assessed using cutoffs from the 2019 Asian Working Group for Sarcopenia⁴⁹: walking speed was divided into speed ≥ 1.0 m/s and speed < 1.0 m/s, while FTSST was categorized by time ≤ 12s and time > 12s.

Selection of predictive variables

As a convention, a model should estimate the high accuracy of the outcome using a minimal number of variables. In this study, the Least Absolute Shrinkage and Selection Operator (LASSO) regression algorithm was applied to identify the most efficient input variables. The LASSO algorithm is a linear regression method using L1-regularization. Its basic idea is to minimize the residual sum of squares under the constraint condition that the absolute sum of the regression coefficients is less than a constant, to generate some regression coefficients strictly equal to 0 and obtain a more refined model, which is a biased estimation for processing data with complex col-linearity. Compared to conventional regression, the LASSO algorithm was considered to prevent the over-fitting problem and produce a more interpretative, compact, and accurate model⁵⁰. In this study, 10-fold cross-validation was used to determine the parameter λ. All variables were evaluated and those with non-zero LASSO regression coefficients were selected as input variables. The final 11 variables were chosen as inputs to develop the model.

Model development

To develop the model for depressive symptoms prediction in KOA patients, four ML-based methods including logistic regression, decision tree, random forest, and artificial neural networks (ANN) were investigated. For details on their implementation methods and configurable parameters in R language, refer to Supplementary Table S1. To train supervised classifiers, the CHARLS dataset was randomly divided into a 70% : 30% ratio, of which 70% was the training set and 30% was the testing set.

Logistic regression

Logistic regression is widely used in the construction of various risk prediction models due to its simplicity and efficiency⁵¹. It is a multiple regression analysis method to analyze the relationship between dependent variables and certain influencing factors⁵².

Decision tree

Decision tree is a predictive analysis model expressed in the form of an imitation tree structure. Generally, a decision tree contains a root node, several internal nodes, and several leaf nodes. The root node and internal nodes represent a feature or attribute, and the leaf node represents a category⁵³. This study uses the classification and regression tree (CART) algorithm to construct a decision tree.

Random forest

Random Forest is an ensemble algorithm composed of the decision tree. The original data were randomly selected by sampling with a replacement method to form a dataset, and randomly selected some features as input, then a random decision tree is obtained. Through repeat many times to obtain random forest. When predicting, every decision tree in the forest makes a decision, and the final output category is determined by the mode of the individual tree’s output category⁵⁴. Compared with a single decision tree, the random forest can learn the interaction between features, has a good anti-noise ability, and has stable performance⁵⁵.

Artificial neural network

Artificial Neural Network is a kind of information processing system that simulates the structure and function of the biological brain neural network⁵⁶, including the input layer, hidden layer, and output layer. Generally, the neuron numbers of the input layer are related to the number of features, and the output layer’s neuron number is the same as the number of categories. The number of layers and neurons in the hidden layer is complex and can be optimized through customization⁵⁷. ANN model is friendly to nonlinear data and has the advantages of strong memory function and self-learning ability, but it also has the disadvantages of black box nature and poor interpretation⁵⁴.

Model performance

The model’s overall performance was evaluated on an independent CHARLS testing set and externally validated using the OAI dataset to ensure generalizability.

The performance of models was quantified based on discriminant performance, calibration, and clinical utility. Discriminative performance was estimated by the areas under the receiving operating characteristic curves (AUC), with a higher AUC indicated better discrimination⁵³. To evaluate calibration, a probabilistic calibration curve was used, the closer the curve is to the diagonal (intercept 0, slope 1), the higher the degree of calibration. Clinical utility was evaluated with the decision curve analysis (DCA), which identified the model with the greatest net benefit. DCA meets the actual needs of clinical decision-making and integrates the preferences of patients or decision-makers into analysis, which focuses on the benefits brought by models with different threshold probabilities^58,59,60. Furthermore, to enhance model interpretability, the most important predictive features will be identified from the best-performing model on external validation.

Ethics approval

Since the CHARLS and OAI cohort is openly accessible, Medical Ethics Board Committee of Peking University granted the study an exemption from review.

Results

Participant characteristics

The initial CHARLS dataset included 17,708 patients. After excluding 17,212 ineligible patients, 496 were retained for modeling, with 347(70%) in the training set and 149(30%) in the testing set. the OAI dataset began with 4,796 patients, after excluding 3,681 patients, a final sample of 1,115 was obtained for externally validation (Fig. 1). The baseline characteristics of participants in the CHARLS and OAI datasets are detailed in Supplementary Table S2 and Table S3, respectively.

A comparison of baseline characteristics between the CHARLS modeling dataset and the OAI validation dataset (Table 1) revealed that KOA patients in the OAI cohort were older (P < 0.001), had higher educational attainment (high school or above, P < 0.001), and a lower proportion of spouses (P < 0.001). More OAI patients lived alone (P < 0.001), fewer reported poor health (P < 0.001) or ADLs/IADLs difficulties (P < 0.001), and fewer had comorbidities (P < 0.001), though a greater proportion reported falls (P < 0.001). OAI participants also had lower rates of frequent sleep problems (> 5 days/week, P < 0.001), higher rates of smoking and alcohol use (P < 0.001), lower prevalence of severe KOA (> 5 years, P < 0.001), and a greater proportion had a walking speed < 1.0 m/s (P < 0.001) and took > 12s to complete FTSST (P < 0.001). No significant differences in gender distribution or pain severity were observed between datasets (p > 0.05).

Table 1 Comparison of baseline characteristics between CHARLS and OAI.

Full size table

Feature selection

Among 18 candidate variables (Table 1), 11 variables were selected by LASSO regression, including gender, education level, income, self-reported health status, difficulties with ADLs/IADLs, history of falls, frequency of sleep problems, smoking status, BMI, pain intensity, and duration of FTSST. The dynamic process of LASSO regression screening variables was shown in Fig. 2. Each curve (Fig. 2a) represents the change trajectory of each variable coefficient. With Log(λ) increasing, the coefficient of each variable gradually approaches 0, and the later it approaches 0, the more important the variable is. In this study variable 18 (frequency of sleep problems) and variable 9 (difficulties with ADLs/IADLs) were compressed to 0 at the latest. Corresponding to the different number of variables/Log(λ), Fig. 2b shows the mean value and 95% confidence interval of regression model deviance after 10-fold cross-validation. Deviance refers to the degree of deviation between the developed model and the ideal model (with perfect fitting data), where the smaller the deviance value, the better the goodness of fit. The two dotted lines in Fig. 2b indicate two special λ values (λ_min and λ_1se), and λ values between them are all considered appropriate. Specifically, λ_min represents the one with the smallest deviance, and λ_1se represents the one with a one standard deviation increase of λ_min. The λ_1se model was finally selected from these analyses by simultaneously considering the accuracy and simplicity. The LASSO regression coefficient of each variable in λ1se is shown in Supplementary Table S4.

(a) trajectory (b) the deviation confidence interval.

Model development

Logistic regression

The parameters of the logistic regression model are shown in Table 2. Specifically, the model expression is:

$$\begin{gathered} n\left( {\frac{p}{{1 - p}}} \right) = \left( { - 1.548} \right) + 0.380*gender + \left( { - 0.338} \right)*education~level + 0.268* \hfill \\ difficulty~with\frac{{ADLs}}{{IADLs}} + 0.214*frequency~of~sleep~problems + 0.118* \hfill \\ pain~intensity + 0.349*Smoking~status + 0.135*self - \hfill \\ reported~health~status + 0.167*duration~of~FTSST + 0.223* \hfill \\ history~of~falls + \left( { - 0.169} \right)*income + ~\left( { - 0.200} \right)*BMI. \hfill \\ \end{gathered}$$

In the training stage, the logistic regression model using all the 11 variables showed the sensitivity, specificity, and accuracy were 0.924, 0.163, 0.624 (95% CI 0.614–0.634), and AUC = 0.607 (95% CI 0.595–0.619).

Table 2 Parameters of logistic regression model.

Full size table

Decision tree

The decision tree model was developed based on selected variables, and the model was pruned (to reduce complexity) according to the Gini coefficient to reduce overfitting. The decision tree formed based on the minimum Gini coefficient is shown in Fig. 3. The first major split in the tree defines pathways separating sleep problems (sometimes or always) and not having sleep problems. The fall history at the second level, when fall occurred, KOA patients with ADLs/IADLs difficulty, per capita household income below or above the median level, mild to moderate pain, and poor self-rated health status had a higher risk of depressive symptoms; when fall did not occur, KOA patients with BMI < 25.00 kg/m2, female, smoking, or household per capita income below the median level had a higher risk of depressive symptoms. Using 10-fold cross-validation on the training set to evaluate model performance, the results showed that the overall accuracy of the decision tree model was 0.656, and the sensitivity, specificity, and AUC were 0.245, 0.923, and 0.607 (95%CI: 0.596–0.619).

Random forest model

An optimal random forest model was developed by determining two important parameters, decision tree number and feature number. As shown in Fig. 4, the overall misjudgment rate was the lowest (0.135) when the decision tree numbers were 220. Figure 5 shows the accuracy of the model under different extraction feature number when the decision tree numbers were fixed at 220. When the feature number was 11, the accuracy of the model is the highest. 10-fold cross-validation on the training set of the model yielded a high AUC was 0.939 (95% CI: 0.934–0.945), and the accuracy, sensitivity, and specificity were 0.874, 0.917, and 0.808.

Artificial neural network

The ANN model with the structure of “11-12-1” was developed in this study. ‘11’ represents the number of input neurons, which were 11 input variables; ‘12’ represents the number of hidden layer neurons, ‘1’ represents the number of neurons in the output layer, indicating whether depressive symptoms occur. Figure 6 shows the accuracy of the model under the different number of hidden layer neurons and weight attenuation parameters. When the number of hidden layer neurons at 12 and the weight attenuation parameter at 0.1, the accuracy was the highest. In the training stage, the ANN model showed a good discriminatory performance where the AUC was 0.877 (95% CI: 0.870–0.884), and the accuracy, sensitivity, and specificity were 0.803, 0.877, and 0.689, respectively.

Model performance

Discriminatory power

The performance of four models was evaluated by the testing set. As shown in Fig. 7, the AUC of the logistic regression is 0.622 (95% CI: 0.603–0.641), showing a certain degree of discrimination power. The AUC value of the decision tree is 0.611 (95% CI 0.593–0.630), slightly lower than the logistic regression model. The ANN model shows a good discriminatory performance where the AUC is 0.868 (95% CI 0.857–0.879), which is higher than logistic regression and decision tree (P < 0.001). The AUC of the random forest is 0.928 (95% CI 0.920–0.937), which shows the best discriminatory performance (P < 0.001).

In addition, the accuracy, sensitivity, and specificity of models were also evaluated (Table 3). Among four models, the random forest shows the highest accuracy (0.856), followed by the ANN (0.786), the decision tree (0.654), and the logistic regression (0.627). In sensitivity, the logistic regression has the highest sensitivity (0.927), while the random forest has a sensitivity of more than 0.9 (0.904). In terms of specificity, the decision tree model is the highest (0.922), followed by the random forest (0.786).

Table 3 Model performance in CHARLS testing set.

Full size table

Calibration & clinical utility

Calibration was evaluated by the probability calibration curve, which showed a good calibration for all models (Fig. 8). The closer the curve is to the diagonal, the higher the calibration degree of the model is, whereas the closer the predicted probability is to reality. The probabilistic calibration curves of the ANN and decision tree showed good calibration degrees, which were closest to the diagonal, then followed by the random forest and the logistic regression.

In addition, the DCA was used to evaluate clinical utility. As shown in Fig. 9, when the threshold probability on 0.2–0.9, the clinical utility value of the random forest was the highest, and when on 0-0.2 or 0.9-1, the ANN was the highest, while the logistic regression and decision tree were always lower. Considering the real clinical situation, the threshold probability of intervention for depressive symptoms in KOA patients was more likely to be in the range of 0.2–0.9, at which point the benefit of decision-making using the random forest model was higher than others.

Through the comprehensive evaluation of discriminant performance, calibration, and clinical utility among four models, the random forest model was the optimal model in the internal test set. The optimal model was externally validated in the OAI dataset, initially showing limited discrimination with an AUC of 0.539 (95% CI 0.528–0.550). After adjusting the random forest parameters by setting the number of decision trees to 140 and the number of randomly selected features to 9, the AUC improved significantly to 0.877 (95% CI 0.864–0.889), demonstrating strong discrimination and calibration.

Feature importance ranking of predictive variables

The feature importance of the optimal model was ranked according to each variable’s impact on prediction accuracy. The results showed that pain severity was the most significant predictor, followed by the duration of FTSST and sleep problems. Other key features included smoking status, fall history, gender, ADL/IADL difficulty, income, BMI, and education level.

Discussion

Principal results

Based on a representative cohort in China, we developed four ML models for depressive symptoms in KOA patients from a variety of easily accessible potential predictors such as sociodemographics, KOA symptom-related data, and general health conditions data. The developed ML models achieved the clinically identification of individuals at high risk of depressive symptoms in KOA, of which the random forest model was considered to be the best performing model. To our knowledge, this is the first study in China to use ML methodologies to predict depressive symptoms in patients with KOA, with comprehensively considered the factors of demographic, KOA symptom-related, and general health conditions. Meanwhile, this is also the first study to use ANN to predict the risk of depressive symptoms in KOA.

In this study, we used routinely available demographic and clinical data to develop the model and identified only 11 key features for prediction through the LASSO method, which increased the simplicity and practicability of the model compared with previous studies. LASSO was used to comprehensively screened variables among a broad range of sociodemographic, KOA symptom-related, and general health condition factors, and 11 predictive features, including gender, education level, per capita income of the family, self-rated health status, ADLs/IADLs difficulty, fall history, sleep problems, smoking status, BMI, pain degree, and the duration of FTSST, were included. Compared with the least square estimation method of the traditional regression model, the LASSO regression method can effectively optimize the overfitting problem of the traditional regression model⁶¹. In addition, this method can obtain a more simplified model with higher prediction accuracy at the cost of a certain estimation bias, which is especially suitable for machine learning models. Because in a machine learning model, if too many meaningless variables are input into it, the complexity of the model will be greatly increased, which is not conducive to algorithm convergence and will also increase the calculation time^62,63. The excellent ability of the LASSO method in variable screening and model stability has been verified in many disease prediction fields such as cancer, cardiovascular, and perinatal health^64,65,66,67.

With the development of ML algorithms, ML methods are widely used in the field of disease prediction. Based on the data of CHARLS from 2011 to 2015, this study used logistic regression, decision tree, random forest, and ANN to construct the risk prediction model. The developed models in this study achieved a clinically acceptable discrimination between depressed and non-depressed individuals, with the random forest model demonstrated the highest predictive performance. In recent years, random forest method has been widely used to deal with classification problems in digital health technology, and has an important role in augmenting clinical diagnosis³³. Our findings demonstrate the potential of this method in dealing with complex classification problems, such as KOA combined with depressive symptoms.

Comparison with prior work

At the broadest level, while many studies have been conducted on the association of KOA with depressive symptoms, few have applied ML methods to predict the risk. Sayre et al.¹⁸ applied logistic regression to developed a prediction model based on a longitudinal cohort data, with a clinically acceptable performance (AUC = 0.742). However, the model was developed base on conventional statistical methods, had small sample and lacked external validation. It is doubtful whether the predictive performance of the model is stable and can be replicated in other KOA patients. Nowinka et al.⁴⁰ applied six ML prediction models to predict depressive symptoms in patients with KOA, but the prevalence of depressive symptoms in the study sample was much lower than in previous literature, calling into question the representativeness of the model, and the patients in the study were predominantly from white ethnic backgrounds. To our knowledge, this is the first study in China to use ML methodologies to predict depressive symptoms in patients with KOA. Meanwhile, this is also the first study to use ANN to predict the risk of depressive symptoms in KOA.

There are several strengths in this study. First, a sensible number of participants from a random, nationwide sample were included to enable the models with a relative high accuracy and representativeness. Second, the robustness and generalization of the developed models were reinforced by the use of internal validation and external validation to validate respectively. Lastly, only 11 crucial features were included, all of which were easily accessible, demonstrating the simplicity of our approach and the ease of widespread application in primary health care.

Limitations

As in all studies, there are some limitations in the present study. In terms of the random forest model, one limitation is that the interpretation of the model is limited by the integration and black box nature of the random forest algorithm. The other limitation is that the application of the model requires practitioners to have some programming ability, which limits the application of the model to a certain extent. In future studies, the risk prediction model constructed in this study can be used as a web tool or embedded in the medical care information system to improve the convenience of using the model. In addition, although the utility value of the risk prediction model was analyzed by the decision curve, the cost-effectiveness of the model needs to be further analyzed through more clinical studies.

Conclusions

In conclusion, focusing on the identification of depressive symptoms in KOA, the study proposes a model for predicting the risk of depressive symptoms in KOA patients. The model is developed based on various easily accessible latent predictive factors, such as demographic information, KOA symptom-related data, and general health status data, achieving an overall performance of 87.7%. In comparison to previous methods, this model demonstrates outstanding performance. It is noteworthy that this is the first study employing ANN method to predict the risk of depressive symptoms in KOA patients. Simultaneously, as the first multi-factor external validation ML model based on longitudinal cohort data in China, this model aids healthcare professionals in early identification of depressive symptoms risk among KOA patients, thereby optimizing personalized preventive strategies in healthcare.

Data availability

The data that support the findings of this study are available from the China Health and Retirement Longitudinal Study (CHARLS) but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. The datasets used and/or analysed during the current study however available from the corresponding author upon reasonable request and with permission of CHARLS.

References

The Joint Surgery Branch Of The Chinese Orthopaedic Association TSGO. Chinese guideline for diagnosis and treatment of osteoarthritis. (2021 Edition). 41(18), 1291–1314 (2021).
Google Scholar
Sale, J. E., Gignac, M. & Hawker, G. The relationship between disease symptoms, life events, coping and treatment, and depression among older adults with osteoarthritis. J Rheumatol. 35(2), 335–342 (2008).
Agarwal, P. & Sambamoorthi, U. Healthcare expenditures associated with depression among individuals with Osteoarthritis. Post-Regression Linear Decompos. Approach 30(12), 1803–1811 (2015).
Google Scholar
Stubbs, B., Aluko, Y., Myint, P. K. & Smith, T. O. Prevalence of depressive symptoms and anxiety in osteoarthritis: a systematic review and meta-analysis. Age Ageing. 45(2), 228–235 (2016).
White, D. K., Neogi, T., Nguyen, U. S., Niu, J. & Zhang, Y. Trajectories of functional decline in knee osteoarthritis: the Osteoarthritis Initiative. Rheumatology (Oxford). 55(5), 801–808 (2016).
Li, M. et al. The trajectories of depression symptoms and comorbidity in knee osteoarthritis subjects. Clin. Rheumatol. 41(1), 235–243 (2022).
Article ADS PubMed Google Scholar
Kroenke, K. et al. Reciprocal relationship between pain and depression: a 12-month longitudinal analysis in primary care. J Pain. 12(9), 964–973 (2011).
Fishbain, D. A., Cutler, R., Rosomoff, H. L. & Rosomoff, R. S. Chronic pain-associated depression: antecedent or consequence of chronic pain? Rev. 13(2), 116–137 (1997).
CAS Google Scholar
Ke, C., Qiao, Y., Liu, S., Rui, Y. & Wu, Y. Longitudinal research on the bidirectional association between depression and arthritis. Soc Psychiatry Psychiatr Epidemiol. 56(7), 1241–1247 (2021).
Han, H. S., Lee, J. Y., Kang, S. B. & Chang, C. B. The relationship between the presence of depressive symptoms and the severity of self-reported knee pain in the middle aged and elderly. Knee Surg Sports Traumatol Arthrosc. 24(5), 1634–1642 (2016).
Bierke, S., Haner, M. & Petersen, W. Influence of somatization and depressive symptoms on the course of pain within the first year after uncomplicated total knee replacement: a prospective study. Int Orthop. 40(7), 1353–1360 (2016).
Zhu, B. Short-Term Efficacy of Antidepressant in Patients Underwent Total Knee Arthroplasty (Dalian Medical University, 2017).
Dekker, J., van Dijk, G. M. & Veenhof, C. Risk factors for functional decline in osteoarthritis of the hip or knee. Curr Opin Rheumatol. 21(5), 520–524 (2009).
Rathbun, A. M. et al. Association between disease progression and depression onset in persons with radiographic knee osteoarthritis. Rheumatology (Oxford). 59(11), 3390–3399 (2020).
Jacobs, C. A., Vranceanu, A. M., Thompson, K. L. & Lattermann, C. Rapid progression of knee pain and osteoarthritis biomarkers greatest for patients with combined obesity and depression: data from the Osteoarthritis Initiative. Cartilage. 11(1), 38–46 (2020).
Sharma, A., Kudesia, P., Shi, Q. & Gandhi, R. Anxiety and depression in patients with osteoarthritis: impact and management challenges. Open Access Rheumatol. 8, 103–113 (2016).
Association CM, Association JOTC, Practice CSOG, Depressive Disorder Collaborative Group CSOP, Editorial Committee Of Chinese Journal Of General Practitioners CMA, Diseases EGFT. Guidelines for primary care management of depression. Guidelines for primary care management of depression(2021 edition). 20(12), 1249–1260 (2021).
Sayre, E. C. et al. Specific manifestations of knee osteoarthritis predict depression and anxiety years in the future: Vancouver longitudinal study of early knee osteoarthritis. BMC Musculoskelet. Disord. 21(1), (2020).
Ribeiro, I. C., Coimbra, A. M. V., Costallat, B. L. & Coimbra, I. B. Relationship between radiological severity and physical and mental health in elderly individuals with knee osteoarthritis. Arthritis Res Ther. 22(1), 187 (2020).
Kim, K. W. et al. Association between comorbid depression and osteoarthritis symptom severity in patients with knee osteoarthritis. J Bone Joint Surg Am. 93(6), 556–563 (2011).
Kigozi, J. et al. Cost-utility analysis of routine anxiety and depression screening in patients consulting for osteoarthritis: results from a clinical. Randomized Controlled Trial 70(12), 1787–1794 (2018).
Google Scholar
JUNG, J. H. et al. Association between osteoarthritis and mental health in a Korean population nationwide study. Int J Rheum Dis. 21(3), 611-619 (2018).
Fuller-Thomson, E. & Shaked, Y. Factors associated with depression and suicidal ideation among individuals with arthritis or rheumatism: findings from a representative community survey. Arthritis Rheum. 61(7), 944–950 (2009).
Salk, R. H., Hyde, J. S. & Abramson, L. Y. Gender differences in depression in representative national samples: Meta-analyses of diagnoses and symptoms. Psychol Bull. 143(8), 783–822 (2017).
Zheng, S. Tu L, Cicuttini F, et al. Depression in patients with knee osteoarthritis: risk factors and associations with joint symptoms. BMC Musculoskelet Disord. 22(1), 40 (2021).
Zhang, L., Xu, Y., Nie, H., Zhang, Y. & Wu, Y. The prevalence of depressive symptoms among the older in China: a meta-analysis. Int J Geriatr Psychiatry. 27(9), 900–906 (2012).
Gu, L., Yu, M., Xu, D., Wang, Q. & Wang, W. Depression in Community-Dwelling older adults living alone in China: Association of social support network and functional ability. Res Gerontol Nurs. 13(2), 82–90 (2020).
Stahl, S. T., Beach, S. R., Musa, D. & Schulz, R. Living alone and depression: the modifying role of the perceived neighborhood environment. Aging Ment Health. 21 (10), 1065–1071 (2017).
Fonseca-Rodrigues, D., et al. Correlation between pain severity and levels of anxiety and depression in osteoarthritis patients: a systematic review and meta-analysis. Rheumatology (Oxford). 61(1), 53–75 (2021).
White, D. K., Neogi, T., Zhang, Y., Niu, J. & Katz, P. P. Association of slow gait speed with trajectories of worsening depressive symptoms in knee osteoarthritis: an observational study. Arthritis Care Res (Hoboken). 69(2), 209–215 (2017).
Parmelee, P. A., Cox, B. S., DeCaro, J. A., Keefe, F. J. & Smith, D. M. Racial/ethnic differences in sleep quality among older adults with osteoarthritis. Sleep Health. 3(3), 163–169 (2017).
Sugai, K. et al. Association between knee Pain, impaired function, and development of depressive symptoms. J Am Geriatr Soc. 66(3), 570–576 (2018).
Peleg, S. & Nudelman, G. Associations between self-rated health and depressive symptoms among older adults. Does age Matter? 280, 114024 (2021).
Google Scholar
Kim, J. H. Experiences of falling and depression: results from the Korean longitudinal study of ageing. J Affect Disord. 281, 174–182 (2021).
Parmelee, P. A., Tighe, C. A. & Dautovich, N. D. Sleep disturbance in osteoarthritis: linkages with pain, disability, and depressive symptoms. Arthritis Care Res (Hoboken). 67(3), 358–365 (2015).
An, X. et al. The effect of passive smoking on early clinical outcomes after total knee arthroplasty among female patients. Risk Manag Healthc Policy. 14, 2407–2419 (2021).
Sutter-Brandenberger, C. C., Hagenauer, G. & Hascher, T. Students’ self-determined motivation and negative emotions in mathematics in lower secondary education-Investigating reciprocal relations. Contemporary Educational Psychology. 55, 166–175 (2018).
Ulus, Y. et al. Sleep quality in fibromyalgia and rheumatoid arthritis: associations with pain, fatigue, depression, and disease activity. Clin Exp Rheumatol. 29 (6 Suppl 69), S92–S96 (2011).
Veronese, N. et al. Association between lower limb osteoarthritis and incidence of depressive symptoms: data from the osteoarthritis initiative. Age Ageing. 46(3), 470–476 (2017).
Nowinka, Z., Alagha, M. A., Mahmoud, K. & Jones, G. G. Predicting Depression in patients with knee osteoarthritis using machine learning: Model development and validation study. JMIR Formative Res. 6(9), e36130 (2022).
Article Google Scholar
Zhao, Y. et al. China Health and Retirement Report (China Health and Retirement Longitudinal Study, Peking University; 2019).
Cheng, S. T. & Chan, A. C. The center for epidemiologic studies depression scale in older Chinese: thresholds for long and short forms. Int J Geriatr Psychiatry. 20(5), 465–470 (2005).
Radloff, L. S., The CES-D Scale: A self-report depression scale for research in the general population. Applied Psychological Measurement. 1(3), 385–401 (1977).
Park, S. H. & Lee, H. Is the center for epidemiologic studies depression scale as useful as the geriatric depression scale in screening for late-life depression? A systematic review. J Affect Disord. 292, 454–463 (2021).
Nishiyama, T., Ozaki, N. & Iwata, N. Practice-based depression screening for psychiatry outpatients: feasibility comparison of two-types of center for epidemiologic studies depression scales. Psychiatry Clin Neurosci. 63(5), 632–638 (2009).
Hawker, G. A. et al. A longitudinal study to explain the pain-depression link in older adults with osteoarthritis. Arthritis Care Res (Hoboken). 63(10), 1382–1390 (2011).
Ji, L. et al. Functional disability mediates the relationship between pain and depression among community-dwelling older adults: age and sex as moderators. Geriatr Nurs. 42(1), 137–144 (2021).
Obesity. Preventing and managing the global epidemic. Report of a WHO consultation. World Health Organ. Tech. Rep. Ser. 894, 1–253 (2000).
Google Scholar
Chen, L. K. et al. Asian working group for sarcopenia: 2019 Consensus update on sarcopenia diagnosis and treatment. J Am Med Dir Assoc. 21(3), 300–307 (2020).
Zhang, L., Wei, X., Lu, J. & Pan, J. Lasso regression: from interpretation to prediction. Adv. Psychol. Sci. 28, 1777–1788 (2020).
Article Google Scholar
Wang, C. To Establish a Surgical Database and a Surgical risk Prediction Model for Valvular Heart Disease (Shanghai Second Military Medical University, 2010).
Su, D., Zhang, X., He, K. & Chen, Y. Use of machine learning approach to predict depression in the elderly in China: a longitudinal study. J Affect Disord. 282, 289–298 (2021).
Zhang, Y. To Construct a risk Prediction tool for Postmenopausal Osteoporotic Fracture Based on Two Models (China Academy of Chinese Medical Science, 2018).
Zhou, Z. Machine Learning (Tsinghua University, 2016).
Zhang, H., Tao, L. & Zhao, Y. Principle of random forest algorithm and its application in clinical research. Chin. J. Pediatr. 59, 798 (2021).
Google Scholar
Chen, J. & Zhou, Q. Application progress of artificial neural network in disease prognosis research. Chin. J. Cardiothorac. Surg. 20, 95–99 (2013).
Google Scholar
Zhen, X. Research on Cardiovascular Disease Prediction System Based on Machine Learning (Beijing Jiaotong University, 2018).
Vickers, A. J. & Elkin, E. B. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 26(6), 565–574 (2006).
Rousson, V. & Zumbrunn, T. Decision curve analysis revisited: overall net benefit, relationships to ROC curve analysis, and application to case-control studies. BMC Med Inform Decis Mak. 11, 45 (2011).
Kerr, K. F., Brown, M. D., Zhu, K. & Janes, H. Assessing the clinical impact of risk prediction models with decision curves: Guidance for correct interpretation and appropriate use. J Clin Oncol. 34(21), 2534–2540 (2016).
Tibshirani, R. Regression shrinkage and selection via the lasso. Journal of the royal statistical society series b-methodoological. 58(1), 267–288 (1996).
Lorberbaum, T. et al. Coupling data mining and laboratory experiments to discover drug. Interact. Causing QT Prolong. 68(16), 1756–1764 (2016).
CAS Google Scholar
Wong, C. H., Siah, K. W. & Lo, A. W. Estimation clinical trial success rates related parameters. Biostatistics. 20(2), 273–286 (2019).
Google Scholar
Zhou, L., Tang, L., Song, A. T., Cibrik, D. M. & Song, P. X. A LASSO method to identify protein signature Predicting post-transplant. Ren. Graft Surv. 9(2), 431–452 (2017).
Google Scholar
Motamedi, F., Perez-Sanchez, H., Mehridehnavi, A., Fassihi, A. & Ghasemi, F. Accelerating Big Data Quantitative Structure-Activity Prediction through LASSO-Random Forest Algorithm. Bioinformatics. 38 (2), 469-475 (2021).
Kang, J. et al. LASSO-Based machine learning algorithm for prediction of lymph node metastasis in T1 colorectal cancer. Cancer Res Treat. 53(3), 773–783 (2021).
Chen, Y. et al. Predicting all-cause mortality risk in atrial fibrillation patients: a novel LASSO-Cox model generated from a prospective dataset. Front Cardiovasc Med. 8, 730453 (2021).

Download references

Acknowledgements

We thank the China Health and Retirement Longitudinal Study (CHARLS) team for providing nationally representative data.

Funding

This work was supported by the National Natural Science Foundation of China (no. 81972158), National Key Research and Development Program of China (No. 2020YFC2008800, 2020YFC2008801), and China Postdoctoral Science Foundation (No. 2022TQ0017, 2022M720303). The funding source did not play any role in the design, data collection, analysis, interpretation of data, writing, or decision to submit the article for publication.

Author information

These authors contributed equally: Dan Li and Han Lu.

Authors and Affiliations

Nursing School, Peking University Health Science Center, No.38, Xueyuan Road, Haidian District, Beijing City, 100191, China
Dan Li, Han Lu, Junhui Wu, Meidi Shen, Beibei Tong, Wen Zeng, Weixuan Wang & Shaomei Shang
Peking University Third Hospital, No. 49 Huayuanbei Road, Haidian District, Beijing City, China
Hongbo Chen

Authors

Dan Li
View author publications
Search author on:PubMed Google Scholar
Han Lu
View author publications
Search author on:PubMed Google Scholar
Junhui Wu
View author publications
Search author on:PubMed Google Scholar
Hongbo Chen
View author publications
Search author on:PubMed Google Scholar
Meidi Shen
View author publications
Search author on:PubMed Google Scholar
Beibei Tong
View author publications
Search author on:PubMed Google Scholar
Wen Zeng
View author publications
Search author on:PubMed Google Scholar
Weixuan Wang
View author publications
Search author on:PubMed Google Scholar
Shaomei Shang
View author publications
Search author on:PubMed Google Scholar

Contributions

The study was conceived and designed by Shaomei Shang, Han Lu and Dan Li, who led the team in developing the research framework and methodology. Data analysis and interpretation were conducted by Han Lu, Dan Li, Hongbo Chen, Beibei Tong, Wen Zeng, and WeiXuan Wang, who carefully examined the data and drew conclusions based on their findings. The initial draft of the manuscript was written by Dan Li and Han Lu, who synthesized the results and presented them in a clear and concise manner. Subsequently, Junhui Wu and Meidi Shen reviewed and revised the manuscript, providing valuable feedback and suggestions to improve its quality.

Corresponding author

Correspondence to Shaomei Shang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval and consent to participate

The Medical Ethics Board Committee of Peking University granted the study an exemption from review.

Declaration of generative AI and AI-assisted technologies in the writing process

During the preparation of this work the author(s) used ChatGPT in order to polish the language of the manuscript. After using this tool/service, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (download DOCX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Li, D., Lu, H., Wu, J. et al. Development of machine learning models for predicting depressive symptoms in knee osteoarthritis patients. Sci Rep 14, 28603 (2024). https://doi.org/10.1038/s41598-024-79601-x

Download citation

Received: 09 September 2024
Accepted: 11 November 2024
Published: 19 November 2024
Version of record: 19 November 2024
DOI: https://doi.org/10.1038/s41598-024-79601-x

Keywords

This article is cited by

Review of 2024 publications on the applications of artificial intelligence in rheumatology
- Mazen Al Zo’ubi
Clinical Rheumatology (2025)

Subjects

Abstract

Similar content being viewed by others

Identification of biomarkers for knee osteoarthritis through clinical data and machine learning models

A metaheuristic optimization-based approach for accurate prediction and classification of knee osteoarthritis

Identifying significant structural factors associated with knee pain severity in patients with osteoarthritis using machine learning

Introduction

Methods

Data sources and study population

Outcome variable and assessment tool

Predictor variables and assessment tools

Selection of predictive variables

Model development

Logistic regression

Decision tree

Random forest

Artificial neural network

Model performance

Ethics approval

Results

Participant characteristics

Feature selection

Model development

Logistic regression

Decision tree

Random forest model

Artificial neural network

Model performance

Discriminatory power

Calibration & clinical utility

Feature importance ranking of predictive variables

Discussion

Principal results

Comparison with prior work

Limitations

Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethics approval and consent to participate

Declaration of generative AI and AI-assisted technologies in the writing process

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1 (download DOCX )

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Review of 2024 publications on the applications of artificial intelligence in rheumatology

Search

Quick links