Machine learning insights into scapular stabilization for alleviating shoulder pain in college students

Mabrouk, Omar M.; Hady, Doaa A. Abdel; Abd El-Hafeez, Tarek

doi:10.1038/s41598-024-79191-8

Download PDF

Article
Open access
Published: 18 November 2024

Machine learning insights into scapular stabilization for alleviating shoulder pain in college students

Omar M. Mabrouk¹,
Doaa A. Abdel Hady² &
Tarek Abd El-Hafeez ORCID: orcid.org/0000-0003-1785-1058^3,4

Scientific Reports volume 14, Article number: 28430 (2024) Cite this article

3059 Accesses
18 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Non-specific shoulder pain is a common musculoskeletal condition, especially among college students, and it can have a negative impact on the patient’s life. Therapists have used scapular stabilization exercises (SSE) to enhance scapular control and mobility. This study investigates the prediction of the impact of scapular stability exercises in treating non-specific shoulder pain, leveraging advanced machine learning techniques for comprehensive evaluation and analysis. Using a diverse range of regression models, including Gamma Regressor, Tweedie Regressor, Poisson Regressor, and others, the study examines the relationship between the effectiveness of various exercises and their impact on shoulder pain management. Furthermore, the study employs optimization techniques, such as Hyperopt, scikit-optimize, optunity, GPyOpt, and Optuna, to fine-tune the exercise protocols for optimal outcomes. The results reveal that scapular stabilization exercises, when optimized using machine learning algorithms, significantly contribute to reducing shoulder pain in college students. Among the optimization techniques, scikit-optimize demonstrated the best performance, resulting in a mean squared error of 0.0085, a mean absolute error of 0.0712, and an impressive R2 score of 0.8501. This indicates that the scikit-optimize approach yielded the most accurate predictions and effectively captured the relationship between the exercises and shoulder pain management. The findings highlight the critical role of scapular stabilization exercise interventions in ameliorating non-specific shoulder pain and underscore the potential of machine learning techniques in optimizing therapeutic strategies for musculoskeletal health management. The utilization of scikit-optimize, in particular, showcases its effectiveness in fine-tuning the exercise protocols for optimal outcomes. The study’s results serve as a crucial stepping stone in developing personalized rehabilitation programs for non-specific shoulder pain, emphasizing the importance of integrating machine learning methodologies in the assessment and treatment of musculoskeletal disorders among college students.

Algorithmic assessment of shoulder function using smartphone video capture and machine learning

Article Open access 15 November 2023

Innovative boxing training program outperforms the traditional scapular stabilization training program in post-stroke patients

Article Open access 09 September 2024

Classification of musculoskeletal pain using machine learning

Article Open access 25 July 2025

Introduction

Shoulder pain is a prevalent condition in therapeutic practice, with an incidence rate of approximately 10 per 1000 patients in medical care^1,2 and a prevalence of 12% in physiotherapy settings³. It is a common musculoskeletal issue that can significantly impact daily activities and quality of life⁴. Non-specific shoulder pain, characterized by persistent discomfort without a clear underlying pathology, presents a complex challenge for clinicians. This condition often arises from a combination of factors such as poor posture, repetitive movements, muscle imbalances, and inadequate scapular stabilization, all of which contribute to functional limitations^5,6.

Scapular stabilization exercises (SSE) have been widely recognized as an effective intervention for managing non-specific shoulder pain. These exercises aim to enhance shoulder function by improving the balance of the scapular muscles, correcting scapular dyskinesis, and restoring proper force coupling^7,8. Scapular stabilization can result in significant improvements in pain reduction, muscle strength, and overall shoulder function⁹. However, despite their demonstrated efficacy, there remains a critical need to optimize the implementation of these exercises. Variations in patient responses and the challenges in individualizing exercise programs highlight the necessity for further investigation into the most effective strategies for different populations^10,11.

While scapular stabilization exercises are generally effective, their impact can vary significantly across individuals and populations. This variability highlights a critical need for more precise, data-driven approaches that can objectively evaluate which patients are most likely to benefit from these interventions. In certain cases, these exercises may be less effective or even counterproductive, underscoring the importance of using machine learning to identify patient characteristics that predict positive outcomes. By tailoring treatment in this way, clinical practice can move toward a more personalized and efficient approach to rehabilitation^12,13.

In this context, the integration of machine learning techniques offers a promising avenue for advancing the evaluation and optimization of scapular stabilization exercises. Machine learning has already demonstrated its potential in analyzing complex data sets and providing insights that can improve clinical decision-making. By applying these techniques to musculoskeletal disorders, researchers can uncover biomechanical patterns and relationships that may otherwise remain obscured, ultimately leading to more effective and personalized treatment strategies^14,15.

Given the increasing incidence of musculoskeletal issues among college students—attributed to sedentary lifestyles and academic demands¹⁶—it is particularly important to investigate the role of scapular stabilization exercises in this population. The current study aims to explore the use of machine learning to enhance the implementation of these exercises for non-specific shoulder pain in college students. By integrating computational analytics with traditional rehabilitation approaches, the study seeks to improve patient outcomes and contribute to the broader understanding of musculoskeletal health management.

Problem statement

Non-specific shoulder pain is a common and debilitating condition, particularly prevalent among college students due to factors such as poor posture and prolonged sedentary behavior. While scapular stabilization exercises (SSE) are widely recognized as an effective intervention for improving scapular control and reducing shoulder pain, there is a significant challenge in optimizing their implementation to account for individual patient differences. Traditional approaches to prescribing these exercises often rely on subjective measures and general protocols, which may not yield consistent results across diverse populations. Furthermore, there is a lack of a systematic, data-driven approach to personalize SSE interventions to maximize their therapeutic outcomes. This research aims to address these gaps by leveraging advanced machine learning techniques to predict and optimize the effectiveness of scapular stabilization exercises in treating non-specific shoulder pain among college students.

Research question

How can machine learning techniques be applied to optimize scapular stabilization exercises for the treatment of non-specific shoulder pain, and what machine learning models and optimization techniques are most effective in predicting treatment outcomes among college students?

Research gap

While the effectiveness of scapular stabilization exercises in managing shoulder pain has been well-documented, the current literature lacks comprehensive studies that apply machine learning to enhance and individualize these interventions. Previous research has primarily focused on standardized exercise regimens, which may not account for the variability in patient responses. Furthermore, there is limited exploration of how different machine learning techniques, such as regression models and optimization algorithms, can be used to fine-tune exercise protocols for optimal outcomes. This study addresses this gap by integrating machine learning into the evaluation and optimization of scapular stabilization exercises, offering a more personalized and data-driven approach to shoulder pain management.

Contributions

The main contributions of this research can be summarized as follows:

Application of Machine Learning for SSE Optimization: This study utilizes a range of regression models, including Gamma Regressor, Tweedie Regressor, and Poisson Regressor, to predict the outcomes of scapular stabilization exercises based on patient-specific data. These models provide a more objective and precise way of evaluating the effectiveness of SSE interventions.
Optimization of Exercise Protocols: Through the use of hyperparameter optimization algorithms such as Hyperopt, Scikit-Optimize, and Optuna, this research identifies the optimal exercise parameters for individual patients, significantly improving the personalization of treatment plans.
Evaluation of Machine Learning Models: The study compares the performance of different machine learning and optimization techniques for predicting patient outcomes. Among these, Scikit-Optimize demonstrated the best performance, achieving a high R2 score of 0.8501, indicating its effectiveness in capturing the relationship between SSE interventions and pain reduction.
Advancing Personalized Rehabilitation: By integrating machine learning into the rehabilitation process, this study pioneers a personalized approach to treating non-specific shoulder pain in college students, offering a data-driven methodology that could enhance patient outcomes in clinical practice.
Contribution to Musculoskeletal Health Management: This research underscores the potential of machine learning in transforming the assessment and treatment of musculoskeletal disorders, providing a framework that could be extended to other therapeutic interventions beyond scapular stabilization.

Materials and methods

Trial design

This trial was designed as an observational and cross-section study. The study was approved by the Ethical Committee at Deraya University, (No: 19/2023). According to the ethical standards of the Declaration of Helsinki¹⁷, This study complies with the principles for human research. Each patient signed a written consent form after being given a thorough description of the trial. The study was conducted at Deraya outpatient clinic from April 20, 2023, till July 25, 2023.

The sample size

To minimize the risk of Type II errors (failing to detect a statistically significant difference when one exists), a priori sample size calculation was conducted utilizing G*Power software, specifically tailored for the Wilcoxon-Mann-Whitney test^18,19). This computation was informed by the following statistical parameters:

Effect Size (d): 0.5, indicating a moderate expected difference between groups.
Effect Size for Wilcoxon-Mann-Whitney (dz): 0.5, consistent with the chosen effect size.
Power (1-β): 0.95, aiming for high confidence in detecting significant differences.
Significance Level (α): 0.05 (two-tailed), maintaining a balanced approach to error control.

The calculation revealed that a minimum sample size of 40 participants was necessary to ensure that the confidence intervals for the agreement between any two measures would be sufficiently precise, approximating half a standard deviation of their discrepancies.

Notably, the actual dataset for this study comprises 85 patients. This substantial excess in sample size provides several assurances:

1.
Enhanced Statistical Power: The study is well-equipped to detect statistically significant differences between measures with a high degree of confidence.
2.
Reliability and Generalizability of Findings: The larger sample size contributes to more reliable and genuine results, enhancing the study’s external validity and the potential for generalizing the conclusions to similar populations.
3.
Robustness Against Type II Errors: The ample sample size effectively mitigates the risk of failing to identify significant effects, should they exist, thereby bolstering the study’s overallMethodological rigor.

Participants

Eighty-five college students who complained of non-specific unilateral shoulder pain lasting > 6 weeks were included in this study based on the following criteria: their ages ranged from 18 to 25 years, after consulting orthopedic surgeon diagnosis confirmation as if the patients exhibited at least 2 of the following (1) painful arc during flexion or abduction from 60 to 120 degree(2) a positive Neer test(3) Painful resisted external rotation, abduction. (4) Type 1 and 2 scapular dyskinesia.

Exclusion criteria

MRI or ultrasound confirmation of torn rotator cuff tendons (partial or full-thickness tear), inability to lift the arm to 90 of abduction, a cervical range of motion that reproduced shoulder pain, pain below the elbow indicative of cervical or nerve pathologies, past shoulder surgery, and glenohumeral joint arthritis, as indicated in the report, Acute trauma (fractures, traumatic dislocations), inflammatory arthritis.

Outcome measures

Assessment of pain

The visual analog scale (VAS) is a one-dimensional measure of pain intensity that is used to compare the pain severity of patients pre and post-scapular stabilization treatment. The VAS has excellent test-retest reliability (r = .94) and correlates highly with other pain measurement tools²⁰.

Assessment Acromion-humeral distance (AHD)

Two ultrasound images were collected with the arm at rest and 90° active abduction pre and post-exercise sessions with the subject in a sitting position with the elbow flexed 90° supported on his thigh. The outlet of the sub-acromion space was measured with 2-D ultrasound images via the AHD. The AHD is defined as the shortest distance between the acromion and the humerus. An ultrasound unit (mindray DP 10) with a 10 MHz linear ultrasound transducer was utilized. The placement of the ultrasound transducer was standardized, with its location on the posterior to the middle portion of the acromion in the coronal plane, with the transducer placed parallel to the flat superior aspect of the acromion so that both the acromion and humerus were visualized²¹. Ultrasound as an imaging modality is less costly and more practical than MRI and has established concurrent validity with radiographic AHD measures (r = .77–0.85)^22,23.

Interventions

Scapular stabilization exercise intervention

From the sitting position shoulder shrug at 0° abduction and 30° abduction, scapular retraction W exercises against handheld theraband, external rotation with elbow 90° flexion, and external rotation with forward flexion against handheld theraband. From the Prone laying position external rotation at 90° abduction and elbow at 90° using a dumbbell, All the exercises were performed10 repetitions/set, 3 sets/day, and 3 days/week for 4 weeks⁸.

Related work

In this section, we embark on a comprehensive journey through the landscape of related work. We explore a collection of seminal studies that have diligently examined the impact of scapular exercise interventions on shoulder conditions, shedding light on the effectiveness of these exercises in managing ailments such as subacromial impingement syndrome (SIS) and non-specific shoulder pain. These studies have collectively enriched our understanding of the pivotal role that scapular stabilization exercises play in shoulder dysfunction rehabilitation and the alleviation of related discomfort. Scapular dyskinesis and weakness have been increasingly recognized as contributing factors to shoulder impingement syndrome (SIS). Several studies have investigated the effects of scapular-focused exercises in the treatment of SIS. Table 1 summarizes the key findings from several relevant articles on this topic.

Table 1 Summary of the key findings from several relevant articles SIS.

Full size table

Methodology

Regression techniques

Table 2 presents a comprehensive overview of diverse regression models and techniques, along with their associated regularizations and cross-validation strategies, commonly employed in the context of machine learning and statistical analysis. These models serve as fundamental building blocks in the evaluation and prediction of complex relationships within datasets, enabling researchers and practitioners to address a wide array of analytical challenges and optimize predictive performance. By offering a detailed description of each model’s characteristics, suitable data types, advantages, and potential limitations, this table serves as a valuable guide for selecting appropriate regression methodologies tailored to specific research objectives and dataset requirements. From traditional linear regression techniques to advanced ensemble methods and robust regression algorithms, the table outlines the key features and applications of each model, facilitating a comprehensive understanding of the diverse tools available for data analysis and predictive modeling.

Table 2 Summary of the regression techniques used in the study.

Full size table

Dataset characteristics

The provided data represents a table with various columns containing information about individuals, including their sex, age, weight, BMI (Body Mass Index), Pretreatment SAS (Self-Appraisal Scale) zero, Pretreatment SAS 90, and Posttreatment SAS 90 scores. The dataset contains the following columns:

Sex: This variable indicates the gender or biological sex of the patient and can be categorized as male or female. In some cases, it might also include options for other gender identities.
Age: Age represents the patient’s chronological age, typically measured in years. It is a fundamental demographic variable used to understand how age might impact various health-related factors.
Weight: This variable denotes the patient’s body weight, often measured in kilograms (or pounds in some regions). Weight is crucial for assessing overall health and monitoring changes during treatment.
BMI (Body Mass Index): BMI is a calculated value that relates to a person’s weight and height. It is used to assess whether a person is underweight, normal weight, overweight, or obese. The formula for calculating BMI is (weight in kilograms) / (height in meters squared) or (weight in pounds) / (height in inches squared) multiplied by 703.
Pretreatment SAS Zero: The SAS (Self-Rating Anxiety Scale) is a psychological assessment tool used to measure a person’s level of anxiety. “Pretreatment SAS Zero” might refer to the patient’s anxiety score before any treatment or intervention, with “zero” indicating the baseline measurement.
Pretreatment SAS 90: Similar to “Pretreatment SAS Zero,” this variable represents the patient’s anxiety score before any treatment or intervention, but the “90” suggests it could be a specific time point or measurement related to the treatment process. The SAS scale is typically scored out of 100.
VAS pre: VAS (Visual Analog Scale) is a tool used to measure subjective experiences, such as pain or discomfort. “VAS pre” likely refers to the patient’s self-reported assessment on this scale before any treatment.
VAS post: Similar to “VAS pre,” this variable represents the patient’s self-reported assessment on the Visual Analog Scale after the treatment or intervention has been administered.
Posttreatment SAS 90: This variable denotes the patient’s anxiety score after receiving treatment or intervention, with “90” potentially indicating a specific time point or measurement related to the post-treatment phase. It assesses the level of anxiety following the treatment.

This dataset contains information related to (85) individuals’ characteristics, including their demographics (sex and age), physical attributes (weight and BMI), and self-appraisal scores before and after some form of treatment or intervention. The specific nature of the treatment or intervention and the meaning of the self-appraisal scores (SAS) would require additional context to fully understand the dataset’s purpose and analysis.

Table 3; Fig. 1 provide a comprehensive summary of various features, including descriptive statistics that offer insights into the central tendencies, variability, and distribution of each feature within the dataset. These features encompass a range of information, such as demographic attributes (sex, age), physical characteristics (weight (Kg), BMI), and self-appraisal scores both before and after a treatment or intervention (Pretreatment SAS zero, Pretreatment SAS 90). By examining the mean, median, standard deviation, and percentiles (25th, 50th, and 75th) for each feature, we can gain a better understanding of the dataset’s characteristics and the distribution of the variables. This table serves as a valuable reference point for analyzing and interpreting the dataset, shedding light on the central tendencies and variability within the dataset’s key attributes.

Table 3 Descriptive statistics of the dataset features.

Full size table

Table 4 shows the relationship between the numerical variables in the dataset. Each row and column in the matrix represents a continuous variable, and Pearson’s R-value corresponding to that row and column reflects the strength and direction of the correlation between the variables. Most qualities are significantly connected, according to our observations. This matrix provides an in-depth look at the correlations between various attributes, with each attribute listed on both the rows and columns. The numbers in the rows and columns show the correlation coefficient between the two traits, with a coefficient close to 1 representing a high positive correlation, a coefficient close to -1 representing a strong negative correlation, and a coefficient close to 0 representing no association.

Table 4 The correlation heat map of the proposed framework.

Full size table

Preliminaries

Hyperparameter tuning involves the search for an optimal set of parameters that can significantly enhance the precision and accuracy of a model^49,50,51. This process is known to be one of the most challenging aspects of developing machine learning models. Predicting the ideal hyperparameters during the initial model construction is exceedingly difficult. The fundamental objective of hyperparameter tuning is to identify the optimal configuration for a model’s parameters, thereby achieving a superior performance level, as illustrated in Fig. 2.

The depicted figure illustrates the separation of the “hyperparameter tuner” from the model, highlighting that the tuning occurs before the model training phase. The output of the tuning process is the identified optimal values of hyperparameters, which are subsequently utilized during model training.

Hyperparameter optimization constitutes a vital phase in the training of Machine Learning models. Given the numerous parameters that require optimization, coupled with extended training durations and the necessity of implementing multiple folds to prevent information leakage, the process can become a laborious undertaking.

Hyperopt

Hyperopt stands as a popular open-source Python library designed for the optimization of hyperparameters in machine learning models. Created by James Bergstra, Brent Komer, and a team of contributors, Hyperopt is specifically tailored for hyperparameter optimization, employing a Bayesian optimization technique. Its primary objective lies in the identification of the most optimal set of hyperparameters, thereby maximizing the overall performance of machine learning models, as depicted in Fig. 3.

Key features and concepts

Bayesian Optimization: Hyperopt employs Bayesian optimization, a probabilistic model-based optimization technique that efficiently explores and exploits the hyperparameter space. This approach uses a surrogate model to estimate the objective function and its uncertainty, enabling the optimizer to make informed decisions about which hyperparameters to explore next.
Tree-structured Parzen Estimator (TPE): Hyperopt utilizes the TPE algorithm to guide the search for optimal hyperparameters. TPE is known for its efficiency in finding the best configurations by effectively balancing exploration and exploitation.
Parallel and Distributed Optimization: Hyperopt supports parallel and distributed optimization, allowing users to harness the power of multi-core processors and distributed computing clusters. This can significantly speed up the hyperparameter search process.
Extensive Search Space: Hyperopt can handle a wide range of hyperparameters, making it suitable for optimizing a variety of machine learning models, including deep learning neural networks, support vector machines, and gradient boosting machines.

Use cases

Hyperopt is commonly applied to fine-tune hyperparameters in machine learning models, improving their performance and generalization. It is widely used in the fields of data science, deep learning, and machine learning research to automate and optimize the model selection process.

Scikit-optimize - forest_minimize

Description

Scikit-Optimize, often referred to as skopt, is a Python library designed for hyperparameter optimization. The forest_minimize function in Scikit-Optimize⁵⁴ is a specific optimizer that utilizes Bayesian optimization techniques.

Key features and concepts

Surrogate Models: Scikit-Optimize, including forest_minimize, employs surrogate models such as Gaussian processes to approximate the objective function. These surrogate models are computationally efficient and enable the optimizer to make informed decisions about where to sample next.
Acquisition Functions: The optimizer relies on acquisition functions (e.g., Probability of Improvement or Expected Improvement) to guide the search. These functions help strike a balance between exploring uncharted regions of the hyperparameter space and exploiting regions that appear promising.
Customizable Search Space: Scikit-Optimize offers flexibility in defining the search space for hyperparameters. Users can specify both continuous and categorical hyperparameters and set their bounds.

Use cases

Scikit-Optimize, including forest_minimize, is a valuable tool for optimizing machine learning models, as well as other applications involving hyperparameter tuning. It applies to various domains, including natural language processing, computer vision, and scientific research.

Optunity

Description

Optunity⁵⁵ is a Python library for hyperparameter optimization and model selection. It combines various optimization techniques, such as grid search, random search, and Bayesian optimization, to find the best hyperparameters for a given problem.

Key features and concepts

Fitness Function: Optunity is highly customizable and works with user-defined fitness functions. These fitness functions encapsulate the specific problem that needs to be optimized. Optunity aims to either maximize or minimize these functions.
Versatility: Optunity is not limited to machine learning model hyperparameter tuning. It can be applied to a wide range of optimization problems, making it a versatile tool for researchers and practitioners.
Hybrid optimization: Optunity can employ different optimization algorithms, including grid search, random search, and Bayesian optimization, to explore the hyperparameter space. This adaptability allows it to strike a balance between computational efficiency and accuracy.

Use cases

Optunity is used in applications that require optimization across various domains, including machine learning, scientific research, and engineering. It can be applied to optimize model hyperparameters, experimental setups, and more.

GPyOpt

Description

GPyOpt is a Python library designed for Bayesian optimization. It leverages Gaussian processes as surrogate models to approximate the objective function. It is particularly useful for optimizing complex and expensive-to-evaluate functions.

Key features and concepts

Gaussian processes: GPyOpt relies on Gaussian processes to model the objective function⁵⁶. These probabilistic models provide not only point estimates but also uncertainty estimates, making them well-suited for black-box optimization problems.
Acquisition functions: Similar to other Bayesian optimization libraries, GPyOpt employs acquisition functions (e.g., Probability of Improvement) to determine which points to sample next. These functions consider the trade-off between exploration and exploitation.
Probabilistic optimization: GPyOpt excels in handling noisy and uncertain objective functions. It estimates both the mean and variance of the objective function, allowing for more informed decisions during optimization.

Use cases

GPyOpt is commonly used in scenarios where the objective function is expensive to evaluate, such as optimizing hyperparameters for deep learning models, engineering design, and scientific experiments.

Optuna

Description

Optuna⁵⁷ is a Python library known for its simplicity, flexibility, and efficiency in automated hyperparameter optimization. It was developed by the Preferred Networks team to facilitate the optimization of machine learning and deep learning models.

Key features and concepts

Tree-structured Parzen Estimator (TPE): Optuna uses TPE, a Bayesian optimization algorithm, to guide the hyperparameter search. TPE efficiently balances exploration and exploitation in the search process.
Study and Trial: In Optuna, optimization problems are organized into studies, each of which contains multiple trials. Each trial represents an attempt to optimize the objective function by sampling different hyperparameter configurations.
Pruning: Optuna employs pruning techniques to discard unpromising trials, saving computational resources. This is especially useful when optimizing over a large search space.
Parallel and Distributed Optimization: Optuna supports parallel and distributed optimization, enabling users to harness multiple processors or even entire computing clusters to accelerate the optimization process.

Use cases

Optuna is extensively used for automating the hyperparameter tuning process in machine learning and deep learning applications. Its simplicity and efficiency make it accessible to both beginners and experts in the field of artificial intelligence and data science.

The proposed framework

Figure 4 provides a visual representation of the proposed prediction model’s overall structure, encompassing the prediction process and performance evaluation metrics. Figure 5 presents the pseudocode for the implemented optimizers.

The key components and processes involved in the model include data loading and preprocessing, data splitting, hyperparameter optimization, model training, model evaluation, and hyperparameter tuning. The objective function for optimization is typically defined as minimizing the mean squared error.

Evaluation metrics for regression and classification models

Evaluation metrics for regression models

The determination coefficient R-square is one of the most common performances used to evaluate the regression model as shown in Eq. (1). On the other hand, the Minimum Acceptable Error (MAE) is shown in Eq. (2), while the Mean Square Error (MSE) is investigated in Eq. (3)^58,59,60.

$$\:{\text{R}}^{2}=\frac{\sum\:{\left(y-\dot{\widehat{y}}\right)}^{2}}{\sum\:{\left(y-\dot{\overline{y}}\right)}^{2}}$$

(1)

$$\:\text{M}\text{A}\text{E}=\frac{\sum\:_{i=1}^{n}\left|\widehat{{y}_{i}}-y\right|}{\text{n}}$$

(2)

$$\:\:\:\text{M}\text{S}\text{E}=\frac{\sum\:_{i=1}^{n}\left|\widehat{{y}_{i}}-{y}_{i}\right|}{\text{n}}$$

(3)

Where y is the actual value, $\:\dot{\widehat{\text{y}}}$ is the corresponding predicted value, $\:\dot{\overline{\text{y}}}$ is the mean of the actual values in the set, and n is the total number of test objects⁶¹.

Results and analysis

To evaluate the effectiveness of our machine learning framework, we conducted experiments in this section. The experiments were performed on a computer with a 3 GHz i5 processor, 8GB main memory, and a 64-bit Windows 10 operating system. We used the Python programming language to experiment.

The results of the proposed regression machine learning technique

Tables 5 and 6; Fig. 6 depict the evaluation outcomes of various conventional regression models and optimized regressor models for predicting posttreatment SAS 90, along with their corresponding performance metrics. Table 5 outlines the assessment criteria such as adjusted R-squared, R-squared, root mean squared error (RMSE), and execution time for traditional regression models. In contrast, Table 5 provides insights into the evaluation results of optimized regressor models achieved through different optimization techniques, where the focus lies on mean squared error, mean absolute error, R-squared score, and execution time. Here is an analysis and expansion of the Table:

Model: This column shows the names of the machine learning models used in the regression task.
MSE (Mean Squared Error): This column represents the average of the squared differences between the predicted and actual values. A lower value of MSE indicates better performance.
MAE (Mean Absolute Error): This column represents the average of the absolute differences between the predicted and actual values. A lower value of MAE indicates better performance.
R2 Score: This column represents the coefficient of determination, which measures the proportion of variance in the target variable that can be explained by the independent variables. A higher value of the R2 Score indicates better performance.
Time Taken (Seconds): This column represents the amount of time taken by each model to complete the regression task.

Table 5 The evaluation of different traditional regression models for Posttreatment SAS 90 to assess their performance.

Full size table

Table 6 The evaluation of optimized regressor models for Posttreatment SAS 90 to assess their performance.

Full size table

As shown in Tables 5 and 6; Fig. 6:

Among the traditional regression models, the GammaRegressor, TweedieRegressor, and PoissonRegressor show the highest adjusted R-squared values, indicating a better fit to the data compared to other models. These models explain a small but significant percentage of the variance in the posttreatment SAS 90.
In contrast, models like PassiveAggressiveRegressor, OrthogonalMatchingPursuit, and HuberRegressor exhibit negative adjusted R-squared values, suggesting that they do not perform well in explaining the variability in the posttreatment SAS 90.
When considering the mean squared error (MSE) and mean absolute error (MAE), the LassoLarsCV model achieves the lowest values, indicating better accuracy in predicting the posttreatment SAS 90. However, it’s important to note that the MSE and MAE values are relatively high across all models, suggesting that there is still room for improvement in the predictive performance.
Among the optimization techniques, the scikit-optimize with the forest_minimize method achieves the lowest MSE and MAE values, indicating better accuracy in predicting the posttreatment SAS 90 compared to other optimization techniques. Additionally, it achieves the highest R-squared score, suggesting that it explains a larger portion of the variance in the posttreatment SAS 90.
However, it’s worth noting that the execution time for the scikit-optimize with the forest_minimize method is the longest, indicating that it takes more time to train and predict with this model compared to other optimization techniques.
The optimized regressor models using the different optimization algorithms outperformed the traditional regression models in terms of predictive accuracy. Hyperopt showed the best performance among the optimizers, achieving the highest R2-score and the lowest mean squared error and mean absolute error. However, the execution times for the optimized models were relatively longer compared to the traditional models.

Feature correlations

The correlation coefficients between distinct features in a dataset are shown in Table 7. The table has been analyzed and expanded as follows:
First Feature: The name of the first feature being correlated is displayed in this column.
Second Feature: The name of the second feature being correlated is shown in this column.
Correlation: The correlation coefficient between the first and second features is represented in this column. A correlation value of one represents a perfect positive correlation, while one shows a perfect negative correlation. A correlation coefficient of 0 shows that there is no relationship.

Table 7 Pearson’s correlation of the features.

Full size table

Based on the Table 7:

SAS scores (pretreatment, posttreatment, pretreatment zero) are highly correlated with each other, as expected since they measure similar constructs.
Pain measures (VAS pre, post) are strongly negatively correlated with SAS scores, indicating that increased pain is associated with higher levels of anxiety.
VAS pre- and post are positively correlated, showing consistency in pain levels before and after treatment.
BMI has a moderate positive correlation with weight, as one would expect.
Age has weaker correlations with other variables. It is negatively correlated with pretreatment SAS and VAS pre, but these are small effects.
Demographic factors like age, weight, and BMI have small correlations with clinical outcomes.

Overall, this analysis confirms:

Good internal consistency between cognitive/pain measures.
Expected relationships between BMI/weight.
Clinical factors are more strongly correlated than demographics.
Pain inversely linked to cognitive status.
Weaker influence of basic demographics on main outcomes.

The patterns are consistent with the literature and provide validity for using these variables in further modeling.

Feature selection

Table 8 provides the selected features for different feature selection methods. Each method aims to identify a subset of features that are considered important for predicting the target variable. The table has been analyzed and expanded as follows^58,62,63,64:

Feature selection technique: The name of the feature selection method used to choose the features is displayed in this column.
Selected features: The names of the features chosen by the feature selection method are displayed in this column.
All methods consistently select the main SAS scores (pretreatment, posttreatment) as important features, which align with domain knowledge.
Clinical features like BMI, age, and weight are selected by many methods. These make intuitive sense as factors influencing outcomes.
Pain measures (VAS pre, post) also frequently emerge, underscoring their relationship to cognitive scores.
The SAS scores and clinical features tend to overlap across methods, demonstrating consensus.
RFE with random forests and feature importance from random forests are model-specific techniques tuned for random forest models.
Their selections emphasize the SAS scores most, followed by age and BMI - the most predictive features for random forests.

Table 8 Feature selection techniques and the most important features.

Full size table

Based on the analysis:

The SAS scores, pain measures, and basic clinical factors can be reliably considered important features.
Feature selection with random forests (RFE, importance) is best suited since a random forest model will be used.
Its selections are consistent with domain knowledge and focus on the most predictive attributes for the chosen model type.

Therefore, we would recommend using the features selected by RFE with random forests or feature importance from random forests for the random forest classifier.

Discussion and future directions

This study leverages machine learning techniques to investigate the complex relationships between patient characteristics and the efficacy of scapular stabilization exercises (SSE) in alleviating non-specific shoulder pain among college students. A crucial distinction must be made regarding the study’s primary objective: rather than comparing the impact of different exercise regimens on shoulder pain, our research focuses on identifying key patient factors that predict the greatest reduction in pain when utilizing SSE as a treatment approach. In essence, our analysis employs machine learning algorithms to uncover the most influential predictors (e.g., sex, weight, BMI, pre-treatment Self-Appraisal Scale [SAS] scores) that determine the likelihood of successful pain reduction with SSE. This approach enables the development of personalized treatment strategies, tailored to the specific profiles of patients who are most likely to benefit from scapular stabilization exercises.

Key findings and implications

1.
Predictive Modeling for Personalized Treatment: Our machine learning models successfully identified significant correlations between patient characteristics and the efficacy of SSE in reducing shoulder pain. These findings facilitate the creation of personalized treatment plans, enhancing the potential for positive outcomes.
2.
Patient Profiling for Optimal SSE Response: The study’s results provide valuable insights into the types of patients who are most suited for scapular stabilization exercises as a treatment for non-specific shoulder pain. For instance, [insert specific patient characteristics identified by the study, e.g., “patients with a lower BMI and higher pre-treatment SAS scores”] demonstrated a more significant reduction in pain when undergoing SSE.
3.
Clinical Utility and Future Directions: By integrating these predictive models into clinical practice, healthcare professionals can make more informed decisions when recommending scapular stabilization exercises for patients with non-specific shoulder pain. Future research should focus on validating these findings across broader populations and exploring the application of similar machine learning approaches to other musculoskeletal conditions.

Clarification on study objectives and interpretation

To reiterate, this manuscript does not aim to compare the effectiveness of different exercise regimens but rather to elucidate the patient factors that predict successful outcomes with scapular stabilization exercises. The discussion and interpretation of our results are grounded in this core objective, providing a nuanced understanding of how SSE can be optimized for specific patient populations suffering from non-specific shoulder pain.

Furthermore, the substantial execution times associated with some optimization techniques, such as optunity and GPyOpt, emphasize the practical considerations and trade-offs involved in implementing these methodologies in real-world clinical settings. Balancing computational efficiency with predictive accuracy remains a critical consideration for the seamless integration of machine learning-based interventions into routine musculoskeletal rehabilitation protocols.

In light of our findings on predicting successful outcomes with scapular stabilization exercises (SSE) for non-specific shoulder pain, two key avenues for future research emerge:

Validation and Generalizability: Validate the predictive models developed in this study across diverse populations to enhance their generalizability and applicability in various clinical settings.
Integration with Clinical Practice: Investigate the feasibility and effectiveness of integrating these predictive models into clinical decision-support systems to personalize SSE treatment plans for patients with non-specific shoulder pain.

Limitations

Despite the significant contributions and promising results of our study on predicting abdominal fat dynamics in the context of cavitation treatments, several limitations should be acknowledged:

1.
Limited Sample Size: The study utilized a comprehensive dataset; however, the sample size may still be relatively small. A larger sample size would enhance the statistical power and generalizability of the findings. The limited sample size may also restrict the exploration of potential subgroups or variations within the population.
2.
Potential Bias and Confounding Factors: The dataset used in the study may contain inherent biases or confounding factors that could influence the results. Unaccounted variables, such as age, gender, specific medical conditions, and concurrent treatments, may impact the fat dynamics and introduce potential bias into the predictive models.
3.
In this study were collected retrospectively, which may introduce limitations and potential biases inherent in retrospective analyses. Prospective studies with standardized protocols and data collection methods would provide more robust and reliable evidence.
4.
Lack of Patient-reported Outcomes: The study primarily relied on objective measurements of fat dynamics and did not incorporate patient-reported outcomes, such as satisfaction, quality of life, or subjective perception of body contouring. Including patient-reported outcomes would provide a more holistic assessment of the treatment effects.

Conclusions

This study utilized an extensive array of machine learning models and optimization techniques to predict the impact of scapular stabilization exercises on non-specific shoulder pain among college students. The results underscore the critical role of scapular stabilization exercise in improving shoulder function and reducing pain, highlighting the potential of machine learning in optimizing therapeutic strategies for musculoskeletal health management. Among the diverse regression models employed, the Gamma Regressor, Tweedie Regressor, and Poisson Regressor demonstrated the highest adjusted R-squared values, indicating their relatively stronger predictive performance in capturing the relationships between exercise protocols and pain reduction. Furthermore, the scikit-optimize optimization approach exhibited the most promising results, yielding the lowest mean squared error and mean absolute error, alongside the highest R-squared score, signifying its effectiveness in fine-tuning the exercise parameters for optimal outcomes. The findings suggest that a data-driven approach, facilitated by machine learning techniques, can significantly enhance the precision and efficacy of scapular stabilization exercise regimens, ultimately leading to improved shoulder health and functionality among college students. By leveraging advanced optimization methodologies, such as scikit-optimize, the study emphasizes the importance of customizing exercise protocols based on individual biomechanical profiles, thus addressing the diverse needs and concerns associated with non-specific shoulder pain in this specific demographic.

Data availability

Data and code availability statement. The dataset and code used in this study are public and all test data are available at this portal (https://github.com/tarekhemdan/SAS).

References

Aguilar, M. et al. Jan., Which Multimodal Physiotherapy Treatment Is the Most Effective in People with Shoulder Pain? A Systematic Review and Meta-Analyses, Healthcare, vol. 12, no. 12, Art. no. 12, doi: (2024). https://doi.org/10.3390/healthcare12121234
Östör, A. J. K., Richards, C. A., Prevost, A. T., Speed, C. A. & Hazleman, B. L. Diagnosis and relation to general health of shoulder disorders presenting to primary care, Rheumatology, vol. 44, no. 6, pp. 800–805, Jun. doi: (2005). https://doi.org/10.1093/rheumatology/keh598
May, S. An outcome audit for musculoskeletal patients in primary care. Physiother Theory Pract. https://doi.org/10.1080/09593980390246724 (Jan. 2003).
Gelinas, C. P., Dabbagh, A. & MacDermid, J. C. Understanding the Impact of Upper Extremity Musculoskeletal and Comorbid Health conditions on Physical and Mental Health and Quality of Life in 956 adults aged 50 to 65. Crit. Rev. Phys. Rehabil Med. 37 (1). https://doi.org/10.1615/CritRevPhysRehabilMed.2024052387 (2025).
Rungruangbaiyok, C. et al. Prevalence and Associated Factors of Musculoskeletal Disorders among older patients treated at Walailak University Physical Therapy Clinic in Thailand: a retrospective study. Int. J. Environ. Res. Public. Health. 21, https://doi.org/10.3390/ijerph21091253 (Sep. 2024). 9, Art. 9.
Muñoz, T. V. et al. Oct., Comparative evaluation of the efficacy of therapeutic exercise versus myofascial trigger point therapy in the treatment of shoulder tendinopathies: a randomised controlled trial, BMJ Open Sport Exerc. Med., vol. 10, no. 4, doi: (2024). https://doi.org/10.1136/bmjsem-2024-002043
Cho, M. S. et al. Changes in shoulder function and muscle strength following rehabilitation exercise program in male patients with forward shoulder posture undergoing rotator cuff repair. BMC Musculoskelet. Disord. 25 (1), 776. https://doi.org/10.1186/s12891-024-07905-0 (Oct. 2024).
Busch, A., Sarver, X. & Comstock, K. Electromyographic analysis of shoulder-complex muscles performing overhead presses with dumbbell, kettlebell, and bottom-up kettlebell. J. Bodyw. Mov. Ther. 37, 308–314. https://doi.org/10.1016/j.jbmt.2023.10.001 (Jan. 2024).
Tang, L. et al. Sep., Efficacy of Targeted Scapular Stabilization Exercise Versus Conventional Exercise for Patients With Shoulder Pain: A Randomized Clinical Trial, Am. J. Phys. Med. Rehabil., vol. 103, no. 9, p. 771, doi: (2024). https://doi.org/10.1097/PHM.0000000000002431
Chen, Y. et al. Effects of scapular treatment on chronic neck pain: a systematic review and meta-analysis of randomized controlled trials, BMC Musculoskelet. Disord., vol. 25, no. 1, p. 252, Apr. doi: (2024). https://doi.org/10.1186/s12891-024-07220-8
Sun, X., Chai, L., Huang, Q., Zhou, H. & Liu, H. Effects of exercise combined with cervicothoracic spine self-mobilization on chronic non-specific neck pain. Sci. Rep. 14 (1), 5298. https://doi.org/10.1038/s41598-024-55181-8 (Mar. 2024).
Cunha, B., Ferreira, R. & Sousa, A. S. P. Home-Based Rehabilitation of the Shoulder Using Auxiliary Systems and Artificial Intelligence: An Overview, Sensors, vol. 23, no. 16, Art. no. 16, Jan. doi: (2023). https://doi.org/10.3390/s23167100
Reddy, A. K. S. and Improving Preventative Care and Health outcomes for patients with chronic diseases using Big Data-Driven insights and Predictive modeling. Int. J. Appl. Health Care Anal., 9, 2, Art. 2, Feb. 2024.
Caldo, D. et al. Machine learning algorithms distinguish discrete digital emotional fingerprints for web pages related to back pain. Sci. Rep. 13 (1), 4654 (2023).
Article ADS PubMed PubMed Central MATH CAS Google Scholar
Wu, Y., Chen, B., Cai, H. H., Wang, D. & Yuan, Q. Evolutionary game theoretic approach with deep learning for health decision-making in critical environment. Ann. Oper. Res. https://doi.org/10.1007/s10479-024-06353-2 (Oct. 2024).
Cuff, A. V. Understanding the use of diagnostic imaging and its role in decision-making in musculoskeletal pain conditions affecting the lower back, knee, and shoulder, doctoral, Manchester Metropolitan University, Accessed: Oct. 21, 2024. [Online]. Available: (2024). https://e-space.mmu.ac.uk/634014/
tarekhemdan tarekhemdan/Trunk_Movement. (Jul. 05, 2023). Python. Accessed: Jul. 21, 2023. [Online]. Available: https://github.com/tarekhemdan/Trunk_Movement
Shieh, G., Jan, S. & Randles, R. On power and sample size determinations for the Wilcoxon–Mann–Whitney test. J. Nonparametric Stat. 18 (1), 33–43 (2006).
Article MathSciNet MATH Google Scholar
Universität Düsseldorf: G*Power. Accessed: Jul. 21, 2023. [Online]. Available: https://www.psychologie.hhu.de/arbeitsgruppen/allgemeine-psychologie-und-arbeitspsychologie/gpower
Hjermstad, M. J. et al. Studies comparing numerical rating scales, verbal rating scales, and visual analogue scales for assessment of pain intensity in adults: a systematic literature review. J. Pain Symptom Manage. 41 (6), 1073–1093 (2011).
Article PubMed Google Scholar
Kalra, N., Seitz, A. L., Boardman, N. D. & Michener, L. A. Effect of Posture on Acromiohumeral Distance With Arm Elevation in Subjects With and Without Rotator Cuff Disease Using Ultrasonography, J. Orthop. Sports Phys. Ther., vol. 40, no. 10, pp. 633–640, Oct. doi: (2010). https://doi.org/10.2519/jospt.2010.3155
Azzoni, R. & Cabitza, P. Sonographic versus radiographic measurement of the subacromial space width. Chir. Organi Mov. 89 (2), 143–150 (2004).
PubMed MATH CAS Google Scholar
Madson, T. J., Youdas, J. W. & Suman, V. J. Reproducibility of lumbar spine range of motion measurements using the back range of motion device. J. Orthop. Sports Phys. Ther. 29 (8), 470–477 (1999).
Article PubMed CAS Google Scholar
Mohamed, H. T., Youssef, E. F., Gad, A. M. M., Al Hamaky, D. M. & THE PREDICTION OF DISABILITY TO SCAPULAR TRAINING IN PATIENTS WITH SHOULDER IMPINGEMENT SYNDROME., Accessed: Nov. 04, 2023. [Online]. Available: https://ejas.journals.ekb.eg/jufile?ar_sfile=445999
Ravichandran, H. et al. Effect of scapular stabilization exercise program in patients with subacromial impingement syndrome: a systematic review. J. Exerc. Rehabil. 16 (3), 216 (2020).
Article PubMed PubMed Central MATH Google Scholar
Lee, J. H., Cynn, H., Yi, C. H., Kwon, O. & Yoon, T. L. Predictor variables for forward scapular posture including posterior shoulder tightness. J. Bodyw. Mov. Ther. 19 (2), 253–260 (2015).
Article PubMed Google Scholar
Turgut, E., Duzgun, I. & Baltaci, G. Effects of scapular stabilization exercise training on scapular kinematics, disability, and pain in subacromial impingement: a randomized controlled trial. Arch. Phys. Med. Rehabil. 98 (10), 1915–1923 (2017).
Article PubMed Google Scholar
Moezy, A., Sepehrifar, S. & Dodaran, M. S. The effects of scapular stabilization based exercise therapy on pain, posture, flexibility and shoulder mobility in patients with shoulder impingement syndrome: a controlled randomized clinical trial. Med. J. Islam Repub. Iran. 28, 87 (2014).
PubMed PubMed Central Google Scholar
Asgarkhani, N., Kazemi, F. & Jankowski, R. Machine learning-based prediction of residual drift and seismic risk assessment of steel moment-resisting frames considering soil-structure interaction. Comput. Struct. 289, 107181 (2023).
Article MATH Google Scholar
Avinash, M., Nithya, M. & Aravind, S. Automated Machine Learning-Algorithm Selection with Fine-Tuned Parameters, in 6th International Conference on Intelligent Computing and Control Systems (ICICCS), IEEE, 2022, pp. 1175–1180. Accessed: Nov. 04, 2023. [Online]. Available: (2022). https://ieeexplore.ieee.org/abstract/document/9788236/
Mehrizi, S., Tsakmalis, A., Chatzinotas, S. & Ottersten, B. A feature-based Bayesian method for content popularity prediction in edge-caching networks, in IEEE Wireless Communications and Networking Conference (WCNC), IEEE, 2019, pp. 1–6. Accessed: Nov. 04, 2023. [Online]. Available: (2019). https://ieeexplore.ieee.org/abstract/document/8885590/
Chittilappilly, R. M., Suresh, S. & Shanmugam, S. A Comparative Analysis of Optimizing Medical Insurance Prediction Using Genetic Algorithm and Other Machine Learning Algorithms, in International Conference on Advances in Computing, Communication and Applied Informatics (ACCAI), IEEE, 2023, pp. 1–6. Accessed: Nov. 04, 2023. [Online]. Available: (2023). https://ieeexplore.ieee.org/abstract/document/10199979/
McCann, L. & Welsch, R. E. Robust variable selection using least angle regression and elemental set sampling. Comput. Stat. Data Anal. 52 (1), 249–257 (2007).
Article MathSciNet MATH Google Scholar
Lin, A., Kolluri, S. & Sheehan, D. CALCULATING LATIN READABILITY SCORES USING LINEAR REGRESSION, (2021).
Duan, S. et al. LightGBM low-temperature prediction model based on LassoCV feature selection. Math. Probl. Eng. 2021, 1–8 (2021).
MATH Google Scholar
Wang, H., Wang, P. & Zhang, Y. Wind power prediction based on multiple feature extraction by LassoLarsIC and long short-term memory, in International Conference on Algorithms, Microchips and Network Applications, SPIE, pp. 312–319. Accessed: Nov. 04, 2023. [Online]. Available: https://www.spiedigitallibrary.org/conference-proceedings-of-spie/12176/1217617/Wind-power-prediction-based-on-multiple-feature-extraction-by-LassoLarsIC/ (2022). https://doi.org/10.1117/12.2636486.short
Rajan, M. P. An efficient Ridge regression algorithm with Parameter Estimation for Data Analysis in Machine Learning. SN Comput. Sci. 3 (2), 171. https://doi.org/10.1007/s42979-022-01051-x (Mar. 2022).
Shi, Q., Abdel-Aty, M. & Lee, J. A bayesian ridge regression analysis of congestion’s impact on urban expressway safety. Accid. Anal. Prev. 88, 124–137 (2016).
Article PubMed MATH Google Scholar
Kallummil, S. & Kalyani, S. Supplementary Materials: Signal and Noise Statistics Oblivious Orthogonal Matching Pursuit, Accessed: Nov. 04, 2023. [Online]. Available: http://proceedings.mlr.press/v80/kallummil18a/kallummil18a-supp.pdf
Aslam, F., Alyousef, R., Awan, H. H. & Javed, M. F. Forecasting the self-healing capacity of engineered cementitious composites using bagging regressor and stacking regressor, in Structures, Elsevier, pp. 1717–1728. Accessed: Nov. 04, 2023. [Online]. Available: (2023). https://www.sciencedirect.com/science/article/pii/S2352012423007439
McDonald, G. C. Ridge regression, WIREs Comput. Stat., vol. 1, no. 1, pp. 93–100, Jul. doi: (2009). https://doi.org/10.1002/wics.14
Aalen, O. O. A linear regression model for the analysis of life times, Stat. Med., vol. 8, no. 8, pp. 907–925, Aug. doi: (1989). https://doi.org/10.1002/sim.4780080803
Januaviani, T. M. A., Gusriani, N., Joebaedi, K., Supian, S. & Subiyanto, S. The best model of LASSO with the LARS (least angle regression and shrinkage) algorithm using Mallow’s cp. World Sci. News. no. 116, 245–252 (2019).
Google Scholar
Patel, R. S. & Akolekar, H. D. Machine-learning based optimisation of a Biomimiced Herringbone microstructure for Superior Aerodynamic performance. bioRxiv, pp. 2022–2009, (2022).
González-Briones, A., Hernández, G., Pinto, T., Vale, Z. & Corchado, J. M. A review of the main machine learning methods for predicting residential energy consumption, in 16th International Conference on the European Energy Market (EEM), IEEE, 2019, pp. 1–6. Accessed: Nov. 04, 2023. [Online]. Available: (2019). https://ieeexplore.ieee.org/abstract/document/8916406/
Graw, J. H., Wood, W. T. & Phrampus, B. J. Predicting global marine sediment density using the random forest regressor machine learning algorithm. J. Geophys. Res. Solid Earth, 126, 1, p. e2020JB020135, 2021.
John, V., Liu, Z., Guo, C., Mita, S. & Kidono, K. Real-time Lane Estimation using deep features and Extra Trees Regression, in Image and Video Technology, vol. 9431, (eds Bräunl, T., McCane, B., Rivera, M. & Yu, X.) in Lecture Notes in Computer Science, vol. 9431., Cham: Springer International Publishing, 721–733. doi: https://doi.org/10.1007/978-3-319-29451-3_57. (2016).
Chapter MATH Google Scholar
Azmi, C. S. A. M. et al. Univariate and Multivariate Regression models for short-term wind energy forecasting. Inf. Sci. Lett. 11 (2), 465–473 (2022).
Article MATH Google Scholar
Abd El-Hafeez, T., Shams, M. Y., Elshaier, Y. A. M. M., Farghaly, H. M. & Hassanien, A. E. Harnessing machine learning to find synergistic combinations for FDA-approved cancer drugs, Sci. Rep., vol. 14, no. 1, Art. no. 1, Jan. doi: (2024). https://doi.org/10.1038/s41598-024-52814-w
Hassan, E., Elbedwehy, S., Shams, M. Y., Abd El-Hafeez, T. & El-Rashidy, N. Optimizing poultry audio signal classification with deep learning and burn layer fusion. J. Big Data. 11 (1), 135. https://doi.org/10.1186/s40537-024-00985-8 (Sep. 2024).
Abdel Hady, D. A., Mabrouk, O. M. & Abd El-Hafeez, T. Employing machine learning for enhanced abdominal fat prediction in cavitation post-treatment. Sci. Rep. 14 (1), 11004 (2024).
Article ADS PubMed PubMed Central CAS Google Scholar
Avhale, K. Understanding of Optuna-A Machine Learning Hyperparameter Optimization Framework, Medium. Accessed: Oct. 21, [Online]. Available: (2023). https://medium.com/@kalyaniavhale7/understanding-of-optuna-a-machine-learning-hyperparameter-optimization-framework-ed31ebb335b9
López, F. HyperOpt: Hyperparameter Tuning based on Bayesian Optimization, Medium. Accessed: Oct. 21, [Online]. Available: (2023). https://towardsdatascience.com/hyperopt-hyperparameter-tuning-based-on-bayesian-optimization-7fa32dffaf29
Mottafegh, A., Ahn, G. N. & Kim, D. P. Meta optimization based on real-time benchmarking of multiple surrogate models for autonomous flow synthesis. Lab. Chip. 23 (6), 1613–1621 (2023).
Article PubMed MATH CAS Google Scholar
Claesen, M., Simm, J., Popovic, D. & Moor, B. Hyperparameter tuning in python using optunity, in Proceedings of the international workshop on technical computing for machine learning and mathematical engineering, p. 3. Accessed: Nov. 04, 2023. [Online]. Available: (2014). https://www.academia.edu/download/93669707/abstract-tcmm2014.pdf
Hertel, L., Baldi, P. & Gillen, D. L. Quantity vs. Quality: On Hyperparameter Optimization for Deep Reinforcement Learning, Jul. 30, arXiv: arXiv:2007.14604. Accessed: Nov. 04, 2023. [Online]. Available: (2020). http://arxiv.org/abs/2007.14604
Akiba, T., Sano, S., Yanase, T., Ohta, T. & Koyama, M. Optuna: A Next-generation Hyperparameter Optimization Framework, in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage AK USA: ACM, Jul. pp. 2623–2631. doi: (2019). https://doi.org/10.1145/3292500.3330701
Abdel Hady, D. A. & Abd El-Hafeez, T. Revolutionizing core muscle analysis in female sexual dysfunction based on machine learning, Sci. Rep., vol. 14, no. 1, Art. no. 1, Feb. doi: (2024). https://doi.org/10.1038/s41598-024-54967-0
Koshiry, A. M. E., Eliwa, E., El-Hafeez, T. A. & Omar, A. Classification of University Excellence: A Multi-dimensional Exploration of Ranking Criteria Using Data Science and Visualization Technology, in Proceedings of the 10th International Conference on Advanced Intelligent Systems and Informatics 2024, A. E. Hassanien, A. Darwish, M. F. Tolba, and V. Snasel, Eds., Cham: Springer Nature Switzerland, pp. 209–220. doi: (2024). https://doi.org/10.1007/978-3-031-71619-5_18
Eliwa, E. H. I., El Koshiry, A. M., Abd El-Hafeez, T. & Omar, A. Optimal gasoline price predictions: leveraging the ANFIS Regression Model. Int. J. Intell. Syst. 2024 (1), 8462056. https://doi.org/10.1155/2024/8462056 (Jan. 2024).
Bibi, S., Tsoumakas, G., Stamelos, I. & Vlahavas, I. Regression via classification applied on software defect estimation. Expert Syst. Appl. 3, 2091–2101. https://doi.org/10.1016/j.eswa.2007.02.012 (2008).
Article MATH Google Scholar
Mostafa, G., Mahmoud, H. & Abd El-Hafeez, T. The power of deep learning in simplifying feature selection for hepatocellular carcinoma: a review. BMC Med. Inf. Decis. Mak. 24 (1), 287. https://doi.org/10.1186/s12911-024-02682-1 (Oct. 2024).
Mostafa, G., Mahmoud, H., Abd El-Hafeez, T. & ElAraby, M. E. Feature reduction for hepatocellular carcinoma prediction using machine learning algorithms. J. Big Data. 11, 88. https://doi.org/10.1186/s40537-024-00944-3 (2024). no. 1.
Article MATH Google Scholar
Farghaly, H. M., Ali, A. A. & El-Hafeez, T. A. Developing an efficient method for automatic threshold detection based on Hybrid Feature Selection Approach, in Artificial Intelligence and Bioinspired Computational Methods, (ed Silhavy, R.) Cham: Springer International Publishing, 56–72. doi: https://doi.org/10.1007/978-3-030-51971-1_5. (2020).
Chapter MATH Google Scholar

Download references

Funding

Not applicable.

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).

Author information

Authors and Affiliations

Basic Science for Physical Therapy, Deraya University, EL-Minia, Egypt
Omar M. Mabrouk
Department of Physical Therapy for Women’s Health, Faculty of Physiotherapy, Deraya University, EL-Minia, Egypt
Doaa A. Abdel Hady
Department of Computer Science, Faculty of Science, Minia University, EL-Minia, Egypt
Tarek Abd El-Hafeez
Computer Science Unit, Deraya University, EL-Minia, Egypt
Tarek Abd El-Hafeez

Authors

Omar M. Mabrouk
View author publications
Search author on:PubMed Google Scholar
Doaa A. Abdel Hady
View author publications
Search author on:PubMed Google Scholar
Tarek Abd El-Hafeez
View author publications
Search author on:PubMed Google Scholar

Contributions

This work was carried out in collaboration among all authors. Authors DAA, OMM, and TAEH designed the study, performed the statistical analysis, and wrote the protocol. All authors managed the analyses of the study, managed the literature searches, and wrote the first draft of the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Doaa A. Abdel Hady or Tarek Abd El-Hafeez.

Ethics declarations

Ethical statement

Research committees at the institutional and/or national levels, the 1964 Helsinki Declaration and its subsequent revisions, or equivalent ethical norms, were adhered to in all procedures carried out in studies involving human subjects.

Consent statement

Each patient signed a written informed consent form after being given a thorough description of the trial.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mabrouk, O.M., Hady, D.A.A. & Abd El-Hafeez, T. Machine learning insights into scapular stabilization for alleviating shoulder pain in college students. Sci Rep 14, 28430 (2024). https://doi.org/10.1038/s41598-024-79191-8

Download citation

Received: 23 July 2024
Accepted: 06 November 2024
Published: 18 November 2024
DOI: https://doi.org/10.1038/s41598-024-79191-8

Keywords

This article is cited by

Innovative diagnostic framework for shoulder instability: a narrative review on machine learning-enhanced scapular dyskinesis assessment in sports injuries
- Yahui Fu
- Shuai Ma
- Ziteng Li
European Journal of Medical Research (2025)
Classification of musculoskeletal pain using machine learning
- Dalia Mohamed Fouad
- Marwa Mahmoud Mahfouz
- Tarek Abd El-Hafeez
Scientific Reports (2025)
A novel model for expanding horizons in sign Language recognition
- Esraa Hassan
- Mahmoud Y. Shams
- Marwa Elseddik
Scientific Reports (2025)

Subjects

Abstract

Similar content being viewed by others

Algorithmic assessment of shoulder function using smartphone video capture and machine learning

Innovative boxing training program outperforms the traditional scapular stabilization training program in post-stroke patients

Classification of musculoskeletal pain using machine learning

Introduction

Problem statement

Research question

Research gap

Contributions

Materials and methods

Trial design

The sample size

Participants

Exclusion criteria

Outcome measures

Assessment of pain

Assessment Acromion-humeral distance (AHD)

Interventions

Scapular stabilization exercise intervention

Related work

Methodology

Regression techniques

Dataset characteristics

Preliminaries

Hyperopt

Key features and concepts

Use cases

Scikit-optimize - forest_minimize

Description

Key features and concepts

Use cases

Optunity

Description

Key features and concepts

Use cases

GPyOpt

Description

Key features and concepts

Use cases

Optuna

Description

Key features and concepts

Use cases

The proposed framework

Evaluation metrics for regression and classification models

Evaluation metrics for regression models

Results and analysis

The results of the proposed regression machine learning technique

Feature correlations

Feature selection

Discussion and future directions

Key findings and implications

Clarification on study objectives and interpretation

Limitations

Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethical statement

Consent statement

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Innovative diagnostic framework for shoulder instability: a narrative review on machine learning-enhanced scapular dyskinesis assessment in sports injuries

Classification of musculoskeletal pain using machine learning

A novel model for expanding horizons in sign Language recognition

Search