Introduction

Anterior shoulder instability (ASI), also called recurrent anterior shoulder dislocation, is a common problem in young athletic individuals1,2. ASI is a pathological condition in which the humeral head subluxes or dislocates from the glenoid fossa due to a soft-tissue or bone injury, causing shoulder joint symptoms and reducing function. Repeated dislocations damage the static structures involved in joint support, such as the labrum, capsule, ligaments, and bone, which further increases instability1. Bankart injury and Hill–Sachs deformity are well-known lesions related to ASI. The treatment of ASI has evolved owing to the advances in arthroscopic techniques, along with an improved understanding of the shoulder anatomy and its complex biomechanics3,4.

In the shoulder joint area, the degree of muscle atrophy and fatty degeneration of the muscles that make up the shoulder joint are often investigated as factors that indicate the functional prognosis after surgery. These parameters are highly versatile indicators of postoperative muscle strength and joint stability. Many studies have been conducted on rotator cuff repair after shoulder rotator cuff tears, which have shown the utility of specific parameters, such as the cross-sectional area and occupancy of shoulder muscles in oblique sagittal images, called the Y-view, tangent sign, and Goutallier classification5,6,7.

The rotator cuff muscles are dynamic stabilizers of the shoulder joint but not primarily involved in shoulder instability compared to static structures. Therefore, few reports have investigated shoulder muscle atrophy and fatty degeneration in cases of ASI. In a recent report by Mitterer et al.8. , a muscle volume analysis was performed in patients with posterior humeral head subluxation. They observed a significant volume imbalance in the transverse force couple of the shoulder muscle (subscapularis [Ssc] vs. infraspinatus and teres minor muscle [Isp + TM]) in those patients, indicating an unbalanced anterior-posterior muscle force that may contribute to shoulder joint instability. In previous studies, the volume ratio between Ssc and Isp + TM (VR Ssc/Isp+TM) was observed to be 1.02 ± 0.18 in healthy participants, regardless of age or sex9. In the aforementioned report, this ratio was 1.14 ± 0.13 for patients with posterior humeral head subluxation8, which was higher than that for healthy participants. The computed tomography (CT) density ratio between Ssc and Isp + TM (CT-DR Ssc/Isp+TM) could also be a useful indicator for the assessment of fatty degeneration of muscles. However, to the best of our knowledge, no study has examined this ratio in patients with ASI.

In fact, the mechanism of shoulder joint instability is more complicated, and a recent concept of multidirectional shoulder instability has been introduced that involves dislocation in multiple directions, rather than a simple anterior-posterior dislocation10,11. Therefore, research investigating VR Ssc/Isp+TM and CT-DR Ssc/Isp+TM in patients with shoulder instability may provide a new perspective on the instability factors of the shoulder joint; however, studies on this topic are scarce. One of the reasons is that on CT or MRI, the shoulder muscles are often imaged partially in cases of ASI because they are not points of priority in the evaluation compared to the bone, labrum, and cartilage. To accurately assess these ratios, the entire muscle area should ideally be examined; in cases where partial assessment is unavoidable, determining the extent to which the ratio obtained from the partial muscle volume reflects the overall ratio should be a prerequisite.

In the present study, we hypothesized that a CT-based sectional (partially cut) measurement approach has the potential to assess imbalance between the anterior and posterior muscle force in ASI. Therefore, our objectives were to (1) verify whether VR Ssc/Isp+TM and CT-DR Ssc/Isp+TM obtained by three-dimensional (3D) sectional muscle measurements could be substitutes for those obtained from the whole muscle using the CT data of non-pathologic shoulders (patients with clavicular fracture [CF]); and (2) examine VR Ssc/Isp+TM and CT-DR Ssc/Isp+TM in cases of ASI and assess the difference between ASI and non- pathologic shoulders by comparing the two experiments.

Methods

Experimental procedure

Step A

CT data of non-pathologic shoulders (patients with CF) were collected. In each image, the segmentation of Ssc and Isp + TM was performed. Subsequently, we obtained the VR Ssc/Isp+TM and CT-DR Ssc/Isp+TM using sectional or whole-muscle measurements. By comparing these results, we evaluated whether sectoral muscle evaluation was an effective technique.

Step B

Shoulder CT data were collected from patients with ASI. The segmentation and measurement of the muscles were performed in the same manner as in step A, except that they were all evaluated through sectional muscle measurement (as part of the muscle was always out of range in our protocol). We then assessed the differences between the ASI and CF groups by comparing the results of steps A and B.

Participants

This study was approved by the Research Ethics Committee of the Saitama Medical University Hospital (approval number 2022-072). All experiments were performed in accordance with the relevant guidelines and regulations. The requirement of this study for informed consent was waived by the Research Ethics Committee of Saitama Medical University Hospital.

Step A (CF group)

Patients with CF who underwent shoulder CT before surgery at our institution between January 2012 and December 2021, with an order from the Department of Orthopedics at our hospital, were identified and reviewed. Figure 1A presents the inclusion and exclusion criteria of this study. A total of 28 CT data from 19 patients were analyzed.

Fig. 1
figure 1

Flow chart of the inclusion and exclusion criteria for the study.

Step B (ASI group)

Patients diagnosed with ASI who underwent shoulder CT before arthroscopic surgery at our institution between January 2012 and December 2021, with an order from the Department of Orthopedics at our hospital, were identified and reviewed. Figure 1B presents the inclusion and exclusion criteria of this study. A total of 17 CT images from 17 patients were analyzed.

The preoperative clinical and demographic status of patients with ASI (Step B) was determined as follows: symptom duration, number of symptomatic dislocations, sports activity/work level, and preoperative range of motion. Additionally, preoperative morphological findings such as glenoid rim morphology, Hill–Sachs lesion, anterior inferior glenohumeral ligament hyperlaxity, and capsule tears were demonstrated. Glenoid rim morphology and Hill–Sachs lesion was determined as described by Nakagawa et al.12. Glenoid rim morphology was classified into erosion and fragment types, while glenoid defect size was designated as “large” if it exceeded 15 mm and “small” if less than 15 mm. Hill–Sachs lesion length and width were measured on 3D CT reconstructed images and designated as “large” (width > 15 mm or length > 30 mm) or small (width < 15 mm or length < 30 mm). Anterior inferior glenohumeral ligament hyperlaxity and capsule tears were determined by arthroscopy.

CT acquisition

CT was performed using a 128-multislice CT Siemens SOMATOM Definition Flash scanner (Siemens Healthcare, Erlangen, Germany). Patients were examined while in a supine position lying on their abdomen with their arms adducted and hands in a neutral position. CT was performed using the standard dedicated CT protocol for the shoulders used at our institution. The slice thickness was 2 mm with reformations in the axial, coronal, and sagittal planes. The field of view was 25 × 25 × 15 cm and 15 × 15 × 9 cm in step A and step B, respectively.

Measurement of the volume and CT density of the shoulder muscles

To measure the volume and CT density of the shoulder muscles, we employed the method described by Mitterer et al.8 Segmentation was performed using an open-source software (ITK-SNAP veASIon 3.8.0). After the CT scans were loaded into ITK-SNAP using the Digital Imaging and Communications in Medicine format, the Ssc and Isp + TM areas were delineated on every transverse slice using semi-automatic segmentation tools (Fig. 2A). From the delineated contour, a 3D model of each shoulder muscle was reconstructed and saved as a standard 3D model in the NIFTI file format (*. nii.gz).

To perform a 3D sectional (i.e., cut by cross-sections) muscle assessment, the technique proposed by Chung et al. was adopted13. We performed sectional measurements of the shoulder muscles at the following three specific planes: from the most lateral end of the muscles to the plane of the Y-view (Y section), to the plane 2.5 cm medial to the Y-view (Y + 2.5 cm section), and to the plane 5 cm medial to the Y-view (Y + 5 cm section). Additionally, we implemented a cut off at the plane 3 cm inferior to the lower end of the glenoid labrum to exclude the inferior margin of each muscle volume, as this area was usually out of range in our shoulder CT protocol (Fig. 2B). By implementing these adjustments, we created sectional muscle models and determined their volumes and CT densities. The mean CT density (i.e. mean scalar value) was calculated from the binary label map representation of the segmented muscle regions that overlap with the selected scalar volume. In step A, the whole muscle volume and CT density were calculated (Fig. 2C). Finally, we obtained the ratio of muscle volume to CT density between the Ssc and Isp + TM (VR Ssc/Isp+TM and CT-DR Ssc/Isp+TM) for each section. All procedures were conducted using an open-source software (3D Slicer 5.0.3), and the specific implementation schemes are detailed in the Supplementary Information.

Fig. 2
figure 2

(A) The areas of subscapular muscle (Ssc, red) and infraspinatus and teres minor muscles (Isp + TM, blue) in every axial image are delineated by applying semi-automatic segmentation tools. (B) The 3D models of Ssc and Isp + TM are separated by the Y-view plane (blue), Y + 2.5 cm plane (yellow), and Y + 5 cm plane (red) using the Dynamic Modeler module. Moreover, we implemented a cut off at the plane 3 cm inferior to the lower end of the glenoid labrum (green) to exclude the inferior margin of each muscle volume, because this area was usually out of range in our shoulder CT protocol. (C) For each 3D model of Ssc and Isp + TM muscles, three sections of 3D models were newly created (hence, we obtained six sectional muscle models): from the most lateral end of the muscles to the plane of Y-view (Y section), to the plane 2.5 cm medial to the Y-view (Y + 2.5 cm section), and to the plane 5 cm medial to the Y-view (Y + 5-cm section). For step A, we performed the whole muscle measurement as well.

Associations between the ratios obtained from each sectional measurement were evaluated using Pearson’s correlation coefficient and a paired t-test. To assess interobserver reproducibility in the segmentation and measurement process, two radiologists with 7 and 6 years of experience performed these steps independently. Both radiologists were blinded to the clinical information. The intraclass correlation coefficient (ICC) was measured to evaluate interobserver reproducibility.

Comparison and post hoc power analysis of step A and step B results

To compare the CF (Step A) and ASI (Step B) patients, the results of the sectional muscle volume, CT density, and ratios therein were statistically compared using an unpaired Student’s t-test. Furthermore, to determine the study’s ability to detect a significant effect, a post hoc power analysis was performed for each item of the comparison. The Cohen’s d value required to quantify the effect size was calculated as follows:

$$\:s=\:\sqrt{\frac{\left({n}_{1}-1\right)\times\:{s}_{1}+({n}_{2}-1)\times\:{s}_{2}}{{n}_{1}+{n}_{2}-2}}$$

where s is the pooled standard deviation, n1 and n2 are the sizes of the first and second samples, and s1 and s2 is the variance of the first and second samples.

$$\:d=\frac{{u}_{1}-{u}_{2}}{s}$$

where d is the Cohen’s d value, u1 is the mean of the first sample, u2 is the mean of the second sample, and s is the pooled standard deviation of both samples.

Finally, the estimated required sample size was calculated by a statistical power analysis using an unpaired Student t-test. The parameters for the analysis were set as follows: significance level (alpha value), 0.05; minimal statistical power (effect size), 0.80; and statistical power, 0.80. The expected ratio of the sample sizes from one sample to the other was set at 1.0.

Statistical analyses were performed using the open-source software package (Python scikit-learn 0.22.1). Statistical significance was set at p < 0.05.

Results

Step A

A total of 28 CT images from 19 patients (including both sides of the shoulders in nine patients) were included in this study. The mean age ± standard deviation was 56.7 ± 9.9 years. The sex ratio was 13/6 (male/female). The side scanned on CT was 14/14 (right/left). The CF side measures 7/12 (right/left). Interobserver reproducibility was good for all 3D volumes, with a mean ICC value of 0.821.

The observed ratios of the muscle volume (VR Ssc/Isp+TM) for Y, Y + 2.5 and Y + 5 sections and for the entire muscle were 1.09 ± 0.27, 1.09 ± 0.17, 1.06 ± 0.16, and 1.06 ± 0.10, respectively. A moderate correlation was observed between the VR Ssc/Isp+TM of the Y + 2.5 section and the entire muscle volume (r = 0.651).

The observed ratios of CT density (CT-DR Ssc/Isp+TM) for Y, Y + 2.5 and Y + 5 sections and for the entire muscle were 1.09 ± 0.16, 1.03 ± 0.11, 1.05 ± 0.09, and 1.06 ± 0.09, respectively. There was a strong correlation between the CT-DR Ssc/Isp+TM of the Y + 2.5 or Y + 5 sections and that of the entire muscle volume (r = 0.825 and 0.862, respectively).

Table 1 summarizes the results of step A.

Table 1 Observed sectional ratios of muscle volume and CT density for patients with clavicular fracture (step A).

Step B

A total of 17 CT images of 17 patients were included in this study. The patients’ clinical and morphological data are summarized in Table 2. Interobserver reproducibility was good for all 3D volumes, with a mean ICC value of 0.838.

Table 2 Clinical and morphological data of patients with anterior shoulder instability (step B).

The observed ratios of muscle volume (VR Ssc/Isp+TM) for Y, Y + 2.5 and Y + 5 sections were 1.12 ± 0.55, 1.21 ± 0.28, and 1.13 ± 0.16, respectively. The observed ratios of CT density (CT-DR Ssc/Isp+TM) for Y, Y + 2.5 and Y + 5 sections were 1.02 ± 0.11, 0.92 ± 0.07, and 0.92 ± 0.05, respectively.

Table 3 summarizes the results of step B.

Table 3 Observed sectional ratios of muscle volume and CT density for patients with anterior shoulder instability (step B).

Comparison and post hoc power analysis

All sectional muscle volume, CT density, and CT-DR Ssc/Isp+TM data differed significantly between the CF (Step A) and ASI (Step B) groups. However, the differences in VR Ssc/Isp+TM were unclear. Post hoc power analysis showed more variable estimated sample sizes for the muscle volume of Ssc and Isp + TM than for CT density. The estimated sample size for VR Ssc/Isp+TM was considerably larger. Table 4 summarizes the results of the comparison and post hoc power analysis.

Table 4 Comparison and post hoc power analysis of patients with clavicular fracture (step A) versus anterior shoulder instability (step B).

Discussion

In the present study, we observed that the ratios of shoulder muscle volume (VR Ssc/Isp+TM) and CT density (CT-DR Ssc/Isp+TM) were balanced in our patient cohort with CF. In addition, these ratios were constant to some extent, whether they were derived from the sectional or whole measurements. In patients with ASI, the VR Ssc/Isp+TM was relatively high, whereas the CT-DR Ssc/Isp+TM was relatively low, implying the possibility of an imbalance between the anterior and posterior shoulder muscles. For CT-DR Ssc/Isp+TM, a statistically significant intergroup difference was noted and the sample size was sufficient to account for the statistical significance. This was not the case for VR Ssc/Isp+TM.

Considering the balance of the antagonistic muscles is crucial because they may be associated with the causes and consequences of articular diseases. The balance of the shoulder muscles in the transverse force couple has been debated. Espinosa–Uribe et al. conducted a study on 304 shoulders from 152 patients of different ages and sexes and observed a ratio of the rotator cuff transverse force couple volume (corresponding to VR Ssc/Isp+TM) of 1.02 ± 0.18 without significant differences across all ages and sexes9. Piepers et al. reported similar results in non-pathologic shoulders, demonstrating that the muscle volume of the transverse force couple was kept balanced14. In our study, we confirmed a VR Ssc/Isp+TM of 1.06 ± 0.10 in the CF group (non-pathologic shoulder), which was consistent with the previous results9,14.

Our study also suggested that the CT density ratio in the shoulder muscle transverse force couple (CT-DR Ssc/Isp+TM) was almost balanced. In similar studies focusing on the osteoarticular areas, both muscle volume and CT density could be used as an indicator of the muscle atrophy and degeneration. For instance, the CT density of the gluteus medius muscle reflects muscle strength in patients with hip osteoarthritis15. In the shoulder area, van de Sande et al. examined shoulder CT from patients with rheumatoid arthritis and observed a strongly correlation between the CT density of the rotator cuff muscle volume and visual assessment of fatty degeneration16. However, no study specifically investigated the CT-DR Ssc/Isp+TM in pathologic or non-pathologic shoulder. In this regard, our study was the first to show that CT-DR Ssc/Isp+TM was balanced in non-pathological shoulders, indicating that the internal properties of muscles and their volumes might be balanced in the shoulder muscle transverse force couple.

Although the measurement of muscle density has been recognized as a feasible method for observing muscle degeneration, studies utilizing this approach are limited because exposure to radiation poses an inevitable problem for CT. In contrast, quantitative assessments of muscle degeneration using MRI, such as the fat fraction ratio, have been more widely used. In fact, CT is primarily used for the confirmation of fractures or other bone abnormalities; hence, they are more limited than MRI. Moreover, CT can be used to assess shoulder instability, but it is not the primary diagnostic tool. Therefore, we believe that MRI will be required in the future since it is more useful for this indication than current methods.

In this study, we evaluated the feasibility of VR Ssc/Isp+TM and CT-DR Ssc/Isp+TM derived from sectional muscle measurements. Sectional muscle assessment offered advantages in cases where the shoulder muscles were only partially imaged. It provided valuable information for evaluating morphological changes and the degree of medial retraction of the rotator cuff muscles. A recent study by Chung et al. described that 3D sectional volume analysis of the supraspinatus muscle can be useful for evaluating accurate serial changes of muscle atrophy after arthroscopic repair surgery of rotator cuff tears13. Although their research was noteworthy, one of the limitations was that they were unable to measure the entire supraspinatus muscle because of the practical difficulty in extending the field of the shoulder MRI. Thus, our study was the first to compare sectional muscle data with the whole muscle data in non-pathological shoulders. We observed a mild to moderate correlation between the VR Ssc/Isp+TM for each section and that for the total muscle. Conversely, the CT-DR Ssc/Isp+TM for each section and the whole also did not significantly differ, and a strong correlation was observed between the CT-DR Ssc/Isp+TM of Y + 2.5 cm or Y + 5 cm section and that of the total volume. This indicated that the muscle volume may be easily affected by factors, such as a slight difference in scan positioning, and is expected to be more susceptible to sectional evaluation. Muscle CT density, in contrast, would be less influenced by these factors and could be a more favorable parameter for evaluating muscle properties in a sectional manner.

In our study, we applied our sectional evaluation system of VR Ssc/Isp+TM and CT-DR Ssc/Isp+TM to ASI cases. Mitterer et al. recently evaluated VR Ssc/Isp+TM in patients with posterior humeral head subluxation8 obtaining a value of 1.14 ± 0.13 using the total muscle volume. An attempt to measure the sectional muscle volume and CT density in patients with ASI has not been previously performed. Therefore, our study was the first to analyze these parameters in the ASI and compare them with the whole muscle data. In our analysis, the observed VR Ssc/Isp+TM was 1.21 ± 0.28 and 1.13 ± 0.16, whereas CT-DR Ssc/Isp+TM was 0.92 ± 0.07 and 0.92 ± 0.05 for Y + 2.5 and Y + 5 sections, respectively. The VR Ssc/Isp+TM, ratios appeared to be higher than those of the non-pathologic shoulders, indicating hypertrophy of the anterior muscle (Ssc) and atrophy of the posterior muscle (Isp + TM). However, a comparatively lower CT-DR Ssc/Isp+TM in the anterior muscle (Ssc) was associated with fatty degeneration, leading to a discrepancy of our findings. In a previous study on posterior humeral head subluxation8, the posterior driving force was assumed to be created by a larger anterior muscle (Ssc). In this case, we observed that the anterior muscle (Ssc) underwent mild atrophy and degeneration, while, the comparatively larger posterior muscle (Isp + TM) generated an anterior directional force; hence, repeated anterior-dominant dislocation occurred. This hypothesis does not explain the large sectional volume of the anterior muscle (Ssc).

Finally, in the present study, a statistically significant intergroup difference was observed in CT-DR Ssc/Isp+TM but not in VR Ssc/Isp+TM. Furthermore, post hoc power analysis showed variable effect sizes for muscle volume, whereas those for CT density were relatively stable. This may originate from the variability in the muscle volumes of Ssc and Isp + TM, as indicated by the large standard deviations across all sectional volumes. As stated above, compared with muscle CT density, muscle volume may be more influenced by factors such as scan positioning and sectional variation and is likely sensitive to sectional measurements. In contrast, CT is thought to be less sensitive to these factors, resulting in less variable results and more stable effect sizes. Therefore, CT-DR Ssc/Isp+TM may be a reproducible and useful measurement tool for assessing muscle imbalance.

However, post hoc power calculations may pose a potential risk of misinterpretation. Because post hoc analysis tends to be applied especially when research findings are negative (p ≥ 0.05), this will produce a low post hoc power result17. In general, several possibilities should be considered when interpreting such negative results: (1) there is a true association, but with a smaller effect size than expected, which could not be detected with the given sample size; (2) there is a true association with an effect larger than hypothesized, but it was not detected by chance; (3) the variance of the sample is greater than expected, so excessive noise in the data prevented the detection of a true association; and (4) confounding factors were unaccounted for, so the results appear as if there was no association18. These are all type II errors and should be considered when interpreting the negative results of our study (i.e., Ssc and Isp + TM muscle volumes and VR Ssc/Isp + TM). Therefore, future studies would ideally require accurate pre-study power analyses and a careful study design.

Furthermore, it should be considered that CF is not an ideal control group for ASI since it represents a completely different background, especially in terms of age; as patients with CF were older than those with ASI, the differences in age may have influenced the results. On the other hand, these ratios were almost constant across ages and sexes; thus, the influence caused by differences in age could be minimal. To validate this finding, an age- and sex-matched control group must be considered in the future.

This study had several other limitations. First, although the muscle imbalance could be associated with shoulder instability, the instability is primarily related to the static structures such as labrum, capsule, ligaments, and bone. Since these structures were not addressed in the present study, they must be examined in a future analysis. Second, MRI is more frequently used as a diagnostic tool for shoulder instability than CT; thus, the implications of using CT in this study may be relatively limited. Therefore, it would be beneficial to develop an MRI-based assessment in the future. Third, the sample size was small, and the distribution of our cohort was unequal. Fourth, the absence of a true control group for ASI was another limitation because we had to select non-pathological shoulders with CF in this study, whose age and sex were different from those of the ASI cohort. Furthermore, we could not evaluate the entire muscle in the ASI cases. These issues should be addressed in future studies. Simultaneously, we tested the hypothesis that mild fatty degeneration occurs in the anterior muscles (Ssc). Another quantitative method, such as the MRI Dixon-based fat fraction ratio, should be introduced to resolve this issue.

Conclusion

Our study demonstrated that the ratio of volume (VR Ssc/Isp+TM) and CT density (CT-DR Ssc/Isp+TM) in the transverse force couple of shoulder muscles was balanced in non-pathologic shoulders with CF. Additionally, these ratios remained constant, regardless of whether they were derived from sectional or the whole muscle measurements. Furthermore, for ASI, VR Ssc/Isp+TM was relatively high, whereas CT-DR Ssc/Isp+TM was relatively low compared with those of the non-pathological shoulder. This suggested an imbalance between the anterior (Ssc) and posterior (Isp + TM) muscles of the shoulder.