Introduction

Pneumonia represents a significant public health burden, accounting for considerable morbidity and mortality globally. Despite advancements in therapeutic strategies and critical care interventions, severe pneumonia, which necessitates intensive care unit (ICU) admission, is still associated with high morbidity and mortality1,2. Conventional prognostic models such as CURB-65, the Pneumonia Severity Index (PSI), Quick Sequential Organ Failure Assessment (qSOFA), Simplified Acute Physiology Score 3 (SAPS-3), and Acute Physiology and Chronic Health Evaluation II (APACHE II) have shown limited predictive accuracy for patients with severe pneumonia upon their admission to the ICU or emergency department3,4,5,6. Recent studies have indicated that machine learning-based prediction models may offer improved predictive performances for clinical outcomes in ICU patients with severe pneumonia3,7.

Emerging evidence from diagnostic imaging techniques, notably computed tomography (CT), has illuminated the potential of CT-based muscle assessment as an innovative marker for evaluating muscle mass and identifying sarcopenia8,9. Sarcopenia, characterized by substantial muscle loss, has profound implications for patient recovery and survival10, suggesting that muscle mass evaluation could offer critical insights into patient resilience against severe infections, including pneumonia.

Despite known correlations between sarcopenia and increased mortality in various patient populations, a substantial gap persists in our understanding of how muscle mass specifically influences outcomes in severe pneumonia. This knowledge gap hinders optimal integration of muscle assessment in prognostic evaluation of pneumonia patients. The potential of muscle mass indicators derived from routine chest CT scans in ICU settings as prognostic tools necessitate further exploration11,12. Elucidating the relationship between muscle parameters and pneumonia outcomes may catalyze a shift in patient management paradigms, facilitating the adoption of personalized treatment strategies that consider physiological resilience in conjunction with infection severity.

Thus, this study aimed to investigate the potential of thoracic muscle mass as determined by chest CT scans to serve as a biomarker for predicting clinical outcomes of ICU patients with severe community-acquired pneumonia. By categorizing patients into clusters based on their thoracic muscle mass profiles from CT scans, this study aimed to determine if these distinct groups exhibited different risks for extubation failure or in-hospital mortality, thus providing insights into the prognostic value of muscle mass in critical care settings.

Methods

Study design and eligibility criteria

Electronic medical records of patients with severe community-acquired pneumonia who were admitted to the ICU of a teaching hospital from January 2016 to December 2021 were retrospectively analyzed. Inclusion criteria were: patients aged ≥ 18 years presenting with respiratory failure attributable to severe pneumonia necessitating ICU admission. Diagnosis of pneumonia relied on chest CT findings, blood C-reactive protein (CRP) levels ≥ 4 mg/dL13, and administration of pneumonia-specific antibiotics. Severe pneumonia was defined as meeting at least one major or three minor criteria outlined in the guidelines of the Infectious Disease Society of America/American Thoracic Society14. Individuals transferred to the general ward within 3 days of ICU admission and those with medical conditions deemed more critical than pneumonia were excluded. Eligible patients were observed during hospital stay for up to 30 days from the ICU admission date.

Clinical variables

Baseline information including age, sex, body mass index, smoking history, and respiratory and non-respiratory comorbidities was obtained. Clinical conditions included conventional severity-of-illness scores such as Acute Physiology and Chronic Health Evaluation (APACHE) II, Sequential Organ Failure Assessment (SOFA), and SAPS II, initial vital signs, laboratory findings, and treatments for pneumonia. Clinical outcomes included high-flow nasal cannula (HFNC), mechanical ventilation (MV), prolonged MV, Extracorporeal Membrane Oxygenation (ECMO), extubation, tracheostomy, and in-hospital mortality.

Thoracic muscle mass measurements

Thoracic muscle mass was evaluated using chest CT scans conducted upon ICU admission of included patients. The thoracic muscle mass measurements include a comprehensive set of 60 variables that describe various aspects of muscle quantity and density within the thoracic region. The variables are categorized into three main groups: mean muscle area, mean muscle attenuation, and muscle volume. Each group provides specific insights into muscle characteristics.

Mean muscle area (cm2/cm) represents the average cross-sectional area of the muscle normalized by the length of the thoracic region. The mean muscle area is calculated as the total muscle volume divided by the length of the region of interest (ROI). This normalization accounts for individual differences in body size.

$$Mean \;Muscle \;Area= \frac{Total \;Muscle \;Volume \;({cm}^{3})}{Length \;of \;ROI \;(cm)}$$

Mean muscle attenuation, measured in Hounsfield Units (HU), reflects the density and quality of muscle tissue. It is determined by averaging the HU values of all pixels within the segmented muscle area. The formula for mean muscle attenuation involves summing the HU values of all pixels within the muscle and dividing by the number of pixels.

$$Mean \;Muscle \;Attenuation= \frac{\sum \;HU \;values \;of \;all \;pixels}{Number \;of \;pixles}$$

Muscle volume (cm3) represents the total volume of muscle tissue within the specified region. It is calculated by summing the volumes of all slices within the ROI. For each slice, the volume is determined by multiplying the cross-sectional area by the slice thickness.

$$Muscle \;Volume= \sum (Cross-sectional \;Area \left({cm}^{2}\right) \times Slice \;Thickness \left(cm\right))$$

The total muscle area in the thoracic region is the sum of all muscle areas normalized by the length of the thoracic spine. Specific muscles, such as the right and left pectoralis major and minor, as well as the right and left erector spinae, are measured individually at different thoracic levels and along the entire thoracic spine. For measurements specific to the erector spinae muscle group at T1–T12 levels, attenuation values ranged from − 29 to 150 HU15. This specific range captures the density of muscle tissue, excluding fat and other non-muscular tissues, and is useful in assessing the quality and composition of the muscle.

For precise analysis, we utilized 3D segmentation on chest CT scans to assess muscle mass of the thoracic region, focusing on specific muscles such as the pectoralis major, pectoralis minor, and erector spinae. Initial segmentation was performed using a radiological workstation (MEDIP, MEDICALIP, Seoul, South Korea. https://medicalip.com/medip/), which combined 3D volume rendering and multiplanar image reformatting. This workstation employs a semi-automatic segmentation method using a graph cut algorithm16. It is known for having precision in defining muscle boundaries with the guidance of an experienced radiologist. To further enhance the accuracy and efficiency of muscle segmentation, we incorporated a custom-developed deep learning AI model tailored for 3D convolutional neural network-based segmentation. This model is specifically trained to identify and delineate targeted muscle groups. Detailed information about the muscle segmentation network, including the training data, training pipeline, and test performance, has been summarized in Supplementary Information S1.

Moreover, AI software (DeepCatch, MEDICALIP, Seoul, South Korea. https://medicalip.com/DeepCatch/) was used for measuring total muscle mass at various segments of the spine, specifically at T4, T12, and from T10 to T1217. A comprehensive measurement from T1 through T12 was also performed. This AI software excels in providing detailed visualizations and quantifications of muscle mass through advanced imaging analysis.

Clustering stability

To evaluate the reproducibility and stability of our clustering analysis technique, we conducted a sensitivity analysis using random subsampling. We generated 100 random subsets of the data, each containing 80% of the original sample size. For each subset, we standardized the data and performed k-means clustering with k = 3, maintaining the same parameters as the original analysis. We calculated the Adjusted Rand Index (ARI) for each pair of clustering results across the subsets to assess the similarity of the clustering structures.

Outcomes

The primary outcome was extubation. The secondary outcome was in-hospital mortality over a 30-day observation period.

Predictive model assessment

We employed a logistic regression model to predict successful extubation and in-hospital mortality based on thoracic muscle mass measurements. The entire dataset was analyzed without splitting it into training and testing sets. This approach was chosen to perform an exploratory analysis for initial model assessment and to maximize data utilization, particularly in evaluating the feasibility of using thoracic muscle mass data for predictive modeling.

Statistical analysis

K-means clustering was performed with 60 variables representing profiles of thoracic muscle mass. Clusters were visually depicted through uniform manifold approximation and projection (UMAP) and principal component analysis (PCA) plots. Categorical variables were analyzed using the chi-squared test, while continuous variables were analyzed using the independent t-test or Mann–Whitney U test. A chi-square test for linear trend was conducted to evaluate clinical outcomes according to clusters. Kaplan–Meier curves were employed to analyze time to extubation and in-hospital mortality events. Hazard ratios for extubation and in-hospital mortality were estimated using Cox proportional hazards models. We assessed the overall performance using the area under the receiver operating characteristic curve (AUC) and compared models using the Delong method18. Additionally, accuracy, specificity, sensitivity, precision, and recall were evaluated. Statistical significance was set at p < 0.05, alongside 95% confidence intervals. Statistical analyses were conducted using R statistical software version 4.1.2 (R Foundation, Vienna, Austria).

Ethics

This study followed ethical guidelines outlined in the Declaration of Helsinki of 1975. The Institutional Review Board (IRB) of Boramae Medical Center approved the study protocol and waived the requirement for informed consent from study participants to access their electronic medical records (IRB No. 10-2021-110).

Results

We identified a total of 795 patients admitted to the ICU with a primary diagnosis of pneumonia confirmed by chest CT scans (Supplementary Information S2). After excluding 17 patients who were transferred to the general ward within three days or presented with medical conditions more critical than pneumonia, our study included 778 patients requiring ICU admission for severe pneumonia. A comprehensive set of 60 variables related to thoracic muscle mass in the included patients is summarized in Supplementary Information S3. Using the profiles of thoracic muscle mass, three distinct clusters were derived, with 130, 351, and 297 patients allocated to these three clusters. They were visualized through UMAP and PCA plots (Fig. 1). Detailed results of PCA analysis of thoracic muscle mass profiles are provided in Supplementary Information S4. The mean ARI value obtained from 100 random subsets was 0.97, indicating a very high degree of reproducibility and stability in the clustering results.

Figure 1
Figure 1The alternative text for this image may have been generated using AI.
Full size image

UMAP and PCA clustering based on the profiles of thoracic muscle mass. UMAP uniform manifold approximation and projection, PCA principal component analysis.

We observed significant differences in thoracic muscle mass profiles across the three clusters, with Cluster 1 group exhibiting higher values than Clusters 2 and 3 for all indicators, including mean muscle area, musculoskeletal index, and muscle volume (Supplementary Information S5, all p-value < 0.001).

Demographic and clinical features

Baseline characteristics of each cluster are outlined in Table 1. Cluster 1 was characterized by middle-aged males with metabolic concerns. Cluster 2 consisted of elderly males with respiratory comorbidities. Cluster 3 consisted of elderly patients with sarcopenic conditions and cognitive impairment. Upon thorough examination of conventional severity-of-illness scores, APACHE II, SOFA, SAPS II, and PaO2/FiO2 (PF) ratio showed no significant differences among clusters (Table 2). Significant differences in several clinical parameters including initial vital signs, urine output, neutrophil count, lymphocyte count, creatinine levels, aspartate aminotransferase (AST), alanine aminotransferase, and total bilirubin were observed among clusters. Although no differences in the use of antibiotics and systemic corticosteroids were observed between clusters, vasopressors were more frequently used in Cluster 3.

Table 1 Baseline characteristics of ICU patients with severe pneumonia.
Table 2 Clinical conditions of ICU patients with severe pneumonia.

Clinical outcomes

MV was more frequently applied in Cluster 1, while ECMO was more commonly used in Cluster 2 than in other clusters (Table 3). However, there was no significant linear trend across clusters. Cluster 1 had higher rates of extubation, lower rates of prolonged MV, lower rates of tracheostomy, and lower rates of in-hospital mortality than Cluster 3. A significant linear trend across clusters was noted.

Table 3 Clinical outcomes during a 30-day observation period.

In the Kaplan–Meier curve, Cluster 3 had higher risks of extubation failure and in-hospital mortality (Fig. 2). In each comparison, Cluster 3 showed a prolonged time to extubation than Cluster 1 (p-value < 0.001) and Cluster 2 (p-value < 0.001). Meanwhile, Cluster 3 showed a shortened time to in-hospital mortality than Cluster 2 (p-value = 0.002), although there was no significant difference between Cluster 1 and Cluster 3.

Figure 2
Figure 2The alternative text for this image may have been generated using AI.
Full size image

Time to extubation and in-hospital mortality.

Clinical factors related to extubation and in-hospital mortality

In unadjusted Cox regression analyses for extubation, Cluster 3 was associated with a decreased probability of successful extubation compared to Cluster 1 and Cluster 2 (Table 4). Additionally, COPD, initial PR, low PF ratio, systemic corticosteroid usage, and vasopressor administration were linked to a decreased probability of successful extubation. Higher APACHE II or SAPS II scores among conventional severity-of-illness measures were also correlated with a reduced likelihood of extubation (Supplementary information S6). In adjusted Cox regression analyses, Cluster 3 demonstrated a lower hazard ratio (HR) of extubation compared to Cluster 1 (HR: 1.373 [95% confidence interval (CI) 1.050–1.796]) and Cluster 2 (HR: 1.406 [95% CI 1.136–1.741]).

Table 4 Hazard ratio for extubation.

In unadjusted Cox regression analyses for in-hospital mortality, Cluster 1 exhibited a lower HR of in-hospital mortality compared to Cluster 2 and Cluster 3 (Table 5). Among comorbidities, asthma, chronic kidney disease, and congestive heart failure were associated with a higher HR of in-hospital mortality. Initial mean blood pressure, initial respiratory rate, PF ratio, urine output, PaO2, HCO3, lactate, AST, troponin I, and vasopressor usage were correlated with the risk of in-hospital mortality. Moreover, higher APACHE II, SOFA, and SAPS II scores among conventional severity-of-illness measures were associated with an increased likelihood of in-hospital mortality (Supplementary Information S7). In adjusted Cox regression analyses, Cluster 1 demonstrated a lower HR of in-hospital mortality than Cluster 2 (HR: 0.616 [95% CI 0.433–0.876]) and Cluster 3 (HR: 0.601 [95% CI 0.419–0.861]).

Table 5 Hazard ratio for in-hospital mortality.

Predictive performances of extubation and in-hospital mortality

The prediction model for successful extubation using all the variables representing profiles of thoracic muscle mass showed a higher AUC (0.844 [0.796–0.891]) than APACHE II (Delong test p-value < 0.001), SOFA (Delong test p-value < 0.001), and SAPS II (Delong test p-value < 0.001) (Fig. 3). Sensitivity, specificity, positive predictive value, negative predictive value, and accuracy for predicting successful extubation are summarized in Supplementary Information S8.

Figure 3
Figure 3The alternative text for this image may have been generated using AI.
Full size image

Prediction of extubation and in-hospital mortality.

The prediction model for in-hospital mortality employing all the variables representing profiles of thoracic muscle mass demonstrated a higher AUC (0.696 [0.659–0.733]) than APACHE II (Delong test p-value < 0.001), SOFA (Delong test p-value < 0.001), and SAPS II (Delong test p-value < 0.001) (Fig. 3). Sensitivity, specificity, positive predictive value, negative predictive value, and accuracy for predicting in-hospital mortality are summarized in Supplementary Information S9.

Discussion

Our study underscores the crucial role of thoracic muscle mass in determining clinical outcomes for ICU patients with severe pneumonia. The categorization into distinct clusters based on muscle mass profiles highlights the potential for differential treatment approaches. Patients possessing greater muscle mass (Cluster 1) demonstrated superior clinical outcomes, including higher extubation success and lower mortality rates. Patients in Cluster 1 were mainly middle-aged males with metabolic concerns, while Cluster 3 with worse outcomes consisted of older patients with sarcopenic conditions and cognitive impairments. Severity-of-illness scores such as APACHE II, SOFA, and SAPS II showed no significant differences across clusters. Prediction models using thoracic muscle mass profiles showed higher accuracy in predicting extubation success and in-hospital mortality than conventional severity scores.

Unsupervised clustering analysis serves as a robust methodology for evaluating the prognostic relevance in ICU patients19,20. The strength of clustering analysis resides in its ability to manage and interpret complex datasets, thereby facilitating a nuanced understanding of patient characteristics that might be obscured in broader analyses. A previous study has evaluated the effectiveness of a clustering approach based on clinical and echocardiographic parameters, showing that novel cardiovascular phenotypes in septic shock patients are closely associated with prognosis21. A recent study has reported that in ICU settings, clustering analysis enables provision of individualized mobilization strategies for different patient groups, significantly improving discharge success rates22. In numerous studies, sarcopenia identified through CT-measured muscle mass cutoffs has been linked to outcomes of ICU patients23,24,25. However, a significant limitation of these studies was the lack of a universally accepted gold standard for these cutoff values, which could vary widely depending on factors such as ethnicity, specific disease conditions, and imaging modalities. Despite the absence of a validated cut-off value for reduced muscle volume, clustering analysis can provide a unique perspective for examining intricate relationships between muscle mass and clinical outcomes. Our study categorized patients based on comprehensive profiles of thoracic muscle mass and enhanced our understanding of impacts of muscle volume profiles on clinical outcomes in critically ill patients. Our findings advocate for a broader application of clustering analysis in further studies on ICU patients.

Reduced skeletal muscle mass detected via CT scans has been associated with prognosis of critically ill ICU patients26. However, the precise mechanisms through which reduced muscle mass or sarcopenia can worsen prognosis in ICU patients remain poorly understood. There have been several hypotheses suggesting the relationship between skeletal muscle and prognosis in ICU. First, diminished skeletal muscle mass might indicate depleted protein reserves essential for responding to stress and infections, leading to an altered immune response27,28. Second, this reduction in muscle mass may significantly correlate with respiratory muscle weakness and reduce cough strength and lung function, thus increasing risks of respiratory failure and mortality29. Thirdly, reduced skeletal muscle mass can contribute to reduced mobility linked to poorer outcomes of hospitalized patients30. Indeed, a decrease in muscle thickness of the quadriceps has been associated with prolonged ICU stays31. This decreased mobility can lead to a higher incidence of complications such as pressure ulcers, deep vein thrombosis, and falls.

Our clustering analysis identified the association between decreased skeletal muscle mass and impaired cognitive function. A higher prevalence of cognitive impairments was found in the group with the lowest thoracic skeletal muscle index. This finding can be explained by a bidirectional relationship between pneumonia and cognitive function32. A recent study has shown that lower muscle mass is correlated with decreased cognitive function in older patients33. A decreased cognitive function can hinder rehabilitation and slow return to daily activities in ICU patients. One current study has shown that a lower skeletal muscle index is correlated with reduced success in weaning from a nasogastric tube, potentially affecting long-term outcomes34.

Conventional sarcopenia assessment methods such as DEXA and BIA face practical challenges in the ICU, where they are often impractical due to the need for specialized equipment and limited mobility of patients35. This situation necessitates the development of alternative strategies that utilize existing clinical data. Routine performance of chest CT scans for pneumonia diagnosis and treatment monitoring in the ICU present an opportunity for opportunistic imaging36. This method allows for the assessment of muscle mass directly from these scans, offering a streamlined, non-invasive approach to evaluating muscle health. The integration of muscle mass analysis into standard patient care protocols can significantly improve assessment of sarcopenia, enhance patient management, and potentially improve outcomes of critically ill individuals37.

Prognostic models such as APACHE II, SOFA, and SAPS II commonly used in ICU settings have their limitations, especially in older patient cohorts who exhibit unique health characteristics and needs38. In our study, we employed 3D segmentation to individually analyze pectoralis major and minor muscles (known to play a critical role in respiratory mechanics) and erector spinae muscles essential for maintaining posture and stability, particularly in patients with prolonged bed rest due to pneumonia. Our integrated analysis of these specific muscle groups enhanced by deep-learning algorithms demonstrated a prognostic accuracy superior to traditional measures. This comprehensive muscle assessment offers valuable insights into the severity of respiratory conditions and guides personalized care, highlighting the functional importance of each muscle group in patient outcomes.

While our study highlights the prognostic superiority of integrating muscle mass analysis using advanced AI-driven 3D segmentation techniques over traditional severity scores such as APACHE II and SOFA, it has several limitations. First, the retrospective design of our research introduced inherent constraints such as potential selection bias and the inability to establish causality. Second, generalizability of our findings might be limited as this study was conducted in a single teaching hospital. Third, relying solely on chest CT scans to assess thoracic muscle mass might overlook subtle physiological changes associated with conditions such as sarcopenia and other critical illnesses. Fourth, our focus on short-term clinical outcomes did not allow for evaluation of long-term impacts of muscle mass variations on patient recovery and quality of life. Fifth, despite statistical adjustments for known confounders, there remained a risk of residual confounding from unmeasured or inadequately measured variables such as nutritional status and pre-existing functional impairments. These factors could significantly influence observed associations between muscle mass and patient outcomes.

In conclusion, ICU patients with severe community-acquired pneumonia can be stratified into three distinct groups based on different profiles of thoracic muscle mass, each exhibiting unique clinical characteristics. Our findings underscore the potential role of thoracic muscle mass as a predictor of extubation success and in-hospital mortality in this patient population. To confirm our results and investigate long-term impacts of muscle health on patient outcomes, further multicentric studies are needed.