Introduction

Uterine leiomyoma (UL) is the most common benign tumor in female reproductive system1, and it occurs frequently in women aged 30 to 50 years old. Although UL is usually asymptomatic, it can cause abnormal uterine bleeding, infertility, pelvic pain, and even miscarriage2,3. These symptoms can negatively impact the patient’s daily quality of life, as well as cause significantly direct and indirect economic costs.

Diagnosis-related groups (DRG), as a modern case-mixing scheme, is recognized as one of the most advanced payment methods globally4,5. It serves as a crucial tool for measuring the quality and efficiency of healthcare services, as well as for health insurance reimbursement6. Based on the principle of prioritizing clinical similarity while considering the similarity in resource utilization, DRGs classify patients into diagnosis groups based on primary clinical diagnosis and surgeries, taking into account factors such as age, length of stay (LOS), comorbidities and complications (CCs), and surgery type, etc. Each diagnosis group is then assigned a code, which determines the corresponding reimbursement standards.

DRG not only balances the interests of governments, healthcare institutions, and patients but also helps achieve an optimal balance between healthcare quality and cost. As a result, it is widely utilized in healthcare resource allocation, healthcare institution pricing, performance management, quality control, and more7. Whether in meeting DRG payment requirements or ensuring medical quality, safety, and efficiency, strengthening clinical pathway management and promoting the standardization of clinical pathways are essential. Refining DRG groupings further optimizes medical service processes, improves patient satisfaction, and supports the healthy development of the healthcare industry8.

The exhaustive chi-square automatic interaction detection (E-CHAID) algorithm is an advanced modification of the traditional CHAID algorithm and is widely used in analyzing classified quantitative data. A study indicates that E-CHAID demonstrates superior performance in identifying the most significant grouping factors compared to CHAID, while also exhibiting enhanced segmentation capabilities over the CART algorithm9.

This study is based on the hospitalization costs of patients with uterine fibroids to explore the DRG groups scheme for inpatients with uterine fibroids and to formulate a reference standard for the hospitalization costs of patients with uterine fibroids, providing a basis for the reasonable control of hospitalization costs for patients with uterine fibroids in local city (Chongqing).

Materials and methods

Data source and selection

According to the International Classification of Diseases (ICD-10) standard, all patients with UL (ICD-10: D25.-) hospitalized between 2017 and 2021 were enrolled from a tertiary hospital. The data of front sheet of the medical record of each UL patient were retrospectively extracted, including the patient’s personal information, diagnosis and treatment information, and cost information.

After sorting out and checking the data quality of all UL cases, a total of 3708 cases were identified. Subsequently, those with the following criteria were excluded, including cases with missing or leaked items, or logical errors (2 cases), cases with a length of stay (LOS) shorter than 2 days or longer than 60 days (671 cases), case with hospitalization costs over the range of X ± 3s (47 cases). After exclusion, a total of 3087 valid cases were ultimately included in the study, with an effective inclusion rate of 81.09%. This study was performed under the guidelines established by the Declaration of Helsinki. As this is a retrospective analysis of deidentified data, the ethical approval and informed consents of patients were waived by ethics committee of Chongqing Medical University.

Statistical analysis

Univariate analyses of factors affecting the hospitalization costs in UL patients were performed by Mann-Whitney U test or Kruskal-Wallis H test. Then, those factors with a P < 0.05 were included for further multivariate logistic regression analysis to identify the main influencing factors of hospitalization costs. A significance level of P < 0.05 was used to determine statistical significance.

The decision tree model was constructed using E-CHAID. Independent variables were selected based on their impact on hospitalization costs and statistical significance, while the dependent variable was the total hospitalization costs of patients with UL to group DRG cases.

Evaluation index

The heterogeneity between groups was evaluated using the non-parametric Kruskal-Wallis H test and reduction in variance (RIV), while the homogeneity within groups was evaluated using the coefficient of variation (CV)9,10. The rationality and validity of the grouping were tested based on the data distribution. RIV was calculated using the formula: RIV = sum of squared deviations of subsets/sum of squared deviations of populations, and CV was calculated using the formula: CV = standard deviation/mean. The larger RIV indicates greater heterogeneity between groups, resulting in better grouping. The smaller CV indicates smaller within-group variation, which leads to better grouping. The data was analyzed using SPSS 26.0 software, and a significance level of P < 0.05 was used.

Results

Basic information of patients with UL

A total of 3087 eligible UL cases were enrolled in this study, with an average age of 43.03 years old and a median age of 44.00 years old. The age distribution showed that 29.58% of patients were aged ≤ 39 years, 53.97% were aged 40–49 years, and 16.46% were aged ≥ 50 years. The vast majority of patients were of Han ethnicity (98.06%) and married (93.85%). Approximately 96.7% of patients were covered by Medicare, while 3.3% were self-paying. The average LOS was 6.66 days, and 62% of all had a LOS ≤ 7 Days. The most common surgery type was hysteroscopic surgery (42.47%), followed by high-intensity focused ultrasound (26.17%) and general operation (22.55%). The average total hospitalization cost was 22,436.11 yuan and the median was 22,825.74 yuan. Detailed demographic characteristics and clinical features were presented in (Table 1).

Table 1 Demographic characteristics and clinical features, and univariate analysis of significant factor influencing the hospitalization costs in patients with uterine leiomyoma.

Composition of hospitalization costs

In order to eliminate the effect of inflation, the 2021 hospitalization costs for patients with UL were used as the baseline, and the hospitalization costs for patients with UL were also adjusted before and after the DRG payment according to the consumer price index for healthcare residents in Chongqing, as published by the national bureau of statistics. The highest proportion of the total hospitalization cost was treatment cost (29.46%), followed by diagnostic cost (28.98%), consumable cost (26.01%), drugs cost (9.00%), service cost (5.92%) etc. (Table 2). The cost of treatment, diagnosis and consumables accounted for 84.45% of the total cost.

Table 2 Composition of hospitalization costs for patients with uterine leiomyoma.

Univariate analysis of the influencing factors on hospitalized costs

The dependent variable for this study was the hospitalization cost for UL. The independent variables included age, nationality, marital status, type of medical insurance, admission method, admission condition, medical advice discharge, 31-day readmission schedule, Therapeutic effects, LOS, surgery type, and CCs. The Mann-Whitney U test or Kruskal-Wallis H test were performed the analysis. The results turned out that, except nationality, all other factors had a significant impact on the hospitalization costs in patients with UL (Table 1).

Multiple regression analysis of the influencing factors on hospitalized costs

Since the hospitalization costs for UL did not follow a normal distribution, we employed binary logistic regression analysis for logistic regression analysis. Patients were stratified into two categories based on the median cost of hospitalization, which was determined to be 22,436.11 yuan. This categorization resulted in the identification of a high-cost group (costs more than 22,436.11 yuan, coded as 1) and a low-cost group (costs less than 22,436.11 yuan, coded as 0). A binary logistic regression model was constructed with the hospitalization costs group for patients with UL as the dependent variable and the results of univariate analysis as independent variables. For unordered multinomial categorical variables, dummy variables were created with the first category at each factor level serving as the reference. The variance inflation factor of all predictors used in the model was tested to be less than 3 indicating that there is no multicollinearity among the independent variables. The findings revealed that age, surgery type, LOS, and CCs significantly influenced the hospitalization costs in UL patients (Table 3).

Table 3 Results of logistic regression analysis of hospitalization costs for patients of uterine leiomyoma.

DRG grouping: a decision tree model to construct the case combination and the hospitalization cost of each group

The hospitalization costs was used as the dependent variable, and the significant factors including age, LOS, surgery type, and CCs were used as the independent variables, which were included in the decision tree model. The parameters used in this study are as follows:11,12 the maximum depth of the decision tree model is 3, the minimum sample size of the parent node is 100, the minimum sample size of the child node is 50, and the significance level of the split node is α = 0.05. A total of 10 groups were identified (Fig. 1). After grouping, the Kruskal-Wallis non-parametric test was used to test the cost differences between groups. The results showed that the cost distribution of the 10 groups were not identical (Chi-square value was 2152.551, P < 0.05), indicating a good heterogeneity among the different DRG groups and statistically significant differences.

To enhance the comprehensibility of the cost of each DRG group, we included the original value of hospitalization costs in the analysis. As hospitalization costs are abnormal distribution, we used the median as the standard cost of medical resource consumption for UL cases. The calculated RIV value was 0.68, indicating significant heterogeneity between groups and significant classification. The CV for hospitalization costs ranged from 0.08 to 0.43, indicating minimal variation and reasonable grouping result. To establish cost control measures, the warning cost was set at 75(th) quantile of the inpatient cost for each DRG group, while the upper limit was set at 75(th) quantile plus 1.5 times the interquartile distance (P75 + 1.5Q). The results revealed that 40 cases (1.30% of the total) exceeded the upper limit in each DRG group (Table 4).

Fig. 1
figure 1

Decision tree diagram of influencing factors of hospitalization costs of uterine leiomyoma.

Table 4 Results of DRG grouping in patients of uterine leiomyoma.

Discussion

Through the decision tree model, we found that the surgery type was the most significant factor associated with the hospitalization cost in UL patients. Other significant factors included the LOS and CCs. The classification node of the decision tree model in the first layer was the surgery type, this is directly related to the recovery effect of patients and also has a significant impact on their economic burden during hospitalization. In order to improve the quality of medical services and reduce the financial burden on patients, doctors should take full account of patients’ actual situations and formulate personalized treatment plans13. The second level classification node was the LOS, which is an important temporal indicator of healthcare resource consumption, and the results show that the higher the LOS the higher the hospitalization costs of the patients. On the basis of ensuring the quality and safety of medical services, reducing LOS is an effective way of controlling hospitalization costs, which not only reduces the cost of hospitalization for patients and alleviates their financial burden14,15but also improves the turnover rate of hospital beds and the efficiency and effectiveness of medical services16,17. CCs was the third factor to consider. Having CCs means that the patient’s condition is more complex and requires more extensive examinations and treatments; these additional examination items and treatments will increase the patient’s hospitalization costs. In addition to the complexity of the condition, patients may require longer LOS for observation and treatment, which will naturally increase the basic costs such as bed and nursing fees during hospitalization18; hospitals should fully consider patients’ CCs when providing medical services, and adopt reasonable treatment plans and management measures to reduce hospitalization costs and alleviate patients’ financial burden. At the same time, hospitals should also strengthen their internal management, improve the efficiency and quality of medical services, and provide more high-quality, efficient and economical medical services to patients.

Although age was less influential compared to surgery type, LOS, and CCs, it still played a significant role in hospitalization costs among UL patients. Age can impact patients’ overall health status, and improving health conditions can significantly reduce the incidence of age-related diseases and associated medical costs. Therefore, measures such as strengthening disease prevention, optimizing resource allocation, enhancing healthcare coverage, and reinforcing supervision can effectively reduce hospitalization costs and improve the quality of life for elderly patients.

Conclusion

This study identified the factors that affect hospitalization costs in UL patients and constructed DRG payment groups based on the E-CHAID algorithm. The use of DRG grouping to establish hospitalization costs standards is advantageous for the rational allocation of medical resources, enhancing medical quality and efficiency, and reducing the economic burden on patients and society19. At the same time, it can provide a theoretical basis and verification methods for fully implementing DRG payment in the region.