Introduction

There is no doubt that COPD has become one of the major global public problems due to its high morbidity and mortality. It is characterized by persistent (often progressive) airflow limitation and chronic respiratory symptoms caused by airway abnormalities (bronchitis, bronchiolitis) and/or alveolar abnormalities (emphysema) and deserves the attention of people worldwide1,2. COPD is thought to be caused by tobacco smoking, however, in recent years studies have shown that COPD may be caused by impaired lung development3,4.Advancing insights into COPD mechanisms may reveal new strategies for disease prevention and progression control.

Unlike COPD, PRISm is considered to be a non-obstructive pulmonary dysfunction with reduced FEV1and FVC symmetry5,6. PRISm can be categorized into two groups based on the presence or absence of restrictive ventilatory abnormalities (FEV₁/FVC ≥ 0.7 and FVC %pred < 80%): restrictive PRISm and non-restrictive PRISm7,8. Measuring Total Lung Capacity (TLC) helps to more accurately assess the type and severity of pulmonary dysfunction. When TLC is reduced with an FEV₁/FVC ratio ≥ 0.7 and FEV₁ < 80%, it strongly supports the diagnosis of restrictive PRISm. Conversely, if TLC is normal, non-restrictive PRISm may be considered, along with other clinical indicators. Previous studies have shown that non-restrictive PRISm is associated with an increased risk of developing airflow obstruction within 5 years9. Since the term “Preserved Ratio Impaired Spirometry” was introduced by Wan et al. in 20145, it has been widely cited by researchers, and many large cohort studies have also adopted the criteria of FEV1/FVC ≥ 0.7 and FEV1< 80% predicted for defining PRISm10,11,12.Therefore, in our study, we adopted this definition as well. Recent evidence highlights a high prevalence of undiagnosed COPD, and even when COPD cannot be confirmed by PFT, this population shows signs of declining lung function, with PRISm being one of the subtypes at risk for disease progression and delayed treatment13.Moreover, PRISm is considered as a subtype that owns higher risk of progression to COPD14. A study by Kanetake et al. found that PRISm was an independent risk factor for the development of COPD in the Japanese population (OR 3.75, p< 0.001)15. The Rotterdam and COPD Gene study found that the incidence of transition from PRISm to COPD was significantly higher than in the group with normal lung function12,14.

Previous studies have shown that PRISm is a highly heterogeneous and unstable population characterized by a longitudinal transition to different categories of lung function (reverting to normal lung capacity, development of COPD, persistence of PRISm)10,12,14,16,17. Di He et al. used the English Longitudinal Study of Aging (ELSA) cohort, which reported that patients with severe PRISm (both FEV1 and FVC < 80% predicted values, FEV1/FVC ≥ 70%) had increased respiratory symptoms, a higher risk of death, and a greater chance of progressing to COPD than those with mild PRISm (FEV1 < 80% predicted, FEV1/FVC ≥ 70%)17. Park et al.’s research identified that PRISm patients are at risk of developing COPD and should be carefully monitored regardless of lung function, with particular attention to older patients as well as those with wheezing symptoms18. Similar results have been reported in other studies that PRISm subjects have higher mortality and many different lung function trajectories10,17.Therefore, some aspects of PRISm are associated with the deterioration of lung function and development of COPD. However, the incidence of COPD in PRISm patients has been rarely reported. The population-based studies were limited on risk factors for PRISm progression.

It is well known that once pulmonary function deteriorates to COPD, reversal becomes challenging19. Therefore, early identification of PRISm patients who are susceptible to progression to COPD is extremely important. We analyzed clinical features of inpatients progressing from PRISm to COPD, identified progression risk factors, and developed predictive models for early intervention to prevent irreversible lung function decline.

Methods

Study design and population

We extracted electronic medical records of 14,534 hospitalized patients who underwent at least two lung function tests at West China Hospital of Sichuan University between January 2012 and July 2020. After excluding 829 patients with incomplete baseline data, 8,561 with normal lung function, and 4,536 with stable COPD, we included 608 eligible patients (435 with COPD/PRISm and 173 with persistent PRISm) in the final analysis. Subsequent screening excluded 247 patients exhibiting unstable pulmonary function (defined as transitions between COPD, normal, or PRISm status during follow-up). 78 patients lacking complete clinical information and laboratory test results were further excluded. The final analysis included 283 patients: 134 showing disease progression and 149 without progression. The original study initially included 283 patients who were randomly divided into a training cohort (N = 227) and a validation cohort (N = 56) in an 8:2 ratio. The exclusion criteria for this study are listed as follows: (a) participants who had only one lung function test and had incomplete or inaccurate baseline clinical information; (b) participants whose lung function were normal and continued unchanged ; (c) patients diagnosed with COPD at baseline and continued unchanged; (d) Patients with unstable pulmonary function, defined as transitions between COPD, normal, or PRISm status during follow-up, including those whose lung function transitioned from COPD to PRISm or normal; (e) participants with incomplete clinical information or lacking detailed laboratory results; The flow chart illustrating the study participants is presented in Fig. 1. We used their data to establish retrospective cohorts and conduct analyses to explore risk factors for PRISm progression. In this study, 35 baseline variables were included in the prediction model as screening variables, and after further screening, the 6 most meaningful variables were eventually used to build the prediction model.

Fig. 1
figure 1

Flowchart of study participants. The included patients were randomly divided into to the training cohort (N = 227) and the validation cohort (N = 56).

This study was approved by the Ethics Committee of West China Hospital on October 23, 2023.The ethical approval number for this study is Trial in 2023 (No. 1856). As this study was retrospective and all data were used for anonymous analysis, West China Hospital waived the requirement of informed consent.

Diagnosis of COPD and PRISm

Spirometer parameters, including FVC, FEV1 and FEV1/FVC ratio were measured by spirometer (Vmax229D, SensorMedics, CA, USA) measurement, as recommended by the American Thoracic Society20.

PRISm refers to non-obstructive pulmonary dysfunction with reduced FEV1 and FVC symmetry, defined as FEV1/FVC ≥ 70% and FEV1< 80% pred by pulmonary function test (PFT)6.

A fixed ratio of FEV1/FVC less than 70% is the standard for airflow restricted spirometry. This standard is simple and independent of reference values and has been used in many clinical trials to diagnose COPD21.

In our study, the diagnosis of COPD was not solely reliant on pre-bronchodilator spirometry (FEV1/FVC < 70%) but was established through a multi-dimensional clinical evaluation by experienced physicians. This included detailed clinical history (chronic dyspnea, cough, smoking/environmental exposures), chest CT findings (emphysema, bronchial wall thickening, or air trapping), and rigorous exclusion of asthma based on the absence of episodic wheezing, atopy, or symptom reversibility, alongside persistent airflow limitation despite inhaled corticosteroid therapy.

Data collection and processing

We gathered baseline data on various symptoms such as cough and dyspnea, demographic factors like sex, age, body mass index, and ethnicity, as well as information on smoking history, family history of respiratory illness, and comorbidities such as respiratory complications, hypertension, diabetes mellitus, cardiovascular issues, psychiatric disorders, chronic renal problems, and funnel chest. Additionally, we obtained lung function indicators including forced expiratory volume in the first second (FEV1, %pred), the ratio of forced expiratory volume in the first second to forced vital capacity (FEV1/FVC), and forced vital capacity (FVC, %pred). Furthermore, we collected initial laboratory test results upon admission, encompassing routine blood tests, blood biochemistry, and fibrinogen levels, to explore potential associations with lung function progression. The analysis also incorporated several commonly used ratio measures in clinical practice, including monocyte-to-lymphocyte ratio (MLR), neutrophil to lymphocyte ratio (NLR), platelet lymphocyte ratio (PLR). All clinical data, including spirometry measurements, were collected during stable disease phases. Patients with acute exacerbations of COPD or worsening comorbidities during the observation period were excluded to minimize the influence of acute inflammatory responses on the analysis. Subsequent data underwent rigorous quality control and processing procedures.

Statistical analysis

We randomly divided participants into a training cohort and a validation cohort in an 8:2 ratio according to a simple randomization procedure. Continuous variables with normal distributions are represented by means and standard deviations, and skewered distributions are represented by medians and interquartile Range (IQR). For Categorical variables are presented in terms of frequency or percentage. Student’s t tests or Wilcoxon rank sum are used for continuous variables and chi-square tests or Fisher’s exact tests for categorical variables.

We first screened out variables with P values less than 0.05 based on lasso regression analysis, then performed multivariable logistic regression to determine the risk factors for PRISm progression, and finally used these factors to build a prediction model, which was presented in the form of a nomogram. When two-tailed p < 0.05, the results were considered significant. Where appropriate, results were expressed in terms of odds ratios (ORs) and 95% confidence intervals (CIs). Statistical analyses were performed using software R 4.3.2 (R Foundation for Statistical Computing, Vienna, Austria).We performed data extraction and analysis using several R packages, including “glmnet”, “metrics”, “rms”, “broom”, “pROC”, “rmda”. After building the model, we draw calibration curves to assess its calibration, employ ROC curves to evaluate its performance, and calculate the area under the curve (AUC, also known as C-statistics). Additionally, Decision Curve Analysis (DCA) and Clinical Impact Curve (CIC) were utilized to evaluate its clinical application value.

Establishment of a nomogram

We constructed a nomogram based on meaningful risk factors selected by lssso regression and multivariable logistic regression. Set different score values according to the different OR values of each factor. Next, based on the score value for each factor, use the corresponding position on the horizontal axis to determine the score for that factor. The scores for each factor are then combined to calculate an overall score, providing a specific probability of PRISm progress.

Results

Baseline demographics of the study participants

Following rigorous inclusion and exclusion criteria, our final analysis comprised 283 patients, including 134 with disease progression and 149 without progression. The baseline demographic characteristics of participants are shown in Table 1.

Table 1 Baseline demographic characteristics of patients in the progression and progression-free group.

Table 1 shows the clinical and demographic characteristics of patients in the two cohorts. Compared to the progression-free group, patients in the progression group exhibited older age, higher BMI, a higher prevalence of former and current smokers, lower predicted FEV1, lower FEV1/FVC ratio, lower predicted VC, a higher incidence of family history of respiratory disease, respiratory complications, chronic kidney disease, and a higher frequency of CT signs of emphysema. Laboratory tests also revealed significant differences in albumin, glucose, triglycerides, cholesterol, and fibrinogen levels between the two groups. In addition, during the data collection process, we found that the progression-free group showed a higher rate of funnel chest. (P < 0.05, Table 1). It is important to note that the progression-free group included many patients diagnosed with funnel chest, a condition predominantly found in adolescents according to previous studies. Hence, their clinical data may not adequately represent the condition of primarily stable PRISm patients. It is essential to include more representative populations in future studies.

In our comparative analysis of baseline data from the training cohort (N = 227) and validation cohort (N = 56), we observed no statistically significant differences in baseline demography and laboratory tests. Therefore, it is reasonable to use these cohorts as training and validation cohort for the model (Supplementary Table 1).

Feature selection

We utilized a LASSO regression algorithm to conduct feature selection in the training cohort, considering each feature individually. Lasso regression can effectively address the issue of multicollinearity that may arise during modeling. Among all 35 associated characteristic variables, 8 potential predictors were selected. In the LASSO analysis, eight variables with nonzero coefficients were retained (Fig. 2A).The most suitable tuning parameter λ for LASSO regression was determined to be 0.025, corresponding to the minimum value of the partial likelihood binomial deviance (Fig. 2B).

Fig. 2
figure 2

Feature selection by LASSO binary logistic regression model. A LASSO coefficient profiles for clinical features, each coefficient profile plot is produced vs. log (λ) sequence. Eight variables with nonzero coefficients were retained. B Selection of tuning parameter (λ) in the LASSO regression using 10-fold cross-validation via minimum criteria.

Multivariate analyses of risk predictors of lung function progression and create the prediction nomogram model using the training cohort

The eight retained variables were used for multivariable logistic regression analysis. The results of the multivariable logistic analysis for the progression of PRISm in the training cohort are presented in Table 2, and the odds ratios (ORs) for each risk factor, along with a 95% confidence interval, are illustrated using a forest plot (Supplementary Fig. 1). It is evident from the analysis that advanced age (P = 0.024), dyspnea (P = 0.046), comorbidity with respiratory disease (P = 0.005), and higher red blood cell count (P < 0.001) are positively associated with lung function progression. Conversely, patients with higher predicted FEV1 (P = 0.032) and FEV1/FVC ratio (P < 0.001) are at a lower risk of experiencing lung function progression. The nomogram was developed using six variable point scales, each corresponding to a respective variable. The total points were calculated as the sum of the points from these six variables. Finally, we set up a simple nomogram based on the selected risk factors (Fig. 3).

Table 2 Multiple logistic regression of progression for PRISm in the training cohort.
Fig. 3
figure 3

Nomogram for predicting PRISm progression to COPD. This nomogram contains six clinical attributes, including age, dyspnea, FEV1 pred, FEV1/FVC, respiratory complications, red blood cell.

Apparent performance of nomograms to predict PRISm progression

The estimated progress of the nomogram model to predict the progression of PRISm in the training cohort has a good correlation with the actual progress. Our results show that the calibration curve almost coincides with the Y = X line, indicating that the model is well calibrated (Fig. 4).

Fig. 4
figure 4

Calibration curve of nomogram in (A) training cohort and (B) validation cohort.

To assess the accuracy of our model, we performed ROC analyses on patients. The AUCs for discriminating between patients with and without lung function progression were 0.869, and 0.792 in our training and validation cohorts, respectively (Fig. 5). The model’s classification performance was evaluated by assessing its sensitivity and specificity. In the training cohort, the sensitivity of the model was 0.835, while in the validation cohort, it was 0.960. The specificity of the model was 0.771 and 0.613 in the two cohorts, respectively. In addition, DCA results show that nomogram models have good clinical application value. The gray curve shows that all patients received the intervention, the black line shows that no patients received the intervention, and the red curve shows the clinical benefit of our model (Supplementary Fig. 2). CIC showed that the “high risk number” line was relatively close to the “high risk number with event” line, indicating that using this nomogram model to predict PRISm progression has a large clinical net benefit (Supplementary Fig. 3).

Fig. 5
figure 5

The ROC curves for discriminating between patients with and without lung function progression in training (A) and validation (B) cohort. The area under the curve (AUC) is shown.

Discussion

Considering the high heterogeneity within the PRISm population, characterized by various progression or change trajectories, predicting the lung function progression of patients based on their actual changes has become an urgent issue14. In this study, we developed and validated a personalized predictive nomogram to assess the risk of PRISm progression to COPD, using cost-effective and readily available variables to help clinicians identify patients at higher risk of progression from PRISm to COPD for early intervention to delay irreversible lung function progression. The model demonstrates favorable predictive performance, as evidenced by reliable AUC values. Internal validation and evaluation further confirmed the nomogram’s accuracy in risk prediction.

In defining PRISm, we acknowledge the ongoing variability in diagnostic criteria across studies. While some authors classify PRISm as a condition where FEV1/FVC ≥ 0.7 and FVC < 80% of predicted values22,23,24, others include cases where either FEV1or FVC is below 80% of predicted values17. Notably, many large cohort studies have predominantly adopted the definition of FEV1/FVC ≥ 0.7 and FEV1< 80% of predicted values11,12,25. In our study, we aligned with this widely accepted criterion to ensure consistency and comparability with previous research. However, we recognize that the inclusion of cases with FVC < 80% may provide additional insights into the heterogeneity of PRISm. Future studies should explore the clinical and prognostic implications of differing definitions to establish a more standardized diagnostic framework. Additionally, integrating other functional metrics, such as total lung capacity (TLC) and diffusing capacity (DLCO), may further refine the classification and improve our understanding of PRISm’s underlying mechanisms. In recent years, significant changes have occurred in the interpretation of PFTs. The European Respiratory Society (ERS) and the American Thoracic Society (ATS) now recommend using the lower limit of normal (LLN) and Z-scores to define pulmonary abnormalities26,27. These updated criteria improve diagnostic accuracy and reduce misclassification caused by fixed thresholds. In our study, although some analyses still relied on the traditional FEV1/FVC ≥ 70% fixed ratio, we recognize the importance of LLN and Z-scores. Specifically, LLN allows for more precise identification of individuals with preserved FEV1/FVC ratios but impaired lung function, which is particularly relevant in defining PRISm. This approach enhances diagnostic accuracy and provides clearer guidance for early intervention28. Moving forward, we plan to adopt LLN and Z-scores to better classify PRISm patients. We also recommend integrating these updated criteria into clinical practice to improve the diagnostic value of PFTs.

Previous studies have shown that the pattern of lung capacity between COPD and PRISm changes over time29. But it’s worth noting that most of these studies have been conducted in Western populations, what’s more, most previous studies have assessed the prevalence5,30, risk factors31, and prognosis of PRISm10,11,32, but our study is the first retrospective study to predict the progression of PRISm in China. According to the study based on the Japanese population, the rate of progression is varied due to differences in patient background, such as smoking history, age and ethnicity. In Europe and the United States, COPD patients tend to be obese and non-emphysematous, while in East Asia, emphysema and emaciation are more common15. This further explains the necessity of exploring the risk factors of PRISm progression in our country. Our study also identified a subset of COPD patients who regained normal lung function or transitioned to PRISm, consistent with previous findings32. We propose three potential explanations for this phenomenon. First, lifestyle modifications and early interventions may delay or prevent disease progression. Second, the body’s inherent self-regulatory and repair mechanisms could contribute to functional recovery, particularly in patients without severe comorbidities. Third, instrumental variability might account for unstable lung function measurements in borderline cases. These findings highlight the importance of exploring PRISm progression risk factors within the context of China’s unique patient population.

Consistent with previous research, our study shows that the risk of PRISm progressing to COPD increases with age18,33. While age is a non-interventional factor, its significance underscores the importance of monitoring lung function from a young age, engaging in regular health check-ups, and initiating preventive interventions early in life to mitigate the risk of pulmonary function decline. Hye et al.‘s study further affirmed that in the overall sample, age, FVC, FEV1 pred, dyspnea, and wheezing were significant predictors of COPD and among PRISm patients, only age (OR, 1.14; P = 0.002) and wheezing (OR, 4.56; P= 0.04) emerged as significant predictors, which is consistent with our study14. Previous studies have shown a significant association between PRISm and dyspnea, and subjects complaining of dyspnea should be screened for PRISm and followed up on their lung function34. Our data confirm and extend these previous findings and identify additional risk factors for disease progression, enabling early screening of individuals in need of intervention.

At present, physicians often judge airflow limitation based on the decline of FEV1/FVC35. However, although FEV1/FVC is generally within the normal range for most patients with PRISm, our study suggests that a downward trend in this indicator may serve as a risk factor for disease progression. Additionally, patients with seemingly normal FEV1/FVC may not truly represent normal lung function and warrant further attention, confirming findings from previous research36. FEV1pred is a good indicator of moderate and severe airflow limitation. It is used to evaluate the classification and evaluation of the severity of airflow limitation in COPD. Due to its small variability and easy operation, it has been clinically used as a basic item in pulmonary function detection of COPD37. The reduction of FEV1pred was also found to be associated with the progression of pulmonary deterioration38. Our model incorporates this as one of the predictors, aligning with the current guideline recommendation to utilize the predicted percentage of FEV1 for classifying the severity of airflow obstruction. It also suggests that patients should be promptly alerted when a decline in predicted FEV1 is observed. In addition, our study has, for the first time, uncovered the impact of respiratory complications and elevated red blood cell count on the progression of PRISm, and these relationships can be further studied in the future.

There still exist several limitations in our study. First, the comorbidities of each system we collected were not detailed. Second, we lack a specific time interval for pulmonary function deterioration in patients. Third, while our diagnostic approach incorporated comprehensive clinical and imaging criteria to mitigate potential misclassification, the reliance on pre-bronchodilator spirometry remains a limitation. Future studies should adopt standardized post-bronchodilator testing to further reduce diagnostic uncertainty and enhance the validity of COPD phenotyping.Finally, since patients were not followed up regularly, we only included a small sample size of hospitalized patients for retrospective analysis, which may lead to some bias. For example, it is well known that previous/current smoking is a risk factor for worsening lung function, which has been confirmed in many studies29,39, but our model did not include this factor, which is due to the small retrospective sample size and cannot represent the characteristics of the entire population. Future multi - center longitudinal studies should establish external multi - center cohorts for external validation of model performance to identify more risk factors for PRISm progression and enhance the generalizability and reliability of research findings. Furthermore, future research should refine the assessment of comorbidities to explore the individual impact of each comorbidity on disease progression. Regular follow-up of PRISm patients is recommended for understanding the patterns and timing of changes, and early interventions should be implemented. This is the first study in China to predict the progression of PRISm, with a valuable prediction value. The model is well calibrated and the predictions have a strong correlation with the observed outcomes. Physicians can quickly identify PRISm patient who has higher risk of progression according to this simple prediction model.

Conclusion

PRISm is likely to progress to COPD, and older age, dyspnea, lower FEV1 pred, lower FEV1/FVC, respiratory complications and elevated red blood cell count are associated with PRISm progression. Further research is needed to help identify potential risk factors associated with PRISm progression as well as potential treatments and early intervention should be paid attention to in clinical practice to delay further deterioration of lung function.