Introduction

Cervical insufficiency (CI) is characterized by painless dilation or shortening of the cervix during the second trimester of pregnancy1, with incidence ranges between 1% and 3%2,3。CI constitutes 8–9% of preterm births, 8 ~ 25% of extremely preterm births, and 8 ~ 25% of recurrent miscarriages in the second and third trimesters4.

Diagnosis typically relies on a patient’s medical history, physical examinations, ultrasound evaluations during pregnancy.In some cases, preconception cervical function tests are also required3,5,6. Identifying CI among women without a history of cervical insufficiency or who have not undergone cervical function tests can be very challenging since many diagnoses were made when ultrasound results suggest the occurrence of cervical shortening or dilatation, which usually happens during the second trimester and leads to poorer pregnancy outcomes7. For instance, some women may experience premature rupture of membranes or intrauterine infections due to cervical dilation, potentially missing opportunities for timely clinical intervention and resulting in miscarriage or preterm delivery. In some cases, surgical intervention, such as emergency cerclage surgery, may be required. In one study, the abortion rate after emergency cervical cerclage was reported to be 20.89%, with premature birth rates reaching 68.35%8. These figures are notably higher than those associated with preventive cerclage surgery9.This implies that at present, women with CI often experience at least one adverse pregnancy outcome before receiving a diagnosis, delaying preventive treatments aimed at improving future pregnancy outcomes.

Even so, there are currently few established prediction methods or early diagnostic approaches for CI in clinical practice. There are only a limited number of retrospective studies in which risk factors associated with CI, most of the results have been limited to PCOS and testosterone levels10,11, with the occasional study discussing the correlation between past medical history and CI, unfortunately without combining PCOS and testosterone levels12. Thus, these studies considered only a restricted set of factors in their analyses, leading to a constrained predictive effect. Research into the aetiology of CI remains relatively rare and incomplete. Consequently, developing a more comprehensive and accurate method for the early prediction of CI is imperative.

The aim of the current study is to explore the risk factors associated with CI in individuals undergoing IVF-ET. To achieve this, a predictive model for CI using information obtained from women prior to pregnancy will be constructed. Given that these women undergo more comprehensive physical examinations before conception, we can obtain more detailed and extensive data to construct a more comprehensive and accurate predictive model.

Materials and methods

Study population

The study population comprised women who achieved successful pregnancies through IVF-ET at the Reproductive Center of Peking University Third Hospital between January 2016 and November 2022. These women subsequently delivered at the hospital’s Obstetrics Department. Comprehensive data were collected for all cases, covering preconception and pregnancy periods. Women with a history of cervical treatments-including cervical resection, cervical conization, loop electrosurgical excision procedures, laser treatment, and cryotherapy-or with female genital malformations were excluded. Ultimately, 2,494 cases were included in the analysis. Of these, 1,745 cases constituted the training set for model development, while 749 cases were allocated to the validation set for assessing the model’s performance.

Data collection

The data collected for this study included various variables, such as age; prepregnancy body mass index (BMI); prepregnancy hormone levels, including follicle-stimulating hormone (FSH), luteinizing hormone (LH), oestradiol(E2), testosterone (T), androstenedione (A), and anti-Müllerian hormone (AMH); diagnosis of polycystic ovary syndrome (PCOS); embryo cryopreservation (covering both fresh and frozen embryo transfers); the controlled ovarian stimulation protocol used in the fresh embryo transfer cycle (including clomiphene (CC), ultralong GnRH agonist, long GnRH agonist, short GnRH agonist, and standard GnRH agonist protocols); endometrial preparation method in the frozen embryo transfer cycle (natural or artificial cycles); frequency of hysteroscopic surgeries; frequency of curettage procedures; time elapsed between the last intrauterine procedure and pregnancy; previous gravidity (G) and parity (P); twin pregnancy status; prepregnancy hypertension and diabetes; and measurements of uterine and cervical length before the use of ovulation-inducing medications. All the data were extracted from the electronic medical records system of Peking University Third Hospital.

Further specifications were applied to certain variables. Hormone levels were assessed during the initial visit and on days 2–5 of the menstrual cycle; for participants with irregular menstrual cycles, such as those with PCOS, hormone testing was conducted on any day of the cycle. Additionally, to maintain consistency, women with hereditary collagen synthesis disorders, including Marfan syndrome or Ehlers–Danlos syndrome13,14, were excluded, as these conditions are recognized as risk factors for CI. Finally, in this study, multiple pregnancies were classified strictly as twin pregnancies, with cases of triplet or higher-order pregnancies excluded. The cervical length is measured by ultrasound physician as the distance from the internal os to the external os of the cervix and is reported in the ultrasound report.

Diagnostic criteria for CI

The currently recommended diagnostic criteria for CI are as follows3,5,6: (1) history-based diagnosis: defined as two or more instances of painless cervical dilatation resulting in recurrent miscarriage in the second trimester or preterm birth with painless cervical dilatation, after excluding other causes; (2) ultrasound-based diagnosis: which requires a history of one or more pregnancy losses or preterm births and a cervical length ≤ 25 mm before 24 weeks of gestation; and (3) physical examination-based diagnosis: identified by painless cervical dilatation or prolapsed fetal membranes detected on manual or speculum examination before 24 weeks of gestation.

Sample size considerations

The target sample size and the number of events per predictor were calculated using the R package “pmsampsize”, following the methodology outlined by Riley et al.15. The number of candidate predictors was specified as 8, with an event rate of 0.023 and a C-statistic of 0.9. The minimum required sample size for the training set was determined to be 1058, while the minimum number of events per predictor was estimated at 3.04, corresponding to a total of at least 25 events. Subsequently, considering a 70/30 training/validation split, the total sample size for the internal cohort was calculated, requiring a minimum of 1511 patients and 35 events.

Data analysis

Data for this study were obtained from the electronic medical records system of Peking University Third Hospital. Collection and organization were conducted using Microsoft Excel 2020, and subsequent processing and analysis were performed in R (version 4.2.2). In the data, some variables have a certain degree of missing values: LH 39/2494 (1.56%), FSH 143/2494 (5.73%), E2 36/2494 (1.44%), P 80/2494 (3.21%), T 27/2494 (1.08%), A 35/2494 (1.40%), AMH 44/2494 (1.76%), uterine length 51/2494 (2.04%), and cervical length 52/2494 (2.09%). The missing values were filled using the multiple imputation method.

Descriptive statistics, including mean ± standard deviation (x ± s) were used to describe normally distributed measurement data, with intergroup comparisons conducted via the t-test. For non-normally distributed data, median and quartile values M (P25, P75) were used, and intergroup comparisons utilized the rank-sum test. Categorical variables are presented as sample numbers (%), with intergroup comparisons performed using the chi-square test or Fisher’s exact test.

To ensure unbiased analysis, participants were randomly divided into a training set (70%) and a validation set (30%) at a 7:3 ratio using the R sample function. The training set was utilized for model development, while the validation set was reserved for model performance verification. Optimal biomarker thresholds (e.g., BMI, LH/FSH ratio, E2, P, T, A, AMH levels, and uterine and cervical lengths) were determined using the Youden index (J = sensitivity + specificity − 1), which maximizes the combined sensitivity and specificity of the receiver operating characteristic (ROC) curve in training set. This approach prioritizes balanced classification performance for clinical risk stratification.

The model development was initiated with the training set. Univariate logistic regression analysis was first conducted to identify potential predictors of CI occurrence, with a significance level of P < 0.05. LASSO regression was then applied to screen for meaningful predictors. Multivariate logistic regression analysis was used to evaluate variable independence and construct the final prediction model. A nomogram was generated to visualize model outcomes. The model underwent 1,000 rounds of self-sampling validation to assess its robustness, with calibration evaluated via a calibration curve. Model discrimination was assessed through the area under the ROC curve (AUC). A clinical decision curve analysis (DCA) and Calibration curvewas conducted to quantify the net benefit across a range of threshold probabilities, evaluating the model’s clinical utility. Finally, the constructed multifactor model was validated using the validation set.

The “pROC” and “ggplot2” packages were used to plot the ROC curves. The “rms” package was used for nomogram construction and calibration plotting, and “ggDCA” functions were utilized for the DCA. All the statistical tests were two-tailed, with statistical significance defined as P < 0.05.

Results

Incidence rate of CI in women undergoing IVF-ET

A total of 2,494 women undergoing IVF-ET were included in this study. Among them, 80 cases of CI were identified, yielding an incidence rate of 3.2%. The prevalence of CI in the training set was 3.3%(57/1745), and that in the validation set was 3.1%(23/749). Detailed characteristics of the study population are presented in Table 1 and Supplement Table 1.

Table 1 Characteristics in CI and non-CI women.

Logistic regression analysis of High-risk factors for CI

The cut-off point determined by Youden index for BMI is 22.83 kg/m², for testosterone is 0.74 nmol/L, for androstenedione is 11.45 nmol/L, for AMH is 3.50 ng/mL, for cervical length is 3.15 cm.

In the training cohort of 1,745 cases, univariate regression analysis identified 14 high-risk factors associated with CI (Table 2). LASSO regression analysis of the above 14 factors narrowed the selection to 10 significant predictors (Supplement Table 2). Multivariate logistic regression analysis further revealed that the following factors were positively associated with the incidence of CI: BMI > 22.83 kg/m² (OR = 2.499, 95%CI: 1.388 ~ 4.572, p = 0.002), testosterone > 0.74 nmol/L (OR = 3.324, 95%CI: 1.846 ~ 6.048, p < 0.001), androstenedione > 11.45 nmol/L (OR = 4.410, 95%CI: 2.155 ~ 8.826, p < 0.001), AMH > 3.50 ng/ml (OR = 2.073, 95%CI: 1.093 ~ 3.928, p = 0.025), frequency of hysteroscopic surgery (OR = 1.576, 95%CI: 1.200 ~ 2.033, p = 0.001), previous gravidity (OR = 1.404, 95%CI: 1.072 ~ 1.818, p = 0.012), and prepregnancy diabetes (OR = 4.989, 95%CI: 1.472 ~ 14.429, p = 0.005). Conversely, a cervical length > 3.15 cm (OR = 0.365, 95%CI: 0.167 ~ 0.727, p = 0.007) was negatively associated with CI incidence (Table 3).

Table 2 Univariate regression analysis of the training cohort.
Table 3 Multivariate regression analysis of the training cohort.

Prediction model

Based on the findings from the univariate, LASSO, and multivariate regression analyses, we developed a predictive model for CI, as shown in Fig. 1. This model incorporates the following risk factors: BMI > 22.83 kg/m², testosterone level > 0.74 mmol/L, androstenedione level > 11.45 nmol/L, AMH level > 3.50 ng/ml, frequency of hysteroscopic surgeries, presence of prepregnancy diabetes and the number of previous pregnancies. Cervical length > 3.15 cm is a protective factor for CI. Each factor is assigned a corresponding score, with cumulative scoring used to estimate the probability of CI occurrence (see Fig. 1).

Fig. 1
figure 1

Prediction model for CI, incorporating BMI > 22.83 kg/m2, testosterone level > 0.74mmol/L, androstenedione level > 11.45nmol/L, AMH level > 3.50ng/ml, frequency of hysteroscopic surgery, pre-pregnancy diabetes, cervical length > 3.15 cm and number of previous gravidity as risk factors.

Accuracy of the model

The predictive model demonstrated good performance, with an area under the curve (AUC) of 0.819 in the training cohort. The 95% confidence interval ranged from 0.758 to 0.881, indicating strong discriminatory ability (Fig. 2). The AUC of the validation set was 0.768 (95% confidence interval: 0.662 ~ 0.862), which indicated fair performance (Fig. 2).

The optimal cutoff value of the risk score was determined by maximizing the Youden index to achieve the best balance between sensitivity and specificity. In the training set, this threshold was identified as 84.523, yielding a sensitivity of 68.4%, a specificity of 79.5%, and an AUC of 0.819 (95% CI: 0.758 ~ 0.881). The corresponding ROC curve and decision curve analysis (DCA) are presented in Supplementary Fig. 1.

Fig. 2
figure 2

Discrimination and calibration of a model to predict CI in women undergoing IVF-ET treatment. (a) ROC curve of a model in training set to predict CI rate in patients undergoing IVF-ET. (b) The DCA curve of the training set. (c) Calibration curve of training set showing the association between the probability of CI as predicted by the model and the observed CI rate. (d) ROC curve of a model in validation set to predict CI rate in patients undergoing IVF-ET. (e) The DCA curve of the validation set. (f) Calibration curve of validation set showing the association between the probability of CI as predicted by the model and the observed CI rate. Abbreviations: IVF-ET, in vitro fertilization and embryo tranfer; CI, cervical insufficiency; ROC, Receiver Operating Characteristic Curve; DCA: Decision Curve Analysis.

Discussion

In the present study, we developed a predictive model for CI based on data from women who underwent IVF-ET treatment at our hospital and subsequently delivered at our facility.

The incidence of CI in this cohort was 3.2%, higher than the rate reported in the general population of women of childbearing age, which is consistent with findings from previous studies11,16.

A 2020 study involving 4,710 IVF/ICSI cases developed a CI prediction model and identified factors such as BMI >23.9 kg/m², uterine length ≤ 4.5 cm, gravidity, and testosterone levels >0.7 ng/ml as being associated with an increased risk of CI17. Previous studies have demonstrated the utility of hormone levels in predicting clinical pregnancy rates among IVF/ICSI populations. Accordingly, we incorporated hormone level assessments into the design of our study18. Compared to earlier research, our study expanded the scope of analysis by incorporating additional variables, such as AMH and androstenedione levels, specific IVF-ET treatment protocols (e.g., embryo cryopreservation), controlled ovarian stimulation strategies for fresh embryo transfer, endometrial preparation methods for frozen embryo transfer, and comorbid conditions like prepregnancy diabetes and hypertension. Furthermore, we considered not only the frequency of uterine cavity operations but also the time interval between the last intrauterine operation and subsequent pregnancy.

By incorporating these comprehensive factors into our predictive model, we aimed to provide a more robust and reliable tool for estimating the risk of CI. This model has the potential to enhance early detection for women at risk of CI.

Previous research has highlighted the association between BMI and adverse pregnancy outcomes, BMI exceeding 30 kg/m² is linked to an elevated risk of premature delivery19. Additionally, BMI >23.9 kg/m² as a significant high-risk factor17. Our findings are consistent with these observations, as we identified an increased risk of CI at a BMI threshold of >22.83 kg/m².

A potential explanation for this association is that a higher BMI increases abdominal pressure, which, in turn, exerts additional stress on the cervix, potentially leading to the development of CI. Furthermore, the altered endocrine environment observed in women with higher BMIs compared to those with lower BMIs may also contribute to the development of CI.

Hyperandrogenism is recognized as a significant risk factor for CI20. This study found that a testosterone level >0.74 nmol/L was significantly associated with an increased risk of CI. This result is largely consistent with the conclusion reported in previous studies that a testosterone level >0.7 ng/ml is associated with an increased risk of CI17, further supporting the reasonableness and potential reproducibility of the findings in this study. Additionally, the study also revealed that an androstenedione level >11.5 ng/dl was significantly associated with an increased risk of CI. Given that the research center in this study set >11.5 ng/dl of androstenedione as the clinical reference threshold for hyperandrostenedonism, the conclusion of this study not only has statistical significance but also has certain clinical basis and interpretability. While the exact physiological and pathological mechanisms linking hyperandrogenism to CI remain unclear, androgen metabolism may contribute to reduced cervical collagen tissue21,22,23, which could increase susceptibility to CI.

Current research on the relationship between androgen levels and CI has primarily focused on testosterone. Due to the constraints of detection methods, clinical assessments generally measure total testosterone rather than free testosterone, which is more biologically active. This limitation may reduce the sensitivity of testosterone as a diagnostic marker for hyperandrogenism. Despite this, testosterone exhibits higher biological activity compared to androstenedione24,25. Recently, androstenedione has gained recognition in clinical practice for its enhanced sensitivity in diagnosing hyperandrogenism. In this study, both testosterone and androstenedione were identified as independent risk factors for CI through multivariate analysis. Notably, this is the first study to reveal an association between androstenedione levels and CI, highlighting its potential as a valuable marker for assessing CI risk.

AMH is secreted primarily by ovarian granulosa cells26 and serves as the most sensitive biomarker for assessing ovarian reserve. A growing body of evidence indicates that serum AMH concentrations are significantly and positively associated with antral follicle count (AFC), while reduced AMH levels are linked to lower clinical pregnancy rates among infertile women27,28. AMH has also been identified as a predictor of premature delivery in individuals with PCOS29,30,31. However, the relationship between AMH and CI has not been explored in the current literature. This study proposes that AMH may represent a potential risk factor for CI. The obtained threshold of AMH (>3.50 ng/ml) is similar to the AMH levels of the PCOS preterm population in previous studies29,30,31, indicating that the conclusion of this study has clinical plausibility.

AMH, a member of the TGF-β superfamily, binds to specific receptors and activates the TGF-β signaling pathway.The TGF-β pathway is well known for its roles in fibrosis, cell proliferation, and apoptosis. Previous research has shown that in cervical cancer, AMH can influence cancer cell apoptosis by modulating cell cycle progression32. Notably, AMH receptors have been identified in cervical tissues33.

Women with impaired fasting glucose or diabetes prior to pregnancy face an increased risk of spontaneous abortion and preterm delivery. Although previous study found no significant correlation between CI and prepregnancy fasting glucose levels34, our findings demonstrated a positive association between diabetes and the incidence of CI. Meanwhile, among women with PCOS who experience CI, those with concurrent insulin resistance have poorer pregnancy outcomes10. Hyperglycaemia may contribute to degradation of cervical collagen integrity, as diabetes is associated with abnormalities in angiogenesis, cell adhesion, migration, proliferation, differentiation, and extracellular matrix (ECM) deposition35,36. Furthermore, studies have shown that diabetes impairs insulin signaling in fibroblasts, resulting in delayed wound healing37. These mechanisms warrant further investigation specifically in cervical fibroblasts.

Several risk factors identified in this study-including obesity, elevated androgen levels, high AMH levels, and diabetes-are also well-recognized contributors to PCOS38, which itself is considered a significant risk factor for CI. In our analysis, PCOS was positively correlated with CI in univariate regression analysis, but no significant correlation was found in multivariate analyses. This finding is reasonable, as the risk of CI among patients with PCOS likely stems from the presence of these associated risk factors. Excluding PCOS from the predictive model reduces the potential for double counting of risk values, enhancing the model’s reliability.

Our findings suggest that uterine length is not correlated with CI, which contrasts with previous studies that have linked uterine length to premature delivery, abortion39, and CI17. However, we identified a negative correlation between cervical length more than 3.15 cm prior to pregnancy and CI.

In addition to congenital factors such as collagen disorders, uterine malformations, and cervical length, acquired abnormalities in the cervical anatomy may also contribute to CI. The frequency of hysteroscopy emerged as a high-risk factor for CI. In contrast, the number of curettage procedures did not show a significant relationship with CI outcomes. This lack of association could be attributed to the minimal requirements for cervical dilation during curettage in clinical practice.

Previous research has reported that women undergoing hysteroscopic surgery within six months before pregnancy experience higher rates of miscarriage or premature delivery40. However, our study did not identify a significant correlation between CI incidence and intrauterine procedures performed within this timeframe.

Hysteroscopic surgery presents both advantages and disadvantages. While it is associated with an elevated risk of CI, it also improves clinical pregnancy rates. Additionally, hysteroscopic procedures provide an opportunity to evaluate cervical function, particularly in patients with cervical laxity. Given these dual effects, the criteria for performing hysteroscopic surgery should be carefully considered. When operative indications are present, instruments requiring minimal cervical dilation are recommended to mitigate the risk of CI.

Previous studies have reported a higher incidence of CI among women undergoing IVF-ET11,15. In this study, we found the incidence of CI to be 3.2% among patients treated with IVF-ET, slightly exceeding the reported incidence in the general population. Several factors may contribute to this finding. First, women undergoing IVF-ET are often affected by a higher prevalence of endocrine disorders, such as PCOS, which may increase the risk of CI. Second, some patients receiving IVF-ET undergo hysteroscopy prior to embryo transfer, which could also elevate the incidence of CI.

Studies have examined pregnancy outcomes associated with fresh and frozen embryo transfers. For instance, Shi Yuhua41 reported that compared to fresh embryo transfers, frozen embryo transfers significantly reduced the risk of pregnancy loss in women with PCOS, though CI incidence was not specifically addressed. In contrast, another study42 observed a higher incidence of CI in cases involving frozen embryo transfers compared to fresh embryo transfers. In our study, we investigated factors related to infertility and IVF-ET protocols, including methods of embryo cryopreservation and strategies for controlled ovarian stimulation or endometrial preparation. However, no significant association between IVF-ET protocols and the incidence of CI was identified.

We successfully developed a comprehensive predictive model for assessing CI risk in the IVF-ET population. The cutoff value of the total risk score of this model is 84.523, indicating that individuals with a score of ≥ 84.523 are at high risk of CI. It is recommended to enhance the monitoring of cervical length during pregnancy for this group. However, this study has several limitations. First, as a retrospective study, our findings require prospective validation and the lack of an independent external validation cohort may restrict the generalizability of the observed associations. This statistical limitation highlights the critical need for external validation in an expanded multicenter cohort to ensure the robustness of the model. Second, the absence of specific clinical data, such as the duration of hysteroscopy or the type of instruments used, limited our ability to explore the relationship between these factors and CI.

Conclusion

In conclusion, this article develop a predictive model for CI that includes BMI, testosterone level, androstenedione level, AMH level, frequency of hysteroscopic surgery, cervical length as risk factors. For the first time, AMH and androstenedione have been incorporated as predictors in a CI prediction model, Our model suggests potential utility for identifying CI risk in IVF-ET patients, incorporating AMH and androstenedione as exploratory predictors. It is also found that different IVF-ET protocols were not associated with CI. By employing this predictive tool, we can more accurately identify high-risk groups for CI prior to pregnancy. This advancement enhances the monitoring of cervical length during gestation, facilitating timely interventions that may mitigate the risks of miscarriage or preterm birth associated with CI.