Critical factors influencing live birth rates in fresh embryo transfer for IVF: insights from cluster ensemble algorithms

Yu, Zheng; Zheng, Xiaoyan; Sun, Jiaqi; Zhang, Pengfei; Zhong, Ying; Lv, Xingyu; Yuan, Hongwen; Liang, Fanrong; Wang, Dexian; Yang, Jie

doi:10.1038/s41598-025-88210-1

Download PDF

Article
Open access
Published: 30 January 2025

Critical factors influencing live birth rates in fresh embryo transfer for IVF: insights from cluster ensemble algorithms

Zheng Yu¹^na1,
Xiaoyan Zheng^2,3^na1,
Jiaqi Sun²,
Pengfei Zhang¹,
Ying Zhong³,
Xingyu Lv³,
Hongwen Yuan⁴,
Fanrong Liang²,
Dexian Wang¹^na2 &
…
Jie Yang^2,3^na2

Scientific Reports volume 15, Article number: 3734 (2025) Cite this article

3289 Accesses
Metrics details

Subjects

Abstract

Infertility has emerged as a significant global health concern. Assisted reproductive technology (ART) assists numerous infertile couples in conceiving, yet some experience repeated, unsuccessful cycles. This study aims to identify the pivotal clinical factors influencing the success of fresh embryo transfer of in vitro fertilization (IVF). We introduce a novel Non-negative Matrix Factorization (NMF)-based Ensemble algorithm (NMFE). By combining feature matrices from NMF, accelerated multiplicative updates for non-negative matrix factorization (AMU-NMF), and the generalized deep learning clustering (GDLC) algorithm. NMFE exhibits superior accuracy and reliability in analyzing the in vitro fertilization and embryo transfer (IVF-ET) dataset. The dataset comprises 2238 cycles and 85 independent clinical features, categorized into 13 categories based on feature correlation. Subsequently, the NMFE model was trained and reached convergence. Then the features of 13 categories were sequentially masked to analyze their individual effects on IVF-ET live births. The NMFE analysis highlights the significant influence of therapeutic interventions, Embryo transfer outcomes, and ovarian response assessment on live births of IVF-ET. Therapeutic interventions, including ovarian stimulation protocols, ovulation stimulation drugs, and pre-and intra-stimulation cycle acupuncture play prominent roles. However, their impacts on the IVF-ET model are reduced, suggesting a potential synergistic effect when combined. Conversely, factors like basic information, diagnosis, and obstetric history have a lesser influence. The NMFE algorithm demonstrates promising potential in assessing the influence of clinical features on live births in IVF fresh embryo transfer.

Advanced KPI framework for IVF pregnancy prediction models in IVF protocols

Article Open access 27 November 2024

Independent predictors and thresholds of in vitro fertilization outcomes in patients with diminished ovarian reserve

Article Open access 29 May 2025

Pseudo contrastive labeling for predicting IVF embryo developmental potential

Article Open access 15 February 2022

Introduction

Infertility affects approximately one in six couples worldwide¹. Assisted reproductive technology (ART) is recommended for couples with unresolved infertility. However, achieving a satisfactory pregnancy rate remains challenging. It indicates that the live birth rate (LBR) per initiated cycle was 40.1% for women under 35 and 4.5% for women over 42 in the United States in 2013². Previous research³ has highlighted key factors—including weight, ovarian function, and comorbidity—that significantly impact the success of assisted reproduction programs⁴. Recently, additional research has highlighted ethnic origin⁵, male age⁶, and embryo cryopreservation durations⁷ as potential variables. However, identifying the key influencing factors remains challenging.

Machine learning techniques offer a promising solution. By extracting insights from historical data, machine learning allows for comprehensive analysis and ranking of factors influencing ART outcomes. This cutting-edge discipline leverages complex big data to acquire valuable knowledge efficiently⁸ and has found extensive applications across various fields, including healthcare⁹. For instance, dynamic systems design and control in applications such as robotics, autonomous vehicles, and industrial process plants¹⁰. In the medical domain, machine learning has demonstrated its utility in tasks such as COVID-19 diagnosis and epidemic forecasting¹¹, medical image analysis¹², cancer diagnosis and treatment selection¹³, and electronic health record management¹⁴. This technology enables pattern recognition and prediction of disease risk, treatment responses, and patient outcomes¹⁵. Within the realm of ART, machine learning has been used to assess embryo quality¹⁶, analyze sperm characteristics¹⁷, and explore predictive models for ovarian reserve function (such as anti-Mullerian hormone(AMH) level, follicle-stimulating hormone(FSH) level, and age)¹⁸. However, despite these advancements, the relative importance of different influencing factors in the in vitro fertilization and embryo transfer (IVF-ET) process has not been thoroughly studied.

To address this gap, we propose the use of a clustering ensemble approach to analyze the significance of each feature in the IVF-ET algorithm model. Cluster analysis, an unsupervised machine learning technique, is particularly employed to extract insights from unlabeled data¹⁹. Effective clustering algorithms are widely applied across various fields, including Vehicular Ad hoc Networks (VANETs)²⁰ and other contexts where search efficiency and coverage of critical scenarios are key considerations²¹. Ensemble classifiers distinguish themselves in reducing false positives in high-risk scenarios²², thereby enhancing clustering accuracy. Their adaptability to various datasets²³, and robustness against data noise, bolstered by integrating multiple deep networks, further underscore their superiority²⁴.

Effective clustering algorithms such as non-negative matrix factorization (NMF)²⁵, accelerated multiplicative updates for non-negative matrix factorization (AMU-NMF)²⁶, generalized deep learning clustering (GDLC) algorithm based on NMF²⁷, Multi-view clustering (MVC) algorithm based on deep semi-NMF²⁸, generalized deep learning algorithm based on NMF for multi-view clustering²⁹, Meta-CLustering Algorithm (MCLA)³⁰, and dense representation based ensemble clustering (DREC) algorithm³¹ have been developed. These algorithms have been applied to identify signature genes associated with recurrent implantation failure (RIF)³² and gene co-clusters in two species³³, demonstrating their potential in complex biological datasets.

Given the existing gaps in the literature and the promise of machine learning techniques, we conducted a retrospective study to assess the significance of various influencing factors in the IVF-ET process. Data comprising clinical characteristics and live birth outcomes of IVF-ET patients at Sichuan Jinxin Xi’nan Women’s and Children’s Hospital between January 2022 and December 2022 were collected and analyzed using a self-developed ensemble algorithm called NMF-based ensemble algorithm (NMFE). This algorithm combines the strengths of NMF, AMU-NMF, and GDLC, aiming to improve the efficiency of data clustering and provide valuable insights aimed at enhancing the success rate of IVF-ET.

Results

Comparison between different algorithms

We conducted a comparison of the NMFE with some well-known effective algorithms. The algorithms used in the comparison are NMF¹⁷, AMU-NMF¹⁸, GDLC¹⁹, MCLA²² and DREC²³. The accuracy (ACC) and purity (PUR) values serve as metrics to assess the performance and effectiveness of different algorithms³⁴. A higher accuracy value indicates a greater proportion of correct predictions, whereas a higher purity value indicates a higher percentage of instances that are accurately classified. The accuracy and purity value of NMFE are 0.7912 and 0.8605 respectively, surpassing those of other algorithm models. This indicates that NMFE is more effective (Table 1; Fig. 1).

Table 1 Results of accuracy and purity value on the IVF dataset by different clustering algorithms. The accuracy and purity value of NMFE are 0.7912 and 0.8605 respectively, surpassing those of other algorithm models. *The maximum value in the same group.

Full size table

Ranking analysis of influencing factors

In our study, we extended our analysis to compare the efficacy of NMF, AMU-NMF, GDLC, and NMFE algorithms. Following the random masking of the original data by three different sets of random numbers, NMFE consistently exhibited higher accuracy and purity values compared to other algorithms across the majority of cases (Table 2).

Table 2 Accuracy and purity values of different algorithms by masking each feature group by three different sets of random numbers. NMFE consistently demonstrated higher accuracy and purity values compared to other algorithms in the majority of cases. *The maximum value in the same group.

Full size table

To further investigate the impact of specific feature groups on the IVF-ET model, we masked the data for groups such as Therapeutic Interventions, Embryo Transfer Outcomes, and Ovarian Response Assessment Indicators with random numbers. This manipulation resulted in a significant decrease in the overall accuracy value of the IVF-ET model, suggesting that these feature groups exert a substantial influence on the model’s performance.

To quantify the influence of various feature groups on the IVF-ET outcome, we computed ACC-GAP and PUR-GAP values by summing the accuracy and purity values after masking the data with the three sets of random numbers. Smaller calculated values for these gaps indicated a more robust influence of the feature group on the overall model and a greater effect on the IVF-ET outcome (Table 3; Fig. 2.). Among the feature groups, Therapeutic Interventions exhibited the smallest ACC-GAP and PUR-GAP values, suggesting that they contribute the most significantly to the model.

Table 3 The ACC-GAP and PUR-GAP values for each feature group were calculated using NMFE. The feature group therapeutic interventions demonstrated the lowest ACC-GAP and PUR-GAP values. Conversely, the male Basic Information feature group had the highest ACC-GAP value. The female Basic Information feature group had the highest PUR-GAP value. *The maximum value in the same group, # the minimum value in the same group.

Full size table

Based on our influence analysis, we identified the top five groups with the greatest influence on the IVF-ET result: Therapeutic Interventions, Embryo Transfer Outcomes, Ovarian Response Assessment Indicators, Embryo Transfer-Related Indicators, and Complications During Pregnancy. Within the Therapeutic Interventions group, factors such as the ovarian stimulation protocol, ovulation stimulation drugs, and pre-cycle and intra-cycle acupuncture were found to be particularly influential. To gain a deeper understanding of the impact of each treatment plan on the IVF-ET result, we conducted a separate analysis for each intervention factor (Table 4; Fig. 3).

Table 4 ACC-GAP and PUR-GAP value of different intervention factors group by NMFE. Ovarian stimulation protocols, ovulation stimulation drugs, and pre-and intra-cycle acupuncture ranked 7th, 8th, 12th, and 14th based on ACC-GAP values, and 4th, 6th, 13th, and 14th based on PUR-GAP values. * the maximum value in the same group, # the minimum value in the same group.

Full size table

Upon further analyzing the clinical features within the Therapeutic Interventions separately, we observed a shift in the rank of influential factors. Specially, ovulation-stimulating drugs dropped to seventh place, ovarian stimulation protocol dropped to eighth place, and acupuncture treatment was further behind. Whether this shift indicates a synergistic effect among multiple therapies will require further validation to confirm.

Discussion

In this study, we have proposed an ensemble clustering algorithm model to assess the influence of clinical characteristics on IVF-ET live births of fresh embryo transfer. This algorithm surpasses other algorithms in terms of accuracy and purity, demonstrating its robustness and reliability in handling the IVF-ET dataset. The results revealed that the five leading feature groups with the most substantial impact on live births in IVF-ET are Therapeutic Interventions, Embryo Transfer outcomes, Ovarian Response Assessment Indicators, Embryo Transfer-related Indicators, and Complications During Pregnancy. Conversely, factors such as basic male and female information, female diagnosis, and obstetric history had a relatively minor influence.

Among these, Therapeutic interventions as the most influential factor, encompassing multiple aspects of the treatment plan, including ovulation stimulation drugs (recombinant human follicle-stimulating hormone (rFSH) and human menopausal gonadotropin (hMG)), the ovarian stimulation protocol, and the utilization of acupuncture before and during the IVF cycle. The European Society for Human Reproduction (ESHRE) guideline on ovarian stimulation in IVF/ICSI recommends both rFSH and hMG as viable options³⁵. However, the initial dosage of gonadotrophin is pivotal in determining the outcome of controlled ovarian stimulation (COS) and subsequent IVF outcomes³⁶. Thus, it’s crucial to consider an individual’s ovarian potential before initiating stimulation, as a standardized prescription may adversely affect women’s outcomes³⁷. For example, low doses may result in insufficient follicular development in women with normal or high ovarian reserve while excessive doses could lead to ovarian hyperstimulation syndrome (OHSS)³⁸.

Once the follicle has reached a certain size, gonadotropin-releasing hormone (GnRH) -agonists can be used to stimulate the maturation and increase ovum count. On the other hand, recombinant GnRH -antagonists can be employed to inhibit the release of natural luteinizing hormone, thereby preserving eggs for further development. The selection of the ovarian stimulation protocol closely correlates with OHSS occurrence and clinical pregnancy rate³⁹. In the general IVF population, GnRH antagonists were associated with a lower ongoing pregnancy rate after fresh embryo transfer compared to long-protocol agonists with lower OHSS rates. This underscores the challenge of selecting the most suitable protocol for individual patients. Individualizing treatment in IVF aims to maximize pregnancy chances while minimizing ovarian stimulation risks³⁸.

Thus, the selection of ovarian stimulation drugs and protocols is a crucial factor for IVF-ET outcomes, and treatment should be individualized based on ovarian response³⁵. Our model highlights ovarian response as a key factor, recommending antral follicle count (AFC) or AMH for predicting high or poor ovarian response²⁶. Since age and BMI inversely correlate with AMH, they are also important considerations when personalizing treatment plans^40,41,42,43.

Acupuncture, as traditional adjuvant therapy, is being increasingly chosen by subfertility couples to improve the success rate of IVF-ET^44,45. In the United States, 44% of infertile women undergoing IVF-ET administrate acupuncture⁴⁶. However, the potential of acupuncture to enhance the live birth rate of IVF-ET remains debatable^47,48. Recent clinical studies have indicated several positive effects of acupuncture. It has been found to reduce anxiety during embryo transfer⁴⁹, improve oocyte quality⁵⁰, and enhance endometrial blood flow and receptivity⁵¹, ultimately leading to improved outcomes in IVF-assisted pregnancy.

Additionally, when examining the impact of the ovulation stimulation drugs, ovarian stimulation protocols, and acupuncture (pre-cycle and intra-cycle), we observed a significant decrease in their influence on the IVF-ET model, with acupuncture showing the least effect. To investigate this further, we conducted additional analysis and data mining. We found that the majority of patients in our dataset did not receive acupuncture treatment. Only 198 patients received intra-cycle acupuncture and 144 patients received pre-cycle acupuncture. It is important to note that the efficacy of acupuncture is closely related to the number of sessions^52,53 Therefore, the limited use of acupuncture in our dataset may not accurately reflect its true potential in enhancing IVF-ET outcomes. Hence, concluding that acupuncture is ineffective based solely on our findings would be premature. Upon comprehensive consideration of the intervention factors, their combined influence remains significant, hinting at potential synergistic effects among multiple therapies. However, further validation is required to substantiate this observation.

Our results indicated that multiple clinical features after embryo transfer significantly impact the IVF-ET model. Specifically, we considered the Embryo Transfer Outcomes group and the Complications During Pregnancy group. The Embryo Transfer Outcomes group encompassed conditions such as ectopic pregnancy, miscarriage, and premature delivery, while the Complications During Pregnancy group included gestational hypertension, gestational diabetes, intrahepatic cholestasis of pregnancy, fetal transfusion, and premature rupture of membranes. These findings aligned with established clinical patterns^54,55,56,57, suggesting good validity for our model in analyzing the IVF-ET dataset. Additionally, the Embryo Transfer-related Indicators group comprised factors like endometrial thickness, transferred embryo count, and transferred good-quality embryo count. These factors are widely acknowledged as critical determinants of live birth outcomes in the context of IVF-ET^58,59.

Our research findings emphasize the significance of various factors in IVF-ET outcomes. While obstetric history, which includes past pregnancies and deliveries, is generally considered relevant to IVF success, our data mining model does not assign it significant importance compared to other features. A history of successful pregnancies may suggest fertility capability, while previous failed pregnancies or miscarriages could indicate underlying fertility issues. Similarly, cesarean sections or uterine surgeries may affect uterus shape and integrity, potentially impacting embryo implantation. However, our model found that prior obstetric history did not significantly affect IVF-ET outcomes. It is important to note that patients seeking ART assistance often face significant fertility challenges and may have compromised natural conception abilities. Although past reproductive history may influence future pregnancies, it is not decisive in determining IVF-ET success. Our model also indicates that the cause of a woman’s infertility does not play a significant role in IVF-ET outcomes. Additionally, we considered the ethnicity of both partners in our analysis, given China’s multi-ethnic nature. Our dataset included patients from 30 ethnic groups, with the largest representation being Han (n = 2006), followed by Tibetan (n = 52) and Yi (n = 138). Other ethnicities, such as Hui, Tujia, Qiang, and Miao, were less prevalent. Interestingly, our results show minimal impact of ethnicity on the model. Furthermore, the educational background and occupation of both partners had minimal influence on the model, indicating that these factors may not significantly affect IVF-ET success.

Conclusions

Our data mining results indicate that therapeutic intervention, ovarian function, and embryo quality are the primary factors influencing pregnancy outcomes in fresh embryo transfer. Conversely, ethnic background, occupational status, educational levels, female infertility cause, and previous pregnancy history do not significantly impact pregnancy outcomes. Using NMFE, we evaluated and ranked the influence of various factors on patients undergoing fresh embryo transfer.Several limitations point to avenues for future research. Firstly, we did not explore in detail how specific characteristics impact IVF-ET outcomes. For instance, we did not determine optimal ovarian stimulation protocols tailored to individual patients. Similarly, we did not investigate the efficacy of acupuncture administered before and during the IVF cycle, nor did we establish the ideal number of acupuncture sessions. Furthermore, our model did not establish optimal dosages for medications or guide combining clinical interventions to achieve the best results. As such, our next steps involve enriching the dataset and conducting an in-depth analysis of these issues. Additionally, we plan to develop an artificial intelligence-driven personalized IVF support model to assist clinicians in selecting better treatment plans. Moreover, insights from this study will be used to further investigate matters related to frozen embryo transfer, with the ultimate goal of reducing economic costs for patients seeking assisted reproduction.

Materials and methods

Dataset

From January 2022 to December 2022, a total of 9539 patients underwent IVF at Sichuan Jinxin Xi’nan Women’s and Children’s Hospital, of which 3695 patients received fresh embryo transfer and 2238 patients observed pregnancy outcomes (Fig. 4). This study was approved by the Medical Ethics Management Committee of Sichuan Jinxin Xinan Women and Children’s Hospital (Ethnic number: No.2023-043) and was conducted according to all relevant guidelines and regulations. Since the data collected is anonymous, the requirement for informed consent was waived by the committee.

The dataset used in this study contains clinical features before and after IVF-ET. It consists of a total of 85 independent features, 69 clinical features before fresh embryo transfer, and 16 clinical features after transfer. Based on the correlation between features, we grouped them into 13 categories: Female Basic Information (3 items), Male Basic Information (5 items), Menstrual History (3 items), Obstetric History (12 items), Previous History of Assisted Reproduction (3 items), Ovarian Response Assessment Indicators (10 items), Therapeutic Interventions (4 items), Factors Associated with Embryo Quality (13 items), Female Diagnosis (10 items), Embryo Transfer-related Indicators (6 items), Hormone Levels After Transplantation (2 items), Embryo Transfer Outcomes (9 items), and Complications During Pregnancy (5 items) (Table 5; Fig. 5). The dataset contains one dependent feature, referred to as “Result”, which is divided into categories such as no pregnancy, miscarriage, and live birth. Of the cases in the dataset, 1,660 were not pregnant, 298 were miscarriages including ectopic pregnancy, biochemical pregnancy, and abortion, and 280 were live births. A comparative analysis of statistical differences in clinical features between groups with and without live births after fresh embryo transfer is shown in Supplementary Tables 1 & Supplementary Table 2.

Table 5 Features in IVF-ET dataset. The dataset used in this study contains 85 independent clinical features. Based on the correlation between features, we grouped them into 13 categories.

Full size table

Proposed ensemble algorithm

In this paper let $X=\left\{ {{x_1},{x_2}, \ldots {x_n}} \right\} \in {{\mathbb{R}}^{M,N}}$ us denote the dataset of IVF-ET. ${x_n}$denotes the n-th sample in the dataset. M denotes the feature dimension of each sample, and each feature is to portray a sample effective attribute. N denotes the sum of all samples involved in this modeling. In this paper, we use non-negative matrix factorization (NMF)⁶⁰ and its two variants algorithms to construct an ensemble model.

NMF works by approximating the high-dimensional target matrix using two low-dimensional matrices. We obtain effective low representations through multiple variants of the NMF algorithm, and then the ensemble model is constructed by fusing the low-dimensional feature matrices obtained from the training of multiple models. The objective function of NMF is shown in Eq. (1).

$$\mathop {\arg \hbox{min} }\limits_{{U,V}} {J_1}\left( {U,V} \right)=\left\| {X - UV} \right\|_{F}^{2},s.t.{\text{ }}U \geqslant 0,V \geqslant 0.$$

(1)

where U, V are two low-dimensional matrices. U is the weight matrix and V is the feature matrix, and $U \in {{\mathbb{R}}^{M,K}},V \in {{\mathbb{R}}^{N,K}}$. K is the dimension of the low-dimensional matrix, in which$K \ll \hbox{min} \left\{ {M,N} \right\}$. In Eq. 1, ${\left\| {{\text{ }} \cdot {\text{ }}} \right\|_F}$ is the Frobenius norm. To obtain U and V that approximate the original matrix X. The corresponding update rules are usually obtained using multiplicative updating. Furthermore, to accelerate NMF update and improve the effectiveness of the algorithm, a significant acceleration algorithm AMU-NMF was proposed by Gillis et al.²⁶. It improves the efficiency of the algorithm while ensuring convergence. Further, to improve the representation ability and convergence speed of the algorithm. Wang et al. proposed a deep matrix factorization representation learning algorithm GDLC based on element update. Its objective function is shown in Eq. (2).

$$\begin{gathered} \mathop {\arg \hbox{min} }\limits_{{U,V}} {J_1}\left( {U,V} \right)=\left\| {X - UV} \right\|_{F}^{2}+\alpha \left\| U \right\|_{F}^{2}+\beta \left\| V \right\|_{F}^{2},s.t.{\text{ }}U \geqslant 0,V \geqslant 0. \hfill \\ ={\sum\limits_{{m=1}}^{M} {\sum\limits_{{n=1}}^{N} {\left( {{x_{m,n}} - \sum\limits_{{k=1}}^{K} {{u_{m,k}}{v_{n,k}}} } \right)} } ^2}+\alpha \cdot {\sum\limits_{{m=1}}^{M} {\sum\limits_{{k=1}}^{K} {\left( {{u_{m,k}}} \right)} } ^2}+\beta \cdot {\sum\limits_{{n=1}}^{N} {\sum\limits_{{k=1}}^{K} {\left( {{v_{n,k}}} \right)} } ^2} \hfill \\ \end{gathered}$$

(2)

To optimize the objective function, a stochastic gradient descent algorithm^61,62 and an alternate iterative update strategy⁶³ are used to minimize the objective function.

To better improve the effectiveness of the algorithm, we fused the feature matrices obtained from the learning of the three algorithms NMF, AMU-NMF, and GDLC to construct an NMF-based ensemble algorithm (NMFE). Since the feature matrices are all non-negative matrices, to satisfy the effectiveness of the fusion algorithm and to ensure non-negativity, we propose a deep fusion-based method. The objective function of the method is shown below .The algorithm framework for NMFE is shown in Fig. 6.

$$\begin{gathered} \mathop {\arg \hbox{min} }\limits_{{U,V}} {J_1}\left( {U,V} \right)=\sum\limits_{{i=1}}^{I} {\left\| {{V^{(i)}} - E} \right\|_{F}^{2}} +{\alpha _1}\left\| E \right\|_{F}^{2},s.t.{\text{ }}U \geqslant 0,V \geqslant 0. \hfill \\ =\sum\limits_{{i=1}}^{I} {{{\sum\limits_{{n=1}}^{N} {\sum\limits_{{k=1}}^{K} {\left( {v_{{n,k}}^{{(i)}} - {e_{n,k}}} \right)} } }^2}} +{\alpha _1} \cdot {\sum\limits_{{n=1}}^{N} {\sum\limits_{{k=1}}^{K} {\left( {{e_{n,k}}} \right)} } ^2} \hfill \\ \end{gathered}$$

(3)

Where I denote the number of models that are used to construct the ensemble model. ${\text{i}} \in \left\{ {1,2,3} \right\}$,${V^{(1)}}$,${V^{(2)}}$,${V^{(3)}}$ denote the feature matrices obtained by algorithm NMF, AMU-NMF, and GDLC respectively. The objective function of the matrix is written in elemental form and then the SGD is used to optimize the objective function, which can be obtained as follows concerning the variable${e_{n,k}}$.

$${J_2}\left( {{e_{n,k}}} \right)={\left( {v_{{n,k}}^{{(i)}} - {e_{n,k}}} \right)^2}+{\alpha _1} \cdot {\left( {{e_{n,k}}} \right)^2}$$

(4)

Based on SGD its update rule can be obtained as follows.

$$e_{{n,k}}^{{\left( t \right)}} \leftarrow e_{{n,k}}^{{\left( t \right)}} - \eta \left( {\left( {v_{{n,k}}^{{(i)}} - {e_{n,k}}} \right) \cdot \left( { - 1} \right)+{\alpha _1}{e_{n,k}}} \right)$$

(5)

There is a subtraction operation in Eq. (5), which does not guarantee that the update value is non-negative. For this reason, we use an activation function with a non-negative value domain to constrain in Eq. (4), rewriting (4) as

$${J_2}\left( {{e_{n,k}}} \right)={\left( {v_{{n,k}}^{{(i)}} - f\left( {{{\hat {e}}_{n,k}}} \right)} \right)^2}+{\alpha _1} \cdot {\left( {f\left( {{{\hat {e}}_{n,k}}} \right)} \right)^2}$$

(6)

We let $f\left( \cdot \right)=sigmoid\left( \cdot \right)$. Based on the work in (6), the element’s SGD-based gradient values are transformed into weights for constructing the deep network for the update, and we can obtain the following update rule.

$$\left\{ \begin{gathered} {\left( {{{\hat {e}}_{n,k}}} \right)^T}={\left( {{{\hat {e}}_{n,k}}} \right)^1}+\eta {\left( {\Delta {e_{n,k}}} \right)^r}{\text{ }}r<R \hfill \\ {\left( {{{\hat {e}}_{n,k}}} \right)^1}\mathop \leftarrow \limits^{{r+1}} f\left( {{{\left( {{{\hat {e}}_{n,k}}} \right)}^T}} \right){\text{ }}r<R \hfill \\ {e_{n,k}}={\left( {{{\hat {e}}_{n,k}}} \right)^1}+\eta {\left( {\Delta {e_{n,k}}} \right)^r}{\text{ }}r=R \hfill \\ {\left( {\Delta {e_{n,k}}} \right)^r}=\sum\limits_{{t=1}}^{T} {\left( {\left( {v_{{n,k}}^{{(i)}} - f{{\left( {{{\hat {e}}_{n,k}}} \right)}^t}} \right) \cdot \left( { - 1} \right)+{\alpha _1} \cdot f{{\left( {{{\hat {e}}_{n,k}}} \right)}^t}} \right) \cdot f{{\left( {{{\hat {e}}_{n,k}}} \right)}^t} \cdot \left( {1 - f{{\left( {{{\hat {e}}_{n,k}}} \right)}^t}} \right)} \hfill \\ \end{gathered} \right.$$

(7)

Where R denotes the total number of rounds for training, and T denotes the number of times that the element ${\hat {e}_{n,k}}$ is updated in a round. $\eta {\left( {\Delta {e_{n,k}}} \right)^r}{\text{ }}$ denotes the cumulative value of the gradient for update the element ${\hat {e}_{n,k}}$ in the r-th round. By using the update rule of Eq. (7), we can learn to obtain the matrix E, which will be clustered using the k-means algorithm, and the clustering results of the NMFE model can be obtained.

Data availability

Access to source code and dataset: https://github.com/zh6463/raw-data.git.

Abbreviations

ACC:: Accuracy
AFC:: Antral follicle count
AMH:: Anti-Mullerian hormone
AMU-NMF:: Accelerated multiplicative updates for non-negative matrix factorization
ART:: Assisted reproductive technology
BiTSC:: The bipartite tight spectral clustering
COS:: Controlled ovarian stimulation
D3:: Cleavage embryo
D5:: Blastocyst embryo
DREC:: Dense representation-based ensemble clustering
E2:: Estradiol
EMT:: Endometrium thickness
ESHRE:: European Society for Human Reproduction
ET:: Embryo transfer
FSH:: Follicle-stimulating hormone
GDLC:: The generalized deep learning clustering
GnRH:: Recombinant gonadotropin releasing hormone
GnRH-a:: Gonadotropin-releasing hormone agonist
GnRH-A:: Gonadotropin-releasing hormone antagonist
hMG:: Human menopausal gonadotropin
IVF:: In vitro fertilization
IVF-ET:: The in vitro fertilization and embryo transfer
LBR:: Live birth rate
LH:: Luteinizing hormone
MCLA:: Meta-Clustering Algorithm
MVC:: Multi-view clustering
NMF:: Non-negative matrix factorization
NMFE:: NMF-based ensemble algorithm
OHSS:: Ovarian hyperstimulation syndrome
P:: Progesterone
PUR:: Purity
rFSH:: Recombinant human follicle-stimulating hormone
RIF:: Recurrent implantation failure
T:: Testosterone
SGD:: Stochastic gradient descent

References

Cox, C. M. et al. Infertility prevalence and the methods of estimation from 1990 to 2021: A systematic review and meta-analysis. Hum. Reprod. Open. 2022 (4), hoac051 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hornstein, M. D. State of the ART: Assisted Reproductive technologies in the United States. Reprod. Sci. 23 (12), 1630–1633 (2016).
Article PubMed MATH Google Scholar
Bellver, J. & Donnez, J. Introduction: Infertility etiology and offspring health. Fertil. Steril. 111 (6), 1033–1035 (2019).
Article PubMed MATH Google Scholar
Min, J. K. et al. What is the most relevant standard of success in assisted reproduction? The singleton, term gestation, live birth rate per cycle initiated: The BESST endpoint for assisted reproduction. Hum. Reprod. 19 (1), 3–7 (2004).
Article PubMed MATH Google Scholar
Handal-Orefice, R. C. et al. Impact of race versus ethnicity on infertility diagnosis between black American, haitian, African, and white American women seeking infertility care: a retrospective review, 3p. 22–28 (F S Rep, 2022). 2 Suppl.
Kaltsas, A. et al. Impact of advanced paternal age on fertility and risks of genetic disorders in offspring. Genes (Basel), 14(2). (2023).
Wang, X. J. et al. Study on the optimal time limit of frozen embryo transfer and the effect of a long-term frozen embryo on pregnancy outcome. Med. (Baltim). 103 (13), e37542 (2024).
Article CAS Google Scholar
Deo, R. C. Machine learning in Medicine. Circulation 132 (20), 1920–1930 (2015).
Article PubMed PubMed Central MATH Google Scholar
Esteva, A. et al. A guide to deep learning in healthcare. Nat. Med. 25 (1), 24–29 (2019).
Article CAS PubMed MATH Google Scholar
Govindan, V. et al. Optimization-based design and control of Dynamic systems. Babylon. J. Math. 2023, 30–35 (2023).
Article MATH Google Scholar
Rusul Ali, R. Discussing Artificial Intelligence’s role in combatting the COVID-19 pandemic: A review. Mesopotamian J. Artif. Intell. Healthc. 2023, 7–14 (2023).
Google Scholar
Decuyper, M. et al. Artificial intelligence with deep learning in nuclear medicine and radiology. EJNMMI Phys. 8 (1), 81 (2021).
Article PubMed PubMed Central MATH Google Scholar
Tran, K. A. et al. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med. 13 (1), 152 (2021).
Article PubMed PubMed Central MATH Google Scholar
Omotunde, H. & Mouhamed, M. R. The Modern Impact of Artificial Intelligence Systems in Healthcare: A Concise Analysis.Mesopotamian J. Artif. Intell. Healthcare, 2023: 66–70. (2023).
Liu, Y. & Wu, M. Deep learning in precision medicine and focus on glioma. Bioeng. Transl Med. 8 (5), e10553 (2023).
Article CAS PubMed PubMed Central Google Scholar
Dimitriadis, I. et al. Artificial intelligence in the embryology laboratory: A review. Reprod. Biomed. Online. 44 (3), 435–448 (2022).
Article PubMed MATH Google Scholar
Abbasi, A., Miahi, E. & Mirroshandel, S. A. Effect of deep transfer and multi-task learning on sperm abnormality detection. Comput. Biol. Med. 128, 104121 (2021).
Article PubMed MATH Google Scholar
Xu, H. et al. An ovarian reserve assessment model based on anti-mullerian hormone levels, follicle-stimulating hormone levels, and age: Retrospective cohort study. J. Med. Internet Res. 22 (9), e19096 (2020).
Article PubMed PubMed Central Google Scholar
Li, J. et al. Multiomics studies investigating recurrent pregnancy loss: An effective Tool for mechanism exploration. Front. Immunol. 13 (13), 826198 (2022).
Article CAS PubMed PubMed Central Google Scholar
Karne, R. & Sreeja, T. K. Clustering algorithms and comparisons in vehicular Ad Hoc networks.Mesopotamian J. Comput. Sci.,2023: 115–123. (2023).
Zhu, B. et al. A critical scenario search method for intelligent vehicle testing based on the social cognitive optimization algorithm. IEEE Trans. Intell. Transp. Syst. 24 (8), 7974–7986 (2023).
Article MATH Google Scholar
Rajora, K. & abdulhussein, N. Reviews research on applying machine learning techniques to reduce false positives for network intrusion detection systems.Babylonian J. Mach. Learn. 2023, 26–30. (2023).
Crase, S. & Thennadil, S. N. An analysis framework for clustering algorithm selection with applications to spectroscopy. Plos One. 17 (3), e0266369 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hlapisi, N. M. Enhancing Hybrid Spectrum Access in CR-IoT Networks: Reducing Sensing Time in Low SNR Environments.Mesopotamian J. Comput. Sci.2023 47–52. (2023).
Lee, D. & Seung, H. S. Algorithms for non-negative matrix factorization. Adv. Neural. Inf. Process. Syst. 13 (2000).
Gillis, N. & Glineur, F. Accelerated multiplicative updates and hierarchical ALS algorithms for nonnegative matrix factorization. Neural Comput. 24 (4), 1085–1105 (2012).
Article MathSciNet PubMed MATH Google Scholar
Wang, D. et al. A generalized Deep learning clustering algorithm based on non-negative matrix factorization. ACM Trans. Knowl. Discovery Data. 17 (7), 1–20 (2023).
ADS MATH Google Scholar
Wang, D. et al. A multi-view clustering algorithm based on deep semi-NMF. Inform. Fusion, 101884. (2023).
Wang, D. et al. A generalized deep learning algorithm based on nmf for multi-view clustering. IEEE Trans. Big Data. 9 (1), 328–340 (2022).
Article MATH Google Scholar
Strehl, A. & Ghosh, J. Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J. Mach. Learn. Res. 3 (Dec), 583–617 (2002).
MathSciNet MATH Google Scholar
Zhou, J., Zheng, H. & Pan, L. Ensemble clustering based on dense representation. Neurocomputing 357, 66–76 (2019).
Article MATH Google Scholar
Zhao, X. et al. Deciphering the endometrial immune landscape of RIF during the window of implantation from cellular senescence by integrated bioinformatics analysis and machine learning. Front. Immunol. 13, 952708 (2022).
Article CAS PubMed PubMed Central Google Scholar
Sun, Y. E., Zhou, H. J. & Li, J. J. Bipartite tight spectral clustering (BiTSC) algorithm for identifying conserved gene co-clusters in two species. Bioinformatics 37 (9), 1225–1233 (2021).
Article CAS PubMed PubMed Central MATH Google Scholar
Salem, S. A. & Nandi, A. K. Development of assessment criteria for clustering algorithms. Pattern Anal. Appl. 12, 79–98 (2009).
Article MathSciNet MATH Google Scholar
Ovarian Stimulation, T. et al. ESHRE guideline: ovarian stimulation for IVF/ICSI(dagger). Hum. Reprod. Open. 2020 (2), hoaa009 (2020).
Article PubMed Google Scholar
Out, H. J. & Thomas, L. E. Optimizing the gonadotrophin dose regimen. Int. Surg. 91 (5 Suppl), S15–24 (2006).
PubMed MATH Google Scholar
Nardo, L. G. et al. Conventional ovarian stimulation no longer exists: Welcome to the age of individualized ovarian stimulation. Reprod. Biomed. Online. 23 (2), 141–148 (2011).
Article CAS PubMed MATH Google Scholar
La Marca, A. & Sunkara, S. K. Individualization of controlled ovarian stimulation in IVF using ovarian reserve markers: From theory to practice. Hum. Reprod. Update. 20 (1), 124–140 (2014).
Article PubMed MATH Google Scholar
Pacchiarotti, A. et al. Ovarian stimulation protocol in IVF: An Up-to-date review of the literature. Curr. Pharm. Biotechnol. 17 (4), 303–315 (2016).
Article CAS PubMed MATH Google Scholar
La Marca, A. et al. The number and rate of euploid blastocysts in women undergoing IVF/ICSI cycles are strongly dependent on ovarian reserve and female age. Hum. Reprod. 37 (10), 2392–2401 (2022).
Article PubMed MATH Google Scholar
Ribeiro, L. M. et al. Overweight, obesity and assisted reproduction: A systematic review and meta-analysis. Eur. J. Obstet. Gynecol. Reprod. Biol. 271, 117–127 (2022).
Article PubMed MATH Google Scholar
Akbari Sene, A. et al. Anti-mullerian hormone predictive levels to determine the likelihood of ovarian hyper-response in infertile women with polycystic ovarian morphology. Int. J. Fertil. Steril. 15 (2), 115–122 (2021).
PubMed PubMed Central Google Scholar
Kotlyar, A. M. & Seifer, D. B. Ethnicity/Race and age-specific variations of serum AMH in women-a review. Front. Endocrinol. (Lausanne). 11, 593216 (2020).
Article PubMed MATH Google Scholar
Nandi, A. et al. Acupuncture in IVF: A review of current literature. J. Obstet. Gynaecol. 34 (7), 555–561 (2014).
Article CAS PubMed MATH Google Scholar
Wang, X. et al. An overview of systematic reviews of acupuncture for infertile women undergoing in vitro fertilization and embryo transfer. Front. Public. Health. 9, 651811 (2021).
Article ADS PubMed PubMed Central Google Scholar
Domar, A. D. et al. Lifestyle behaviors in women undergoing in vitro fertilization: A prospective study. Fertil. Steril. 97 (3), 697–701e1 (2012).
Article PubMed MATH Google Scholar
Zhang, H. R. et al. Pregnancy benefit of acupuncture on in vitro fertilization: A systematic review and Meta-analysis. Chin. J. Integr. Med. 29 (11), 1021–1032 (2023).
Article PubMed MATH Google Scholar
Masoud, A. et al. Systematic review and meta-analysis of the efficacy of acupuncture as an adjunct to IVF cycles in China and the world. Turk. J. Obstet. Gynecol. 19 (4), 315–326 (2022).
Article PubMed PubMed Central MATH Google Scholar
Smith, C. A. et al. The effects of acupuncture on the secondary outcomes of anxiety and quality of life for women undergoing IVF: A randomized controlled trial. Acta Obstet. Gynecol. Scand. 98 (4), 460–469 (2019).
Article PubMed MATH Google Scholar
Xia, Q. et al. The role of acupuncture in women with advanced reproductive age undergoing in vitro fertilization-embryo transfer: a randomized controlled trial and follicular fluid metabolomics study. Med. (Baltim). 102 (36), e34768 (2023).
Article CAS Google Scholar
Dong, H. X. et al. Effect of acupuncture on endometrial blood Flow in Women Undergoing in vitro fertilization embryo transfer: a single Blind, Randomized Controlled Trial. Chin. J. Integr. Med., (2023).
Xu, M., Zhu, M. & Zheng, C. Effects of acupuncture on pregnancy outcomes in women undergoing in vitro fertilization: An updated systematic review and meta-analysis (Arch Gynecol Obstet, 2023).
Quan, K. et al. Acupuncture as treatment for female infertility: A systematic review and meta-analysis of randomized controlled trials. Evid Based Complement Alternat Med2022, 3595033. (2022).
Hart, L. A. & Sibai, B. M. Seizures in pregnancy: epilepsy, eclampsia, and stroke. Semin Perinatol. 37 (4), 207–224 (2013).
Article PubMed MATH Google Scholar
Murgano, D. et al. Outcome of twin-to-twin transfusion syndrome in monochorionic monoamniotic twin pregnancy: Systematic review and meta-analysis. Ultrasound Obstet. Gynecol. 55 (3), 310–317 (2020).
Article CAS PubMed MATH Google Scholar
Ye, W. et al. Gestational diabetes mellitus and adverse pregnancy outcomes: Systematic review and meta-analysis. BMJ 377, e067946 (2022).
Article PubMed PubMed Central Google Scholar
Chen, Y. et al. Maternal anaemia during early pregnancy and the risk of neonatal outcomes: a prospective cohort study in Central China. BMJ Paediatr. Open., 8 (1). (2024).
Anagnostopoulou, C. et al. Oocyte quality and embryo selection strategies: A review for the embryologists, by the embryologists. Panminerva Med. 64 (2), 171–184 (2022).
Article PubMed MATH Google Scholar
Liao, Z. et al. The Effect of Endometrial thickness on pregnancy, maternal, and perinatal outcomes of women in fresh cycles after IVF/ICSI: a systematic review and Meta-analysis. Front. Endocrinol. (Lausanne). 12, 814648 (2021).
Article PubMed Google Scholar
Lee, D. D. & Seung, H. S. Learning the parts of objects by non-negative matrix factorization. Nature 401 (6755), 788–791 (1999).
Article ADS CAS PubMed MATH Google Scholar
Amari, S. Backpropagation and stochastic gradient descent method. Neurocomputing 5 (4–5), 185–196 (1993).
Article MATH Google Scholar
Deng, P. et al. Multi-view clustering guided by unconstrained non-negative matrix factorization. Knowl. Based Syst. 266, 110425 (2023).
Article Google Scholar
Wang, D. et al. Dual graph-regularized sparse concept factorization for clustering. Inf. Sci. 607, 1074–1088 (2022).
Article MATH Google Scholar

Download references

Acknowledgements

This study was supported by Sichuan Science and Technology Program (2022YFS0036), China Postdoctoral Science Foundation (2023MD744127), Postdoctoral Fellowship Program (Grade B) of China Postdoctoral Science Foundation (Nos. GZB20230091), Innovation Team and Talents Cultivation Program of National Administration of Traditional Chinese Medicine (ZYYCXTD-D-202003), and the National Natural Science Foundation of China (82174517).

Author information

Zheng Yu and Xiaoyan Zheng contributed equally to this work.
Dexian Wang and Jie Yang have jointly supervised this work.

Authors and Affiliations

School of Intelligent Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, 610075, China
Zheng Yu, Pengfei Zhang & Dexian Wang
Acupuncture and Tuina School, Chengdu University of Traditional Chinese Medicine, Chengdu, 610075, China
Xiaoyan Zheng, Jiaqi Sun, Fanrong Liang & Jie Yang
Traditional Chinese Medicine Department, Sichuan Jinxin Xi’nan Women’s and Children’s Hospital, Chengdu, 610066, China
Xiaoyan Zheng, Ying Zhong, Xingyu Lv & Jie Yang
School of Traditional Chinese Medicine, Capital Medical University, Beijing, 100069, China
Hongwen Yuan

Authors

Zheng Yu
View author publications
Search author on:PubMed Google Scholar
Xiaoyan Zheng
View author publications
Search author on:PubMed Google Scholar
Jiaqi Sun
View author publications
Search author on:PubMed Google Scholar
Pengfei Zhang
View author publications
Search author on:PubMed Google Scholar
Ying Zhong
View author publications
Search author on:PubMed Google Scholar
Xingyu Lv
View author publications
Search author on:PubMed Google Scholar
Hongwen Yuan
View author publications
Search author on:PubMed Google Scholar
Fanrong Liang
View author publications
Search author on:PubMed Google Scholar
Dexian Wang
View author publications
Search author on:PubMed Google Scholar
Jie Yang
View author publications
Search author on:PubMed Google Scholar

Contributions

Z.Y. and D.X.W.: Conceptualization; D.X.W. and P.F.Z.: Methodology; X.Y.Z., Y.Z. and X.Y.L.: Data Acquisition; Z.Y., D.X.W. and J.Q.S.: Writing – original draft; X.Y.Z., H.W.Y., F.R.L. and J.Y.: Writing – review & editing.

Corresponding author

Correspondence to Dexian Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, Z., Zheng, X., Sun, J. et al. Critical factors influencing live birth rates in fresh embryo transfer for IVF: insights from cluster ensemble algorithms. Sci Rep 15, 3734 (2025). https://doi.org/10.1038/s41598-025-88210-1

Download citation

Received: 20 May 2024
Accepted: 24 January 2025
Published: 30 January 2025
DOI: https://doi.org/10.1038/s41598-025-88210-1