Overnight maternal heart rate variability for early prediction of gestational diabetes mellitus: a retrospective cohort study

Wu, Yanqi; Asvadi, Sima; van der Ven, Myrthe; van der Hout-van der Jagt, M. Beatrijs; Peri, Elisabetta; Fonseca, Pedro; Overeem, Sebastiaan; Oei, S. Guid; Mischi, Massimo; Long, Xi

doi:10.1038/s44294-025-00081-z

Download PDF

Article
Open access
Published: 18 June 2025

Overnight maternal heart rate variability for early prediction of gestational diabetes mellitus: a retrospective cohort study

Yanqi Wu^1,2,
Sima Asvadi^1,3,
Myrthe van der Ven^2,4,5,
M. Beatrijs van der Hout-van der Jagt^1,2,5,
Elisabetta Peri^1,2,
Pedro Fonseca^1,2,6,
Sebastiaan Overeem^1,7,
S. Guid Oei^1,2,5,
Massimo Mischi^1,2 &
…
Xi Long^1,2

npj Women's Health volume 3, Article number: 37 (2025) Cite this article

4874 Accesses
12 Altmetric
Metrics details

Subjects

Abstract

A reliable early risk prediction of gestational diabetes mellitus (GDM) allows for early lifestyle modifications during pregnancy to reduce the risk of developing GDM. In this retrospective study, we developed a logistic regression machine learning model with heart rate variability (HRV) characteristics during overnight sleep in early pregnancy as predictors for GDM prediction. The study used the nuMoM2b dataset from 2748 nulliparous women in the USA who underwent a standardized home sleep test between 6 and 15 weeks’ gestation with subsequent GDM assessment at 24–28 weeks. A total of 52 overnight HRV features were analyzed alongside the baseline risk factors recommended by the National Institutes of Health (NIH). The model combining baseline and HRV features achieved an area under the receiver operating characteristic curve (AUC) of 0.73, outperforming the model using only baseline features (AUC = 0.69) and that using only HRV features (AUC = 0.65). These machine learning models all performed better than the early GDM risk assessment based on the NIH guidelines (AUC = 0.63). The findings suggest that overnight maternal HRV characteristics can be used as early predictors of GDM.

Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms

Article Open access 16 August 2023

An explainable machine learning-based clinical decision support system for prediction of gestational diabetes mellitus

Article Open access 21 January 2022

Prediction of gestational diabetes mellitus using machine learning from birth cohort data of the Japan Environment and Children's Study

Article Open access 13 October 2023

Introduction

Gestational diabetes mellitus (GDM) is defined as glucose intolerance with onset or first recognition during pregnancy¹, and is one of the most common adverse pregnancy outcomes (APOs). The prevalence of GDM varies across countries and regions. A recent meta-analysis reported an average prevalence of GDM of 14.7% based on the criteria of the International Association of Diabetes and Pregnancy Study Groups (IADPSG)². In South or East Asia, it can be higher than 20%^3,4. Differences in GDM diagnostic standards can explain differences in prevalence⁵. For decades, less attention was spent on GDM than on other APOs, since it was considered, in comparison with other outcomes that could directly cause maternal or fetal death, to have a relatively mild impact and was often considered fully reversible after delivery. However, it is increasingly understood that in addition to causing maternal hyperglycemia, GDM can induce breathing difficulties^6,7, cesarean section⁸, urinary tract infection⁹, hydramnios, pregnancy induced hypertension¹⁰, and even preeclampsia¹¹ on the mother. Uncontrolled GDM has a possibility of up to 87% in developing type II diabetes in 5–10 years after delivery¹². For the infant, GDM exposes the newborn to an elevated risk of macrosomia¹³, postpartum hypoglycemia¹⁴, and future obesity¹⁵. GDM also imposes an additional economic burden on family and society. Research shows that pregnant women with diagnosed GDM spent up to 34% more costs in maternal care and 49% more in neonatal care than non-GDM pregnancies^4,16.

Currently, clinical GDM risk screening is usually performed by an oral glucose tolerance test (OGTT) between 24 and 28 weeks of gestation. In the second trimester, as pregnancy progresses, surges in local and placental hormones, including estrogen, progesterone, leptin, cortisol, placental lactogen, and placental growth hormone, combine to promote a state of insulin resistance¹⁷. Women who cannot produce enough insulin to adapt to this resistance will have increased blood glucose levels, so OGTT can screen out GDM at this stage¹⁸. However, emerging evidence indicates that abnormal fetal growth can start prior to the commonly used GDM diagnosis period^19,20. Moreover, for pregnant women with advanced maternal age or BMI, GDM diagnosed at 24–28 weeks’ gestation already affected fetal abdominal circumference²¹. Various studies have demonstrated that early lifestyle modifications during pregnancy can help reduce the risk of developing GDM²². By making lifestyle adjustments (such as improving diet and physical activity) as early as possible in pregnancy - typically before 15 weeks —and maintaining them throughout the pregnancy, this effect is enhanced²³. Early screening also offers advantages over the 24–28 weeks OGTT in reducing the incidence of adverse neonatal outcomes and severe perineal injury^24,25, with the greatest benefits observed when interventions are initiated before 14 weeks. Furthermore, early diagnosis enables more precise ultrasound monitoring of fetal growth trajectories, allowing for the early detection of abnormal growth patterns (e.g., slow early growth followed by catch-up or accelerated early growth followed by slowing), both of which may increase the long-term risk of metabolic complications^26,27. Hence, to facilitate effective treatment and lifestyle adjustments, it is pivotal to accurately predict the risk of developing GDM early in pregnancy.

Numerous studies on early risk prediction of GDM indicate that the most used risk factors are maternal demographics, family history and obstetric history factors (e.g., age, BMI, race and ethnicity, parity, history of GDM, family history of diabetes). Although they are attractive due to their good accessibility, their predictive power is unsatisfactory²⁸. Recent research on biomarkers (e.g., fasting blood glucose and hemoglobin A1c level²⁹) has been shown to improve GDM prediction accuracy. However, these markers are limited by their invasive and obtrusive acquisition methods and are not widely used in clinical practice. The National Institute of Health recommended a guideline (National Institutes of Health (NIH) guidelines) for early GDM risk assessment based on eight demographics and history questions. According to the NIH guidelines, the mothers would be graded for the risk of GDM from zero to eight, where a higher grade corresponds to a higher GDM risk.

This study assessed heart rate variability (HRV) as a new predictor for the early prediction of GDM. HRV is the physiological phenomenon of variation in the time interval between heartbeats (known as inter-beat intervals or IBIs). HRV measures can be obtained using, for example, wearable or unobtrusive devices, demonstrating good accessibility, in particular for pregnant women who are already using a wearable device. While technological advancements may enhance accessibility, current adoption requires consideration of device affordability and user literacy. HRV, characterized and quantified using different metrics, can change during pregnancy. In early pregnancy, in order to provide sufficient nutrients for the development of the fetus, maternal cardiac blood volume increases by 30-40% compared with the non-pregnant state, and the average heart rate is also 10-15% faster³⁰. Stein et al.³¹ reported that mean heart rate (HR), standard deviation of normal-to-normal intervals (SDNN) and power of low frequent band (LF) showed significant difference in early pregnancy, compared with non-pregnant state. Bester et al.³² also reported that the worsening of systolic pulse in pregnant women was significantly different from that in non-pregnant women.

The contribution of autonomic nervous system (ANS) imbalance to the metabolic syndrome is well documented^33,34. The overactivation of sympathetic nervous system will stimulate and inhibit pancreatic β-cell insulin secretion³⁵ and may promote insulin resistance by inducing hemodynamic and cellular effects³⁶. A systematic review³⁷ reported that HRV can be used as an indicator of T2DM. Patients with T2DM had significantly higher HR, lower SDNN, RMSSD, LF and HF compared with healthy people. Both GDM and T2DM are associated with impaired insulin secretion and insulin resistance. For pregnancies with potential GDM, due to the fluctuations in hormone levels, automatic imbalances may be more easily induced or amplified.

Yet up to now, the effect of HRV characteristics in GDM prediction has not been explored, and no study has ever established any model using HRV in early pregnancy to predict GDM. This study is the first to develop models based on machine learning (ML) algorithms to verify whether HRV can be used as an early risk predictor for GDM and whether the GDM prediction can be improved when combined with risk factors suggested by existent guidelines for early GDM risk assessment such as the NIH guidelines.

Results

Cohort and HRV characteristics

Among all 9289 participants in the nuMoM2b database, 431 (4.6%) had no OGTT, 134 (1.4%) had pre-existing diabetes mellitus, and 6060 (65.2%) did not participate in the home sleep test or did not have the minimum number of 5-min valid segments, resulting in a total of 2748 (29.6%) pregnant women included in this study. Overnight sleep ECG signals were collected during the home sleep test between 11 and 14 weeks of gestation. In nuMoM2b, individuals’ race and ethnicity were originally classified into 8 categories: Non-Hispanic White, Non-Hispanic Black, Hispanic, American Indian, Asian, Native Hawaiian, Other, Multiracial. The “Other” category refers to individuals with clear race and ethnicity information who do not fall into the first six categories and are not multiracial. As shown in Table 1, compared with participants without GDM, those diagnosed with GDM exhibited a significantly higher age and BMI (p < 0.001), higher (diastolic and systolic) blood pressure measures in early pregnancy (p < 0.001), and a higher prevalence in family history of diabetes (p < 0.05) and Asian (p < 0.001) and Other (p < 0.001) in race and ethnicity. No statistical difference was observed in PCOS (p = 0.35), history of miscarriage (p = 0.60), or in the racial/ethnic categories of Non-Hispanic White (p = 0.21), Non-Hispanic Black (p = 0.18), Hispanic (p = 0.87), American Indian (p = 0.75), Native Hawaiian (p = 0.20), and Multiracial (p = 0.71). The mean (SD) number of valid 5-min segments for individuals with GDM was 95.8 (22.7), the proportion of valid segments compared to the total recording time (TRT) was 70.9% (17.7). For non-GDM individuals, the mean (SD) number of valid segments was 95.5 (22.0), the proportion of valid segments compared to TRT was 70.2% (17.2). There were no significant differences between the GDM and non-GDM groups in the number of valid segments (p = 0.94) or the proportion of valid segments to TRT (p = 0.75). In addition, 46 out of 52 overnight statistics features from HRV during early pregnancy were statistically significantly different between GDM and non-GDM pregnancies (Supplementary Table 4).

Table 1 Baseline characteristics of study cohort for GDM prediction

Full size table

Evaluation of the NIH guidelines for GDM prediction

We first applied the NIH guidelines on all 2748 samples and obtained GDM risk scores from eight binary variables (one not available as explained in the “Methods”). Figure 1a, b depict the number of individuals and GDM prevalence for each risk assessment level from the NIH guidelines. Since the risk factor “history of insulin resistance or blood glucose problems” is not available for all pregnant women in this study, and no pregnant woman meets all of the remaining seven risk factors simultaneously, the risk score range provided by the NIH guidelines is zero to six. The highest risk level (six) coincided with the highest GDM prevalence, with 10%, about 7 times higher than the risk level zero, for which only ten participants had GDM (less than 0.4%). To calculate the ROC curve when using the NIH guidelines for GDM prediction, we varied the risk score from zero to six as the threshold for decision making of GDM or non-GDM, obtaining an AUC of 0.63. As stated, NIH suggests that pregnancies with a score more than one in the assessment are high-risk pregnancies for GDM³⁸, corresponding to the red bars in Fig. 1a. The NIH suggested-high risk group accounted for about 2/3 of the pregnancies in the studied cohort, but the actual GDM prevalence was only slightly higher than the average (5.9% vs. 4.4%), leading to a large number of false positives.

Evaluation of machine learning models for GDM prediction

Figure 2 shows the ROC curves and the resulting AUC after ten-fold cross-validation for GDM prediction using the combined model, the baseline model, the HRV model and the NIH guidelines. The combined model achieved the highest AUC of 0.73 [95% CI, 0.72–0.74], superior to the baseline model (AUC = 0.69, [95% CI, 0.68–0.70]), the HRV model (AUC = 0.65 [95% CI, 0.64–0.66]), and the NIH guidelines (AUC = 0.63). The CI for NIH guidelines is not available since the risk scoring criteria of the NIH guidelines are fixed (seen from Supplementary Table 3) and do not need to be obtained through training. Although the NIH guidelines and the baseline model shared the same risk factors, the AUC of the baseline model is 6.1% higher than that of the NIH guidelines. The PR curves of the combined model, the baseline model, the HRV model and the NIH guidelines were shown in Fig. 3. The area under the PR curve (AUPRC) of the combined model is 0.14 [95% CI 0.13–0.15], superior to the baseline model (AUPRC = 0.11 [95% CI 0.10–0.12]), HRV model (AUPRC = 0.07 [95% CI 0.07–0.08]) and NIH guidelines (AUPRC = 0.08). The weighted F1 scores for the NIH guidelines, HRV model, baseline model, and combined model were 0.22, 0.93 [95% CI 0.93–0.94], 0.94 [95% CI 0.93–0.94], and 0.94 [95% CI 0.94–0.94], respectively. Within the GDM subgroup, the corresponding F1 scores were 0.17, 0.96 [95% CI 0.95–0.96], 0.96 [95% CI 0.96–0.96], and 0.96 [95% CI 0.96–0.96]. In the non-GDM subgroup, F1 scores were lower across all models, at 0.09, 0.13 [95% CI 0.12–0.14], 0.19 [95% CI 0.18–0.20], and 0.22 [95% CI 0.20–0.23], respectively. The calibration curves for the combined model, baseline model and HRV model was illustrated in Supplementary Fig. 1. The combined model can truly reflect the probability of GDM to a degree of 0–15%, superior to the baseline model and the HRV model. Since the nuMoM2b dataset is highly unbalanced and the number of positive samples is limited, the calibration curves of all three models’ experiences fluctuates.

**Fig. 2: Receiver Operating Characteristic (ROC) curves of NIH guidelines (orange dash line), the baseline model (green line), the HRV model (red line) and the combined model (blue line).**

**Fig. 3: Precision-recall (PR) curve of NIH guidelines (orange dashed line), the baseline model (green line), the HRV model (red line) and the combined model (blue line).**

Relative feature importance was ranked based on the average absolute coefficient of the normalized features across cross-validation folds, and top 20 relatively prominent features in the combined model were plotted in Fig. 4. A positive coefficient means that the feature has a higher correlation with GDM, while a negative coefficient means that the feature has a higher correlation with non-GDM. It’s important to note that due to the elastic net regularization, these coefficients reflect relative importance rankings rather than absolute magnitudes of association, as the penalization process shrinks toward zero to improve generalizability. This approach prioritizes stable predictors while maintaining clinical interpretability of top-ranked features. Since the combined model was trained and tested using ten-fold cross validation, the coefficients of each feature are likely different. Thus, the SD of each feature was also provided in Fig. 4 as error bars.

Including HRV features derived from noisy segments resulted in a significant decline in model performance: the AUC of the combined model decreased from 0.73 [95% CI 0.72–0.74] to 0.71 [95% CI 0.70–0.72] (p < 001), and that of the HRV model decreased from 0.65 [0.64–0.66] to 0.64 [0.63–0.65] (p < 001). These findings highlight the importance of excluding invalid segments when computing HRV features. The effect of varying the minimum valid segment threshold on model performance was evaluated using repeated experiments to derive AUC, AUPRC, and corresponding 95% CI. Applying an appropriate threshold improved performance for both the HRV and Combined models, compared with using no threshold (Supplementary Fig. 4 and Supplementary Fig. 5). Although the Baseline model does not incorporate HRV features, its performance was also affected due to changes in sample size. Around the threshold used in this study (minimum of 24 valid segments), all three models demonstrated stable performance with minimal variation.

**Fig. 4: mean and standard deviation of the coefficient for 20 relative important features in the combined (LR) model.**

Subgroup analysis

Figures 5 and 6 compare the performance of GDM prediction in different age and BMI subgroups, respectively. The prevalence of GDM for age <25, 25–30 and >30 was 2.3%, 4.1% and 7.6%, respectively. For subgroups of underweight/healthy, overweight and obese, the GDM prevalence was 3.2%, 5.0% and 8.2%, respectively. The combined model outperformed the baseline and the HRV model for pregnant women in all age and BMI subgroups (p < 0.001, Wilcoxon rank-sum test). Compared with the baseline model, the accuracy improvement of the combined model is higher (10–15%) in the lower and intermediate age and BMI subgroups, and lower in the higher age and/or obese subgroups (2–4%). The women in these two subgroups had a higher prevalence of GDM, and this is consistent with literature^39,40. The HRV model underperformed for most subgroups, except for the subgroup of young pregnant women <25 years, where it outperformed the baseline model.

**Fig. 5: The AUC of the baseline, HRV and combined models in different age subgroups (<25, 25–30, >30 years).**

**Fig. 6: The AUC of the baseline, HRV and combined models in different pre-pregnancy BMI subgroups (<25 (underweight or healthy), 25–30 (overweight), >30 kg/m² (obese)).**

Discussion

Early identification and intervention targeting GDM can help improve pregnancy outcomes^22,23. To investigate predictors of this condition, we first evaluated the NIH guidelines as the clinical practice baseline to predict the risk of GDM in early pregnancy. A baseline ML model based on an LR algorithm using the same risk factors from the NIH guidelines showed significantly higher performance than the NIH guidelines (AUC of 0.69 versus 0.63). This improvement likely results from LR’s ability to assign different weights to individual risk factors based on their data-driven relative importance. As illustrated in Supplementary Fig. 6, the coefficients in the baseline model vary, and its performance exceeds that of an equal-weight model (AUC: 0.69 vs 0.66, Supplementary Fig. 7). These findings highlight the advantage of automatic feature weighting in enhancing model performance. However, potential limitations should be acknowledged, as data-driven models that optimize weights may overfit, particularly when trained on small datasets. Such models may exhibit limited generalizability compared with NIH guidelines.

To investigate whether overnight HRV characteristics in early pregnancy can be early predictors for the risk of GDM, we proposed a model using only HRV characteristics, and another model where the HRV characteristics and the risk factors from the baseline model were combined. The combined model achieved the best performance with an AUC of 0.73, 15.9% higher than that using the NIH guidelines, 12.3% higher than the HRV model and 5.8% higher than the baseline model. Considering the study cohort is highly unbalanced, AUPRC was also used to evaluate model performance. The combined model had the highest AUPRC of 0.14, i.e., 75.0% higher than that using the NIH guidelines, 100% higher than HRV model and 27.3% higher than the baseline model.

As maternal age and/or BMI increase, the frequency of sleep disordered breathing events (such as sleep apnea) and nighttime awakenings may increase^41,42. Their presence would interfere with the measurement of HRV⁴³ and weaken the effectiveness of HRV characteristics in GDM prediction. From Fig. 4, age and BMI had the highest coefficients in the model, indicated that the individuals with advanced age or BMI are assigned higher GDM predicted value in the model. Often in current clinical practice, pregnant women with advanced maternal age and/or BMI at the beginning of pregnancy will be noted that they have a higher risk of developing GDM. This implies that intervention can be started at the beginning of the pregnancy to reduce the risk of GDM and its adverse consequences. For the younger age or healthier BMI individuals, their GDM risks are often overlooked. The comparison results of model performance in different age and BMI groups in Figs. 5 and 6 showed that the combination of HRV characteristics and clinically suggested risk factors can help the model more accurately identify the individual GDM risk in moderate age and BMI groups, and the AUC can be improved by up to 15% in some subgroups.

It is well known that individuals identifying as Asian American/Pacific Islander (AAPI) are at a higher risk of developing GDM compared to the other race and ethnicity groups⁴⁴. In our study cohort, the representation of American Indian and Native Hawaiian participants was substantially lower than that of Asian participants, and their corresponding risk factors were not significantly associated with GDM. To address this, we combined American Indian, Asian, and Native Hawaiian categories into a single AAPI feature and conducted a sensitivity analysis to evaluate the impact of this aggregation on model performance in Supplementary Tables 5 and 6. The results indicated that feature aggregation did not significantly affect the AUPRC across all three ML models. Although a statistically significant change in the AUC was observed, the average change was less than 0.01.

It is challenging to accurately estimate the risk of GDM for nulliparous pregnant women who did not have a history of GDM or avaliable early plasma glucose level. Compared with biomarkers that require blood or body fluid sampling, we investigated the possibility of using the unobtrusive and easily accessible HRV characteristics as additional predictors to predict the risk of GDM. Prior to this study, there was no study considering early pregnancy HRV characteristics used in a prediction model for GDM, and the analysis between HRV and GDM has focused mainly on the end of the second trimester or later^{45,46,47,48,49}. Although some studies^45,46,47,48 reported no difference between GDM and non-GDM pregnant individuals in terms of HRV metrics, Pöyhönen-Alho et al.⁴⁹ reported that the HRV frequency domain components were different between control and GDM groups. In our study, most of the HRV characteristics listed in Supplementary Table 1 showed statistically significant difference (p < 0.05 or p < 0.001) between GDM and non-GDM pregnancies in the nuMoM2b dataset. Furthermore, in the feature importance analysis characterized by logistic regression coefficients, we identified average HR (overnight mean and minimum) as the strongest predictor positively associated with GDM risk. Elevated average heart rate indicates sympathetic overactivation coupled with vagal withdrawal, a pattern consistent with autonomic imbalance observed in metabolic dysregulation⁵⁰. Conversely, RMSSD, PNN50, SD1, SD1/SD2 ratio, and Poincaré plot area (S) exhibited negative associations with predicted GDM probability. The observed reductions in nocturnal RMSSD means and their variability (SD), alongside diminished SD1 variability, lower SD1/SD2 ratios, and decreased S values, reflect impaired parasympathetic tone and attenuated autonomic regulatory capacity⁵¹. Specifically, the decline in SD1 and RMSSD indicates blunted vagally mediated beat-to-beat adjustments, while reduced SD1/SD2 and S values suggest compromised integration of short- and long-term cardiovascular control mechanisms^52,53. This confirmed the hypothesis that ANS activity and its manifestation in cardiac rhythms would appear different already in earlier phases of pregnancy in women who develop GDM in later pregnancy. Qiu et al.⁵⁴ reported that the (daytime, 5-min) resting heart rate in the first trimester is associated with GDM, which also supports our findings. As pregnancy progresses, fetal growth and changes in maternal hormone levels have an impact on the cardiac system and ANS, thereby masking or attenuating differences related to GDM. The findings in our study suggest that the overnight maternal HRV characteristics measured during sleep are novel physiological factors that can aid in the early risk prediction of GDM. From an application perspective, HRV can theoretically be easily measured with a wearable or unobtrusive device at home such as a bracelet, watch, mattress, camera, and radar^55,56,57. These unobtrusive methods make it possible to accurately and continuously monitor HRV for multiple nights or even weeks in a home environment, possibly enabling further improvements in the GDM prediction performance.

Although there is currently no consensus on conducting GDM screening for all mothers in early pregnancy, this may be due to limitations of the OGTT. Despite being the gold standard for diagnosing GDM, the OGTT is invasive, time-consuming, and shows low sensitivity in early pregnancy⁵⁸. Nevertheless, some regional and international organizations have begun to recommend screening for GDM in early pregnancy. For example, a new consensus has been established in the Flanders region to measure fasting blood glucose in early pregnancy⁵⁹. It is recommended that women with a fasting blood glucose of 5.3–6.9 mmol/L before 20 weeks of pregnancy be diagnosed with early GDM. “International Federation of Gynecology and Obstetrics” (FIGO) recommend screening universally in early pregnancy for diabetes and GDM⁶⁰, Italian recommendations state that pregnancies with high risk factors should be tested at 16–18 weeks and an FPG ≥ 5.1 mmol/L is diagnosed as GDM⁶¹. And American Diabetes Association⁶² and National Institute for Health and Care Excellence⁶³ both recommend blood glucose screening in early pregnancy (<15 weeks) for high-risk pregnancies. Importantly, the lack of consensus on early GDM screening not only highlights a research gap but also reinforces the need to explore novel risk factors to enhance early prediction of GDM. This aligns with the aim of our study to advance early risk stratification and ultimately improve maternal and fetal outcomes.

While the LR model with elastic net regularization provided good interpretability and reasonable performance, we also implemented a voting ensemble that combines LR, support vector machine (SVM), and random forest (RF) models. This was done to explore potential performance gains and better capture complex interactions between HRV and other risk factors. The results in Supplementary Table 7 showed that the LR and SVM models performed comparably, while the RF model yielded the poorest results. As a result, the voting model achieved performance similar to the standalone LR model but introduced additional complexity.

This study had several limitations. The nuMoM2b researchers used three different standards of OGTT (non-fasting 50 g OGTT, fasting 75g-2h OGTT, and fasting 100g-3h-OGTT) to diagnose GDM⁴². This might lead to inconsistency in GDM samples used for training a GDM prediction model, and it is difficult for us to evaluate the impact of the different diagnostic criteria on the prediction of GDM in this dataset. Ideally, different diagnostic standards should be applied and evaluated for same pregnancies, which merits further investigation. Second, in the nuMoM2b dataset, the overnight ECG recording was collected only once per participant during early pregnancy, with variable recording quality and duration. Although removing these noisy segments could ameliorate the data quality and improve model’s performance, the influence of reducing the number of segments on the reliability of HRV estimates still existed. In addition to the ECG recording duration and quality, sleep stage is another important factor that may influence HRV characteristics³². Many invalid segments might likely correspond to wakefulness with movement artifacts, which should be further verified when sleep stages are available. However, sleep stage information was not available in this dataset due to the absence of EEG, the required signal modality in the gold standard polysomnography for sleep staging. Given HRV-based sleep staging has been shown to be feasible⁶⁴, the effect of sleep stages should be further studied in the future. Besides, HRV characteristics can also alter as pregnancy progresses⁶⁵. The time-dependent discrepancy of HRV due to different gestational age might confuse the ML model when including HRV for GDM prediction. Future work should investigate the use of prolonged monitoring of HRV for potentially improving the early prediction of GDM. Third, this study only used seven of the eight risk factors recommended by NIH guidelines. The history of GDM, macrosomia or stillbirth and history of insulin resistance or blood glucose problem (except diabetes) are not available in the nuMoM2b dataset, this study is unable to evaluate how the models’ performance would change after including the above unavailable risk factors. Finally, the accessibility of wearable devices used for HRV monitoring may vary across populations, particularly for individuals with lower socioeconomic status (SES) or limited health literacy. While our study demonstrates the technical feasibility of HRV-based prediction, the current reliance on wearable technology could pose barriers to equitable implementation in resource-constrained settings. Future research should validate this approach in socioeconomically diverse cohorts and explore alternative lower cost solutions to improve accessibility.

In conclusion, this study presents the first model designed to predict the risk of GDM early in pregnancy by utilizing overnight maternal HRV, which can potentially be measured unobtrusively at home. By integrating HRV characteristics with risk factors identified in existing guidelines, there was a notable improvement in the accuracy of early pregnancy GDM prediction. This enhancement underscores the utility of maternal HRV characteristics as physiological indicators for forecasting GDM risk.

Methods

Study population

This retrospective study population included individuals who were enrolled in the observational cohort study “Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be” (nuMoM2b), in which nulliparous women with singleton pregnancies were recruited from hospitals affiliated with eight clinical centers in the USA between March, 2011 and September, 2013, and 4 visits were scheduled at 6–13, 16–21, and 22–29 weeks of gestation and at the time of the delivery⁶⁶. The local institutional review board of each study site approved the study protocol, and all women provided written informed consent before participation. The study population was selected from the original nuMoM2b participants as follows. Women were excluded in case they were diagnosed with diabetes before the current pregnancy, or when they did not undergo OGTT between 24 and 28 weeks gestation, or when they did not join standardized level 3 home sleep test between 6 and 13 weeks’ gestation, or when the sleep monitoring device did not work properly or valid sleep test duration of less than 2 h (equal to 24 valid 5-min segments), resulting in a sample size of n = 2748 participants (121 with GDM [4.4%]). The Eunice Kennedy Shriver National Institute of Child Health and Human Development review board (approval date: December 15, 2021, request ID: 11961) and Eindhoven Technology University ethical review board (approval date: November 1, 2021, reference number: ERB2021BME4) approved the study protocol. Our study complied with all relevant ethical regulations, including the Declaration of Helsinki and the Good Clinical Practice Guidelines.

Assessment of HRV characteristics

Overnight maternal cardiac rhythm was measured through a bipolar ECG sensor of the Embletta-Gold device⁶⁶ during a home sleep test at the first study visit between 6 weeks and 13 weeks plus 6 days of gestation. The overnight ECG signals were divided into non-overlapping 5-min segments, and a linear phase high-pass filter was used on each segment to eliminate the baseline drift of the ECG signal. The filter used a 1.106 s Kaiser window, a cutoff frequency of 0.8 Hz, and a sidelobe attenuation of 30 dB⁶⁷. We then removed segments whose average signal amplitude within 5-min segments was much lower than the normal ECG signal (<0.1 mV) and visually confirmed the removed parts. A low-complexity precise QRS complex localization algorithm⁶⁷ was then used to extract the R interval. We further removed the RR intervals that fell outside the range of more than 0.5–2 s³². In each 5-min interval, if the coverage rate of RR intervals was more than 50%, we considered it as valid segment otherwise it would be treated as noisy segment.

After removing noisy segments disrupted by motion artifacts, for each 5-min segment, HRV properties were quantified as time-domain characteristics (including average HR, SDNN, root mean square of successive differences (RMSSD), proportion of NN intervals differing by >20 ms or >50 ms (PNN20/PNN50) and mean absolute deviation (MAD)), frequency-domain characteristics (including ultra-low-frequency (VLF), low-frequency (LF) and high-frequency (HF)) and non-linear-domain characteristics (including, minor axis of Poincaré ellipse (SD1), major axis of Poincaré ellipse (SD2) and area of Poincaré ellipse (S)). Time-domain metrics capture ANS dynamics through sympathetic-parasympathetic balance (average HR), overall rhythm variability (SDNN), rapid parasympathetic modulation (RMSSD), and vagally mediated heartbeat fluctuation thresholds (PNN20/PNN50)⁶⁸. Frequency-domain oscillations correspond to distinct physiological regulatory processes: VLF associated with metabolic homeostasis⁶⁹, LF oscillations reflecting baroreflex-mediated blood pressure control⁷⁰, and HF components synchronized with respiratory-driven vagal activity⁷¹. Nonlinear-domain parameters characterize cardiac rhythm complexity via Poincaré plot geometry, where SD1 quantifies instantaneous beat-to-beat variability, SD2 represents long-term regulatory trends, and S (equal to SD1 multiplied by SD2) reflecting global autonomic complexity⁷². Detailed description of the HRV characteristics is provided in Supplementary Table 1. Then each HRV characteristic was overnight summarized using sample statistics [maximum, minimum, mean and standard deviation (SD)].

NIH guidelines for early risk assessment of GDM

The NIH recommended a self-administered, eight-binary questionnaire for clinicians to determine GDM risk in early pregnancy³⁸. The eight binary questions were applied to data collected in the first study visit and corresponded to eight risk factors: body mass index (BMI), family history of diabetes, race and ethnicity, age, history of GDM or macrosomia or miscarriage or stillbirth, history of polycystic ovary syndrome (PCOS), history of insulin resistance or blood glucose problem, and high blood pressure (systolic pressure ≥ 140 mmHg or diastolic pressure ≥90 mmHg) or heart disease (Supplementary Table 2). This study selected this early risk screening or assessment tool to represent the current clinical baseline in GDM early risk prediction, called “NIH guidelines”, for comparison with the proposed ML models. Since all the pregnant women in the nuMoM2b were nulliparous and the women with pre-existing diabetes were excluded from our study population, only history of miscarriage in the risk factor “history of GDM, macrosomia, miscarriage or stillbirth” was available. And for the risk factor “history of insulin resistance or blood sugar problem”, the specific pre-pregnancy blood glucose or insulin resistance levels were not available in the nuMoM2b dataset. We were unable to confirm whether there were participants who had insulin resistance or blood glucose problems before pregnancy except for those already diagnosed with diabetes. In principle, the NIH guidelines assessed the individual’s GDM risk from zero to eight through a risk score add-up methodology. For each question answered “yes”, the individual’s risk score was increased by one³⁷. However, since one question was not available on the nuMoM2b dataset, the maximum risk score that the NIH guidelines could provide is seven.

Machine learning prediction models

We developed three logistic regression (LR) based-ML models, each characterized by the set of input features included: a baseline model, an HRV model and a combined model. The baseline model included seven NIH-recommended risk factors, consisting of age, BMI, race and ethnicity, family history of diabetes, history of PCOS, high blood pressure or heart disease and history of miscarriage. The history of insulin resistance or blood glucose problems were not available except for those diagnosed with diabetes before pregnancy, and this risk factor was removed. The HRV model included all 52 overnight statistics [max, min, mean, and SD] features from 13 HRV characteristics mentioned above. The combined model included both the NIH suggested risk factors and the overnight HRV characteristics. The outcome variable for each subject was a binary value indicating whether the participant was diagnosed with GDM (1) or not (0). (Supplementary Table 3).

Given the simplicity and good interpretability of LR, it has been the most widely used algorithm in GDM risk stratification or prediction²⁸, motivating us to employ the same technique in our study. While LR is often positioned at the intersection of statistics and ML, this study considers it within the framework of statistical ML model, given its data-driven optimization of model parameters. Elastic net regularization was applied separately in each fold of cross validation to cope with potential collinearity and overfitting issues. To help with model convergence, features corresponding to the risk factors or HRV characteristics were z-score normalized before being entered into the model.

All the ML models were trained, validated and tested using ten-fold cross-validation. Participants were randomly partitioned into ten equally sized subsets, referred to as “folds”. Of the ten subsets, eight subsets were used as training sets (80%) for model training, one subset was used as the validation set (10%) for optimizing the model’s hyper-parameters and the remaining one subset was held out as the test set (10%) for testing the model. The cross-validation process was repeated ten times, with each of the ten subsets used exactly once as the test set. Within each training set we computed the Cohen’s d effect size for each feature to assess its discriminative capability for GDM and then applied effect size-based feature selection thresholds ranging from 0.2 (small effect) to 0.8 (large effect) during hyperparameter tuning⁷³. The hyperparameters were tuned using the validation set, including the regularization strength (C) over a range from 0.1 to 1, the L1 ratio with values from 0 to 1, and the effect size-based feature selection threshold ranging from 0.2 (small effect) to 0.8 (large effect). The model performance in predicting GDM was assessed using the area under the receiver-operating-characteristic (ROC) curve (AUC) and area under the precision-recall (PR) curve (AUPRC) and F1 score for all test results of cross-validation. The 95% confidence interval (CI) of the AUC, AUPRC and F1 score of each model was provided through 100 times randomly shuffled repeated experiments.

To assess the impact of identifying and excluding invalid segments, we first evaluated how the inclusion of such segments influenced model performance by comparing results when HRV features were derived from all segments versus only valid ones, with the study population being constant. We then examined how the number of valid segments, among samples deemed to contain only valid data, affected the reliability of HRV estimation and its contribution to model performance. Performance metrics, including AUC and AUPRC with corresponding 95% CI, were obtained through repeated experiments.

In the subgroup analysis, we compared the performance of the baseline, the HRV and the combined model under different age and BMI subgroups. Calibration curves were also plotted to evaluate the agreement between predicted probabilities and observed outcomes for the three ML models.

For statistical analysis, the two-sided Wilcoxon rank-sum test was used to examine the statistical significance of risk factors and characteristics between the GDM and non-GDM groups, and Benjamini-Hochberg procedure was used to adjust p value. Model performance was also statistically compared using the Wilcoxon rank-sum test.

Data availability

The datasets used and analyzed during the current study are available from the Eunice Kennedy Shriver National Institute of Child Health and Human Development—Data and Specimen Hub: https://dash.nichd.nih.gov/study/226675.

Code availability

The code generated for data analysis in the current study is available from the corresponding author (X.L.) on reasonable request.

References

Metzger, B. E. Summary and recommendations of the Third International Workshop-Conference on Gestational Diabetes Mellitus. Diabetes 40, 197–201 (1991).
Article PubMed Google Scholar
Saeedi, M. et al. Increasing prevalence of gestational diabetes mellitus when implementing the IADPSG criteria: a systematic review and meta-analysis. Diabetes Res. Clin. Pract. 172, 108642 (2021).
Article PubMed Google Scholar
Yuen, L. et al. Projections of the prevalence of hyperglycaemia in pregnancy in 2019 and beyond: results from the International Diabetes Federation Diabetes Atlas, 9th edition. Diabetes Res. Clin. Pract. 157, 107841 (2019).
Article PubMed Google Scholar
Gillespie, P. et al. Modeling the independent effects of gestational diabetes mellitus on maternity care and costs. Diabetes Care 36, 1111–1116 (2013).
Article PubMed PubMed Central Google Scholar
Behboudi-Gandevani, S. et al. The impact of diagnostic criteria for gestational diabetes on its prevalence: a systematic review and meta-analysis. Diabetol. Metab. Syndr. 11, 11 (2019).
Article PubMed PubMed Central Google Scholar
Bisson, M. et al. Gestational diabetes mellitus and sleep-disordered breathing. Obstet. Gynecol. 123, 634–641 (2014).
Article PubMed Google Scholar
Izci Balserak, B. et al. Obstructive sleep apnea is associated with newly diagnosed gestational diabetes mellitus. Ann. Am. Thorac. Soc. 17, 754–761 (2020).
Article PubMed PubMed Central Google Scholar
Boriboonhirunsarn, D. & Waiyanikorn, R. Emergency cesarean section rate between women with gestational diabetes and normal pregnant women. Taiwan. J. Obstet. Gynecol. 55, 64–67 (2016).
Article PubMed Google Scholar
Yefet, E. et al. The association between gestational diabetes mellitus and infections in pregnancy-systematic review and meta-analysis. Microorganisms 11, 1956 (2023).
Article CAS PubMed PubMed Central Google Scholar
Hwu, L.-J. et al. Risk of subsequent hypertension and diabetes in women with hypertension during pregnancy and gestational diabetes. Mayo Clin. Proc. 91, 1158–1165 (2016).
Article PubMed Google Scholar
Weissgerber, T. L. & Mudd, L. M. Preeclampsia and diabetes. Curr. Diabetes Rep. 15, 9 (2015).
Article Google Scholar
Ratner, R. E. et al. Prevention of diabetes in women with a history of gestational diabetes: effects of metformin and lifestyle interventions. J. Clin. Endocrinol. Metab. 93, 4774–4779 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kc, K. et al. Gestational diabetes mellitus and macrosomia: a literature review. Ann. Nutr. Metab. 66, 14–20 (2015).
Article CAS PubMed Google Scholar
Voormolen, D. N. et al. Neonatal hypoglycemia following diet-controlled and insulin-treated gestational diabetes mellitus. Diabetes Care 41, 1385–1390 (2018).
Article CAS PubMed Google Scholar
Zhao, P. et al. Maternal gestational diabetes and childhood obesity at age 9–11: results of a multinational study. Diabetologia 59, 2339–2348 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kolu, P. et al. Health care costs associated with gestational diabetes mellitus among high-risk women–results from a randomised trial. BMC Pregnancy Childbirth 12, 71 (2012).
Article PubMed PubMed Central Google Scholar
Plows, J. F. et al. The pathophysiology of gestational diabetes mellitus. Int. J. Mol. Sci. 19, 3342 (2018).
Article PubMed PubMed Central Google Scholar
Rani, P., Reddi, P. & Begum, J. Screening and diagnosis of gestational diabetes mellitus, where do we stand. J. Clin. Diagn. Res. 10, QE01–QE04 (2016).
CAS PubMed PubMed Central Google Scholar
Venkataraman, H. et al. Increased fetal adiposity prior to diagnosis of gestational diabetes in South Asians: More evidence for the ‘thin-fat’ baby. Diabetologia 60, 399–405 (2017).
Article CAS PubMed Google Scholar
Sovio, U. et al. Accelerated fetal growth prior to diagnosis of gestational diabetes mellitus: a prospective cohort study of nulliparous women. Diabetes Care 39, 982–987 (2016).
Article CAS PubMed Google Scholar
Kim, W. et al. Gestational diabetes mellitus diagnosed at 24 to 28 weeks of gestation in older and obese Women: Is it too late?. PLoS One 14, e0225955 (2019).
Article CAS PubMed PubMed Central Google Scholar
Song, C. et al. Lifestyle intervention can reduce the risk of gestational diabetes: a meta-analysis of randomized controlled trials. Obes. Rev. 17, 960–969 (2016).
Article CAS PubMed Google Scholar
McIntyre, H. D. et al. Gestational diabetes mellitus. Nat. Rev. Dis. Prim. 5, 47 (2019).
Article PubMed Google Scholar
Hivert, M.-F. et al. Pathophysiology from preconception, during pregnancy, and beyond. Lancet 404, 158–174 (2024).
Article CAS PubMed Google Scholar
Simmons, D. et al. Treatment of gestational diabetes mellitus diagnosed early in pregnancy. N. Engl. J. Med. 388, 2132–2144 (2023).
Article CAS PubMed Google Scholar
Prentice, P. M. et al. Reduced size at birth and persisting reductions in adiposity in recent, compared with earlier, cohorts of infants born to mothers with gestational diabetes mellitus. Diabetologia 62, 1977–1987 (2019).
Article CAS PubMed PubMed Central Google Scholar
van Poppel, M. N. M. et al. Is the biphasic effect of diabetes and obesity on fetal growth a risk factor for childhood obesity?. Diabetes Care 46, 1124–1131 (2023).
Article PubMed Google Scholar
Lamain-de Ruiter, M. et al. Prediction models for the risk of gestational diabetes: a systematic review. Diagn. Progn. Res. 1, 3 (2017).
Article PubMed PubMed Central Google Scholar
Fong, A. et al. Use of hemoglobin A1c as an early predictor of gestational diabetes mellitus. Am. J. Obstet. Gynecol. 211, 641.e1–7 (2014).
Article CAS PubMed Google Scholar
Ciliberto, C. F. & Marx, G. F. Physiological changes associated with pregnancy. Update Anaesth. 9, 1–3 (1998).
Google Scholar
Stein, P. K. et al. Changes in 24-hour heart rate variability during normal pregnancy. Am. J. Obstet. Gynecol. 180, 978–985 (1999).
Article CAS PubMed Google Scholar
Bester, M. et al. Maternal cardiorespiratory coupling: differences between pregnant and nonpregnant women are further amplified by sleep-stage stratification. J. Appl. Physiol. 135, 1199–1212 (2023).
Article PubMed PubMed Central Google Scholar
Hu, M. X. et al. Basal autonomic activity, stress reactivity, and increases in metabolic syndrome components over time. Psychoneuroendocrinology 71, 119–126 (2016).
Article PubMed Google Scholar
Wulsin, L. R. et al. Autonomic imbalance as a predictor of metabolic risks, cardiovascular disease, diabetes, and mortality. J. Clin. Endocrinol. Metab. 100, 2443–2448 (2015).
Article CAS PubMed Google Scholar
Moullé, V. S. et al. The autonomic nervous system regulates pancreatic β-cell proliferation in adult male rats. Am. J. Physiol. Endocrinol. Metab. 317, E234–E243 (2019).
Article PubMed PubMed Central Google Scholar
Jamerson, K. A. et al. Reflex sympathetic activation induces acute insulin resistance in the human forearm. Hypertension 21, 618–623 (1993).
Article CAS PubMed Google Scholar
Benichou, T. et al. Heart rate variability in type 2 diabetes mellitus: a systematic review and meta-analysis. PLoS One 13, e0195166 (2018).
Article PubMed PubMed Central Google Scholar
Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, DHHS. Am I at Risk for Gestational Diabetes? 12–4818 (U.S. Government Printing Office, 2012).
Li, Y. et al. Maternal age and the risk of gestational diabetes mellitus: a systematic review and meta-analysis of over 120 million participants. Diabetes Res. Clin. Pract. 162, 108044 (2020).
Article PubMed Google Scholar
Torloni, M. R. et al. Prepregnancy BMI and the risk of gestational diabetes: a systematic review of the literature with meta-analysis. Obes. Rev. 10, 194–203 (2009).
Article CAS PubMed Google Scholar
Louis, J. M. et al. Predictors of sleep-disordered breathing in pregnancy. Am. J. Obstet. Gynecol. 218, 521.e1–521.e12 (2018).
Article PubMed Google Scholar
Haas, D. M. et al. A description of the methods of the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be (nuMoM2b). Am. J. Obstet. Gynecol. 212, 539.e1–539.e24 (2015).
Article PubMed Google Scholar
Qin, H. et al. The different facets of heart rate variability in obstructive sleep apnea. Front. Psychiatry 12, 642333 (2021).
Article PubMed PubMed Central Google Scholar
Gregory, E. C. W. & Ely, D. M. Trends and characteristics in gestational diabetes: United States, 2016-2020. Natl. Vital. Stat. Rep. 71, 1–15 (2022).
PubMed Google Scholar
Weissman, A. et al. Power spectral analysis of heart rate variability during the 100-g oral glucose tolerance test in pregnant women. Diabetes Care 29, 571–574 (2006).
Article PubMed Google Scholar
Heiskanen, N. et al. Gestational diabetic patients with adequate management have normal cardiovascular autonomic regulation during the third trimester of pregnancy and 3 months after delivery. J. Diabetes Complicat. 24, 234–241 (2010).
Article Google Scholar
Maser, R. E. et al. Autonomic modulation in gestational diabetes mellitus. J. Diabetes Complicat. 28, 684–688 (2014.
Article Google Scholar
Žákovičová, E. et al. ECG body surface mapping in patients with gestational diabetes mellitus and optimal metabolic compensation. Physiol. Res. 4(63 Suppl), S479–S487 (2014).
Article Google Scholar
Pöyhönen-Alho, M. et al. Imbalance of the autonomic nervous system at night in women with gestational diabetes. Diabet. Med. 27, 988–994 (2010).
Article PubMed Google Scholar
Svensson, M. K. et al. Alterations in heart rate variability during everyday life are linked to insulin resistance. A role of dominating sympathetic over parasympathetic nerve activity?. Cardiovasc. Diabetol. 15, 91 (2016).
Article PubMed PubMed Central Google Scholar
Arroyo-Carmona, R. E. et al. Heart rate variability as early biomarker for the evaluation of diabetes mellitus progress. J. Diabetes Res. 2016, 8483537 (2016).
Article PubMed PubMed Central Google Scholar
Ciccone, A. B. et al. Reminder: RMSSD and SD1 are identical heart rate variability metrics. Muscle Nerve 56, 674–678 (2017).
Article PubMed Google Scholar
Kubičková, A. et al. Heart rate variability analysed by Poincaré plot in patients with metabolic syndrome. J. Electrocardiol. 49, 23–28 (2016).
Article PubMed Google Scholar
Qiu, J. et al. Early-pregnancy maternal heart rate is related to gestational diabetes mellitus (GDM). Eur. J. Obstet. Gynecol. Reprod. Biol. 268, 31–36 (2022).
Article PubMed Google Scholar
Zheng, Y.-L. et al. Unobtrusive sensing and wearable devices for health informatics. IEEE Trans. Biomed. Eng. 61, 1538–1554 (2014).
Article PubMed PubMed Central Google Scholar
Chen, S. et al. Flexible wearable sensors for cardiovascular health monitoring. Adv. Healthc. Mater. 10, e2100116 (2021).
Article PubMed Google Scholar
Werth, J. et al. Unobtrusive sleep state measurements in preterm infants—a review. Sleep. Med. Rev. 32, 109–122 (2017).
Article PubMed Google Scholar
Huhn, E. A. et al. Sensitivity, specificity, and diagnostic accuracy of WHO 2013 criteria for diagnosis of gestational diabetes mellitus in low risk early pregnancies: International, prospective, multicentre cohort study. BMJ Med. 2, e000330 (2023).
Article PubMed PubMed Central Google Scholar
Benhalima, K. et al. The 2024 Flemish consensus on screening for gestational diabetes mellitus early and later in pregnancy. Acta Clin. Belg. 79, 217–224 (2024).
Article PubMed Google Scholar
Hod, M. et al. The International Federation of Gynecology and Obstetrics (FIGO) initiative on gestational diabetes mellitus: a pragmatic guide for diagnosis, management, and care. Int. J. Gynaecol. Obstet. 131, S173–S211 (2015).
Article PubMed Google Scholar
Torlone, E. et al. Italian recommendations for the diagnosis of gestational diabetes during COVID-19 pandemic: position statement of the Italian Association of Clinical Diabetologists (AMD) and the Italian Diabetes Society (SID), diabetes, and pregnancy study group. Nutr. Metab. Cardiovasc. Dis. 30, 1418–1422 (2020).
Article CAS PubMed PubMed Central Google Scholar
Webber, J., Charlton, M. & Johns, N. Diabetes in pregnancy: management of diabetes and its complications from preconception to the postnatal period (NG3). Br. J. Diabetes 15, 107–111 (2015).
Article Google Scholar
American Diabetes Association Professional Practice Committee 15. Management of diabetes in pregnancy: standards of care in diabetes-2025. Diabetes Care 48, S306–S320 (2025).
Article Google Scholar
Radha, M. et al. A deep transfer learning approach for wearable sleep stage classification with photoplethysmography. NPJ Digit. Med. 4, 135 (2021).
Article PubMed PubMed Central Google Scholar
Speranza, G. et al. Autonomic changes during pregnancy: assessment by spectral heart rate variability analysis. J. Electrocardiol. 31, 101–109 (1998).
Article CAS PubMed Google Scholar
Facco, F. L. et al. NuMoM2b sleep-disordered breathing study: objectives and methods. Am. J. Obstet. Gynecol. 212, 542.e1–127 (2015).
Article PubMed Google Scholar
Fonseca, P. et al. A novel low-complexity post-processing algorithm for precise QRS localization. SpringerPlus 3, 376 (2014).
Article PubMed PubMed Central Google Scholar
Shaffer, F. & Ginsberg, J. P. An overview of heart rate variability metrics and norms. Front. Public Health 5, 258 (2017).
Article PubMed PubMed Central Google Scholar
Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology Heart rate variability: standards of measurement, physiological interpretation and clinical use. Circulation 93, 1043–1065 (1996).
Article Google Scholar
Malliani, A. et al. Cardiovascular neural regulation explored in the frequency domain. Circulation 84, 482–492 (1991).
Article CAS PubMed Google Scholar
Berntson, G. G. et al. Heart rate variability: origins, methods, and interpretive caveats. Psychophysiology 34, 623–648 (1997).
Article CAS PubMed Google Scholar
Hoshi, R. A. et al. Poincaré plot indexes of heart rate variability: relationships with other nonlinear variables. Auton. Neurosci. Basic Clin. 177, 271–274 (2013).
Article Google Scholar
Masino, N. & Quintero-Rincón, A. Effect sizes as a statistical feature-selector-based learning to detect breast cancer. In Proc. IEEE Biennial Congress of Argentina (ARGENCON) (IEEE, 2024).

Download references

Acknowledgements

This study was performed within the framework of the Eindhoven MedTech Innovation Center (e/MTIC) and was funded by the China Scholarship Council (Grant No: 201906340168). The funder played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript.

Author information

Authors and Affiliations

Department of Electrical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
Yanqi Wu, Sima Asvadi, M. Beatrijs van der Hout-van der Jagt, Elisabetta Peri, Pedro Fonseca, Sebastiaan Overeem, S. Guid Oei, Massimo Mischi & Xi Long
Eindhoven MedTech Innovation Center, Eindhoven, The Netherlands
Yanqi Wu, Myrthe van der Ven, M. Beatrijs van der Hout-van der Jagt, Elisabetta Peri, Pedro Fonseca, S. Guid Oei, Massimo Mischi & Xi Long
Philips Research, Eindhoven, The Netherlands
Sima Asvadi
Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
Myrthe van der Ven
Department of Obstetrics and Gynaecology, Máxima Medical Center, Veldhoven, The Netherlands
Myrthe van der Ven, M. Beatrijs van der Hout-van der Jagt & S. Guid Oei
Philips Sleep and Respiratory Care, Eindhoven, The Netherlands
Pedro Fonseca
Sleep Medicine Center Kempenhaeghe, Heeze, The Netherlands
Sebastiaan Overeem

Authors

Yanqi Wu
View author publications
Search author on:PubMed Google Scholar
Sima Asvadi
View author publications
Search author on:PubMed Google Scholar
Myrthe van der Ven
View author publications
Search author on:PubMed Google Scholar
M. Beatrijs van der Hout-van der Jagt
View author publications
Search author on:PubMed Google Scholar
Elisabetta Peri
View author publications
Search author on:PubMed Google Scholar
Pedro Fonseca
View author publications
Search author on:PubMed Google Scholar
Sebastiaan Overeem
View author publications
Search author on:PubMed Google Scholar
S. Guid Oei
View author publications
Search author on:PubMed Google Scholar
Massimo Mischi
View author publications
Search author on:PubMed Google Scholar
Xi Long
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization: Y.W., X.L., S.A., and M.v.d.V.; Data curation: Y.W., S.A., M.v.d.V., and M.B.v.d.H.; Formal analysis: Y.W., S.A., M.v.d.V., M.B.v.d.H., and X.L.; Investigation and clinical interpretation: M.v.d.V.,. M.B.v.d.H., P.F., S.O., E.P. and S.G.O.; Writing—original draft: Y.W. and X.L.; Writing—review & editing: S.A., M.v.d.V., M.B.v.d.H., S.G.O., M.M., P.F., S.O., and E.P.; Supervision: M.M. and X.L.; Funding acquisition: Y.W. and X.L.; All authors had full access to the data in the study and accepted responsibility for the decision to submit for publication.

Corresponding author

Correspondence to Xi Long.

Ethics declarations

Competing interests

At the time of writing, S.A. and P.F. were employed by or affiliated with Royal Philips, a health-tech company and manufacturer of consumer and medical electronic devices, commercializing products and solutions in the area of mother and child-care. Philips had no role in the study design, decision to publish, or preparation of the manuscript. The other authors have no conflicts of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, Y., Asvadi, S., van der Ven, M. et al. Overnight maternal heart rate variability for early prediction of gestational diabetes mellitus: a retrospective cohort study. npj Womens Health 3, 37 (2025). https://doi.org/10.1038/s44294-025-00081-z

Download citation

Received: 01 January 2025
Accepted: 22 May 2025
Published: 18 June 2025
Version of record: 18 June 2025
DOI: https://doi.org/10.1038/s44294-025-00081-z