Introduction

Pulmonary and cardiovascular development and function are closely linked during fetal development and postnatal life1. In newborn infants, the interaction between the pulmonary and cardiovascular system plays an important role during transition after birth. However, in preterm infants normal development of the pulmonary and cardiac systems is interrupted, which results in delayed cardiopulmonary adaptation and potentially to delayed long-term development of these systems2. The most common long term respiratory complication after preterm birth is bronchopulmonary dysplasia (BPD). BPD affects between five to 30% of all very preterm infants (< 32 w gestation) and its prevalence is increasing due to increased survival rates of extremely preterm infants born at the limits of viability3,4. Evolving BPD is considered a developmental disease of the lung structure, resulting in ventilation-perfusion mismatch, clinically characterized by frequent intermittent hypoxia, prolonged requirement of respiratory support and supplemental oxygen to maintain stable oxyhemoglobin saturation (SpO2)2,5. In addition to structural immaturity of the lung, preterm infants exhibit an immature cardiopulmonary autonomous nervous system2,6,7. Preterm infants typically show irregular breathing and frequently have apnea of prematurity, associated with unstable SpO2, hypoxemia and bradycardia during the first weeks to months of life8.

Maturation of the cardiac autonomous nervous system is associated with an increased parasympathetic activity and can be quantitatively assessed using the heart rate variability (HRV). HRV increases as the parasympathetic activity increases, which explains the lower HRV in preterm infants compared to term infants6,9. Further, the mathematical analysis of HR time series can be used to assess the predictive value of the HR for clinically important events or medium-term clinical outcomes in neonates. For instance, the sample entropy (SampEn) of HR time series, a parameter describing complexity of values within a time series of heart beats, can be used as an early marker of neonatal sepsis10 and as a predictor of stability of the respiratory system in preterm infants over several months11. SampEn decreases not only when physiological data is highly regular but also in the presence of spikes/decelerations, or both12. Such phenomena are typical in time series analyses of physiological signals in preterm infants. The SampEn can not only be calculated from HR but also from time series of other physiological signals such as SpO2, provided the time series are sufficiently long and have been sampled at a frequency corresponding to the natural time scales of the physiological processes involved. In critically ill adults with sepsis, lower SampEn of SpO2 was a predictor of mortality13. Given the impact of immaturity and BPD on SpO2 stability and the predictive value of SampEn of HR on outcomes in preterm neonates, SampEn of time series of SpO2 may offer diagnostic value in this population. E.g., SampEn of SpO2 time series might be associated with maturational processes and respiratory disease in preterm infants. We would expect SampEn of SpO2 to increase with postnatal maturation and with resolution of lung disease due to less frequent hypoxemic events. Therefore, substantial differences in SampEn between infants with and without BPD might enable an earlier diagnosis of BPD by defining a SampEn threshold.

We aimed to investigate the SampEn of time series of SpO2 in preterm compared to term infants. Moreover, we aimed to compare the SampEn of SpO2 in preterm infants with and without BPD across several time points during their hospitalization after birth. We then aimed to identify demographic and respiratory disease related factors associated with SampEn of SpO2 at the different time points. This study also evaluates the technical feasibility and validity of the SampEn of SpO2 in a clinical setting. We hypothesized that SampEn of SpO2 is lower in preterm infants compared to term infants, and that the presence of BPD is associated with lower SampEn among preterm infants. Secondary outcomes include the variability in SpO2 of infants with BPD through calculation of the mean, the standard deviation (SD) and the coefficient of variation (CV).

Results

Demographic data

We included a total of 275 infants in this study. Of these, 180 were preterm infants and 95 term healthy infants. Forty-four of the 180 preterm infants (24.4%) were subsequently diagnosed with BPD (Fig. 1). Anthropometric data are outlined in Table 1. Gestational age, birth weight, and length at birth of infants with BPD were lower compared to infants without BPD. All preterm infants received caffeine during hospitalization. Caffeine administration was stopped later in preterm infants with BPD (caffeine administration at 36 w PMA in 80% of BPD infants vs. 20% of infants without BPD). Duration of supplemental oxygen and duration of respiratory support were longer in infants with BPD compared to those without BPD (Table 2).

Fig. 1
figure 1

Flow of participants through the phases of the study. BPD, bronchopulmonary dysplasia; PMA, postmenstrual age.

Table 1 Demographical characteristics of study participants.
Table 2 Time on respiratory support.

Sample entropy

The durations and effective durations of all SpO2 time series obtained at 32 w PMA, 36 w PMA, and at discharge are outlined in Table 3. Total duration refers to the whole measurement, whereas the effective duration is the duration of the longest uninterrupted interval in the time series, i.e., the longest interval containing no missing values/artefacts. On average, the effective duration was > 5400 s at 36 weeks PMA and at discharge. The average effective duration was 4958 s at 32 weeks PMA due to difficulties obtaining uninterrupted measurements in preterm infants with BPD.

Table 3 Duration of sample entropy measurements.

Overall, the SampEn of SpO2 in preterm infants at discharge was lower than that in term healthy infants (0.18 (0.10) vs. 0.22 (0.11), p < 0.05). Infants with BPD had a significantly lower mean (SD) SampEn compared to infants without BPD at 32 w PMA (0.19 (0.08) vs. 0.26 (0.09); p < 0.001) (Fig. 2). However, the difference in SampEn was no longer present at 36 w PMA and at discharge. In preterm infants without BPD, SampEn decreased from 32 w PMA to 36 w PMA (0.26 (0.09) vs. 0.16 (0.12), p < 0.001), but not from 36 w PMA to discharge (0.16 (0.12) vs. 0.17 (0.12), p 0.75). Infants with BPD had significantly lower SampEn of SpO2 compared to term infants at discharge (Fig. 2). Example waveforms of SpO2 time series are shown in Fig. 3.

Fig. 2
figure 2

Sample Entropy of SpO2 measurements. BPD, bronchopulmonary dysplasia; 32, 32 w postmenstrual age; 36, 36 w postmenstrual age; DG, discharge; TB, term born. * p < 0.05, ** p < 0.01, *** p < 0.001 (Student’s t-test).

Fig. 3
figure 3figure 3

Example waveforms of SpO2 time series of 3 patients at (a) 32 weeks PMA, (b) 36 weeks PMA, (c) discharge, (d) Term born. BPD, bronchopulmonary dysplasia; PMA, postmenstrual age; SampEn, sample entropy.

We identified gestational age, postnatal age, birthweight z-score, the diagnosis of BPD, days on oxygen, days on conventional mechanical ventilation, days on NIPPV (non-invasive positive-pressure ventilation) and days on HHHF (heated humidified high flow) and caffeine at the time of measurement as potential explanatory factors for SampEn of SpO2 at 32 and 36 w PMA and at discharge. Gestational age as an indicator of maturation is more important than postnatal age in preterm infants. Therefore, we omitted postnatal age due to collinearity with gestational age in the final model (R2 = 0.99). Moreover, variables related to respiratory support were collinear with gestational age. Therefore, we computed the unstandardized residual for days on oxygen, conventional mechanical ventilation, days on NIPPV and days on HHHF against gestational age prior to the univariate analysis. We used the unstandardized residuals as independent measures of respiratory support.

Gestational age was the only statistically significant predictor of SampEn of SpO2 at 32 w PMA. Gestational age explained 17.8% of the variability seen in SampEn of SpO2 at 32 w PMA (adjusted R2 = 0.178, p < 0.001). Days on NIPPV independent of gestational age was the only predictor of SampEn of SpO2 at 36 w PMA and explained 4% of the variability (adjusted R2 = 0.039, p < 0.001). At discharge none of the entered variables were statistically significantly associated with SampEn of SpO2.

Additional measures of variability

We evaluated the mean, SD, and CV of SpO2 time series as measures of central tendency and dispersion in the time series (Table 4). SD and CV of SpO2 in preterm infants was higher compared to term infants. In infants with BPD, SpO2 time series had statistically significantly lower mean and higher SD and CV compared to infants without BPD at all three time points. Mean SpO2 increased in all preterm infants over time. Also, SD and CV of SpO2 decreased over time in infants with and without BPD, with still significantly higher SD and CV in infants with BPD vs. infants without BPD at discharge.

Table 4 SpO2.

Discussion

We found that SampEn of time series of SpO2 was lower in preterm infants compared to term healthy infants at discharge. SampEn of SpO2 was lower in preterm infants with BPD compared to infants without BPD at 32 w PMA. However, the main predictor of SampEn of SpO2 at 32 w PMA is maturation at birth. The difference in SampEn of SpO2 between infants with and without BPD was no longer present at 36 w PMA or at discharge. NIPPV was a significant predictor of SampEn of SpO2 at 36 w PMA but explained only 4% of the variability of SampEn of SpO2. Longitudinally, SampEn of SpO2 in preterm infants with BPD did not substantially change from 32 w PMA to discharge while there was a marked decrease in preterm infants without BPD from 32 to 36 w PMA (but not from 36 w PMA to discharge). As expected, conventional measures of dispersion including SD and CV of SpO2 in preterm infants at discharge were higher compared to term infants. In preterm infants with BPD, time series of SpO2 had higher SD and CV than those without BPD across all time points.

To the best of our knowledge, this is the first study comparing SampEn of SpO2 in preterm vs. term infants and, longitudinally, in preterm infants with and without BPD over the course of weeks to months. Recent studies in adults indicate that SampEn of SpO2 is a good predictor of exacerbations in patients with chronic obstructive pulmonary disease and of survival in critically ill patients with sepsis13,14. Small studies in neonates (n = 25–44) showed that low SampEn of SpO2 is associated with a higher risk of intermittent hypoxemic events in the NICU and an important predictor of periventricular leukomalacia following neonatal heart surgery15,16. There are no published studies assessing the association of SampEn of SpO2 with neonatal respiratory disease or with postnatal maturation in preterm neonates. However, low SampEn of time series of HR was previously identified as an indicator of extubation failure and of poor postnatal maturation in preterm infants17,18.

We expected SampEn of SpO2 to be lower in preterm infants with BPD compared to infants without BPD. Furthermore, we expected an increase in SampEn of SpO2 towards discharge, potentially approaching values of healthy term infants, i.e., to reflect both resolution of respiratory disease and postnatal maturation of lung structure and autonomous nervous system. We found that SampEn of SpO2 in preterm infants with BPD was lower at 32 w PMA compared to infants without BPD, but the main predictor of SampEn of SpO2 at 32 w PMA is maturation at birth. Nevertheless, values remained stable until discharge in infants with BPD and decreased significantly in those without BPD from 32 to 36 w PMA (but not from 36 w PMA towards discharge). A decrease in SampEn of SpO2 in infants without BPD over several weeks seems counterintuitive. However, the effect of important co-factors may explain this finding: Almost all preterm infants were prescribed caffeine at 32 w PMA, independent of BPD status (170 of 171 infants). In contrast, 80% of infants with BPD but only 20% of infants without BPD still received caffeine at 36 w PMA which may have resulted in an increased frequency of short hypoxemic events in the former, leading to lower SampEn of SpO2 after cessation of caffeine. Caffeine citrate is a highly effective treatment for apnea of prematurity and prevention of BPD19. The discrepancy in rates of caffeine administration at 36 w PMA is a result of clinical practice as physicians typically discontinue caffeine upon increasing cardiorespiratory stability of patients which occurs earlier in those infants who do not develop BPD. In previous studies, caffeine administration induced an enhanced autonomic nervous system responsiveness, reflected by indices of parasympathetic activity from HR and blood pressure systems20. Thus, the decrease of SampEn in time series of SpO2 from 32 to 36 w PMA in infants without BPD may be related to earlier discontinuation of caffeine administration, resulting in values comparable to those of infants with BPD who remained on caffeine. As a result, caffeine therapy is not statistically significantly associated with SampEn of SpO2 at 36 w PMA. Also, with increasing age and improving lung maturation, preterm infants without BPD will tend to show SpO2 values close to 100% for extended periods of time. Due to this ceiling effect, SampEn of SpO2 may decrease as the time series may become highly regular. SampEn may potentially even approach lower values than that of ‘less stable’ appearing time series (Fig. 3b), which might also explain the longitudinal decrease in SampEn in preterm infants without BPD from 32 to 36 w PMA. The persistently low SampEn of time series of SpO2 in infants with BPD at 36 w PMA may be explained by ongoing respiratory disease. At discharge, SampEn of SpO2 did not differ between preterm infants with and without BPD. Importantly, the average PMA at discharge in preterm infants with BPD was 40 w PMA but only 37 w PMA in those without BPD. Given that the SampEn of a cardiorespiratory signal in infants may be interpreted as an integrative physiological marker of health12, the 3-week advantage in maturation to overcome residual respiratory disease may explain this finding. Overall, values in preterm infants at discharge were lower than those of healthy term infants which may reflect differences in maturation as the mean PMA at discharge in the whole cohort of preterm infants was 37.4 w vs. 40.3 w in healthy term infants. Even though healthy term infants were on average three weeks older at discharge compared to the preterm infants included in our study (40 w PMA compared to 37 w PMA), healthy term infants seem to be the only suitable control group of infants for our discharge measurements. Healthy term infants are ready for discharge within hours to days after birth as they are mature enough and maintain a stable SpO2. Thus, important factors to consider when evaluating SampEn of SpO2 in infants with and without BPD over various timepoints are gestational age, caffeine administration and the ceiling effect of SpO2 values.

In terms of secondary outcome measures, we found an increased variability in SpO2 as measured by the SD and CV of SpO2 in preterm infants with BPD compared to preterm infants without BPD and term healthy infants, potentially reflecting the lack of stability of oxygenation in infants with BPD. This seems biologically plausible and suggests that conventional measures of variation of a set of values, such as SD and CV, are not necessarily aligned with measures of complexity derived from nonlinear time series analysis.

To the best of our knowledge, this is the first study investigating SampEn of SpO2 in preterm infants with and without BPD and comparing them with term healthy infants in a pragmatic clinical care setting. Our study was prospectively designed and included a total of 275 infants, ensuring sufficient statistical power to evaluate and interpret results. Also, the aimed-for duration of the time series was 90 min, resulting in about 5000 data points per measurement which is safely above the 2000 data points where entropy values seem to stabilize21,22. Further, the use of the SpO2 signal derived from peripheral pulse oximeters instead of the HR from electrocardiogram allows for application and repetition of such an analysis in clinical study settings beyond intensive care, as pulse oximeters are commonly available in normal wards and even outpatient areas.

Our study has some limitations that require careful interpretation of the results. Although we included preterm infants from 23 to 32 w gestation, only 44/180 preterm infants had BPD due to the relatively low rate of BPD among infants at our hospital. This may result in spectrum bias. Along with the single-center design, this limits generalizability of the results, particularly since respiratory care practices and incidences of BPD vary widely among hospitals and countries23. We corrected for differences between extrauterine maturation in infants with and without BPD by assessing SampEn of SpO2 at a common PMA. However, BPD is an evolving disease and extrauterine development might be affected by several factors such as preexisting disease, nutritional management, or intercurrent infections. Nevertheless, assessment at a common PMA seems a reasonable approach to correct for extrauterine maturation. There are multiple potential technical and clinical confounders that need to be considered. Restlessness can affect the artefact-free pulse oximetry measurement. By measuring the infants 30 min after a morning feed in a sleeping or quiet awake state, we aimed to minimize these artefacts on the pulse oximetry measurements. Also, clinical confounders like sepsis, inotrope use, and caffeine citrate can affect SampEn data10. Therefore, we assessed for any relevant medication during measurements. Almost all infants received caffeine at 32 w PMA (n = 170/171). At 36 w 80% of infants with BPD and 20% of infants without BPD still received caffeine. However, at 36 PMA, caffeine therapy was not associated with SampEn of SpO2. At 32 w, 5 infants received antibiotic treatment, of which 1 infant (20%) had BPD and died. One infant had already received antibiotic treatment for 14 days and thus did not have an acute infection. At 36 w, one infant without BPD received antibiotics. No inotrope use was documented during any of the measurements. Due to the low number of infants (3%) that received antibiotic treatment, we do not expect there to be a relevant impact on our results. Further, although calculations of SampEn of SpO2 were readily available within minutes after the measurements, they were not available as continuous outputs that could directly be used by clinicians in the NICU which limits clinical utility in routine preterm care. Furthermore, when assessing SampEn of SpO2 time series the underlying algorithm of SpO2 signal processing might differ dependent on the manufacturer thereby affecting the results24.

Given the widespread use and multitude of effects of caffeine on the cardiorespiratory system in preterm infants25, it seems imminent to assess its effects on parameters of nonlinear time series analyses. Particularly when considering that nonlinear time series analyses may soon be implemented in clinical care and integrated into NICU monitoring systems26. Further, we measured SampEn of SpO2 in a clearly defined cohort of preterm infants and a control group of healthy term infants to assess the diagnostic utility related to BPD and postnatal maturation. Arguably, the relative importance of intermittent hypoxic events and a ceiling effect of SpO2 values at 100%, development of reference values and the use of standardized methods for signal acquisition, data processing, and interpretation would allow for comparison across different physiological signals, patient populations, and study settings. Nonlinear time series analysis in preterm infants is possible in a variety of physiological signals such as HR, respiratory rate, SpO2, or body temperature11,12,15,17,27. If and how SampEn estimations correlate across several physiological signals, whether any potential correlations depend on the time scale of measurement, and how they may complement each other to improve diagnostic utility for early detection of disease or monitoring of maturation in preterm infants remains to be investigated in future studies.

SampEn of time series of SpO2 can be utilized to discriminate between preterm infants with and without BPD and preterm vs. healthy term infants but requires careful consideration of confounding factors such as maturation at birth and the natural boundary of 100% of SpO2 values. In contrast, standard measures of variation such as mean, standard deviation and coefficient of variation of SpO2 seem less dependent on confounding factors. The potential of SpO2-SampEn for monitoring postnatal maturation and its relation to other physiological signals suitable for nonlinear time series analysis requires further study. Also, after the initial validation phase, more clinical studies need to be conducted to validate the clinical utility of SpO2-SampEn for early prediction of adverse neonatal outcomes.

Methods

Study design

We conducted a single center observational cohort study at the University Children’s Hospital Basel (UKBB) in Switzerland between December 2018 and July 2022. The study protocol was approved by the Ethics Committee of Northwestern and Central Switzerland (EKNZ, Ref 2018-01,207). All measurements were performed in accordance with the regulations of the ethics committee. Written parental consent was obtained for all participants prior to study enrolment. The study was registered at clinicaltrials.gov (NCT04303494).

Participants

This study is a subproject within the larger study Predictive Value of Heart Rate Variability on Cardiorespiratory Events (NCT04303494). Very preterm infants (< 32 w gestation) and/or those with a birth weight less than 1500 g were eligible for the study. Inclusion criteria for term healthy control infants were (1) chronological age of 3 to 28 days, and (2) hospitalization in the neonatal rooming-in unit as a healthy sibling, or a healthy neonate ready for discharge from the maternity ward. Exclusion criteria included major congenital malformations including neuromuscular disorders, thoracic malformations, or a major cardiac malformation. Eligible infants were allocated into groups according to whether they were born preterm or at term and the presence of a BPD diagnosis according to the NICHD BPD definition published in 20011.

Clinical variables and measurements

We obtained sex, gestational age, birth weight and length at birth from medical records. Body weight was recorded at each test occasion. We measured SpO2 over a 90-min period at 32 w postmenstrual age (PMA), 36 w PMA and at discharge. To rule out the effects of night/day cycling, spontaneous movements, feeding, and nursing care procedures on SpO2, we aimed to start SampEn recordings at 9 am while infants were asleep, or at least in a quiet awake state 30 min after a morning feed. Consequently, if above conditions could not be met after the morning feed, the next feasible observation period after an 11 am feed was chosen. SpO2 was measured via pulse oximetry using the standard neonatal intensive care unit (NICU) monitoring system (Philips IntelliVue MX700, Philips Healthcare, Amsterdam, Netherlands). FiO2 management was based on standard operating procedures and overseen by nurses at the bedside and adapted to achieve an SpO2 range of 87–95% in infants with supplemental oxygen and 87–100% in infants without supplemental oxygen. Raw data was acquired at 0.98 Hz and processed by a commercially available software (ixTrend and Dataplore, ixcellence GmbH, Wildau, Germany). All clinical data was stored in a LabKey database installed on a local server. LabKey is an open-source platform for scientific data integration, analysis, and collaboration28. SampEn for time series of SpO2 measurements was calculated based on the analysis, insights and recommendations outlined by Richman and Moorman21, using custom-made software implemented in R29 and using the following R-Packages: “TSEntropies”30, “lubridate”31 “dplyr”32, “FSA”33, "data.table"34, “Rlabkey”35, “mgsub”36. For the calculation of the SampEn using the algorithm implemented within the R-package “TSEntropies”, the following parameters were used: Embedding dimension m = 3, radius of searched areas r = 0.2*sd(TS), where sd(TS) stands for the standard deviation of the whole time series at hand. No downsampling was used, i.e., the lag parameter was set equal to 1. Furthermore, for the ease of use for our study staff, our R-code was integrated in a graphical user interface (GUI) using the Shiny Framework37,38,39. SampEn is the “negative natural logarithm of the conditional probability that two sequences similar for m points remain similar at the next point”, without allowing self-matches21. Thus, SampEn is a measure of randomness of a time series. In other words, SampEn represents the rate at which new information is produced in a time series. Low SampEn indicates lower complexity within a time series, i.e., less information production. Typically, reduced SampEn of HR time series is associated with short- or medium-term disease in neonates10,11,21,40. Based on previous work by Richman and Moorman21, our goal was to collect time series of SpO2 containing at least 4096 values measured equidistantly in time, which corresponds to measuring uninterruptedly for 69.66 min at a sampling frequency of 0.98 Hz. Thus, and in order to leave a margin, we obtained each time series by measuring for 90 min. However, due to movement artefacts during some measurements, the resulting time series occasionally contained missing values or invalid values. Consequently, we searched in each time series for the longest uninterrupted interval. The duration of this interval is what we called the effective duration of the time series. Oftentimes the effective duration was longer or equal to 69.66 min, i.e., the minimum duration required according to Richman and Moorman. In the few cases in which the effective duration was shorter, we increased the effective duration by imputing missing values/artefacts. Imputation was done using Spline interpolation as implemented in the R-package “imputeTS”41. In such cases, only time series with less than 3% missing values were used.

Statistical methods

We tested for normality using the Shapiro–Wilk’s method combined with visual inspection through Q–Q plots and testing for absence of variance through the F-test. We used mean, SD and the independent two-tailed t-test for normally distributed data and median, interquartile range (IQR) and the Mann–Whitney U test (Wilcoxon test) for non-normally distributed data. Linear regression analysis was used to assess potential explanatory variables of SampEn of SpO2 at the time points 32- and 36-weeks postmenstrual age as well as at discharge. All statistical analyses were performed using the statistical software R (version 4.0.3, R Development Core Team, Vienna Austria)29 and figures were produced using the package ggplot242. Missing values were analyzed through available case analysis. Aiming for 80% power at a 5% significance level, assuming that prematurity status and BPD have a small effect size (f2 = 0.26)43, we planned to analyze data from a sample of n = 180 preterm infants and n = 100 term infants.