Identifying performance differences between two pulse oximetry systems in simulated critical neonatal conditions

King, Brian; Dove, Jake; McGonigle, Scott J.; Ames, Warwick A.; Vesoulis, Zachary A.

doi:10.1038/s41372-025-02364-4

Download PDF

Article
Open access
Published: 30 July 2025

Identifying performance differences between two pulse oximetry systems in simulated critical neonatal conditions

Brian King¹,
Jake Dove²,
Scott J. McGonigle³,
Warwick A. Ames⁴ &
…
Zachary A. Vesoulis ORCID: orcid.org/0000-0001-8290-0069⁵

Journal of Perinatology volume 45, pages 1608–1614 (2025)Cite this article

1744 Accesses
1 Citations
19 Altmetric
Metrics details

Subjects

A Correction to this article was published on 18 September 2025

This article has been updated

Abstract

Objective

Pulse oximetry is used to guide critical clinical decisions in neonatology. We used a vital signs simulator to compare performance of two pulse oximetry systems in conditions not tested in standardized clinical verification studies.

Study design

We devised a set of simulated tissue translucency, perfusion, peripheral oxygen saturation (SpO₂)_, and heart rate (HR) parameters to mimic challenging real-world neonatal data and applied them to two marketed pulse oximetry systems (Nellcor™ and Masimo®). At each combination of input parameters, we used the response from both systems to assess SpO₂ error.

Results

The mean SpO₂ error for Nellcor™ was below 1.1% across all parameters explored, while Masimo® showed significantly higher (p < 0.005) error at lower translucencies.

Conclusion

Significant performance differences can be observed when comparing pulse oximeters at low translucency and perfusion conditions. Patient simulators cannot replace clinical testing but provide a safe and cost-effective method for additional performance profiling.

Open Access Dataset and Common Data Model for Pulse Oximeter Performance Data

Article Open access 03 April 2025

Clinical feasibility of an advanced neonatal epidermal multiparameter continuous monitoring technology in a large public maternity hospital in Nairobi, Kenya

Article Open access 09 July 2022

Smartphone camera oximetry in an induced hypoxemia study

Article Open access 19 September 2022

Introduction

Pulse oximetry has become a standard of care for continuous monitoring in the neonatal intensive care unit (NICU), and is used in a variety of contexts, including guiding resuscitation in the delivery room [1, 2], monitoring of oxygen saturation in the operating room, and more recently, to screen for critical congenital heart disease (CCHD) [3,4,5].

Further, pulse oximetry is used to guide careful titration of respiratory support in the first few days of life, when preterm neonates typically undergo a transition in physiology and an accurate evaluation of subclinical hypoxemia is critical for the evolving diagnostic and medical management. Preterm infants must maintain a tenuous balance when receiving supplemental oxygen, as small changes in the level of oxygenation have been associated with adverse outcomes such as retinopathy of prematurity, intraventricular hemorrhage and increased mortality [6,7,8,9,10,11].

Despite being widely used in neonatal clinical practice, there are specific characteristics of the neonate which can challenge the accuracy of pulse oximetry systems, including low tissue translucency, poor perfusion and motion artifacts [1, 9,10,11,12]. Low translucency conditions are mainly due to dark skin pigmentation or thick tissue sites (pulse oximetry sensors are typically positioned on the neonates’ hands or feet, as their digits are simply too small). Moreover, neonates, especially those born preterm, have proportionally lower blood pressures, including a pulse pressure only about 50% of the magnitude of adults [13]. These infants also have immature peripheral vascular autoregulation, with diminished distal blood flow, further complicated by thermoregulatory instability [14]. The net effect of these factors is a marked reduction in the signal to noise ratio in pulse oximeters readings, which is difficult to replicate in healthy adult subjects on whom these devices are traditionally clinically verified.

Current methods for accuracy verification of a pulse oximeter used on neonates, outlined by the Food and Drug Administration (FDA) guidance [15] and by the International Organization for Standardization (ISO) 80601-2-61 standard [16], involve a controlled hypoxia study—limited to healthy adults for ethical and practical reasons—coupled with a convenience sampling study on neonates with arterial lines already in place for verification of clinical performance. These standardized methods are necessary to confirm the calibration curve for a pulse oximetry system [17], which is key for an accurate conversion from optical signals to peripheral oxygen saturation (SpO₂) readings. However, they do not challenge the pulse oximetry systems over the entire space of possible neonatal signals; specifically, they do not assess accuracy of these systems in the above-mentioned low translucency and low perfusion conditions at varying saturation levels.

Patient simulators, designed to create synthetic signals that mimic human vital signs and equipped with extensive configuration options (including varying simulated vital parameters and various signal artifacts), may fill the gap between controlled standardized studies and uncontrolled real-world clinical settings without posing additional risks to fragile subjects. Simulators cannot replace standardized clinical testing, as they are unable to fully reproduce the actual tissue-sensor interface [17], which is essential to assess the clinical accuracy of pulse oximetry systems. However, simulators can be used to challenge and improve the design of these systems and may provide a streamlined, cost-effective method for supplemental pre-clinical verification of their performance [18].

In this work, a cohort of real-world pulse oximetry data gathered from the NICU was used to inform a bench test comparison of SpO₂ monitoring performance of two widely used pulse oximetry systems in low-translucency and low-perfusion conditions representative of neonatal patients.

Methods

Pulse oximetry systems under test

This work compared the performance of the following two pulse oximetry systems:

(1)
Nellcor™ system: a Nellcor™ OxiMax™ N-600x patient monitor (Medtronic, Minneapolis, MN) coupled with a set of seven distinct DS100A-1 finger sensors.
(2)
Masimo® system: a Rad-97™ Pulse CO-Oximeter® (Masimo®, Irvine, CA) paired with a set of seven distinct RD SET™ DCI® finger sensors.

The number of tested finger sensors was chosen to provide a preliminary insight on performance variability within a pulse oximetry system.

Testing equipment

In pulse oximetry, simulators work by presenting pulse oximetry systems with optical signals appropriately designed to mimic the way in which the LED light emitted by a finger sensor is modulated when passing through real tissues before reaching the corresponding photodetector.

In this in silico study, we assessed how translucency, perfusion, peripheral oxygen saturation and heart rate (HR) impact on error in the SpO₂ readings displayed on the tested systems.

We designed a set of test parameters (detailed in Table 1) encompassing, for each of the four variables of interest, a range of values aligned to those expected in the NICU population. We were specifically interested in simulating the most critical values that could be encountered in NICU patients, to challenge pulse oximetry systems performance in conditions not typically assessed in clinical verification studies under current regulatory requirements.

Table 1 Simulated parameters and rationale for selection.

Full size table

Test parameters were generated using a ProSim™ 8 Vital Signs Simulator (Fluke®, Everett, WA). A SPOT Light SpO₂ Functional Tester (Fluke®, Everett, WA), consisting of an artificial finger on which the tested finger sensors were applied, was connected to the Simulator. Both Nellcor™ and Masimo® tested finger sensors had their calibration information encoded into the Simulator. Figure 1 provides an overview of the test setup.

Simulated clinical parameters

Translucency and perfusion parameters

Translucency and perfusion were the primary variables of interest for this testing.

Translucency represents the property of materials of allowing light to pass through them. In this work it was expressed in terms of % transmission (%T), a parameter approximately representing the percent amount of a sensor’s LED light that reaches the corresponding photodetector after passing through tissue (see Eq. (1) in the Supplementary Materials for further details). In a clinical setting, %T values depend on both tissue thickness and amount of melanin at monitoring sites, with lower %T typically associated to thick and darkly pigmented sites.

Perfusion is a variable representing the amount of blood in tissues, which varies in a pulsatile fashion. In this work, perfusion was defined in terms of % modulation (%MOD), an indicator of pulsatile optical signal strength that relies on correlation between the amount of light able to pass through tissues and hemodynamic changes [19,20,21].

Specifically, %MOD represents the percent ratio between the variable (due to attenuation of pulsing blood) and the constant (due to attenuation of other elements, like venous blood, water, bone, and melanin) components of the signal generated by a finger sensor’s LED light passing through a perfused tissue (see Eq. (2) in the Supplementary Materials for further details).

Simulated %T and %MOD parameters (Table 1) were selected based on the distribution of real-world data collected from NICU patients in two prior multicenter observational clinical studies. These studies were designed to follow FDA guidance [15] on verifying safe form, fit, and function of the OxySoft™ neonatal-adult SpO₂ sensor (Medtronic, Minneapolis, MN) against reference CO-oximetry measurements of arterial oxygen saturation from convenience arterial lines. NICU data were collected using a Nellcor™ OxiMax N-600x patient monitor, equivalent to the one tested in the present work, paired to OxySoft™ neonatal-adult SpO₂ sensors positioned on the neonates’ feet. Skin tone data were also collected from all NICU patients and categorized using a four-level scale.

These studies were conducted in accordance with the Declaration of Helsinki and all local regulatory requirements and were approved by the Western Institutional Review Board (IRB00000533) and the Timpanogos Regional Hospital’s Institutional Review Board (IRB00003926). Written informed consent, including consent to secondary use of clinical data for research purposes, was obtained from the subjects’ parents. All methods were performed in accordance with applicable governmental and institutional guidelines and regulations. The Supplementary Materials provide additional details on the equations and methods used to calculate %T and %MOD values from the reference NICU datasets and thus derive the set of simulated physiological parameters for testing.

SpO₂ and heart rate parameters

Simulated SpO₂ and HR parameters were selected based on supporting literature findings, as reported in Table 1. Of note, HR values were specifically selected to reproduce critical real-world conditions known to be challenging for pulse oximetry systems, such as cases of HR < 100 bpm, that usually prompt positive-pressure ventilation or other emergent interventions [22]. For all test cases the Simulator cardiac waveform was set to “Child Normal Sinus Rhythm” (NSR—Pediatric).

Testing procedure

For both Nellcor™ and Masimo® systems, all seven tested finger sensors were sequentially placed on the SpO₂ Functional Tester, as the Simulator was cycled automatically through all the combinations of the parameters in Table 1. All simulated values were within the performance range of the Simulator [23]. Parameter variation was performed via a pre-programmed set of nested loops: %T varied most slowly, then %MOD, then SpO₂, with HR varying in the innermost loop.

Pulse oximetry systems performance was tested in a best-case scenario, with the Simulator respiration and ambient-light artifacts kept inactivated and the system positioned in a light-shielded box to exclude any ambient-light artifact (Fig. 1). For each combination of parameters, data were collected from each finger sensor for 45 s. There were no predefined exclusion criteria for the collected data. Error-handling techniques were considered based on analysis of the collected data. Throughout all test runs, a fiducial mark was used to ensure finger sensors were consistently placed in the same position and orientation with respect to the SpO₂ Functional Tester.

Performance metric

The mean SpO₂ error (expressed as absolute value) was the performance metric for this test. For each combination of Simulator parameters, the mean SpO₂ error for each system was calculated as the weighted average of the difference over all seven finger sensors between SpO₂ values reported by the pulse oximetry systems and SpO₂ values input from the Simulator (Eq. (3) in the Supplementary Materials).

A mean SpO₂ error of 3% was selected as a threshold to identify potential performance concerns.

Statistical analysis of SpO₂ error

An additional analysis was conducted to support a quantitative comparison between Nellcor™ and Masimo® systems across selected points of interest of a signal space defined by the primary variables (%T, %MOD).

First, the mean SpO₂ error over a defined point of the (%T, %MOD) signal space was calculated for each individual finger sensor (Eq. (4) in the Supplementary Materials).

Then, for each test point, the seven estimates of the mean SpO₂ error for the Nellcor™ system were compared to the same seven estimates for the Masimo® system using a t-test (allowing for unequal variances for the pulse oximeter type).

For this test, we did assume a normal distribution of errors across all test parameters for both systems, as well as a normal distribution of performance amongst the seven tested sensors.

Statistical analyses were run through the statsmodels Python package [24]. Statistical significance was defined as p < 0.05 at 95% confidence level.

Results

Figure 2 shows mean SpO₂ errors for both Nellcor™ and Masimo® systems over the seven tested sensors as a function of the four test parameters. No error-handling techniques were needed on the data acquired from both systems.

**Fig. 2: Mean SpO₂ errors of Nellcor™ and Masimo® pulse oximetry systems as a function of %T (main horizontal axis, logarithmic scale) and %MOD (main vertical axis, logarithmic scale) settings.**

Datapoints in red represent the conditions where the mean SpO₂ error exceeded the pre-defined threshold of 3%. All numerical results are reported in the Supplementary Materials.

The mean SpO₂ error for the Nellcor™ system was ≤1.1% % across all parameters explored (0.23 ± 0.24% at %T > 1%; 0.17 ± 0.21% at %T ≤ 1%), suggesting consistent accuracy and repeatability even at the lowest translucency and perfusion settings. The Masimo® system performed well at higher translucencies (mean absolute error 0.33 ± 0.33% at %T > 1%); however, at the lower-translucency settings higher error was reported (2.79 ± 1.62% at %T ≤ 1%) as shown by the red datapoints in Fig. 2, particularly at lower SpO₂ values, but roughly independently of heart rate. The highest reported mean SpO₂ error was 1.1% for the Nellcor™ system and 6.4% for the Masimo® system.

To understand the challenging nature of the parameters used for this performance comparison, our reference data distribution derived from NICU clinical trials was overlaid to the same (%T, %MOD) signal space where the SpO₂ error data were plotted. The NICU data were collected from 34 neonates (mean age 3.3 ± 2.3 days) with the following representation of skin tones: “extremely dark hue”: 1/34 (2.94%), “dark olive hue”: 8/34 (23.53%), “olive hue”: 15/34 (44.12%) and “very light hue”: 10/34 (29.41%).

Figure 2 shows that most of the benchmark NICU data fall in a region of the (%T, %MOD) signal space where both systems showed an average SpO₂ error <1% (%T > 1%); however, considering the limited representation of neonates with darkly pigmented skin in the reference NICU dataset, it can be hypothesized that a wider real-world NICU distribution could further expand to the left of the plot (at %T < 1%), where the difference in SpO₂ errors between the two pulse oximetry systems was found to be even higher.

A statistical comparison between mean absolute SpO₂ errors for Nellcor™ and Masimo® systems was performed for regions of the (%T, %MOD) signal space selected based on the challenging nature of their parameters or on amount (or expected amount) of NICU data represented within them.

Results, reported in Table 2, show statistically significantly lower SpO₂ errors (p < 0.005) for the Nellcor™ system across all tested points of the space, with more consistent differences between the two systems at low %T values.

Table 2 Comparison between mean absolute SpO₂ errors for Nellcor™ and Masimo® systems across different regions of the (%T, %MOD) signal space.

Full size table

Confidence intervals show relatively small variance of the difference between mean absolute SpO₂ errors, particularly at higher %T and %MOD values.

Discussion

Since the first introduction of pulse oximeters to the NICU in the 1980s, they have become part of the standard of care and SpO₂ is widely considered the “fifth vital sign” [25, 26]. Major professional bodies, including the American Heart Association and the American Academy of Pediatrics, have endorsed the central role of continuous pulse oximetry monitoring in targeted oxygen saturation management during neonatal life support and in guiding judicious use of oxygen therapy during neonatal resuscitation, to avoid major morbidities that have been widely shown to be related to hypoxemia or hyperoxemia [2]. In agreement, consensus European guidelines recommend the use of pulse oximetry to monitor infants’ saturation during the first minutes after birth and to adjust oxygenation levels in neonates with respiratory distress syndrome [27].

Pulse oximetry is used throughout NICU hospitalization to manage many aspects of care including the titration of respiratory support and discharge readiness determination [1]. More recently, pulse oximetry has been used for the screening of CCHD in asymptomatic newborns [3, 4] and has demonstrated higher sensitivity compared to alternative strategies (such as prenatal screening and clinical examination) and a low false-positive rate [5].

In each of these use cases, the accuracy of pulse oximetry is vital; failure to intervene in a timely and appropriate manner could impact morbidity and mortality outcomes.

Although all currently available medical-grade pulse oximetry systems must comply with the same regulatory requirements, their real-world performance can vary significantly in conditions that are not typically tested in standardized clinical verification studies but which represent the most challenging portions (in terms of translucency and perfusion) of real-world NICU data distributions. Unfortunately, these challenging conditions are those in which pulse oximetry systems are relied upon most heavily in the clinical practice, as they are representative of instances of significant pathophysiology, such as critically ill neonates with poor perfusion. Although there has been increasing recognition of the limitations of pulse oximetry in recent years [28] (including for neonates, specifically [29, 30]), the overall awareness among health care providers of these potential technical deficiencies and of their impact on the direction or expediency of clinical intervention is still lacking.

In the last few decades, patient simulators have increasingly been used for supplemental bench testing of pulse oximetry system performance; however, as most of this work is done through manufacturer’s pre-market testing or routine inspection in hospital clinical engineering departments, published data is still scarce. Ganesh Kumar et al. recently tested SpO₂ and pulse rate accuracy of six pulse oximeters using an SpO₂ simulator and observed performance deterioration in over half of the tested devices in the presence of motion artifact and low perfusion conditions [18]. Although their study did not utilize neonate-specific parameters, their testing method parallels the one used in our study. The authors highlighted the ability of simulators to span a much wider range of variables and to achieve higher repeatability compared to what can typically be done in tests on volunteers in a simulated clinical setting. Further, they encouraged the use of simulators to challenge pulse oximeter performance in physiological or pathological conditions representative of specific patient populations. It is worth remembering, however, that while patient simulators can highlight areas of performance concern that would likely be confirmed in a clinical setting, they are not able to reproduce every facet of the complex physiological behaviors that occur in real subjects and other confounding factors.

In this work, we used a vital sign simulator coupled with a functional tester to investigate differences in the response of two pulse oximetry systems in non-clinically verified conditions representative of NICU patients. Over the entire set of test parameters, both Nellcor™ and Masimo® systems were found to perform within the combined accuracy specifications of each of the oximeters and of the Simulator [23, 31, 32], thus meeting performance requirements defined by current regulatory standards. However, when focusing on low translucency conditions (representative, per Fig. 2, of a consistent portion of a real-world NICU distribution), we observed that the two systems started to deviate in performance, with the Masimo® system reporting mean SpO₂ errors up to 3.9% in specific areas of interest (Table 2) and up to 6.4% when looking at specific simulated low saturation conditions (Fig. 2 and Supplementary Materials).

Though current standards for pulse oximeters performance verification define a 4% threshold for SpO₂ accuracy of these equipment when tested on healthy adults [15, 16], we set our threshold for mean SpO₂ error to 3% considering that stricter requirements are currently being envisioned [33] due to a concern for pulse oximeters disparate performance in the clinical practice [28].

While recognizing that synthetic data can only partially reproduce the variability of a clinical setting, we believe the amount of red datapoints in Fig. 2 justifies the concern that, in critical conditions, some pulse oximeters may show inaccurate readings that, in turn, may lead to erroneous clinical decisions.

Based on recently updated recommendations on CCHD screening in newborns [34], a difference greater than 3% in SpO₂ readings from two monitoring sites at SpO₂ levels between 90% and 95% represents a condition for screen failure, and thus prompts additional clinical assessment.

Though the tested oximetry systems didn’t show mean SpO₂ errors greater than 3% at SpO₂ levels above 90%, we believe these new recommendations support our selection of a 3% mean SpO₂ error as a suitable threshold to identify potential performance issues, as the magnitude of reported error shouldn’t exceed the quantity under investigation.

Our findings are further supported by results from a recent study by Gudelunas et al. on healthy adult volunteers with varying skin pigmentation, which also identified performance differences between Nellcor™ and Masimo® pulse oximetry systems [35]. The authors found that SpO₂ error (defined in their study as the difference between SpO₂ and oxygen saturation of arterial blood (SaO₂)) for both devices was dependent on the combined effect of the amount of melanin, perfusion, and degree of hypoxemia. Specifically, they observed that in subjects with darkly pigmented skin and low perfusion, missed hypoxemia events (defined as SaO₂ values < 88% and corresponding SpO₂ values between 92% and 96%) were found in 30.2% of the Masimo® readings and in 7.9% of the Nellcor™ readings.

Interestingly, the hypoxia testing by Gudelunas et al. showed a dependence of SpO₂ error on perfusion, whereas no correlation between SpO₂ error and %MOD was observed in our testing with the simulator. This finding might be explained by the intrinsic nature of the simulation testing, that allows a satisfactory evaluation of the electro-optical response of pulse oximetry systems to changes in translucency but, as mentioned, is not able to adequately reproduce the physiological tissue-related variables at varying perfusion conditions, which also impacts the response and overall accuracy of this equipment.

Considering that low perfusion states are even more frequent and consequential in real-world NICU populations (as shown by the consistent amount of NICU data at %MOD < 1% in Fig. 2) and the role of pulse oximetry systems in the decision to apply oxygen supplementation in such cases, it should be noted again the importance of not relying solely upon simulator testing for equipment performance characterization. Further research on the optical properties of physiological tissues and their optical interactions with pulse oximetry probes may lead to the development of improved simulators able to adequately reproduce a comprehensive range of real-world subjects and is therefore highly encouraged.

This study was designed to pave the way for the development of new pre-clinical testing methods aimed to obviate some of the limitations of current verification testing required for regulatory approval of pulse oximeters. Due to its pilot nature, it presents some limitations. First, performance was compared in a best-case scenario; a simple heart rate rhythm was set, ambient light was excluded, and no respiratory or other artifacts were activated on the simulator. Second, the dimension of our reference real-world dataset was limited, and so was the representation of darkly pigmented subjects. In addition, skin pigmentation was not assessed using a standardized color scale or spectrophotometric measures (whose usefulness has been highlighted in several recent works [36, 37]), which would be beneficial to include in future studies. Further, as our reference data were derived from NICU patients, the varying contributions of skin thickness and melanin to translucency values during development, or low perfusion conditions typical of the delivery room (which could also impact the width of real-world neonatal data distribution, along both the %T and the %MOD axes) could not be accounted for. However, we expect future, larger, real-world datasets used to inform in silico testing will include greater diversity across more dimensions than those included in our reference sample. It is important to recall that current verification testing conducted per FDA and ISO requirements do not incorporate neonates at all. Third, the performance metric we used (mean SpO₂ error), which was chosen for the sake of simplicity, is not the one defined in regulatory standards to determine pulse oximeter accuracy. However, as SpO₂ accuracy is derived from both mean SpO₂ error and its standard deviation, an increase in mean SpO₂ error will also impact more complex performance metrics. Fourth, the sensors used in this bench testing (Nellcor™ DS100A-1 and Masimo® RD SET™ DCI® sensors) are not indicated for use on neonates but were chosen due to their compatibility with the selected simulator. Neonate-specific sensors may have different performance characteristics which, however, would not be expected to invalidate the main findings of this work.

Despite these limitations, our study highlights the role that in silico testing can have in pulse oximeters verification. Specifically, this work shows how simulators can be used to extend the range of tested clinical scenarios to include challenging edge cases where pulse oximeter accuracy is most challenged but also most critical in clinical decision making. Although pulse oximeters should not be used as the sole basis for diagnosis or treatment decisions, it remains intuitive that best care is achieved by having the most accurate measurements. Reliability demonstrated through expanded testing, including all clinical conditions of interest, could have a significant impact on both healthcare equity and costs.

Conclusion

Clinicians should be aware that pulse oximetry system performance may vary based on translucency and perfusion. This laboratory-based work compared SpO₂ error between two commercially available pulse oximetry systems and identified a persistent performance difference between the two systems at low translucency values, which closely model the characteristics of real NICU patients.

While the current standard of clinical hypoxia testing for regulatory approval of oximeters may be sufficient to assess and verify device performance on many patients in many scenarios, such testing is not comprehensive and does not challenge the oximeters in the settings in which accuracy is most important. Further research in the use of synthetic data for bench testing in pulse oximetry is encouraged. Clinicians, manufacturers and regulators should consider the development of a standardized simulator-based pre-clinical testing method that spans a wide set of perfusion, absorption, and cardiovascular parameters aligned with real-world variability and not systematically reproducible in clinical settings. Such a joint effort would contribute towards more transparent and equitable healthcare and more dependable devices.

Data availability

All data that support the findings of this study are included in this article and in the Supplementary Materials. The code used to generate results is available from the corresponding author upon reasonable request.

Change history

18 September 2025
A Correction to this paper has been published: https://doi.org/10.1038/s41372-025-02405-y

References

Dawson JA, Davis PG, O’Donnell CP, Kamlin CO, Morley CJ. Pulse oximetry for monitoring infants in the delivery room: a review. Arch Dis Child Fetal Neonatal Ed. 2007;92:F4–7.
Article CAS PubMed PubMed Central Google Scholar
Aziz K, Lee HC, Escobedo MB, Hoover AV, Kamath-Rayne BD, Kapadia VS, et al. Part 5: neonatal resuscitation: 2020 American Heart Association guidelines for cardiopulmonary resuscitation and emergency cardiovascular care. Circulation. 2020;142:S524–50.
Article PubMed Google Scholar
Kemper AR, Mahle WT, Martin GR, Cooley WC, Kumar P, Morrow WR, et al. Strategies for implementing screening for critical congenital heart disease. Pediatrics. 2011;128:e1259–67.
Article PubMed Google Scholar
Engel MS, Kochilas LK. Pulse oximetry screening: a review of diagnosing critical congenital heart disease in newborns. Med Devices. 2016;9:199–203.
Article Google Scholar
Thangaratinam S, Brown K, Zamora J, Khan KS, Ewer AK. Pulse oximetry screening for critical congenital heart defects in asymptomatic newborn babies: a systematic review and meta-analysis. Lancet. 2012;379:2459–64.
Article PubMed Google Scholar
Carlo WA, Finer NN, Walsh MC, Rich W, Gantz MG, Laptook AR, et al. Target ranges of oxygen saturation in extremely preterm infants. N. Engl J Med. 2010;362:1959–69.
Article CAS PubMed Google Scholar
Stenson BJ, Tarnow-Mordi WO, Darlow BA, Simes J, Juszczak E, Askie L, et al. Oxygen saturation and outcomes in preterm infants—Boost Ii United Kingdom Collaborative Group—Boost Ii Australia Collaborative Group—Boost Ii New Zealand Collaborative Group. N Engl J Med. 2013;368:2094–104.
PubMed Google Scholar
The Boost-II Australia and United Kingdom Collaborative G, Tarnow-Mordi W, Stenson B, Kirby A, Juszczak E, Donoghoe M, et al. Outcomes of two trials of oxygen-saturation targets in preterm infants. N Engl J Med. 2016;374:749–60.
Article Google Scholar
Wackernagel D, Blennow M, Hellstrom A. Accuracy of pulse oximetry in preterm and term infants is insufficient to determine arterial oxygen saturation and tension. Acta Paediatr. 2020;109:2251–7.
Article CAS PubMed Google Scholar
Poets CF, Wilken M, Seidenberg J, Southall DP, von der Hardt H. Reliability of a pulse oximeter in the detection of hyperoxemia. J Pediatr. 1993;122:87–90.
Article CAS PubMed Google Scholar
Hay WW Jr, Brockway JF, Eyzaguirre M. Neonatal pulse oximetry: accuracy and reliability. Pediatrics 1989;83:717–22.
Khoury R, Klinger G, Shir Y, Osovsky M, Bromiker R. Monitoring oxygen saturation and heart rate during neonatal transition. Comparison between two different pulse oximeters and electrocardiography. J Perinatol. 2021;41:885–90.
Article CAS PubMed Google Scholar
Vega-Barrera C, Muraskas J, Guo R, Ray B. The pulse pressure in a premature infant less than 37 weeks gestational age with a patent ductus arteriosus. Open J Pediatr. 2013;03:99–104.
Article Google Scholar
Kissack CM, Weindling AM. Peripheral blood flow and oxygen extraction in the sick, newborn very low birth weight infant shortly after birth. Pediatr Res. 2009;65:462–7.
Article PubMed Google Scholar
FDA. U.S. Department of Health and Human Services, Food and Drug Administration—Pulse Oximeters—Premarket Notification Submissions [510(k)s]—Guidance for Industry and Food and Drug Administration Staff. Silver Spring, MD USA: FDA; 2013.
ISO. International Organization for Standardization—Medical electrical equipment Part 2-61: particular requirements for basic safety and essential performance of pulse oximeter equipment. Geneva, Switzerland: ISO; 2017.
Mannheimer PD. The light-tissue interaction of pulse oximetry. Anesth Analg. 2007;105:S10–7.
Article PubMed Google Scholar
Ganesh Kumar M, Kaur S, Kumar R. Laboratory evaluation of performance of pulse oximeters from six different manufacturers during motion artifacts produced by Fluke 2XL SpO(2) simulator. J Clin Monit Comput. 2022;36:1181–91.
Article PubMed Google Scholar
Sahni R. Noninvasive monitoring by photoplethysmography. Clin Perinatol. 2012;39:573–83.
Article PubMed Google Scholar
Lima A, Bakker J. Noninvasive monitoring of peripheral perfusion. Intensive Care Med. 2005;31:1316–26.
Article PubMed Google Scholar
Kroese JK, van Vonderen JJ, Narayen IC, Walther FJ, Hooper S, te Pas AB. The perfusion index of healthy term infants during transition at birth. Eur J Pediatr. 2016;175:475–9.
Article PubMed Google Scholar
Dawson JA, Saraswat A, Simionato L, Thio M, Kamlin CO, Owen LS, et al. Comparison of heart rate and oxygen saturation measurements from Masimo and Nellcor pulse oximeters in newly born term infants. Acta Paediatr. 2013;102:955–60.
Article CAS PubMed Google Scholar
Fluke Biomedical; ProSim(TM) 8 Vital Signs Simulator Users Manual. Rev. 3, 2011. available at https://www.flukebiomedical.com/sites/default/files/resources/ProSim8_umeng0300.pdf.
Statsmodels version 0.14.1. available at: https://www.statsmodels.org/stable/index.html.
Chan ED, Chan MF, Chan MM. Pulse oximetry: understanding its basic principles facilitates appreciation of its limitations. Respir Med. 2013;107:789–99.
Mower WR, Sachs C, Nicklin EL, Baraff LJ. Pulse oximetry as a fifth pediatric vital sign. Pediatrics. 1997;99:681–6.
Article CAS PubMed Google Scholar
Sweet DG, Carnielli VP, Greisen G, Hallman M, Klebermass-Schrehof K, Ozek E, et al. European consensus guidelines on the management of respiratory distress syndrome: 2022 update. Neonatology. 2023;120:3–23.
Article PubMed Google Scholar
Sjoding MW, Dickson RP, Iwashyna TJ, Gay SE, Valley TS. Racial bias in pulse oximetry measurement. N Engl J Med. 2020;383:2477–8.
Article PubMed PubMed Central Google Scholar
Vesoulis Z, Tims A, Lodhi H, Lalos N, Whitehead H. Racial discrepancy in pulse oximeter accuracy in preterm infants. J Perinatol. 2022;42:79–85.
Article PubMed Google Scholar
Foglia EE, Whyte RK, Chaudhary A, Mott A, Chen J, Propert KJ, et al. The effect of skin pigmentation on the accuracy of pulse oximetry in infants with hypoxemia. J Pediatr. 2017;182:375–7.e2.
Article PubMed Google Scholar
Masimo. Rad-97™ Pulse CO-Oximeter Operator’s Manual—38281/LAB-9275D-0518. 2018.
Covidien. Nellcor N-600x Pulse Oximeter Operator’s Manual—Part No. 10074884 Rev C 2014-02. 2014.
FDA. U.S. Department of Health and Human Services, Food and Drug Administration—Pulse Oximeters for Medical Purposes—Non-Clinical and Clinical Performance Testing, Labeling, and Premarket Submission Recommendations—Draft Guidance for Industry and Food and Drug Administration Staff. Silver Spring, MD USA: FDA; 2025.
Oster ME, Pinto NM, Pramanik AK, Markowsky A, Schwartz BN, Kemper AR, et al. Newborn screening for critical congenital heart disease: a new algorithm and other updated recommendations: clinical report. Pediatrics. 2025;155:5.
Gudelunas MK, Lipnick M, Hendrickson C, Vanderburg S, Okunlola B, Auchus I, et al. Low perfusion and missed diagnosis of hypoxemia by pulse oximetry in darkly pigmented skin: a prospective study. Anesth Analg. 2024;138:552–61.
Article CAS PubMed Google Scholar
Verkruysse W, Jaffe MB, Lipnick M, Zemouri C. Challenges of subjective skin color scales: the case for the use of objective pigmentation measurement methods in regulatory pulse oximetry studies. Anesth Analg. 2024;139:e15–7.
Article PubMed Google Scholar
Lipnick MS, Chen D, Law T, Moore K, Lester JC, Monk EP. et al. Comparison of methods for characterizing skin pigment diversity in research cohorts. medRxiv 2025; 2025.02.21.25322707.
Dawson JA, Kamlin CO, Vento M, Wong C, Cole TJ, Donath SM, et al. Defining the reference range for oxygen saturation for infants after birth. Pediatrics. 2010;125:e1340–7.
Article PubMed Google Scholar

Download references

Acknowledgements

Medical writing support was provided by Ilaria Pietta of Medtronic (Milan, Italy) in accordance with Good Publication Practice (GPP) guidelines.

Funding

All work completed in this report was sponsored and funded by Medtronic. The authors had access to all data and final responsibility for the decision to submit for publication. The authors were not paid to write this article by the sponsor or any other agency.

Author information

Authors and Affiliations

StarFish Medical, Victoria, BC, Canada
Brian King
Research and Development, Acute Care Monitoring, Medtronic, Boulder, CO, USA
Jake Dove
Research and Development, Acute Care Monitoring, Medtronic, Midlothian, UK
Scott J. McGonigle
Duke University Medical Center, Durham, NC, USA
Warwick A. Ames
Department of Pediatrics, Washington University School of Medicine, St. Louis, MO, USA
Zachary A. Vesoulis

Authors

Brian King
View author publications
Search author on:PubMed Google Scholar
Jake Dove
View author publications
Search author on:PubMed Google Scholar
Scott J. McGonigle
View author publications
Search author on:PubMed Google Scholar
Warwick A. Ames
View author publications
Search author on:PubMed Google Scholar
Zachary A. Vesoulis
View author publications
Search author on:PubMed Google Scholar

Contributions

BK and JD contributed to conception and design of testing, acquisition and analysis of data, and to writing of the manuscript. ZAV contributed to conception of the manuscript flow, interpretation of the data and to writing of the manuscript. SMG contributed to analysis and interpretation of the data. WA contributed to interpretation of the data. All authors critically reviewed and approved the final manuscript and agreed to be accountable for the accuracy or integrity of the work.

Corresponding author

Correspondence to Zachary A. Vesoulis.

Ethics declarations

Competing interests

JD and SMG are full-time employees of Medtronic. BK (or his institution) received research support from Medtronic to conduct the bench testing. WA and ZV are paid consultants to Medtronic. Per Medtronic internal publication procedure, any undue influence, contribution or review from Medtronic employees who could indirectly benefit from publication of this work was prohibited. No one except the authors was entitled to approve for submission.

Ethical approval

IRB approval was obtained for the human subject portion of this study from the Western Institutional Review Board and the Timpanogos Regional Hospital’s Institutional Review Boards. Informed written consent was obtained from the parents or legal guardian prior to any study procedures. This study was performed in accordance with the Declaration of Helsinki.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

King, B., Dove, J., McGonigle, S.J. et al. Identifying performance differences between two pulse oximetry systems in simulated critical neonatal conditions. J Perinatol 45, 1608–1614 (2025). https://doi.org/10.1038/s41372-025-02364-4

Download citation

Received: 19 December 2024
Revised: 20 June 2025
Accepted: 09 July 2025
Published: 30 July 2025
Version of record: 30 July 2025
Issue date: November 2025
DOI: https://doi.org/10.1038/s41372-025-02364-4

Subjects

Abstract

Objective

Study design

Results

Conclusion

Similar content being viewed by others

Open Access Dataset and Common Data Model for Pulse Oximeter Performance Data

Clinical feasibility of an advanced neonatal epidermal multiparameter continuous monitoring technology in a large public maternity hospital in Nairobi, Kenya

Smartphone camera oximetry in an induced hypoxemia study

Introduction

Methods

Pulse oximetry systems under test

Testing equipment

Simulated clinical parameters

Translucency and perfusion parameters

SpO2 and heart rate parameters

Testing procedure

Performance metric

Statistical analysis of SpO2 error

Results

Discussion

Conclusion

Data availability

Change history

18 September 2025

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Additional information

Supplementary information

Supplemental material

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links

SpO₂ and heart rate parameters

Statistical analysis of SpO₂ error