Development of a prediction model for mortality in infants undergoing therapeutic hypothermia for neonatal encephalopathy

Mitchell, Jill M.; Rodrigues, Clare L.; Dunworth, Margo; Mc Ginley, Julie; O’Byrne, Laura; Hall, Roisin; Corcoran, Paul; Campillo, Indra San Lazaro; Mc Kenna, Peter; Higgins, John R.; McKernan, Joye; Khashan, Ali S.; Maher, Gillian M.; Murphy, John; Walsh, Brian H.; Greene, Richard A.; McCarthy, Fergus P.

doi:10.1038/s41372-025-02547-z

Download PDF

Article
Open access
Published: 05 January 2026

Development of a prediction model for mortality in infants undergoing therapeutic hypothermia for neonatal encephalopathy

Jill M. Mitchell ORCID: orcid.org/0000-0001-5195-0522¹,
Clare L. Rodrigues¹,
Margo Dunworth²,
Julie Mc Ginley²,
Laura O’Byrne¹,
Roisin Hall³,
Paul Corcoran³,
Indra San Lazaro Campillo³,
Peter Mc Kenna²,
John R. Higgins¹,
Joye McKernan³,
Ali S. Khashan^4,5,
Gillian M. Maher ORCID: orcid.org/0000-0002-6722-0484^4,5,
John Murphy²,
Brian H. Walsh ORCID: orcid.org/0000-0003-4665-6884⁴,
Richard A. Greene ORCID: orcid.org/0000-0003-0247-523X^1,3 &
…
Fergus P. McCarthy^1,4

Journal of Perinatology (2026)Cite this article

1722 Accesses
15 Altmetric
Metrics details

Subjects

Abstract

Objective

To develop and internally validate a model predicting neonatal mortality in infants with neonatal encephalopathy requiring therapeutic hypothermia (TH), using national data.

Study design

Data from 385 infants treated with TH across 19 hospitals (2016–2021) were analysed. Multivariable logistic regression with backward stepwise selection was applied. Discrimination was assessed using the C-statistic, with internal validation by bootstrapping. The THERM (Therapeutic Hypothermia Early Risk Model for Mortality) tool was developed to calculate individualised mortality risk.

Results

Forty-six infants (11.9%) died within 28 days. Four predictors were retained: prelabour Caesarean section, adrenaline use, base excess ≤–22 mmol/L, and seizures during the first day of life. The model demonstrated excellent discrimination [optimism-adjusted C-statistic 0.885 (95% CI: 0.827–0.936)].

Conclusions

Four routinely collected variables predicted mortality in infants undergoing TH. The THERM tool provides a practical resource for clinicians, enabling personalised risk assessment and supporting parental counselling during the first day of life.

Evaluation of heterogeneity in effect of therapeutic hypothermia by sex among infants with neonatal encephalopathy

Article 03 April 2023

Process variations between Swiss units treating neonates with hypoxic-ischemic encephalopathy and their effect on short-term outcome

Article Open access 21 July 2021

Impact of teleneonatology on time to goal temperature in outborn neonates with hypoxic-ischemic encephalopathy requiring therapeutic hypothermia

Article 29 May 2025

Introduction

Neonatal encephalopathy (NE) is a significant cause of morbidity and mortality in term infants [1, 2]. Therapeutic hypothermia (TH) is the standard treatment for moderate to severe NE and has been shown to reduce mortality and improve neurodevelopmental outcomes in survivors [3, 4]. Despite its effectiveness, there remains a need for accurate early prediction models to enhance parental counselling.

Among outcomes following NE, death is the most immediate to prognosticate and carries profound implications for both clinicians and families [5]. Accurate early prognostication of death allows parents to begin processing the possibility of goals of care discussions or loss, and to make preparations per family, cultural, or religious needs [5]. Prediction models can aid this process by offering individualised, evidence-based mortality risk estimates. Importantly, such tools are intended to support parental counselling and emotional preparation, not to guide decisions about initiating or continuing therapeutic hypothermia, which should be made based on established clinical criteria.

Ambalavanan et al. developed a prediction model for neonatal death in infants with NE, using American data from over 20 years ago [6]. Advances in clinical practices may limit the accuracy of this model in a modern setting, and it has not been validated. More recent studies have been region-specific or rely on magnetic resonance imaging (MRI), which limits their applicability in high-income settings or for early clinical use [7,8,9]. Glass et al. developed a prediction model for the composite outcome of death or severe neurodevelopmental impairment, but did not provide performance metrics for death as an isolated outcome and included death up to 2 years of life [10].

This study aims to develop a prediction model using early neonatal risk factors to predict neonatal mortality in infants with neonatal encephalopathy requiring TH, using data from 19 maternity hospitals. The model is designed to facilitate clearer, more informed parental counselling in the early days of life, without influencing clinical decision-making regarding the use of TH.

Methods

Study population

We used data from paired mother-infant medical records of infants who received TH across 19 maternity hospitals in the Republic of Ireland between 2016 to 2021, collected through the National Neonatal Therapeutic Hypothermia Development Project.

Data access approval was granted from the National Neonatal Steering Committee, the National Perinatal Epidemiology Centre, the National Women and Infant Health Programme, and the National Clinical Programme for Paediatrics and Neonatology for the use of anonymised data for this project.

Infants diagnosed with neonatal encephalopathy and requiring TH were included in the study. Those with genetic anomalies were excluded. This study followed the TRIPOD guidelines (available at https://www.tripod-statement.org).

Variable selection

We selected routinely measured predictors by considering antenatal factors, delivery factors, early neonatal evaluations, and laboratory results. Our approach combined expert opinions from neonatologists and obstetricians, alongside a review of existing literature, taking into account the available data [2, 4, 6,7,8, 11,12,13,14]. The variables; parity, maternal age, body mass index (BMI), employment status, smoking status at booking, maternal medical condition in current pregnancy, previous Caesarean section, presence of meconium, gestational age at delivery, sex of infant, birth weight, mode of delivery, birth status, occurrence of acute perinatal event, time of birth, day of birth, 5-min Apgar score < 3, establishment of spontaneous respiration during resuscitation, use of adrenaline during resuscition, chest compressions, intubation, base excess ≤ −22 mmol/L, pH ≤ 6.70, seizures during day one of life were considered as potential predictors. Parity was recorded as the number of completed pregnancies ≥24 weeks and recategorised as nulliparous or multiparous. Maternal age, measured in years, was retained as a continuous variable and was also assessed as a categorical variable (≤24 years, 25–34 years, 35–39 years and ≥40 years). BMI was categorised into ≤24.9 kg/m², 25.0–29.9 kg/m² and ≥30 kg/m². Employment status was categorised into employed, unemployed/student, and homemaker. Smoking status at booking was categorised as non-smoker and smoker. Maternal medical condition in pregnancy was defined as hypertensive disorders (including pre-eclampsia, pregnancy-induced hypertension, and essential hypertension), diabetes mellitus (either gestational or pre-existing), or thyroid disease (hypothyroidism or hyperthyroidism) in the current pregnancy. Mode of delivery was categorised as spontaneous vaginal delivery, operative vaginal delivery, Caesarean section after the onset of labour, or prelabour Caesarean section. Descriptive statistics were employed to analyse the category “prelabour Caesarean section” further to differentiate elective prelabour Caesarean sections and emergency prelabour Caesarean sections, whereby the Cesarean section was performed before the onset of labour, due to concerns for the well-being of either the mother or the baby. Birth status was defined as inborn or outborn. TH is exclusively provided in the four tertiary NICUs in Ireland. All other maternity sites transfer eligible newborns to one of these four units to receive TH [15]. “Inborn” was defined as birth at a site performing TH, while “outborn” included all infants born in an external location and transferred to the tertiary units for TH. The variables: ‘presence of meconium, 5-min Apgar score < 3, intake of a spontaneous breath during resuscitation, use of adrenaline during resuscitation, chest compressions, and intubation’ were also categorised separately as yes/no. Similarly, the variable “occurrence of an acute perinatal event” was also binary (yes/no), with perinatal event defined as umbilical cord prolapse, uterine rupture, antenatal haemorrhage, shoulder dystocia, or severe fetal heart rate abnormality [4]. Thresholds for base excess (≤−22 mmol/L) and pH (≤6.70) were derived from previously published research [6]. These measurements included umbilical artery and venous blood gases, as well as the first blood gas analysis from the infant within the first hour of life. The variables “seizures during day one of life, intake of a spontaneous breath during the initial resuscitation and use of adrenaline during resuscitation” were categorised separately as yes/no. Seizure activity during the first day of life was classified following review of medical notes by the study team. Infants were classified as having seizures if there was documented clinical seizure activity during the first day of life, confirmed by electroencephalogram (EEG) when available, and initiation of antiepileptic treatment. Diagnoses were made by senior neonatologists (consultants or senior registrars).

Outcome measure

The outcome of interest was neonatal mortality, defined as neonatal death within the first 28 days of life, consistent with the World Health Organization definition of neonatal death [16].

Statistical analysis

Statistical analysis was performed using Stata BE 18.5, SPSS version 29, and Microsoft Excel 16.90.2. Demographic factors were examined using descriptive analysis. Unadjusted logistic regression analysis examined associations between candidate predictors and the odds of neonatal death. Significant variables (p < 0.1) were entered into the multivariable logistic regression analysis. Variables with more than 15% missing data were excluded from the analysis; however, separate multivariable analyses were subsequently performed to assess the effect of these variables on the model’s predictive ability.

Initial prediction model

A multivariable logistic regression with backward stepwise selection (p ≥ 0.1 for exclusion) was used to develop the initial prediction model [17]. All candidate predictors were included, and the least statistically significant predictors were removed one by one [17, 18]. We used Little’s test, utilising the mcartest command, to assess the assumption of missingness being completely at random (MCAR) [19]. A complete case analysis of all variables included was conducted to address missing values [20].

Simplified prediction model

We then developed a simplified prediction model by removing one candidate variable at a time, completely excluding it from the candidate set, prior to running the backward stepwise selection. This iterative process continued until we obtained the simplest model that retained its predictive ability.

Sample size

The sample size calculation was performed using the pmsampsize command. Assuming an outcome event proportion (prevalence) of 0.12, a target shrinkage factor of 0.85, a c-statistic of 0.81 [6,7,8], and 10 candidate predictors/categories, a minimum sample size of 353 (with 42 events) would be required to minimise overfitting [21].

Model performance and internal validation

Model performance was evaluated by examining overall fit, discrimination, and calibration. Overall fit was assessed with the Brier Score and Cragg & Uhler’s (Nagelkerke) R². Discrimination was evaluated using the area under the ROC C-statistic. Calibration was assessed using calibration-in-the-large (CITL) and calibration slope (C-slope). Bootstrapping was performed for internal validation (with 100 repetitions) to assess overfitting and calculate the optimism-adjusted C-statistic, CITL, and C-slope. Supplementary information 1 (A1) provides additional details on the accepted thresholds for model performance metrics.

Development of an interactive prediction tool

To facilitate the practical application of our prediction model, we developed an interactive Excel-based tool called the THERM tool (Therapeutic Hypothermia Early Risk Model for Mortality). The tool incorporates variables identified in the final multivariable logistic regression model, with each variable weighted according to its corresponding regression coefficient, reflecting its relative contribution to the overall risk. The tool computes the total log-odds using the logistic regression formula and transforms it into a probability using the inverse logit function. Additionally, an explanatory sheet provides clear definitions and coding for each variable, ensuring ease of use and accessibility for clinicians.

Confusion matrix

To provide additional clinical context, we used the simplified prediction model, selected for its parsimony and clinical applicability, to classify infants as predicted to survive or die at a 50% probability threshold, and constructed a confusion matrix to compare predicted with observed outcomes.

Results

Descriptive statistics

Characteristics of the study participants are presented in Table 1. A complete descriptive analysis of the cohort, including maternal demographics and medical history, labour and delivery details, neonatal resuscitation, and biochemical characteristics, is documented in Tables A1 to A3.

Table 1 Characteristics of study participants.

Full size table

Out of 415 infants who required TH, nine were excluded due to genetic anomalies. In the initial prediction model, 6.9% of the data for candidate variables were missing, while 5.1% were missing in the simplified model. Little’s test indicated that the missing data did not deviate from the missing completely at random (MCAR) assumption (p = 0.908). The complete case analysis resulted in a final cohort of 378 infants for the initial prediction model, among whom 44 (11.6%) died, and 385 infants for the simplified prediction model, among whom 46 (11.9%) died (Fig. 1). All deaths occurred within the first 28 days of life; no additional deaths were recorded up to two years of follow-up. Descriptive statistics were performed for 385 mother–infant pairs included in the simplified model.

**Fig. 1: Graphical overview of methodology and sample size.**

Of the included mothers, 58.7% (n = 228) were primiparous, and the majority (89.5%, n = 342) were White. Most deliveries occurred at term with 2.7% (n = 11) delivered before 36 weeks, 5.7% (n = 23) delivered between 36 + 0 and 36 + 6 weeks, 68.0% (n = 261) between 37 + 0 and 40 + 6 weeks, and 23.4% (n = 90) at or beyond 41 weeks (with 0.3%, n = 1 not documented).

Regarding mode of delivery, 23.9% (n = 92) had a spontaneous vaginal delivery, 31.9% (n = 123) had an operative vaginal delivery, 22.9% (n = 88) underwent a Caesarean section after labour onset, and 21.3% (n = 82) had a prelabour Caesarean section.

Most prelabour Caesarean sections in this cohort were emergency procedures (97.6%, n = 80). Acute perinatal events occurred in 75.1% (n = 289) of cases overall, rising to 90.2% (n = 74) among prelabour Caesareans. Rates were also high for Caesareans after labour onset (85.2%), operative vaginal deliveries (79.7%), and lower for spontaneous vaginal deliveries (45.7%). Differences across delivery modes were statistically significant (p < 0.001; Table A4).

Unadjusted logistic regression analysis

The variables; sex of infant, presence of meconium, occurrence of an acute perinatal event, mode of delivery (specifically prelabor Caesarean section), use of adrenaline during resuscitation, chest compressions, 5 min Apgar score < 3, establishment of spontaneous respiration during resuscitation, intubation, base excess ≤ −22 mmol/L, pH ≤ 6.7 and seizures during day one of life were significant in the unadjusted logistic regression analysis (p < 0.1). The following variables were not statistically significant at p < 0.1: parity, body mass index, employment status, smoking status at booking, maternal condition in the current pregnancy, previous Cesarean section, gestational age at delivery, birth status, time of birth, day of birth, and maternal age.

Initial prediction model

The initial analysis identified four variables as the best combined predictors of neonatal mortality: mode of delivery (prelabour Caesarean section), use of adrenaline during resuscitation, base excess ≤ −22 mmol/L, and seizures during the first day of life (Table 2).

Table 2 Best combined predictors of neonatal mortality in infants requiring therapeutic hypothermia, initial model.

Full size table

Simplified Prediction Model. For the simplified model, we removed one candidate variable at a time, completely excluding it from the candidate set, before running the multivariable logistic regression with backwards stepwise selection. This resulted in the inclusion of the candidate variables: mode of delivery, use of adrenaline during resuscitation, establishment of spontaneous respiration during resuscitation, intubation, base excess ≤ −22 mmol/L, and seizures during day one of life into the multivariable logistic regression model. The same four variables were identified in the simplified model as in the initial model (mode of delivery [prelabour Caesarean section], use of adrenaline during resuscitation, base excess ≤ −22 mmol/L, and seizure during day one of life) (Table 3).

Table 3 Best combined predictors of neonatal mortality in infants requiring therapeutic hypothermia, simplified model.

Full size table

Prediction tool

These four predictors were used to develop the THERM tool (Supplementary Material 2). For example, when a hypothetical case with a prelabour Caesarean section, adrenaline use during resuscitation, and base excess ≤ −22 mmol/L was entered, the tool generated a predicted mortality probability of 67%. The predicted probability of death increases to 87% when all four risk factors are present.

Model performance and internal validation

For the initial model, the overall performance was good, with a Brier Score of 0.069 and Cragg & Uhler’s R² score of 0.43 [22,23,24]. The original apparent C-statistic was 0.900 (95% CI 0.854, 0.945), indicating excellent discriminative ability [25]. Bootstrapping adjusted the C-statistic to 0.873 (95% CI 0.816, 0.917), supporting the model’s capability to predict neonatal mortality effectively (Table 4).

Table 4 Assessment of model performance.

Full size table

Similarly, the simplified model performed well, with a Brier Score of 0.069 and Cragg & Uhler’s R² score of 0.459. The original apparent C-statistic was 0.903 (95% CI, 0.860–0.948), demonstrating the excellent discriminative ability of the model [25]. The original apparent C-statistic was 0.903 (95% CI 0.860–0.948), and after bootstrapping, the adjusted C-statistic remained high at 0.885 (95% CI 0.827–0.936) (Table 4). The miscalibration in CITL and C-slope was small in both the initial and simplified models, suggesting that overfitting was unlikely to be an issue.

In the unadjusted analysis, the variable “presence of meconium” was significant (OR = 1.84, 95% CI: 0.91–3.70, p = 0.09). However, due to 16.5% missing data, it was excluded from the primary multivariable analysis. A separate multivariable logistic regression model that included the variable’ presence of meconium’ did not improve the model’s predictive ability (AUC = 0.892, 95% CI: 0.840–0.944; see Table A5).

The calibration plots (Figs. A1 and A2) suggest that the average model predictions closely match the observed outcomes across ten groups of patients (i.e., deciles of risk were used as cut-off points to compare observed and expected probabilities within groups of individuals), indicating good calibration for both models. Most deciles were clustered at the lower end of the risk spectrum, reflecting the majority of individuals having low predicted probabilities of the event. The Lowess smoother highlighted minimal miscalibration at higher predicted probabilities; however, these estimates are based on limited data, as indicated by the sparse spike plot at higher risk levels.

At a 50% probability threshold, the simplified model correctly identified 329 of 354 infants predicted to survive (92.9%) and 21 of 31 infants predicted to die (67.7%) (Table A7).

Discussion

Principal findings

This study developed and internally validated a risk prediction model for neonatal mortality in infants with neonatal encephalopathy undergoing therapeutic hypothermia. We identified four variables routinely collected during the early neonatal resuscitation period to predict mortality in this population. These variables included mode of delivery (prelabour Caesarean section), use of adrenaline during resuscitation, base excess ≤ −22 mmol/L, and the presence of seizures during day one of life. The concordance of the same four predictors in both the initial and simplified models demonstrates their association with neonatal mortality in infants undergoing TH, reinforcing the credibility and stability of the THERM tool for individualised risk assessment. The association between prelabour Caesarean section and neonatal mortality is likely due to the high percentage of infants that had an acute perinatal event within this category. Notably, nearly all prelabour Caesarean sections were emergency procedures rather than elective procedures, meaning that the Caesarean section was performed before the onset of labour, due to concerns for the well-being of either the mother or the baby.

Both the initial and simplified models demonstrated excellent discrimination (optimism-adjusted C-statistic 0.873 in the initial model and 0.885 in the simplified model) and good calibration. The calibration plot indicated that model predictions closely matched observed outcomes across risk groups. To provide a more clinically intuitive summary, we examined how well the simplified model’s predictions aligned with actual outcomes at a 50% probability threshold. Presenting the results in this way provides a clearer clinical perspective: if the model predicts survival, almost 93% of infants do in fact survive, whereas if it predicts death, about two-thirds of infants die. This highlights that the model is most reliable when predicting survival, while predictions of death are less certain but still useful to inform counselling conversations with families. These findings suggest that the model may be most helpful in reassuring families when survival is predicted, while still offering valuable, though less certain, guidance when death is predicted.

Comparison to previous research

This model showed improved discrimination compared to previous models [6, 7]. Ambalavanan et al. developed a prediction model for neonatal death in infants with neonatal encephalopathy, achieving an AUC of 0.81 [6]. Similar to the present study, their model identified base deficit >22 mmol/L as a significant predictor of neonatal mortality. While the authors also used variables routinely collected during the early neonatal resuscitation period, their chosen predictors differed from those in our study (decerebrate posture, absent suck, absence of antepartum haemorrhage and base deficit of first postnatal gas > 22 mmol/L). Significantly, this model was based on American data collected over 20 years ago and requires updating to reflect current clinical practices and populations. Additionally, the study did not include a sample size calculation, nor did it perform internal validation or apply methods to adjust for overfitting.

Tegegne et al. developed a prediction model to predict neonatal mortality in asphyxiated neonates admitted to the neonatal intensive care unit [7]. The model achieved an optimism-adjusted C-statistic of 0.775. This study was conducted in a low-income country, where perinatal care practices and outcomes may differ significantly from those in high-income settings. Notably, this study had a high incidence of neonatal mortality (27.2% versus 11.9% in the present study).

Lew et al. developed a deep learning algorithm that combined MRI and clinical data to predict 2-year neurodevelopmental outcomes in neonates with hypoxic-ischaemic encephalopathy [8]. Their model demonstrated strong discriminative performance for predicting mortality alone, achieving an AUC of 0.92. While MRI data are valuable, it is important to note that these data are not available until several days of age [26, 27]. In contrast, our prediction model is based on readily available clinical and biochemical data available within the first day of life, providing a practical tool to improve parental counselling.

Clinical implications

Clear and compassionate communication with families is essential in the early days following birth, when uncertainty about outcomes can significantly contribute to parental distress [5]. This study provides an evidence-based tool to predict neonatal mortality in infants with neonatal encephalopathy undergoing TH using routinely available neonatal data. The model is intended solely for parental counselling and was not designed to determine whether TH should be initiated or continued. If a model were to be developed for clinical decision-making, it would need to undergo not only external validation but also prospective evaluation and formal impact assessment to determine whether its use improves clinical decision-making and patient outcomes in routine clinical practice [28, 29].

The model’s excellent discriminative ability (optimism-adjusted C-statistic 0.885) supports its utility in guiding parental counselling. Furthermore, the inclusion of readily available clinical and biochemical variables ensures the model’s practical application in similar real-world high-income settings. The THERM tool, implemented as a simple and interactive Excel-based calculator, allows clinicians to quickly generate individualised risk estimates before family meetings.

Strengths and limitations

Our study contained several strengths. First, we used a comprehensive national dataset, incorporating data from 19 maternity units over a 6-year period. This ensures that the findings are nationally inclusive, enhancing the generalisability of the results to similar healthcare settings. Second, our predictors are routinely collected. This practical approach ensures that the model can be readily implemented in clinical settings without requiring additional, specialised resources or equipment, thereby enhancing its utility in everyday clinical practice.

Third, we created a user-friendly interactive tool to calculate individual neonatal mortality risk. Dropdown menus for categorical variables and clear variable definitions enhance usability and reduce the likelihood of data entry errors [30]. This tool provides a practical and user-friendly approach for applying the prediction model in clinical settings, enabling clinicians to make individualized, evidence-based assessments of neonatal mortality risk.

Fourth, this study adhered to the TRIPOD guidelines, ensuring transparency and reproducibility of the research.

Some limitations should be noted. First, as a retrospective study, this analysis is inherently subject to limitations, particularly related to missing data. However, the proportion of missing data in this study was relatively low at 5.17% for the simplified model and 6.90% for the initial model, which is below the 10% threshold associated with a high risk of bias [31]. Furthermore, since the data satisfy the MCAR assumption, the missing data can be considered a random subset of the complete dataset [32]. Consequently, excluding cases with missing data under the MCAR assumption is unlikely to introduce bias into the findings [32]. Second, “seizures during the first day of life” was included in the model. While neonatal seizures can be challenging to recognise clinically, the variable is included in many candidacy checklists for TH [15, 33]. The model assumes experienced clinicians will assess it, as its purpose is to support parental counselling during initial neonatal resuscitation. We acknowledge that continuous EEG monitoring, the gold standard for seizure detection, was not uniformly available across all centres, and subclinical seizures may therefore have been missed. This limitation could contribute to misclassification, although we attempted to reduce this risk by requiring documentation of clinical seizure activity, EEG changes where available, and antiepileptic treatment. Third, the model was developed using a recent, nationally representative dataset from a high-income setting, which enhances generalisability to similar healthcare environments. Nonetheless, temporal and geographical external validation will be required to assess performance and applicability in other settings with different healthcare resources and practices. However, it is recommended that an independent research team conduct external validation to evaluate performance objectively [34]. Therefore, we have included the values to calculate the linear predictor of our model, allowing researchers to conduct independent external validation (Supplementary Material 3).

Fourth, although the data extraction form used for national data collection was not formally validated, it was employed by expert clinicians in obstetrics and neonatology and was overseen by the National TH Coordinator and the Therapeutic Hypothermia Steering Committee to ensure the accuracy of the information.

Fifth, although TH is of uncertain benefit in infants born at 34–35 weeks, we retained these cases to reflect real-world clinical practice, where TH may be used in late preterm infants [35, 36]. Their small number (n = 11, 2.7%) makes it unlikely that their inclusion influenced the model’s performance meaningfully.

Sixth, this study used unadjusted logistic regression as part of the predictor selection process, in addition to expert input from neonatologists and obstetricians, and a review of the existing literature, while also considering data availability. While this is a widely used method, there is an argument that it may lead to the exclusion of predictors that, while not individually significant, could provide valuable information when considered in combination with other variables [37]. However, this method has been used in previously published similar research, and there is no universally accepted standard for predictor selection [38,39,40,41]. This approach allowed us to explore associations between individual predictors and neonatal mortality in a national cohort.

Seventh, although methods such as machine learning are increasingly used in prediction modelling, we selected logistic regression because it is transparent, easy to interpret, and widely applied in similar research [36, 42,43,44]. It is also recognised in methodological guidance as a common starting point for binary outcomes [37]. Importantly, this approach allowed us to translate the model into a simple tool for use in clinical practice.

Eighth, long-term neurodevelopmental outcomes, which are highly relevant for families, were beyond the scope of this study but should be addressed in future research.

Lastly, although the final model met the conventional events per parameter threshold of 10 (11.5 events per parameter), the modest number of deaths (n = 46) may still limit stability. Internal validation using bootstrapping was therefore undertaken. This approach quantifies and corrects for optimism, yielding adjusted estimates of discrimination and calibration that better reflect expected performance in new populations. Nonetheless, external validation remains essential.

Conclusion

This study developed and internally validated a risk prediction model for neonatal mortality in infants with neonatal encephalopathy undergoing therapeutic hypothermia, using national data from 19 maternity hospitals and units across the Republic of Ireland. By incorporating four routinely collected clinical and biochemical variables, the model demonstrated excellent discriminative ability (optimism-adjusted C-statistic, 0.885) and good calibration. The use of our prediction tool (THERM) could potentially assist with individualised risk assessment for neonatal mortality and aid in parental counselling in this setting. External validation and prospective studies are needed to confirm its utility in clinical practice.

Data availability

The datasets analysed during the current study are available from the corresponding author on reasonable request.

References

D’Alton ME, Hankins GDV, Berkowitz RL, Bienstock J, Ghidini A, Goldsmith J, et al. Neonatal encephalopathy and neurologic outcome. Obstet Gynecol. 2014;123:896–901.
Article Google Scholar
Kurinczuk JJ, White-Koning M, Badawi N. Epidemiology of neonatal encephalopathy and hypoxic-ischaemic encephalopathy. Early Hum Dev. 2010;86:329–38.
Article PubMed Google Scholar
Edwards AD, Brocklehurst P, Gunn AJ, Halliday H, Juszczak E, Levene M, et al. Neurological outcomes at 18 months of age after moderate hypothermia for perinatal hypoxic ischaemic encephalopathy: synthesis and meta-analysis of trial data. BMJ. 2010;340:c363.
Article PubMed PubMed Central Google Scholar
Shankaran S, Laptook AR, Ehrenkranz RA, Tyson JE, McDonald SA, Donovan EF, et al. Whole-body hypothermia for neonates with hypoxic-ischemic encephalopathy. N Engl J Med. 2005;353:1574–84.
Article PubMed CAS Google Scholar
Cawley P, Chakkarapani E. Fifteen-minute consultation: Therapeutic hypothermia for infants with hypoxic ischaemic encephalopathy—translating jargon, prognosis and uncertainty for parents. Arch Dis Child Educ Pract Ed. 2020;105:75–83.
Article PubMed Google Scholar
Ambalavanan N, Carlo WA, Shankaran S, Bann CM, Emrich SL, Higgins RD, et al. Predicting outcomes of neonates diagnosed with hypoxemic-ischemic encephalopathy. Pediatrics. 2006;118:2084–93.
Article PubMed Google Scholar
Tegegne YS, Birhan TY, Takele H, Mekonnen FA. Incidence and development of validated mortality prediction model among asphyxiated neonates admitted to neonatal intensive care unit at Felege Hiwot Comprehensive Specialized Hospital, Bahir Dar, Northwest Ethiopia, 2021: retrospective follow-up study. BMC Pediatr. 2024;24:219.
Article PubMed PubMed Central Google Scholar
Lew CO, Calabrese E, Chen JV, Tang F, Chaudhari G, Lee A, et al. Artificial intelligence outcome prediction in neonates with encephalopathy (AI-OPiNE). Radiology Artif Intell. 2024;6:e240076.
Article Google Scholar
Jeong JW, Lee M, Fernandes N, Deol S, Mody S, Arslanturk S, et al. Neonatal encephalopathy prediction of poor outcome with diffusion-weighted imaging connectome and fixel-based analysis. Pediatr Res. 2022;91:1505–15.
Article PubMed CAS Google Scholar
Glass HC, Wood TR, Comstock BA, Numis AL, Bonifacio SL, Cornet MC, et al. Predictors of death or severe impairment in neonates with hypoxic-ischemic encephalopathy. JAMA Netw Open. 2024;7:e2449188.
Article PubMed PubMed Central Google Scholar
Doreswamy SM, Ramakrishnegowda A. Prediction of encephalopathy in perinatal asphyxia score: reaching the unreached. J Perinat Med. 2021;49:748–54.
Article PubMed Google Scholar
Talati AJ, Yang W, Yolton K, Korones SB, Bada HS. Combination of early perinatal factors to identify near-term and term neonates for neuroprotection. J Perinatol. 2005;25:245–50.
Article PubMed Google Scholar
Debillon T, Sentilhes L, Kayem G, Chevallier M, Zeitlin J, Baud O, et al. Risk factors for unfavorable outcome at discharge of newborns with hypoxic-ischemic encephalopathy in the era of hypothermia. Pediatr Res. 2023;93:1975–82.
Article PubMed Google Scholar
Wayock CP, Meserole RL, Saria S, Jennings JM, Huisman TAGM, Northington FJ, et al. Perinatal risk factors for severe injury in neonates treated with whole-body hypothermia for encephalopathy. Am J Obstet Gynecol. 2014;211:41.e1–.e8.
Article PubMed Google Scholar
Meaney S, McGinley J, Corcoran P, McKenna P, Filan P, Greene R, et al. Neonatal therapeutic hypothermia in Ireland, annual report 2019. Cork: National Perinatal Epidemiology Centre; 2021.
World Health Organization. Neonatal and perinatal mortality: country, regional and global estimates. WHO; 2006.
Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and prognostic research: Developing a prognostic model. BMJ. 2009;338:b604.
Article PubMed Google Scholar
Chowdhury MZI, Turin TC. Variable selection strategies and its importance in clinical prediction modelling. Fam Med Community Health. 2020;8:e000262.
Article PubMed PubMed Central Google Scholar
Li C. Little’s test of missing completely at random. Stata J. 2013;13:795–809.
Article Google Scholar
Harrell FE. Missing Data. In: Harrell JFE, editor. Regression modeling strategies: with applications to linear models, logistic and ordinal regression, and survival analysis. Cham: Springer International Publishing; 2015. p. 45–61.
Riley RD, Van Calster B, Collins GS. A note on estimating the Cox-Snell from a reported C statistic (AUROC) to inform sample size calculations for developing a prediction model with a binary outcome. Stat Med. 2021;40:859–64.
Article PubMed Google Scholar
Faraway JJ. Extending the linear models with R: generalised linear, mixed effects and nonparametric regression models. Chapman and Hall; 2006.
Nagelkerke NJD. A note on a general definition of the coefficient of determination. Biometrika. 1991;78:691–2.
Article Google Scholar
Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology. 2010;21:128–38.
Article PubMed PubMed Central Google Scholar
Gravesteijn BY, Sewalt CA, Venema E, Nieboer D, Steyerberg EW. Missing Data in prediction research: a five-step approach for multiple imputation, illustrated in the CENTER-TBI study. J Neurotrauma. 2021;38:1842–57.
Article PubMed Google Scholar
O’Kane A, Vezina G, Chang T, Bendush N, Ridore M, Gai J, et al. Early versus late brain magnetic resonance imaging after neonatal hypoxic ischemic encephalopathy treated with therapeutic hypothermia. J Pediatrics. 2021;232:73–9.e2.
Article Google Scholar
Li Y, Wisnowski JL, Chalak L, Mathur AM, McKinstry RC, Licona G, et al. Mild hypoxic-ischemic encephalopathy (HIE): timing and pattern of MRI brain injury. Pediatr Res. 2022;92:1731–6.
Article PubMed PubMed Central CAS Google Scholar
Kappen TH, van Klei WA, van Wolfswinkel L, Kalkman CJ, Vergouwe Y, Moons KGM. Evaluating the impact of prediction models: lessons learned, challenges, and recommendations. Diagnostic Prognostic Res. 2018;2:11.
Article Google Scholar
Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162:W1–73.
Article PubMed Google Scholar
Nyame-Mensah A. Decrease Data Entry Errors with Drop Down Menus [Available from: https://www.anyamemensah.com/blog/drop-down-menus.
Bennett DA. How can I deal with missing data in my study?. Aust N Z J Public Health. 2001;25:464–9.
Article PubMed CAS Google Scholar
Dong Y, Peng CY. Principled missing data methods for researchers. Springerplus. 2013;2:222.
Article PubMed PubMed Central Google Scholar
Panayiotopoulos CP. The epilepsies: seizures, syndromes and management, Chapter 5, Neonatal seizures and neonatal syndromes, Oxfordshire (UK): Bladon Medical Publishing; 2005 Available from: https://www.ncbi.nlm.nih.gov/books/NBK2599/#.
Collins GS, Altman DG. An independent and external validation of QRISK2 cardiovascular disease risk score: a prospective open cohort study. BMJ. 2010;340:c2442.
Article PubMed PubMed Central Google Scholar
Kim SH, El-Shibiny H, Inder T, El-Dib M. Therapeutic hypothermia for preterm infants 34-35 weeks gestational age with neonatal encephalopathy. J Perinatol. 2024;44:528–31.
Article PubMed Google Scholar
Faix RG, Laptook AR, Shankaran S, Eggleston B, Chowdhury D, Heyne RJ, et al. Whole-body hypothermia for neonatal encephalopathy in preterm infants 33 to 35 weeks’ gestation: a randomized clinical trial. JAMA Pediatrics. 2025;179:396–406.
Article PubMed PubMed Central Google Scholar
Efthimiou O, Seo M, Chalkou K, Debray T, Egger M, Salanti G. Developing clinical prediction models: a step-by-step guide. BMJ. 2024;386:e078276.
Article PubMed PubMed Central Google Scholar
de Wit HAJM, Winkens B, Mestres Gonzalvo C, Hurkens KPGM, Mulder WJ, Janknegt R, et al. The development of an automated ward independent delirium risk prediction model. Int J Clin Pharm. 2016;38:915–23.
Article PubMed Google Scholar
Qiu Y, Xue W, Chen Y, He X, Zhao L, Tang M, et al. Development and validation of a prediction model for dysphagia in community-dwelling older adults. Biol Res Nurs. 2025;27:300–15.
Article PubMed CAS Google Scholar
Zeng Y, Liu R, Li S, Wei J, Luo F, Chen Y, et al. Analysis of risk factors and development of a nomogram prediction model for renal tubular acidosis in primary Sjogren syndrome patients. Arthritis Res Ther. 2024;26:151.
Article PubMed PubMed Central CAS Google Scholar
Maher GM, McKernan J, O’Byrne L, Corcoran P, Greene RA, Khashan AS, et al. Predicting risk of postpartum haemorrhage during the intrapartum period in a general obstetric population. Eur J Obstet Gynecol Reprod Biol. 2022;276:168–73.
Article PubMed Google Scholar
Fu YQ, Bai K, Liu CJ. The impact of admission serum lactate on children with moderate to severe traumatic brain injury. PLoS ONE. 2019;14:e0222591.
Article PubMed PubMed Central CAS Google Scholar
López Jiménez N, García Sánchez F, Pailos RH, Rodrigo Álvaro V, Pascual Pedreño A, Moreno Cid M, et al. Prediction of an effective cervical ripening in the induction of labour using vaginal dinoprostone. Sci Rep. 2023;13:6855.
Article PubMed PubMed Central Google Scholar
Beninati MJ, Ramos SZ, Danilack VA, Has P, Savitz DA, Werner EF. Prediction model for vaginal birth after induction of labor in women with hypertensive disorders of pregnancy. Obstet Gynecol. 2020;136:402–10.
Article PubMed CAS Google Scholar

Download references

Funding

Open Access funding provided by the IReL Consortium.

Author information

Authors and Affiliations

Department of Obstetrics and Gynaecology, University College Cork, Cork, Ireland
Jill M. Mitchell, Clare L. Rodrigues, Laura O’Byrne, John R. Higgins, Richard A. Greene & Fergus P. McCarthy
National Women and Infants Health Programme, Cork, Ireland
Margo Dunworth, Julie Mc Ginley, Peter Mc Kenna & John Murphy
National Perinatal Epidemiology Centre, Cork, Ireland
Roisin Hall, Paul Corcoran, Indra San Lazaro Campillo, Joye McKernan & Richard A. Greene
INFANT Research Centre, Cork, Ireland
Ali S. Khashan, Gillian M. Maher, Brian H. Walsh & Fergus P. McCarthy
School of Public Health, University College Cork, Cork, Ireland
Ali S. Khashan & Gillian M. Maher

Authors

Jill M. Mitchell
View author publications
Search author on:PubMed Google Scholar
Clare L. Rodrigues
View author publications
Search author on:PubMed Google Scholar
Margo Dunworth
View author publications
Search author on:PubMed Google Scholar
Julie Mc Ginley
View author publications
Search author on:PubMed Google Scholar
Laura O’Byrne
View author publications
Search author on:PubMed Google Scholar
Roisin Hall
View author publications
Search author on:PubMed Google Scholar
Paul Corcoran
View author publications
Search author on:PubMed Google Scholar
Indra San Lazaro Campillo
View author publications
Search author on:PubMed Google Scholar
Peter Mc Kenna
View author publications
Search author on:PubMed Google Scholar
John R. Higgins
View author publications
Search author on:PubMed Google Scholar
Joye McKernan
View author publications
Search author on:PubMed Google Scholar
Ali S. Khashan
View author publications
Search author on:PubMed Google Scholar
Gillian M. Maher
View author publications
Search author on:PubMed Google Scholar
John Murphy
View author publications
Search author on:PubMed Google Scholar
Brian H. Walsh
View author publications
Search author on:PubMed Google Scholar
Richard A. Greene
View author publications
Search author on:PubMed Google Scholar
Fergus P. McCarthy
View author publications
Search author on:PubMed Google Scholar

Contributions

JM: Conceptualisation, data curation, formal analysis, visualization, writing—original draft preparation. CR: Investigation, methodology. MD: Investigation, methodology, project administration, JMcG: Investigation, methodology, project administration, LOB: Investigation, methodology, writing—review and editing. RH: Investigation, methodology. PC: Investigation, methodology, writing—review and editing. IC: Project administration, methodology, writing—review and editing. PMcK: Conceptualisation, methodology, supervision, writing—review and editing. AK: Conceptualisation, methodology, supervision, writing—review and editing. GM: Conceptualisation, methodology, supervision, writing—review and editing. JM: Conceptualisation, methodology, supervision, writing—review and editing. BW: Conceptualisation, methodology, supervision, writing—review and editing. RG: Conceptualisation, methodology, supervision, writing—review and editing. FMcC: Conceptualisation, methodology, supervision, writing—review and editing.

Corresponding author

Correspondence to Jill M. Mitchell.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval and consent to participate

Data access approval for the use of anonymised data in this project was granted on 05 January 2024 by the National Neonatal Steering Committee, the National Perinatal Epidemiology Centre (NPEC), the National Women and Infant Health Programme, and the National Clinical Programme for Paediatrics and Neonatology. Informed consent was not required because the study involved secondary analysis of fully anonymised data. Full ethical approval for the National Neonatal Therapeutic Hypothermia Development Project was granted by the Clinical Research and Ethics Committee (ECM 4 (g) 05/08/08).

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

41372_2025_2547_MOESM1_ESM.docx (download DOCX )

Appendix: Development of a Prediction Model for Mortality in Infants Undergoing Therapeutic Hypothermia for Neonatal Encephalopathy

Data Set 1

Data Set 2 (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mitchell, J.M., Rodrigues, C.L., Dunworth, M. et al. Development of a prediction model for mortality in infants undergoing therapeutic hypothermia for neonatal encephalopathy. J Perinatol (2026). https://doi.org/10.1038/s41372-025-02547-z

Download citation

Received: 02 September 2025
Revised: 11 November 2025
Accepted: 05 December 2025
Published: 05 January 2026
Version of record: 05 January 2026
DOI: https://doi.org/10.1038/s41372-025-02547-z

Subjects

Abstract

Objective

Study design

Results

Conclusions

Similar content being viewed by others

Evaluation of heterogeneity in effect of therapeutic hypothermia by sex among infants with neonatal encephalopathy

Process variations between Swiss units treating neonates with hypoxic-ischemic encephalopathy and their effect on short-term outcome

Impact of teleneonatology on time to goal temperature in outborn neonates with hypoxic-ischemic encephalopathy requiring therapeutic hypothermia

Introduction

Methods

Study population

Variable selection

Outcome measure

Statistical analysis

Initial prediction model

Simplified prediction model

Sample size

Model performance and internal validation

Development of an interactive prediction tool

Confusion matrix

Results

Descriptive statistics

Unadjusted logistic regression analysis

Initial prediction model

Prediction tool

Model performance and internal validation

Discussion

Principal findings

Comparison to previous research

Clinical implications

Strengths and limitations

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval and consent to participate

Additional information

Supplementary information

41372_2025_2547_MOESM1_ESM.docx (download DOCX )

Data Set 1

Data Set 2 (download XLSX )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links