Introduction

Extremely preterm infants under 28 weeks of gestation often undergo invasive procedures, such as blood sampling, central venous catheterization, and intubation.1,2 These invasive procedures potentially cause severe complications, including intraventricular hemorrhage (IVH) and periventricular leukomalacia (PVL).3 To reduce severe complications in preterm infants, sedation and analgesic drugs such as opioids, midazolam, and phenobarbital are administered,1,4 which are essential because IVH and PVL can cause neurodevelopmental impairment.1,3,4,5 Thus, opioids (e.g., fentanyl and morphine) have been used in preterm infants in the neonatal intensive care unit (NICU).6

Previous research on early-phase opioid use in extremely preterm infants is limited due to insufficient adjustments for confounding factors. A retrospective study of 4781 such infants found increased in-hospital mortality, IVH, PVL, bronchopulmonary dysplasia (BPD), and retinopathy of prematurity associated with early-phase opioid use without adjusting for crucial confounding factors, such as drug use and procedures.7 Other small previous studies have linked early-phase opioid use with unfavorable mortality and morbidities without adjusting for background factors.8,9 Moreover, multiple randomized controlled trials and meta-analyses have not confirmed the effectiveness of early-phase opioid use in preterm infants (including extremely preterm infants) due to small sample sizes.10,11,12,13,14,15,16,17 These short-term outcomes affect the life of infants and are often associated with long-term outcomes such as neurodevelopmental impairment.

Given the high prevalence of preterm birth in high-income countries,18 investigating the benefits of early-phase opioid use is necessary. Investigation of the short-term outcomes of early-phase opioid use is important to improve the treatment in NICUs. Therefore, we aimed to investigate the association between early-phase opioid use and outcomes in extremely preterm infants after adjusting for potential confounders using a nationwide database in Japan.

Methods

Database

This nationwide retrospective study used the Diagnostic Procedure Combination database. This database contains administrative claims data and discharge abstracts of 8,000,000 inpatients per year from > 1200 hospitals in Japan.19 The database covers approximately 90% of hospitals with NICUs.20 The participation of community hospitals was voluntary although all 82 university hospitals in Japan were required to join the database. The requirement for informed consent was waived due to the anonymity of the database. The Institutional Review Board of the University of Tokyo approved this study (approval number: 3501-(5) [May 19th, 2021]).

The database includes the following data: sex of the patient, gestational age; birth weight; age at admission; main diagnoses; comorbidities at admission; and complications after admission recorded in Japanese text and using the International Classification of Disease, Tenth Revision (ICD-10) codes indexed by the original Japanese codes; patients’ transportation from other hospitals or in-hospital births; length of stay; discharge status; and hospitalization costs. Hospitalization costs were calculated based on the reference prices in the Japanese national fee schedule for item-by-item prices for surgical, pharmaceutical, laboratory, and other inpatient services. The attending physicians recorded all abstract discharge data at discharge. Previous studies have reported the favorable sensitivity and specificity of the diagnoses and procedure records in the database.21,22 Furthermore, several perinatal studies have been conducted using this database.23,24

Study protocol

We identified patients under 28 weeks of gestational age who were admitted to the NICUs between July 2010 and March 2021. We included patients who were transferred from other NICUs and non-NICU settings. We excluded patients who were admitted after 2 days of age and discharged before 2 days of age. Additionally, we excluded those who were diagnosed with trisomies 21, 18, and 13 (defined by the ICD-10 codes; Supplementary Table S1) and those who underwent cardiac surgeries other than the surgery for patent ductus arteriosus.

The original Japanese procedure codes were used to identify the cardiac surgeries. We categorized eligible patients into those who received opioids before 2 days of age (opioid group) and those who did not receive opioids (non-opioid group). We defined the exposure period as 2 days of age per previously established methodologies24,25 to evaluate the effects of early-phase opioid use because hemodynamics soon after birth can change drastically;26 thus, the effects of opioid use after 2 days of age were not assessed and the non-opioid group could receive opioids after 2 days of age. The period was defined by days instead of hours because the time of day for events was unavailable in the database. We consider the definition of 2 days appropriate because a single-day exposure period hardly allows a patient born at midnight to be classified into the exposure group.

The primary outcomes were in-hospital mortality, IVH, and PVL. The ICD-10 codes were used to define IVH and PVL (Supplementary Table S1).24 The grade of IVH and the status of in-utero IVH were unavailable because they were indistinguishable by ICD-10 codes. The secondary outcomes included other in-hospital morbidities (BPD, hydrocephalus, necrotizing enterocolitis, and retinopathy of prematurity), discharge destination, home assistive technology use (home oxygen therapy, tracheostomy, and home mechanical ventilation), length of hospital stay, length of intubation, and total hospitalization cost. BPD was defined as receiving oxygen, non-invasive positive-pressure ventilation, or invasive mechanical ventilation at 36 weeks of corrected gestational age. The ICD-10 codes were used to define hydrocephalus and necrotizing enterocolitis (Supplementary Table S1).24 We selected hydrocephalus as a secondary outcome as a proxy for severe IVH because we were unable to determine the actual grade of IVH. Retinopathy of prematurity was defined as receiving an antivascular endothelial growth factor injection or laser photocoagulation.24 We defined the currency exchange rate as 140 Japanese yen per 1 US dollar.

The following patient characteristics were investigated: sex, gestational age, birth weight, in-hospital birth, presence of neonatal asphyxia, and general comorbidities. Gestational age was categorized into three groups: 22–23, 24–25, and 26–27 weeks. Additionally, birth weight was categorized into four groups: <500, 500–999, 1000–1499, and ≥1500 g. The ICD-10 codes and original Japanese procedure codes were used to define neonatal asphyxia (Supplementary Table S1).24 The Pediatric Complex Chronic Conditions Classification System version 2 (CCC) was used to identify general comorbidities.27 CCC includes a variety of disease categories, such as cardiovascular diseases, genetic disorders, and neurological disorders. General comorbidities were categorized into four groups according to the number of CCC (0, 1, 2, and ≥3).

Subsequently, we investigated the following treatments before 2 days of age: drug use (adrenalin, dopamine or dobutamine, milrinone, cyclooxygenase inhibitors [ibuprofen or indomethacin], antibiotics, caffeine citrate, granulocyte-colony stimulating factor, steroids, muscle relaxants [rocuronium and vecuronium], and other sedative agents [midazolam, dexmedetomidine, and phenobarbital]), blood transfusion (albumin, fresh frozen plasma, gamma-globulin, platelets, and red blood cell), and procedures (arterial catheterization, central venous catheterization, inhaled nitric oxide, intratracheal surfactant, intubation, surgery for patent ductus arteriosus, and jaundice phototherapy). Additionally, we collected data on the hospital type (whether a patient was admitted to a general perinatal medical center or not, and to a teaching hospital or not), hospital volume, and fiscal year. NICUs in general perinatal medical centers in Japan are similar to the level 3 NICUs in the United States.28 Hospital volume was defined as the number of patients with gestational age < 28 weeks at each hospital and was categorized into three groups (low, medium, and high). The fiscal years were categorized into two groups: 2010–2015 and 2016–2021.

Statistical analysis

To compare the outcomes between the two groups, we performed propensity score matching analysis at a 1:2 ratio.29 Propensity scores were calculated using a logistic regression model in which the independent variables included patient backgrounds and treatments. Each patient in the opioid group was matched with two patients with the closest estimated propensity scores within a caliper (≤ 0.2 of the pooled standard deviation) in the non-opioid group. The matched samples were obtained using the nearest-neighbor matching with replacement. Furthermore, we calculated the standardized differences to confirm the balance in baseline characteristics between the two groups for all patients, as well as for the 1:2 propensity score-matched cohort. A standardized difference of <10% implied a negligible difference between the two groups.30 We performed a χ2 test to compare proportions for categorical outcomes, and the Mann–Whitney U test to compare medians for continuous outcomes in the 1:2 propensity score-matched pair cohort. After generating the propensity score-matched pair cohort, we used a logistic regression model to calculate the odds ratio for the opioid group compared with the non-opioid group.

For sensitivity analysis, we conducted the following analyses to examine the robustness of our inference: overlap propensity score-weighted and instrumental variable analyses. Overlap propensity score-weighted analysis is an extension of the propensity score analysis that balances covariates between the two groups and imitates a randomized trial without excluding most observations from the study sample.31 In this analysis, we used the same propensity score as in the main analysis. Next, using a two-stage residual inclusion method, we conducted instrumental variable analysis. Instrumental variable analysis can theoretically adjust for the unmeasured background characteristics and resemble trials.32,33,34,35 We used the facility treatment rate as an instrumental variable because it is a typical instrumental variable in epidemiological studies.33 In the present study, the facility treatment rate was defined as the number of extremely preterm infants receiving opioids divided by the total number of eligible extremely preterm infants in each hospital. The validity of the instrumental variable was tested using its F-statistic; an F-statistic <10 was considered an invalid instrumental variable.33 A logistic regression model was used to calculate the odds ratio for the opioid group compared with the non-opioid group in each sensitivity analysis. Moreover, after generating the propensity score-matched pair cohort, we performed logistic regression analysis using the propensity score and all variables to calculate the odds ratio for the opioid group compared with the non-opioid group.

Finally, we performed subgroup analyses based on the gestational age and opioid type. We conducted a 1:2 propensity score-matching analysis in patients under 25 weeks of gestational age. This analysis included re-categorizing the birth weight into four groups (<500, 500–599, 600–699, and ≥700 g) for balancing the matched patients. Additionally, we further divided the opioid group into two groups (fentanyl and morphine groups) according to the opioid type used before 2 days of age and separately compared the outcomes with those of the non-opioid group. The subgroup analysis involved adjusting the matching ratio according to the number of patients in each group.

The significance threshold was set at 0.05, and all reported p-values were two-sided. All statistical analyses were conducted using Stata/MP 17.0 (StataCorp, College Station, TX).

Results

We identified 22,312 patients with a gestational age of <28 weeks who were treated in NICUs between July 2010 and March 2021. We excluded 3518 patients who met the following exclusion criteria: (i) 59 patients with trisomies 21, 18, and 13; (ii) 55 patients who underwent cardiac surgery other than the surgery for patent ductus arteriosus; and (iii) 3404 patients who were admitted or discharged after or before 2 days of age, respectively. Of the 18,794 eligible patients, the opioid and non-opioid groups comprised 4806 (26%) and 13,988 patients (74%), respectively (Fig. 1).

Fig. 1: Patient flow.
figure 1

The non-opioid group comprised the patients who did not receive opioids within the first 2 days of age irrespective of opioids use after 2 days of age.

Table 1 summarizes the baseline characteristics of all patients and the patients with 1:2 propensity score matching. Before matching, the opioid group demonstrated a lower gestational age and birth weight and was more likely to receive almost all drugs, transfusions, and procedures than the non-opioid group. The non-opioid group comprised 9612 patients, and the background characteristics were well balanced between the two groups after the 1:2 propensity score matching.

Table 1 Background characteristics of all patients and patients with 1:2 propensity score matching.

Table 2 demonstrates the crude outcomes of all patients and the 1:2 propensity score-matched cohort. Among all patients, the in-hospital mortality was 8.5%, and IVH and PVL occurred in 2365 (13%) and 388 (2.1%) patients, respectively. The in-hospital mortality and IVH did not differ between the two groups after the 1:2 propensity score matching. The opioid group exhibited significantly lower PVL (1.7% vs. 2.2%), hydrocephalus (3.3% vs. 4.3%), and home discharge occurrences (66% vs. 68%) but higher BPD (65% vs. 58%) and home assistive technology use occurrences (19% vs. 15%) than the non-opioid group. Moreover, the lengths of hospital stay (125 vs. 122 days) and intubation (73 vs. 71 days) were longer, and total hospitalization costs (89,485 vs. 86,827 US dollars) were higher in the opioid group than in the non-opioid group. Similar results were observed in the logistic regression analysis after generating the propensity score-matched pair cohort (Fig. 2).

Table 2 Comparisons of the outcomes between the opioid and non-opioid groups among all patients and patients with 1:2 propensity score matching.
Fig. 2: Odds ratios for the opioid versus the non-opioid groups for each outcome in the propensity score and instrumental variable analyses.
figure 2

Odds ratios are with reference to patients in the non-opioid group. The non-opioid group comprised the patients who did not receive opioids within the first 2 days of age irrespective of opioids use after 2 days of age. CI confidence interval, IV instrumental variable analysis, PS propensity score.

The weighted background characteristics revealed an exact balance between the two groups in the overlap propensity score-weighted analysis (Supplementary Table S2) and the results corresponded with the main analysis (Supplementary Table S3). Additionally, the instrumental variable analysis had 1310 as the F-statistic and similar results to the propensity score analyses (Fig. 2). Similar results were observed in the logistic regression analysis using the propensity score and all variables after generating the propensity score-matched pair cohort (Supplementary Table S4).

The subgroup analyses’ background characteristics after propensity score matching were well-balanced (Supplementary Tables S56). The in-hospital mortality and morbidity rates of patients <25 weeks of gestational age were higher than those of all patients (Table 3). Although mortality did not differ between the two groups in line with the main analysis, opioid use was associated with decreased IVH (17% vs. 21%). Regarding the opioid type, the fentanyl group demonstrated results similar to those of the main analysis (e.g., decreased occurrence of PVL); however, the morphine group was less likely to develop IVH than the non-opioid group (13% vs. 17%) (Table 4).

Table 3 Comparisons of the outcomes between the opioid and non-opioid groups among the patients < 25 weeks of gestational age and patients with 1:1 propensity score matching.
Table 4 Comparisons of the outcomes between the fentanyl group or the morphine group and non-opioid group among patients and patients with propensity score matching.

Discussion

This study investigated the relationship between early-phase opioid use and outcomes in extremely preterm infants using a nationwide database in Japan. Previous studies reported higher mortality (odds ratio 1.57 [95% confidence interval, 1.13–2.18]) and IVH (1.63 [1.30–2.04]) in opioid users;7,17 however, after adjusting for background factors, our analysis found no significant differences in mortality or IVH between the opioid and non-opioid groups. Additionally, this study revealed that early-phase opioid use was linked to decreased PVL, but also increased BPD, length of hospital stay, and hospitalization costs.

Preventing IVH is a critical goal in perinatal medicine as it is a leading cause of neurodevelopmental impairment in extremely preterm infants.3,5 A previous observational study with inadequate adjustment for confounders found that opioid use in extremely preterm infants was linked to increased in-hospital mortality and IVH.7 In the current study, after adjusting for various clinical backgrounds and treatments, opioid use was not linked to IVH but was associated with a reduced incidence of hydrocephalus. This reflects a decrease in severe IVH in the opioid group as hydrocephalus usually develops with severe IVH.36 In other words, early-phase opioid use may reduce IVH severity in extremely preterm infants.

Patients in the opioid group were more likely to exhibit BPD than those in the non-opioid group. Opioid use may extend intubation duration as previously reported,1 and lead to BPD from ventilator-associated lung damage in extremely preterm infants.37

Long-term intubation may also lead to increased home assistive technology use, longer hospital stays, and higher hospitalization costs in the opioid group. Clinicians should avoid long intubation duration when using early-phase opioids in extremely preterm infants. This study showed that early-phase opioid use was significantly associated with an increase in the occurrence of BPD, with only a slight decrease in the occurrence of PVL. Therefore, clinicians should consider limiting routine early-phase opioid use until further data are available from well-designed and adequately powered prospective cohort studies.

Extremely preterm infants born <25 weeks of gestation are at a particularly high risk for developing IVH.38,39 However, limited evidence exists regarding the effectiveness of early-phase opioid use in these infants. The present study involving >6000 extremely preterm infants showed that early-phase opioid use at <25 weeks of gestational age was associated with reduced occurrence of IVH and hydrocephalus. Preventing IVH is crucial as it is an important risk factor for developing neurodevelopmental impairment.3 Nevertheless, early-phase opioids markedly increased BPD in this population. Clinicians should avoid routine early-phase opioid use without considering the increased BPD risk, even in this population.

Although morphine is a major analgesic opioid, fentanyl is often used as an alternative due to its fewer cardiovascular and gastrointestinal side effects than morphine40; indeed, fentanyl was used approximately four times more frequently than morphine in this study. However, fentanyl was not linked to decreased IVH, whereas morphine was associated with reduced IVH. Fentanyl might have a weak analgesic effect in preventing IVH. Additionally, the fentanyl group (as well as morphine groups) exhibited a higher BPD incidence, longer hospital stays and intubation duration, and increased hospitalization costs than the non-opioid group. Fentanyl should be carefully selected as an early-phase opioid to reduce side effects as its use demonstrated no benefit (other than a small reduction in PVL).

This study has several limitations. First, we could not obtain information on the drug dosage and administration methods, such as continuous or bolus administration. Therefore, this study did not examine the dose-dependent effects of early-phase opioid use. Second, we lacked access to laboratory data (e.g., blood gas tests) and imaging findings (e.g., magnetic resonance imaging and echocardiography) to assess the severity of patient backgrounds and outcomes. We could not determine the severity of IVH and the actual onset of IVH in this study. However, even in clinical practice, it is difficult to determine the actual onset of IVH by ultrasound, which is commonly used for the diagnosis of IVH in neonates. Information on maternal history, including antenatal corticosteroid administration, delivery mode, and maternal comorbidities, was not obtained. However, we adjusted for neonatal conditions after birth, including patient background, drugs, and procedures, through propensity scores and instrumental variable analyses. We believe these adjustments can account for measured and unmeasured covariates. Fourth, we did not assess the effects of opioid use beyond 2 days of age because we originally aimed to evaluate the effects of early-phase opioid use. Finally, we did not assess the long-term effects of early-phase opioid use in extremely preterm infants. However, our study focused on the short-term effects of early-phase opioid use in extremely preterm infants because data on these is limited, even in short-term outcomes.

Conclusions

We performed propensity score and instrumental variable analyses with a large nationwide cohort of extremely preterm infants born under 28 weeks of gestation to compare the short-term effects of early-phase opioid use. Early-phase opioids for extremely preterm infants were markedly associated with decreased PVL and hydrocephalus, increased BPD, prolonged length of hospital stay and intubation, and higher hospitalization costs. Furthermore, Early phase opioid use was not considerably linked to increased IVH incidence.