Introduction

Brain development during gestation and after birth is rapid1. The brain undergoes sequential, coordinated and hierarchical development that is also reflected in functional brain networks during the neonatal period and infancy2. Previous work has demonstrated the presence and patterns of major brain networks' functional connectivity in fetuses3 as well as in preterm4,5,6 and term infants7,8. The maturation of motor and sensory systems occurs early9, and in infants, cortical functional networks exhibit large confinement to primary sensory and motor areas, suggesting a perception-action task-related functional network architecture9,10.

Preterm birth (birth before 37 weeks of gestation) is a common condition, occurring in about 1 in 10 births globally11. It is associated with an increased likelihood for multiple adverse health outcomes in later life, including brain, cognitive, and neural impairments4,12,13,14. Considering brain maturation, preterm birth has been associated with altered brain development as younger gestational age at birth predicts smaller brain volumes15,16,17,18,19 and higher diffusivity in the prefrontal, parietal, motor, somatosensory, and visual cortices, likely suggesting delayed maturation of these cortical areas20. Prior research has also demonstrated sex-specific alterations in the preterm brain21. In addition to the neural correlates of preterm birth, studies have unraveled relationships of many key steps of brain development with increasing postmenstrual age (PMA) at scan22. However, the strength of associations between PMA and brain metrics varies across imaging modalities. Structural connectivity measures have demonstrated strong correlation with PMA in neonatal subjects (r = 0.83)23, as well as with gestational age in fetal subjects (R2 = 0.71)24, whereas functional connectivity measures have demonstrated relatively weaker correlations (r = 0.18–0.21)25. In a recent work, however, Sun et al.26 showed that both structural and functional connectomes accurately predict PMA in term and preterm infants scanned within the first months of life, although structural connectomes still had greater predictive accuracy. This underscores the potential of advanced functional imaging approaches to enhance assessments of brain maturation in neonates.

Electrophysiological activity of the brain exhibits both periodic (or oscillatory) and aperiodic (non-oscillatory) properties. The overall level of aperiodic activity, and how it changes as a function of frequency, can be modeled with two parameters: offset and exponent, respectively27. The parameters change as a function of age, with the exponent decreasing in infants28, and both exponent and offset decreasing during the early-childhood29,30 and early-adulthood31. Based on these EEG studies, we propose that the parameters of aperiodic brain activity could serve as markers for the pace of (functional) brain maturation. Given that aperiodic brain activity is assumed to be ‘scale-free’ (i.e., it is not restricted to a specific temporal scale)32, age-related changes could also be detected using aperiodic parameters of the resting-state functional MRI (rs-fMRI) blood-oxygen-level-dependent (BOLD) signal. While the utility of parameterizing brain activity in terms of its periodic and aperiodic components has been demonstrated in numerous electrophysiological studies27,28,33,34, the methodology has not been widely adopted for the analysis of fMRI datasets.

An effective neurovascular coupling is already present in the developing brain before the time of normal birth35, and the hemodynamic response function (HRF) can be reliably identified in neonatal participants despite being weaker in amplitude as compared to adult participants36. While the aperiodic component of the fMRI BOLD signal remains relatively unexplored, evidence from concurrent multimodal measurements with electrocorticography (ECoG) and EEG connects the scale-free activity to fMRI BOLD signal response, suggesting a common neural origin37,38,39. Wen and Liu40 argue that whereas the periodic activity in BOLD signal supports the modular networks of the brain, the aperiodic activity is related to the global BOLD signal, which is increasingly being recognized as carrying valuable information about neuronal processes and related to various phenomena such as levels of vigilance and arousal, as well as quasi-periodic patterns of functional activity of the brain41. In neonatal participants, higher aperiodic exponents of the fMRI BOLD signal have been observed in the sensory cortex42, potentially suggesting local brain maturation. Although the exact origin or the significance of the aperiodic component in the fMRI BOLD signal is not yet fully understood, there is little doubt as to its origin in the neural processes of the brain, or that it carries important information relating to various processes of the brain.

In the present study, we parameterized the aperiodic components of the rs-fMRI BOLD signal power spectra (0.01–0.15 Hz) from 90 cortical and subcortical regions of interest (ROIs) in terms of their exponent and offset, using data from the dHCP neonatal rs-fMRI dataset. We then mapped the associations between the aperiodic offset and exponent parameters from the pre- and postcentral gyri and preterm birth, and postmenstrual and postnatal age, as well as sex, were included in linear mixed-effect models. Finally, we performed machine learning-based regression to predict postmenstrual and postnatal age from the aperiodic parameters from all 90 ROIs. Based on previous electrophysiological studies28,29,30,43, particularly a recent one exploring aperiodic activity and gestational duration in neonates and toddlers44, as well as diffusion tensor imaging studies in preterm infants20, we hypothesized that postmenstrual age is positively associated, while postnatal age is negatively associated, with aperiodic parameters. Additionally, we expected preterm-born neonates to exhibit higher aperiodic parameters compared to term-born neonates, reflecting delayed neural developmental processes in the preterm group. In our statistical analysis, PMA had a statistically significant positive association with the offset of the aperiodic component, while postnatal age had a statistically significant negative association with the exponent of the aperiodic component, both in accordance with our hypothesis. Prematurity was negatively associated with the exponent with statistical significance; however, this disagrees with our hypothesis. The association between prematurity and the offset was also negative but did not reach statistical significance. Our machine learning-based elastic net regression model achieved relatively high predictive performance using aperiodic parameters as predictors, supporting the aperiodic component of the BOLD signal as a measure of brain maturation in neonatal populations.

Results and discussion

Descriptive statistics for the full sample are presented in Table 1. Descriptive statistics for the full-term and preterm subsamples are presented in Supplementary Tables S1 and S2, respectively.

Table 1 Descriptive statistics for the full sample

The aperiodic fit was generally acceptable across the ROIs, but it was highest for somatosensory areas (Fig. 1). There was also a trend of poorer fits towards the inferior parts of the brain that may be due to poor signal-to-noise ratio in those regions45.

Fig. 1: FOOOF model goodness of fit.
Fig. 1: FOOOF model goodness of fit.
Full size image

Group mean R² values (N = 599) from the FOOOF model mapped onto cortical and subcortical regions defined by the Automated Anatomical Labeling (AAL90) atlas. The color gradient indicates the goodness of fit, with red representing higher R2 and blue representing lower R2.

Effects of postmenstrual age at scan

Postmenstrual age at scan was positively associated with the offset (Table 3: estimate = 0.06, SE = 0.01, p < 0.001). Further, interaction effects showed that this association was statistically significantly dependent on ROI (Table 3: estimate = 0.04, SE = 0.01, p < 0.001). This means that the increase of one standard deviation (SD) in child PMA predicted a +0.06 change in offset in postcentral gyri and a +0.10 change in precentral gyri.

The main effect of PMA at scan was not statistically significantly associated with the exponent, but its interaction effect with ROI was statistically significant (Table 2: estimate = –0.03, SE = 0.01, p = 0.002). This means that there was a crossover interaction, i.e., the association between PMA and exponent was dependent on the ROI. In postcentral gyri, the increase of one SD in child PMA predicted +0.02 change in exponent, while in precentral gyri, the corresponding change was negligible, –0.01. For the relationship between predicted aperiodic parameters and child PMA (see Supplementary Fig. S1).

Table 2 Results of the linear mixed-effects regression model for exponent

During gestation, there is a proliferation of neuroblasts and synaptogenesis, ultimately leading to the capability of neurons to fire repetitive action potentials. Additionally, spontaneous neural activity begins to occur in mid-gestation (approximately at gestation week 18)46,47. Given that asynchronous signals (i.e., offset) have been identified as a principal source of BOLD responses38 and further, electrophysiological power shifts have been shown to be a reliable predictor of neuronal spikes48, our results could reflect increasing neuronal firing rates as a function of postmenstrual age during gestation.

We chose the somatosensory regions for closer examination as they are known to be among the regions that mature early3. Offset values were higher in precentral compared to postcentral gyri, reflecting an elevated baseline level of BOLD signal power in the precentral gyri. Considering offset as an indirect proxy of neural activity, this suggests either higher neural activity or higher voxel-wise temporal concordance (i.e., more synchronized BOLD signal fluctuations across voxels) in the precentral (motor) gyri compared to postcentral (somatosensory) gyri. However, this difference narrows down after birth as postnatal age increases. We observed an increase in exponent values in the postcentral gyri as a function of PMA, while in the precentral gyri no noticeable change was detected. This suggests that the aperiodic activity distribution steepens in the postcentral gyri as a function of postmenstrual age. Given that in previous EEG studies flattening of the power spectrum has been linked to postnatal aging28,29,30,31,43, our results could indicate earlier maturation of the precentral (motor) gyri compared to postcentral (somatosensory) gyri during gestation. This aligns with an earlier diffusion tensor imaging study of preterm newborns, which found higher fractional anisotropy and lower diffusivity in motor tracts compared to sensory tracts, reflecting earlier white matter maturation of the motor system compared to sensory pathways49.

It is important to note that our study focused on changes in the low-frequency power spectrum of the BOLD signal (0.01–0.15 Hz), and caution is warranted when comparing our results to prior EEG literature. However, previous studies on fractional amplitude of low-frequency fluctuation (fALFF) activation in adults have reported negative correlations between age and the fALFF activation in adults50,51, and fALFF activation has been demonstrated to be a valid predictor of age in children52. These findings suggest a trend towards less prominent low-frequency fluctuations as a function of age, also in the BOLD signal. Additionally, we found slightly higher offset values in the right hemisphere compared to the left hemisphere, indicating greater power of the BOLD signal in the right hemisphere. However, it should be noted that the difference was negligible, only 0.01. Previous studies align with our findings, suggesting earlier maturation of the right hemisphere, as cortical folding occurs earlier in the right hemisphere in fetuses. Furthermore, a general functional dominance (except for linguistic stimuli) of the right hemisphere has been described in fetuses, neonates, and preterm-born infants53.

Effects of postnatal age

The negative association between the child’s postnatal age and exponent was statistically significant (Table 2: estimate = –0.17, SE = 0.03, p < 0.001). This association did not differ statistically significantly between the pre- and postcentral gyri, meaning that the increase of one SD in postnatal age predicts –0.17 change in exponent in both ROIs.

The negative effect of postnatal age on offset was not statistically significant. However, its interaction effect with ROI was statistically significant (Table 3: estimate = –0.02, SE = 0.01, p = 0.048). The result indicates that an increase of one standard deviation in postnatal age decreases the difference in offset values between the precentral and postcentral gyri by –0.02, resulting in the region-related difference disappearing at approximately 20 weeks of postnatal age. For the relationship between predicted aperiodic parameters and child postnatal age (see Supplementary Fig. S1).

Table 3 Results of the linear mixed-effects regression model for offset

Our finding of decreasing exponent values (flattening power spectrum) as a function of postnatal age is consistent with several previous EEG studies, such as a longitudinal study in infants28, and several cross-sectional studies during early childhood to adolescence29,30, and into early-adulthood31. However, these associations between age and aperiodic parameters are not necessarily linear throughout the lifespan, as McSweeney et al.43 demonstrated quadratic age-related changes of offset and exponent values during childhood (in 4–11 year-olds).

Our findings suggest that after birth, in the very early stages of brain maturation, the distribution of aperiodic activity (power of BOLD signal) on different frequencies changes (flattens). This could reflect the emergence of more widespread or global neural processes during aging, or, as discussed above, changes in brain volume or myelination. In EEG, changes in the steepness of the slope of the power spectrum have been thought to reflect cortical synaptic excitation–inhibition (E–I) balance54, undergoing changes from the very early stages of development onwards55. In animal studies, a steeper spectral slope (higher exponent) has been correlated to reduced E–I balance in macaque and rat cortices, possibly due to greater gamma-aminobutyric acid (GABA) synapse density54. The aperiodic exponent has been implicated in pathology in attention deficit hyperactivity disorder (ADHD)56,57, and autistic traits58. An optimal E–I balance may play a crucial role in typical brain development, as medical treatment has been shown to lead to the “normalization” of E–I balance (exponent) in ADHD children57. More studies examining the age-related changes of the periodic and aperiodic parameters of the fMRI BOLD signal, especially longitudinal ones, are required.

Effects of prematurity

Prematurity was negatively associated with the exponent (Table 2: estimate = –0.17, SE = 0.08, p = 0.036). This means that exponent values were statistically significantly smaller (–0.17 difference) in neonates born preterm compared to term-born neonates.

Prematurity was almost statistically significantly associated with offset (Table 3: estimate = −0.09, SE = 0.05, p = 0.057). If true, this would mean that offset values are, in general, 0.09 smaller in preterm neonates compared to term-born neonates. However, the interaction effect of prematurity and ROI was statistically significant (Table 3: estimate = 0.06, SE = 0.03, p = 0.048), indicating that the effect of prematurity is even weaker in precentral gyri. To see the differences in predicted aperiodic parameters between the term-born versus preterm neonates visualized (see Fig. 2).

Fig. 2: Predicted aperiodic parameters in term-born and preterm neonates by sex.
Fig. 2: Predicted aperiodic parameters in term-born and preterm neonates by sex.
Full size image

Predicted values of aperiodic parameters from linear mixed-effects models for the precentral and postcentral gyri, conditioned on random effects. Green denotes males and orange denotes females. Vertical lines present 95% confidence intervals around the average predicted value for each group A Aperiodic exponents (N = 599 biologically independent samples; 2370 observations after removal of 26 outliers). B Aperiodic offsets (N = 599 biologically independent samples; 2396 observations).

For the cortical and subcortical projected estimates of prematurity from the linear regression models (exponent/offset) across all 90 different brain areas (see Fig. 3). After FDR correction, the effect of prematurity on aperiodic exponent remained statistically significant only in two brain areas: right hippocampal (adjusted p = 0.009) and right parahippocampal (adjusted p = 0.029) areas. No statistically significant effects of prematurity on aperiodic offset remained in any of the brain areas. All estimates, their p-values, and FDR corrected p-values are presented in Supplementary Tables S3 and S4.

Fig. 3: Estimates of prematurity.
Fig. 3: Estimates of prematurity.
Full size image

Linear regression estimates for prematurity (term-born = 0, preterm = 1) mapped onto cortical and subcortical regions defined by the Automated Anatomical Labeling (AAL90) atlas. The color gradient indicates estimated magnitude (red = larger, blue = smaller). A Aperiodic exponent (N = 599). B Aperiodic offset (N = 599).

In our study, we observed smaller exponents in preterm neonates compared to term-born neonates, indicating a more flattened BOLD power distribution across frequencies. A parallel trend was seen with offsets, but the association did not reach statistical significance. Given the increasing trend in exponent values during postmenstrual aging, our findings likely reflect the shorter duration of pregnancy in preterm compared to term-born neonates. Our finding is consistent with a previous fMRI BOLD signal study demonstrating altered low-frequency fluctuation amplitudes in preterm newborns, particularly in the motor and primary sensor cortices59. As discussed earlier, a flattened power spectrum, at least according to prior EEG literature, may indicate a shift towards excitation, reflecting the changes in the E–I balance. Prior research has shown that preterm-born children often face a delay in inhibition abilities, though most behavioral studies suggest that they catch up with their peers by late childhood60. Alternatively, smaller exponents may reflect more rapid postnatal neural maturational processes in preterm versus term-born neonates. Follow-up studies are needed to investigate whether the trajectory of aperiodic parameters continues to evolve throughout the lifespan.

Effects of sex

In females, aperiodic exponents were smaller (Table 2: estimate = –0.09, SE = 0.03, p = 0.006) and aperiodic offsets higher compared to males (Table 3: estimate = 0.04, SE = 0.02, p = 0.038). No statistically significant interactions between the child’s sex and ROI were found.

Sex-dependent differences in brain development have been described in previous studies, including increased functional connectivity of the visual association network in female infants25 and higher fractional anisotropy in females within the posterior and temporal white matter in 5-year-olds61. Our findings align with these studies, tentatively suggesting faster brain maturation (stronger flattening of the power spectrum) in females. Further studies are needed to examine whether these sex-specific differences persist into later childhood.

Regional and hemisphere-specific differences

The offset values were higher in precentral gyri when compared to postcentral gyri (Table 3: estimate = 0.12, SE = 0.01, p < 0.001). There was no statistically significant association between ROI (postcentral/precentral gyri) and exponent. In the right hemisphere, offset values were negligibly 0.01 higher than in the left hemisphere (Table 3: estimate = 0.01, SE = 0.01, p = 0.048). Exponent values were not associated with the side of the hemisphere (right/left).

Machine learning postmenstrual and postnatal age prediction

Elastic net machine learning model performance was estimated separately using only offset-based predictors, only exponent-based predictors, and both types of predictors at once. Performance was also estimated separately for postnatal age and PMA at the time of scan as target variables, as well as for a subset of the data containing only the term-born participants and all participants to control for the effect of preterm birth. For an estimate of the model’s generalization ability, the mean values of the repeatedly cross-validated test and train set performance estimates over the cross-validation folds were reported in terms of the coefficient of determination (R²) and the mean absolute error (MAE) in weeks of age. The performance estimates for each of the different analysis settings are presented in Tables 46.

Table 4 Cross-validated performance estimates for analyses predicting postnatal and postmenstrual age at scan using offset features
Table 5 Cross-validated performance estimates for analyses predicting postnatal and postmenstrual age at scan using exponent features
Table 6 Cross-validated performance estimates for analyses predicting postnatal and postmenstrual age at scan using both offset and exponent features

The cross-validated mean performance scores associated with the different settings of input and output variables demonstrate moderate to relatively high test performance for all such settings (Test R²’s 0.20–0.41). Offset predictors yielded generally somewhat higher performance estimates (0.24–0.36) for predicting both types of target variable than exponent predictors (0.20–0.36). The models trained with both types of predictors at once performed better than either type separately (0.24–0.41), implying that the two aperiodic parameters carry complementary information related to brain age that the model can effectively leverage for better predictive performance. As for the target variables, the models predicting postnatal age seem to perform better on term-born only subsample (0.36–0.40) as compared to full sample (0.22–0.24), whereas models predicting PMA seem to perform better on full sample (0.20–0.41) than term-only subsample (0.30–0.35), with the exception of exponent predictor model predicting PMA (0.20). Comparing mean train performance scores to test performance scores demonstrates some amount of model overfitting despite the regularization mechanism inherent to the elastic net.

For model interpretation via feature importance analysis, the mean elastic net model coefficients were computed across the cross-validation folds. Figures 4 and 5 display the mean model coefficients across cross-validation folds mapped onto cortical and subcortical regions of interest for models predicting postnatal and postmenstrual age in full-term participants, based on aperiodic exponent (Fig. 4) and offset (Fig. 5) variables. Supplementary Figs. S2 and S3 present the top-ranking positive and negative mean coefficients, while Supplementary Figs. S4S11 present the complete reports on the model coefficients for each analysis setting.

Fig. 4: Elastic net coefficients for predicting age from aperiodic exponents.
Fig. 4: Elastic net coefficients for predicting age from aperiodic exponents.
Full size image

Mean elastic net regression coefficients over cross-validation folds for predicting age variables in term-born participants using aperiodic exponent parameters, mapped onto cortical and subcortical regions defined by the Automated Anatomical Labeling (AAL90) atlas. Color gradient indicates the magnitude of coefficients (red = larger, blue = smaller). A Prediction of postnatal age using exponents (N = 483). B Prediction of postmenstrual age using exponents (N = 483).

Fig. 5: Elastic net coefficients for predicting age from aperiodic offsets.
Fig. 5: Elastic net coefficients for predicting age from aperiodic offsets.
Full size image

Mean elastic net regression coefficients over cross-validation folds for predicting age variables in term-born participants using aperiodic offset parameters, mapped onto cortical and subcortical regions defined by the Automated Anatomical Labeling (AAL90) atlas. Color gradient indicates the magnitude of coefficients (red = larger, blue = smaller). A Prediction of postnatal age using offsets (N = 483). B Prediction of postmenstrual age using offsets (N = 483).

In our machine learning data-analysis, we found that both aperiodic parameters of the BOLD signal can be used to predict neonate age with moderate to relatively high performance (R²’s 0.20–0.41), providing support for the viability of the aperiodic component of the rs-fMRI BOLD signal as a measure of brain maturation. The offset parameter generally yielded slightly higher prediction performance estimates than the exponent parameters, and using both kinds of parameters further improved the prediction performance estimates. Previously, Kardan et al.62, predicted infants’ postnatal age from rs-fMRI data as a regression task with high performance (median R2 = 0.51), but it should be noted that the dataset differed from that of the present study, especially in that the subjects’ ages were considerably higher and spanned a wider range (8–26 months). The majority of preceding studies in predicting neonate or infant age have been performed using structural23,63,64 or multi-modal65,66,67 MRI data. The studies focusing on predicting neonate or infant age using functional MRI are often implemented as classification tasks68,69, with Smyser et al.69 also modeling age from gestation at term age as a regression task. Regarding regional associations between rs-fMRI and age, Dosenbach et al.70 found that the anterior prefrontal cortex and precuneus contribute most to age predictions on older subjects between 7 and 30 years. Recent efforts in machine learning based brain age prediction using the dHCP resting-state fMRI dataset, specifically, have resulted in high predictive performance (r’s = 0.42–0.69), further demonstrating the viability of a functional brain age prediction framework26,71. Brain age prediction has many potential uses as a means to gain insights into normal development, and the brain age estimates in turn can be used as a predictor for various brain conditions and disorders, to be further used in the clinical context72,73. Systematic bias in age estimations has, however, been reported74, inviting for more comprehensive models to be used to avoid overestimation of age in young subjects.

In this cross-sectional study, we used the aperiodic parameters of the BOLD signal power spectrum as a prospective index of brain maturation. Prematurity is associated with lower offsets and exponents of aperiodic activity compared to term-born neonates. This reflects a flatter distribution of aperiodic signal activity (i.e., flattening of the power spectrum) across frequencies. The difference between preterm vs. term-born neonates was narrowly non-significant in offsets (p = 0.057) and significant in exponents (p = 0.036). The offsets were positively associated with PMA and negatively with postnatal age, while for the exponents, the association with PMA was not statistically significant, and the association with postnatal age was strongly negative. Our findings indicate an increase in aperiodic activity with postmenstrual aging, followed by a decrease in the postnatal period. Finally, in our machine learning analyses, we observed relatively strong associations (R²’s = 0.20–0.41) between aperiodic parameters of the rs-fMRI BOLD signal and neonate age, demonstrating the feasibility of the methodology for studying functional brain maturation. The ‘conventional’ statistical testing and machine learning are complementary approaches and should not be compared directly. For instance, PMA was not associated with the exponent in the somatosensory area (Table 2), whereas in the case of elastic net regression, the exponent showed substantial prediction performance (Table 5). This is likely explained by the higher dimensionality of the data used to train the machine learning model, allowing the model to use all available information from all regions of interest, while the statistical model was limited to the preselected regions of interest.

The developing cortex exhibits spontaneous, intermittent neuronal activity, which is believed to play a critical role in the maturation and wiring of early brain networks75,76. In preterm infants, these spontaneous activity transients evolve with development and are reflected in changes in EEG measures such as decreased low-frequency power and reduced waveform variance77. In the fMRI domain, Fransson et al.10 demonstrated that spontaneous low-frequency BOLD fluctuations in term-born infants are organized around cortical hubs, primarily located in primary sensory and motor regions. They suggested that these early hub structures could possibly gradually reorganize and expand into higher-order association networks across development.

Although the exact neural origin of low-frequency BOLD fluctuations remains unresolved, evidence from a simultaneous EEG-fMRI study in adults shows that changes in aperiodic EEG power (1/f component) are coupled with hemodynamic fluctuations in auditory-salience-cerebellar and prefrontal regions39. Given this cross-modal correspondence and the fact that both EEG and BOLD signals in neonates exhibit structured, non-random dynamics, it is reasonable to infer that the aperiodic component of the neonatal BOLD signal carries neurophysiologically meaningful information. The application of novel derived brain metrics to predict age parameters in neonates is a common approach to test the initial validity of the derived brain metrics78,79,80. Nevertheless, we emphasize that this study lacks simultaneous or parallel EEG data collection that would allow us to characterize the links between EEG and fMRI aperiodic components. This remains an important topic for future studies.

It should be noted that our study design is cross-sectional and, therefore, no assumptions about individual changes in the pace of brain maturation can be made. However, owing to our large sample size, we are confident that our group-level findings are generalizable to a larger population. While the cross-validated test performance estimates demonstrate relatively high generalization ability of the elastic net model, it should be noted that the training performances are considerably better, implying overfitting despite model regularization81. Among newborn infants, sleep-wake patterns start to evolve soon after birth, first changing rapidly during the first few months after birth from relatively quiet and active sleep phases to structured non-rapid eye-movement and rapid eye-movement sleep82,83,84. Their sleep is dispersed unevenly throughout the day85, and uncertainty as to the state of wakefulness or the stage of sleep at the time of scan can complicate the interpretation of fMRI86. Because such an effect is very difficult to avoid in this age group, we assume that changes in aperiodic parameters are related to aging, premature birth, and sex of the neonates. It has been demonstrated that functional connections strongly related to age as a target variable are more intercorrelated than expected by chance, providing redundant information, ultimately resulting in unreliable weights87. While our study used BOLD signal parameters as features, rather than functional connections, a similar limitation may apply to the interpretability of our analysis as well. Finally, in our data set, the variation of the postnatal age in neonates was low, due to the choice of performing the scans close to birth. This led to a highly skewed distribution of the infant age data. However, the visually inspected diagnostics of the linear regression models were acceptable.

In this study, we showed differences in aperiodic parameters of the rs-fMRI BOLD signal in preterm vs. term-born neonates and between males and females. We also demonstrated the applicability of those aperiodic parameters for the purpose of machine learning-based neonate age prediction with relatively high performance (R² = 0.41) and identified the most important ROIs to that end. Based on our results, we consider the rs-fMRI BOLD signal and its aperiodic component a viable functional brain metric for quantifying brain maturation in neonates. In addition to that, we hope that our results will enable more precise surveying of the neural origins of developmental conditions and disorders.

Methods

The present study used preprocessed data released as part of the Developing Human Connectome Project (dHCP) data release 3.0, and the methods are described in detail previously25.

Research participants were prospectively recruited as part of the dHCP, an observational, cross-sectional Open Science program approved by the UK National Research Ethics Authority (14/LO/1169). Written consent was obtained from all participating families prior to imaging. All ethical regulations relevant to human research participants were followed. Term-born neonates were recruited from the postnatal wards and approached on the basis of being clinically well. Preterm-born neonates were recruited from the neonatal unit and postnatal wards. Neonates were not approached for study inclusion if there was a history of severe compromise at birth requiring prolonged resuscitation, a diagnosed chromosomal abnormality, or any contraindication to MRI scanning (e.g., due to incompatible implants). No neonates included in the final study group required treatment for a clinically significant brain injury. The third data release includes a total of 887 datasets from 783 neonates, including healthy term-born neonates, preterm neonates, and neonates at high risk for atypical neurocognitive development88.

Participants

The openly available dHCP data set (third release) consists of 887 rs-fMRI scans from 783 neonatal research subjects between 26–45 weeks of post-menstrual age (359 females), 205 of whom were born preterm. Minor incidental findings were not considered an exclusion criterion. For a more detailed description of the data set, see Edwards et al.88. 105 scans were flagged for failed fMRI quality control and excluded from the present analyses. Further, 129 scans were from non-singleton participants and were excluded from the present analyses to avoid indirect information leakage during cross-validation. Finally, we only included the last non-flagged scan from each participant, resulting in 605 unique, non-flagged scans from singleton participants. The spectral parameterization model fit failed for six participants, so the analyses were performed on a subset of 599 participants (276 female, PMA 27–45 weeks), 116 of them preterm-born.

fMRI data acquisition and preprocessing

The MR imaging was carried out using a 3T Philips Achieva scanner running modified Release 3.2.2 software. The length of the fMRI acquisition was 15 min and was collected with parameters: TE/TR = 38/392 ms, 2300 volumes, with an acquired spatial resolution of 2.15 mm isotropic88. Participants were scanned non-sedated using Neonatal Brain Imaging System (NBIS), consisting of a dedicated 32-channel array coil and a positioning device89. The functional data imaging acquisition used an optimized multiband sequence tuned for neonatal participants90, with phase-optimized multiband pulses used throughout91. For detailed descriptions of the automated processing pipeline and the motion and distortion correction techniques applied to all dHCP open access pre-processed fMRI data, see Fitzgibbon et al.45 and Andersson et al.92,93,94,95.

Derived brain measures

An algorithmic method known as Fitting Oscillations and One-Over-F (FOOOF) for parameterizing the power spectrum into periodic and aperiodic component parameters has been suggested27. While the periodic component is characterized by such parameters as its center frequency, power, and bandwidth, the 1/f-like aperiodic component is characterized by its offset and exponent parameters. The performance of the algorithm in characterizing neural power spectra has been validated against both simulated power spectra with known ground truth and human expert labeling on real EEG and simulated data, demonstrating a low error rate on both test arrangements27.

The FOOOF algorithm parameterizes the power spectrum into its periodic and aperiodic components by creating an initial fit of the aperiodic component and subtracting it from the total power spectrum, resulting in a flat spectrum, from which, in an iterative process, maxima are identified as peaks and used for fitting Gaussians, which are then subtracted from the flattened spectrum. When no further peaks surpassing a threshold value can be identified, a final periodic multi-Gaussian model is fit using all maxima thus identified. The periodic model is subtracted from the original spectrum, resulting in a noisy aperiodic component, to which the final aperiodic model is fit. The aperiodic component of the model is characterized by offset and exponent parameters according to the following formula:

$$L\left(F\right)=b-{\log }_{10}\left({F}^{X}\right)$$
(1)

where F refers to frequency, b to the offset and X to the exponent parameter, which are resolved by fitting the model as described above. Notably, because the power values are presented on a logarithmic scale, negative offset values are possible. For a detailed description of the fitting procedure of the whole model, see “Algorithmic parameterization” under “Methods” in the original publication of the algorithm27. For each time series representing an ROI specific BOLD signal, we estimated and parameterized the power spectrum between 0.01 and 0.15 Hz using multitaper96 spectral analysis, and the FOOOF algorithm (Python implementation version 1.1.0), resulting in parameters for the periodic and aperiodic components of the signal, as well as an R² score for the goodness of fit for each ROI. For a summary on frequency range selection in earlier studies on BOLD signal, see Glerean et al.97. In choosing the FOOOF model parameters, we did not set a maximum width for the oscillation peaks, while the minimum width of oscillatory peaks to detect was set to 0.001 Hz. Due to the sparsity of prior relevant research into the characteristics of oscillatory peaks in the rs-fMRI BOLD signal power spectrum, we used a prior work studying the rs-fMRI BOLD signal power spectrum in white matter to guide us, setting the threshold value for oscillatory peaks to 1.5 SD over the mean, as well as the maximum number of peaks to find to two98.

Spectral parameterization of the fMRI BOLD signal was performed for 90 ROIs of the AAL90 atlas99, encompassing both cortical and subcortical gray matter. From these regions, the precentral (motor) and postcentral (somatosensory) gyri were selected for further statistical analyses. These regions were chosen due to their early functional maturation in infancy and because the aperiodic model fit was particularly robust in these areas, with high mean R² values (0.94 for the precentral and 0.93 for the postcentral gyri). The pre- and postcentral gyri are located anterior and posterior to the central sulcus, respectively, and can be identified in Fig. 1 by their dark red coloring, signifying high goodness of fit. Finally, all of the regions were used in a machine learning-based regression analysis.

Linear mixed-effect regression models

Statistical analyses were performed using RStudio (2022.07.1+554) and JASP (2022, version 0.16.3). To estimate the associations between the aperiodic parameters (exponent or offset) of the fMRI BOLD signal and preterm birth, sex, PMA at scan, and postnatal age, the lmer function from the afex package of RStudio was used. Linear mixed effects regression model analyses were performed using maximum likelihood estimation. Fixed effects were PMA at scan, postnatal age, sex, and premature birth, as well as hemisphere (right/left). Interaction terms with ROI (precentral gyri/postcentral gyri) were included. In addition, in the offset model (Model 2) exponent was included as a fixed effect to control the possible effects of exponent changes on offset values. The intercepts of subjects were modeled as random effects, including random slopes for the motion outliers. The linear mixed effects regression models had the following structure:

Model 1: Exponent  PMA at scan*ROI + Age*ROI + Sex*ROI + Prematurity*ROI + Hemisphere+(1 + Motion|ID)

Model 2: Offset  PMA at scan*ROI + Age*ROI + Sex*ROI + Prematurity*ROI + Hemisphere + Exponent+(1 + Motion | ID)

PMA at scan means the postmenstrual age (PMA) of the neonate at scan (weeks; z-transformed), and Age means the postnatal age of the child (weeks; z-transformed). Sex means the sex of the child (male = 0, female = 1), and Prematurity means term versus preterm birth (term-born = 0, preterm = 1; postmenstrual age at birth < 37 weeks). ROI means region-of-interest, either postcentral (=0) or precentral (=1) gyri, and Hemisphere means left (=0) or right (=1) side of the brain. Motion means the number of motion-compromised volumes (outliers based on DVARS; z-transformed). In Model 2, the exponent was z-transformed.

The models were originally run with only the main effects of independent variables included, but as interaction terms with ROI were added into the models, the fit of the models improved (based on lower Akaike information criterion (AIC) and statistically significant result of the likelihood ratio test before vs. after interaction term inclusion). The Prematurity variable had moderate correlation with other age variables (for the correlation matrix of all variables included in models: see Supplementary Fig. S12), but a lower AIC (when Prematurity was included) supported the inclusion of Prematurity as a predictor. Likelihood ratio test (Prematurity not included vs. included) was not statistically significant in Model 1 (p = 0.09) and in Model 2 (p = 0.06). The possibility of singularity of the models was checked and was not detected.

Further, the model assumptions were visually assessed for both models. At first, the Q–Q plot of Model 1 showed a slightly S-shaped pattern, while other visual assessments supported normal distribution of the residuals. After residual outlier detection and exclusion (±3 SD, N = 26), the Q–Q plot improved. In Model 2, visual assessments supported normal distribution of the residuals, and the need for outlier exclusion was not established. For more information about model diagnostics (see Supplementary Figs. S13S16 and Tables S5, S6).

Linear regression models

To map the estimates of Prematurity (term-born = 0, preterm = 1) for aperiodic parameters (exponent or offset) of fMRI BOLD signal across 90 different cortical and subcortical brain areas, the linear models following the formulas of the linear mixed effect regression models described above were used. For a more detailed description, see “Linear regression models” in the Supplementary material.

Machine learning models

We predicted the participants’ PMA and postnatal age as target variables using a supervised machine learning regression model. For each target variable, we trained and tested a regression model with either exponent, offset, or both kinds of parameters from all ROIs as predictors using a repeated and nested 10-fold cross-validation algorithm. To control for the effect of preterm birth, we trained and tested the regression models separately for the full sample and term-born only participants. In addition to the exponent and offset parameters of the aperiodic component of the power spectra of the BOLD signals, no additional predictors were introduced into the machine learning models. Performing the machine learning-based regression analysis separately using three sets of predictors (i.e., offset, exponent, and both) on two subsets (term-born only and all participants) of the data in predicting two different age target variables (postnatal and postmenstrual age at scan) resulted in 12 predictive models in total.

We used elastic net100, an l1- and l2-regularized linear regression-based machine learning model to make predictions on the target variables and rank the predictors for interpretation using the model's beta coefficients. While l1-regularized regression (aka. LASSO) performs feature selection by enforcing sparsity, thus making the model more interpretable, it can lead to model instability if the features are highly correlated, which typically is the case with neuroimaging data86. In contrast, elastic net accomplishes a grouping effect via l2-regularization, which is known to correct the instability-inducing tendency for arbitrary choice between correlated predictors101 and is considered a standard predictive algorithm in machine learning based neuroimaging studies86. Elastic net finds the optimal model coefficients by minimizing \(\beta\) in the following formula:

$${\min}_{\beta }\left(\frac{1}{2N}\right){|{{{\rm{X}}}}{{{\rm{\beta}}}} -y|}_{2}^{2}+\alpha * \rho {|{{{\rm{\beta}}}} |}_{1}^{1}+\left(\frac{\alpha (1\,-\rho )}{2}\right) * {|{{{\rm{\beta}}}} |}_{2}^{2}$$
(2)

where β stands for the beta coefficients (aka. weights) of the model, α stands for the overall regularization strength and ρ stands for the ratio of l1-norm used in the regularization.

To estimate the models’ predictive performance on previously unseen data with low bias and variance, we used 10-times repeated 10-fold cross-validation with nested 5-fold cross-validation for the purpose of grid search-based hyperparameter tuning, the hyperparameters in question being the ”α” and ”ρ” in the elastic net formula (2) (i.e., ”alpha” and ”l1_ratio” respectively, in the scikit-learn ElasticNetCV implementation of the algorithm). The beta coefficients of the trained models were averaged over folds and ranked based on the average coefficient value for the purpose of feature importance analysis.

Prior to fitting the elastic net model, the data were standardized by subtracting the mean and scaling to unit variance—a recommended preprocessing step that is considered to have the effect of increasing the interpretability of the model86. This preprocessing operation was performed during cross-validation in order to avoid data leakage from the test set to the model, which could occur if such preprocessing were performed prior to splitting the data.

Statistics and reproducibility

Statistical analyses were conducted using RStudio (version 2022.07.1 + 554) and JASP (version 0.16.3, 2022). No power analysis was conducted to determine sample sizes in these experiments, as the effect sizes were unknown. The sample consisted of N = 599 neonates. In the linear mixed-effect models, each participant contributed repeated measurements across two hemispheres (left/right) and two regions of interest (ROIs: precentral/postcentral gyri), resulting in 4 observations per subject. These repeated measures were treated as replicates, defined as multiple observations from the same subject under different anatomical conditions (ROI and hemisphere). Residual outliers (±3 SD) were excluded where necessary (Model 1: N = 26 observations removed, Model 2: N = 0). Model comparisons were conducted using likelihood ratio tests and Akaike Information Criterion (AIC). To map the estimates of prematurity (preterm/term-born) for aperiodic parameters (exponent and offset) across all 90 cortical and subcortical regions, linear regression models were applied, and mapped effects were corrected for multiple comparisons using the false discovery rate (FDR) method.

Machine learning analyses were conducted using Python 3.8.20102 with external libraries Numpy 1.23.5103, Pandas 2.0.3104, Neurokit2 0.2.7105, and Scikit-learn 1.2.2106 in an interactive Jupyter notebook environment. The sample sizes for machine-learning analyses were 599 (full sample) and 483 (term-born only). The model evaluation was performed using 10-times repeated 10-fold cross-validation, and the elastic net hyperparameters (alpha and l1-ratio) were optimized using five-fold nested cross-validation. Random seed was fixed (random_state = 42) for the repeated k-fold cross-validation and the ElasticNetCV predictive model to ensure reproducibility.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.