Introduction

Pregnancy is a period characterized by profound changes, including hormonal surges, increased blood volume, heightened oxygen consumption, as well as morphological adjustments such as uterine expansion and relocation of organs1,2,3. These adaptations support fetal development and preparation for labor and delivery. The body’s response to pregnancy may also change the pregnant person’s brain, potentially as adaptive, nonadaptive, or merely evolutionary byproducts4,5. However, we know remarkably little about pregnancy-related brain changes. This knowledge gap regarding how, why, and to what extent the brain changes during pregnancy is, at least in part, attributable to historical reluctance of medical studies to include pregnant individuals due to concerns over potential risks and ethical complexities6,7,8,9. However, magnetic resonance imaging (MRI) has been used clinically among pregnant patients for the last four decades10, and numerous studies find no evidence that MRI is associated with increased risk for adverse effects on pregnant individuals or their offspring (e.g., congenital anomalies, neoplasm, or vision or hearing loss)11,12,13. Given that approximately 85% of US women give birth in their lifetimes14 and the existence of safe imaging procedures, it is critical to identify how a pregnant person’s brain changes across pregnancy.

Non-human animal studies of pregnancy report diminished hippocampal cell proliferation15,16 and reduced microglial cell density17, both of which may manifest as detectable reductions in gray matter volume. Research on structural brain changes associated with pregnancy in humans remains limited. While several studies suggest potential reductions in brain volume during pregnancy18,19,20,21,22, these findings, however, come with limitations. For example, only four participants were assessed repeatedly (3–7 times) during pregnancy in Oatridge et al.21, and anatomical scans were collected via a 1.0-T MRI system, which has relatively low signal-to-noise ratio (SNR) and spatial resolution. Further, neither Hoekzema et al.18,19, which compared women’s brains before conception to 2-3 months postpartum, nor Paternina-Die et al.20, which examined differences between pregnant individuals and nulliparous controls at late pregnancy and early postpartum, addressed changes occurring within pregnancy. More recently, Pritschet et al.22 leveraged precision imaging to map neuroanatomical changes in a single individual from preconception through two years postpartum, suggesting decreases in gray matter volume and cortical thickness across the brain during pregnancy. While this work offers intriguing evidence of remarkable neuroplasticity during pregnancy, the findings are based on a single subject, which limits the generalizability and scope of broader inferences. As such, the current literature regarding brain changes during pregnancy remains inconclusive. Within-subject longitudinal studies are essential for mapping brain dynamics throughout gestation, offering insights into pregnancy-related neuroplasticity.

In terms of potential pregnancy-related changes in white matter microstructure, non-human animal research has observed an increase in myelinated axons during pregnancy, potentially resulting from augmented proliferation of oligodendrocytes—a type of brain cell vital for axonal myelination23,24,25. In particular, Chan et al.26 reported a general increase in diffusivity across the whole brain during pregnancy, as well as regional increases in mean diffusivity (MD) and fractional anisotropy (FA) that were derived from Diffusion Tensor Imaging (DTI). The current human studies, however, have generated inconsistent findings. Hoekzema et al.19 found no significant changes in white matter diffusion metrics derived from DTI or in white matter volume when comparing pregnant women to nulliparous controls. In contrast, Pritschet et al.22 observed nonlinear increases in white matter quantitative anisotropy across the brain—indicating greater tract integrity—as gestational weeks progressed in a single subject.

In vivo longitudinal studies are crucial for detecting and understanding the dynamics of brain changes during pregnancy. Moreover, there are increasing calls for a more inclusive approach to biomedical research that recognizes and investigates the distinctive biological experiences of females27,28,29. Responding to this need, we initiated a pilot longitudinal study of pregnant people (n = 10, assessed 1–6 times [median = 3], spanning 12–39 gestational weeks). We used anatomical MRI to assess brain morphometry (e.g., gray matter volume and cortical thickness). White matter microstructure was evaluated with multi-shell diffusion-weighted MRI and advanced neurite orientation dispersion and density (NODDI)30 modeling, which provided metrics such as the neurite density index (NDI) and orientation dispersion index (ODI). The primary goals of the current study were twofold. The first objective was to chart the brain structural changes occurring during pregnancy. The second goal was to explore potential mechanisms driving these changes, particularly focusing on pregnancy hormones, inflammatory cytokines, and the intracranial cerebrospinal fluid (CSF) volume.

Results

Brain changes during pregnancy

Brain morphometry changes

Covarying for total intracranial volume (TIV) and age, total brain volume decreased across pregnancy (β = −0.105, 95% CI [−0.169, −0.040], p = 0.004). This translates to a 2.6% decline over the observed gestational range in this study (i.e., 12–39 weeks). We further conducted leave-one-out analyses, which resulted in estimated β values that ranged from −0.184 to −0.090. Similarly, total gray matter volume exhibited decreases across pregnancy (β = −0.192, 95% CI [−0.301, −0.082], p = 0.002). This translates to a 3.9% decline over the observed gestational range in this study (see Fig. 1). With leave-one-out analyses, β values ranged from −0.328 to −0.179. There were decreases in total cortical volume (β = −0.238, 95% CI [−0.374, −0.102], p = 0.002) and in subcortical volume (β = −0.094, 95% CI [−0.196, 0.008], p = 0.068), although the latter was not statistically significant. Additionally, there was a large decrease in cortical thickness (β = −0.516, 95% CI [−0.915, −0.116], p = 0.015), yet cortical surface area remained unchanged (β = −0.018, 95% CI [−0.052, 0.017], p = 0.297). These results suggest that the observed reduction in total gray matter volume may primarily result from decreases in cortical volume, particularly cortical thickness (see Fig. S2 for detailed statistics). Lastly, total white matter volume did not exhibit a statistically significant change across pregnancy (β = −0.031, 95% CI [−0.120, 0.057], p = 0.460). All findings were consistent using leave-one-out analyses, as detailed in Table S2.

Fig. 1: The observed decrease in brain volumes throughout the course of pregnancy.
figure 1

The plots are based on the linear mixed-effects model shown in Eq. (1). Specifically, brain metrics were the dependent variables, and gestational week was the predictor, covarying for age and total intracranial volume. GM: gray matter, WM: white matter. The shaded area represents the 95% confidence interval (95% CI) for the fitted line. Note. **p < 0.01.

Changes in cortical regions

To further characterize the reduction in total cortical volume, we fitted linear mixed-effects (LME) models for all Desikan–Killiany atlas regions, using gestational week as the fixed effect and hemisphere, TIV, and age as covariates. A surface map with the standardized regression coefficients for all cortex regions is shown in Fig. 2a. Overall, we observed a consistent trend of decreasing cortical volume in most cortical regions, except for three regions in the occipital lobe (pericalcarine, lingual, and cuneus) and two regions in the medial temporal lobe (entorhinal and temporal pole). Statistically significant reductions in cortical volume were observed in prefrontal cortex regions, including the medial orbitofrontal, superior frontal, and lateral orbitofrontal cortex, as well as two regions surrounding the central sulcus: the precentral and postcentral cortex (Fig. 2b). Detailed statistical reports, including leave-one-out analyses consistent with the full sample results, are provided in Table S3.

Fig. 2: Illustration of the association between cortical region volumes, as defined by the Desikan-Killiany atlas, and gestational week.
figure 2

a The figure is color-coded to reflect the standardized regression coefficients derived from linear mixed-effects models, which were fitted for each cortical region volume against gestational week while covarying for age and total intracranial volume (Eq. 2). b The figure selectively presents the cortical regions that exhibited a significant volume decrease throughout pregnancy. No multiple comparison correction was implemented.

White matter microstructure changes

In our investigation of changes in white matter microstructure, we implemented LME models for each of the NODDI metrics (NDI and ODI) across each of the 50 tracts generated by TractSeg with gestational week as the fixed effect and hemisphere (where applicable), age, and relative motion as covariates. To facilitate the understanding of the global pattern of metric changes, a heat map was generated, representing the standardized regression coefficients for all segmented tracts (see Fig. S3). Throughout the gestation period, we observed a predominant increase in NDI across most tracts, whereas ODI metrics remained relatively stable. Notably, the most pronounced increases in NDI were observed in projection tracts such as the corticospinal tract (CST), fronto-pontine tract (FPT), parieto-occipital pontine (POPT), in addition to the striato-premotor tract (ST_PREM) and arcuate fascicle (AF) (see Fig. 3). In line with these NDI findings, our analysis of DTI metrics revealed a predominant trend of increasing FA. Tracts that exhibited the most significant FA increases largely overlapped with those displaying notable NDI increases. Comprehensive statistical results for NDI and ODI metrics are available in Tables S6 and S7, and analyses of DTI metrics can also be found in the supplementary materials results section. Lastly, an examination of average measures across all segmented tracts revealed no significant global changes in NDI and ODI metrics during pregnancy, consistent with our secondary method for assessing global diffusion metrics (see supplementary materials results section).

Fig. 3: Illustration of the association between Neurite Density Index (NDI) and the progression of pregnancy.
figure 3

The figure is color-coded to represent the standardized regression coefficients from linear mixed-effects models fitted for NDI against gestational week while covarying for age and relative motion (Eq. 2). CST: Corticospinal Tract, FPT: Fronto-Pontine Tract, POPT: Parieto‐Occipital Pontine, SCP: Superior Cerebellar Peduncle, STR: Superior Thalamic Radiation, T_PREM: Thalamo-Premotor, ST_PREM: Striato-Premotor.

CSF changes

We observed a statistically significant increase in total intracranial CSF (β = 0.187, 95% CI [0.021, 0.353], p = 0.030) after covarying for age and TIV (see Fig. 4). Using a leave-one-out cross-validation approach, β values ranged from 0.145 to 0.267 (see Table S4). This pattern was consistent when varying the threshold of CSF volume fraction between 60% and 95% (see Table S5).

Fig. 4: The increase in total intracranial cerebrospinal fluid (CSF) was observed across pregnancy.
figure 4

The plot is based on the linear mixed-effects model shown in Eq. (1). Specifically, CSF was the dependent variable and gestational week was the predictor, covarying for age and total intracranial volume. The shaded area represents the 95% confidence interval (95% CI) for the fitted line. Note. *p < .05.

Associations between brain dynamics and salivary hormones in pregnancy

Salivary hormones and inflammatory cytokine changes during pregnancy

Covarying for age, we observed statistically significant increases in waking salivary levels of progesterone (β = 0.717, 95% CI [0.391, 1.042], p < 0.001), estradiol (β = 0.792, 95% CI [0.533, 1.050], p < 0.001), and testosterone (β = 0.393, 95% CI [0.061, 0.724], p = 0.024) across pregnancy (see Fig. 5a). However, the increase in cortisol levels was not statistically significant (β = 0.198, 95% CI [−0.014, 0.410], p = 0.065). Additionally, we did not observe statistically significant changes in C-reactive protein (CRP) levels throughout the course of pregnancy (β = −0.067, 95% CI [−0.248, 0.114], p = 0.441).

Fig. 5: The observed changes in hormone levels during pregnancy and associations between progesterone levels and brain structural metrics.
figure 5

a The observed changes in hormone levels (progesterone, estradiol, testosterone, and cortisol) during pregnancy. Linear mixed-effects models were applied with hormone levels as the dependent variable and gestational week as the predictor, covarying for age. b Associations between progesterone levels and brain structural metrics, including total brain volume, total gray matter volume, total cortical volume, and cortical thickness. The plot is based on Eq. 3. Specifically, brain metrics were the dependent variable and progesterone was the predictor, covarying for age and total intracranial volume. The shaded area represents the 95% confidence interval (95% CI) for the fitted line. Note. **p < .01. ***p < .001.

Associations between salivary hormones and brain structure

After covarying for TIV and age, we found statistically significant negative associations between total brain volume and salivary levels of progesterone (β = −0.103, 95% CI [−0.170, −0.036], p = 0.006) and cortisol (β = −0.213, 95% CI [−0.393, −0.032], p = 0.025), indicating that increasing salivary levels of progesterone and cortisol are associated with decreasing total brain volume. Associations between total brain volume and either salivary estradiol or testosterone levels were not statistically significant (ps > 0.144). Additionally, total gray matter volume was negatively associated with salivary progesterone (β = −0.189, 95% CI [−0.305, −0.073], p = 0.003), but not with salivary estradiol, testosterone, or cortisol (all ps > 0.134). Total cortical volume (β = −0.254, 95% CI [−0.390, −0.118], p = 0.002) and mean cortical thickness (β = −0.691, 95% CI [−1.049, −0.333], p = 0.001) were also negatively associated with salivary progesterone (see Fig. 5b), but not with salivary estradiol, testosterone, or cortisol levels (all ps > 0.137).

Associations between salivary hormones and diffusion metrics

Fig. S8 shows the standardized regression coefficients of hormone levels across all segmented tracts, illustrating the associations between NDI and salivary hormone levels. Most notably, our results indicated that NDI across nearly all tracts was positively associated with salivary estradiol levels and moderately with salivary progesterone (see Fig. 6). Moreover, NDI demonstrated a negative link with salivary cortisol levels, while no significant association was found with salivary testosterone. Detailed statistical outcomes of these analyses are available in Tables S12S15.

Fig. 6: The association between Neurite Density Index (NDI) and hormone levels (specifically progesterone, estradiol, and cortisol).
figure 6

The color-coding reflects the standardized regression coefficients obtained from linear mixed effects models, which were used to analyze the relation between NDI and each of the three hormones while covarying for age and relative motion (Eq. 4). Note. *p < .05. **p < .01. ***p < .001. An asterisk in the color bar indicates the level of significance for standardized regression coefficients.

Associations between salivary hormones and CSF

Covarying for TIV and age, no statistically significant associations were detected between CSF volume changes and salivary hormonal levels, including progesterone (β = 0.130, 95% CI [−0.060, 0.319], p = 0.160), estradiol (β = 0.151, 95% CI [−0.060, 0.361], p = 0.143), testosterone (β = 0.153, 95% CI [−0.119, 0.425], p = 0.242), or cortisol (β = 0.187, 95% CI [−0.284, 0.668], p = 0.401; see Table S16S19).

Associations between CSF and brain structure

There were statistically significant negative associations between total intracranial CSF and total brain volume (β = −0.277, 95% CI [−0.452, −0.102], p = 0.004), total gray matter volume (β = −0.388, 95% CI [−0.678, −0.097], p = 0.013), and total cortical volume (β = −0.391, 95% CI [−0.754, −0.028], p = 0.037), suggesting that reductions in brain volumetrics may be associated with increases in total intracranial CSF volume. However, the link between total intracranial CSF and mean cortical thickness was not statistically significant (β = −0.515, 95% CI [−1.069, 0.039], p = 0.066).

Associations between CSF and diffusion metrics

Across all segmented tracts, there were no statistically significant associations between total intracranial CSF and NDI. Detailed statistical outcomes of these analyses are available in Table S20.

Discussion

In a sample of 10 pregnant participants who underwent multimodal MRI scans, hormone assessments, and inflammatory cytokine assessments between 1 and 6 times during pregnancy (median = 3 times), we explored whether, and to what degree, neurobiological changes occur during pregnancy. There were four main findings. First, there were pronounced reductions in overall brain and gray matter volume, while an increase was found in the total intracranial CSF volume. Second, using NODDI metrics to assess white matter microstructure, we found an increase in NDI across the majority of segmented tracts, with the most pronounced increase observed in tracts associated with sensorimotor processing. Third, there were significant correlations between pregnancy-related salivary hormones and observed brain changes (i.e., volumetric reduction in overall brain and gray matter, and increased NDI in segmented tracts). Fourth, the increase in the total intracranial CSF volume was associated with the decrease in brain volumetrics, but not with NDI changes.

The decreases in estimates of total brain volume in the present study were consistent with earlier evidence involving both humans18,19,20,21,22 and non-human animal models15,16,17. Comparing mothers before and after their first pregnancy, Hoekzema et al.18,19 reported a reduction in gray matter volume across multiple brain regions, notably within the anterior and posterior midline structures. Another study found that during late pregnancy, mothers exhibited smaller global cortical volume and thickness compared to nulliparous controls, with the most pronounced differences in the somatomotor, attentional, and default mode networks20. A recent study by Pritschet et al.22 focused on neuroanatomical changes from preconception to two years postpartum in a single individual, documenting significant reductions in gray matter volume and cortical thickness. Non-human animal studies suggest that the observed brain volume reduction may be linked to decreased microglial proliferation, as evidenced by the reduced microglial cell density in pregnant rats during late gestation and the early postpartum period compared to virgin rats17. Expanding upon previous pre- and post-pregnancy comparisons and cross-sectional analyses, our findings demonstrate a significant rate of reduction in gray matter volume during pregnancy. Additionally, we observed a significant correlation between CSF volume and overall brain and gray matter volumes. This association suggests that intracranial CSF volume increases during pregnancy, consistent with the prior report of ventricular volume expansion21, may exert pressure on the brain and compress the cortex, possibly contributing to the observed decreases in brain volume31,32.

Additionally, we observed an increase in NDI across tracts, especially those linked with sensorimotor processing, whereas ODI remained largely unchanged. NDI quantifies the tissue volume fraction occupied by neurites, such as axons and dendrites, and is considered a proxy for axonal density in white matter33,34. Meanwhile, ODI reflects the variance in neurite orientation. Our findings suggest that, across pregnancy, there may be an increase in axonal density, potentially stemming from increased myelination and/or axonal diameter, without significant shifts in white matter geometric complexity. This aligns with prior non-human animal studies that have documented an increase in myelinated axons during pregnancy, potentially as a result of amplified proliferation of oligodendrocytes24,25. However, there is a discrepancy between our results and those reported by Hoekzema et al.19 who did not observe significant changes in DTI metrics during pregnancy. The discrepancy may arise from differences in imaging sequence parameters. For instance, Hoekzema et al.19 used a single shell protocol (b = 1000) in contrast to the multi-shell protocol (b = 500, 1500, 2500) in the present study, which has been shown to affect the estimated scalar diffusion metrics35. Another consideration is the methodological divergence between the two studies. Our study adopted a longitudinal design, capturing dynamic changes over the course of pregnancy, while Hoekzema et al.19 employed a pre-post pregnancy comparison approach. Their post-pregnancy data acquisition took place, on average, 100 days after childbirth with a wide variability (SD = 70 days). This extended interval could potentially obscure white matter alterations occurring during pregnancy. Indeed, the recent study by Pritschet et al.22 demonstrated an increase in global quantitative anisotropy (QA), an index of white matter microstructural integrity, during gestation, which was concomitant with rises in estradiol and progesterone. However, extended analyses into the postpartum period revealed non-linear patterns of global QA, with increases during the first and second trimesters, returning to baseline levels postpartum. Thus, it is possible that white matter quickly reverts to its pre-pregnancy state, thereby remaining undetected in the delayed post-pregnancy assessment. Alternatively, the postpartum period itself, characterized by hormonal fluctuations, physiological adaptations, and new caregiving demands, might induce its own set of white matter microstructural adaptations, which might counteract those taking place during pregnancy. Additionally, the absence of global changes in white matter microstructural metrics, specifically NDI, may be due to regional variability. While we observed increases in NDI across most tracts, other tracts exhibited small to subtle increases. As such, the effects may not be detectable at the global level. Although it is also possible that our study is underpowered to detect subtle global changes that may indeed be present, this observation, if generalized, highlights the importance of understanding focal tract changes in NDI that occur during pregnancy in future studies.

The current study also explored candidate mechanisms underlying the observed neurobiological changes during pregnancy. First, our data revealed a correlation between pregnancy-associated hormones and brain alterations in both gray matter and white matter. Estradiol and progesterone are pivotal neuroprotective agents, modulating processes like neurogenesis36,37, dendritic spine formation and branching38,39, myelination40,41, axonal growth, and synaptogenesis42. Our findings of positive correlations between white matter NDI and progesterone and estradiol levels are consistent with results from animal studies. However, the observed inverse associations between pregnancy-related hormones and gray matter volumes are somewhat counterintuitive, especially when considering that non-human animal research suggests estradiol and progesterone promote neurogenesis43,44. It might be that increased dendritic length, branching, and spine density in gray matter due to pregnancy-related hormones38,39,42,45 and/or myelination lead to misclassification of voxels at tissue interfaces, thereby appearing as reductions in gray matter volume46,47, either in addition to or instead of fluid-related compression. It is also possible that hormones influence other physiological functions that are linked to a decrease in gray matter volume. For example, suppressed microglial proliferation might underlie the connections between hormonal shifts and grey matter reduction during pregnancy47. Furthermore, the observed inverse associations raise the possibility of a non-linear, U-shaped relationship, where progesterone’s links to brain structure may not be straightforward and could vary depending on concentration levels48. Indeed, inverted U-shaped relationships have also been observed for other hormones such as estradiol49 and cortisol50 in humans.

Additionally, it is interesting that we observed a link between brain morphometry and progesterone, but not estradiol. However, we should acknowledge that salivary hormone assays can be affected by numerous factors, including collection time and sample storage51,52. If these effects generalize, it might suggest that progesterone and estradiol have distinct links to brain morphometry. This distinction has been evidenced by their different effects on receptors53 and functional network organization54, as well as their complex interactions in synaptic plasticity and neuroprotection55. It has been reported that intrinsic fluctuations in progesterone, but not estradiol, were associated with medial temporal lobe morphology across the human menstrual cycle56. In mice, during pregnancy, both estradiol and progesterone affect the galanin (Gal)-expressing neurons in the mouse medial preoptic area (MPOA), but in different ways—estradiol silences MPOAGal neurons and increases neuronal excitability, while progesterone promotes dendritic spine formation. Those effects on MPOAGal neurons are linked to increased selectivity for pup stimuli57. However, whether these previously mentioned findings from animal studies translate to humans remains unclear, given the challenges inherent in comparing animal models to human experiences58. It is important to note that our findings reflect associations and cannot be interpreted as evidence of causality. The observed relations between hormone levels and brain structure could be due to a variety of factors, such as underlying physiological changes that occur during pregnancy and/or the influence of other biological processes not measured in the current study.

Moreover, the influence of pregnancy-related hormones likely represents only one of the factors contributing to observed brain changes. Other pregnancy-associated alterations, such as increased water content59,60 and blood flow variations61,62 during pregnancy, may also play significant roles. For example, studies have documented a significant and progressive rise in total body water, as well as extracellular and intracellular water values throughout pregnancy59,60, which might help explain the observed elevation in NDI. Yet, it is important to note that in vivo MRI methods offer insights at a macroscopic level and inherently lack the resolution to elucidate the cellular and molecular mechanisms at play during pregnancy. Future research across multiple levels of analysis is needed to supplement this work and enhance our understanding of the neurobiological changes observed during pregnancy.

Although cortisol levels in our study did not reach statistical significance at the .05 level, they demonstrated a small to moderate increasing trend over the course of pregnancy. This observation is consistent with previous literature indicating that salivary cortisol levels progressively rise during pregnancy63,64. Additionally, we observed negative associations between cortisol levels and total brain volume, as well as between cortisol levels and white matter NDI. These findings align with existing research that implicates cortisol in neurodevelopmental processes, including neurogenesis65,66, synaptic plasticity67,68, and myelination69. Higher cortisol levels have been linked to reduced brain volume and alterations in white matter microstructures70. Therefore, despite the lack of statistical significance in the observed cortisol changes, the upward trend might still hold biological relevance in influencing brain structure in pregnant individuals. However, it is important to acknowledge the limitations of the waking salivary cortisol assay in our study, as it was assessed only once upon waking71,72, and hormone assays can be affected by various factors, as mentioned above. As such, interpretations should be made with caution. Future research may benefit from more frequent cortisol measurements to better capture dynamic changes during pregnancy and explore their implications for brain structure.

Additionally, we did not observe significant changes in CRP levels during the observed period (12 to 39 gestational weeks). This finding aligns with previous literature, which demonstrates that CRP levels can vary widely among individuals during pregnancy—some studies report an increase73,74,75, some report consistency73,76,77, and others report a decrease73,74 in CRP levels across gestation. Therefore, while CRP is an important marker of inflammation78, it has not been consistently shown to track with gestation and is likely influenced by various factors.

While not the focus of the current study, there are multiple hypotheses regarding the potential implications of pregnancy-related brain changes. While some contend that changes to the pregnant person’s brain are most likely byproducts of promoting fetal survival and childbirth4, others point to the possibility that these changes are in the service of promoting the parent–child relationship19,79,80. There is substantial evidence that interactions with children post-birth shape the brain. Indeed, evidence from non-birthing parents, which allows for distinctions between pregnancy-specific vs. caregiving-related brain changes, shows brain alterations following childbirth81,82,83,84 and child rearing85,86. Additionally, research has shown that brain changes can persist up to six years postpartum87, and persistent changes may relate to the cognitive and emotional challenges of caring for a child88,89. Future work would benefit from careful consideration of the potential functions of these changes, with designs that help to rule out plausible rival hypotheses.

Several limitations merit consideration. First, our study was constrained by a small sample size. This limited our power to detect subtle effects as well as our ability to adjust for potential confounds and examine the potential effect of moderators. For example, prior research suggests that the number of live births is related to brain age88. It is possible that the number of live births might moderate the neurobiological changes during pregnancy, such that individuals with more births experience less (or more) neurobiological changes (see supplementary materials results section for our exploration of these associations within our sample). Given that the current literature has focused nearly exclusively on first-time mothers, it is especially important for future research to consider the number of prior births on pregnancy-related brain changes. Second, our study is limited by the number of data points available (27 unique scan sessions from 10 individuals), with some participants attending only a single session. For example, while our calculation of percentage declines in brain morphometry using estimated LME models offers a useful estimate of average changes during pregnancy, it is important to interpret these data with caution. The focus on the observed range of gestational weeks (i.e., 12–39 weeks) is intentional to accommodate the limited coverage of gestational weeks and small sample size, which could nevertheless affect the precision of this estimate. Third, our study does not have benchmarking data. Including such comparisons in future research would provide additional context for interpreting the uniqueness of pregnancy-related neurobiological changes. For example, benchmarking against non-birthing parents could help isolate changes unique to pregnancy from those due to shared environmental or experiential factors, offering a more nuanced understanding of the impacts of pregnancy. Fourth, our sample did not cover the entire duration of pregnancy. Without preconception data, we have an incomplete picture of when during pregnancy such alterations become evident. Fifth, given the preliminary nature of our investigation, we opted not to implement corrections for multiple comparisons. Although this decision increases sensitivity to potential effects, it simultaneously raises the risk of type I errors. However, the consistency in patterns and lack of randomness suggest that the results may not be artifacts but are likely indicative of genuine underlying phenomena. It is important for subsequent, larger-scale studies to incorporate appropriate corrections to further validate and replicate our observations. Sixth, our exploration of the underlying mechanisms of neural dynamics primarily focused on hormonal fluctuations during pregnancy. Although these hormonal shifts are important, they might represent just one aspect of a more multifaceted interplay. Other potential factors, such as metabolic changes, vascular dynamics, and alterations in neurotransmitter levels, may also significantly contribute. Future research adopting an interdisciplinary approach, integrating insights from these domains would shed light on the multifaceted nature of brain changes during pregnancy. Lastly, careful monitoring of pregnancy complications (e.g., preeclampsia and eclampsia) which may significantly affect brain structure21, is crucial to ensure accurate interpretation of the findings in the future.

The present study elucidates neurobiological changes occurring across pregnancy. Specifically, we observed a decline in total brain and gray matter volume, along with increases in total intracranial CSF volume and NDI in white matter tracts across pregnancy. Our data also suggest that increases in CSF and pregnancy-related hormones may, at least partially, contribute to these observed neural alterations. These findings highlight the neuroplasticity associated with pregnancy, establishing a foundation for further investigations in this essential yet understudied area of research.

Methods

Participants

A total of 10 pregnant women aged between 20.00 and 39.25 years (M = 28.97, SD = 6.50) years were recruited for the current longitudinal study. In terms of race, 7 identified as White, 2 as Black or African American, and 1 selected “Other.” In terms of ethnicity, 1 participant identified as Hispanic/Latine. Participants were recruited through online advertisements and referral/word of mouth and 1 participant was the senior author. Participant eligibility was initially screened via telephone. Inclusion criteria required participants to be pregnant, at least 18 years old, English-proficient, and have no immediate plans to relocate. Exclusion criteria encompassed a medical history of neurological injury or impairment, a history of psychosis, non-removable metal or electronic implants, having undergone in vitro fertilization (IVF), and use of medication(s) that could potentially influence hormonal or neurological changes during pregnancy (e.g., progesterone). The mean number of prior pregnancies carried to term was 0.50 (SD = 0.71, range = 0–2). The mean weeks gestation at data collection was 23.90 (SD = 7.70, range = 12.29–39.14). The number of data collection sessions for each participant ranged from 1 and 6 times (mean number of repeated measures = 2.70 times, SD = 1.49), contingent on individual circumstances and pregnancy progression. Figure 7 provides a detailed illustration of the gestational timing range captured for each participant. Each session included an MRI scan and collection of hair samples and dried blood spots. Additionally, participants were asked to collect saliva samples at their homes on the morning prior to and the morning of each session. All participants provided written informed consent prior to participation. Study procedures were approved by the Vanderbilt University Institutional Review Board. All ethical regulations relevant to human research participants were followed.

Fig. 7: Illustration of the gestational weeks range captured for participants.
figure 7

Zero on the x-axis denotes the first day of the last menstrual period, with conception following approximately two weeks later.

Hormone assessments

Hormonal assessments were performed by measuring levels of progesterone, estradiol, testosterone, and cortisol in saliva samples obtained from participants, in accordance with Salimetrics collection and assaying procedures. Two saliva samples were collected for each participant: one on the morning prior to the MRI scan and another on the day of the scan. The obtained values were then averaged for analysis. Additionally, we obtained estimates of cortisol, cortisone, and progesterone from hair samples. Further details on the saliva and hair sample collection and assay procedures can be found in the supplementary materials sections titled Salivary hormone assessments and Hair hormone assessments. We primarily reported on salivary hormones, and results from hair hormone analysis were included in the supplementary materials results section.

Inflammatory cytokines assessments

Inflammatory cytokine assessments were performed by measuring levels of Interleukin-6 (IL-6), Interleukin-10 (IL-10), Tumor Necrosis Factor alpha (TNFɑ), and CRP in dried blood spots (DBS) obtained from participants. At each visit, up to 5 drops of capillary whole blood were collected on filter paper (Whatman 903) following a finger stick with a sterile microlancet. Blood was then dried at room temperature and stored at −20 degrees Celsius in accordance with protocols established by Northwestern University’s Laboratory for Human Biology Research90,91, which performed assays on DBS samples. Further details on the DBS collection and assay procedures can be found in the supplementary materials section titled Inflammatory cytokines assessments. The usability of the samples for IL-6, IL-10, and TNFɑ was constrained by the high volume requirements, resulting in a low yield of successfully assessed data for these cytokines (less than 50%). Thus, only CRP was utilized in the analyses. Detailed descriptive statistics are provided in Supplementary Table S1.

MRI acquisition

MRI data acquisition was conducted on a 3T Philips Achieva scanner using a 32-channel head coil at Vanderbilt University Institute of Imaging Science. High-resolution T1- and T2-weighted images were obtained for each participant through 3D-quantification using an interleaved Look–Locker acquisition sequence with a T2 preparation pulse (3D-QALAS) sequence92 with the following parameters: repetition time (TR) of 5.0 ms, echo time (TE) of 2.3 ms, a flip angle of 4 degrees, a field of view (FOV) of 230 ×193 × 150 mm, a matrix size of 224 × 224 × 146, and a voxel size of 1 × 1 × 1 mm. The acquisition parameters of diffusion-weighted imaging (DWI) were as follows: TR of 4300 ms, TE of 91.5 ms, flip angle of 90 degrees, FOV of 240 ×240 x 135 mm, matrix size of 112 × 112 × 63, yielding a voxel size of 2 × 2 × 2 mm. The diffusion weighting was distributed across 23 directions with a b-value of 500 s/mm², 47 directions with a b-value of 1500 s/mm², and 70 directions with a b-value of 2500 s/mm². The scanner used in this study underwent regular calibrations and quality control assessments at least weekly, conducted by the Vanderbilt University Institute of Imaging Science. Additionally, vendor engineers performed quarterly preventative maintenance and periodic image quality tests to assess system performance across various specifications, including SNR, geometry, scale, and uniformity. Furthermore, the scanner was remotely monitored by Philips, with daily performance reports covering parameters such as cryogenic system stability, noise levels, coil functionality, hardware integrity, cooling efficiency, and environmental factors like temperature and humidity. Throughout the study, all reports and test results indicated no significant deviations in scanner performance. Further details can be found in the supplementary materials section titled MRI acquisitions.

MRI Processing

Anatomical MRI

Synthetic T1-weighted and T2-weighted images were found from 3D-QALAS images using SyntheticMR software (version 0.45.35; Linkoping, Sweden). Both T1-weighted and T2-weighted anatomical images were then used to extract structural MRI data through FreeSurfer v7.1.0’s longitudinal pipeline93. Scans at every time point were independently processed for each participant (including skull stripping, Talairach transformation, subcortical structure labeling, surface extraction, spherical registration, and cortical parcellation), followed by the creation of an unbiased within-subject template space using all the subject’s time points94. Note that the FreeSurfer longitudinal pipeline assumes a fixed head size over time, meaning that TIV is considered constant across time in the final longitudinal output. By accounting for the expected stability, the longitudinal pipeline is particularly sensitive to detecting small, gradual changes over time. To ensure that TIV remained relatively stable and to better support the inferences made from our subsequent morphological analyses, we used the cross-sectional estimates, treating each dataset as a separate scan before feeding it into the longitudinal pipeline. The results showed that total TIV did not exhibit a meaningful or statistically significant change over the course of pregnancy (β = −0.014, 95% CI [−0.060, 0.033], p = 0.539; see Fig. S1). Following the automated processing steps, all the cortical reconstructions were visually inspected for potential inaccuracies. We observed no processing failures and no manual edits were required. Please refer to the supplementary materials for the Euler number report and example anatomical images in the section titled Anatomical MRI. Subsequently, we obtained measurements of volume for 34 cortical regions in both the left and right hemispheres (68 regions in total) using FreeSurfer’s cortical parcellation based on the Desikan-Killiany atlas95, which were used for cortical morphometric analyses.

CSF

To quantify the total CSF volume in the entire brain, we used the aggregated volume fraction of the CSF image generated by SyntheticMR software from the 3D-QALAS images. Specifically, we first applied a threshold to retain voxels in which the CSF volume faction exceeded this threshold. Then, we multiplied the CSF volume fraction by the voxel volume across the brain. For our primary analyses, we set a threshold at 80%, meaning that a voxel was classified as containing CSF if it has 80% or more CSF. To test if this threshold influenced findings, we repeated analyses using different thresholds across a range of 60% to 95% (reported in Table S4). We observed consistent results across these different probability thresholds.

Diffusion-weighted MRI

The raw diffusion data were processed using MRtrix396 and FMRIB’s Software Library (FSL)97. Preprocessing steps involved noise estimation and removal via the dwidenoise function from MRtrix398,99. The eddy and susceptibility distortion corrections were performed using the dwifslpreproc function in MRtrix397,100,101,102. Subsequent to preprocessing, we computed the NDI and ODI using the NODDI Matlab Toolbox30. All preprocessing steps and NODDI metrics were carried out in native space. For comparison, we also fitted the standard DTI model103. In order to analyze changes across tracts, we first estimated fiber orientation distributions (FOD) using MRtrix3’s dwi2fod msmt_csd and then normalized them using MRtrix3’s mtnormalise104,105. The normalized FODs were then fed to TractSeg to automatically segment fiber tracts, resulting in NODDI and DTI metrics for each tract across 50 established tracts, which were subsequently used in our statistical analyses106,107. Lastly, we characterized the global changes for each NODDI metric and investigated if observed tract-level changes reflected a general alteration across all segmented tracts using two different approaches. For a more detailed description of diffusion data processing and metric quantification, please refer to the supplementary materials section titled Diffusion-weighted MRI.

Statistics and reproducibility

Given the longitudinal nature of our data, analyses were conducted using LME models, utilizing the “lme4” package in R108. One of the main advantages of using LME models with longitudinal data is their flexibility and robustness in handling missing data points and designs with variable numbers and intervals between measurements109. This approach can weigh the data points based on the number of observations per subject, thus subjects with different numbers of observations can be included without biasing the overall estimates, enabling us to use all available data points110. Note that we included all available time points, including those from subjects with only a single time point, to maximize data usage. While individuals with data at only a single-time point do not contribute to the estimation of within-subject changes, they provide information about between-subject variability, and inclusion results in improved intercept estimation and overall model accuracy111.

We organized analyses into two primary sections. First, we examined alterations in brain metrics throughout the course of pregnancy, where brain metrics (BrainMetricij) were the dependent variable and gestational week (GesWeekij) was the predictor. We also included age (Agej) and total intracranial volume (TIVj) as covariates for volumetric analyses (see Eq. 1), and included Agej and relative motion (Motionij) as covariates for diffusion metric analyses112; see Eq. (2).

$$\begin{array}{cc}{{BrainMetric}}_{{ij}}={\beta }_{0j}+{\beta }_{1j}{{GesWeeks}}_{{ij}}+{e}_{{ij}} & {e}_{{ij}} \sim N\left(0,\,{\sigma }_{e}^{2}\right)\\ {\beta }_{0j}={\gamma }_{00}+{\gamma }_{01}{{Age}}_{j}+{\gamma }_{02}{{TIV}}_{j}+{u}_{0j} & {u}_{0j} \sim N\left(0,{\tau }_{00}\right)\\ {\beta }_{1j}={\gamma }_{10} & \end{array}$$
(1)
$$\begin{array}{cc}{{BrainMetric}}_{{ij}}={\beta }_{0j}+{\beta }_{1j}{{GesWeeks}}_{{ij}}+{\beta }_{2j}{{Motion}}_{{ij}}+{e}_{{ij}} & {e}_{{ij}} \sim N\left(0,\,{\sigma }_{e}^{2}\right)\\ {\beta }_{0j}={\gamma }_{00}+{\gamma }_{01}{{Age}}_{j}+{u}_{0j} & {u}_{0j} \sim N\left(0,{\tau }_{00}\right)\\ {\beta }_{1j}={\gamma }_{10} & \\ {\beta }_{2j}={\gamma }_{20} & \end{array}$$
(2)

where \({\gamma }_{10}\) is the fixed effect of gestational weeks (i.e., slope), and \({\gamma }_{00}\) and \({u}_{0j}\) are the fixed and random effects of the model intercept which allow each subject to have their own baseline level. To calculate the average percentage change in brain morphometry, we used the estimated LME models to predict the average change in brain morphometry over the range of observed gestational weeks (i.e., 12–39 weeks). Specifically, we used the estimated model to predict the brain metric at 12 and 39 weeks, respectively, and then computed the percentage change as

$$r=\frac{|{{BrainMetric}}_{{week}39}-{{BrainMetric}}_{{week}12}|}{{{BrainMetric}}_{{week}12}}$$

The second set of analyses focused on the potential mechanisms underlying the observed brain alterations by examining the associations between brain metrics and potential influencing factors, including hormone levels, CRP, and CSF volume. We first applied LME models to examine variations in hormone levels (progesterone, estradiol, testosterone, and cortisol) and CRP throughout pregnancy, with hormone levels and CRP as the dependent variable and gestational week as the fixed effect. CRP was log-transformed using a base 10 logarithm to normalize their distribution113. For brain metrics demonstrating significant alterations during pregnancy, we examined potential links with hormone levels and the total intracranial CSF volume using LME models in which BrainMetricij was the dependent variable and hormone levels (Hormoneij) or CSF volume was the predictor. Given the absence of significant changes in CRP, associational analyses between CRP levels and observed changes in brain metrics were not implemented. Moreover, we included Agej and total TIVj as covariates for volumetric analyses (see Eq. 3); we included Agej and Motionij as covariates for diffusion metric analyses (see Eq. 4).

$$\begin{array}{cc}{{BrainMetric}}_{{ij}}={\beta }_{0j}+{\beta }_{1j}{{Hormone}}_{{ij}}+{e}_{{ij}} & {e}_{{ij}} \sim N\left(0,\,{\sigma }_{e}^{2}\right)\\ {\beta }_{0j}={\gamma }_{00}+{\gamma }_{01}{{Age}}_{j}+{\gamma }_{02}{{TIV}}_{j}+{u}_{0j} & {u}_{0j} \sim N\left(0,{\tau }_{00}\right)\\ {\beta }_{1j}={\gamma }_{10} & \end{array}$$
(3)
$$\begin{array}{cc}{{BrainMetric}}_{{ij}}={\beta }_{0j}+{\beta }_{1j}{{Hormone}}_{{ij}}+{\beta }_{2j}{{Motion}}_{{ij}}+{e}_{{ij}} & {e}_{{ij}} \sim N\left(0,\,{\sigma }_{e}^{2}\right)\\ {\beta }_{0j}={\gamma }_{00}+{\gamma }_{01}{{Age}}_{j}+{u}_{0j} & {u}_{0j} \sim N\left(0,{\tau }_{00}\right)\\ {\beta }_{1j}={\gamma }_{10} & \\ {\beta }_{2j}={\gamma }_{20} & \end{array}$$
(4)

where \({\gamma }_{10}\) is the fixed effect of gestational weeks (i.e., slope), and \({\gamma }_{00}\) and \({u}_{0j}\) are the fixed and random effects of the model intercept. We reported standard beta values and 95% confidence intervals (95% CI), with p-values tested based on a two-sided significance test.

Given the small sample size and the exploratory nature of our study, we did not correct for multiple comparisons114. Results with p-values < 0.05 were considered statistically significant. This decision was made to prioritize sensitivity over specificity, as it is particularly important in exploratory research to identify potential effects for further investigation in larger studies. Our results should therefore be interpreted with caution. We mitigated potential concerns by emphasizing our interpretation of the direction and magnitude of effect sizes. Additionally, recognizing the possibility that individual outliers could disproportionately influence the outcome in our small sample, we implemented leave-one-out analyses for our main findings, in which each participant was systematically removed and the analyses recomputed. Specifically, a custom R script was programmed to loop through the subject IDs, sequentially removing each participant by instructing the script to filter out the subject ID for each iteration. In each iteration, one subject was automatically excluded from the dataset and the LME models were estimated using the remaining subjects. This approach helps to validate that the observed effects are consistent and not artifacts of unique data points, thus enhancing the reliability of our findings115. Additionally, we included all available data, including those with single time points, in our LME models to maximize the use of the data and increase the reliability of the findings110. To further validate our approach, we have also included results excluding the three participants with only a single time point in the supplementary materials results section. These results are consistent with those presented in the main text. We also considered the patterns of results across related dependent measures, as opposed to individual tests in isolation. The results of our study thus serve as a preliminary exploration into the complex domain of neurobiological dynamics during pregnancy.