Introduction

A physically active lifestyle is associated with a higher likelihood of having a healthy brain, both in younger and older populations. In school-aged children, increased aerobic fitness has been associated with executive function1 and larger hippocampi2. In older adults, physical activity is associated with indices of brain structure3,4,5 and even delayed onset or less likely occurrence of neurological illness, such as dementia6. Attempts to translate these associations into physical activity interventions that promote healthy brain ageing have yielded promising, yet mixed, results7,8,9,10. Across observational and interventional studies, considerable individual variability in the physical activity-brain relationship can be observed, plausibly contributing to the heterogeneity of findings. One plausible, yet often overlooked, moderator of the physical activity-brain association is biological sex11.

Globally, the prevalence of insufficient physical activity differs between men and women, with women showing reduced activity levels12. In terms of cognition, physical activity may have greater beneficial effects on cognitive function in women than in men13,14,15, although the reverse has also been reported16. Given that sex differences may influence the relationship between physical activity and cognition, one would expect a similar sex difference in the efficacy of physical activity on brain structure—particularly the neural correlates of cognitive function. In terms of brain structure, no consensus has yet been reached regarding the direction of any potential interaction, even for the hippocampus, the most commonly examined region of interest in the exercise literature. Studies of the moderating effect of sex on associations between physical activity and hippocampus have also given contradictory results, including stronger associations in women17, men18, or not differing between sexes19. Accordingly, further investigation of the interaction between physical activity and sex in their effect on brain structures is needed to reveal robust and replicable associations.

Using pooled data from five independent European cohorts of the Horizon 2020 Lifebrain consortium20, we investigated the moderating effect of biological sex on the relationship between self-reported physical activity and MRI-derived measures of brain structures. In a previous publication, we found that a cumulative unhealthy lifestyle score was negatively associated with total grey matter volume21. Using a binary classification of physical inactivity, Binnewies and colleagues21 further found no evidence that sex moderated the association between being classified as physically inactive and total grey matter or hippocampal volumes. However, the use of categorical data for physical activity levels may not be sensitive enough to the full range of physical activities nor the small effects often observed in lifestyle-brain associations22. To account for the full continuum of physical activity levels, and their potential interaction with sex on brain outcomes, the current analysis used the continuous scale of derived metabolic equivalents of task (METs) minutes per week. We selected as outcome measures grey matter regions often examined in the exercise-brain literature, i.e., hippocampal volume and the dorsolateral prefrontal cortex (dlPFC). The hippocampus has a track-record of observed plasticity during physical activity interventions10,23,24,25 and may also show sex differences in its vulnerability to ageing26. Similarly, the dlPFC shows age-related changes27 and has been associated with physical activity, both in interventions28 and in observational studies29,30,31. Here, we tested if, in line with findings in the cognitive domain, volumetric indices of brain structure would show a stronger association in females than in males.

Methods

Study samples

The Lifebrain consortium aims to promote harmonised multicohort analyses across European studies on brain structure and function20. Here, we used all Lifebrain cohorts with available physical activity (in METs) and structural MRI data. Cohorts outside of Lifebrain were not included in the analyses. This resulted in a total of 1809 participants, drawn from five different Lifebrain cohorts (Table 1): the Cambridge Centre for Ageing and Neuroscience study (Cam-CAN, UK)32, Center for Lifebrain Changes in Brain and Cognition longitudinal studies (LCBC, Norway)33, Live Active Successful ageing study (LISA, Denmark)34, Whitehall-II Imaging Sub-study (UK)35 and the Netherlands Study of Depression and Anxiety (NESDA, Netherlands)36. In longitudinal studies (LISA, LCBC), only data from baseline were used. From the NESDA study, which recruited a combination of participants diagnosed with depression and anxiety and healthy controls, only the healthy controls were selected for the present analysis. We further excluded participants who had missing physical activity data or no T1-weighted MRI brain scan. A description of each study’s sample is available in Appendix A.

Table 1 Characteristics of each cohort (mean ± SD).

All studies were conducted in accordance with the guidelines and regulations set out by their relevant authorities and the Declaration of Helsinki. Informed consent was obtained from all participants, and each study received approval from their local ethical review board: the Whitehall II Imaging Sub-study from the Oxford Central University Ethics Committee (MSD‐IDREC‐C1‐2011‐71), Cam-CAN from the Cambridgeshire 2 Research Ethics Committee (reference: 10/H0308/50), LCBC from the Norwegian Regional Committee for Medical and Health Research Ethics, LISA from the Capital Region, Copenhagen (H-3-2014-017) and the Danish Data Protection Agency, and NESDA from the ethical review board of each participating research centre in Amsterdam, Leiden, and Groningen (METC Number 2003-183).

Measures

Physical activity

Physical activity questionnaires were used to estimate total physical activity in metabolic equivalent of task minutes per week (MET-min/week). The questionnaires included the international physical activity questionnaire (IPAQ) in LCBC and NESDA, the community healthy activities model program for seniors (CHAMPS) in Whitehall-II, the EPIC-EPAQ2 in Cam-CAN and the physical activity scale for the elderly (PASE) in LISA. Although the questionnaires differed between cohorts, all enabled the calculation of MET-min/week and reflected the type, frequency, and intensity of current physical activities.

MRI acquisition and processing

T1-weighted images were automatically processed in a harmonised manner across studies with the recon-all stream of FreeSurfer (version 6.0 or 5.3)37. The derived regions of interest (ROIs) included the hippocampus, dlPFC (“caudal middle frontal”) and total grey matter (GMV). Left and right hippocampal volumes were considered separately in light of previous reports that the left hippocampus may be more susceptible to the effects of exercise10. For the dlPFC, the average of the left and right volumes was used because we had no a priori hypothesis for hemispheric differences in relation to physical activity. Estimated intracranial volumes (ICV), as outputted by FreeSurfer (eTIV), were included as covariates in the model. For an overview of the scanner type, strength and MR acquisition parameters used in each cohort, see Appendix A.

Statistical analysis

Statistical analyses were carried out in RStudio version 1.4.110338, running on R version 3.6.039, with the “stats” (R Core Team, 2016) and “meta”40 packages.

For each cohort, a generalised linear model (GLM) was fitted for each volumetric brain outcome (total GM, left hippocampus, right hippocampus, dlPFC), with physical activity (MET-min/week), age, sex and ICV as the independent variables (model 1). The GLMs were then repeated to include the physical activity*sex interaction term (model 2). Sex variables were coded in the same direction across cohorts (female = 1, male = 0). As two different scanners were used in the Whitehall II data collection, “scanner” was added as an additional covariate for this study. In a supplementary analysis, we tested whether the inclusion of BMI (kg/m2) as a covariate altered results. For each brain outcome, the standardised coefficients for physical activity (from model 1) and physical activity*sex (from model 2) were then pooled across cohorts in a random-effects meta-analysis. As a measure of heterogeneity, Cochran’s Q (henceforth “Q”) was calculated for each meta-analysis. The alpha-level was Bonferroni corrected to account for the four ROIs (α = 0.0125). Finally, to compare the fit of the proposed models, model evaluation was conducted using Akaike information criteria (AIC), and Akaike weights. The AIC for each model was obtained using the R package “stats”. The AIC values were then transformed to Akaike weights, as defined in Wagenmakers and Farrell41.

Two post-hoc sensitivity analyses were conducted. First, we explored a three-way interaction term (physical activity*sex*age) in each cohort separately. In line with previous literature focusing solely on older adults, we then repeated the random-effects meta-analyses in participants aged 60 years or older. Second, to better understand which aspects of physical activity the MET values capture, we correlated METs with other measures of physical activity, physical function, and fitness in the LISA study. The LISA study was selected for this additional analysis as it offered a wide range of physical activity and function measures. To generate further hypotheses, an exploratory analysis also probed the interaction between physical activity and sex on other brain regions reported to show marked age-related changes (ref.42; Appendix B).

Results

Pooled across cohorts, physical activity levels did not significantly vary with age or sex (Appendix C), even though there was a large negative age effect in CAM-CAN, which included the widest age range, and a large positive sex effect (wherein men reported more METs than women) in Whitehall, which had the smallest proportion of females across the five studies. At the same time, there was a large heterogeneity of effects across the five cohorts, for both age (Q = 81.98, p < 0.001) and sex (Q = 35.59, p < 0.001).

No significant pooled estimates for physical activity (Figs. 1, Fig. 2), or the interaction between sex and physical activity (Fig. 3), were observed. No significant pooled estimates were observed when the analysis was stratified by sex (Appendix D), nor when BMI was included as a covariate (Appendix E).

Fig. 1
Fig. 1
Full size image

For each brain ROI, a random effects meta-analysis was conducted to pool the associations between physical activity and grey matter volumes across cohorts, including age, sex and ICV as covariates in the model.

Fig. 2
Fig. 2
Full size image

A scatterplot to illustrate the association between physical activity and total grey matter volume (GMV), with one linear model per cohort, using z-scores.

Fig. 3
Fig. 3
Full size image

For each brain ROI, a random effects meta-analysis was conducted. Each forest plot illustrates the cohort-specific and pooled beta values for the sex \(\times\) physical activity interaction for one of the structural brain outcomes, including age, sex, ICV and physical activity effects as covariates in the model.

In a post-hoc analysis, we used AIC model selection to distinguish among three possible models describing the relationship between age, sex, ICV, physical activity and volumetric brain outcomes (Appendix F). For all outcomes, there was no evidence in favour of including the interaction term (physical activity*sex). For three cohorts, the best-fit model included physical activity as a parameter in relationship to the left (AICweight (LISA) = 0.45, AICweight (NESDA) = 0.43) or right (AICweight (LCBC) = 0.50, AICweight (NESDA) = 0.54) hippocampus. For all other outcomes, the best fitting model did not include physical activity, or its interaction, as a parameter.

Exploratory analyses, including additional regions of interest, showed that sex moderated the association between physical activity and volume of the medial temporal lobe, with women showing a stronger association between physical activity and volume in the medial temporal lobe. However, this was not significant after correcting for multiple comparisons (b = 0.06; 95% CI 0.01–0.11; p = 0.023). No other pooled estimates for the sex*physical activity interaction were observed (Appendix B).

Sensitivity analyses

Older adults

Analyses were repeated with samples restricted to participants aged 60 years or older (n = 1,647). Given the low number of older participants in the LCBC (n = 5) and NESDA (n = 0) cohorts, these samples were not included in this sensitivity analysis. Again, across cohorts, there was no evidence for an association between grey matter volumes and physical activity, nor for the interaction between sex and physical activity (Appendix G). An additional exploratory analysis of the three-way interaction between sex, age and physical activity is also presented in Appendix G.

What’s in a MET?

To test the validity of total MET mins/week as a measure of physical activity, we conducted post-hoc correlational analyses in the LISA cohort (n = 401), which included multiple indices of physical function, physical activity, and physical fitness (Fig. 4; Appendix H). Total MET mins/week was associated with accelerometer-measured step count (r = 0.31, p < 0.001), but not with gait speed (r = 0.04, p = 0.42) or estimated cardiorespiratory fitness (r = 0.03, p = 0.55).

Fig. 4
Fig. 4
Full size image

Heatmap illustrating correlation coefficients from the associations between measures of physical function, physical activity and estimated physical fitness in the LISA cohort (n = 401). eCRF estimated cardiorespiratory fitness.

Discussion

Using data pooled from five European cohorts, the present analyses aimed to test whether sex moderated associations between physical activity (PA) levels and brain structure. Contrary to our hypotheses, there was no evidence of such an interaction. Our interpretations are, however, limited by the finding that there was also no significant relationship between physical activity levels and grey matter volumes. Accordingly, this analysis did not reveal any of the expected associations.

Previous observational studies have repeatedly associated physical activity with volumetric indices of brain structure43,44,45. For example, in the UK Biobank an accelerometer-derived estimate of physical activity, was associated with larger total grey matter volume as well as right hippocampal volume in 10,083 adults aged 50 years or older. In the same analysis, sex was a significant moderator - whereby the physical activity relationship was only significant in men46. In contrast, and in a much smaller sample (n = 92), Varma and colleagues found that daily step count was associated with larger hippocampal volumes in older women, but not in older men (≥ 60 years;17). Unlike these two studies, our samples covered a much wider age range (18–87 years old). Others have noted that the link between physical activity and grey matter volume was only significant in older adults19 and that physical activity may have different associations with brain structure at different stages of the lifespan47, raising a plausible explanation for the discrepancy in our findings. However, our results remained unchanged (i.e., null) when restricting the sample to those aged 60 years or older.

In contrast to the above-described studies, our PA outcome measure was derived from self-reported PA questionnaires. Others have, like us, also not found a significant association when using self-reported outcomes48,49, with some exceptions50,51. Here, physical activity levels were defined by METs, or the “metabolic equivalent of task”, as estimated from PA questionnaires. Physical activity questionnaires are cheap and easy to administer, making them a common feature of large datasets. However, estimated METs have several confounds. First, MET intensity values are typically obtained from a compendium of physical activities, which was developed based on data from healthy adults aged 18 to 65 years old. Importantly, these values may not reflect energy costs of older adults or children performing the same task52. While this can be circumvented by using age-adjusted energy cost values, as done in questionnaires specifically designed for older adults (e.g., community healthy activities model program for seniors (CHAMPS)), this limits the comparability of values across age groups. Regarding comparability across studies, it is also important to note that the included MET estimates were calculated from different physical activity questionnaires (PASE, CHAMPS, IPAQ and EPAQ). All values were z-scored in the analyses to increase the comparability between studies. As a result, we have lost the ability to interpret how much physical activity (in METs) corresponds to a change in brain volume. Finally, as recently argued by Gill and colleagues53, questionnaires overestimate physical activity during vigorous efforts but underestimate physical activity during intermittent activities of daily living (e.g. going up a flight of stairs).

Further, as observed in our sensitivity analysis in the LISA study, the correlations between objectively-measured physical activity (e.g., accelerometer-based step count, estimated cardiorespiratory fitness) and MET values estimated from self-report were modest at best, a limitation which has been well-described elsewhere54. In older adults, a tendency to under-report physical activity has previously been noted55, and the accuracy of self-reported physical activity levels may even vary with sex and BMI56. Accordingly, and in light of previous null results48,49, these limitations raise the question of whether the use of objective measures may be preferable to PA questionnaires for assessing the brain-PA relationship. In the likely event that a full adoption of objective measures is not feasible for larger studies (but see57), an in-sample validation of the applied physical activity questionnaires relative to more objective measures would be a step forward for the field. The literature on this topic is unlikely to become any clearer before ensuring the validity of our physical activity measures.

Once the quality of the measurements has been strengthened, the next step will be to identify the key ingredients of physical activity and fitness that most effectively modify brain structure. For example, when looking at the functional integrity of ageing-related brain networks, cardiorespiratory fitness (CRF), but not physical activity, has been shown to play a role58. Sex differences in the relationship between CRF and brain structure have also been reported—as increased CRF was associated with greater bilateral subiculum volume in older women, but not men59. As new cohort studies require a decision about which variables to include in their protocols, we encourage researchers to broaden their scope and include measures beyond conventional physical activity questionnaires, such as fitness (or resting heart rate as a proxy for fitness e.g., Jackson, Sui60) and physical function.

Pooling data across multiple countries also invites questions around the cross-cultural validity of the applied questionnaires. For example, using data from five countries, a study examining the level of agreement between accelerometer- and questionnaire-based physical activity found language-specific differences in the level of agreement when comparing tests administered in English, Czech or Danish. There may, therefore, also be systematic linguistic (or cultural) differences in the interpretation of questionnaire items61.

In line with these differences, variations between study samples may have contributed to a potential loss in sensitivity when combining across cohorts. In a previous report, we observed that despite an overall similarity in the association of lifestyle factors and brain structure, the underlying pattern of associations between lifestyle factors and brain structure outcomes differed substantively between two cohorts of older adults also included in the present analyses (Whitehall and LISA)62. If associations between gray matter volumes and physical activity are small, pooling across varied populations with self-reported measurements may wash out any (predominantly modest) lifestyle-brain associations. Combining data from five European cohorts also had its advantages. First, the access to individual-level data enabled us to run harmonised analyses in each cohort before estimating the overall effect of physical activity, and physical activity by sex, on structural brain outcomes. Second, it resulted in a larger sample size. No a priori power analysis was conducted, raising the question of whether we were powered to detect potential interactions. Unlike analyses of main effects, interactions require much larger samples63. Even so, previous significant findings in a sample of 92 individuals17, suggest this was not the case in the present paper.

Conclusion

Understanding factors that influence the relationship between self-reported physical activity and brain structure may enable the development of tailored, and ultimately more effective, physical activity interventions for brain health. Here, using data from five European datasets, we found no evidence to suggest that the relationship between physical activity and indices of brain structure was moderated by biological sex. We also did not observe an association between grey matter volumes and physical activity. We propose that the reliance on self-report measures may not be sufficiently sensitive to probe associations, much less interactions, between physical activity levels and brain outcomes across the lifespan.