Consortium profile: the methylation, imaging and NeuroDevelopment (MIND) consortium

Schuurmans, Isabel K.; Mulder, Rosa H.; Baltramonaityte, Vilte; Lahtinen, Alexandra; Qiuyu, Fan; Rothmann, Leonardo Melo; Staginnus, Marlene; Tuulari, Jetro J.; Burt, S. Alexandra; Buss, Claudia; Craig, Jeffrey M.; Donald, Kirsten A.; Eriksson, Johan G.; Felix, Janine F.; Freeman, Tom P.; Grassi-Oliveira, Rodrigo; Huels, Anke; Hyde, Luke W.; Jones, Scott A.; Karlsson, Hasse; Karlsson, Linnea; Koen, Nastassja; Lawn, Will; Mitchell, Colter; Monk, Christopher S.; Mooney, Michael A.; Muetzel, Ryan; Nigg, Joel T.; Belangero, Síntia Iole Nogueira; Notterman, Daniel; Ong, Yi Ying; O’Connor, Tom; O’Donnell, Kieran J.; Pan, Pedro Mario; Paunio, Tiina; Ryabinin, Peter; Saffery, Richard; Salum, Giovanni A.; Seal, Marc; Silk, Tim J.; Stein, Dan J.; Tan, Ai Peng; Teh, Ai Ling; Wang, Dennis; Zar, Heather; Walton, Esther; Cecil, Charlotte A. M.

doi:10.1038/s41380-025-03203-w

Download PDF

Perspective
Open access
Published: 06 September 2025

Consortium profile: the methylation, imaging and NeuroDevelopment (MIND) consortium

Molecular Psychiatry volume 31, pages 1177–1189 (2026)Cite this article

6045 Accesses
4 Citations
Metrics details

Subjects

Abstract

Epigenetic processes, such as DNA methylation, show potential as biological markers and mechanisms underlying gene-environment interplay in the prediction of mental health and other brain-based phenotypes. However, little is known about how peripheral epigenetic patterns relate to individual differences in the brain itself. An increasingly popular approach to address this is by combining epigenetic and neuroimaging data; yet, research in this area is almost entirely comprised of cross-sectional studies in adults. To bridge this gap, we established the Methylation, Imaging and NeuroDevelopment (MIND) Consortium, which aims to bring a developmental focus to the emerging field of Neuroimaging Epigenetics by (i) promoting collaborative, adequately powered developmental research via multi-cohort analyses; (ii) increasing scientific rigor through the establishment of shared pipelines and open science practices; and (iii) advancing our understanding of DNA methylation-brain dynamics at different developmental periods (from birth to emerging adulthood), by leveraging data from prospective, longitudinal pediatric studies. MIND currently integrates 16 cohorts worldwide, comprising (repeated) measures of DNA methylation in peripheral tissues (blood, buccal cells, and saliva) and neuroimaging by magnetic resonance imaging across up to five time points over a period of up to 21 years (N_{pooled DNAm} = 12,877; N_{pooled neuroimaging} = 10,899; N_{pooled combined} = 6074). By triangulating associations across multiple developmental time points and study types, we hope to generate new insights into the dynamic relationships between peripheral DNA methylation and the brain, and how these ultimately relate to neurodevelopmental and psychiatric phenotypes.

A systematic review of neuroimaging epigenetic research: calling for an increased focus on development

Article Open access 25 April 2023

A longitudinal DNA methylation atlas and its link to brain structure and mental health

Article Open access 31 March 2026

Cross-tissue correlations of genome-wide DNA methylation in Japanese live human brain and blood, saliva, and buccal epithelial tissues

Article Open access 27 February 2023

Background

DNA methylation (DNAm) is an epigenetic process that can regulate gene activity in response to both genetic and environmental influences [1, 2], beginning in utero. While DNAm plays a key role in healthy development and function [3], alterations in DNAm have also been linked to the emergence of disease states, including brain-based disorders such as neurodevelopmental, psychiatric, and neurological conditions [4,5,6,7,8,9,10]. In addition, peripheral markers of DNAm are often more accessible to measure than several brain-based phenotypes. Together, these properties make DNAm a promising molecular system in the search for both biological markers and mechanisms underlying gene-environment impacts on brain-based disorder and phenotypes. However, we still know little about how peripheral DNAm patterns relate to individual differences in the brain structure and function itself – the target organ of interest.

To address this gap, a growing number of researchers have examined associations between DNAm and brain features (typically measured using magnetic resonance imaging; MRI), giving rise to the new field of Neuroimaging Epigenetics [11,12,13]. Already more than 100 articles combining DNAm and brain MRI have been published to date, but important limitations remain. Most studies have assessed DNAm and the brain only at a single (cross-sectional) time point, are based on adult samples (~80%), have small to moderate sample sizes (median N = 98, range 14–715), and have primarily used a candidate gene approach (i.e., focusing on a limited set of CpGs; 67%). The first large-scale multi-cohort investigations using an epigenome-wide approach are now emerging; these generally show modest associations, with for example only two CpGs in blood found to be associated with hippocampal volume in a pooled analysis of 3337 participants after genome-wide correction [14].

Why was the consortium set up?

Rapid advances in Neuroimaging Epigenetics have also been accompanied by several key challenges. First, studies in this field have been highly heterogeneous in terms of their design, characteristics, and methodology, with few shared practices, limiting comparability between findings and the identification of robust Neuroimaging Epigenetic associations. Second, the reliance on cross-sectional measurements, mainly in adults, likely ignores important temporal dynamics, as both DNAm and brain structure and volume are known vary over time [15,16,17,18,19] and in different ways (i) depending on the region examined (i.e., where in the genome or in the brain), (ii) in terms of their developmental trajectory (i.e., linear or non-linear), (iii) in light of (sensitive) periods with more change/growth (e.g., childhood and adolescence), (iv) considering the directionality of associations (i.e., whether changes in DNAm predict changes in the brain or vice versa), and (v) in association with different downstream phenotypes (e.g., neurodevelopmental, psychiatric). For example, recent evidence points to epigenetic ‘timing effects’ [6,7,8], whereby DNAm patterns in cord blood at birth have been shown to prospectively associate with certain neurodevelopmental problems in childhood (e.g., ADHD symptoms [6, 7], social communication deficits [8]) more strongly than DNAm patterns measured cross-sectionally during childhood. Similar ‘timing effects’ have also been reported for changes in brain volume, with the strength of phenotypic associations varying depending on timing of assessment [20]. Yet, how DNAm and neuroimaging measures relate to each other at different developmental stages is largely unknown. Finally, studies to date are typically based on single cohorts with small sample sizes of n < 100, with limited statistical power to detect what are likely to be small associations. While neighboring fields have already showed how large-scale collaborative efforts between cohorts [21,22,23,24] can be achieved, such initiatives at the intersection of epigenetics, imaging and development are lacking.

Recently, we established The Methylation, Imaging and NeuroDevelopment (MIND) Consortium to address these gaps. Our overarching goal is to understand how peripheral DNAm patterns relate to variation in brain structure and function across development. Specifically, we aim to bring a developmental focus to the emerging field of Neuroimaging Epigenetics by (i) promoting collaborative, adequately powered developmental research via multi-cohort analyses; (ii) increasing scientific rigor through the establishment of shared pipelines and open science practices; and (iii) elucidating directionality of associations between DNAm and the brain via the use of prospective, longitudinal cohorts across development (from birth through emerging adulthood).

Who is in the consortium and what has been measured?

The MIND consortium brings together a global set of research teams and cohorts from North America, South America, Asia, Europe, African and Australia (Fig. 1). It currently includes 16 cohorts, spanning population-based, twin and high-risk cohorts. The unifying feature of these cohorts is the availability of genome-wide DNAm data (Illumina 450K or Illumina EPIC chip) and neuroimaging data (e.g., T1-weighted images, diffusion-weighted images, and/or resting state functional MRI) collected at one or more time points across development, with at least one assessment prior to age 18 years. Most cohorts have data from fetal life into childhood, with several cohorts also covering adolescence and early adulthood. With ongoing data collection in many cohorts, new DNAm and neuroimaging data are also expected to become available at additional time points. Key characteristics of cohorts included in the MIND consortium can be found in Table 1 and Fig. 2. Full cohort descriptions and study-specific acknowledgements can be found in the Supplementary Materials.

**Fig. 1: World map of sample size per country (overlap DNAm and neuroimaging), as covered in MIND.**

Table 1 Key characteristics of cohorts participating in MIND.

Full size table

To date, the MIND cohorts comprise a total of 12,877 participants with DNAm profiles, 10,899 participants with neuroimaging data, and 6074 participants with both data types for neuroimaging epigenetic analyses. This is a significant expansion in sample size compared to the current average for developmental neuroimaging epigenetic studies (median N = 80, range 33–715 [11]), enabling us to characterize time-varying DNAm-brain associations in a more robust and nuanced manner. All cohorts have at least one time point of measurement, but 94% have two or more, up to five repeated time points. Cohorts participating in MIND are also notably diverse, including participants from various backgrounds in terms of ethnicity, geographical location and socio-economic environments, enabling the investigation of neuroimaging epigenetic associations across different settings, and increasing the inclusivity and generalizability of the findings. In addition, participants from most of the included cohorts have undergone extensive environmental, molecular and phenotypic profiling. As shown in Table 2, all cohorts have genetic data of participants – and in some cases also for relatives – which can be leveraged to account for population stratification, calculate polygenic scores, investigate gene-environment correlations and interactions as well as in some cases to study genetic nurture effects (i.e., using trio genetic data). Further, most cohorts have data on developmental and (mental) health outcomes, typically comprising measures of neurodevelopment, behavior, cognition and psychiatric symptoms, but also often feature other phenotypes such as anthropometrics and physical health assessments. Most cohorts have also measured environmental exposures, typically beginning in utero for the birth-cohorts (e.g., maternal smoking, diet, psychopathology and psychosocial stress during pregnancy) and during childhood/adolescence (e.g., early life stressors, parental influences, home environment and broader socio-economic factors). Finally, almost all cohorts have data on additional biological markers (e.g., inflammation) and profiling of other omics (e.g., microRNAs, metabolomics, proteomics and the gut microbiome).

Table 2 Overview of available measures different cohorts.

Full size table

How do we work together?

MIND operates on a federated model, wherein methods, practices, and results are shared among consortium members, while maintaining the confidentiality of the underlying individual-level data. Contributors have the flexibility to propose new projects, which operate on an opt-in basis. Participation from different cohorts in each project is optional and contingent on data availability, data sharing policies, resource allocation, and cohort-specific research interests. Similarly to consortia such as the Pregnancy And Childhood Epigenetics (PACE) Consortium [22], project leads are responsible for producing detailed analysis plans, which wherever possible will be pre-registered for transparency, and project leads will also be responsible for developing and testing analysis scripts. In the case of meta-analytic projects, scripts are disseminated across the consortium by the project lead for implementation by participating cohorts, who then share cohort-specific summary statistics with the project lead for pooling of results. Options for mega-analyses (i.e., pooled analyses of individual participant data) will also be explored in light of the increasing availability and uptake of federated data sharing platforms.

Getting started: identifying and addressing challenges in collaborative research on developmental Neuroimaging Epigenetics

Modeling (potentially) time-varying associations between two high-dimensional data modalities within a consortium framework offers new opportunities but also considerable challenges. Here, we outline three anticipated challenges within MIND, and reflect on potential strategies that could be used to address them (Fig. 3).

Challenge #1: Separating developmental from technical variability

A key goal of MIND is to characterize neuroimaging epigenetic associations across development. We therefore plan to integrate data from cohorts covering different ages and developmental periods. While all cohorts in MIND have collected DNAm and neuroimaging data using comparable techniques, they still show substantial variability in sample characteristics and data processing methods. Notably, technical variability could increase generalizability of findings, e.g., MIND findings are more generalizable when using a range of methods, whereas studies using a single technique alone may be less generalizable to cases where different techniques are used. Yet, technical variability also complicates integration of different findings across cohorts. Furthermore, sources of technical variability across cohorts are tied to (or coincide with) differences in the timing of assessments, making it difficult to separate (developmental) signal from (technical) noise. This raises the challenge of balancing the need of maximizing statistical power (typically by increasing pooled sample size as much as possible) with maximizing comparability across diverse cohorts (instead calling for more focused pooling of data based on shared characteristics). We highlight below main sources of variability in sample characteristics, DNAm, and neuroimaging.

Sample characteristics

MIND cohorts vary widely with respect to sample size, from numerous smaller studies with neonatal MRI (e.g., N = 86, UCI cohort) to larger cohorts with MRI data at different time points in childhood and adolescence (N = 822 at around age 18, ALSPAC cohort). This variation in sample size leads to uneven data availability across different developmental periods, resulting in unbalanced representation and differences in statistical power between these developmental periods. Furthermore, while almost all cohorts have repeated measures of DNAm or neuroimaging (or both), a small number has only a single time point available. This discrepancy results in partial overlap of samples when analyzing different developmental periods. In other words, the same group of participants is not consistently tracked across all time points in every study. To complicate matters, the ‘spacing’ of time points between DNAm and neuroimaging differs between cohorts. For example, while some cohorts have both DNAm and neuroimaging available in the neonatal period (maximum gap of a few weeks), others have DNAm at birth but only started with neuroimaging assessments in mid-childhood, resulting in a gap of several years. To address these sample differences, careful consideration of covariates (e.g., age at DNAm and neuroimaging assessments and the time gap between them) as well as methods to address partial sample overlap and sample size imbalance across cohorts (e.g., through the use of weighted approaches and leave-one-out analyses to test the stability of results) will be needed. Moreover, sex is a key covariate for consideration in DNAm-neuroimaging analyses. Studies have reported widespread autosomal sex differences in DNA methylation as early as at birth, with sex-differentiated sites enriched in genes related to nervous system development and mental health phenotypes [25, 26]. These differences may contribute to the observed sex disparities in psychiatric outcomes. Similarly, brain structure development also follows sex-dependent trajectories [27]. Given these findings, sex will be systematically accounted for in MIND analyses as a covariate, or where possible examined as a potential moderator through interaction analyses, to better understand the role of sex in DNAm-neuroimaging associations.

DNAm

Another source of variability across studies is in how DNAm is measured, both in terms of DNAm array and tissue examined. The type of DNAm array used varies not only between MIND cohorts but at times also within longitudinal cohorts. The gold standard for DNAm assessment is whole-genome bisulfite sequencing (WGBS), offering single-nucleotide precision and covering about 95% of all CpGs on the genome, or roughly 28 million CpG sites [28]. However, its high cost and computational burden limits its use in large-scale cohorts. As an alternative, Illumina has developed a series of more affordable BeadChip microarrays, starting with the HumanMethylation27 BeadChip in 2008, expanding to over 450,000 CpGs with the 450 K array in 2011 [29], to over 850,000 CpGs with the EPICv1 in 2016 [30] and featuring an additional 186,000 CpGs with EPICv2 in 2023 [31]. The latest version, the Methylation Screening Array (MSA), contains over 284,000 probes of which only around 145,000 overlap with EPICv2 [32]. The cost-effective nature of these arrays (~3 times cheaper than WGBS) has made it possible for cohorts to measure hundreds of thousands of CpGs across the genome in larger samples, and as a result these methods have been eagerly adopted by many cohort studies. Yet, DNAm arrays and their technology change over time, and while these changes are marked by the removal of poor-quality sites [33] and addition of new probes with evidence for biological or clinical importance, the availability of data across several arrays presents challenges for longitudinal cohorts transitioning between versions. The discrepancies at the individual CpG level could reduce consistency in longitudinal research [34, 35]. Consequently, it may be difficult for longitudinal cohorts to distinguish between developmental changes vs variations due to array or batch-related factors, especially as measuring the same sample using multiple arrays is hardly ever done. Fortunately, studies comparing arrays have reported generally high concordance [31, 34,35,36].

Even within a single array, processing-related biases can arise, particularly due to batch effects linked to plate assignment. For instance, the 450 K BeadChip accommodates 12 samples per chip, with a standard processing plate holding 8 chips, totaling 96 samples being processed per plate. Meanwhile, the EPICv1 and EPICv2 arrays accommodate 8 samples per chip, with 8 chips per plate, allowing for a total of 64 samples being processed per plate. These processing plates can introduce systematic biases, which are especially problematic in longitudinal studies, where plate effects may become confounded with time points if all samples from a given time point are processed on the same plate. To mitigate this, studies planning to generate repeated measures of DNAm could apply a two-stage pseudo-randomization process, whereby (i) samples from the same individual at different time points are processed on the same plate, minimizing batch effects in within-subject longitudinal analyses, while also (ii) ensuring that individuals are adequately randomized across plates, to limit batch effects in between-subject analyses. If new time points are added to a cohort with an existing DNAm assessment, it would also be recommended to run ‘bridge samples’ (i.e. re-estimating a set of samples from the existing time point together with the new time points), in order to better quantify - and control for – resulting batch effects. This is for example the strategy that one of the MIND cohorts, Generation R, has recently applied to its new longitudinal epigenetic data (encompassing repeated DNAm measures at age 6, 10, 14, and 18 years), whereby repeated measures per individual were assigned to the same plate, and bridge samples were run to link this new resource to existing DNAm data at birth.

Another key challenge lies in the choice of tissue for DNAm measurement [37, 38]. Since in vivo DNAm assessment in brain tissue is not feasible, studies typically rely on peripheral DNAm (measured in more accessible tissues) to investigate associations with neuroimaging parameters. However, peripheral DNAm may not accurately capture brain methylation patterns, as only ~10% of CpG sites are estimated to be both variable in blood (range > 0.1) and concordant between blood and brain (based on correlation, r > 0.33, brain region-dependent) [38]. These CpGs can be identified using blood-brain methylation databases (e.g., BECon [38], Blood-Brain Comparison Tool [39], IMAGE-CpG [40]), which provide cross-tissue comparisons. Alternatively, significant findings can be validated using brain tissue derived from brain banks (e.g., The Netherlands brain bank for psychiatry [41]). However, in both cases, these data are primarily derived from adult postmortem brain samples and are typically limited to specific brain regions, leaving it unclear if these correlations generalize to the developmental context, in vivo, across different brain regions, or across different cell populations (neuronal versus non-neuronal). Efforts are emerging to profile DNAm in postmortem brains from fetal life onwards, although these studies have yet to establish the comparability of DNAm patterns between brain and more accessible peripheral tissues [42, 43]. Notably, even when cross-tissue correspondence is low, peripheral DNAm patterns may still be associated with brain features measured by MRI and hold biological relevance – for example, with blood-derived CpG sites reflecting inflammatory processes that influence brain morphology or connectivity. Additionally, such sites may have practical value in biomarker development, serving as indicators of risk factors for poor brain health, such as smoking.

In addition, there is also variability in across development [44]. At birth, DNAm is often measured from cord blood or other accessible tissues such as placenta, whereas later in life, samples are typically obtained from peripheral blood, saliva, or buccal cells. Even within the same tissue, cell-type variability may exist (e.g., cord blood versus peripheral blood). Most cohorts collect DNAm in only one tissue type, although MIND does feature a small subset of cohorts that collected DNAm in multiple tissues (e.g., saliva and blood, Kids2health). A common practice for many multi-cohort studies (e.g., [6, 14, 45]), has been to focus on blood tissue, including cord, heel prick and peripheral whole blood during development. While utilizing data obtained from a single tissue can enhance comparability and reduce noise between studies, it can come at the cost of lower sample size and limited generalizability to different tissues. Yet, even within the same tissue, cell-type composition can change dramatically across development, both in terms of the type and proportion of cells in bulk tissue. For example, nucleated red blood cells (nRBCs) are abundant in cord blood, but largely absent in blood later in development. Cell-type composition in bulk tissue can be adjusted for using age/tissue-appropriate reference panels (e.g., reference for cord blood versus peripheral blood at later time points, [6, 45]). However, this also introduces a further source of technical variability in longitudinal analyses (given that each panel has been estimated using different data sources). For example, a previous study characterizing DNAm trajectories across development found widespread age-related changes in DNAm patterns over the first two decades of life [15]. In some cases, these changes were non-linear (e.g. DNAm changing more rapidly from birth to mid-childhood, compared to later time points), raising questions about whether these changes are due to developmental timing or differences between cord and peripheral blood. In this respect, the creation of a single, integrated panel which estimates a more comprehensive set of cell-types, including (i) cells that may be present at only one time point (e.g., nRBCs at birth), as well as (ii) cell types that may be ubiquitous across development, but still vary in abundance over time (e.g., naïve versus memory B cells), could offer an attractive solution to better separate tissue from timing-related differences.

Neuroimaging

Measurement heterogeneity is also a factor for neuroimaging assessments. MRI scanner types typically vary across study sites, and even within one site, scanners are often upgraded or replaced over time. Different strategies that have been proposed for reducing this unwanted geographical and temporal variation, include harmonization techniques like ComBat [46,47,48], deep learning [49], or hierarchical Bayesian regression [50]. However, removing the geographical and temporal variation from neuroimaging assessments can be challenging in developmental research: consider participants scanned repeatedly at different ages (e.g., as newborns and later in development) at various locations or with different scanners. While harmonization techniques aim to eliminate unwanted scanner or site variations, there is a risk they might also inadvertently remove developmental variation-of-interest. Currently, there is a lack of consensus about whether these methods should or should not be used for developmental cohorts. For example, one large-scale study by Ge et al. [19], involving almost 40,000 participants from 86 cohorts, used ComBat to control for unwanted site effects while comparing statistical methods for normative modeling of brain morphometric data. Another large-scale study by Kia et al. [50], also including almost 40,000 participants from 79 scanners, harmonized their data by using federated hierarchical Bayesian regression. In contrast, another large study by Bethlehem et al. [18], including over 100,000 participants from 100 cohorts, chose not to perform harmonization techniques for constructing normalized brain charts. Overall, it is unclear how best to strike a balance between reducing unwanted scanner variation while also preserving developmental changes crucial for accurate interpretation.

Most neuroimaging processing software packages - such as FreeSurfer, Statistical Parametric Mapping (SPM), and FMRIB Software Library (FSL) – have been developed for and tested in adult samples, raising concerns about how these methods perform in data from infants and children. For example, although the standard FreeSurfer suite has been successfully applied to data from children aged 4 years [51], Infant FreeSurfer has been developed for infants aged 0–2 [52]. Other software packages developed especially for fetal or neonatal brain scans include iBEAT [53] and NEOCIVET [54]. An important question that follows is whether cohorts should prioritize methodological consistency (i.e., reduce technical variability at the cost of potentially lower accuracy, by processing all neuroimaging data with the same software) or developmental sensitivity (i.e., use accurate tools at the cost of methodological consistency, by processing neuroimaging data with developmentally appropriate software) when harmonizing cross-cohort neuroimaging data across different life stages. Both strategies have been used in the past. For example, a recent analysis from the ENIGMA Lifespan consortium, including data from age 3 to 90 years across 86 cohorts, only used standard FreeSurfer [19]. In contrast, Bethlehem et al. [18] created normalized brain charts for mid-gestation up to age 100, while allowing for the use of developmentally-sensitive processing software across cohorts. Consequently, all adult cohorts were processed using FreeSurfer, but there was considerable heterogeneity of methods in data from pre-birth to early childhood (e.g., manual segmentation, NEOCIVET, customized FreeSurfer versions, Infant FreeSurfer, and standard Freesurfer). It is still unclear whether the use of age-tailored software tools adds or reduces noise in longitudinal analyses that span several developmental periods (e.g., birth to early adulthood). Other strategies include leveraging age-appropriate software, while clustering analyses by developmental period, and accounting for software-related variations during the meta-analysis phase. Alternatively, findings can be triangulated using different processing software. Overall, research evaluating the best options for leveraging developmentally sensitive data is needed to provide evidence-based guidelines for the field.

Challenge #2: Modeling time-varying DNAm-brain associations in multi-cohort analyses

Besides considering how best to account for variability between MIND cohorts, decisions must also be made regarding how best to statistically perform multi-cohort integration. Both DNAm and brain structure are highly dynamic across development, meaning their associations may differ depending on when they are assessed [15,16,17,18,19]. In consortium settings, standard meta-analyses remain the most popular option, where an exposure (e.g., DNAm at specific loci) is associated with an outcome (e.g., a brain feature) using regression models within each individual cohort, after which cohort-level summary statistics are pooled through meta-analysis. Such methods were used by Jia et al. [14], which is currently the largest DNAm-brain multi-cohort study [12], meta-analyzing data from 3337 participants (mainly adults) across 11 cohorts. Notably, only two CpGs were found to be associated with hippocampal volume after correction for multiple testing, and none for the thalamus or the nucleus accumbens – the other two regions investigated. While this could suggest that the meta-analysis was either underpowered to detect modest effects or that peripheral DNAm and volumetric brain differences are largely unrelated, it could also indicate that rather than being constant, associations vary over time. Unless explicitly modeled, however, this information is lost in standard meta-analytic approaches, which inherently assume constant exposure-outcome associations; i.e., that the link between DNAm and the brain does not change over time. Although this restriction could have the advantage of identifying age-averaged associations (which may be more generalizable across age groups), it comes at the cost of obscuring time-varying effects. This may be particularly problematic for studies which pool data from both pediatric and adult cohorts, as changes in DNAm patterns and brain features are expected to be more drastic earlier in life.

An alternative approach to address developmental timing in multi-cohort studies involves strategic pooling of summary data at specific age ranges, as opposed to pooling all cohorts together regardless of age. This approach is commonly used in the PACE consortium (e.g., [6, 45]), prioritizing one epigenetic time point of interest (e.g., birth, because it is generally the time point with the largest sample size, or because of a focus on pregnancy exposures) and then quering later DNAm assessments by (i) adopting a follow-forward approach, where significant CpG sites are selected for further analysis and tested at subsequent time points (e.g., [7, 9, 55]) or by (ii) conducting a completely new set of analyses at later developmental stages (e.g., running separate epigenome-wide analyses using DNAm at birth and DNAm in childhood in relation to the same child outcome [6, 45]). While this strategy helps to partition analyses according to more developmentally-specific age groups, it does not directly quantify temporal changes in DNAm-brain associations, nor does it account for the potential non-linearity of this relationship over time. For this, meta-regression methods could be applied, which allow to test, within the same model (still based on cohort-specific summary statistics), (i) how DNAm and brain features associate at each available time points/developmental period, (ii) whether these associations change across time periods, and (iii) whether changes are linear or non-linear [15,16,17]. For an example of applying longitudinal meta-regression to quantify DNAm timing effects on child health outcomes using cohort-level summary statistics from the PACE Consortium, see [56].

While the above strategies can help us get started, ultimately, the transition toward a mega-analysis framework using individual-level data will offer greater modeling flexibility and new opportunities to address developmental questions in collaborative multi-cohort studies. Three concrete examples of potential applications here include longitudinal mixed models (LMM), structured life-course modeling analysis (SLCMA) and structural equation modeling (SEM). Mixed models test whether changes in a predictor (measured at repeated time points, e.g., DNAm at birth, age 6, and age 10) associates with an outcome (measured at single time points, e.g., brain feature at age 14) over time. While we are not aware of any existing multi-cohort studies using LMM to model DNAm-brain associations, this method was recently applied (i) in a single-cohort study to examine whether DNAm sites across time prospectively associate with the amygdala:hippocampus ratio in early adulthood [57], and (ii) to individual-level data from two longitudinal cohorts in order to characterize how epigenome-wide DNAm patterns vary over the first two decades of life [15]. As an alternative to LMM, SLCMA uses repeated measures of a predictor to test competing temporal hypotheses about its effect on a particular outcome (measured at a single time point). It has been used for example to test the developmental effect of stress exposure on child DNAm, showing that stressful experiences are more likely to associate with changes in DNAm if they occur early rather than late in childhood [58, 59]). This approach could be extended to model repeated DNAm as a predictor, in order to establish whether epigenetic effects on brain outcomes may also be developmentally-specific (orange line, Fig. 3), or might diminish or accumulate over time (red and purple lines, Fig. 3), as opposed to being temporally invariant (green line, Fig. 3). Finally, SEM allows modeling of repeated measures for both the predictor and the outcome; it has been previously applied to DNAm and neuroimaging data, although not within the same study. In the context of MIND, this model could be used to integrate DNAm-brain data, for example, (i) to estimate the degree of temporal stability in either epigenetic patterns and brain features across development (i.e., autoregressive paths), (ii) to examine relationships between these DNAm and the brain over time (i.e., cross-lagged paths), and (iii) to test genetic/environmental predictors or phenotypic (health) outcomes of identified DNAm-brain associations, as well as mediating effects (i.e., indirect paths). Despite the potential of these advanced methods to model complex (repeated) DNAm-brain measures for time-varying associations, their implementation within a multi-cohort framework remains a more distant goal. Success in this respect will first depend on overcoming existing barriers in sharing of individual-level data, increasing availability of meta-analyses for advanced statistical models, accessing a large enough set of cohorts with sufficiently comparable repeated DNAm/neuroimaging measures, and deciding how best to balance the complexity of longitudinal methods with the high-dimensionality of these data types, as discussed in more detail below.

Challenge #3: Analyzing high-dimensional data

A third set of challenges relates to the dimensionality of DNAm and MRI data. In MIND, if we were to focus on single predictor–single outcome models, with a sample size of approximately >6000, we would be powered to detect an effect size (R²) of 0.001 at p < 0.05. However, DNAm data is commonly investigated through univariate epigenome-wide associations (with covariate adjustment), where DNAm-phenotype relationships are studied for a large number of individual sites across the genome. Epigenome-wide investigations currently interrogate up to 900,000 data points per person, depending on which array is used (note that even more data points can be measured with some technologies [28]). A commonly employed statistical threshold in such studies is p < 9 × 10⁻⁸ [60]. Under this threshold, MIND would be powered to detect associations with an effect size (R²) of 0.006. Yet, brain data can be treated as high-dimensional as well, and are often examined voxel-wide, where local density of gray matter is compared between participants (e.g., per 1mm³ of the brain), again resulting in hundreds of thousands of data points per participant. Findings from other epigenetic or neuroimaging consortia already suggest that detecting small to medium effects typically requires thousands of participants to ensure robust and reproducible associations when analyzing epigenetic these data types independently [61, 62]. When these datasets are combined, the challenge of collating adequately powered sample sizes becomes even greater. If epigenome- and voxel-wide data are analyzed together in the same model, even if just at a single time point, this would add up to examining more than a billion associations (CpG by voxel). At present, there are no concrete estimates for the sample sizes required for such large-scale Neuroimaging Epigenetics analyses. However, assuming a strict Bonferroni-adjusted threshold of p < 1 × 10⁻¹³ (as based on genome by voxel-wide analyses [63]), we would still be powered to detect small effects with an R² of 0.011 with an sample size >6000. Notably, recent efforts have been made to apply such high-dimensional models within a single time point in the context of gene-environment interplay on DNAm, with some promising findings [64]. Yet, such models, especially if extended longitudinally, are currently neither practical (due to computational burden) nor feasible (due to limited power in smaller subsets of the data) within Neuroimaging Epigenetics. Therefore, in the context of MIND, some level of data manipulation is necessary.

One approach to managing high-dimensional DNAm data is preselecting CpGs, e.g. by focusing on variable CpGs known to differ across individuals, those with known cross-tissue correspondence (identified within blood-brain methylation databases [38]), or by focusing on CpGs close to genes implicated in neuronal development. However, candidate-based approaches have inherent limitations, including reduced discovery potential, the risk of overlooking novel or developmentally relevant CpG sites, and the challenge of ensuring generalizability across diverse populations and tissues. Alternatively, high-dimensional data can be handled by applying dimension reduction techniques, such as principal component analysis, (parallel) independent component analysis, local Fisher’s discriminant analysis, and canonical correlation analysis [65, 66]. These methods have indeed been applied to neuroimaging epigenetic data in single cohorts, although their application to multi-cohort analyses can be challenging due to the potential of obtaining different model solutions (e.g., factors or components comprised of different CpGs/brain features, or different weights) in different cohorts.

Additionally, we can condense high-dimensional data into singular aggregate scores that can be computed in a comparable way across cohorts, as done for example in the case of biological age estimates such as epigenetic age and brain age [67, 68]. In particular, the use of methylation profile scores (MPS) is gaining popularity in epigenetic research [69], following in the footsteps of polygenic scores (PGS) within the field of genetics. Within MIND, we could use MPSs to proxy at an epigenetic level brain-relevant exposures (e.g., prenatal smoking or insufficient sleep), biological processes (e.g., inflammation, neuroendocrine function), and health outcomes (e.g., mental or cardiometabolic phenotypes). We could also seek to develop MPSs of brain features themselves, an approach not yet attempted. In this context, it would be particularly interesting to conduct pathway analyses on the included CpGs to identify the underlying biological mechanisms driving associations in Neuroimaging Epigenetics. Another application of MPSs is to use them as a proxy for missing covariates or to enhance the quality of data imputation, which could be particularly helpful in the case of multi-cohort analyses (as certain cohorts may not have collected important covariates, or at least not at the specific age of interest). Overall, MPSs may offer a practical solution to manage high dimensionality while also aggregating together many epigenetic loci of small effect; yet, they also have their challenges. For example, MPSs are increasingly constructed using penalized linear regression techniques, which require adequate consideration and availability of training, tuning and testing datasets. The selection and splitting of datasets, and the amount of data reserved for training versus testing, can reduce sample size and power. Of note, to obtain reliable estimates for PGSs in complex traits, discovery samples from tens to hundreds of thousands of individuals have been required. Sample sizes required to construct reliable MPSs remain undefined. Furthermore, the predictive power of PRSs is influenced by ancestry [70]. While some evidence shows this may be less of an issue with MPSs [71], it will be of interest to test performance of MPSs in MIND to generate MPSs that can ideally be applied across ancestries. Finally, MPSs cannot be reliably constructed without accounting for key covariates known to influence the epigenome, such as age, sex, prenatal and postnatal exposure to smoking, and cell-type composition. All these factors must be carefully adjusted for or integrated into model construction to ensure valid MPSs.

Conclusions

The potential of large-scale collaborative efforts between cohorts has been clearly demonstrated by the success of genome-wide association studies (for example by the Psychiatric Genetic Consortium [21]), epigenome-wide association studies (for example by PACE [22]), and neuroimaging studies (for example by ENIGMA [23] or specifically ENIGMA-ORIGINs [24]). The MIND consortium aims to apply these principles to the emerging field of Neuroimaging Epigenetics, with a particular emphasis on development. Priorities include fostering multi-cohort analyses to access collaborative, adequately powered developmental research; to establish shared pipelines; and to elucidate the time-varying relationship between peripheral DNAm and brain development through prospective, longitudinal cohorts spanning pre-birth to young adulthood. MIND is committed to pursuing these goals while supporting open science practices, including the use of pre-registration of studies, sharing of analysis scripts, and making full results (e.g., meta-analysis summary statistics) openly accessible. Through these activities, the consortium aims to facilitate an integrative and efficient research environment and to enhance the transparency and reproducibility of our research. Overall, we hope that MIND will represent a significant step towards shedding light into the complex, dynamic relationship between peripheral DNAm and the brain, and to understanding how these ultimately relate to neurodevelopmental and psychiatric phenotypes.

Joining us

The MIND Consortium operates as an open network, welcoming researchers interested in joining with one or more of its cohorts. In this consortium, each participant manages and analyses their data locally. Detailed protocols for each study can be accessed through their individual websites (see Supplementary materials) or by contacting the study investigators. In order to participate in MIND, cohorts must feature at least one assessment of DNAm (peripheral, array-based) and neuroimaging collected before the age of 18 years. Those interested in collaborating with the MIND Consortium are encouraged to contact the corresponding authors (Dr. Esther Walton, E.Walton@bath.ac.uk; Dr. Charlotte Cecil, c.cecil@erasmusmc.nl). More information can also be found on our website: https://www.erasmusmc.nl/en/research/groups/methylation-imaging-and-neurodevelopment-mind-consortium#.

References

Szyf M, McGowan P, Meaney MJ. The social environment and the epigenome. Environ Mol Mutagen. 2008;49:46–60.
Article PubMed CAS Google Scholar
Van Dongen J, Nivard MG, Willemsen G, Hottenga J-J, Helmer Q, Dolan CV, et al. Genetic and environmental influences interact with age and sex in shaping the human methylome. Nat Commun. 2016;7:11115.
Article PubMed PubMed Central Google Scholar
Jones MJ, Goodman SJ, Kobor MS. DNA methylation and healthy human aging. Aging Cell. 2015;14:924–32.
Article PubMed PubMed Central CAS Google Scholar
Gao X, Chen Q, Yao H, Tan J, Liu Z, Zhou Y, et al. Epigenetics in Alzheimer’s disease. Front Aging Neurosci. 2022;14:911635.
Article PubMed PubMed Central CAS Google Scholar
Abdolmaleky HM, Smith CL, Faraone SV, Shafa R, Stone W, Glatt SJ, et al. Methylomics in psychiatry: modulation of gene–environment interactions may be through DNA methylation. Am J Med Genet B Neuropsychiatric Genet. 2004;127:51–59.
Article Google Scholar
Neumann A, Walton E, Alemany S, Cecil C, González JR, Jima DD, et al. Association between DNA methylation and ADHD symptoms from birth to school age: a prospective meta-analysis. Transl Psychiatry. 2020;10:398.
Article PubMed PubMed Central CAS Google Scholar
Walton E, Pingault JB, Cecil CAM, Gaunt TR, Relton CL, Mill J, et al. Epigenetic profiling of ADHD symptoms trajectories: a prospective, methylome-wide study. Mol Psychiatry. 2017;22:250–6.
Article PubMed CAS Google Scholar
Rijlaarsdam J, Cecil CAM, Relton CL, Barker ED. Epigenetic profiling of social communication trajectories and co-occurring mental health problems: a prospective, methylome-wide association study. Dev Psychopathol. 2022;34:854–63.
Article PubMed Google Scholar
Luo M, Walton E, Neumann A, Thio CHL, Felix JF, van Ijzendoorn MH, et al. DNA methylation at birth and lateral ventricular volume in childhood: a neuroimaging epigenetics study. J Child Psychol Psychiatry. 2023;65:77–90.
Article PubMed PubMed Central Google Scholar
Caramaschi D, Hatcher C, Mulder RH, Felix JF, Cecil CAM, Relton CL, et al. Epigenome-wide association study of seizures in childhood and adolescence. Clin Epigenetics. 2020;12:1–13.
Article Google Scholar
Walton E, Baltramonaityte V, Calhoun V, Heijmans BT, Thompson PM, Cecil CA. A systematic review of neuroimaging epigenetic research: calling for an increased focus on development. Mol Psychiatry. 2023;28:2839–47.
Article PubMed PubMed Central CAS Google Scholar
Wheater EN, Stoye DQ, Cox SR, Wardlaw JM, Drake AJ, Bastin ME, et al. DNA methylation and brain structure and function across the life course: a systematic review. Neurosci Biobehav Rev. 2020;113:133–56.
Article PubMed PubMed Central CAS Google Scholar
Berger A. How does it work?: Magnetic resonance imaging. BMJ: British Medical Journal. 2002;324:35.
Article PubMed PubMed Central Google Scholar
Jia T, Chu C, Liu Y, Van Dongen J, Papastergios E, Armstrong NJ, et al. Epigenome-wide meta-analysis of blood DNA methylation and its association with subcortical volumes: findings from the ENIGMA Epigenetics Working Group. Mol Psychiatry. 2021;26:3884–95.
Article PubMed Google Scholar
Mulder RH, Neumann A, Cecil CAM, Walton E, Houtepen LC, Simpkin AJ, et al. Epigenome-wide change and variation in DNA methylation in childhood: trajectories from birth to late adolescence. Hum Mol Genet. 2021;30:119–34.
Article PubMed PubMed Central CAS Google Scholar
Oh ES, Petronis A. Origins of human disease: the chrono-epigenetic perspective. Nat Rev Genet. 2021;22:533–46.
Article PubMed CAS Google Scholar
Shaw P, Kabani NJ, Lerch JP, Eckstrand K, Lenroot R, Gogtay N, et al. Neurodevelopmental trajectories of the human cerebral cortex. J Neurosci. 2008;28:3586–94.
Article PubMed PubMed Central CAS Google Scholar
Bethlehem RAI, Seidlitz J, White SR, Vogel JW, Anderson KM, Adamson C, et al. Brain charts for the human lifespan. Nature. 2022;604:525–33.
Article PubMed PubMed Central CAS Google Scholar
Ge R, Yu Y, Qi YX, Fan YN, Chen S, Gao C, et al. Normative modeling of brain morphometry across the lifespan using CentileBrain: algorithm benchmarking and model optimization. Lancet Digit Health. 2024;6:211–22.
Fuhrmann D, Knoll LJ, Blakemore S-J. Adolescence as a sensitive period of brain development. Trends Cogn Sci. 2015;19:558–66.
Article PubMed Google Scholar
Sullivan PF. The psychiatric GWAS consortium: big science comes to psychiatry. Neuron. 2010;68:182–6.
Article PubMed PubMed Central CAS Google Scholar
Felix JF, Joubert BR, Baccarelli AA, Sharp GC, Almqvist C, Annesi-Maesano I, et al. Cohort profile: pregnancy and childhood epigenetics (PACE) consortium. Int J Epidemiol. 2018;47:22–23u.
Article PubMed Google Scholar
Thompson PM, Stein JL, Medland SE, Hibar DP, Vasquez AA, Renteria ME, et al. The ENIGMA Consortium: large-scale collaborative analyses of neuroimaging and genetic data. Brain Imaging Behav. 2014;8:153–82.
Article PubMed PubMed Central Google Scholar
Alex AM, Buss C, Davis EP, de Los Campos G, Donald KA, Fair DA, et al. Genetic influences on the developing young brain and risk for neuropsychiatric disorders. Biol Psychiatry. 2023;93:905–20.
Article PubMed PubMed Central CAS Google Scholar
Maschietto M, Bastos LC, Tahira AC, Bastos EP, Euclydes VLV, Brentani A, et al. Sex differences in DNA methylation of the cord blood are related to sex-bias psychiatric diseases. Sci Rep. 2017;7:44547.
Article PubMed PubMed Central CAS Google Scholar
Solomon O, Huen K, Yousefi P, Küpers LK, González JR, Suderman M, et al. Meta-analysis of epigenome-wide association studies in newborns and children show widespread sex differences in blood DNA methylation. Mutat Res Rev Mutat Res. 2022;789:108415.
Article PubMed PubMed Central CAS Google Scholar
Koolschijn PCMP, Crone EA. Sex differences and structural brain maturation from childhood to early adulthood. Dev Cogn Neurosci. 2013;5:106–18.
Article PubMed PubMed Central Google Scholar
Kurdyukov S, Bullock M. DNA methylation analysis: choosing the right method. Biology. 2016;5:3.
Article PubMed PubMed Central Google Scholar
Dedeurwaerder S, Defrance M, Calonne E, Denis H, Sotiriou C, Fuks F. Evaluation of the Infinium Methylation 450K technology. Epigenomics. 2011;3:771–84.
Article PubMed CAS Google Scholar
Pidsley R, Zotenko E, Peters TJ, Lawrence MG, Risbridger GP, Molloy P, et al. Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling. Genome Biol. 2016;17:1–17.
Article Google Scholar
Noguera-Castells A, García-Prieto CA, Álvarez-Errico D, Esteller M. Validation of the new EPIC DNA methylation microarray (900K EPIC v2) for high-throughput profiling of the human DNA methylome. Epigenetics. 2023;18:2185742.
Article PubMed PubMed Central Google Scholar
Goldberg DC, Cloud C, Lee SM, Barnes B, Gruber S, Kim E, et al. MSA: scalable DNA methylation screening BeadChip for high-throughput trait association studies. Cell Genom. 2025;5:100929.
Sugden K, Hannon EJ, Arseneault L, Belsky DW, Corcoran DL, Fisher HL, et al. Patterns of reliability: assessing the reproducibility and integrity of DNA methylation measurement. Patterns. 2020;1:100014.
Article PubMed PubMed Central CAS Google Scholar
Olstad EW, Nordeng HME, Sandve GK, Lyle R, Gervin K. Low reliability of DNA methylation across Illumina Infinium platforms in cord blood: implications for replication studies and meta-analyses of prenatal exposures. Clin Epigenetics. 2022;14:80.
Article PubMed PubMed Central CAS Google Scholar
Solomon O, MacIsaac J, Quach H, Tindula G, Kobor MS, Huen K, et al. Comparison of DNA methylation measured by Illumina 450K and EPIC BeadChips in blood of newborns and 14-year-old children. Epigenetics. 2018;13:655–64.
Article PubMed PubMed Central Google Scholar
Fernandez-Jimenez N, Allard C, Bouchard L, Perron P, Bustamante M, Bilbao JR, et al. Comparison of Illumina 450K and EPIC arrays in placental DNA methylation. Epigenetics. 2019;14:1177–82.
Article PubMed PubMed Central Google Scholar
Walton E, Hass J, Liu J, Roffman JL, Bernardoni F, Roessner V, et al. Correspondence of DNA methylation between blood and brain tissue and its application to schizophrenia research. Schizophr Bull. 2016;42:406–14.
Article PubMed Google Scholar
Edgar RD, Jones MJ, Meaney MJ, Turecki G, Kobor MS. BECon: a tool for interpreting DNA methylation findings from blood in the context of brain. Transl Psychiatry. 2017;7:e1187.
Article PubMed PubMed Central CAS Google Scholar
Hannon E, Lunnon K, Schalkwyk L, Mill J. Interindividual methylomic variation across blood, cortex, and cerebellum: implications for epigenetic studies of neurological and neuropsychiatric phenotypes. Epigenetics. 2015;10:1024–32.
Article PubMed PubMed Central Google Scholar
Braun PR, Han S, Hing B, Nagahama Y, Gaul LN, Heinzman JT, et al. Genome-wide DNA methylation comparison between live human brain and peripheral tissues within individuals. Transl Psychiatry. 2019;9:47.
Article PubMed PubMed Central Google Scholar
Rademaker MC, de Lange GM, Palmen SJMC. The Netherlands brain bank for psychiatry. Handb Clin Neurol. 2018;150:3–16.
Article PubMed Google Scholar
Spiers H, Hannon E, Schalkwyk LC, Smith R, Wong CC, O’Donovan MC, et al. Methylomic trajectories across human fetal brain development. Genome Res. 2015;25:338–52.
Article PubMed PubMed Central CAS Google Scholar
Franklin A, Davies JP, Clifton NE, Blake GET, Bamford R, Walker EM, et al. Cell-type-specific DNA methylation dynamics in the prenatal and postnatal human cortex. bioRxiv:2025.02.21.639467 [Preprint]. 2025. [cited 21 Feb 2025, https://www.biorxiv.org/content/10.1101/2025.02.21.639467v1].
Gunasekara CJ, Scott CA, Laritsky E, Baker MS, MacKay H, Duryea JD, et al. A genomic atlas of systemic interindividual epigenetic variation in humans. Genome Biol. 2019;20:1–12.
Article CAS Google Scholar
Rijlaarsdam J, Cosin-Tomas M, Schellhas L, Abrishamcar S, Malmberg A, Neumann A, et al. DNA methylation and general psychopathology in childhood: an epigenome-wide meta-analysis from the PACE consortium. Mol Psychiatry. 2023;28:1128–36.
Article PubMed CAS Google Scholar
Beer JC, Tustison NJ, Cook PA, Davatzikos C, Sheline YI, Shinohara RT, et al. Longitudinal ComBat: a method for harmonizing longitudinal multi-scanner imaging data. Neuroimage. 2020;220:117129.
Article PubMed Google Scholar
Fortin J-P, Cullen N, Sheline YI, Taylor WD, Aselcioglu I, Cook PA, et al. Harmonization of cortical thickness measurements across scanners and sites. Neuroimage. 2018;167:104–20.
Article PubMed Google Scholar
Radua J, Vieta E, Shinohara R, Kochunov P, Quidé Y, Green MJ, et al. Increased power by harmonizing structural MRI site differences with the ComBat batch adjustment method in ENIGMA. Neuroimage. 2020;218:116956.
Article PubMed Google Scholar
Cetin-Karayumak S, Zhang F, Zurrin R, Billah T, Zekelman L, Makris N, et al. Harmonized diffusion MRI data and white matter measures from the Adolescent Brain Cognitive Development Study. Sci Data. 2024;11:249.
Article PubMed PubMed Central Google Scholar
Kia SM, Huijsdens H, Rutherford S, de Boer A, Dinga R, Wolfers T, et al. Closing the life-cycle of normative modeling using federated hierarchical Bayesian regression. PLoS ONE. 2022;17:e0278776.
Article PubMed PubMed Central CAS Google Scholar
Ghosh SS, Kakunoori S, Augustinack J, Nieto-Castanon A, Kovelman I, Gaab N, et al. Evaluating the validity of volume-based and surface-based brain image registration for developmental cognitive neuroscience studies in children 4 to 11 years of age. Neuroimage. 2010;53:85–93.
Article PubMed Google Scholar
Zöllei L, Iglesias JE, Ou Y, Grant PE, Fischl B. Infant FreeSurfer: an automated segmentation and surface extraction pipeline for T1-weighted neuroimaging data of infants 0–2 years. Neuroimage. 2020;218:116946.
Article PubMed Google Scholar
Dai Y, Shi F, Wang L, Wu G, Shen D. iBEAT: a toolbox for infant brain magnetic resonance image processing. Neuroinformatics. 2013;11:211–25.
Article PubMed Google Scholar
Kim H, Lepage C, Maheshwary R, Jeon S, Evans AC, Hess CP, et al. NEOCIVET: towards accurate morphometry of neonatal gyrification and clinical applications in preterm newborns. Neuroimage. 2016;138:28–42.
Article PubMed Google Scholar
Gruzieva O, Xu C-J, Yousefi P, Relton C, Merid SK, Breton CV, et al. Prenatal particulate air pollution and DNA methylation in newborns: an epigenome-wide meta-analysis. Environ Health Perspect. 2019;127:057012.
Article PubMed PubMed Central Google Scholar
Neumann A, Sammallahti S, Cosin-Tomas M, Reese SE, Suderman M, Alemany S, et al. Epigenetic timing effects on child developmental outcomes: a longitudinal meta-regression of findings from the Pregnancy And Childhood Epigenetics Consortium. Genome Med. 2025;17:39.
Article PubMed PubMed Central Google Scholar
Walton E, Cecil CAM, Suderman M, Liu J, Turner JA, Calhoun V, et al. Longitudinal epigenetic predictors of amygdala: hippocampus volume ratio. J Child Psychol Psychiatry. 2017;58:1341–50.
Article PubMed PubMed Central Google Scholar
Dunn EC, Soare TW, Zhu Y, Simpkin AJ, Suderman MJ, Klengel T, et al. Sensitive periods for the effect of childhood adversity on DNA methylation: results from a prospective, longitudinal study. Biol Psychiatry. 2019;85:838–49.
Article PubMed PubMed Central CAS Google Scholar
Lussier AA, Zhu Y, Smith BJ, Cerutti J, Fisher J, Melton PE, et al. Association between the timing of childhood adversity and epigenetic patterns across childhood and adolescence: findings from the Avon Longitudinal Study of Parents and Children (ALSPAC) prospective cohort. Lancet Child Adolesc Health. 2023;7:532–43.
Article PubMed PubMed Central Google Scholar
Mansell G, Gorrie-Stone TJ, Bao Y, Kumari M, Schalkwyk LS, Mill J, et al. Guidance for DNA methylation studies: statistical insights from the Illumina EPIC array. BMC Genomics. 2019;20:1–15.
Article Google Scholar
Marek S, Tervo-Clemmens B, Calabro FJ, Montez DF, Kay BP, Hatoum AS, et al. Reproducible brain-wide association studies require thousands of individuals. Nature. 2022;603:654–60.
Article PubMed PubMed Central CAS Google Scholar
Campagna MP, Xavier A, Lechner-Scott J, Maltby V, Scott RJ, Butzkueven H, et al. Epigenome-wide association studies: current knowledge, strategies and recommendations. Clin Epigenetics. 2021;13:1–24.
Article Google Scholar
Luo Q, Chen Q, Wang W, Desrivières S, Quinlan EB, Jia T, et al. Association of a schizophrenia-risk nonsynonymous variant with putamen volume in adolescents: a voxelwise and genome-wide association study. JAMA Psychiatry. 2019;76:435–45.
Article PubMed PubMed Central Google Scholar
Mulder RH, Baltramonaityte V, Defina S, Trajanoska K, Suderman M, Schwarz E, et al. Interactive effects of genotype with prenatal stress on DNA methylation at birth. medRxiv:2024.11.20.24317575 [Preprint]. 2024. [cited 20 Nov 2024, https://www.medrxiv.org/content/10.1101/2024.11.20.24317575v1].
Ray P, Reddy SS, Banerjee T. Various dimension reduction techniques for high dimensional data analysis: a review. Artif Intell Rev. 2021;54:3473–515.
Article Google Scholar
Pearlson GD, Liu J, Calhoun VD. An introductory review of parallel independent component analysis (p-ICA) and a guide to applying p-ICA to genetic data and imaging phenotypes to identify disease-associated biological pathways and systems in common complex disorders. Front Genet. 2015;6:276.
Article PubMed PubMed Central Google Scholar
Horvath S, Raj K. DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat Rev Genet. 2018;19:371–84.
Article PubMed CAS Google Scholar
Cole JH, Franke K. Predicting age using neuroimaging: innovative brain ageing biomarkers. Trends Neurosci. 2017;40:681–90.
Article PubMed CAS Google Scholar
Nabais MF, Gadd DA, Hannon E, Mill J, McRae AF, Wray NR. An overview of DNA methylation-derived trait score methods and applications. Genome Biol. 2023;24:28.
Article PubMed PubMed Central CAS Google Scholar
Wang Y, Kanai M, Tan T, Kamariza M, Tsuo K, Yuan K, et al. Polygenic prediction across populations is influenced by ancestry, genetic architecture, and methodology. Cell Genomics. 2023;3:100408.
Article PubMed PubMed Central CAS Google Scholar
Chen J, Gatev E, Everson T, Conneely KN, Koen N, Epstein MP, et al. Pruning and thresholding approach for methylation risk scores in multi-ancestry populations. Epigenetics. 2023;18:2187172.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses. The UK Medical Research Council and Wellcome (Grant ref: 217065/Z/19/Z) and the University of Bristol provide core support for ALSPAC. GWAS data was generated by Sample Logistics and Genotyping Facilities at Wellcome Sanger Institute and LabCorp (Laboratory Corporation of America) using support from 23andMe. This publication is the work of the authors and they will serve as guarantors for the contents of this paper. A comprehensive list of grants funding is available on the ALSPAC website (http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf). The Brazilian High-Risk Cohort – Happy Mums is supported by the National Institute of Developmental Psychiatry for Children and Adolescents, a science and technology institute funded by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq; National Council for Scientific and Technological Development; grant number 573974/2008‐0), Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP; Research Support Foundation of the State of São Paulo; grant number 2008/57896‐8), and Horizon – Health, through the project Understanding, predicting, and treating depression in pregnancy to improve mothers and offspring mental health outcomes (grant number 101057390). The Drakenstein Child Health study is funded by the Bill and Melinda Gates Foundation [grant number OPP 1017641], National Institute on Alcohol Abuse and Alcoholism [grant numbers R21AA023887, R01AA026834-01], US Brain and Behavior Research Foundation [grant number 24467], Collaborative Initiative on Fetal Alcohol Spectrum Disorders (CIFASD) [grant number U24 AA014811], South African Medical Research Council, UK Government’s Newton Fund [grant number NAF002/1001], Wellcome Trust [grant number 203525/Z/16/Z], South Africa’s National Research Foundation [grant numbers 105865, 120432], ABMRF/The Foundation for Alcohol Research, and Harry Crossley Foundation. FFCW data collection is funded by National Institutes of Health (NIH) R01-HD-036916 and a consortium of other funders. Methylation analyses are funded by the NIH: R01 HD076592, R01 MH103761, R01HL149869, and R01 MD011716. Brain imaging (SAND) is funded by the NIH: R01 MH103761 and R01 MH121079. The Growing Up in Singapore Towards healthy Outcomes (GUSTO) study is supported by the National Research Foundation (NRF) under the Open Fund-Large Collaborative Grant (OF-LCG; MOH-000504) administered by the Singapore Ministry of Health’s National Medical Research Council (NMRC) and the Agency for Science, Technology and Research (A*STAR). In RIE2025, GUSTO is supported by funding from the NRF’s Human Health and Potential (HHP) Domain, under the Human Potential Programme. Kids2health is funded by BMBF 01GL1743A. MTwiNS was supported by funds from the National Institute of Mental Health of the National Institutes of Health (NIH): UH3MH114249 & R01 MD081813; the Eunice Kennedy Shriver National Institute of Child Health and Human Development of the NIH: R01HD093334 & R01 HD104297 & R01 HD066040; the Brain and Behavior Foundation: NARSAD young Investigator Grant, The Avielle Foundation, and Institutional funds from the University of Michigan and Michigan State University. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NIH. The authors would like to thank the staff of the Twin Study of Behavioral and Emotional Development— Child (TBED-C) and Michigan Twins Neurogenetics Study (MTwiNS) studies for their hard work, and the authors thank the families who participated in TBED-C and MTwiNS for sharing their lives with us. The Oregon ADHD-1000 cohort is supported by grants from the National Institute of Mental Health of the National Institutes of Health under award numbers R37MH059105 (JTN), R01MH099064 (JTN), R01MH115357 (JTN), R01MH131685 (JTN, MAM). The CannTeen study was funded by the UK Medical Research Council (MR/P012728/1). The FinnBrain study was supported by the Academy of Finland, the Sigrid Juselius Foundation, the Signe and Ane Gyllenberg Foundation, the Jane and Aatos Erkko, and the Foundation and State Grants for Clinical Research (ERVA). The general design of the Generation R Study is made possible by financial support from the Erasmus MC, Erasmus University Rotterdam, the Netherlands Organization for Health Research and Development and the Ministry of Health, Welfare and Sport. The EWAS data were funded by a grant from the Netherlands Genomics Initiative (NGI)/Netherlands Organisation for Scientific Research (NWO) Netherlands Consortium for Healthy Aging (NCHA; project nr. 050-060-810), by funds from the Genetic Laboratory of the Department of Internal Medicine, Erasmus MC, and by a grant from the National Institute of Child and Human Development (R01HD068437). We gratefully acknowledge the contribution of children and parents, general practitioners, hospitals, midwives and pharmacies in Rotterdam. The NICAP cohort is supported by the National Health and Medical Research Council of Australia (NHMRC #1065895 and #2029361) and a grant from the Waterloo Foundation. PETS was supported by grants from the Australian National Health and Medical Research Council (#437015, #607358, #114333), the Bonnie Babes Foundation (#BBF20704), the Financial Markets Foundation for Children (#032-2007), and the Victorian Government’s Operational Infrastructure Support Program. The UCIrvine Daily Experiences in Pregnancy Study was funded in part by a European Research Area Network (ERA Net) Neuron grant MecTranGen 01EW1407A to CB and National Institutes of Health grants R01 HD-060628 and R01 MD-017387.

Funding

The work of CAMC is supported by the European Union’s HorizonEurope Research and Innovation Programme (FAMILY, grant agreement No 101057529; HappyMums, grant agreement No 101057390) and the European Research Council (TEMPO; grant agreement No 101039672). This research was conducted while CAMC was a Hevolution/AFAR New Investigator Awardee in Aging Biology and Geroscience Research. EW and CAMC are supported by the European Union’s Horizon 2020 Research and Innovation Programme (EarlyCause, grant agreement No 848158). EW, MS and VB received funding from UK Research and Innovation (UKRI) under the UK government’s Horizon Europe / ERC Frontier Research Guarantee [BrainHealth, grant number EP/Y015037/1]. EW also received funding from the National Institute of Mental Health of the National Institutes of Health (award number R01MH113930).

Author information

These authors contributed equally: Esther Walton, Charlotte A. M. Cecil.

Authors and Affiliations

Department of Child and Adolescent Psychiatry and Psychology, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
Isabel K. Schuurmans, Rosa H. Mulder, Ryan Muetzel & Charlotte A. M. Cecil
Department of Psychology, University of Bath, Bath, UK
Vilte Baltramonaityte, Marlene Staginnus, Tom P. Freeman & Esther Walton
Research Program in Systems Oncology, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland
Alexandra Lahtinen
National Institute of Health and Welfare, Department of Public Health and Welfare, Population Health Unit, Helsinki, Finland
Fan Qiuyu & Tiina Paunio
Translational Neuropsychiatry Unit, Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
Leonardo Melo Rothmann & Rodrigo Grassi-Oliveira
FinnBrain Birth Cohort Study, Turku Brain and Mind Center; Department of Clinical Medicine, University of Turku, Turku, Finland
Jetro J. Tuulari, Hasse Karlsson & Linnea Karlsson
Centre for Population Health Research, Turku University Hospital and University of Turku, Turku, Finland
Jetro J. Tuulari & Linnea Karlsson
Michigan State University, East Lansing, MI, USA
S. Alexandra Burt
Development, Health and Disease Research Program, University of California, Irvine, CA, USA
Claudia Buss
Department of Pediatrics, University of California, Irvine, CA, USA
Claudia Buss
Department of Medical Psychology, Charité - Universitätsmedizin Berlin, Berlin, Germany
Claudia Buss
Deakin University, IMPACT – The Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, VIC, Australia
Jeffrey M. Craig
Murdoch Children’s Research Institute, Royal Children’s Hospital, Parkville, VIC, Australia
Jeffrey M. Craig & Richard Saffery
Department of Paediatrics, University of Melbourne, Royal Children’s Hospital, Parkville, VIC, Australia
Jeffrey M. Craig, Richard Saffery & Marc Seal
Department of Paediatrics and Child Health, Red Cross War Memorial Children’s Hospital, University of Cape Town, Cape Town, South Africa
Kirsten A. Donald
Neuroscience Institute, University of Cape Town, Cape Town, South Africa
Kirsten A. Donald & Nastassja Koen
Institute for Human Development and Potential (IHDP), Agency for Science Technology and Research (A*STAR), Singapore, Singapore
Johan G. Eriksson, Ai Peng Tan, Ai Ling Teh & Dennis Wang
Department of Obstetrics & Gynaecology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Johan G. Eriksson
The Generation R Study Group, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
Janine F. Felix
Department of Pediatrics, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
Janine F. Felix
School of Medicine, Developmental Cognitive Neuroscience Lab, Pontifical Catholic University of Rio Grande Do Sul (PUCRS), Porto Alegre, Brazil
Rodrigo Grassi-Oliveira
Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Anke Huels
Gangarosa Department of Environmental Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Anke Huels
Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Anke Huels
Department of Psychology, University of Michigan, Ann Arbor, MI, USA
Luke W. Hyde & Christopher S. Monk
Center for Mental Health Innovation, Oregon Health & Science University, Portland, OR, USA
Scott A. Jones, Michael A. Mooney, Joel T. Nigg & Peter Ryabinin
Department of Psychiatry, Oregon Health & Science University, Portland, OR, USA
Scott A. Jones, Michael A. Mooney & Joel T. Nigg
Department of Psychiatry, Turku University Hospital & University of Turku, Turku, Finland
Hasse Karlsson
Department of Clinical Medicine, Pediatrics and Adolescent Medicine, Turku University Hospital and University of Turku, Turku, Finland
Linnea Karlsson
Department of Child Psychiatry, Turku University Hospital, Turku, Finland
Linnea Karlsson
Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
Nastassja Koen
SAMRC Unit on Risk and Resilience in Mental Disorders, Cape Town, South Africa
Nastassja Koen & Dan J. Stein
Department of Psychology, Institute of Psychiatry Psychology and Neuroscience, King’s College London, London, UK
Will Lawn
Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
Colter Mitchell & Christopher S. Monk
Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
Michael A. Mooney & Peter Ryabinin
Federal University of São Paulo, São Paulo, Brazil
Síntia Iole Nogueira Belangero & Pedro Mario Pan
Department of Molecular Biology, Princeton University, Princeton, NJ, USA
Daniel Notterman
Department of Paediatrics, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Yi Ying Ong
University of Rochester Medical Center, Departments of Psychiatry, Psychology, Neuroscience, and Obstetrics and Gynecology, Rochester, NY, USA
Tom O’Connor
Yale Child Study Center, Yale School of Medicine, New Haven, CT, USA
Kieran J. O’Donnell
Department of Obstetrics, Gynecology, and Reproductive Sciences, Yale School of Medicine, New Haven, CT, USA
Kieran J. O’Donnell
Sleepwell program and Department of Psychiatry, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Tiina Paunio
Department of Psychiatry, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
Giovanni A. Salum
Graduate Program in Psychiatry and Behavioral Sciences, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
Giovanni A. Salum
Child Mind Institute, New York, NY, USA
Giovanni A. Salum
Developmental Imaging, Murdoch Children’s Research Institute, Parkville, VIC, Australia
Marc Seal & Tim J. Silk
School of Psychology and Centre for Social and Early Emotional Development (SEED), Deakin University, Geelong, VIC, Australia
Tim J. Silk
Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
Ai Peng Tan
Bioinformatics Institute (BII), Agency for Science Technology and Research (A*STAR), Singapore, Singapore
Ai Ling Teh & Dennis Wang
Department of Paediatrics & Child Health, Red Cross Children’s Hospital and SA-MRC Unit on Child & Adolescent Health, University of Cape Town, Cape Town, South Africa
Heather Zar
Molecular Epidemiology, Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
Charlotte A. M. Cecil

Authors

Isabel K. Schuurmans
View author publications
Search author on:PubMed Google Scholar
Rosa H. Mulder
View author publications
Search author on:PubMed Google Scholar
Vilte Baltramonaityte
View author publications
Search author on:PubMed Google Scholar
Alexandra Lahtinen
View author publications
Search author on:PubMed Google Scholar
Fan Qiuyu
View author publications
Search author on:PubMed Google Scholar
Leonardo Melo Rothmann
View author publications
Search author on:PubMed Google Scholar
Marlene Staginnus
View author publications
Search author on:PubMed Google Scholar
Jetro J. Tuulari
View author publications
Search author on:PubMed Google Scholar
S. Alexandra Burt
View author publications
Search author on:PubMed Google Scholar
Claudia Buss
View author publications
Search author on:PubMed Google Scholar
Jeffrey M. Craig
View author publications
Search author on:PubMed Google Scholar
Kirsten A. Donald
View author publications
Search author on:PubMed Google Scholar
Johan G. Eriksson
View author publications
Search author on:PubMed Google Scholar
Janine F. Felix
View author publications
Search author on:PubMed Google Scholar
Tom P. Freeman
View author publications
Search author on:PubMed Google Scholar
Rodrigo Grassi-Oliveira
View author publications
Search author on:PubMed Google Scholar
Anke Huels
View author publications
Search author on:PubMed Google Scholar
Luke W. Hyde
View author publications
Search author on:PubMed Google Scholar
Scott A. Jones
View author publications
Search author on:PubMed Google Scholar
Hasse Karlsson
View author publications
Search author on:PubMed Google Scholar
Linnea Karlsson
View author publications
Search author on:PubMed Google Scholar
Nastassja Koen
View author publications
Search author on:PubMed Google Scholar
Will Lawn
View author publications
Search author on:PubMed Google Scholar
Colter Mitchell
View author publications
Search author on:PubMed Google Scholar
Christopher S. Monk
View author publications
Search author on:PubMed Google Scholar
Michael A. Mooney
View author publications
Search author on:PubMed Google Scholar
Ryan Muetzel
View author publications
Search author on:PubMed Google Scholar
Joel T. Nigg
View author publications
Search author on:PubMed Google Scholar
Síntia Iole Nogueira Belangero
View author publications
Search author on:PubMed Google Scholar
Daniel Notterman
View author publications
Search author on:PubMed Google Scholar
Yi Ying Ong
View author publications
Search author on:PubMed Google Scholar
Tom O’Connor
View author publications
Search author on:PubMed Google Scholar
Kieran J. O’Donnell
View author publications
Search author on:PubMed Google Scholar
Pedro Mario Pan
View author publications
Search author on:PubMed Google Scholar
Tiina Paunio
View author publications
Search author on:PubMed Google Scholar
Peter Ryabinin
View author publications
Search author on:PubMed Google Scholar
Richard Saffery
View author publications
Search author on:PubMed Google Scholar
Giovanni A. Salum
View author publications
Search author on:PubMed Google Scholar
Marc Seal
View author publications
Search author on:PubMed Google Scholar
Tim J. Silk
View author publications
Search author on:PubMed Google Scholar
Dan J. Stein
View author publications
Search author on:PubMed Google Scholar
Ai Peng Tan
View author publications
Search author on:PubMed Google Scholar
Ai Ling Teh
View author publications
Search author on:PubMed Google Scholar
Dennis Wang
View author publications
Search author on:PubMed Google Scholar
Heather Zar
View author publications
Search author on:PubMed Google Scholar
Esther Walton
View author publications
Search author on:PubMed Google Scholar
Charlotte A. M. Cecil
View author publications
Search author on:PubMed Google Scholar

Contributions

IKS, CAMC and EW conceptualized the project. Formal analyses was performed by IKS and EW Project administration and visualization was performed by IKS. Further, CAMC and EW supervised the project and acquired funding. All authors, IKS, RHM, VB, AL, FQ, LMR, M St., JJT, SAB, CB, JMC, KAD, JGE, JFF, TPF, RGO, AH, LWH, SAJ, HK, LK, NK, WL, CM, CSM, MAM, RM, JTN, SINB, DN, YYO, TOC, KJO, PMP, TP, PR, RS, GAS, M Se., TJS, DJS, APT, ALT, DW, HZ, EW, and CAMC contributed to data curation, writing of the original draft, as well as reviewing and editing.

Corresponding authors

Correspondence to Esther Walton or Charlotte A. M. Cecil.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary materials (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schuurmans, I.K., Mulder, R.H., Baltramonaityte, V. et al. Consortium profile: the methylation, imaging and NeuroDevelopment (MIND) consortium. Mol Psychiatry 31, 1177–1189 (2026). https://doi.org/10.1038/s41380-025-03203-w

Download citation

Received: 23 June 2024
Revised: 12 May 2025
Accepted: 22 August 2025
Published: 06 September 2025
Version of record: 06 September 2025
Issue date: February 2026
DOI: https://doi.org/10.1038/s41380-025-03203-w

This article is cited by

Bringing methylation profile scores to early life
- Isabel K. Schuurmans
- Janine F. Felix
- Charlotte A. M. Cecil
Nature Reviews Genetics (2026)