Genome-wide analysis of brain age identifies 59 associated loci and unveils relationships with mental and physical health

Jawinski, Philippe; Forstbach, Helena; Kirsten, Holger; Beyer, Frauke; Villringer, Arno; Witte, A. Veronica; Scholz, Markus; Ripke, Stephan; Markett, Sebastian

doi:10.1038/s43587-025-00962-7

Download PDF

Article
Open access
Published: 03 October 2025

Genome-wide analysis of brain age identifies 59 associated loci and unveils relationships with mental and physical health

Nature Aging volume 5, pages 2086–2103 (2025)Cite this article

14k Accesses
4 Citations
228 Altmetric
Metrics details

Subjects

Abstract

Neuroimaging and machine learning are advancing research into the mechanisms of biological aging. In this field, ‘brain age gap’ has emerged as a promising magnetic resonance imaging-based biomarker that quantifies the deviation between an individual’s biological and chronological age of the brain. Here we conducted an in-depth genomic analysis of the brain age gap and its relationships with over 1,000 health traits. Genome-wide analyses in up to 56,348 individuals unveiled a heritability of 23–29% attributable to common genetic variants and highlighted 59 associated loci (39 novel). The leading locus encompasses MAPT, encoding the tau protein central to Alzheimer’s disease. Genetic correlations revealed relationships with mental health, physical health, lifestyle and socioeconomic traits, including depressed mood, diabetes, alcohol intake and income. Mendelian randomization indicated a causal role of high blood pressure and type 2 diabetes in accelerated brain aging. Our study highlights key genes and pathways related to neurogenesis, immune-system-related processes and small GTPase binding, laying the foundation for further mechanistic exploration.

Brain age gap as a predictive biomarker that links aging, lifestyle, and neuropsychiatric health

Article Open access 24 October 2025

Neuroimaging-derived biological brain age and its associations with glial reactivity and synaptic dysfunction cerebrospinal fluid biomarkers

Article Open access 12 April 2025

Diversity-sensitive brain clocks linked to biophysical mechanisms in aging and dementia

Article 18 September 2025

Main

Aging is a complex phenomenon inherent to most organisms^1,2,3. As human lifespans extend and global populations age, age-related disabilities, including dementia, are rising⁴. Thus, understanding the biological mechanisms of aging is an urgent priority for social systems, to sustain longer lives with reduced periods of disability.

The use of neuroimaging methods in conjunction with machine learning has become a promising avenue in biomedical research to capture an individual’s biological age, particularly ‘brain age’^5,6. Brain age is typically assessed by training an age prediction model on in vivo magnetic resonance imaging (MRI) data from a normative lifespan sample. This model is then applied to the MRI data of unseen individuals to predict their age. The discrepancy between an individual’s brain-predicted and chronological age is termed ‘brain age gap’ (BAG) and it is used to infer typical and atypical aging trajectories^6,7.

A positive BAG, interpreted as accelerated aging, has been linked to reduced mental and physical health⁵; including weaker grip strength, higher blood pressure, diabetes, adverse drinking and smoking behavior, poorer cognitive abilities and depression^{8,9,10,11,12,13}. BAG is also enhanced in neurological and psychiatric disorders such as Alzheimer’s disease (AD), schizophrenia and bipolar disorder^14,15. While previous genetic studies suggested that BAG exhibits a substantial heritable component, few genetic variants have been identified^{15,16,17,18,19,20,21,22,23,24}. To refine the genetic architecture of BAG and identify potential therapeutic targets for healthy aging, further research is imperative.

In this article, we present what is to our knowledge the largest genome-wide association study (GWAS) of BAG to date. We begin by discovering new loci in a sample of 32,634 individuals of White British ancestry and replicate our findings in a multi-ancestry sample of up to 23,714 individuals. Next, we conduct meta-analyses across the discovery and replication samples, with an aggregated sample size of up to 56,348 individuals. This represents a 79% increase (~25,000 more) over previous GWAS^23,24. To prioritize genes, we use complementary fine-mapping, annotation and transcriptomic analyses that integrate multiple omics resources. We also calculate polygenic scores (PGS) to estimate the present yield in variance explanation, compute genetic correlations with over 1,000 traits and test causal effects using Mendelian randomization. Finally, we examine the degree of polygenicity of BAG and project future discovery potential. Through these efforts, we unravel new biological mechanisms behind BAG, including pathways related to neurogenesis, immune system processes and binding of small GTPases—evolutionarily conserved proteins that act as cellular timers.

Results

We estimated brain age with a well-established and extensively validated workflow based on CAT12 voxel-based morphometry^5,25. Using T1-weighed MRI scans and supervised machine learning, we estimated brain age through cross-prediction in a discovery sample of 32,634 individuals of White British ancestry from the UK Biobank (UKB) cohort (age range = 45–81 years)²⁶. To capture tissue-specific aging patterns, we conducted separate analyses for gray matter (GM) and white matter (WM) segmentations¹⁸. Brain age estimation relied on an ensemble of complementary machine learning algorithms: the sparse Bayesian relevance vector machine²⁷ (RVM) and extreme gradient boosting (XGBoost) with tree and linear boosters²⁸. Models were stacked within and across tissue classes, yielding three brain-predicted age estimates per individual: for GM, WM and combined GM and WM.

In the discovery sample, we observed accurate predictions for chronological age, with mean absolute errors (MAEs) reaching 3.09 years and correlation coefficients attaining r = 0.86 (Fig. 1a, Table 1, Supplementary Table 1 and Extended Data Fig. 1). Model performances were similar in the multi-ancestry UKB replication samples (n = 21,881; age range = 45-81 years) and the European-ancestry Leipzig Research Centre for Civilization Diseases (LIFE)-Adult replication sample (n = 1,833; age range = 45-80 years)^29,30. Genetic association analyses were performed on BAG, that is, the difference between brain-predicted and chronological age. These BAG estimates—residualized for sex, age, age², scanner site and total intracranial volume—showed high test–retest reliabilities, with intraclass correlation coefficients (ICCs) ranging from 0.89 to 0.92.

**Fig. 1: Phenotypic characteristics and associations of combined BAG.**

Table 1 Prediction accuracies of the stacked age estimation models stratified according to tissue class

Full size table

Phenotypic associations

To validate our BAG estimates and extend previous evidence on their health relevance^31,32, we conducted cross-trait association analyses between BAG and 7,088 non-imaging-derived phenotypes using PHESANT³³. A total of 210 associations reached Bonferroni significance (P < 7.1 × 10⁻⁶) for at least 1 of the 3 BAG traits (Supplementary Table 2 and Extended Data Fig. 2). Figure 1b presents the cross-trait results for combined GM and WM BAG.

Top associations for combined BAG (all P ≤ 1.8 × 10⁻¹²) included pack years of smoking (r = 0.091), diastolic blood pressure (DBP) (r = 0.084), number of symbol digit matches made correctly (that is, a measure of cognitive performance; r = −0.082), diabetes diagnosed by doctor (r = 0.079), amount of alcohol drunk on a typical drinking day (r = 0.076) and overall health rating (r = 0.039; note that higher scores indicate poorer health). These results corroborate earlier BAG associations³¹ and expand known health-related links.

To examine regional contributions to BAG and facilitate comparisons with prior surface-based morphometry studies^15,19,34, we analyzed associations with FreeSurfer-derived³⁵ cortical surface measures and subcortical volumes (Fig. 1c, Supplementary Fig. 3 and Supplementary Table 3). For combined BAG, the strongest associations (all P ≤ 1.1 × 10⁻²⁰⁹) were observed with the volumes of the accumbens (r = −0.31), lateral ventricles (r = 0.30), amygdala (r = −0.25), hippocampus (r = −0.23) and thalamus (r = −0.22), and the cortical thickness of the superior frontal (r = −0.20) and inferior parietal (r = −0.17) cortex. These findings suggest that our models capture patterns of aging distributed throughout the brain, rather than being confined to specific areas.

Following up on Smith et al.³¹, we also performed sex-stratified analyses, which revealed largely similar BAG associations in males and females for non-imaging-derived (Supplementary Figs. 1 and 2 and Supplementary Table 4) and brain structure phenotypes (Supplementary Figs. 4 and 5 and Supplementary Table 5). Some differences emerged, for example, stronger associations between GM BAG and body fat percentage in males, but were generally small in magnitude.

Concordant genomic signals in discovery and replication GWAS

An overview of our genomic analysis workflow is shown in Fig. 2. To examine the reliability and replicability of our findings, we first compared results from a discovery GWAS of 32,634 individuals of White British ancestry (UKB imaging release v.1.7) with a replication GWAS of 22,256 individuals of European ancestry (UKB imaging release v.1.10 and LIFE-Adult).

**Fig. 2: Genomic analysis workflow and gene prioritization strategy.**

The discovery and replication GWAS results were highly consistent, with strong genetic correlations derived from bivariate linkage disequilibrium (LD) score regression (LDSC) (all r_g > 0.996, all P > 2.9 × 10⁻³⁶; Supplementary Table 6)³⁶. We discovered 25 independent genome-wide significant loci across the 3 BAG traits, all showing concordant effect directions (binomial test: P = 3.0 × 10⁻⁰⁸), and 18 reaching one-tailed nominal significance in replication (binomial test: P = 1.3 × 10⁻¹⁸; Supplementary Table 7). These findings closely align with our power analysis, which predicted 19 nominal replications. Among 45 additional suggestive discoveries (P < 1.0 × 10⁻⁶), 36 showed concordant directions (binomial test: P = 3.3 × 10⁻⁵) and 24 reached nominal significance in replication (binomial test: P = 8.0 × 10⁻²⁰). Incorporating 1,458 non-European UKB participants into an extended multi-ancestry GWAS also yielded above-chance consistency (Supplementary Table 8). Together, these findings strongly support locus replicability, reinforcing the robustness of our results. Additional details are provided in Supplementary Figs. 6–23.

Identification of 59 associated loci

To maximize statistical power and improve genetic discovery, we meta-analyzed GWAS data from 54,890 individuals of European ancestry, combining the UKB discovery (n = 32,634), UKB-EUR replication (n = 20,423) and LIFE-Adult (n = 1,833) cohorts. Analyses included 9.6 million single-nucleotide polymorphisms (SNPs) and insertions and deletions (indels) with a minor allele frequency (MAF) greater than 0.01 and imputation quality score (INFO) greater than 0.80. We modeled additive genetic effects with covariates for sex, age, age², total intracranial volume, scanner site and type of genotyping array, and up to 20 genetic principal components. Results for the three BAG traits are shown in Fig. 3 (multi-ancestry results are shown in Extended Data Fig. 3).

**Fig. 3: Genome-wide association meta-analyses of BAG traits.**

LDSC intercepts did not indicate a bias of test statistics due to reasons other than polygenicity, suggesting no confounding inflation caused by population stratification (intercept range: 1.011–1.017; Supplementary Table 9)³⁷. SNP-based heritability estimates ranged from 22.7% (GM BAG) to 29.0% (WM BAG). The genetic correlation between GM and WM BAG was r_g = 0.74 (s.e. = 0.023), indicating both shared and distinct genetic influences (phenotypic correlation: r_P = 0.62, s.e. = 0.003). Combined BAG showed strong genetic correlations with both GM (r_g = 0.91, s.e. = 0.008; cf. r_P = 0.88, s.e. = 0.002) and WM (r_g = 0.951, s.e. = 0.006; cf. r_P = 0.91, s.e. = 0.002) BAG. Strong genetic correlations were observed between sex-stratified results (all r_g ≥ 0.934; Supplementary Table 10), indicating highly concordant architectures across sexes.

Partitioned LDSC³⁸ (Supplementary Table 11 and Extended Data Fig. 4) revealed an enrichment of heritability (false discovery rate (FDR) < 0.05) in regions conserved across mammals (fold enrichment (FE) = 12.4) and primates (FE = 10.5). Additional enrichment was observed in super-enhancer (FE = 2.6), flanking bivalent transcriptional start sites/enhancers (FE = 8.7) and epigenetically modified H3K27ac (FE = 1.8) and H3K9ac (FE = 2.9) regions. Cell-type group analyses (Supplementary Table 12 and Extended Data Fig. 5) revealed enrichment near genes expressed in central nervous system (FE = 3.4), connective or bone (FE = 3.0) and kidney (FE = 3.7) tissue.

To identify independent genome-wide significant associations, we conducted stepwise conditional analyses using genome-wide complex trait analysis–conditional and joint association analysis (GCTA–COJO)³⁹. This resulted in 26, 34 and 39 independent discoveries for GM, WM and combined BAG, respectively (regional plots are shown in Supplementary Figs. 25–42). After cross-trait LD clumping of index variants (r² > 0.1, 10,000-kb window), we identified 59 distinct loci (≥460 kb apart; Table 2 and Supplementary Table 13). Of these, 39 represent novel discoveries not previously reported in BAG GWAS (see Methods for the definition of novelty)^{17,18,19,20,21,22,23,24}.

Table 2 Identification of 59 genomic loci associated with BAG in n = 54,890 individuals

Full size table

We observed most index variants in intronic regions of protein-coding genes. ANNOtate VARiation (ANNOVAR) enrichment tests confirmed that variants in high LD with the lead variants were underrepresented in intergenic regions and overrepresented in 3′-UTR, 5′-UTR, intronic, exonic noncoding RNA and intronic noncoding RNA regions (Supplementary Fig. 24 and Supplementary Table 14).

Fine-mapping and gene prioritization

To identify putative causal genes, we used several fine-mapping, functional annotation and transcriptomic analyses that integrate information from multiple omics resources. For each genome-wide significant locus, we (1) constructed 95% credible sets of variants that likely include the causal variant using SBayesRC^40,41, complemented by susieR and FINEMAP^42,43; (2) physically mapped credible variants to genes using ANNOVAR⁴⁴; (3) predicted the transcript consequences of nonsynonymous exonic variants and scored their deleteriousness using combined annotation dependent depletion (CADD)⁴⁵; (4) mapped variants to genes using expression quantitative trait locus (eQTL) lookup in 49 Genotype-Tissue Expression (GTEx) Project v.8 tissues⁴⁶; (5) conducted summary-data-based Mendelian randomization (SMR)⁴⁷ with the RNA sequence (RNA-seq) data of 2,865 brain cortex samples⁴⁸ to test for mediation through gene expression and splicing; and (6) calculated polygenic priority scores (PoPS)⁴⁹ that incorporate data from single-cell RNA-seq datasets, curated biological pathways and protein–protein interaction networks. We integrated these results to compute a gene priority score and selected the most plausible candidate per locus (Methods). Figure 2 shows an overview of the analysis workflow; full results are found in Supplementary Table 13 (with further details in Supplementary Tables 14–24). The key findings are summarized below.

Across the 59 discovered loci, SBayesRC genome-wide fine-mapping produced, on average, the smallest 95% credible sets (median = 9 variants), compared to region-specific approaches using susieR (median = 34 variants) and FINEMAP (median = 36 variants). susieR and FINEMAP showed strong concordance, with a median overlap of 97.2%. Variants from SBayesRC were included in the susieR and FINEMAP credible sets at median rates of 75.0% and 80.0%, respectively. Most loci contained a single signal, although susieR and FINEMAP identified four loci with potential secondary signals. Estimated regional heritability ranged from 0.03% to 0.67%, reflecting modest variant effects, with few exceptions.

We observed the strongest association at locus 17q21.31 (index variant: rs2260227, P = 9.4 × 10⁻⁸³), which tags a well-known 900-kb inversion polymorphism^50,51. Consistent with the strong LD cluster in the inverted region⁵⁰, the region-specific fine-mapping approaches yielded large credible sets of variants (>1,500 variants). A National Human Genome Research Institute GWAS Catalog search⁵² revealed associations with many locus-associated traits, including educational attainment⁵³, depressed affect⁵⁴, alcohol consumption⁵⁵, sleep duration⁵⁶, lung function⁵⁷, male puberty timing⁵⁸, age at onset of menarche⁵⁷ and AD⁵⁹. The region spans multiple genes, including MAPT, STH, KANSL1 and CRHR1. Several genome-wide significant variants in these genes are GTEx single-tissue and multi-tissue eQTLs (Supplementary Tables 21 and 22). SMR analyses implicated expression and splicing of MAPT and KANSL1 (along with other genes) in mediating variant effects on BAG (Supplementary Tables 19 and 20). We also identified credible exonic variants causing amino acid changes (Supplementary Table 18); notably, rs17651549 (P = 1.5 × 10⁻⁸¹), which had the highest deleteriousness (CADD score = 34), results in an arginine-to-tryptophan substitution at MAPT protein position 370. MAPT encodes the well-known tau protein implicated in AD and other neurodegenerative diseases⁶⁰. Altogether, we prioritized MAPT as the most likely causal gene for brain aging at this locus.

In 6 regions, all fine-mapping methods yielded 95% credible sets with fewer than 10 likely causal variants. One locus, with only a single credible variant (rs12146713, posterior inclusion probability (PIP) > 0.99; P = 4.7 × 10⁻¹²), lies in an intron of NUAK1 at 12q23.3 and tags a multi-tissue eQTL for the long noncoding RNA gene Lnc-NUAK1-1, expressed in the brain cortex and cerebellum. GWAS Catalog matches link this region to cortical thickness⁶¹, surface area⁶² and subcortical volume⁶¹.

A second locus, led by rs1452628 (P = 2.5 × 10⁻¹⁹), with up to 7 credible variants, refers to an intergenic region at 1q41, 41 kb upstream of KCNK2, encoding a potassium channel subunit. KCNK2 is also the prioritized gene supported by the GTEx and PoPS analyses. KCNK2 has been implicated in neuroinflammation, blood–brain barrier dysfunction, and cerebral ischemia^63,64. GWAS Catalog matches include associations with cortical thickness⁶¹, surface area⁶² and sulcal opening⁶⁵.

The third locus refers to the well-known apolipoprotein E (APOE) gene region, led by rs483082 (P = 1.0 × 10⁻¹⁰), with four credible variants. The APOE ε4 allele, defined by rs429358 and rs7412, is the strongest known genetic risk factor for AD. Notably, the exonic variant rs429358 was identified by SBayesRC as the most likely causal variant (PIP = 0.62).

We also discovered several novel loci that offer new insights into the mechanisms of brain aging. Among these loci, one is led by rs2215590 (P = 2.9 × 10⁻⁹) and maps to an intronic region of DPF3, which encodes double PHD fingers 3. Supported by GTEx eQTLs and PoPS, DPF3 was prioritized as the most likely causal gene. DPF3 also represents the prioritized gene supported by the GTEx eQTL lookup and PoPS analyses. GWAS Catalog matches link this locus to pulse pressure⁶⁶ and serum urate levels⁶⁷. DPF3 serves as a subunit of the neuron-specific chromatin remodeling nBAF complex, which is crucial for neurogenesis and neurodevelopment⁶⁸.

Another novel locus, led by rs776970253 (P = 2.4 × 10⁻⁹), implicates an intronic region of TNIK, encoding TRAF2-interacting and NCK-interacting kinases. Notably, TNIK has been recognized for its role in several biological pathways linked to the hallmarks of aging and has been identified as a promising drug target to improve neuronal health⁶⁹.

Altogether, by integrating fine-mapping, functional annotation and transcriptomic data, we prioritized several genes potentially involved in brain aging, thereby offering new, testable hypotheses about its biological underpinnings.

Polygenic score analysis

To evaluate the predictive power of the genetic variants identified in our GWAS, we performed PGS analyses using SBayesRC (Supplementary Table 25). Compared to previous reports (~2% prediction accuracy)²³, our PGS showed substantially improved performance. Using the discovery GWAS data alone (n = 32,634), PGS explained 4.1% (GM BAG) to 7.0% (WM BAG) of the phenotypic variance in the European-ancestry replication sample. Incorporating the meta-analysis results (n = 52,890, excluding 2,000 test individuals) further improved variance explanation to 6.8% (GM BAG) and 10.3% (WM BAG). As expected, prediction accuracy was lower in the non-European UKB replication samples, explaining 0.4–3.2% of the BAG variance in African (AFR) ancestry (n = 337), 3.1–3.9% in Central/South Asian (CSA) ancestry (n = 638) and 4.1–9.1% in East Asian (EAS) ancestry (n = 291) individuals.

Gene-based analysis

To assess the contribution of protein-coding genes, we performed gene-based association analyses using GCTA fastBAT⁷⁰. Gene-based analyses aggregate variant-level signals across genes, reducing the multiple-testing burden. We tested 18,639 genes and identified 528, 886 and 776 genes significantly associated (FDR < 0.05) with the GM, WM and combined GM and WM BAG, respectively. To define independent loci, we applied P value-informed clumping to genes within 3,000 kb, yielding 151 loci, 230 loci and 203 loci per trait, of which 285 were unique (Supplementary Table 26 and Extended Data Fig. 6). The strongest signal was again observed at 17q21.31 covering MAPT. In total, gene-based analyses provide evidence for an extended set of genomic loci involved in human brain aging.

Pathway analysis

To gain deeper insight into the biological mechanisms underlying brain aging, we performed gene set enrichment analyses using GOfuncR⁷¹, testing for the enrichment of Gene Ontology (GO) terms—sets of genes known to serve a common biological function⁷². After correcting for hierarchical dependencies (Methods), we identified 25 significant GO terms (Supplementary Table 27). Analyses highlighted immune-related and pathogen-related processes in brain aging, with significant enrichments for the major histocompatibility (MHC) protein complex (GO:0042611), peptide antigen binding (GO:0042605) and regulation of viral transcription (GO:0046782). Further significant terms, such as positive regulation of neurogenesis (GO:0050769) and regulation of axon extension (GO:0050769) align with the conceptualization of BAG as a neurodevelopmental marker. We also observed enrichment for small GTPase binding (GO:0031267l), a superfamily of evolutionary conserved proteins that act as biological timers of essential cellular processes⁷³, including cell differentiation, proliferation and signal transduction⁷⁴. Several small GTPase proteins are implicated in premature senescence^75,76.

Genetic correlations with other complex traits

To assess a potential shared genetic architecture between BAG and other traits, we applied bivariate LDSC^36,37 to GWAS summary statistics, calculating genetic correlations with 38 commonly studied mental and physical health traits (Supplementary Table 28)^77,78,79, as well as 989 heritable traits from a broader set of GWAS⁸⁰.

Among the 38 selected traits, 17 showed significant correlations (FDR < 0.05) with at least 1 BAG phenotype (Fig. 4a and Supplementary Table 29). GM BAG showed the highest number of associations (17) relative to WM (4) and combined GM and WM (11) BAG. Notable associations for GM BAG included substance use (cigarettes per day: r_g = 0.134), neurological (stroke: r_g = 0.217), psychological (well-being: r_g = 0.100), cognition-related (educational attainment: r_g = −0.083), anthropometric (body mass index (BMI): r_g = 0.075) and cardiovascular and metabolic syndrome traits (DBP: r_g = 0.115).

**Fig. 4: Genetic correlations between BAG and a wide range of complex traits.**

Similarly, for the 989 traits, we found 118, 7 and 48 significant associations (FDR < 0.05) for GM, WM and combined GM and WM BAG, respectively (Figs. 4b,c and Supplementary Table 30). BAG showed significant genetic correlations with parental longevity (mother’s age at death, r_g = −0.214; father’s age at death, r_g = −0.158), socioeconomic status (average total household income before tax, r_g = −0.160), mental health (frequency of tiredness/lethargy in the last 2 weeks, r_g = 0.122), medical conditions (vascular/heart problems diagnosed by doctor: high blood pressure, r_g = 0.122), cognitive function (snap game—mean time to correctly identify matches, r_g = −0.069), respiratory traits (forced expiratory volume in 1 s (FEV1), best measure, r_g = −0.128), blood markers (white blood cell (leukocyte) count, r_g = 0.121) and early life exposures (maternal smoking around birth, r_g = 0.099), among others (Supplementary Table 30). These results suggest genetic overlap between BAG and a broad range of health-related traits.

Mendelian randomization analyses

We used two-sample generalized summary-data-based Mendelian randomization (GSMR)⁸¹ to investigate the potential causal effects of 12 modifiable risk and resilience factors on BAG. These included BMI, waist–hip ratio adjusted for BMI, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, triglycerides, systolic blood pressure (SBP), DBP, pulse pressure, type 2 diabetes, coronary artery disease, schizophrenia and years of education (Supplementary Table 31). Across all BAG traits, we found significant effects of DBP (combined BAG: β_xz = 0.550, P = 6.9 × 10⁻¹⁰) and SBP (combined BAG: β_xz = 0.382, P = 1.4 × 10⁻⁵), indicating that a one standard deviation increase in blood pressure causally contributes to an ~0.5-year increase in BAG (Supplementary Figs. 43–48 and Extended Data Fig. 7). We also observed significant effects of type 2 diabetes (combined BAG: β_xz = 0.118, P = 5.6 × 10⁻⁴) and coronary artery disease (combined BAG: β_xz = 0.114, P = 0.01)⁸². These findings were largely confirmed by nine alternative MR analyses, except for coronary artery disease (Supplementary Table 32).

Reverse GSMR analyses (Supplementary Table 33) indicated potential negative feedback effects of WM BAG on SBP, DBP and pulse pressure (all β_xz = −0.009, all P ≤ 3.2 × 10⁻³), which is consistent with lower blood pressure in late life and frailty⁸³. Additionally, reverse GSMR analyses hinted at BAG effects on elevated low-density lipoprotein cholesterol and increased risk for coronary artery disease, although these were not supported by other MR methods.

Polygenicity and projection of discoveries to future GWAS

To quantify the BAG degree of polygenicity and estimate discovery potential in future GWAS, we used GENESIS⁸⁴ to estimate the number of underlying susceptibility variants and their effect sizes. We also selected height and neuroticism as benchmark traits because of their distinct degrees of polygenicity^84,85,86,87. The number of susceptibility variants was estimated at 8.7k (s.e. = 1.8k) for GM, 9.8k (s.e. = 1.3k) for WM and 11.0k (s.e. = 1.3k) for combined GM and WM BAG (Supplementary Table 34). For comparison, height showed 12.6k (s.e. = 1.3k) and neuroticism 16.2k (s.e. = 1.2k) susceptibility variants. Effect-size distributions (Fig. 5a) revealed that BAG includes a larger proportion of variants with stronger effects compared to neuroticism, but similar to height. Our projections also suggest rapid growth in the number of BAG discoveries with increasing sample size (Fig. 5b). Approximately 1 million individuals are needed to explain 80% of the SNP-based heritability for BAG via genome-wide significant variants (Fig. 5c), a threshold comparable to height but lower than for neuroticism (~6 million). These findings suggest that while BAG is genetically complex, its relatively lower polygenicity enhances discovery prospects in future studies.

**Fig. 5: Genetic effect-size distribution analysis of BAG.**

Discussion

In this study, we leveraged genomic and neuroimaging methods to establish BAG as a promising biomarker of aging with potential utility for therapeutic discovery. Using machine learning and MRI, we derived highly reliable brain age estimates that capture aging-related structural patterns across the brain. BAG showed robust phenotypic associations with many health traits and was substantially heritable, with 23–29% of variance attributable to common genetic variation. We identified 59 independent genome-wide significant loci, of which 39 are novel. The genomic signals unveiled enriched biological pathways, for example, immune-system-related processes and small GTPase binding, prompting further mechanistic exploration. Through genetic correlations, we demonstrated shared genetic influences between BAG and a broad spectrum of physical and mental health traits. Mendelian randomization supported a causal role of elevated blood pressure and diabetes in accelerated brain aging. Finally, BAG showed a relatively low degree of polygenicity, which increases the likelihood of variant discovery in future studies.

Brain aging is not a uniform process; rather, it encompasses diverse aspects of structural and functional change. Studying distinct aspects of brain aging has been advocated to increase the yield of biologically meaningful insights¹⁸. In this study, we estimated separate BAG scores for GM and WM, alongside a composite measure. While all three showed similar age prediction accuracy and test–retest reliability, GM BAG exhibited stronger phenotypic and genetic associations, potentially reflecting a closer link to health-related outcomes. One explanation is that our volumetric approach to brain age estimation captures biologically meaningful differences more effectively in GM—tied to dendritic complexity and synaptic density—than in WM, which depends on microstructural features less directly represented in bulk volume. As such, GM BAG may better reflect deviations from normative aging across diverse health domains.

While brain aging follows multiple biological trajectories, differential aging rates across systems underscore the need to conceptualize bodily aging as a heterogeneous process⁶. Consistent with this, previous research showed weak correlations between biological age measures derived from telomere length, epigenetic clocks, transcriptomics and immunometabolic markers⁸⁸. Similarly, in our previous work, we found modest correlations between brain age, epigenetic clocks, skin age, clinical biological age composites and subjective age^89,90. Although combining multiple biomarkers may improve chronological age prediction, it may also obscure biologically meaningful deviations at the domain level¹⁸. Recognizing this heterogeneity could help refine aging biomarker panels, and improve specificity and prognostic value for disease risk.

Several GWAS of BAG have been conducted previously^{15,17,18,19,20,21,22,23,24}; however, few have incorporated extensive post-GWAS analyses. Leveraging increased statistical power, we present the most comprehensive genetic analysis of BAG to date, identifying 26, 34 and 39 loci across GM, WM and combined GM and WM BAG—59 in total, of which 39 are new. This surpasses the fewer than ten loci reported per phenotype in earlier studies. Expanding prior fine-mapping efforts²², we constructed credible sets of likely causal variants (median = 9 per locus) and developed PGS explaining 7–10% of BAG variance, up from 2% previously²³. Additionally, we here report results in groups of non-European ancestry, showing reduced yet significant polygenic predictions in AFR, CSA and EAS populations. We also identified new gene sets linked to neurodevelopment, immune function and signal transduction. Extending previous Mendelian randomization efforts^21,22,23,24, we confirmed a causal role of diabetes in BAG and newly identified blood pressure as a risk factor. Moreover, we expanded genetic correlations^{15,21,22,23,24} and identified over 140 BAG associations across health domains. Finally, we introduced genetic effect-size distribution modeling, estimating ~8,700–11,000 contributing variants and providing key projections for discovery potential in future studies.

In addition to these new insights, our study also reinforces previously reported genetic associations. We confirmed the inversion locus at 17q21.31 as the strongest genetic contributor to BAG^{17,18,19,20,21,22,23,24}, with MAPT—a gene encoding the tau protein implicated in AD—prioritized as the most likely causal gene. We also identified the well-known AD risk gene APOE and other apolipoprotein genes. The presence of both tau-related and apolipoprotein-related signals suggests that key hallmarks of AD are reflected in accelerated brain aging, reinforcing BAG’s relevance as a marker for neurodegenerative risk⁹¹.

Our findings suggest that BAG integrates signals from neurodevelopmental and neurodegenerative processes, lifestyle factors, vascular and metabolic health, and immune function. This supports the idea that BAG is not purely a neurodegenerative marker but reflects genetic susceptibility, systemic health and environmental exposures. This supports viewing BAG as composite indicator shaped by aging, disease susceptibility and lifestyle exposures.

By reporting both phenotypic and genetic correlations for BAG, our study allows a direct comparison between the two. We observed a positive relationship between genetic and phenotypic correlations (Extended Data Fig. 8), with correlation coefficients ranging from 0.39 (WM BAG) to 0.51 (GM BAG). Although weaker than previously reported, probably because of the restricted range of correlation strengths in our dataset, the small mean absolute difference between genetic and phenotypic estimates suggests that phenotypic correlations approximate genetic ones and vice versa^92,93.

A future research direction is to explore how model precision affects the genetic and phenotypic associations of BAG⁹⁴. Improved age prediction accuracy may enhance heritability estimates and genetic signals, but it could also obscure meaningful deviations from normative aging. Future studies may investigate this balance to understand the trade-off between predictive performance and biological interpretability.

The current study has several limitations. First, the gene prioritization techniques face challenges in pinpointing causal genes⁴⁹, particularly in loci characterized by high gene density and complex linkage structures. To streamline interpretation, we adopted a winner-takes-all approach, prioritizing a single candidate gene per locus. However, this may overlook other plausible genes with similarly high prioritization scores. Second, BAG was estimated from cross-sectional data, typically interpreted as accelerated or decelerated aging. However, an alternative view posits BAG as stable, early-emerging individual differences that persist into old age⁹⁵. Third, although we used an ensemble of three machine learning models, expanding the number and diversity of models may further enhance prediction by leveraging complementary strengths. Fourth, polygenic overlap was assessed using genetic correlations, which do not capture shared variants with opposing effects. Future studies could apply tools such as MiXeR to quantify genetic overlap by considering mixtures of variant effects⁹⁶. Fifth, polygenicity estimates probably underestimate the true number of contributing variants because models may classify those with very small effects as null. Finally, as our primary GWAS was based on individuals of European ancestry, PGS predicted less accurately in groups of non-European ancestry, limiting generalizability across ancestries. This reflects known transfer challenges due to differences in allele frequencies, LD and environmental context. Expanding genomic data and sample sizes in diverse populations will be essential to improve accuracy and broaden the applicability of brain age genetics.

In conclusion, our study refines the genetic architecture of BAG and its relationships to other traits. We added 39 new genetic loci and nominated plausible candidate genes, including DPF3, which is implicated in neurodevelopment, and TNIK, which is linked to neuronal health and aging-related diseases. This will facilitate further work on the pathway mechanisms of BAG and potential therapy targets.

Methods

Ethical approval

This study used individual-level data from the UKB (www.ukbiobank.ac.uk) and LIFE-Adult (www.uniklinikum-leipzig.de/einrichtungen/life)^26,29,30. Both studies were conducted in accordance with applicable ethical regulations and the principles of the Declaration of Helsinki (2008). The UKB received approval from the North West–Haydock Research Ethics Committee (ref. nos. 11/NW/0382, 16/NW/0274, 21/NW/0157). LIFE-Adult was approved by the Ethics Committee of Leipzig University (ref. nos. 263–2009-14122009, 263/09-ff, 201/17-ek). All participants provided written informed consent. LIFE-Adult participants received a fixed compensation of 20 EUR per visit. UKB participants could claim travel reimbursement.

Statistics and reproducibility

This study presents results from a GWAS alongside a broad set of post-GWAS analyses, including fine-mapping, polygenic scoring, genetic correlation and Mendelian randomization. To enhance transparency and reproducibility, we have provided all analysis scripts, conda environments and software details in a public GitHub repository (https://github.com/pjawinski/ukb_brainage). Analyses were run on Debian GNU/Linux 11 (kernel 5.10.0-23-amd64). Unless stated otherwise, all P values are two-sided. Associations with P < 0.05 were considered nominally significant; Bonferroni correction and FDR control according to the Benjamini–Hochberg procedure were used to adjust for multiple testing. No formal power calculation was used to predetermine sample size. Instead, we included all eligible individuals from the UKB and LIFE-Adult who passed predefined quality control criteria.

Data exclusions were limited to prespecified quality control steps, described in detail elsewhere in the Methods. Analytical assumptions were addressed at each stage of the analysis. In cross-trait association testing, regression models were automatically selected based on the type of variable, with continuous variables normalized to meet distributional assumptions. In the GWAS, standard variant-based and sample-based quality control was applied; LDSC confirmed that the test statistic inflation was driven by polygenicity rather than confounding. This study was observational and nonexperimental; thus, participants were not randomly assigned and no blinding was applied. We report how our target samples were defined, all data exclusions, quality control procedures and all measures used in the study. A full list of UKB variables is provided in the UKB data dictionary (https://biobank.ndph.ox.ac.uk/showcase/) and LIFE-Adult data portal (https://ldp.life.uni-leipzig.de/).

Sample characteristics

Participants were drawn from the UKB under application no. 423032. A detailed description of the UKB study design and quality control methods has been published previously²⁶. For our discovery sample, participants were drawn from the UKB January 2020 imaging release (v.1.7). These data contained 40,681 participants with structural T1-weighted MRI scans (UKB data-field 20252). Scans in folders labeled ‘unusable’ were excluded, leaving 39,679 participants. Voxel-based morphometry preprocessing was successfully completed for 39,677 MRI scans (see the ‘MRI preprocessing’ section of the Methods). Analyses were restricted to participants whose self-reported sex matched the genetic sex (data-fields 31 and 2200), without sex chromosome aneuploidy (data-field 22019) and who were no outliers in heterozygosity and missingness (data-field 22027). We only included unrelated participants as suggested by pairwise kinship coefficients below 0.0442 (precalculated coefficients retrieved using ‘ukbgene rel’). We included participants of White British ancestry (data-field 22006), yielding a final discovery sample of 32,634 participants (17,084 female, age range = 45.2–81.9 years, mean age = 64.3 years).

For replication, we selected all remaining individuals without White British ancestry from the UKB January 2020 release (n = 4,870). Applying the same inclusion criteria, we added European and non-European UKB participants with imaging data released until May 2024 (v.1.10), yielding 25,668 individuals. None of them were related to the discovery participants. We included individuals with valid ancestry assignment from the Pan-ancestry return (no. 2442; https://pan.ukbb.broadinstitute.org/). This resulted in 337 African, 94 Admixed American, 638 Central/South Asian, 291 East Asian, 20,423 European and 98 Middle Eastern ancestry participants. In total, we included 21,881 UKB participants for replication (11,451 female, age range = 45.5–81.9 years, mean age = 67.1 years). From the LIFE-Adult study^29,30, we included another 1,833 unrelated participants of European ancestry (888 female, age range = 45.2–80.3 years, mean age = 65.3 years) with available T1-weighted MRI and genotype data, selected to match the UKB age range⁹⁷. Altogether, the final replication sample included 23,714 participants (12,339 female, age range = 45.2–81.9 years, mean age = 67.0 years) from 7 subsamples.

MRI data acquisition

The UKB imaging acquisition protocol and processing pipeline have been detailed previously (http://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id=1977). Brain MRI data were acquired at four UKB imaging centers (Cheadle, Newcastle, Reading and Bristol) on Siemens Skyra 3T MRI scanners (Siemens Healthcare) running the VD13A SP4 software, with a standard 32-channel radiofrequency head coil. We used T1-weighted structural MRI scans (UKB data-field 20252) acquired using a 3D magnetization-prepared rapid gradient-echo (MPRAGE) sequence in the sagittal plane, with a voxel size of 1 × 1 × 1 mm, 208 × 256 × 256 acquisition matrix, 2,000-ms repetition time (TR), 2.01-ms echo time (TE), 880-ms inversion time (TI), 6.1-ms echo spacing, 8 ° flip angle, a bandwidth of 240 Hz per pixel, an in-plane acceleration factor of R = 2 and duration of 4 min 54 s.

In LIFE-Adult, brain imaging was performed on a 3T Verio MRI scanner (Siemens Healthcare) with a standard 32-channel head coil. T1-weighted images were obtained using a 3D MPRAGE sequence with a voxel size of 1 × 1 × 1 mm, 256 × 240 × 176 acquisition matrix, TR = 2,300 ms, TE = 2.98 ms, TI = 900 ms and 9 ° flip angle.

MRI preprocessing

T1-weighted MRI scans in NIfTI format were preprocessed using the voxel-based morphometry pipeline of CAT12 (r1364, http://dbm.neuro.uni-jena.de) for SPM12 (r7487) in MATLAB R2021a (MathWorks). CAT12 preprocessing included affine and DARTEL registration to a reference brain, segmentation into GM, WM and cerebrospinal fluid, bias correction for intensity inhomogeneity and modulation to account for volume changes because of spatial registration. Images were then smoothed using an 8 × 8 × 8-mm full-width-at-half-maximum Gaussian kernel and resampled to a voxel size of 8 mm³. Only scans with a CAT12 overall image quality rating of less than 3.0 were retained, excluding 119 (~0.3%) of 39,677 scans from the UKB imaging release v.1.7, and 101 (0.4%) of 23,000 additional scans from release v.1.10.

Feature set for machine learning

Machine learning features were derived from CAT12-preprocessed GM and WM segmentations. Each smoothed, resampled brain image included 16,128 voxels. Voxels without interindividual variation were excluded, yielding 5,416 GM and 5,123 WM voxels. Because of spatial correlation across voxels, we applied principal component analysis (PCA) in MATLAB to reduce dimensionality. The first 500 principal components—explaining ~90% of the total variance—were selected as features.

Machine learning algorithms

We implemented three complementary algorithms to model age from the brain imaging data: the sparse Bayesian RVM using the MATLAB toolbox SparseBayes v.2 with the wrapper and kernel in refs. ^27,98, and extreme gradient boosting using XGBoost v.0.82.1 in R²⁸, using both decision tree (gbtree) and linear (gblinear) boosters. These algorithms were chosen for their demonstrated efficacy in previous brain age studies^5,15,99,100. XGBoost was configured with a learning rate of η = 0.02, 5,000 training iterations, early stopping after 50 iterations without improvement and maximum tree depth of 3. Default settings were used for all other training parameters. To exploit their complementary strengths in handling high-dimensional data, modeling linear and nonlinear relationships, and regularization, we combined all three (RVM, XGBoost tree and XGBoost linear) in an ensemble.

Age estimation models and BAG calculation

Age estimation models were trained using PCA-derived brain imaging features to predict chronological age. Training and application were performed in the discovery sample using tenfold cross-prediction with 100 repeats. This cross-prediction approach was chosen to maximize precision and avoid bias from external datasets with differing MRI protocols, similar to previous studies^15,20,21,22. The discovery sample was split into ten equally sized subsets. In each iteration, nine subsets served for model training and one for testing. PCA was performed on the training data and the transformation parameters were applied to the test set. This procedure was cycled through all ten folds, so each subset served once as the test set. The entire tenfold cross-prediction procedure was repeated 100 times, generating 100 predictions for each individual. This process was run for each tissue type (GM and WM) and model type (RVM, XGBoost tree and XGBoost linear), yielding 600 brain-predicted age estimates per individual (2 tissues × 3 models × 100 repeats). A nested tenfold cross-prediction was used to stack model-type predictions into tissue-specific ensemble estimates for GM and WM. To derive estimates for combined GM and WM, we stacked tissue-specific predictions rather than training new models on combined inputs. This yielded 100 age estimates per tissue type (GM, WM, combined), which were averaged to obtain 1 final brain-predicted age estimate per tissue type. Model performance was evaluated using the product-moment correlation coefficient (r), the coefficient of determination (R²) and MAE¹⁰⁰.

In the replication samples, age predictions were generated using all tenfold discovery models and compared to predictions from models trained on the full discovery sample. Results were highly concordant for all three tissue types (r > 0.997; Supplementary Fig. 15). For improved practicability, subsequent replication analyses used models trained on the full discovery sample.

BAG was calculated as the difference between predicted brain age and chronological age as:

$${\rm{BAG}}={\hat{{\rm{A}}}}_{{\rm{brain}}}-{{\rm{A}}}_{{\rm{chron}}}$$

where BAG reflects the brain age gap estimate. Â_brain is the predicted (modeled) age based on an individual’s brain imaging data and A_chron is the actual chronological age of the individual.

Because of regression dilution, BAG is typically confounded by age, with younger individuals showing higher and older individuals lower BAG values³¹. To correct this bias, we included both age and age², alongside additional covariates (sex, scanner site, total intracranial volume, genotyping array, genetic principal components), in all association analyses^31,32,100.

Cross-trait association analysis

We performed cross-trait association analyses using PHESANT v.1.1 (ref. ³³), an automated pipeline for phenome-wide association analyses in the UKB. Each BAG phenotype was tested against 7,088 nonimaging UKB variables. Covariates included sex (field 31), age (derived from fields 34, 52 and 53), age², scanner site (field 54) and total intracranial volume (from CAT12 segmentation). PHESANT selected regression models (linear, logistic, ordinal logistic or multinomial logistic) based on the type of variable. Continuous variables were inverse-rank-normalized before linear regression. To obtain standardized effect sizes, we calculated product-moment correlations (r) via corresponding z-statistics: $r={\mathrm{sign}}\left(\beta \right)\sqrt{{z}^{2}/({z}^{2}+(N-k-2))}$. For visualization, variables were grouped into categories based on the UKB data dictionary path. We also performed sex-stratified analyses and tested sex differences by comparing PHESANT beta coefficients in males (β_m) and females (β_f): $z=({{{\beta }}}_{{\rm{m}}}-{{{\beta }}}_{{\rm{f}}})/\sqrt{({\rm{s.}}{{\rm{e.}}}_{{\rm{m}}}^{2}+{\rm{s.}}{{\rm{e.}}}_{{\rm{f}}}^{2})}$. The resulting z-values were converted into P values using standard normal probabilities.

FreeSurfer associations

To examine associations between BAG and individual brain regions, we analyzed brain measures from the FreeSurfer aparc and aseg output files (UKB data-field 20263)³⁵, including surface area, cortical thickness and volume from 34 bilateral cortical and 8 bilateral subcortical regions (220 measures in total). We calculated partial product-moment correlations between BAG and brain measures, adjusting for sex, age, age², scanner site and total intracranial volume. Visualizations were created using the ENIGMA toolbox v.2.0.3 for MATLAB¹⁰¹. We also performed sex-stratified analyses and tested sex differences using Fisher’s r-to-z transformation with the cocor R package¹⁰². Associations between brain regions and chronological age are reported in Supplementary Table 3.

UKB genotyping and imputation

We retrieved genotype data (called: BED; imputed: BGEN v.3) from the UKB. Genotype collection, processing and quality control have been described previously^26,103. Genotyping was performed on DNA from EDTA blood using 2 Affymetrix arrays with 95% marker overlap: the UKB BiLEVE Axiom Array (807,411 markers used in 49,950 participants) and the UKB Axiom Array (825,927 markers used in 438,427 participants). Marker-based quality control included a call rate greater than 0.90, tests for batch, plate, array and sex effects, and Hardy–Weinberg equilibrium (P < 1.0 × 10⁻¹²). Sample-based quality control excluded individuals with a missingness greater than 0.05, high heterozygosity, sex discordance or sex chromosome aneuploidy. Relatedness was inferred using KING¹⁰⁴. White British ancestry (data-field 22006) was defined via self-report and genetic principal components. Genotypes were phased using SHAPEIT3 and imputed using IMPUTE4 (https://jmarchini.org/software/) with the Haplotype Reference Consortium, UK10K Project and 1000 Genomes Project Phase 3 serving as reference. Imputation yielded ~97 M markers. We selected biallelic SNPs and indels with MAF > 0.01 and INFO > 0.80. Biallelic variants were defined as those without duplicate coordinates or duplicate identifiers. This resulted in 9,669,330 variants for the discovery GWAS. In the replication samples, the number of variants passing quality control ranged between 8,345,339 (EAS ancestry) and 15,371,587 (AFR ancestry).

LIFE-Adult genotyping and imputation

Genotype collection, processing and quality control in LIFE-Adult have been described previously⁹⁷. DNA from peripheral blood leukocytes was genotyped on the Axiom Genome-Wide CEU 1 Array (Applied Biosystems) (587,352 markers). Marker-based quality control removed variants with call rate lower than 0.97, Hardy–Weinberg equilibrium P < 1.0 × 10⁻⁶ or plate effects P < 1.0 × 10⁻⁷. Sample quality control excluded individuals with Dish QC < 0.82, missingness > 0.03, sex discordance or cryptic relatedness. Genotypes were phased using SHAPEIT and imputed with IMPUTE2 using the 1000 Genomes Project Phase 3 as reference. This yielded 85,063,807 markers in 7,776 individuals. Quality control after imputation (MAF > 0.01, INFO > 0.8) left 9,472,504 biallelic SNPs or indels that also passed UKB quality control for inclusion in the meta-analysis.

Control for population structure

In the discovery sample, we calculated 20 genetic principal components using the randomized PCA algorithm (--pca 20 approx) implemented in PLINK v.2.00a2LM¹⁰⁵, based on the same variants used by the UKB group (resource 1955; 146,988 markers passing our own quality checks). For the UKB replication samples, we used principal components from the Pan-ancestry UKB project (return 2442), using 20 components for the larger UKB European-ancestry sample and 4 for all other groups.

Heritability and partitioned heritability

Estimates of SNP-based heritability (h²_SNP) were derived by applying LDSC^36,37 to our GWAS summary statistics, with precalculated LD scores and regression weights from the 1000 Genomes Project Phase 3. Analyses were limited to HapMap3 variants with MAF > 0.01, excluding the MHC region. Partitioned heritability was assessed using stratified LDSC³⁸ with baseline-LD model v.2.2. We tested the 33 main annotations reported in ref. ¹⁰⁶, considering annotations with an FDR < 0.05 as significant.

Genome-wide association analysis

GWAS analyses were performed in PLINK v.2.00a2LM¹⁰⁵ using allelic dosage data, including autosomal (chromosomes 1 and 22), gonosomal (chromosomes X and Y) and pseudoautosomal (chromosomes XY) variants. Dosage scales were 0–2 for diploid regions (chromosomes 1–22, chromosomes XY), 0–1 for haploid chromosome Y and 0–2 for chromosome X. We modeled additive genetic effects and used sex, age, age², total intracranial volume, scanner site, type of genotyping array and the first 20 genetic principal components as covariates (4 components for the LIFE-Adult and non-European-ancestry samples).

Genome-wide association meta-analysis

The European-ancestry GWAS results were meta-analyzed using fixed-effects inverse-variance-weighted models in METAL (v.2020-05-05)¹⁰⁷. Variants with a sample size of less than 67% of the 90th percentile (adapted from LDSC)³⁷ or heterogeneity P < 1.0 × 10⁻⁶ were excluded. The final GWAS meta-analysis in individuals of European ancestry included 9,628,877 variants for GM and WM BAG, and 9,628,868 variants for the combined BAG, analyzed in up to 54,890 individuals. Multi-ancestry meta-analyses were performed with MR-MEGA v.0.2 (ref. ¹⁰⁸), including the White British discovery sample, six UKB replication samples (European, African, Admixed American, Central/South Asian, East Asian, Middle Eastern ancestry) and the European-ancestry LIFE-Adult cohort. Ancestry effects were modeled using three axes of genetic variation derived from allele frequency differences. For comparison, fixed-effects and random-effects meta-analyses were also conducted in GWAMA v.2.2.2 (ref. ¹⁰⁹). The multi-ancestry GWAS results included 8,618,923 variants in up to 56,348 individuals.

Identification of independent discoveries

We identified independent association signals using stepwise conditional analyses in GCTA-COJO³⁹. A 10,000-kb window size and collinearity cutoff of 0.9 were applied. Multiple signals within a locus were only considered independent if the P value of the subsidiary signal did not increase by more than two orders of magnitude relative to its unadjusted value. Variants reaching P < 5.0 × 10⁻⁸ in the conditional analysis were considered genome-wide significant; those with P < 1.0 × 10⁻⁶ were deemed suggestive. We refer to lead variants from these signals as index variants. To identify nonredundant signals across the three BAG GWAS, all index variants were LD-clumped (r² < 0.1, window size = 10,000 kb) using PLINK v.1.90b6.8.¹⁰⁵.

Definition of variant replication and power calculations

From the discovery GWAS, we selected index variants from genome-wide significant loci (conditional P < 5.0 × 10⁻⁸) and 45 additional suggestive loci (conditional P values between 1.0 × 10⁻⁶ and 5.0 × 10⁻⁸) for replication. Consistency between discovery and replication was tested using sign tests (binomial), based on the agreement in effect direction. Variants with a replication P < 0.05 (one-tailed nominal significance) were considered replicated. To estimate the expected replication yield, we performed power calculations based on standardized discovery betas, MAF and replication sample size¹¹⁰. Beta coefficients were corrected for winner’s curse¹¹¹. Expected replications were computed as the sum of individual variant-level power estimates.

Novelty of the discovered loci

To assess novelty, we compared our findings against nine prior BAG GWAS reporting genome-wide significant loci^{15,17,18,19,20,21,22,23,24}. Using PLINK v.1.90b6, we clumped variants based on LD (R² > 0.1, window size = 10,000 kb)¹⁰⁵ and defined loci as novel if they did not cluster with previously reported variants. Parameter choices were guided by GCTA-COJO³⁹ and Psychiatric Genomics Consortium studies^79,112. Of the 59 loci identified, 39 were classified as novel, a result consistent across clumping thresholds (R² of 0.10 or 0.05) and window sizes (10,000 kb or 3,000 kb).

ANNOVAR enrichment test

We used the ANNOVAR (v.2017-07-17)⁴⁴ enrichment test implemented in FUMA v.1.6.0 (https://fuma.ctglab.nl/)¹¹³ to evaluate whether genome-wide significant regions were enriched for specific functional annotations. All candidate variants in LD (R² > 0.6) with independent significant autosomal variants (P < 5.0 × 10⁻⁸) were included. Candidate variants were defined as those with P < 0.05 and R² > 0.60 with an independent significant variant. UKB release 2 served as the LD reference panel. If a variant had multiple annotations, each was counted separately. Enrichment was computed as the proportion of candidate variants with a given annotation relative to the proportion of variants with that annotation among all variants in the reference panel. Significance was tested using a two-tailed Fisher’s exact test.

Credible sets of variants

We used SBayesRC, a Bayesian mixture model implemented in GCTB v.2.5.2 (refs. ^40,41), to construct 95% credible sets of variants per locus, capturing the cumulative posterior probability of including a causal variant. Unlike region-specific fine-mapping approaches, such as susieR⁴² and FINEMAP⁴³, SBayesRC jointly models multiple genomic regions alongside functional annotations. We used SBayesRC with the eigendecomposition data of LD matrices from our discovery dataset of 32,634 individuals (~9.7 M imputed SNPs), and functional annotations from the stratified LDSC baseline-LF UKB model (v.2.2)¹¹⁴. We used default settings with five mixture components (scaling factors of 0, 0.001, 0.01, 0.1 and 1%). Credible sets were assigned to a discovered locus if they contained at least 1 genome-wide significant credible variant in strong LD (R² > 0.8) within 3,000 kb from the index variant. We report sets with PIP > 0.95 and posterior enrichment probability > 0.50. For comparison, we also applied susieR v.0.12.35 (ref. ⁴²) and FINEMAP v.1.4.2 (ref. ⁴³). For each locus, a 10,000-kb window was used to identify the outermost variants in LD (R² > 0.1), defining region boundaries. LD matrices were computed using LDstore v.2.0 (ref. ¹¹⁵) in 53,074 individuals of European ancestry from the combined discovery and UKB replication sample. For FINEMAP, we allowed up to k = 10 causal variants per region, reporting 95% credible sets for the most probable k model. For susieR, we allowed up to L = 10 causal signals per region, reporting 95% credible sets with minimum purity greater than 0.5.

Functional annotation of variants

Variants were annotated using ANNOVAR⁴⁴, which assigns functional categories based on physical position relative to genes. RefSeq gene annotations (hg19) were retrieved from the UCSC Genome Browser (https://genome.ucsc.edu/)¹¹⁶. The nearest gene was identified using ANNOVAR’s default prioritization of variant function and genomic distance. The transcript consequences of nonsynonymous exonic variants were predicted; deleteriousness scores from CADD were obtained from dbnsfp35a (hg19)^45,117.

Gene nomination through functional annotation of credible variants

Credible variants were annotated using ANNOVAR⁴⁴, and variant posterior probabilities were aggregated per gene implicated. Genes were then ranked according to their total variant posterior probabilities and nominated for gene prioritization. Additionally, genes implicated by nonsynonymous exonic variants were ranked based on the highest CADD Phred-scaled score among those variants.

Gene nomination through SMR

We applied summary-data-based Mendelian randomization implemented in SMR v.1.03 (refs. ^47,48) to test whether variant effects were potentially mediated by gene expression or splicing. SMR integrates GWAS summary statistics with omics data to prioritize gene targets and regulatory elements. It adopts the Mendelian randomization strategy by using a single genetic instrument (z) to test for pleiotropic association between gene regulation (exposure, x) and a trait of interest (outcome, y). The effect of gene regulation on a trait (β_xy) is calculated as a two-step least squares estimate and defined as the ratio of the instrument’s effect on the outcome (β_zy) to its effect on the exposure (β_zx), that is, β_xy = β_zy/β_yz. To distinguish pleiotropy from linkage, SMR incorporates the HEterogeneity In Dependent Instruments (HEIDI) test, which leverages multiple instruments in the regulatory region. We used cis-eQTL (gene expression) and cis-sQTL (gene splicing) summary statistics from BrainMeta v.2, derived from RNA-seq data of 2,865 brain cortex samples from 2,443 individuals of European ancestry⁴⁸. Our GWAS variants were mapped to 16,375 eQTL and 58,941 sQTL probes. We retained results with an FDR < 0.05, P_HEIDI > 0.01 and those mapping to genome-wide-significant GWAS loci. Significant SMR hits were assigned to index variants using PLINK clumping (window size = 3,000 kb; R² > 0.80). Genes implicated by eQTL and sQTL SMR were nominated separately and ranked using the SMR P value.

Gene nomination through GTEx eQTL lookup

Index variants and their genome-wide significant neighbors in strong LD (R² > 0.8) were mapped to cis-QTLs from the GTEx v.8 database⁴⁶. Single-tissue QTLs were retrieved from the prefiltered file (GTEx_Analysis_v8_eQTL.tar), covering 49 tissues. Multi-tissue QTLs were obtained from the METASOFT results (GTEx_Analysis_v8.metasoft.txt.gz), retaining variant–gene associations available in 10 or more tissues and with an m value equal to or greater than 0.9 (that is, the posterior probability that the effect exists) in 50% or more of tissues. To ensure robustness, only associations with meta-analytical P < 5.0 × 10⁻⁸ (Han and Eskin’s random-effects (RE2) model) were considered, yielding 4,420,841 multi-tissue QTLs. Variant mapping was done using the GTEx hg19 liftover variant IDs. If multiple variants implicated the same gene within a locus, we reported the variant in strongest LD with the index variant. Ensembl gene IDs were converted to HUGO Gene Nomenclature Committee symbols using biomaRt¹¹⁸. Genes implicated by single-tissue and multi-tissue QTLs were nominated separately for prioritization and ranked according to the number of significant tissue associations.

Gene nomination through polygenic priority scores

We used PoPS v.0.2 (ref. ⁴⁹) to identify likely causal genes within significant GWAS loci. PoPS builds on MAGMA¹¹⁹ gene-level associations and leverages subthreshold polygenic signals, integrating more than 57,000 features from sources such as single-cell RNA-seq datasets, curated biological pathways and protein–protein interaction networks. We used the same PoPS feature map and MAGMA gene annotation file as in the original publication (www.finucanelab.org/data)⁴⁹. MAGMA v.1.10 was applied to our GWAS summary statistics using SNP-wise mean gene analysis, with LD data from 53,057 individuals of European ancestry (combined discovery and UKB replication sample). For each index variant identified through the conditional analyses, up to 3 genes within 500 kb with the highest PoPS scores were nominated for gene prioritization.

Gene prioritization

Genes were prioritized based on seven evidence streams: (1) ANNOVAR functional annotation of credible variants, summing the posterior probabilities per gene; (2) transcript consequences of nonsynonymous exonic variants, ranked according to CADD score; (3) SMR eQTLs ranked according to P value; (4) sQTLs ranked according to P value; (5) GTEx single-tissue eQTLs; and (6) multi-tissue eQTLs, ranked according to the number of significant associations across tissues; and (7) PoPS ranked according to prioritization score. A composite priority score was calculated for each nominated gene as described below.

Let i denote a gene and j denote the index of the nomination category. The priority score (P_i) for gene i combines the cumulative posterior probability (C_i) from variant annotations and the gene’s rank (R_ij) across six additional evidence categories (${{j}}\in \left[1,6\right]$) as

$${{{P}}}_{{{i}}}={{{C}}}_{{{i}}}+\mathop{\sum }\limits_{{{j}}=1}^{6}\frac{2\left({{{n}}}_{{{j}}}+1-{{{R}}}_{{{ij}}}\right)}{{{{n}}}_{{{j}}}\left({{n}}_{{j}}+1\right)}$$

where P_i denotes the priority score for gene i, C_i denotes the cumulative posterior probability of variants mapped to gene i, n_j denotes the number of genes ranked in nomination category j and R_ij denotes the rank of gene i in nomination category j.This formulation assigns greater weight to top-ranked genes and ensures that each category contributes equally (one point per category). The gene with the highest P_i per locus was designated the prioritized gene.

GWAS Catalog search

We queried the National Human Genome Research Institute GWAS Catalog (13 September 2024 release; gwas_catalog_v1.0-associations_e112_r2024-09-13.tsv)⁵² for all index variants identified using conditional analysis and their genome-wide significant neighbors in strong LD (3,000-kb window, R² > 0.8). Neighboring variants were identified through P-value-informed clumping in PLINK v.1.90b6.8 (ref. ¹⁰⁵). Only GWAS Catalog entries reaching genome-wide significance were retained.

Gene-based analysis

We performed gene-based analyses using fastBAT in GCTA v.1.93.1f⁷⁰. Gene coordinates were obtained from the RefSeq GFF3 annotation file (GRCh37.p13; release 105.20201022)¹²⁰. NCBI chromosome names were converted to UCSC format. We selected protein-coding genes located on chromosomes 1–22, X and Y, removing duplicates gene names, by keeping the first entry sorted according to chromosome, symbol and coordinates. This yielded 19,299 genes, of which 18,632 were successfully mapped to GWAS variants. Analyses used linkage data from the combined UKB discovery and replication sample (n = 53,057, European ancestry), applying no flanking window to reduce gene-level dependency. Genes with an FDR < 0.05 were considered significant. To identify independent associations, we applied P-value-informed clumping with a 3,000-kb window size. Distinct associations across the 3 BAG traits were determined with second-level clumping, using each gene’s top P value, again with a 3,000-kb window size.

Pathway analyses

We conducted GO pathway analyses using the R package GOfuncR v.1.14.0, based on the GO.db v.3.14.0 and Homo.sapiens v.1.3.1 annotations^71,121,122. GO provides a curated framework to categorize genes based their molecular function, cellular components where they perform actions and the higher-order biological processes they contribute to. Gene set enrichment analyses were performed on the full fastBAT gene-based results, testing for lower-than-expected P value ranks using the Wilcoxon rank-sum test. By default, GOfuncR calculates family-wise error rates (FWERs) in each of the three GO aspects using random permutations. To reduce false discoveries, we joined these permutation-based results to calculate FWERs across the three GO aspects. We further refined significant results (FWER < 0.05) by applying the elim algorithm to decorrelate overlapping terms and retain the most specific¹²³. For interpretation, we also determined the number of distinct loci contributing to each enriched term, applying 3,000 kb clumping to account for spatial gene clustering.

PGS analysis

To estimate the variance in BAG explained by PGS, we used a conventional clumping and P thresholding (C + P) approach implemented in PRSice-2 v.2.3.3 (ref. ¹²⁴) along with two Bayesian polygenic prediction methods, SBayesR and SBayesRC, implemented in GCTB v.2.5.2 (ref. ⁴⁰). For the C + P approach, we used R² > 0.1, a 500-kb window size and 10 predefined P value thresholds^79,112. Unlike the C + P approach, SBayesR and SBayesRC jointly model all variant effects, with SBayesRC additionally incorporating functional annotations. SBayesR/RC were used with eigendecomposed LD matrices (~7 M variants), and stratified LDSC baseline-LD v.2.2 annotations. Missing variants were imputed. Default settings were used (--gamma 0,0.001,0.01,0.1,1 --pi 0.99,0.005,0.003,0.001,0.001 --chain-length 3,000 --burn-in 1,000). The resulting weights were applied to calculate the PGS in target samples using PLINK (--score)¹⁰⁵.

PGS weights derived from the discovery sample (n = 32,634) were tested in the European-ancestry replication sample (n = 20,423). To estimate PGS performance in the combined discovery and replication sample, we reran the GWAS meta-analysis excluding 2,000 individuals of European ancestry held out as a target set (total training n = 52,890). Transferability was assessed in AFR (n = 337), CSA (n = 638) and EAS (n = 291) UKB subsamples. Associations between PGS and BAG were evaluated using partial product-moment correlations, adjusting for sex, age, age², scanner site, total intracranial volume, genotyping array and 20 genetic principal components (4 for non-European samples).

Genetic correlations

We used bivariate LDSC (v.1.0.1)³⁶ to compute pairwise genetic correlations among BAG traits and between BAG and other complex traits. These included 38 commonly studied traits spanning the mental and physical health domains^77,78,79 (Supplementary Table 28), as well as 989 heritable UKB traits with publicly available GWAS summary statistics⁸⁰. Analyses were restricted to HapMap3 variants, excluding the MHC region. Genetic correlations with an FDR < 0.05 were considered significant.

Mendelian randomization

Potential causal associations were examined using GSMR implemented in GCTA v.1.93.1f⁸¹. GSMR uses multiple genetic variants (here clumped with an R² < 0.001 and a 10,000-kb window) as instruments (z) to test for causality between an exposure (x) and outcome variable (y), using the ratio β_xy = β_zy/β_zx. Designed for two-sample scenarios, GSMR estimates the exposure–outcome effects using GWAS summary statistics from independent samples. Estimates from multiple instruments are integrated using generalized least squares. Instrument heterogeneity is assessed via HEIDI (P < 0.01), removing outliers deviating from the causal model. To facilitate effect-size comparisons, we standardized instrument effects on continuous exposures (β_zx) based on z-statistic, allele frequency and sample size. GSMR has been demonstrated with superior power to inverse-variance-weighted MR and MR Egger regression⁸¹. We used GSMR as the primary method for inferring causality and conducted sensitivity analyses using nine alternative MR approaches: inverse-variance-weighted MR (simple, debiased and penalized); MR Egger regression; weighted median-base; maximum-likelihood; mode-based; MR lasso; and contamination-mixture MR, implemented in MendelianRandomization v.0.10.0 (ref. ¹²⁵). These methods used the same variants as GSMR but without HEIDI-based outlier removal. Twelve risk factors were selected based on the availability of large-scale GWAS, not including UKB individuals^79,81. Both forward and reverse MR were performed to assess any potential bidirectional effects between risk factors and BAG.

Polygenicity

We used GENESIS v.1.0 (commit e4e6894) to infer genetic effect-size distributions and estimate the number of susceptibility variants underlying BAG under a normal-mixture model of variant effects⁸⁴. Analyses included 1.07 million HapMap3 variants with MAF > 0.05, excluding the MHC region, SNPs with a sample size of less than 0.67 × 90th percentile and those with extreme effect sizes (z² > 80). We fitted the GENESIS three-component model, which assumes that 99% of variant effects are null, while the remaining 1% follow a mixture of 2 normal distributions, allowing a subset of susceptibility variants to exhibit larger effects. We chose the three-component model over the simpler two-component model because it provides better fits across diverse traits, is robust to model misspecification and reduces downward bias in polygenicity estimates⁸⁴. Default settings were used for defining tagging SNPs (R² > 0.1 and 1,000-kb window). Neuroticism and height served as benchmark traits for comparison^86,87.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The individual-level data used in this study were obtained from the UKB and LIFE-Adult study. Access to these datasets is restricted to researchers with approved projects. The GWAS summary statistics and polygenic score weights generated from our analyses are available via Zenodo at https://doi.org/10.5281/zenodo.14826943 (ref. ¹²⁶). Genetic correlation analyses involving UKB traits were conducted using the GWAS summary statistics provided in ref. ⁸⁰ (https://doi.org/10.5281/zenodo.7186871). Additional GWAS summary statistics used for genetic correlation and Mendelian randomization analyses are detailed in Supplementary Tables 28 and 31.

Code availability

All scripts, conda environments and a complete list of software and resources used in this study are available via GitHub at https://github.com/pjawinski/ukb_brainage.

References

Kirkwood, T. B. L. Understanding the odd science of aging. Cell 120, 437–447 (2005).
CAS PubMed Google Scholar
da Silva, R., Conde, D. A., Baudisch, A. & Colchero, F. Slow and negligible senescence among testudines challenges evolutionary theories of senescence. Science 376, 1466–1470 (2022).
CAS PubMed Google Scholar
Finch, C. E. & Austad, S. N. History and prospects: symposium on organisms with slow aging. Exp. Gerontol. 36, 593–597 (2001).
CAS PubMed Google Scholar
Vos, T. et al. Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380, 2163–2196 (2012).
PubMed PubMed Central Google Scholar
Franke, K. & Gaser, C. 10 years of BrainAGE as an neuroimaging biomarker of brain aging: what insights did we gain? Front. Neurol. 10, 789 (2019).
PubMed PubMed Central Google Scholar
Cole, J. H., Marioni, R. E., Harris, S. E. & Deary, I. J. Brain age and other bodily ‘ages’: implications for neuropsychiatry. Mol. Psychiatry 24, 266–281 (2019).
PubMed Google Scholar
Liem, F. et al. Predicting brain-age from multimodal imaging data captures cognitive impairment. Neuroimage 148, 179–188 (2017).
PubMed Google Scholar
Franke, K., Gaser, C., Manor, B. & Novak, V. Advanced BrainAGE in older adults with type 2 diabetes mellitus. Front. Aging Neurosci. 5, 90 (2013).
PubMed PubMed Central Google Scholar
Franke, K., Ristow, M. & Gaser, C. Gender-specific impact of personal health parameters on individual brain aging in cognitively unimpaired elderly subjects. Front. Aging Neurosci. 6, 94 (2014).
PubMed PubMed Central Google Scholar
Steffener, J. et al. Differences between chronological and brain age are related to education and self-reported physical activity. Neurobiol. Aging 40, 138–144 (2016).
PubMed PubMed Central Google Scholar
Richard, G. et al. Assessing distinct patterns of cognitive aging using tissue-specific brain age prediction based on diffusion tensor imaging and brain morphometry. PeerJ 2018, e5908 (2018).
Google Scholar
Cole, J. H. et al. Brain age predicts mortality. Mol. Psychiatry 23, 1385–1392 (2018).
CAS PubMed Google Scholar
Jawinski, P. et al. Linking brain age gap to mental and physical health in the Berlin Aging Study II. Front. Aging Neurosci. 14, 791222 (2022).
PubMed PubMed Central Google Scholar
Shahab, S. et al. Brain structure, cognition, and brain age in schizophrenia, bipolar disorder, and healthy controls. Neuropsychopharmacology 44, 898–906 (2019).
PubMed Google Scholar
Kaufmann, T. et al. Common brain disorders are associated with heritable patterns of apparent aging of the brain. Nat. Neurosci. 22, 1617–1623 (2019).
CAS PubMed PubMed Central Google Scholar
Cole, J. H. et al. Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. Neuroimage 163, 115–124 (2017).
PubMed Google Scholar
Jonsson, B. A. et al. Brain age prediction using deep learning uncovers associated sequence variants. Nat. Commun. 10, 5409 (2019).
CAS PubMed PubMed Central Google Scholar
Smith, S. M. et al. Brain aging comprises many modes of structural and functional change with distinct genetic and biophysical associations. eLife 9, e52677 (2020).
CAS PubMed PubMed Central Google Scholar
Ning, K., Zhao, L., Matloff, W., Sun, F. & Toga, A. W. Association of relative brain age with tobacco smoking, alcohol consumption, and genetic variants. Sci. Rep. 10, 10 (2020).
CAS PubMed PubMed Central Google Scholar
Ning, K. et al. Improving brain age estimates with deep learning leads to identification of novel genetic factors associated with brain aging. Neurobiol. Aging 105, 199–204 (2021).
CAS PubMed PubMed Central Google Scholar
Kim, J., Lee, J., Nam, K. & Lee, S. Investigation of genetic variants and causal biomarkers associated with brain aging. Sci. Rep. 13, 1526 (2023).
CAS PubMed PubMed Central Google Scholar
Leonardsen, E. H. et al. Genetic architecture of brain age and its causal relations with brain and mental disorders. Mol. Psychiatry 28, 3111–3120 (2023).
PubMed PubMed Central Google Scholar
Wen, J. et al. The genetic architecture of multimodal human brain age. Nat. Commun. 15, 2604 (2024).
CAS PubMed PubMed Central Google Scholar
Yi, F. et al. Genetically supported targets and drug repurposing for brain aging: a systematic study in the UK Biobank. Sci. Adv. 11, eadr3757 (2025).
CAS PubMed PubMed Central Google Scholar
Gaser, C. et al. CAT: a computational anatomy toolbox for the analysis of structural MRI data. Gigascience 13, giae049 (2024).
PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
CAS PubMed PubMed Central Google Scholar
Tipping, M. E. Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. 1, 211–244 (2001).
Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (eds Krishnapuram, B. et al.) 785–794 (Association for Computing Machinery, 2016).
Loeffler, M. et al. The LIFE-Adult-Study: objectives and design of a population-based cohort study with 10,000 deeply phenotyped adults in Germany. BMC Public Health 15, 691 (2015).
PubMed PubMed Central Google Scholar
Engel, C. et al. Cohort profile: the LIFE-Adult-Study. Int. J. Epidemiol. 52, e66–e79 (2023).
PubMed Google Scholar
Smith, S. M., Vidaurre, D., Alfaro-Almagro, F., Nichols, T. E. & Miller, K. L. Estimation of brain age delta from brain imaging. Neuroimage 200, 528–539 (2019).
PubMed Google Scholar
Cole, J. H. Multimodality neuroimaging brain-age in UK biobank: relationship to biomedical, lifestyle, and cognitive factors. Neurobiol. Aging 92, 34–42 (2020).
PubMed PubMed Central Google Scholar
Millard, L. A. C., Davies, N. M., Gaunt, T. R., Davey Smith, G. & Tilling, K. Software application profile: PHESANT: a tool for performing automated phenome scans in UK Biobank. Int. J. Epidemiol. 47, 29–35 (2018).
PubMed Google Scholar
Han, L. K. M. et al. Brain aging in major depressive disorder: results from the ENIGMA major depressive disorder working group. Mol. Psychiatry 26, 5124–5139 (2021).
CAS PubMed Google Scholar
Dale, A. M., Fischl, B. & Sereno, M. I. Cortical surface-based analysis: I. Segmentation and surface reconstruction. Neuroimage 9, 179–194 (1999).
CAS PubMed Google Scholar
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
CAS PubMed PubMed Central Google Scholar
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228–1235 (2015).
CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375 (2012).
CAS PubMed PubMed Central Google Scholar
Zheng, Z. et al. Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries. Nat. Genet. 56, 767–777 (2024).
CAS PubMed PubMed Central Google Scholar
Wu, Y. et al. Genome-wide fine-mapping improves identification of causal variants. Preprint at medRxiv https://doi.org/10.1101/2024.07.18.24310667 (2024).
Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Series B Stat. Methodol. 82, 1273–1300 (2020).
PubMed PubMed Central Google Scholar
Benner, C. et al. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 32, 1493–1501 (2016).
CAS PubMed PubMed Central Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
PubMed PubMed Central Google Scholar
Rentzsch, P., Schubach, M., Shendure, J. & Kircher, M. CADD-Splice—improving genome-wide variant effect prediction using deep learning-derived splice scores. Genome Med. 13, 31 (2021).
CAS PubMed PubMed Central Google Scholar
Aguet, F. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Google Scholar
Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481–487 (2016).
CAS PubMed Google Scholar
Qi, T. et al. Genetic control of RNA splicing and its distinct role in complex trait variation. Nat. Genet. 54, 1355–1363 (2022).
CAS PubMed PubMed Central Google Scholar
Weeks, E. M. et al. Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases. Nat. Genet. 55, 1267–1276 (2023).
CAS PubMed PubMed Central Google Scholar
Stefansson, H. et al. A common inversion under selection in Europeans. Nat. Genet. 37, 129–137 (2005).
CAS PubMed Google Scholar
Okbay, A. et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat. Genet. 48, 624–633 (2016).
CAS PubMed PubMed Central Google Scholar
Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985 (2023).
CAS PubMed Google Scholar
Lee, J. J. et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 50, 1112–1121 (2018).
CAS PubMed PubMed Central Google Scholar
Nagel, M. et al. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat. Genet. 50, 920–927 (2018).
CAS PubMed Google Scholar
Karlsson Linnér, R. et al. Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences. Nat. Genet. 51, 245–257 (2019).
PubMed Google Scholar
Jansen, P. R. et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nat. Genet. 51, 394–403 (2019).
CAS PubMed Google Scholar
Kichaev, G. et al. Leveraging polygenic functional enrichment to improve GWAS power. Am. J. Hum. Genet. 104, 65–75 (2019).
CAS PubMed Google Scholar
Hollis, B. et al. Genomic analysis of male puberty timing highlights shared genetic basis with hair colour and lifespan. Nat. Commun. 11, 1536 (2020).
CAS PubMed PubMed Central Google Scholar
Jun, G. et al. A novel Alzheimer disease locus located near the gene encoding tau protein. Mol. Psychiatry 21, 108–117 (2016).
CAS PubMed Google Scholar
Rademakers, R., Cruts, M. & van Broeckhoven, C. The role of tau (MAPT) in frontotemporal dementia and related tauopathies. Hum. Mutat. 24, 277–295 (2004).
CAS PubMed Google Scholar
Shadrin, A. A. et al. Vertex-wise multivariate genome-wide association study identifies 780 unique genetic loci associated with cortical morphology. Neuroimage 244, 118603 (2021).
CAS PubMed Google Scholar
van der Meer, D. et al. Understanding the genetic determinants of the brain with MOSTest. Nat. Commun. 11, 3512 (2020).
CAS PubMed PubMed Central Google Scholar
Bittner, S., Ruck, T., Fernández-Orth, J. & Meuth, S. G. TREK-King the blood–brain-barrier. J. Neuroimmune Pharmacol. 9, 293–301 (2014).
PubMed Google Scholar
Wang, W. et al. Lig4-4 selectively inhibits TREK-1 and plays potent neuroprotective roles in vitro and in rat MCAO model. Neurosci. Lett. 671, 93–98 (2018).
CAS PubMed Google Scholar
Le Guen, Y. et al. eQTL of KCNK2 regionally influences the brain sulcal widening: evidence from 15,597 UK Biobank participants with neuroimaging data. Brain Struct. Funct. 224, 847–857 (2019).
CAS PubMed Google Scholar
Mullins, N. et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat. Genet. 53, 817–829 (2021).
CAS PubMed PubMed Central Google Scholar
Cho, C. et al. Large-scale cross-ancestry genome-wide meta-analysis of serum urate. Nat. Commun. 15, 3441 (2024).
CAS PubMed PubMed Central Google Scholar
Cornejo, K. G. et al. Activity-assembled nBAF complex mediates rapid immediate early gene transcription by regulating RNA polymerase II productive elongation. Cell Rep. 43, 114877 (2024).
CAS PubMed PubMed Central Google Scholar
Ewald, C. Y. et al. TNIK’s emerging role in cancer, metabolism, and age-related diseases. Trends Pharmacol Sci. 45, 478–489 (2024).
CAS PubMed Google Scholar
Bakshi, A. et al. Fast set-based association analysis using summary data from GWAS identifies novel gene loci for human complex traits. Sci. Rep. 6, 32894 (2016).
Grote, S. GOfuncR: Gene ontology enrichment using FUNC. R package version 1.14.0 (2021).
Ashburner, M. et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
CAS PubMed PubMed Central Google Scholar
Takai, Y., Sasaki, T. & Matozaki, T. Small GTP-binding proteins. Physiol. Rev. 81, 153–208 (2001).
CAS PubMed Google Scholar
Wennerberg, K., Rossman, K. L. & Der, C. J. The Ras superfamily at a glance. J. Cell Sci. 118, 843–846 (2005).
CAS PubMed Google Scholar
Ejaz, A., Mattesich, M. & Zwerschke, W. Silencing of the small GTPase DIRAS3 induces cellular senescence in human white adipose stromal/progenitor cells. Aging 9, 860–879 (2017).
CAS PubMed PubMed Central Google Scholar
Wang, L., Yang, L., Debidda, M., Witte, D. & Zheng, Y. Cdc42 GTPase-activating protein deficiency promotes genomic instability and premature aging-like phenotypes. Proc. Natl Acad. Sci. USA 104, 1248–1253 (2007).
CAS PubMed PubMed Central Google Scholar
Abdellaoui, A. & Verweij, K. J. H. Dissecting polygenic signals from genome-wide association studies on human behaviour. Nat. Hum. Behav. 5, 686–694 (2021).
PubMed Google Scholar
Abdellaoui, A. et al. Genetic correlates of social stratification in Great Britain. Nat. Hum. Behav. 3, 1332–1342 (2019).
PubMed Google Scholar
Wray, N. R. et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat. Genet. 50, 668–681 (2018).
CAS PubMed PubMed Central Google Scholar
rkwalters & Palmer. D. Nealelab/UKBB_ldsc: v2.0.0 (Round 2 GWAS update). Zenodo https://doi.org/10.5281/zenodo.7186871 (2022).
Zhu, Z. et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nat. Commun. 9, 224 (2018).
PubMed PubMed Central Google Scholar
Evangelou, E. et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 50, 1412–1425 (2018).
CAS PubMed PubMed Central Google Scholar
Chen, Y. et al. Relationship between changes in late-life blood pressure and the risk of frailty and mortality among older population in China: a cohort study based on CLHLS. Hypertens. Res. 47, 1881–1891 (2024).
PubMed Google Scholar
Zhang, Y., Qi, G., Park, J.-H. & Chatterjee, N. Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits. Nat. Genet. 50, 1318–1326 (2018).
CAS PubMed Google Scholar
Montag, C., Ebstein, R. P., Jawinski, P. & Markett, S. Molecular genetics in psychology and personality neuroscience: on candidate genes, genome wide scans, and new research strategies. Neurosci. Biobehav. Rev. 118, 163–174 (2020).
CAS PubMed Google Scholar
Wood, A. R. et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 46, 1173–1186 (2014).
CAS PubMed PubMed Central Google Scholar
Baselmans, B. M. L. et al. Multivariate genome-wide analyses of the well-being spectrum. Nat. Genet. 51, 445–451 (2019).
CAS PubMed Google Scholar
Robinson, O. et al. Associations of four biological age markers with child development: a multi-omic analysis in the European HELIX cohort. eLife 12, e85104 (2023).
CAS PubMed PubMed Central Google Scholar
Drewelies, J. et al. There are multiple clocks that time us: Cross-sectional and longitudinal associations among 14 alternative indicators of age and aging. J. Gerontol. A Biol. Sci. Med. Sci. 80, glae244 (2025).
PubMed Google Scholar
Vetter, V. M. et al. Comprehensive comparison of sixteen markers of biological aging: cross-sectional and longitudinal results from the Berlin Aging Study II (BASE-II). Preprint at medRxiv https://doi.org/10.1101/2025.04.09.25325514 (2025).
Gaser, C., Franke, K., Klöppel, S., Koutsouleris, N. & Sauer, H. BrainAGE in mild cognitive impaired patients: predicting the conversion to Alzheimer’s Disease. PLoS ONE 8, e67346 (2013).
CAS PubMed PubMed Central Google Scholar
Sodini, S. M., Kemper, K. E., Wray, N. R. & Trzaskowski, M. Comparison of genotypic and phenotypic correlations: Cheverud’s conjecture in humans. Genetics 209, 941–948 (2018).
PubMed PubMed Central Google Scholar
Cheverud, J. M. A comparison of genetic and phenotypic correlations. Evolution 42, 958–968 (1988).
PubMed Google Scholar
Nebe, S. et al. Enhancing precision in human neuroscience. eLife 12, e85980 (2023).
CAS PubMed PubMed Central Google Scholar
Vidal-Pineiro, D. et al. Individual variations in ‘brain age’ relate to early-life factors more than to longitudinal brain change. eLife 10, e69995 (2021).
CAS PubMed PubMed Central Google Scholar
Frei, O. et al. Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation. Nat. Commun. 10, 2417 (2019).
PubMed PubMed Central Google Scholar
Jawinski, P. et al. Human brain arousal in the resting state: a genome-wide association study. Mol. Psychiatry 24, 1599–1609 (2019).
CAS PubMed Google Scholar
Qiu, K., Wang, J., Wang, R., Guo, Y. & Zhao, L. Soft sensor development based on kernel dynamic time warping and a relevant vector machine for unequal-length batch processes. Expert Syst. Appl. 182, 115223 (2021).
Google Scholar
Franke, K., Ziegler, G., Klöppel, S. & Gaser, C. Estimating the age of healthy subjects from T1-weighted MRI scans using kernel methods: exploring the influence of various parameters. Neuroimage 50, 883–892 (2010).
PubMed Google Scholar
de Lange, A.-M. G. et al. Mind the gap: performance metric evaluation in brain-age prediction. Hum. Brain Mapp. 43, 3113–3129 (2022).
PubMed PubMed Central Google Scholar
Larivière, S. et al. The ENIGMA Toolbox: multiscale neural contextualization of multisite neuroimaging datasets. Nat. Methods 18, 698–700 (2021).
PubMed PubMed Central Google Scholar
Diedenhofen, B. & Musch, J. cocor: a comprehensive solution for the statistical comparison of correlations. PLoS ONE 10, e0121945 (2015).
PubMed PubMed Central Google Scholar
Welsh, S., Peakman, T., Sheard, S. & Almond, R. Comparison of DNA quantification methodology used in the DNA extraction protocol for the UK Biobank cohort. BMC Genomics 18, 26 (2017).
PubMed PubMed Central Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience https://doi.org/10.1186/s13742-015-0047-8 (2015).
Gazal, S. et al. Functional architecture of low-frequency variants highlights strength of negative selection across coding and non-coding annotations. Nat. Genet. 50, 1600–1607 (2018).
CAS PubMed PubMed Central Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
CAS PubMed PubMed Central Google Scholar
Mägi, R. et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet. 26, 3639–3650 (2017).
PubMed PubMed Central Google Scholar
Mägi, R. & Morris, A. P. GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics 11, 288 (2010).
PubMed PubMed Central Google Scholar
Visscher, P. M. et al. 10 years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101, 5–22 (2017).
CAS PubMed PubMed Central Google Scholar
Palmer, C. & Pe’er, I. Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies. PLoS Genet. 13, e1006916 (2017).
PubMed PubMed Central Google Scholar
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
CAS PubMed Central Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
PubMed PubMed Central Google Scholar
Weissbrod, O. et al. Functionally informed fine-mapping and polygenic localization of complex trait heritability. Nat. Genet. 52, 1355–1363 (2020).
CAS PubMed PubMed Central Google Scholar
Benner, C. et al. Prospects of fine-mapping trait-associated genomic regions by using summary statistics from genome-wide association studies. Am. J. Hum. Genet. 101, 539–551 (2017).
CAS PubMed PubMed Central Google Scholar
Nassar, L. R. et al. The UCSC Genome Browser database: 2023 update. Nucleic Acids Res. 51, D1188–D1195 (2023).
CAS PubMed Google Scholar
Liu, X., Wu, C., Li, C. & Boerwinkle, E. dbNSFP v3.0: a one-stop database of functional predictions and annotations for human nonsynonymous and splice-site SNVs. Hum. Mutat. 37, 235–241 (2016).
PubMed Google Scholar
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191 (2009).
CAS PubMed PubMed Central Google Scholar
de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015).
PubMed PubMed Central Google Scholar
O’Leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745 (2016).
PubMed Google Scholar
Carlson, M. GO.db: A set of annotation maps describing the entire Gene Ontology. R package version 3.14.0 (2021).
Bioconductor Core Team. Homo.sapiens: Annotation package for the Homo.sapiens object. R package version 1.3.1 (2015).
Alexa, A., Rahnenführer, J. & Lengauer, T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 22, 1600–1607 (2006).
CAS PubMed Google Scholar
Choi, S. W. & O’Reilly, P. F. PRSice-2: polygenic risk score software for biobank-scale data. Gigascience 8, giz082 (2019).
PubMed PubMed Central Google Scholar
Patel, A. et al. MendelianRandomization v0.9.0: updates to an R package for performing Mendelian randomization analyses using summarized data. Wellcome Open Res. 8, 449 (2023).
PubMed PubMed Central Google Scholar
Jawinski, P. & Markett, S. GWAS summary statistics for brain age gap. Zenodo https://doi.org/10.5281/zenodo.14826943 (2025).

Download references

Acknowledgements

This research was conducted using the UKB resource under application no. 423032. The UKB was established by the Wellcome Trust medical charity, the Medical Research Council, the Department of Health, the Scottish Government and the Northwest Regional Development Agency. The UKB also received funding from the Welsh Government, the British Heart Foundation, Cancer Research UK and Diabetes UK. This publication was supported by LIFE – Leipzig Research Centre for Civilization Diseases, University of Leipzig. LIFE was funded by means of the European Social Fund and the Free State of Saxony. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. We thank the International Consortium for Blood Pressure for providing the GWAS summary statistics for SBP, DBP and pulse pressure. We thank all participants of the UKB and LIFE-Adult cohorts for their essential contributions, which made this research possible.

Funding

Open access funding provided by Humboldt-Universität zu Berlin.

Author information

Authors and Affiliations

Department of Psychology, Humboldt-Universität zu Berlin, Berlin, Germany
Philippe Jawinski, Helena Forstbach & Sebastian Markett
LIFE-Leipzig Research Center for Civilization Diseases, Leipzig University, Leipzig, Germany
Philippe Jawinski, Holger Kirsten & Markus Scholz
Institute for Medical Informatics, Statistics and Epidemiology, Leipzig University, Leipzig, Germany
Holger Kirsten & Markus Scholz
Cognitive Neurology, University of Leipzig Medical Center & Department of Neurology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
Frauke Beyer, Arno Villringer & A. Veronica Witte
Stanley Center for Psychiatric Research, Broad Institute of the Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
Stephan Ripke
Department of Psychiatry and Psychotherapy, Charité-Universitätsmedizin, Berlin, Germany
Stephan Ripke

Authors

Philippe Jawinski
View author publications
Search author on:PubMed Google Scholar
Helena Forstbach
View author publications
Search author on:PubMed Google Scholar
Holger Kirsten
View author publications
Search author on:PubMed Google Scholar
Frauke Beyer
View author publications
Search author on:PubMed Google Scholar
Arno Villringer
View author publications
Search author on:PubMed Google Scholar
A. Veronica Witte
View author publications
Search author on:PubMed Google Scholar
Markus Scholz
View author publications
Search author on:PubMed Google Scholar
Stephan Ripke
View author publications
Search author on:PubMed Google Scholar
Sebastian Markett
View author publications
Search author on:PubMed Google Scholar

Contributions

P.J. and S.M. conceived and designed the study and co-wrote the original manuscript. S.M. supervised the project. P.J. performed the analyses, curated the data and generated the visualizations. H.F. and H.K. supported data curation and the genetic analyses. F.B., A.V., A.V.W., M.S. and S.R. contributed to the study design, the interpretation of the results and the critical revision of the manuscript. S.M., M.S., A.V.W. and A.V. were involved in funding acquisition. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Philippe Jawinski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Aging thanks Lars Bertram, James Cole and Andrew Schork for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Accuracy and reliability of brain age estimation models across tissue classes and samples.

(a) grey matter, (b) white matter, and (c) combined grey and white matter age estimation models. Blue dots in the first three plots (from left to right) show brain-predicted age estimates plotted against the chronological age in the UKB discovery sample (n = 32,634; white-British ancestry), UKB replication sample (n = 21,881; multi-ancestry), and LIFE-Adult replication sample (n = 1,833; European ancestry). To facilitate comparisons, results of the UKB discovery sample are also shown as grey dots in the background of the LIFE replication plots. At this stage, brain-predicted age estimates have not yet been bias-corrected for regression dilution, that is, younger participants’ ages are systematically overestimated and vice versa, as indicated by the linear regression line (solid) crossing the identity line (dashed). The fourth plot shows the test-retest reliabilities of brain age gap (i.e., the difference between brain-predicted and chronological age) in a subset of the UKB discovery (grey dots, n = 3,751) and UKB replication sample (blue dots, n = 395). For test-retest comparisons, brain age gap was residualized for effects of age, age², sex, scanner site, and total intracranial volume. MAE: mean absolute error; rho: product-moment correlation coefficient. ICC: intraclass correlation coefficient (C, 1).

Extended Data Fig. 2 Cross-trait associations between brain age gap and other UK Biobank phenotypes.

Each dot represents an association between brain age gap and one of 7,088 non-imaging derived phenotypes in the UKB discovery sample (n = 32,634). P values (-log10 scale) are shown on the y-axis, with phenotypes on the x-axis categorized by their UK Biobank data dictionary path. Analyses were conducted using PHESANT, which applies data-type-specific regression models (linear, logistic, ordered logistic, or multinomial logistic regression). All models included sex, age, age², scanner site, and total intracranial volume as covariates. Horizontal lines indicate the Bonferroni-adjusted (solid) and FDR-adjusted (dashed) level of significance (two-sided). The top associations in each category are annotated. (a) GM BAG, (b) WM BAG, (c) combined BAG.

Extended Data Fig. 3 Results of the multi-ancestry GWAS meta-regression for brain age gap.

Manhattan plots (a-c) and quantile-quantile (QQ) plots (d-f) show the results of the multi-ancestry meta-regression (MR-MEGA) for the three brain age gap traits (up to N = 56,348). Multi-ancestry analyses combine results from the discovery sample (n = 32,634) and up to seven replication samples: UKB African ancestry (n = 337), UKB Admixed American ancestry (n = 94); UKB Central/South Asian ancestry (n = 638), UKB East Asian ancestry (n = 291), UKB European ancestry (n = 20,423), UKB Middle Eastern ancestry (n = 98), and LIFE-Adult (European ancestry; n = 1,833). Manhattan plots show the P values (-log₁₀ scale) of the tested genetic variants on the y-axis and base-pair positions along the chromosomes on the x-axis. P values were derived from two-sided linear regression models using PLINK, followed by meta-regression using MR-MEGA. The solid horizontal line indicates the threshold of genome-wide significance (p = 5.0e-08, two-sided, accounting for multiple testing). Pseudoautosomal variations have been added to chromosome ‘X’. Index variants are highlighted by diamonds and were identified by positional clumping with 3,000 kb window-size (no LD threshold was applied in multi-ancestry analyses). Quantile-quantile plots show the observed P values from the association analysis vs. the expected P values under the null hypothesis of no effect (-log₁₀ scale). For illustrative reasons, the y-axis has been truncated at p = 1.0e-40. a,d grey matter brain age gap (Manhattan and QQ); b,e white matter brain age gap (Manhattan and QQ); c,f combined brain age gap (Manhattan and QQ).

Extended Data Fig. 4 Functional partitioning of heritability for brain age gap.

Enrichment of heritability (h²) across functional genomic categories for (a) grey matter brain age gap, (b) white matter brain age gap, (c) combined brain age gap. Results are based on GWAS meta-analyses in individuals of European ancestry (n = 54,890). Heritability enrichment is defined as the proportion of h² attributable to a given category divided by the proportion of SNPs in that category. Error bars represent jackknife standard errors around the estimates of enrichment. Please note that significance is tested based on the covariance matrix for the tau coefficient estimates (adjusted for all other baseline categories), which may differ from interpretations based solely on jackknife SEs. Functional categories are ordered by the proportion of SNPs they contain (x-axis), with percentages shown at tick marks. Categories with significant enrichment are marked by red diamonds.

Extended Data Fig. 5 Heritability enrichment in genes specifically expressed in selected tissue groups.

SNP-based heritability (h²) enrichment across genomic regions near genes with specific expression in various tissue types, for (a) grey matter brain age gap, (b) white matter brain age gap, (c) combined brain age gap. Results are based on GWAS meta-analyses in individuals of European ancestry (n = 54,890). Heritability enrichment is defined as the proportion of h² attributable to a given category divided by the proportion of SNPs in that category. Error bars represent jackknife standard errors around the estimates of enrichment. Please note that significance is tested based on the covariance matrix for the tau coefficient estimates (adjusted for all other baseline categories), which may differ from interpretations based solely on jackknife SEs. Functional categories are ordered by the proportion of SNPs they contain (x-axis), with percentages shown at tick marks. Categories with significant enrichment are marked by red diamonds.

Extended Data Fig. 6 Gene-based association results for brain age gap traits.

Manhattan plots (a-c) and quantile-quantile (QQ) plots (d-f) showing the results of the gene-based association analyses for the three brain age gap traits (n = 54,890 individuals of European ancestry). Manhattan plots display the P values (-log₁₀ scale) of the tested genes on the y-axis and base-pair positions (gene start coordinates) along the chromosomes on the x-axis. In total, 18,639 protein-coding genes (RefSeq assembly GRCh37.p13, 09-05-2019) have been included. The solid horizontal line reflects the Bonferroni-corrected level of significance. The dashed horizontal line reflects the FDR-corrected level of significance. Diamonds and circles highlight the index gene in each genomic locus (diamonds: Bonferroni-significant index gene; red diamonds: Bonferroni-significant index gene exceeding the y-axis limit; circles: FDR-significant index gene). Genes within 3,000 kb of each other are considered to belong to the same locus and share the same index gene. Quantile-quantile plots show the observed P values from the association analysis vs. the expected P values under the null hypothesis of no effect (-log₁₀ scale). For illustrative reasons, the y-axis has been truncated at p = 1.0e-40. a,d grey matter brain age gap (Manhattan and QQ); b,e white matter brain age gap (Manhattan and QQ); c,f combined brain age gap (Manhattan and QQ).

Extended Data Fig. 7 Mendelian randomization analyses linking modifiable risk factors to brain age gap.

Results of the generalized summary-data-based Mendelian randomization (GSMR) results for 12 risk factors (exposures) and combined brain age gap (outcome). Each plot shows multiple genetic variants serving as instruments to test for causality between the exposure and outcome variable. Under a causal model, variant effects on the outcome (b_zy; y-axis) are expected to be linearly proportional to the variant effects on the exposure variable (b_zx; x-axis). The ratio between b_zy and b_zx provides an estimate of the mediation effect of x on y (b_xy). Variants flagged for potential horizontal pleiotropy were excluded using the HEIDI-outlier method. s.e. standard error of the mediation effect; p_xy P value of the mediation effect.

Extended Data Fig. 8 Relationship between genetic and phenotypic correlations of brain age gap with other complex traits.

This figure compares genetic and phenotypic correlation coefficients between brain age gap and 673 other complex traits. Genetic correlations were estimated using GWAS meta-analysis summary statistics for BAG (n = 54,890, European ancestry), while phenotypic correlations were calculated in the UK Biobank discovery sample (n = 32,634, European ancestry) using PHESANT. Summary statistics for other UK Biobank traits were obtained from Neale et al. (10.5281/zenodo.7186871). Results indicate a positive association between genetic and phenotypic correlation estimates. r: product-moment correlation coefficient, p: P value, MAD: mean absolute difference. (a) grey matter brain age gap, (b) white matter brain age gap, (c) combined brain age gap.

Supplementary information

Supplementary Information

Supplementary Figs. 1–48 and Tables 1–34.

Reporting Summary

Supplementary Tables 1–34

Supplementary Tables 1–34.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jawinski, P., Forstbach, H., Kirsten, H. et al. Genome-wide analysis of brain age identifies 59 associated loci and unveils relationships with mental and physical health. Nat Aging 5, 2086–2103 (2025). https://doi.org/10.1038/s43587-025-00962-7

Download citation

Received: 30 January 2024
Accepted: 13 August 2025
Published: 03 October 2025
Version of record: 03 October 2025
Issue date: October 2025
DOI: https://doi.org/10.1038/s43587-025-00962-7

This article is cited by

No causal links between estradiol and female’s brain and mental health using Mendelian randomization
- Hannah Oppenheimer
- Dennis van der Meer
- Claudia Barth
Nature Communications (2025)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Phenotypic associations

Concordant genomic signals in discovery and replication GWAS

Identification of 59 associated loci

Fine-mapping and gene prioritization

Polygenic score analysis

Gene-based analysis

Pathway analysis

Genetic correlations with other complex traits

Mendelian randomization analyses

Polygenicity and projection of discoveries to future GWAS

Discussion

Methods

Ethical approval

Statistics and reproducibility

Sample characteristics

MRI data acquisition

MRI preprocessing

Feature set for machine learning

Machine learning algorithms

Age estimation models and BAG calculation

Cross-trait association analysis

FreeSurfer associations

UKB genotyping and imputation

LIFE-Adult genotyping and imputation

Control for population structure

Heritability and partitioned heritability

Genome-wide association analysis

Genome-wide association meta-analysis

Identification of independent discoveries

Definition of variant replication and power calculations

Novelty of the discovered loci

ANNOVAR enrichment test

Credible sets of variants

Functional annotation of variants

Gene nomination through functional annotation of credible variants

Gene nomination through SMR

Gene nomination through GTEx eQTL lookup

Gene nomination through polygenic priority scores

Gene prioritization

GWAS Catalog search

Gene-based analysis

Pathway analyses

PGS analysis

Genetic correlations

Mendelian randomization

Polygenicity

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links