Machine learning reveals distinct neuroanatomical signatures of cardiovascular and metabolic diseases in cognitively unimpaired individuals

Govindarajan, Sindhuja Tirumalai; Mamourian, Elizabeth; Erus, Guray; Abdulkadir, Ahmed; Melhem, Randa; Doshi, Jimit; Pomponio, Raymond; Tosun, Duygu; Bilgel, Murat; An, Yang; Sotiras, Aristeidis; Marcus, Daniel S.; LaMontagne, Pamela; Benzinger, Tammie L. S.; Espeland, Mark A.; Masters, Colin L.; Maruff, Paul; Launer, Lenore J.; Fripp, Jurgen; Johnson, Sterling C.; Morris, John C.; Albert, Marilyn S.; Bryan, R. Nick; Resnick, Susan M.; Habes, Mohamad; Shou, Haochang; Wolk, David A.; Nasrallah, Ilya M.; Davatzikos, Christos

doi:10.1038/s41467-025-57867-7

Download PDF

Article
Open access
Published: 19 March 2025

Machine learning reveals distinct neuroanatomical signatures of cardiovascular and metabolic diseases in cognitively unimpaired individuals

Nature Communications volume 16, Article number: 2724 (2025) Cite this article

8887 Accesses
5 Altmetric
Metrics details

Subjects

Abstract

Comorbid cardiovascular and metabolic risk factors (CVM) differentially impact brain structure and increase dementia risk, but their specific magnetic resonance imaging signatures (MRI) remain poorly characterized. To address this, we developed and validated machine learning models to quantify the distinct spatial patterns of atrophy and white matter hyperintensities related to hypertension, hyperlipidemia, smoking, obesity, and type-2 diabetes mellitus at the patient level. Using harmonized MRI data from 37,096 participants (45–85 years) in a large multinational dataset of 10 cohort studies, we generated five in silico severity markers that: i) outperformed conventional structural MRI markers with a ten-fold increase in effect sizes, ii) captured subtle patterns at sub-clinical CVM stages, iii) were most sensitive in mid-life (45–64 years), iv) were associated with brain beta-amyloid status, and v) showed stronger associations with cognitive performance than diagnostic CVM status. Integrating personalized measurements of CVM-specific brain signatures into phenotypic frameworks could guide early risk detection and stratification in clinical studies.

Impact of the ambulatory blood pressure monitoring profile on cognitive and imaging findings of cerebral small-vessel disease in older adults with cognitive complaints

Article Open access 15 February 2021

Reduced cerebral vascular fractal dimension among asymptomatic individuals as a potential biomarker for cerebral small vessel disease

Article Open access 11 July 2022

Topography of associations between cardiovascular risk factors and myelin loss in the ageing human brain

Article Open access 10 April 2023

Introduction

Age-related progressive accumulation of chronic and often co-occurring modifiable health risks is estimated to contribute up to 50% of all incident dementia cases globally¹, with population-attributable risks of 23.8% for hypertension¹, 14.1% for smoking², 20.9% for obesity, and 12.5% for type 2 diabetes³. The prevalence of these conditions and their contribution to dementia vary significantly by race, ethnicity, and socioeconomic status, resulting in health inequalities³. Robust detection of early neuroanatomical changes associated with these cardiovascular and metabolic risk factors (CVMs) could pave the pathway for early risk stratification, longitudinal monitoring of disease progression, and proactive management to mitigate cognitive decline. Understanding the distinct associations between specific CVMs and in vivo brain changes is crucial to disentangle the combined effects of comorbid CVMs and prioritize intervention targets.

Structural magnetic resonance imaging (sMRI) investigations based on diagnostic CVM labels have uncovered neuroanatomical changes such as hippocampal and whole brain atrophy or development of white matter hyperintensities that are hallmark signatures of cerebrovascular damage^4,5. However, individual differences in vulnerability to specific CVMs are not understood through group-level investigations, hindering the generalizability of findings to individual patients. This limitation stems from several factors: (1) conventional sMRI measures are unable to distinguish between the different CVMs, a key concern since each CVM carries varying dementia risks; (2) the underlying neuropathological processes are highly variable, leading to a spectrum of sMRI presentations that are not fully captured by diagnostic labels; (3) group-level investigations on small, selective samples are often underpowered to study the complex interplay between comorbid conditions present in real-world patients. To detect cerebrovascular changes early and measure disease severity and progression, therefore, robust and generalizable in vivo imaging markers that can quantify a specific CVM’s impact on an individual patient’s sMRI are needed. Crucially, such markers could help determine the factors that influence an individual’s vulnerability to CVM impacts, and potentially inform clinical trials in target selection and treatment measurement.

Quantifying brain health from neuroimaging data is feasible with machine-learning techniques that enable the mapping of multivariate sMRI measures into low-dimensional composite indices. For instance, the Spatial Patterns of Abnormalities for Recognition of Alzheimer’s Disease (SPARE-AD) is an individualized index reflecting the presence and severity of Alzheimer’s disease (AD)-like patterns of atrophy in the brain⁶ and is predictive of future cognitive decline⁷. Fueled by the integration and harmonization of multi-cohort MRI datasets⁸, these techniques yield interpretable and generalizable markers^7,8,9,10, facilitating patient-level evaluations of disease severity. The current study aims to determine (i) whether machine-learning techniques can detect and quantify subtle brain imaging signatures related to individual CVMs, and (ii) whether these signatures can be detected even in the presence of additional CVMs.

Here, we leverage the SPARE framework to investigate the neuroimaging signatures of specific CVMs in a cognitively asymptomatic population. Using a large, diverse cohort from 10 neuroimaging studies, we characterize sMRI signatures of hypertension, hyperlipidemia, smoking, obesity, and type 2 diabetes, and quantify their severity at the individual level. We show that the resulting markers, known collectively as SPARE-CVMs, are better at detecting CVM-related brain changes when compared to conventional sMRI markers, particularly in mid-life and early sub-clinical stages of the corresponding CVMs. Additionally, we validate the models on an external dataset, assess their robustness across demographic subgroups, and evaluate the impact of co-occurring CVMs on SPARE-CVM scores. SPARE-CVMs are associated with brain beta-amyloid status and cognitive performance, suggesting their potential to inform early risk detection and clinical decision-making. We demonstrate that SPARE-CVMs will capture a nuanced phenotypic spectrum of imaging signatures, providing a more granular understanding of the impact of CVM on brain health beyond simple diagnostic categories.

Results

Overview of SPARE-CVM modeling and individualized severity estimation

An overview of the study workflow is provided in Fig. 1. A total of 20,000 participants from 10 studies in the Imaging-based coordinate SysTem for AGing and NeurodeGenerative diseases (iSTAGING) dataset, for whom sMRI imaging measures were available, were used for training and validation of CVM signatures (Table 1, Supplementary Information S1). Participants were between 45 and 85 years of age (mean age (standard deviation, SD) = 64.1 (8) years, 54.5 % Female), and had no known cognitive impairment as defined by study-specific criteria. An independent validation dataset of N = 17,096 (mean age (SD) = 65.4 (7.4) years, 53.4 % Female) participants from the UK-Biobank study who were added to the 2020 data release (UKBIOBANK v1.7) was used to validate model results.

**Fig. 1: Development of SPARE-CVM models.**

Table 1 Overview of the data used for modeling imaging signatures of cardiovascular and metabolic risk factors (CVM)

Full size table

Five separate support vector classification models described in the Methods section were trained to detect and quantify spatial sMRI patterns for each CVM− hypertension (HTN), hyperlipidemia (HL), smoking (SM), obesity (OB), and type 2 diabetes mellitus (T2D), to derive SPARE-HTN, SPARE-HL, SPARE-SM, SPARE-OB, and SPARE-T2D indices, respectively. CVM statuses were dichotomized as present (CVM+) or absent (CVM−) based on study-provided categorical responses and medication status where available, and augmented using traditional cut-offs applied to the continuous clinical measures (Supplementary Information S3). Clinical definitions for ground truth CVM status and sample sizes used in training are provided in Supplementary Table 3. The distribution of the ground truth CVM− and CVM+ labels for the five target conditions indicates a highly heterogeneous sample with co-occurring CVMs (Fig. 2). More than 30% of the samples had two or more co-occurring conditions (Supplementary Fig. 2). Our ML configuration achieved better performance compared to other commonly employed ML models (Supplementary Information S4, Supplementary Fig. 3) despite moderate area under the receiver operating characteristic curve (AUC) values for the training (and validation) datasets, which ranged between 0.64 (0.63) for SPARE-SM to 0.70 (0.72) for SPARE-OB (Supplementary Table 4). Three-dimensional projections of the resulting SPARE-CVM indices are shown in Fig. 2A–D. Supplementary Fig. 4 illustrates the heterogeneity of clinical profiles and neuroimaging signatures observed at the individual level. Greater expression of CVM severity is quantified as large positive values along the corresponding bar in the graph. The diverse magnitudes of SPARE-CVMs within this marker panel highlight their ability to detect subtle, spatially distributed sMRI patterns that are not easily discernible through visual inspection.

**Fig. 2: CVM co-occurrence and multi-morbidity influence SPARE-CVM profiles across phenotypic dimensions.**

SPARE-CVMs reveal distinct CVM-related spatial patterns on sMRI

Overall, higher SPARE-CVMs were associated with lower GM and WM volumes and higher WMH volumes, although the spatial patterns and strengths of correlation varied (Fig. 3 and Supplementary Fig. 5). While SPARE-SM was associated with global volume loss (blue colors), the other SPARE-CVMs were associated with more spatially specific patterns of volume differences. It is important to note that age was treated as a confounding variable for sMRI measures and SPARE-CVMs in multiple regression analyses. Hence the positive association noted below may be interpreted not as increasing volume, but rather as relatively preserved at older ages, suggesting potential resilience to atrophy in CVM+.

**Fig. 3: SPARE-CVMs capture distinct spatial sMRI patterns of CVMs.**

Cortical GM patterns—higher SPAREs for all CVMs were associated with a pattern of cortical atrophy in frontal GM regions including the anterior and posterior insula, the frontal and central opercular regions, and parts of the inferior frontal gyri, in parietal regions including the postcentral and supramarginal gyri, and temporal GM regions including the planum polare and planum temporale. In the frontal lobe, lower volumes in the middle frontal gyri, and the orbital gyri were associated with higher SPARE-HTN, SPARE-HL, and SPARE-SM, the subcallosal area with higher SPARE-HTN, SPARE-HL, and SPARE-OB, the posterior orbital gyri with higher SPARE-HTN and SPARE-T2D, and the supplementary motor cortex and the medial parts of the precentral and superior frontal gyri with SPARE-HL and SPARE-SM. In the parietal lobe, lower volumes in the angular gyrus were associated with higher SPARE-SM and SPARE-T2D. In the temporal lobe, lower volumes in the entorhinal area were associated with higher SPARE-HTN, SPARE-SM, SPARE-OB, and SPARE-T2D, and the superior temporal gyri with higher SPARE-SM and SPARE-T2D. In the occipital lobe, lower volumes in the lingual gyri were associated with higher SPARE-HTN, SPARE-SM, and SPARE-T2D, and lower volumes in the cuneus and calcarine cortices were associated with higher SPARE-T2D. Relatively higher volumes in the middle occipital gyri and the cingulate gyri were associated with higher SPARE-HL and SPARE-OB, the gyrus rectus with SPARE-HL, the supplementary motor cortex, the precuneus and the occipital gyri with higher SPARE-OB.

Deep GM patterns—among deep GM structures, lower volumes in the accumbens area were associated with higher SPARE scores for all CVMs except SPARE-HL, the thalamus with higher SPARE-HL, SPARE-SM, and SPARE-T2D, and the pallidum associated with SPARE-SM, SPARE-OB, and SPARE-T2D. Relatively higher volumes in the hippocampus were associated with higher SPARE-HL and SPARE-OB, the putamen with SPARE-HTN and SPARE-HL, and the caudate nuclei with SPARE-HTN.

WM patterns—lower volumes of WM regions were associated with SPARE-HL, SPARE-SM, SPARE-OB, and SPARE-T2D. The strongest associations were observed between volumes of the anterior internal capsule and cerebellar WM and SPARE-HL, SPARE-SM, and SPARE-T2D. In contrast, higher WM volumes were associated with higher SPARE-HTN. We speculate that this positive association is driven by the increased WMH presence which would contribute to the regional summary measures of WM extracted from T1-weighted sMRI.

WMH patterns—higher SPARE-HTN was associated with larger WMH volumes across most of the sub-cortical and deep WM partitions, but not with WMH in the occipital lobe. Higher WMH volumes in deep WM structures were associated with higher SPARE-HL, SPARE-SM, and SPARE-T2D.

SPARE-CVMs are more sensitive to target CVMs than other imaging markers and are robust across demographic subgroups

All 5 SPARE-CVMs showed medium-to-large Cohen’s d effect sizes in separating the corresponding CVM+ from CVM− individuals with d = 0.67, 0.61, 0.49, 0.74, and 0.67 for HTN, HL, SM, OB, and T2D respectively. SPARE-CVMs showed the highest effect sizes for their target CVMs as seen along the diagonal elements in Fig. 4A outlined in blue. Replication on the independent dataset confirmed this finding (Supplementary Fig. 6A). Logistic regression analyses revealed that a unit increase in SPARE-CVM was associated with a higher odds ratio of having a positive status on the target CVM, ranging from OR = 1.79 (1.73–1.85) for SPARE-SM to OR = 2.39 (2.31–2.47) for SPARE-OB (Supplementary Fig. 6B). Additional sensitivity analyses revealed that SPARE-CVM effects are robust among male and female sexes (self-identified), self-identified race, and levels of education (Supplementary Fig. 7).

**Fig. 4: SPARE-CVMs detect brain patterns more effectively across the clinical stages.**

In contrast, effect sizes for univariate volumetric measures and ML-based SPARE-AD, SPARE-BA-Gap measures were largely inappreciable (Fig. 4A, Supplementary Fig. 8). Small effect sizes were observed for WMH volumes at separating HTN+/HTN− (d = 0.29) and ventricular volumes separating T2D+/T2D− (d = 0.29) individuals. SPARE-BA-Gap also showed small effect sizes with CVM+ participants having older brain ages than their chronological age d = 0.16 (+1.16 years) for HTN+, d = 0.09 (+0.6 years) for HL+, d = 0.29 (+2.1 years) for SM+, d = 0.11 (+0.9 years) for OB+, and d = 0.34 (+2.5 years) for T2D+ individuals.

SPARE-CVMs capture brain changes in sub-clinical stages

Association between SPARE-CVMs and clinical stage or continuous measures of severity are shown in Fig. 4B, C. SPARE-HTN was significantly higher in participants categorized as “Stage 1” and “Stage 2” (+0.1 and +0.56, p < <0.0001) when compared to “Normal”. SPARE-HL was significantly higher only in participants categorized as “Very high” (+0.52, p < <0.0001) when compared to “Normal”, but not in “Elevated” or “High” categories. SPARE-T2D was significantly higher in participants categorized as “Prediabetic”, with values on par with the Diabetic group (+0.69 and +0.61, p < <0.0001), when compared with “Normal”. SPARE-HTN and SPARE-HL were elevated in medicated CVM+ individuals when compared to the unmedicated CVM+ individuals (SPARE-HTN + 0.28 and SPARE-HL + 0.52, p < <0.0001) but SPARE-T2D did not differ based on T2D medication status.

SPARE-OB correlated positively with BMI (r = 0.31, p << 0.0001). SPARE-SM was positively associated with years of SM after adjustment for age, increasing with each additional decade of smoking from +0.09 (p < 5 × 10⁻³) for 5–9 years to +0.42 (p << 0.0001) for >50 years of smoking.

SPARE-CVMs are more pronounced in mid-life

Age associations of effect sizes showed that effect sizes peaked at the 45–50 years age interval for SPARE-T2D, the 50–55 interval for SPARE-HL and the 60–65 interval for SPARE-HTN and SPARE-OB, and tapered off in older ages (Fig. 5). The decline in SPARE-CVM effect sizes with age was also independently confirmed through sensitivity analyses by training multiple SPARE models at separate age ranges (Supplementary Information S4 .2, Supplementary Fig. 9).

**Fig. 5: SPARE-CVMs detect strong CVM effects in mid-life.**

Simultaneous presence of multiple CVMs is associated with co-expression of respective SPARE indices

Three-dimensional projections of SPARE-CVM distributions in Fig. 2A–D, and Supplementary Fig. 10 show the influence of single- vs multi-morbidity on the corresponding SPARE scores. SPARE-CVMs for CVM+ individuals without comorbid conditions showed a clear separation in the corresponding dimension. For example, SPARE-OB in only-OB+ cases showed separation from SPARE-HTN, SPARE-HL, and SPARE-SM dimensions (Fig. 2B–D). By contrast, SPARE-CVMs for commonly comorbid CVMs showed higher co-expression likely due to the shared brain patterns and the co-occurrence of CVMs in the population. HTN+ and HL+ were separable by the non-target SPARE- model with small-medium effect sizes (Fig. 4A) and SPARE-HTN and SPARE-HL scores overlapped in participants with only-HTN+ and only-HL+ (Fig. 2A). T2D was rarely present without comorbidities in our dataset, resulting in small effect sizes observed for non-target SPARE-CVMs. When the analysis was restricted to individuals with only one CVM when compared with individuals with no CVMs, the effect sizes remained highest for the SPARE models and target CVMs (Supplementary Fig. 10B). Additional logistic regression analyses with CVM status (+/−) as the outcome variable and all five SPARE-CVMs as predictor variables demonstrate higher specificity of SPARE-CVM to the target CVM but not the comorbid CVMs (Supplementary Fig. 10C).

SPARE-CVMs had variable associations with amyloid deposition

A subset of our sample (N = 407) had amyloid status available within ±1 year of the MRI scan included in our dataset (Supplementary Table 5) and was included in multiple regression analyses to evaluate the interaction between amyloid deposition (Aβ+), CVM status and age on SPARE-CVM scores (Supplementary Fig. 11). Aβ+ participants were significantly older (p < 0.001) than Aβ- participants. Fewer participants were Aβ+ and OB+ when compared to Aβ+ and OB− in this cohort of CN participants (p < 0.001), perhaps suggesting that OB+ participants with Aβ+ were likely already experiencing cognitive symptoms warranting an MCI/dementia diagnosis or due to inclusion/exclusion criteria of the parent studies. SPARE-HTN in Aβ+ individuals was lower in the younger ages of this sample (<70 years, p < 0.01) and increased with advancing age (p < 0.05), but did not reveal a significant interaction between HTN+ and Aβ+ status. SPARE-SM was lower in SM−Aβ+ individuals (p < 0.05) and higher in SM + Aβ+ individuals (p < 0.05), but did not show significant interactions between Aβ+ and age. Significant age interactions were observed between OB status and Aβ+, where SPARE-OB decreased with age in OB−Aβ+ individuals (p < 0.05) and trended towards an increase with age in OB + Aβ+ individuals (p = 0.06). No significant associations were found between Aβ+ and HL status or T2D status on the corresponding SPARE scores.

SPARE-CVMs negatively correlated with cognitive performance

Significant negative associations (p < 0.05, corrected for multiple comparisons, Fig. 6) were observed between higher values for all SPARE-CVMs and lower DSST scores, longer time to complete TMT-A and TMT-B, and lower MOCA scores. Higher SPARE-HTN, SPARE-HL, SPARE-SM, and SPARE-T2D, but not SPARE-OB, were also associated with lower (by >10%) odds ratio of correct recall on the first attempt in the P-Mem test. Higher SPARE-HTN, SPARE-SM, SPARE-OB, and SPARE-T2D, but not SPARE-HL, were also associated with lower MMSE scores (Supplementary Fig. 12). In comparison, CVM status (+/−) was associated with performance on fewer cognitive tests.

**Fig. 6: SPARE-CVMs exhibit stronger associations with cognitive performance than CVM labels.**

SPARE-CVMs from harmonized data were more generalizable across study sites

To evaluate the impact of the unwanted technical variations caused by site differences on SPARE-CVMs, we compared them with identical ML models trained with unharmonized MUSE ROI volumes as input features. We found that harmonization drastically reduced site-related variability in SPARE-CVMs, as evidenced by lower analysis of variance F values (Supplementary Fig. 13). In contrast, SPARE scores trained using unharmonized MUSE ROI exhibited substantial site effects, limiting their generalizability beyond the original study sites despite effective CVM classification within those sites.

Discussion

We leveraged machine-learning methods on a large multi-study sMRI dataset to tackle the persistent challenge of individual variability in CVM severity across the phenotypic spectrum in neuroimaging. ML-derived SPARE-CVM markers quantified characteristic neuroanatomical patterns associated with CVMs, detecting sub-clinical changes at the crucial period of mid-life ages, and showed strong associations with cognitive performance even in the absence of discernible cognitive impairments. Incorporating SPARE-CVMs into brain phenotyping can help assess the variable impact of CVMs on brain structural changes and their interactions with AD-related neurodegeneration at the individual level.

SPARE-CVMs showed better discrimination and more specific patterns of brain structure corresponding to each CVM, compared to established imaging markers including regional or whole brain volumes and other imaging markers such as SPARE-AD or SPARE-BA-Gap. In our models, we did not impose homogeneity or CVM exclusivity restrictions on our CVM+ participants, as evidence suggests that CVMs are more likely to occur together than separately at older ages and are commonly managed together in clinical practice⁹. Our results show that the shared expression of comorbid CVMs is captured by the individualized SPARE scores, enabling the investigations of the cumulative effects of comorbid CVMs.

The brain patterns associated with SPARE-CVMs are consistent with reported literature on CVM associations. Elevated lesion burden, particularly in the frontal WM, is a well-known indicator of vascular pathology. Similarly, the associations between elevated blood pressure frontal and temporal lobe atrophy¹⁰, between smoking and total GM atrophy⁷, between obesity and inferior frontal lobe atrophy, including the insular regions¹¹, and between diabetes and superior temporal gyri¹² have been reported in previous investigations. The positive association between GM volumes and CVMs suggests relatively preserved volume at older ages, a result which appears unexpected but has nevertheless been previously reported in group-level investigations and meta-analytic studies^11,13,14. Further multimodal imaging investigations are warranted to determine the possible mechanisms driving the positive association as it is unclear whether these associations were due to slower rates of age-related atrophy, or tissue hypertrophy driven by compensatory or inflammatory processes, or rather due to reduced myelin concentration exacerbated by CVMs, which can lead to poor contrast between sub-cortical GM and deep WM structures in sMRI. Importantly, these positive associations might also reflect a relatively higher brain reserve, especially in relatively unaffected brain regions, in older individuals who remain cognitively normal, despite the presence of CVMs.

SPARE-CVM scores showed associations with CVM severity. SPARE-HTN was higher in Stage 1 when compared to HTN-, SPARE-OB correlated BMI, and SPARE-SM increased with years of smoking suggesting a dose-dependent association between these CVMs and brain patterns. SPARE-T2D was higher in pre-diabetic and diabetic participants when compared to those with normal fasting glucose and HbA1c, confirming the association between poor glycemic regulation and brain alterations. Early brain changes have been observed in middle age¹² and older adults¹⁵ in association with insulin resistance¹⁶ and impaired glucose regulation resulting in high-normal glycemic levels¹⁷, and even within a year of diabetes diagnosis¹⁸.

We found greater separation in SPARE-CVMs between CVM− and CVM+ participants in mid-life ages when compared to older ages. This is likely to be driven by the multiplicity and heterogeneous progression of pathologic processes associated with aging and neurodegenerative diseases, confounding our ability to disentangle patterns of brain change specifically associated with individual CVMs at older ages. Conversely, relatively small brain changes expected at mid-life can stand out more vividly against a relatively lower background brain variability at this age range. Nevertheless, our models demonstrated the strongest effect sizes during the age range when the onset of CVMs carries the highest dementia risk. Age-varying associations of CVM onset with risk for cognitive decline and brain atrophy have been reported, with elevated risk for mid-life onset of T2D¹⁹, elevated blood pressure²⁰, and higher BMI²¹, after adjustments for disease duration. SPARE-CVMs could thus have a profound impact in identifying individuals vulnerable to cerebrovascular changes at mid-life ages, a crucial time window for treatment and lifestyle interventions.

Our sensitivity analyses investigating the influence of amyloid deposition on SPARE-CVMs revealed interactive effects between Aβ+ and CVM+ status consistent with the literature. Individuals exhibiting elevated Aβ burden and vascular risk demonstrate more severe brain atrophy both cross-sectionally^22,23,24 and longitudinally^25,26,27 in regions vulnerable to AD, such as the parietal lobe and hippocampus, compared to individuals with only one risk factor. This was also observed in our whole brain summary indices, with higher SPARE-HTN, SPARE-SM, or SPARE-OB values in participants with HTN+, SM+, or OB + CVM status, respectively. CVMs occurring at mid-life appear to work synergistically with age, apolipoprotein E ε4, sex, and amyloid deposition to diminish brain integrity (atrophy, formation of white matter hyperintense lesions, and tau deposition)^24,25,26 and drive cognitive impairment (AD, vascular dementia). It is worth noting studies such as ADNI excluded participants with high vascular burden, and investigations on imaging and Aβ, including those reported here, may not capture a broad range of CVM risk. Further investigations with more population-representative datasets and stratified mediation analyses are needed to provide insights into the mechanistic pathways connecting CVMs, brain structural and functional alterations, amyloid pathology, and dementia.

Higher SPARE-CVMs were associated with lower cognitive test scores despite no overall group differences between CVM− and CVM+ participants, highlighting the value of analyzing the nuances of CVM impact on the brain beyond investigating clinical labels. This finding further emphasizes the potential of SPARE-CVMs in the early identification of individuals at risk of greater cognitive decline. Additionally, since these modifiable risk factors for dementia and cardiovascular disease converge^28,29,30, lifestyle interventions targeting blood pressure, lipid, and glucose control, along with reducing body mass and tobacco use in high-risk individuals, can potentially alleviate the burden of cardiovascular diseases.

Our study has limitations. We did not study participants with cognitive impairment, or follow-up AD diagnoses, and hence we did not evaluate whether SPARE-CVMs can predict future dementia. ML models were constructed to optimally separate CVM+ from CVM− individuals. CVM labels were derived from varying sources—self-reports, diagnosis codes, and health records, which were not uniformly available across all studies. However, to avoid training errors due to mislabeled false negatives, we used clinical measures where available and excluded participants with missing data. The cross-sectional nature of our study and the unavailability of information regarding disease or treatment duration across the sample pose a challenge in interpreting results related to medication status. SPARE-HL and SPARE-HTN were higher in individuals who received medical treatment for HL and HTN respectively, whereas SPARE-T2D was similar between treated and untreated T2D+ individuals. Further longitudinal investigation is needed as it is likely that these are not direct treatment effects, but rather arise from pharmacological intervention being more common for participants who presented with worse CVM symptoms and hence more pronounced brain differences. Our SPARE-CVMs show only moderate AUC levels around 0.7 for CVM classification. This study primarily aimed to develop quantitative markers for CVM-related sMRI patterns, rather than establish a diagnostic tool based on MRI. The expression or manifestation of these CVM signatures at the time of MRI acquisition can be highly variable across participants due to factors like differing disease duration, severity management, and individual susceptibility to brain and cognitive changes. By quantifying the CVM-related neuroanatomical effects, SPARE-CVMs are intended to help identify individuals with increased severity of brain alterations and not replace the far simpler and routine clinical diagnostic tools (e.g., blood pressure or weight measurements). For risk factors such as smoking, the severity of the impact on the brain may depend on additional factors such as the number of pack years and the time since smoking cessation. This could partially explain the low effect size observed for SPARE-SM, where CVM+ was determined based on the number of years of smoking to maximize available data across studies.

While this study has limitations, it benefits from a large multi-study dataset consisting of cohorts from diverse geographical and demographic backgrounds, and sMRI data harmonized using state-of-the-art techniques. SPARE-CVMs leverage sMRI, a commonly used neuroimaging tool in clinical settings, and can be applied for retrospective investigations of existing clinical datasets as well as secondary analyses of prior clinical trials such as the systolic blood pressure intervention trial (SPRINT) for hypertension treatment³¹. Being individualized measures, SPARE-CVMs can potentially inform and influence the design of future clinical trials targeting modifiable risk factors. In particular, trials targeting a particular CVM might be better powered to detect treatment effects by using the respective SPARE-CVM as the outcome measure, rather than using non-specific brain measures such as global or regional brain volumes. Moreover, such trials can use SPARE-CVMs for the selection and stratification of patients by risk for cerebrovascular changes. Furthermore, our integrated approach creates new opportunities to investigate the susceptibility to CVM-related neurodegeneration from non-modifiable risk factors such as sex, race, and genetic factors.

In conclusion, this study presents a novel approach to quantify CVM-related brain changes using machine-learning-derived MRI markers. Our findings demonstrate that these markers have greater discriminatory power than conventional imaging markers, particularly in middle-aged individuals, enriching the array of imaging biomarkers for characterizing neuroanatomical signatures. By providing a more nuanced understanding of the impact of CVM on brain health, these markers have the potential to inform personalized diagnostics and patient management, guide the design of future clinical trials through increased precision of treatment effect evaluations, and advance our understanding of the mechanisms underlying CVM-related neurodegeneration through enhanced detection of subtle CVM-related brain changes in longitudinal studies.

Methods

Study participants

The University of Pennsylvania institutional review board approved the protocols of this research study. We used a large multi-study collection of sMRI pooled and harmonized for the iSTAGING dataset. The studies in iSTAGING were carried out in diverse international geographic locations, with varying scanners and acquisition parameters between 1995 and 2020 (Supplementary Information S1). Participants provided written informed consent to the corresponding source studies. The analysis focused on cross-sectional scans for a subgroup of participants within the iSTAGING dataset, all of whom had complete sMRI measures available (Table 1).

MRI preprocessing and harmonization

T1-weighted anatomical images were segmented into gray (GM) and white matter (WM) regions of interest using the multi-atlas, multi-warp segmentation (MUSE) tool³², which uses an ensemble of diverse brain atlases and is relatively robust to MRI scanner and protocol variations. Regional volumes from the multi-site studies were harmonized using ComBat-GAM⁸. This technique corrects for systematic variations in brain volumes caused by differences in scanning equipment—such as the manufacturer and model of the scanner, the magnetic field strength, the hardware and sequences used—while ensuring that inherent biological differences among individuals, such as age, gender, and brain size, are preserved. DeepMRSeg, a deep-learning-based segmentation tool, was used to derive total intracranial volumes (ICV) from T1-weighted images and WMH fluid-attenuated inversion recovery (FLAIR) and/or T2-weighted images³³. WMH volumes were summarized within the lobar and deep WM regions. Raw images and segmentation masks underwent a previously established two-step semi-automated quality control procedure³⁴, available as a standalone software package MRISnapshot (https://cbica.github.io/MRISnapshot/). The procedure automatically ranked scans based on a quality score derived from segmented ROI volumes and flagged segmentations that deviated most from expected volume distributions. Flagged segmentations were visually inspected using visualization reports created by MRISnapshot to identify and exclude poor-quality segmentations. All regional volume measures of GM, WM, and WMH were adjusted for age, sex, and ICV, and the residuals were standardized to zero-mean and unit-variance for the ML models. See Supplementary Information S2 for a list of imaging features and an illustration of harmonization outcomes.

Clinical data consolidation

Clinical measurements for CVM status were collected across studies using standard operating procedures—systolic and diastolic blood pressure for HTN, fasting blood measures for type 2 diabetes and hyperlipidemia, height and weight for estimating body mass index, and self-reports for smoking status. CVM statuses were dichotomized as present (CVM+) or absent (CVM−) based on study-provided categorical responses and medication status where available, and augmented using traditional cut-offs applied to the continuous clinical measures (Supplementary Information S3).

Machine-learning models

Supervised classifiers were trained independently for each of the five CVMs to separate input features of CVM+ and CVM−, with no restrictions imposed for comorbidities. Linear support vector classifiers are well suited for our study goals. The decision boundary between classes are linear combinations of high-dimensional features, which makes them not only more interpretable but also reduces the risk of overfitting when compared to more complex decision boundaries. Age, sex, ICV, harmonized local GM and WM volumes, and lobar WMH volumes were used as input features (n = 157) using a nested cross-validation procedure to reduce overfitting (Supplementary Information S4 .1)³⁵. The training sets were stratified by CVM status for the nested k-fold cross-validation procedure to ensure an equivalent distribution of CVM− and CVM+ samples in each fold. Additionally, model performance was evaluated using scikit-learn’s balanced accuracy scorer, which weights raw accuracy by the inverse of class prevalence, thereby making the models less susceptible to class imbalance issues. The classifiers output continuous scalar values that summarize the degree to which SPARE-CVMs are expressed in an individual’s features. High/positive values suggest a clear presence and low/negative values suggest a relative absence of CVM-related patterns in the brain.

Dissecting SPARE-CVM spatial patterns

To understand which imaging features most contributed to the SPARE-CVMs, associations between SPARE-CVMs and ROI/WMH volumes were assessed using multiple linear regressions for each sMRI feature adjusting for age, sex, and ICV. The resulting p values were corrected for multiple comparisons using the Bonferroni method using an alpha of 0.001.

Evaluating the separability and sensitivity of SPARE-CVMs

Cohen’s d effect sizes for SPARE-CVM differences between corresponding sets of CVM+ and CVM− groups were calculated. Higher d values indicate greater separability between the CVM+ and CVM− groups, with empirical benchmarks for small (d = 0.2), medium(d = 0.5), and large (d = 0.8) effect sizes³⁶. Similar analyses were performed on general neuroimaging measures, such as the volumes of the lateral ventricles, hippocampus, temporal GM, total GM, and total WMH, as well as previously established ML-based imaging markers for the brain age gap, defined as the difference between brain age estimated from sMRI and chronological age (SPARE-BA-Gap)³⁷, and Alzheimer’s disease (SPARE-AD)⁶. Effect size analyses were replicated for all ML-based imaging markers in the independent dataset (UKBIOBANK v1.7). We performed logistic regression to assess how a unit increase in each SPARE-CVM is associated with the odds of having a positive status for the target CVM.

Evaluating the associations between SPARE-CVMs and clinical measures of CV health

To assess the clinical relevance of CVM signatures beyond the dichotomized CVM categories, we investigated the association between each SPARE-CVM and the underlying clinical measures. SPARE-CVMs were estimated for participants with sub-diagnostic clinical measures who were excluded from the model training: “Stage 1” for HTN (systolic blood pressure: 130–150 mmHg OR Diastolic blood pressure: 80–95 mmHg), “Elevated” for HL (Triglycerides: 150–199 mg/dL OR LDL: 130–159 mg/dL), and “Prediabetes” for T2D (Fasting glucose 100–125 mg/dL OR HBA1C: 5.7–6.4 %). Additionally, SPARE-CVMs were compared between medicated and unmedicated CVM+ individuals with HTN, HL, and T2D. Pearson’s correlation was used to evaluate the association between SPARE-OB and continuous measures of BMI, including overweight (BMI 25–30 kg/m²) individuals who were not part of the model training. SPARE-SM was tested for its association with the number of years of smoking, a continuous measure that offers a partial estimate of risk severity.

Evaluating the associations between SPARE-CVMs and markers of AD pathology

We performed additional sensitivity analyses to evaluate the interactions between age, CVM status, and the presence of AD pathology on SPARE-CVMs using multiple linear regressions. The presence of amyloid pathology (Aβ+) was determined using cerebrospinal fluid biomarkers (CSF) and mean cortical positron emission tomography standardized uptake ratios (SUVR). Study-specific cut-offs for CSF amyloid beta (Aβ42) concentrations were <192 pg/mL for ADNI³⁸ and <374.5 pg/mL for BIOCARD³⁹; for Pittsburgh compound B ([11 C]PiB) SUVR ≥ 1.6 for ADNI⁴⁰, ≥1.5 for AIBL⁴¹, ≥1.06 for BLSA⁴², and >1.5 for OASIS⁴³; for Florbetapir ([18 F]AV-45) SUVR ≥ 1.11 in ADNI and OASIS^44,45.

Evaluation of associations between SPARE-CVMs and cognitive performance

We examined the association between SPARE-CVMs and cognitive performance in the following tests within the UKBioBank cohort: the digit symbol substitution test (DSST) for processing speed, the trail-making test (TMT-A and TMT-B) for executive function, and the prospective memory test (P-Mem). Multivariate linear regression models (for continuous measures) and logistic regression models (for P-Mem) predicting cognitive performance using SPARE-CVMs, while adjusting for the confounding covariates of age, sex, and years of education, were implemented. The resulting p values were corrected for multiple comparisons using the Benjamini–Hochberg procedure for false discovery rates. Similar models were constructed for other cognitive tests that were available for a subset of studies, with additional corrections for the study data origin as a confounding variable (Supplementary Information S5 .2).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

SPARE-CVM indices derived in this study have been uploaded as Supplementary Data in the Source Data file. Original imaging and clinical data used in this study were obtained through data-sharing agreements from the following ten individual studies: Alzheimer’s Disease Neuroimaging Initiative (ADNI), Australian Imaging, Biomarker and Lifestyle Flagship Study of Ageing (AIBL), Biomarkers of Cognitive Decline Among Normal Individuals (BIOCARD), Baltimore Longitudinal Study of Aging (BLSA), Coronary Artery Risk Development in Young Adults (CARDIA), Open Access Series of Imaging Studies (OASIS), Penn Memory Center (PENN), UK Biobank (UKBB), Women’s Health Initiative Memory Study (WHIMS), and Wisconsin Registry for Alzheimer’s Prevention (WRAP). The data-sharing agreements do not include permission for us to share the data further. Investigators must apply to the source data providers to access additional data and match their subject IDs to those used in this study under the current protocol (primarily for UKBB). Data from ADNI and AIBL are available from the Imaging and Data Archive database (https://ida.loni.usc.ed) upon registration and compliance with the data usage agreement. Data from the UKBB are available upon request from the UKBB website (https://www.ukbiobank.ac.uk/). Data from the BLSA study are available upon request at https://www.blsa.nih.gov/how-apply. Data from the OASIS study are available upon request at https://www.oasis-brains.org/. Data requests for BIOCARD, PENN, WRAP, CARDIA, and WHIMS datasets should be directed to M.S.A., D.A.W., S.C.J., L.J.L., and M.A.E., respectively. Upon obtaining access to the source data, investigators can match our derived SPARE-CVM indices to the rest of the data from these studies. Further assistance in matching the R-indices can be requested from the corresponding last author, C.D., at Christos.Davatzikos@pennmedicine.upenn.edu, with responses typically provided within 2 weeks. Moreover, we are actively following protocols to upload our derived measures to the UKBB and ADNI websites, making them directly accessible to investigators who obtain access to those studies. Source data are provided with this paper.

Code availability

Modeling and analyses utilized Python (version 3.8.1) and the models were developed using scikit-learn (version 1.3.2). We used several other Python and R libraries to support data analysis and visualization, including pandas (version 1.5.3), statsmodels (version 0.13.2), numpy (version 1.22.4), matplotlib (version 3.5.13), seaborn (version 0.12.2), scipy (1.7.3), ggplot2 (version 3.4.4), and venn (version 1.11). Python scripts for data processing are available on GitHub: https://github.com/CBICA/NiChart. Machine-learning models used in this project are available on GitHub via Zenodo: https://doi.org/10.5281/zenodo.14872922. This study’s SPARE-CVM models and normative distributions are available in the NiChart: Neuro Imaging Chart of AI-based Imaging Biomarkers platform (https://neuroimagingchart.com/). NiChart allows researchers across the globe to upload their study data, process structural MRI for deriving volumetric features, harmonize said features to iSTAGING dataset, and predict SPARE-CVMs. The visualization modules in NiChart will then allow comparisons of the derived imaging signatures with the normative data in the dimensional coordinate system or as distribution plots.

References

Borelli, W. V. et al. Preventable risk factors of dementia: population attributable fractions in a Brazilian population-based study. Lancet Reg. Health Am. 11, 100256 (2022).
PubMed PubMed Central MATH Google Scholar
Livingston, G. et al. Dementia prevention, intervention, and care: 2020 report of the Lancet Commission. Lancet 396, 413–446 (2020).
Article PubMed PubMed Central MATH Google Scholar
Lee, M. et al. Variation in population attributable fraction of dementia associated with potentially modifiable risk factors by race and ethnicity in the US. JAMA Netw. Open 5, e2219672 (2022).
Article PubMed PubMed Central Google Scholar
Erus, G. et al. Spatial patterns of structural brain changes in type 2 diabetic patients and their longitudinal progression with intensive control of blood glucose. Diabetes Care 38, 97–104 (2015).
Article CAS PubMed MATH Google Scholar
Habes, M. et al. White matter hyperintensities and imaging patterns of brain ageing in the general population. Brain 139, 1164–1179 (2016).
Article PubMed PubMed Central MATH Google Scholar
Davatzikos, C. et al. Longitudinal progression of Alzheimer’s-like patterns of atrophy in normal older adults: the SPARE-AD index. Brain 132, 2026–2035 (2009).
Article PubMed PubMed Central MATH Google Scholar
Elbejjani, M. et al. Cigarette smoking and gray matter brain volumes in middle age adults: the CARDIA Brain MRI sub-study. Transl. Psychiatry 9, https://doi.org/10.1038/s41398-019-0401-1 (2019).
Pomponio, R. et al. Harmonization of large MRI datasets for the analysis of brain imaging patterns throughout the lifespan. Neuroimage 208, 116450 (2020).
Article PubMed MATH Google Scholar
Davis, J. W., Chung, R. & Juarez, D. T. Prevalence of comorbid conditions with aging among patients with diabetes and cardiovascular disease. Hawaii Med. J. 70, 209–213 (2011).
PubMed PubMed Central Google Scholar
Beauchet, O. et al. Blood pressure levels and brain volume reduction: a systematic review and meta-analysis. J. Hypertens. 31, 1502–1516 (2013).
Article CAS PubMed MATH Google Scholar
Herrmann, M. J. et al. Grey matter alterations in obesity: a meta-analysis of whole-brain studies. Obes. Rev. 20, 464–471 (2019).
Article PubMed MATH Google Scholar
Fang, F. et al. Brain atrophy in middle-aged subjects with type 2 diabetes mellitus, with and without microvascular complications. J. Diab. 10, 625–632 (2018).
Article MATH Google Scholar
Morys, F., Dadar, M. & Dagher, A. Association between midlife obesity and its metabolic consequences, cerebrovascular disease, and cognitive decline. J. Clin. Endocrinol. Metab. 106, e4260–e4274 (2021).
Article PubMed PubMed Central Google Scholar
Trofimova, O. et al. Brain tissue properties link cardio-vascular risk factors, mood and cognitive performance in the CoLaus|PsyCoLaus epidemiological cohort. Neurobiol. Aging 102, 50–63 (2021).
Article PubMed MATH Google Scholar
Moran, C. et al. Type 2 diabetes mellitus, brain atrophy, and cognitive decline. Neurology 92, e823–e830 (2019).
Article CAS PubMed PubMed Central MATH Google Scholar
Neth, B. J. & Craft, S. Insulin resistance and Alzheimer’s disease: bioenergetic linkages. Front. Aging Neurosci. 9, 345 (2017).
Article PubMed PubMed Central MATH Google Scholar
Cherbuin, N., Sachdev, P. & Anstey, K. J. Higher normal fasting plasma glucose is associated with hippocampal atrophy: The PATH Study. Neurology 79, 1019–1026 (2012).
Article PubMed Google Scholar
Lee, J. H. et al. Morphometric changes in lateral ventricles of patients with recent-onset type 2 diabetes mellitus. PLoS ONE 8, e60515 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Xu, W. et al. Mid- and late-life diabetes in relation to the risk of dementia: a population-based twin study. Diabetes 58, 71–77 (2009).
Article CAS PubMed PubMed Central MATH Google Scholar
Shang, X. et al. The association of age at diagnosis of hypertension with brain structure and incident dementia in the UK Biobank. Hypertension 78, 1463–1474 (2021).
Article CAS PubMed MATH Google Scholar
Lane, C. A. et al. Investigating the relationship between BMI across adulthood and late life brain pathologies. Alzheimer’s Res. Ther. 13, 91 (2021).
Article CAS MATH Google Scholar
Villeneuve, S. et al. Vascular risk and Abeta interact to reduce cortical thickness in AD vulnerable brain regions. Neurology 83, 40–47 (2014).
Article PubMed PubMed Central MATH Google Scholar
Jang, H. et al. Association of glycemic variability with imaging markers of vascular burden, beta-amyloid, brain atrophy, and cognitive impairment. Neurology 102, e207806 (2024).
Article PubMed Google Scholar
Ye, B. S. et al. Amyloid burden, cerebrovascular disease, brain atrophy, and cognition in cognitively impaired patients. Alzheimers Dement. 11, 494–503.e3 (2015).
Article PubMed MATH Google Scholar
Rabin, J. S. et al. Association of beta-amyloid and vascular risk on longitudinal patterns of brain atrophy. Neurology 99, e270–e280 (2022).
Article CAS PubMed PubMed Central MATH Google Scholar
Lo, R. Y. et al. Vascular burden and Alzheimer disease pathologic progression. Neurology 79, 1349–1355 (2012).
Article CAS PubMed PubMed Central MATH Google Scholar
Hohman, T. J. et al. Stroke risk interacts with Alzheimer’s disease biomarkers on brain aging outcomes. Neurobiol. Aging 36, 2501–2508 (2015).
Article CAS PubMed PubMed Central MATH Google Scholar
Newman, A. B. et al. Dementia and Alzheimer’s disease incidence in relationship to cardiovascular disease in the Cardiovascular Health Study cohort. J. Am. Geriatr. Soc. 53, 1101–1107 (2005).
Article PubMed MATH Google Scholar
Noale, M., Limongi, F. & Maggi, S. Epidemiology of cardiovascular diseases in the elderly. Adv. Exp. Med Biol. 1216, 29–38 (2020).
Article PubMed MATH Google Scholar
Muqtadar, H., Testai, F. D. & Gorelick, P. B. The dementia of cardiac disease. Curr. Cardiol. Rep. 14, 732–740 (2012).
Article PubMed Google Scholar
Nasrallah, I. M. et al. Association of intensive vs standard blood pressure control with magnetic resonance imaging biomarkers of Alzheimer disease: secondary analysis of the SPRINT MIND Randomized Trial. JAMA Neurol. 78, 568–577 (2021).
Article PubMed MATH Google Scholar
Doshi, J. et al. MUSE: MUlti-atlas region Segmentation utilizing Ensembles of registration algorithms and parameters, and locally optimal atlas selection. Neuroimage 127, 186–195 (2016).
Article PubMed MATH Google Scholar
Doshi, J. et al. DeepMRSeg: a convolutional deep neural network for anatomy and abnormality segmentation on MR images. Preprint at https://doi.org/10.48550/arXiv.1907.02110 (2019).
Srinivasan, D. et al. A comparison of Freesurfer and multi-atlas MUSE for brain anatomy segmentation: Findings about size and age bias, and inter-scanner stability in multi-site aging studies. NeuroImage 223, 117248 (2020).
Article PubMed MATH Google Scholar
Govindarajan, S. T. et al. Machine learning reveals distinct neuroanatomical signatures of cardiovascular and metabolic diseases in cognitively unimpaired individual. GitHub via Zeonodo. https://doi.org/10.5281/zenodo.14872923 (2025).
Cohen, J. Statistical Power Analysis for the Behavioral Sciences (Academic Press, 2013).
Habes, M. et al. The Brain Chart of Aging: machine-learning analytics reveals links between brain aging, white matter disease, amyloid burden, and cognition in the iSTAGING consortium of 10,216 harmonized MR scans. Alzheimers Dement. 17, 89–102 (2021).
Article CAS PubMed Google Scholar
Shaw, L. M. et al. Cerebrospinal fluid biomarker signature in Alzheimer’s disease neuroimaging initiative subjects. Ann. Neurol. 65, 403–413 (2009).
Article CAS PubMed PubMed Central MATH Google Scholar
Soldan, A. et al. Hypothetical preclinical Alzheimer disease groups and longitudinal cognitive change. JAMA Neurol. 73, 698–705 (2016).
Article PubMed PubMed Central MATH Google Scholar
Ewers, M. et al. CSF biomarker and PIB-PET-derived beta-amyloid signature predicts metabolic, gray matter, and cognitive changes in nondemented subjects. Cereb. Cortex 22, 1993–2004 (2012).
Article PubMed MATH Google Scholar
Rowe, C. C. et al. Amyloid imaging results from the Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging. Neurobiol. Aging 31, 1275–1283 (2010).
Article PubMed MATH Google Scholar
Kamil, R. J. et al. Vestibular function and beta-amyloid deposition in the Baltimore longitudinal study of aging. Front. Aging Neurosci. 10, 408 (2018).
Article CAS PubMed PubMed Central MATH Google Scholar
Lopes Alves, I. et al. Strategies to reduce sample sizes in Alzheimer’s disease primary and secondary prevention trials using longitudinal amyloid PET imaging. Alzheimer’s Res. Ther. 13, 82 (2021).
Article CAS MATH Google Scholar
Joshi, A. D. et al. Performance characteristics of amyloid PET with florbetapir F 18 in patients with Alzheimer’s disease and cognitively normal subjects. J. Nucl. Med. 53, 378–384 (2012).
Article CAS PubMed Google Scholar
Johnson, K. A. et al. Florbetapir (F18-AV-45) PET to assess amyloid burden in Alzheimer’s disease dementia, mild cognitive impairment, and normal aging. Alzheimers Dement. 9, S72–S83 (2013).
Article PubMed MATH Google Scholar

Download references

Acknowledgements

The iSTAGING study is a multi-institutional effort funded by the National Institute on Aging (NIA) by RF1 AG054409 (C. Davatzikos). Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in the analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf. ADNI is funded by the NIA, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica; Biogen; Bristol-Myers Squibb; CereSpir; Cogstate; Eisai; Elan Pharmaceuticals; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche and its affiliated company Genentech; Fujirebio; GE Healthcare; IXICO; Janssen Alzheimer Immunotherapy Research & Development; Johnson & Johnson Pharmaceutical Research & Development; Lumosity; Lundbeck; Merck & Co; Meso Scale Diagnostics; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. Data used in the preparation of this article were obtained from the Australian Imaging Biomarkers and Lifestyle flagship study of ageing (AIBL) funded by the Commonwealth Scientific and Industrial Research Organisation (CSIRO) which was made available at the ADNI database (www.loni.usc.edu/ADNI). The AIBL researchers contributed data but did not participate in the analysis or writing of this report. AIBL researchers are listed at www.aibl.csiro.au. The BIOCARD study is partly supported by NIH grant U19-AG033655 (M.S.A.). The BLSA neuroimaging study is funded by the Intramural Research Program, NIA, National Institutes of Health (NIH), and by HHSN271201600059C (S.M.R., M.B., Y.A.). CARDIA study is conducted and supported by the NHLBI in collaboration with the University of Alabama at Birmingham (HHSN268201300025C and HHSN268201300026C), Northwestern University (HHSN268201300027C), University of Minnesota (HHSN268201300028C), Kaiser Foundation Research Institute (HHSN268201300029C), and Johns Hopkins University School of Medicine (HHSN268200900041C). CARDIA is also partially supported by the Intramural Research Program of the National Institute on Aging (NIA) and an intra-agency agreement between NIA and NHLBI (AG0005) (L.J.L.). Data used in the preparation of this article was obtained from the OASIS study funded in part by grants P50 AG05681, P01 AG03991, P01 AG026276, R01 AG021910, P20 MH071616, U24 RR021382 for OASIS-1, P50 AG05681, P01 AG03991, P01 AG026276, R01 AG021910, P20 MH071616, U24 RR021382 for OASIS-2, and NIH P30 AG066444, P50 AG00561, P30 NS09857781, P01 AG026276, P01 AG003991, R01 AG043434, UL1 TR000448, R01 EB009352 for OASIS-3 (T.B., D.M., J.M., P.L.). Data used in the preparation of this article was obtained at Penn Alzheimer’s Disease Research Center funded in part by grant P30 AG072979 (D.A.W.). Data used in the preparation of this article was obtained from the UK Biobank Resource under application number 35148. The Women’s Health Initiative was funded by the National Heart, Lung and Blood Institute of the NIH, US Department of Health and Human Services. Contracts HHSN268200464221C and N01-WH-4-4221 provided additional support. The WHIMS (M.A.E.) was funded in part by Wyeth Pharmaceuticals. The WRAP study was supported by grants: NIH R01AG027161 and R01AG054047 (S.C.J.). The authors would like to acknowledge the clinical and neuropathology diagnostic support provided by the Wisconsin ADRC’s Clinical, Neuropathology and Biomarkers Cores, and biostatistical support provided by the Data Management and Biostatistics Core. S.T. Govindarajan was partly supported by the Alzheimer’s Association Research Fellowship AARFD-23-1151286. A.A. was funded through grants 191026 and 206795 awarded by the Swiss National Science Foundation. M.H. was supported by grant 1R01AG080821 from the National Institutes of Health. Funding sources had no role in the study design, data collection, analysis, interpretation, or writing of the study report. The opinions and conclusions contained in this publication are solely those of the authors and are not necessarily endorsed by the associated studies, institutions, and funding agencies and should not be assumed to reflect their opinions or conclusions.

Author information

Authors and Affiliations

Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA, USA
Sindhuja Tirumalai Govindarajan, Elizabeth Mamourian, Guray Erus, Randa Melhem, Jimit Doshi, Raymond Pomponio, Haochang Shou, Ilya M. Nasrallah & Christos Davatzikos
Centre for Artificial Intelligence, ZHAW School of Engineering, Winterthur, Switzerland
Ahmed Abdulkadir
Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, USA
Duygu Tosun
Laboratory of Behavioral Neuroscience, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
Murat Bilgel, Yang An & Susan M. Resnick
Department of Radiology, Washington University School of Medicine, St. Louis, MO, USA
Aristeidis Sotiras, Daniel S. Marcus, Pamela LaMontagne & Tammie L. S. Benzinger
Sticht Center for Healthy Aging and Alzheimer’s Prevention, Wake Forest School of Medicine, Winston-Salem, NC, USA
Mark A. Espeland
Department of Biostatistics and Data Science, Wake Forest School of Medicine, Winston-Salem, NC, USA
Mark A. Espeland
Florey Institute, The University of Melbourne, Parkville, VIC, Australia
Colin L. Masters & Paul Maruff
Neuroepidemiology Section, Intramural Research Program, National Institute on Aging, Bethesda, MD, USA
Lenore J. Launer
CSIRO Health and Biosecurity, Australian e-Health Research Centre CSIRO, Brisbane, Queensland, Australia
Jurgen Fripp
Wisconsin Alzheimer’s Institute, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA
Sterling C. Johnson
Knight Alzheimer Disease Research Center, Washington University in St. Louis, St. Louis, MO, USA
John C. Morris
Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Marilyn S. Albert
Department of Radiology, University of Pennsylvania, Philadelphia, PA, USA
R. Nick Bryan & Ilya M. Nasrallah
Biggs Alzheimer’s Institute, University of Texas San Antonio Health Science Center, San Antonio, TX, USA
Mohamad Habes
Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA, USA
Haochang Shou
Department of Neurology, University of Pennsylvania, Philadelphia, PA, USA
David A. Wolk

Authors

Sindhuja Tirumalai Govindarajan
View author publications
Search author on:PubMed Google Scholar
Elizabeth Mamourian
View author publications
Search author on:PubMed Google Scholar
Guray Erus
View author publications
Search author on:PubMed Google Scholar
Ahmed Abdulkadir
View author publications
Search author on:PubMed Google Scholar
Randa Melhem
View author publications
Search author on:PubMed Google Scholar
Jimit Doshi
View author publications
Search author on:PubMed Google Scholar
Raymond Pomponio
View author publications
Search author on:PubMed Google Scholar
Duygu Tosun
View author publications
Search author on:PubMed Google Scholar
Murat Bilgel
View author publications
Search author on:PubMed Google Scholar
Yang An
View author publications
Search author on:PubMed Google Scholar
Aristeidis Sotiras
View author publications
Search author on:PubMed Google Scholar
Daniel S. Marcus
View author publications
Search author on:PubMed Google Scholar
Pamela LaMontagne
View author publications
Search author on:PubMed Google Scholar
Tammie L. S. Benzinger
View author publications
Search author on:PubMed Google Scholar
Mark A. Espeland
View author publications
Search author on:PubMed Google Scholar
Colin L. Masters
View author publications
Search author on:PubMed Google Scholar
Paul Maruff
View author publications
Search author on:PubMed Google Scholar
Lenore J. Launer
View author publications
Search author on:PubMed Google Scholar
Jurgen Fripp
View author publications
Search author on:PubMed Google Scholar
Sterling C. Johnson
View author publications
Search author on:PubMed Google Scholar
John C. Morris
View author publications
Search author on:PubMed Google Scholar
Marilyn S. Albert
View author publications
Search author on:PubMed Google Scholar
R. Nick Bryan
View author publications
Search author on:PubMed Google Scholar
Susan M. Resnick
View author publications
Search author on:PubMed Google Scholar
Mohamad Habes
View author publications
Search author on:PubMed Google Scholar
Haochang Shou
View author publications
Search author on:PubMed Google Scholar
David A. Wolk
View author publications
Search author on:PubMed Google Scholar
Ilya M. Nasrallah
View author publications
Search author on:PubMed Google Scholar
Christos Davatzikos
View author publications
Search author on:PubMed Google Scholar

Contributions

Study concept and design was by S.T.G. and C.D. Model development was by S.T.G. Data interpretation was by S.T.G., H.S., I.M.N., and C.D. Drafting of the manuscript was by S.T.G., I.M.N., and C.D. Statistical analysis was by S.T.G. Data collection and processing was by S.T.G., E.M., G.E., A.A., R.M., J.D., R.P., D.T., M.B., Y.A., A.S., D.S.M., P.L., T.L.S.B., M.A.E., C.L.M., P.M., L.J.L., J.F., S.C.J., J.C.M., M.S.A., R.N.B., S.M.R., D.A.W., and C.D. Critical revision of the manuscript for important intellectual content was by S.T.G., E.M., G.E., A.A., R.M., J.D., R.P., D.T., M.B., Y.A., A.S., D.S.M., P.L., T.L.S.B., M.A.E., C.L.M., P.M., L.J.L., J.F., S.C.J., J.C.M., M.S.A., R.N.B., S.M.R., M.H., H.S., D.A.W., I.M.N., and C.D.

Corresponding authors

Correspondence to Sindhuja Tirumalai Govindarajan or Christos Davatzikos.

Ethics declarations

Competing interests

T.L.S.B. has received investigator-initiated research funding from the NIH, the Alzheimer’s Association, the Foundation at Barnes-Jewish Hospital, Siemens Healthineers, and Avid Radiopharmaceuticals (a wholly owned subsidiary of Eli Lilly and Company). She participates as a site investigator in clinical trials sponsored by Eli Lilly and Company, Biogen, Eisai, Jaansen, and Roche. She has served as a paid and unpaid consultant to Eisai, Siemens, Biogen, Janssen, and Bristol-Myers Squibb. J.C.M. has served as a paid consultant to the Barcelona Brain Research Center and the Native Alzheimer Disease-related Resource Center in Minority Aging Research. He also received payments for presentations at the AAIM meeting, Longer Life Foundation, and the International Brain Health Symposium. JCM has received travel support to attend meetings including AAIM, DIAN, AD/PD, ATRI/ADNI, ADRC, ADC, the International Conference on Health Aging & Biomarkers, and the International Brain Health Symposium. He has served on the advisory board for the Cure Alzheimer’s Fund and LEADS at Indiana University. S.M.R. is an NIA IRP employee and has served on the advisory board of Dementia Platforms, UK, the Canadian Consortium on Neurodegeneration in Aging, and the Adult Aging Brain Connectome. She has received travel support from the McKnight Foundation to attend an annual meeting. D.A.W. has served as a paid consultant to Beckman Coulter and Eli Lilly. He also received grants from the NIH and Biogen paid to his institution and received travel support from the Alzheimer’s Association. He has served on the DSMB of studies by Functional Neuromodulation and GSK. The other authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Raphael Castilhos, Yaou Liu, and Jakub Nalepa for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

For the iSTAGING study, the Preclinical AD consortium, the ADNI, and the CARDIA studies: Christos Davatzikos.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Govindarajan, S.T., Mamourian, E., Erus, G. et al. Machine learning reveals distinct neuroanatomical signatures of cardiovascular and metabolic diseases in cognitively unimpaired individuals. Nat Commun 16, 2724 (2025). https://doi.org/10.1038/s41467-025-57867-7

Download citation

Received: 09 May 2024
Accepted: 03 March 2025
Published: 19 March 2025
DOI: https://doi.org/10.1038/s41467-025-57867-7