Abstract
Background
Drug development and disease prevention of heart failure (HF) and atrial fibrillation (AF) are impeded by a lack of robust early-stage surrogates. We determined to what extent cardiac magnetic resonance (CMR) measurements act as surrogates for the development of HF or AF.
Methods
Genetic data were sourced on the association with 21 atrial and ventricular CMR measurements. Mendelian randomization was used to determine CMR associations with AF, HF, non-ischaemic cardiomyopathy (NICM), and dilated cardiomyopathy (DCM), noting that the definition of NICM includes DCM as a subset. Additionally, for the CMR surrogates of AF and HF, we explored their association with non-cardiac traits potentially influenced by cardiac disease liability.
Results
In total we find that 7 CMR measures (biventricular ejection fraction (EF) and end-systolic volume (ESV), as well as LV systolic volume (SV), end-diastolic volume (EDV), and mass to volume ratio (MVR)) associate with the development of HF, 5 with the development of NICM (biventricular EDV and ESV, LV-EF), 7 with DCM (biventricular EF, ESV, EDV, and LV end-diastolic mass (EDM), and 3 associate with AF (LV-ESV, RV-EF, RV-ESV). Higher EF of both ventricles associate with lower risk of HF and DCM, with biventricular ESV associating with all four cardiac outcomes. Higher values of biventricular EDV associate with lower risk of HF, and DCM. Exploring the associations of these CMR cardiac disease surrogates with non-cardiac traits confirms a strong link with diastolic blood pressure, as well as more specific associations with lung function (LV-ESV), HbA1c (LV-EDM), and type 2 diabetes (LV-SV).
Conclusions
The current paper identifies key CMR measurements that may act as surrogate endpoints for the development of HF (including NICM and DCM) or AF.
Plain language summary
Heart failure and irregular heart rhythms are common and serious heart conditions. Finding early warning signs could help prevent these diseases or enable the development of better treatments. This study looked at heart measurements taken using heart scans to see whether these could act as early indicators of future heart problems. Using information about the genes people had inherited, we investigated whether changes in heart size or function are linked to a person’s risk of developing heart disease. We found several measurements that are strongly linked to later heart failure or irregular heart rhythms. These findings suggest that heart-based measurements could help identify people at higher risk of heart problems earlier. In the future, this might help guide treatment decisions or identify new ways to prevent heart disease.
Similar content being viewed by others
Introduction
Heart failure (HF) and atrial fibrillation (AF) are major cardiac diseases that cause a considerable burden in terms of health and economic costs, as well as mortality1,2,3. HF is a clinical diagnosis secondary to dysfunction of the right ventricle (RV) or left ventricle (LV), while AF is defined by uncoordinated electrical activation and consequently ineffective contraction of the atria. Both diseases are intricately related, and while the causative relationship between the two conditions has not been fully determined, it is clear these two diseases frequently co-occur4.
Despite recent advances in medicines, for example, offered by sodium-glucose co-transporter-2 inhibitors, drug development for cardiac disease suffers from high failure rates, often occurring during costly late-stage clinical testing5,6,7. Unlike the cholesterol content on low-density lipoprotein particles for coronary heart disease, drug development for AF and HF is impeded by a lack of robust early-stage surrogates (or intermediates) for cardiac disease.
Cardiac magnetic resonance (CMR) imaging is the gold standard for the quantification of atrial and ventricular function and morphology and has become an integral diagnostic modality for cardiac diseases. It is, however, unclear to what extent CMR measurements act as surrogates for the development of cardiac disease in otherwise healthy individuals.
Both HF and AF are associated with multimorbidity, including non-cardiac diseases, such as stroke, chronic kidney disease (CKD), diabetes mellitus (T2DM), and neurological diseases, such as Alzheimer’s disease (AD). Because HF and AF are clinical manifestations of underlying changes in cardiac function and structure, patients with similar diagnoses may vary considerably in underlying pathophysiology and disease progression. Unlike HF or AF diagnoses, CMR measurements directly reflect cardiac physiology and therefore, provide an opportunity to explore the effects changes in cardiac function and structure may elicit in other organs.
Recently, CMR measurements of thousands of subjects have been linked to genetic data and analysed through genome-wide association studies (GWAS). Aggregate data from GWAS, consisting of variant-specific point estimates and standard errors, can be used in Mendelian randomization analyses to ascertain the causal effects a CMR trait may have on disease. In the current manuscript, we leveraged data from two recent GWAS of CMR measurements of cardiac structure and function8, and left atrial (LA) volume9, jointly consisting of 21 measurements conducted in over 35,000 UK biobank (UKB) participants. These data were used to conduct Mendelian randomization analyses to determine the potential association between CMR traits and cardiac events, specifically focussing on AF and HF, as well as the aetiological HF subtypes: dilated cardiomyopathy (DCM) and non-ischaemic cardiomyopathy (NICM)10,11. Subsequently, we explored the association of CMR surrogates for HF or AF with 19 clinically relevant non-cardiac traits, focussing on traits potentially influenced by cardiac disease liability, such as blood pressure, kidney and lung function markers, diabetes, and stroke subtypes. In this study, we identify seven CMR measures associated with the development of HF, seven with DCM, five with the development of NICM, and three with AF. Moreover, we find that many of these CMR measures are associated with diastolic blood pressure, while specific individual measures also relate to lung function and glucose metabolism.
Methods
Genetic data on CMR and cardiac traits
We leveraged aggregate data (i.e. point estimates and standard errors) from two GWAS of deep-learning derived CMR measurements conducted using UKB participants; please see the Supplementary Methods and the specific study references for details on the derivation methods. Ahlberg et al.9 provided measurements on LA volume (LA-V (max) and LA-V (min)), LA total emptying fraction (LA-TF), LA active emptying fraction (LA-AF), and passive emptying fraction (LA-PF) from 35,658 subjects. Schmidt et al.8 provided (n: 36,548) data on biventricular ejection fraction (EF), stroke volume (SV), peak filling rate (PFR), peak atrial filling rate (PAFR) peak ejection rate (PER), end-diastolic or end-systolic volumes (EDV, ESV), LV end-diastolic mass (LV-EDM), and LV mass to volume ratio (LV-MVR). The deep-learning algorithms employed for automated CMR analyses included outlier detection steps and removing CMRs from the minority of subjects who were, for example, out of sinus rhythm.
GWAS data were included on the following cardiac outcomes: HF (52,496 cases)12, NICM (1816 cases)13, DCM (2719 cases)14, and AF (60,620 cases)15. Here, NICM and DCM were sourced from distinct samples, where the phenotypic overlap between DCM and NICM (with DCM being a more homogenous subgroup of NICM) allowed for indirectly replication of our findings. The following 19 traits were used in the non-cardiac phenome-wide scan: five stroke subtypes16, venous thromboembolism (VTE)17, abdominal aortic aneurysm (AAA)17, systolic/diastolic blood pressure (SBP/DBP)18, body mass index (BMI)19, T2DM20, glycated haemoglobin (HbA1c) from the Neale UKB analysis (http://www.nealelab.is/uk-biobank), C-reactive protein (CRP)21, lung function measurement from the Neale UKB analysis (http://www.nealelab.is/uk-biobank, forced expiratory volume: FEV1, forced vital capacity: FVC, peak expiratory flow: PEF), CKD22, estimated glomerular filtration rate (eGFR)22, and AD23; please see the data availability section for more detail. The original GWA studies sought to prevent bias due to population stratification or cryptic relatedness by either removing non-European individuals (based on genetic principal component analysis), related individuals, or accounting for this through mixed-effects models such as BOLT-LMM.
Mendelian randomization analysis
Genetic instruments were selected from throughout the genome using an F-statistic > 24 and a minor allele frequency of at least 0.01. Variants were clumped to a linkage disequilibrium (LD) R-squared threshold of 0.30, with residual LD modelled using a generalized least square (GLS) solution24 and a reference panel from a random sample of 5000 of white British ancestry UKB participants. By actively modelling the remaining LD structure, the employed GLS estimators prevent potential bias in the standard error estimates due to the correlation between variants while at the same time optimising precision and stability by including additional variants24. To maximise the number of available variants, we did not perform any additional distance-based clumping. Steiger filtering25 was employed to remove variants with a likely direct causal effect on the outcome instead of on the exposure (i.e., removing variants affecting the outcome before the change in exposure occurred: variant -> outcome -> exposure), ensuring the remaining genetic instruments supported an association model where the exposure occurred before the outcome.
Mendelian randomization was conducted using the GLS implementation of the inverse-variance weighted (IVW) estimator, as well as with an Egger correction to protect against horizontal pleiotropy26. To further minimize the potential influence of horizontal pleiotropy, we excluded variants with a leverage value of more than 3 times the mean or an outlier Chi-square statistic above 11, with the Q-statistic identifying possible remaining violations27. Noting the lack of power of the Egger intercept test, we instead report the p-value of the Q-test27. The Rucker model selection framework was applied to select the most appropriate estimator (IVW or MR-Egger) for each individual exposure-outcome relation27,28. To ensure a sufficient number of variants were available to accurately explore possible horizontal pleiotropy effects, we dropped CMR measurement with less than 5 variants. The influence of the horizontal pleiotropy assumption was additionally evaluated using the weighted median estimator, which assumes at least 50% of the information is derived from valid instruments. Importantly, other than assuming at least 50% of the variants are valid instruments, the median estimator does not make any specific assumptions on the type of horizontal pleiotropy affecting the invalid genetic instruments. The weighted median estimator, therefore, provides a middle ground between the IVW estimator (which assumes the complete absence of horizontal pleiotropy) and the MR-Egger estimator (which allows for 100% of the variants acting through a horizontal pleiotropy pathway)29,30. Given that the Median estimator does not allow for the inclusion of correlated variants, the genetic instruments used in this analysis were pruned to an R-squared of 0.05.
Previous Mendelian randomisation studies have often applied a p-value threshold of 5 × 10-8 (approximately equal to an F-statistic of 30) to identify instruments with a sufficiently strong exposure association. While this conservative threshold does protect against weak-instrument bias, applying a lower F-statistic threshold may beneficially increase the number of available variants and thereby decrease the type 2 error rate. We ensured the presented results remained sufficiently protected against the potential influence of weak-instrument bias by applying a reasonably high F-statistic threshold and by prioritising outcome GWAS with limited overlap with UKB data used by the CMR studies31. Nevertheless, the following outcomes were based on GWAS where over 50% of the participants were sourced from the UKB: AF, NICM, SBP & DBP, BMI, HbA1c, lung function metrics. It is therefore important to note that in large sample size settings (where the estimated F-statistic converges to the true F-statistic) the multiplicative inverse of the estimated F-statistic approximates the amount of bias in one-sample settings (with complete sample overlap), in our cases this implies the average amount of potential bias with complete sample overlap is about 4%31.
Where appropriate, results were presented as odds ratio (OR, for binary traits) with 95% confidence interval (95%CI) or mean difference (MD, for continuous traits). Given the close interrelationship between CMR traits themselves, as well as between the considered cardiac traits (e.g., with HF, DCM, and NCIM reflecting closely related diseases) we sought to identify the subset of CMR traits with strong support for cardiac involvement. Importantly, because of the aforementioned phenotypic interrelationship, associations with multiple cardiac traits should be viewed as supportive rather than penalised. We, therefore, applied a two-stage approach to address multiple testing. First, individual associations with cardiac outcomes were declared significant using the standard alpha of 0.05. Next, following Storey32, multiplicity was addressed by using Kolmogorov-Smirnov “KS”-tests to de-prioritise CMR traits for which the p-value distribution followed a uniform distribution indicative of an overall null-association. Focussing on the CMR traits that were rejected by the KS-tests we conducted a targeted phenome-wide scan to explore their potential association with the aforementioned non-cardiac traits, applying a multiplicity corrected alpha of 2.63 × 10-4 (correcting for the 10 remaining CMR traits and 19 exposure).
As outlined in the supplementary methods, we additionally investigate the extent to which increased HF liability is associated with a rise in drug prescriptions33, which might act as mediators.
Institutional review board approval
Aside from the UKB LD reference data, the current study exclusively uses summary-level GWAS statistics, with download URLs provided in the data availability section. For all included GWA studies, all participants provided informed consent, and study protocols were approved by their respective local ethical committee. The UK Biobank has ethical approval from the North West Multi-centre Research Ethics Committee to handle human participant data. Written informed consent was obtained from all participants, and all data was deidentified for analysis. This research has been conducted using the UK Biobank Resource under Application Number 12113. Given that the current study uses aggregate GWAS data, we did not seek further ethical approval.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Results
Biventricular and atrial CMR associations with cardiac outcomes
We employed Mendelian randomization to determine the potential association between CMR traits and the liability of AF, HF, DCM, and/or NICM. Due to the limited number of available variants (fewer than 5), the following CMR parameters were excluded from the analysis: LA-PF and LA-TF, biventricular PAFR and PFR, as well as LV-PER.
Prioritising the initial Rucker based MR analysis (Supplementary Figs. 1, 2, Data 1) on results which passed multiplicity filtering (Fig. 1), we identified 10 CMR traits which were associated (multiple) cardiac outcomes: biventricular EF, ESV, EDV, LV-SV, LV-EDM, LV-MVR, LA -V (min). We subsequently compared these Rucker-based point estimates to those obtained using the weighted median estimator (which assumes horizontal pleiotropy is absent in at least 50% of the used genetic variants), which showed strong agreement: Spearman’s correlation of 0.86 (p-value 1.10 × 10-11); Figs. 2, 3, Data 2.
Nota bene: the bar chart in (a) represent the -log10 (p-value) of Kolmogorov-Smirnov test results, were a significant p-value indicates the associations are unlikely driven by multiple testing. The horizontal line indicates the significance threshold of 0.05/14 – representing the 14 CMR traits with at least five available variants in the cardiac outcome analysis. The bar chart (b) represents the counts of significant Mendelian randomization CMR effects grouped by chamber. These counts disregard CMRs which did not pass the Kolmogorov-Smirnov test for multiplicity, and are based on the weighted median results. The bar chart in (c) represents the counts of significant Mendelian randomization CMR effects on the considered phenome-wide traits. Results are based on the CMRs which passed Kolmogrov-Smirnov test for multiplicity, with the Rucker selected (IVW/Egger) results depicted as the wider yellow bars, and the thinner red bars representing the weighted median results. The following abbreviations were used, LV left-ventricle, RV right-ventricle, RA right-atrial, LA right-atrial, HCM hypertrophic cardiomyopathy, DCM dilated cardiomyopathy, AF atrial fibrillation, T2DM type 2 diabetes, CKD chronic kidney disease, VTE venous thromboembolism, AAA abdominal aortic aneurysm, SBP/DBP systolic/diastolic blood pressure, BMI body mass index, CRP c-reactive protein, FVC forced vital capacity, FEV1 forced expiratory volume, PEF peak expiratory flow, eGFR: estimated glomerular filtration rate, HbA1c glycated haemoglobin, CE cardioembolic, LA large artery, IS ischaemic, SV small vessel, AD Alzheimer’s disease. See Data 1–4 for the underlying data.
Nota bene: point estimates reflect odds ratios (OR) with 95% confidence intervals presented as horizontal line segments. CMR measurements that passed the Kolmogorov-Smirnov test for multiplicity are indicated with a star. LV left-ventricle, RV right-ventricle, EF ejection fraction, SV stroke volume, PFR peak filling rate, PER peak ejection rate, EDV/ESV diastolic or systolic volumes, EDM end diastolic mass, MVR mass to volume ratio. Outcome data were available on HF (heart failure, 52,496 cases), DCM (dilated cardiomyopathy, 2719 cases), NICM (non-ischaemic cardiomyopathy, 1816 cases), and AF (atrial fibrillation, 60,620 cases). See Data 1 and 2 for the underlying data.
Nota bene: point estimates reflect odds ratios (OR) with 95% confidence intervals presented as horizontal line segments. CMR measurements that passed the Kolmogorov-Smirnov test for multiplicity are indicated with a star. LA left-atrial, V (max) maximum volume, V (min) minimum volume, AF active emptying fraction, PF passive emptying fraction, PFR peak filling rate. Outcome data were available on HF (heart failure, 52,496 cases), DCM (dilated cardiomyopathy, 2719 cases), NICM (non-ischaemic cardiomyopathy, 1816 cases), and AF (atrial fibrillation, 60,620 cases). See Data 2 for the underlying data. See Data 1 and 2 for the underlying data.
Focussing on the subset of estimates with a constant association across the considered MR estimators, we found that increased biventricular EF associated with lower risk of HF and DCM, with LV-EF additionally associated with lower risk of NICM. Similar biventricular associations were observed for EDV (lower risk of NICM, DCM), and ESV (higher risk of AF, HF, DCM, NICM). Additionally, higher LV-EDM was associated with higher DCM risk (OR 3.93, 95%CI 1.43; 10.83), and higher LV-SV was associated with higher HF risk (OR 1.33, 95%CI 1.06; 1.67). Contrary to the Rucker MR analysis, in the MR-median analysis, LA-V (min) did not associate with cardiac disease; Figs. 1–3.
Associations of CMR-derived indices of cardiac function and structure with non-cardiac traits
Focusing on the same 10 CMR measurements, we next explored whether changes in cardiac function and structure could be associated with non-cardiac traits. We note that the correlation between the Rucker-based point estimates and weighted median-based estimates was lower for non-cardiac trait associations: 0.54 (p-value 2.06 × 10-14), suggesting the specific horizontal pleiotropy assumption was more influential. In line with this the weighted median-based estimator identified substantially fewer significant associations with non-cardiac traits (Figs. 1 and 4, Data 3, 4, Supplementary Fig. 3, 4). Focussing on the subset of non-cardiac traits with concordant results between estimators identified five CMR traits which were strongly associated with a decrease in DPB (LV-EF, LV-SV, LV-EDV, RV-EDV, and LV-EDM), as well as single CMR traits associating with FEV1, HbA1c, and T2DM.
Nota bene: p-values passing the 0.05 threshold are indicated by an open diamond, with stars indicating results passing a threshold of 2.6 × 10−4. Cells were coloured by effect direction times -log10(p-value); where p-values were truncated at 8 for display purposes. CMR traits were selected based on the Kolmogorov-Smirnov test for multiplicity, with effects estimated using the weighted median estimator. The following abbreviations were used, LV left-ventricle, RV right-ventricle, LA right-atrial, EF ejection fraction, SV stroke volume, EDV/ESV diastolic or systolic volumes, EDM end diastolic mass, MVR mass to volume ratio V (max): maximum volume, AF atrial fibrillation, T2DM type 2 diabetes, CKD chronic kidney disease, VTE venous thromboembolism, AAA abdominal aortic aneurysm, SBP/DBP systolic/diastolic blood pressure, BMI: body mass index, CRP c-reactive protein, FVC forced vital capacity, FEV1 forced expiratory volume, PEF peak expiratory flow, eGFR estimated glomerular filtration rate, HbA1c glycated haemoglobin, CE cardioembolic, LA large artery, IS: ischaemic, SV small vessel, AD Alzheimer’s disease. See Data 3–6 for the underlying data.
Comparison to HF and AF associations with non-cardiac traits
Next, as comparison, we leveraged genetic instruments associated with a clinical diagnosis of HF or AF and performed Mendelian randomization to determine the association an increased liability of HF or AF had on non-cardiac traits (Fig. 4, Supplementary Fig. 4, Data 5, 6). An increased HF liability was strongly associated with the development of (any/ischaemic) stroke, as well as an increased blood pressure, increased risk of CKD, and T2DM. AF liability was strongly associated with the development of cardioembolic stroke. Please refer to the supplementary methods and Supplementary Table 1 for additional analyses exploring potential associations between HF liability and drug prescription.
Discussion
In the current manuscript, we employed Mendelian randomization combined with CMR measurements to identify surrogate outcomes for the onset of HF (52,496 cases) and AF (60,620 cases). We showed that biventricular EF was associated with the development of HF and DCM, with RV-EF additionally associating with AF. Biventricular ESV was associated with the development of all four cardiac outcomes, with biventricular EDV associating with DCM and NICM, and LV-EDV additionally with HF, and LV-EDM associating with the onset of DCM. Importantly, we observed strong consistency in terms of effect direction and magnitude across HF, DCM and NICM, which represent strongly related cardiac events. We found that the development of HF or AF reflects a combination of changes in cardiac function and structure.
In total, we identified 7 CMR measures (biventricular EF and ESV, LV-SV, LV-EDV, LV-MVR) associated with the development of HF, 5 with development of NICM (biventricular EDV and ESV, LV-EF), 7 with DCM (biventricular EF, ESV, EDV, LV-EDM), and 3 with AF (LV-ESV, RV-EF, RV-ESV). This implies that CMR measurements may be useful to monitor disease occurrence in subjects without pre-existing cardiac disease and help identify high-risk patients in need of preventative measures. Additionally, our findings suggest that CMR measurement might be used as surrogate endpoints in early clinical studies, which can assist in prioritizing compounds for confirmatory outcome trials.
We explored the phenotypic effects that changes in cardiac function and structure may have on non-cardiac traits (Figs. 1 and 4), finding strong support for an association with diastolic blood pressure. Despite a moderate correlation between the Rucker-based MR results and the weighted median results (correlation of 0.54), there was substantial disagreement between the amount of significant findings. whereas the Rucker-based method supported more prolific associations between CMR measurements and a substantial number of non-cardiac traits. The disagreement in terms of significant findings suggests that the CMR associations with non-cardiac traits are at least partially influenced by horizontal pleiotropy. The moderate correlation between point estimates does however, suggest that some of the difference in significance might be due to the lower power of the median estimator29. By discounting discordant results, we identified the subset of effect estimates that are relatively robust to the specific horizontal pleiotropy assumption and are, therefore, less likely to be affected by this type of bias.
We compared the CMR associations with non-cardiac traits to the associations of HF/AF liability with these same non-cardiac traits, suggesting that the associations of CMR traits and HF/AF liability are distinct. For example, a higher liability of HF was associated with a higher risk of (ischaemic) stroke, diabetes, and kidney disease, while a higher AF liability was associated with a higher cardioembolic stroke risk. While diabetes is a known risk factor for HF, we found that an increased HF liability was associated with an increased risk of diabetes. Rather than reflecting a direct effect of HF, we hypothesize that this association may reflect a mediation pathway where an increased HF liability is associated with an increased prescription rate of diabetes associated medicines34,35. To show this, we conducted an MR analysis of HF liability and its association with drug prescriptions using a GWAS from Wu et al.33, confirming that an increased risk of HF is associated with cardiovascular drug prescriptions affecting diabetes risk such as statins/HMGCR inhibitors; Supplementary Table 1.
The study has a number of limitations that deserve consideration. First, while we sourced genetic associations with CMR measurements taken from subjects without pre-existing cardiac conditions, a proportion of subjects may have had undiagnosed diseases. The UKB, however, represents a relatively healthy subset of the UK population, likely minimizing the number of individuals with latent disease. The potential influence of this is further limited by employing a two-sample MR design, where the exposure and outcome GWAS are sourced from (partially) non-overlapping samples with distinct disease patterns. This two-sample design further ensures that any potential weak-instrument bias acts towards a null effect. We do emphasise that a subset of GWAS used as a source of outcome data did include UKB data (Supplemental Table 2), which may have slightly increased anticipated bias proportional to the sample overlap (or overlap in cases for binary outcomes). For example31, with 100% sample overlap, our employed F-statistic (used to select instruments) would result in a small amount of bias 1/24 ≈ 0.04, with a sample overlap of 30% this would be 0.04 × 0.3 ≈ 0.01, and zero in the absence of sample overlap. As such, by combining large sample size GWAS data with limited sample overlap and relatively strong instruments the presented results are protected against potential weak-instrument bias. Second, our choice of CMR measurement was limited by the publicly available data, for example preventing us from exploring the association between the ratio of measure (such as PEF/EDV or PFR/EDV) not available in the original GWAS. Third, while Mendelian randomization is robust against bias due to reverse causality and confounding, it critically assumes the absence of horizontal pleiotropy, where the genetic variant only affects the outcome through its association with the CMR measurement. In the current analysis, we performed automatic model selection to decide between an IVW or more robust MR-egger models, removed potentially pleiotropic variants through the identification and removal of outliers and high-leverage points, and prioritised results with a significant and directionally consistent effect using the Median MR estimator. Fourth, the conducted Mendelian randomization analyses implicitly assess a linear trend between CMR and outcome. In the presence of nonlinearity, the presented Mendelian randomization estimates represent a population average effect, which may not necessarily apply to any single individual but often offers a reasonable approximation. While non-linear Mendelian randomization methods have been developed36,37, these require access to individual participant data, which, even for UKB-sized data, only offers a fraction of the disease cases we have been able to leverage here. Finally, due to the cumulative nature of GWAS, where results from previous studies are typically meta-analysed in subsequent studies, there was insufficient independent data to replicate our findings. Potentially, the growing number of non-European GWAS will provide avenues for cross-ancestry replication.
In conclusion, we have identified biventricular CMR measurements that may act as surrogate endpoints for future cardiac events, including heart failure, cardiomyopathies, and atrial fibrillation. We additionally show that changes in cardiac function and structure affect blood pressure, as well as identifying potential associations with lung function and glucose homoeostasis.
Data availability
The source data for Fig. 1 is available as Data 1-4, the source data for Figs. 2, 3 is available as Data 1-2, the source data for Fig. 4 is available as Data 2-6. The individual variants and their trait associations have been included as Supplementary Data 7. The UKB data can be requested from www.ukbiobank.ac.uk, conditional on an approved project. Most of the GWAS data are publicly available from the following download links: for the CMR associations from Ahlberg et al. (https://zenodo.org/records/5074929) and Schmidt et al. (https://www.ebi.ac.uk/gwas/publications/37126556), for HF (http://results.globalbiobankmeta.org/), for NICM (https://cvd.hugeamp.org/), for AF, for stroke (subtypes) from GIGASTROKE (https://www.ebi.ac.uk/gwas/publications/36180795), venous thromboembolism and abdominal aortic aneurysm from (https://www.globalbiobankmeta.org/), blood pressure from Evangelou et.al. (https://www.nature.com/articles/s41588-018-0205-x), glycemic traits, and lung function measurement were sourced from (http://www.nealelab.is/uk-biobank); type 2 diabetes from DIAGRAM; BMI from GIANT; CRP from (https://www.ebi.ac.uk/gwas/publications/30388399); the CKDGen consortium provided GWAS associations on estimated glomerular filtration rate, and chronic kidney disease (http://ckdgen.imbi.uni-freiburg.de/); Alzheimer’s disease data were sourced from Jansen et.al. (https://ctg.cncr.nl/software/summary_statistics), and drug prescriptions from (https://www.ebi.ac.uk/gwas/publications/31015401) Finally, the aggregate GWAS results for DCM can be requested from the GWAS corresponding authors.
Code availability
Analyses were conducted using Python v3.7.10 (for GNU Linux), Pandas v1.3.5, Numpy v1.20.3, and Matplotlib 3.3.2. Scripts and data necessary to generate the illustrations have been deposited: 10.5522/04/2863919338.
References
Taylor, C. J. et al. Trends in survival after a diagnosis of heart failure in the United Kingdom 2000-2017: population based cohort study. BMJ 364, l223 (2019).
Lee, D. S. et al. A systematic assessment of causes of death after heart failure onset in the community: impact of age at death, time period, and left ventricular systolic dysfunction. Circ. Heart Fail 4, 36–43 (2011).
Al-Khayatt, B. M. et al. Paradoxical impact of socioeconomic factors on outcome of atrial fibrillation in Europe: trends in incidence and mortality from atrial fibrillation. Eur. Heart J. 42, 847–857 (2021).
Anter, E., Jessup, M. & Callans, D. J. Atrial Fibrillation and Heart Failure. Circulation 119, 2516–2525 (2009).
Redfield, M. M. et al. Isosorbide mononitrate in heart failure with preserved ejection fraction. N. Engl. J. Med. 373, 2314–2324 (2015).
Gheorghiade, M. et al. Effect of Vericiguat, a Soluble guanylate cyclase stimulator, on natriuretic peptide levels in patients with worsening chronic heart failure and reduced ejection fraction: The SOCRATES-REDUCED Randomized Trial. JAMA 314, 2251–2262 (2015).
Fordyce, C. B. et al. Cardiovascular drug development: is it dead or just hibernating? J. Am. Coll. Cardiol. 65, 1567–1582 (2015).
Schmidt, A. F. et al. Druggable proteins influencing cardiac structure and function: Implications for heart failure therapies and cancer cardiotoxicity. Sci. Adv. 9, eadd4984 (2023).
Ahlberg, G. et al. Genome-wide association study identifies 18 novel loci associated with left atrial volume and function. Eur. Heart J. 42, 4523–4534 (2021).
Arbelo, E. et al. 2023 ESC Guidelines for the management of cardiomyopathies. Eur. Heart J. 44, 3503–3626 (2023).
Pinto, Y. M. et al. Proposal for a revised definition of dilated cardiomyopathy, hypokinetic non-dilated cardiomyopathy, and its implications for clinical practice: a position statement of the ESC working group on myocardial and pericardial diseases. Eur. Heart J. 37, 1850–1858 (2016).
Wu, K.-H. H. et al. Polygenic risk score from a multi-ancestry GWAS uncovers susceptibility of heart failure. 2021.12.06.21267389 Preprint at https://doi.org/10.1101/2021.12.06.21267389 (2021).
Aragam, K. G. et al. Phenotypic refinement of heart failure in a National Biobank facilitates genetic discovery. Circulation 139, 489–501 (2019).
Garnier, S. et al. Genome-wide association analysis in dilated cardiomyopathy reveals two new players in systolic heart failure on chromosomes 3p25.1 and 22q11.23. Eur. Heart J. 42, 2000–2011 (2021).
Nielsen, J. B. et al. Biobank-driven genomic discovery yields new insight into atrial fibrillation biology. Nat. Genet. 50, 1234–1239 (2018).
Mishra, A. et al. Stroke genetics informs drug discovery and risk prediction across ancestries. Nature 611, 115–123 (2022).
Zhou, W. et al. Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease. Cell Genom. 2, 100192 (2022).
Evangelou, E. et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 50, 1412–1425 (2018).
Pulit, S. L. et al. Meta-analysis of genome-wide association studies for body fat distribution in 694,649 individuals of European ancestry. Hum. Mol. Genet. 28, 166–174 (2019).
Mahajan, A. et al. Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes. Nat. Genet. 50, 559–571 (2018).
Ligthart, S. et al. Genome Analyses of >200,000 Individuals Identify 58 Loci for chronic inflammation and highlight pathways that link inflammation and complex disorders. Am. J. Hum. Genet. 103, 691–706 (2018).
Wuttke, M. et al. A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat. Genet. 51, 957–972 (2019).
Jansen, I. E. et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat. Genet. 51, 404–413 (2019).
Burgess, S., Zuber, V., Valdes-Marquez, E., Sun, B. B. & Hopewell, J. C. Mendelian randomization with fine-mapped genetic data: Choosing from large numbers of correlated instrumental variables. Genet. Epidemiol. 41, 714–725 (2017).
Hemani, G., Tilling, K. & Davey Smith, G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. (2017) https://doi.org/10.1371/journal.pgen.1007081.
Bowden, J., Smith, G. D. & Burgess, S. Mendelian randomization with invalid instruments: Effect estimation and bias detection through Egger regression. Int J. Epidemiol. 44, 512–525 (2015).
Bowden, J. et al. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 36, 1783–1802 (2017).
Bowden, J. et al. Improving the visualization, interpretation and analysis of two-sample summary data Mendelian randomization via the Radial plot and Radial regression. Int J. Epidemiol. 47, 1264–1278 (2018).
Bowden, J., Davey-Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in Mendelian randomization withsome invalid instruments using a weighted median estimator. Genet. Epidemiol. 40, 304–14 (2016).
Schmidt, A. F. & Dudbridge, F. Mendelian randomization with Egger pleiotropy correction and weakly informative Bayesian priors. Int. J. Epidemiol. 47, 1217–1228 (2018).
Burgess, S., Davies, N. M. & Thompson, S. G. Bias due to participant overlap in two-sample Mendelian randomization. Genetic Epidemiology. 40, 597–608 (2016).
Storey, J. D. A direct approach to false discovery rates. J. R. Stat. Soc. Ser. B: Stat. Methodol. 64, 479–498 (2002).
Wu, Y. et al. Genome-wide association study of medication-use and associated disease in the UK Biobank. Nat. Commun. 10, 1891 (2019).
Shen, L. et al. Role of diuretics, β blockers, and statins in increasing the risk of diabetes in patients with impaired glucose tolerance: reanalysis of data from the NAVIGATOR study. BMJ 347, f6745 (2013).
Swerdlow, D. I. et al. HMG-coenzyme A reductase inhibition, type 2 diabetes, and bodyweight: evidence from genetic analysis and randomised trials. Lancet 385, 351–361 (2015).
Silverwood, R. J. et al. Testing for non-linear causal effects using a binary genotype in a Mendelian randomization study: application to alcohol and cardiovascular traits. Int J. Epidemiol. 43, 1781–1790 (2014).
Staley, J. R. & Burgess, S. Semiparametric methods for estimation of a nonlinear exposure-outcome relationship using instrumental variables with application to Mendelian randomization. Genet. Epidemiol. 41, 341–352 (2017).
A Mendelian randomization analysis of cardiac MRI measurements as surrogate outcomes for heart failure and atrial fibrillation. https://doi.org/10.5522/04/28639193.
Acknowledgements
This research has been conducted using the UK Biobank Resource under Application Numbers 24711, 12113, and 44972. The authors are grateful to UK Biobank participants. UK Biobank was established by the Wellcome Trust medical charity, Medical Research Council, Department of Health, Scottish Government, and the Northwest Regional Development Agency. It has also had funding from the Welsh Assembly Government and the British Heart Foundation. AFS is supported by BHF grants PG/18/5033837, PG/22/10989, and the UCL BHF Research Accelerator AA/18/6/34223. CF and AFS received additional support from the National Institute for Health Research University College London Hospitals Biomedical Research Centre, and the Rosetrees Trust. This work was funded by UK Research and Innovation (UKRI) under the UK government’s Horizon Europe funding guarantee EP/Z000211/1. JvS is supported by Dutch Heart Foundation grant 03-004-2019-T045. ATR is supported by the CardioVascular Research Initiative of the Netherlands Heart Foundation (CVON 2015-12 eDETECT) and ZonMW Off Road. This work was supported by grant [R01 LM010098] from the National Institutes of Health (USA) and by EU/EFPIA Innovative Medicines Initiative 2 Joint Undertaking BigData@Heart grant no. 116074, as well as by the UKRI/NIHR Multimorbidity fund Mechanism and Therapeutics Research Collaborative MR/V033867/1, and the Rosetrees Trust. This publication is part of the project “Computational medicine for cardiac disease” with file number 2023.022 of the research programme “Computing Time on National Computer Facilities” which is (partly) financed by the Dutch Research Council (NWO).
Author information
Authors and Affiliations
Contributions
A.F.S., F.W.A., A.T.R. contributed to the idea and design of the study. A.F.S. performed the analyses. A.F.S. and A.T.R. drafted the manuscript. C.F., Jv.S., E.P.A., B.R., M.B., A.I.A., B.K.V., F.W.A., A.T.R. provided critical input on the analyses and the drafted manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare the following competing interests: AFS and FWA have received Servier funding for unrelated work. AFS and CF have received funding from NewAmsterdam Pharma for unrelated work. AFS is an Editorial Board Member for Communications Medicine but was not involved in the editorial review or peer review, nor in the decision to publish this article. The remaining authors declare no competing interests.
Peer review
Peer review information
Communications Medicine thanks the anonymous reviewers for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Schmidt, A.F., Finan, C., van Setten, J. et al. A Mendelian randomization analysis of cardiac MRI measurements as surrogate outcomes for heart failure and atrial fibrillation. Commun Med 5, 130 (2025). https://doi.org/10.1038/s43856-025-00855-1
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s43856-025-00855-1






