Partitioned polygenic scores show mechanistic heterogeneity in type 2 diabetes and hypertension comorbidity

Pascat, Vincent; Zudina, Liudmila; Maurin, Lucas; Ulrich, Anna; Maina, Jared G.; Demirkan, Ayse; Balkhiyarova, Zhanna; Pupko, Igor; Sharhorodska, Yevheniya; Pattou, François; Staels, Bart; Kaakinen, Marika; Khamis, Amna; Bonnefond, Amélie; Munroe, Patricia; Froguel, Philippe; Prokopenko, Inga

doi:10.1038/s41467-025-67449-2

Download PDF

Article
Open access
Published: 09 February 2026

Partitioned polygenic scores show mechanistic heterogeneity in type 2 diabetes and hypertension comorbidity

Nature Communications volume 17, Article number: 1446 (2026) Cite this article

6165 Accesses
1 Citations
97 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

Type 2 diabetes and hypertension are common health conditions that often occur together, suggesting shared biological mechanisms. To explore this relationship, we analyse large-scale multiomic data to uncover genetic factors underlying type 2 diabetes and blood pressure comorbidity. We curate 1304 independent single-nucleotide variants associated with type 2 diabetes and blood pressure, grouping them into five clusters related to metabolic syndrome, inverse type 2 diabetes/blood pressure risk, impaired pancreatic beta-cell function, higher adiposity, and vascular dysfunction. Colocalization with tissue-specific gene expression highlights significant enrichment in pathways related to thyroid function and fetal development. Partitioned polygenic scores derived from these clusters improve risk prediction for type 2 diabetes/hypertension comorbidity, identifying individuals with more than twice the usual susceptibility. These results reveal a mechanistically heterogeneous genetic architecture shared between type 2 diabetes and blood pressure, enhancing comorbidity risk prediction. Partitioned polygenic risk scores offer a promising approach for early risk stratification, personalised prevention, and improved management of these interconnected conditions.

Polygenic risk score for type 2 diabetes shows context-dependent effects across populations

Article Open access 01 October 2025

Diabetes mellitus polygenic risk scores: heterogeneity and clinical translation

Article 04 June 2025

The effect of type 2 diabetes genetic predisposition on non-cardiovascular comorbidities

Article Open access 10 October 2025

Introduction

Hypertension and type 2 diabetes (T2D) pose major public health challenges, affecting approximately 1.28 billion¹ and 537 million adults worldwide², respectively, with the prevalence of T2D expected to rise to 1.3 billion by 2050. T2D and high blood pressure (BP) frequently co-occur in the same individual^3,4,5. The T2D-BP comorbidity further increases the risk of major health outcomes, and individuals with both conditions often face challenges in achieving treatment objectives⁶. T2D and high BP are key components of the Metabolic Syndrome (MetS), also encompassing various cardiovascular risk factors, including central obesity, dyslipidaemia, microalbuminuria, and insulin resistance (IR)^7,8.

Extensive genetic research, notably through recent genome-wide association studies (GWAS), has dissected the underlying genetic architecture of both T2D and BP traits independently. Latest reports associated 1289 independent variants in DNA with T2D⁹, while 2103 variants are implicated in BP control¹⁰. These findings highlight the complex genetic architecture of T2D and BP traits/hypertension, emphasising their diverse genetic drivers.

Despite significant advances in understanding the genetics of T2D and high BP as independent conditions, the shared genetic basis underlying their frequent comorbidity remains largely unexplored. This gap persists even though T2D–high BP comorbidity has been consistently observed, including within genetic datasets^11,12,13. Several Mendelian randomisation (MR) studies have yielded conflicting evidence about the causal relationship between T2D and high BP. For instance, Sun et al. identified T2D as a driver of high BP, while Aikens et al. proposed the opposite^14,15. Additionally, another study found that two of four types of hypertensive medications were protective against T2D risk, while the others increased its risk¹⁶. These findings highlight the diverse and complex pathways underpinnings the T2D-BP relationship.

In this study, we aimed to characterise the shared pathophysiological processes underlying the T2D-BP relationship by harnessing large-scale genomic datasets from recent research on both conditions. Using common genetic variation, we sought to enhance the mechanistic understanding of these diseases comorbid status and suggest potential avenues for targeted interventions and precision health.

We aggregated genomic data from 45 GWAS for related conditions and traits/endophenotypes, 50 tissue-specific expression quantitative trait loci (eQTL)^17,18,19, assay for transposase-accessible chromatin using sequencing peaks from single-cell (scATAC-seq) atlas²⁰, and the UK Biobank (UKB) cohort²¹. By leveraging GWAS summary statistics, we assessed the genetic correlation between T2D²² and systolic BP (SBP), diastolic BP (DBP), and pulse pressure (PP = SBP-DBP)¹². We clustered the T2D-BP-associated independent single-nucleotide variant (SNV) effects into distinct groups based on their underlying pathogenetic processes. We observed the cluster-associated changes in gene expression through colocalization analysis with eQTL and enrichment in the scATAC-seq atlas. We finally evaluated the cluster-specific risks of complication using partitioned polygenic scores (PGS) in 459,247 individuals (Fig. 1).

Results

Genetic overlap between T2D and BP

We explored the genetic relationships between T2D and BP by evaluating the overall genetic correlation and associated loci that overlap between the two conditions. We performed linkage disequilibrium (LD) score regression using ldsc²³ and observed a direct genetic correlation between T2D and SBP (r_g[SE] = 0.25[0.028], p = 1.56 × 10⁻¹⁹), DBP (r_g[SE] = 0.18[0.027], p = 1.38 × 10⁻¹¹), and PP (r_g[SE] = 0.23[0.029], p = 2.25 × 10⁻¹⁵), consistent with previous research.

To further validate the LD score regression results, we constructed PGSs for T2D, SBP, DBP and PP in the UKB (Supplementary Table 1) using independent weights from GWASs (“Methods”). We probed whether a genetic predisposition towards one condition could predict the risk of the other using comorbidPGS²⁴. T2D PGS was consistently associated with a modest increase in SBP, DBP, and PP (Beta_PP[SE] ≥ 0.37[0.017] change in PP mmHg per one-unit increase in T2D PGS, p ≤ 1.83 × 10⁻¹⁰⁶). SBP and PP PGSs were significantly associated with a higher risk of T2D (OR_SBP[95% CI] = 1.07 [1.06–1.09] change in T2D odds per one-unit increase in SBP PGS, p = 9.36 × 10⁻³⁵; OR_PP[95% CI] = 1.07 [1.06–1.08], p = 8.02 × 10⁻³¹). In contrast, DBP PGS had no impact on T2D risk (OR_DBP[95% CI] = 1.01[0.999–1.02], p = 0.062, Supplementary Table 2).

We gathered a collection of 1401 SNVs associated with T2D, high SBP, DBP, and/or PP (“Methods”). We revealed 24/19/26 overlapping genetic loci between T2D and SBP/DBP/PP, respectively, determined by LD and/or genomic proximity (Supplementary Data 1). Of the 1401 SNVs, 9 were directly reported as lead signals for both T2D and BP traits. Additionally, we identified 97 SNV pairs that overlapped (within 500 kb or LD r² > 0.2) and were associated with T2D and BP traits. We observed several well-known loci, such as those at GRB14-COBLL1 (associated with reduced insulin level, pulse pressure, and mean arterial pressure)^25,26, ADCY5 (beta cell function and lipodystrophy)²⁷, and ACE (renin-angiotensin system, hypertension) genes²⁸. Overlapping loci at JAZF1 (regulating glucose, lipid, and inflammation)²⁹, ADRB1 (beta-adrenergic receptors regulating cardiac contractility and heart rate)³⁰, TCF7L2 (controlling Langerhans islet proliferation)³¹, and SGIP1 (signalling in energy homoeostasis)³² contribute to the inverse relationship between T2D and BP. These results highlight the dense and complex genetic relationships between high BP predisposition and T2D risk.

Clusters of pathogenetic processes

To dissect the complexity of shared biological pathways between T2D and BP, we curated and refined the SNV list to 1304 independent variants (LD r² < 0.2), including 500 T2D-associated and 813 BP-associated SNVs, to cluster them based on their effects on 45 related (endo)phenotypes, including T2D/BP traits (Supplementary Data 2). Our investigation encompassed a wide array of related endpoints or risk factors, including biomarkers of inflammation and hepatic function, circulating plasma lipids, cardiovascular health indicators, anthropometric measures, glycaemic traits, and sex hormones (Supplementary Data 2). All SNVs (originally associated with T2D, BP traits, or both) were aligned to the T2D risk allele. When information for a particular phenotype at an SNV was unavailable, we used LD proxies (“Methods”)³³, and performed imputation of the remaining missing data by random forest algorithm implemented in the imputeSCOPA software tool³⁴. We used an unsupervised hierarchical clustering approach, given the anticipated heterogeneity within our SNV set induced by the inherent complexity in both T2D and BP signals³⁵. To ensure robustness of our clustering, we ran extensive sensitivity analyses (“Methods”) with different sets of metabolic traits and other clustering methods, such as using Z-score adjusted for GWAS sample size, MRClust³⁶ and Bayesian nonnegative matrix factorization (bNMF) (“Methods”, Supplementary Figs. 1–4). We identified five clusters of distinct pathogenetic mechanisms (Fig. 2), highlighting mechanistic heterogeneity in T2D-BP comorbidity. We compared the SNV assignments of our T2D-BP clusters with recent T2D hierarchical clustering⁹ and bNMF clustering^37,38 (“Methods”, Supplementary Data 3, 4, Supplementary Figs. 5, 6). The pathophysiological processes identified across the five clusters were consistent with existing evidence and highlight mechanistic insights (“Methods”, Supplementary Figs. 4–7)^9,12,37.

**Fig. 2: Clustering heat map of endophenotypes with five pathogenetic SNV clusters associated with high BP and/or risk of T2D.**

The Metabolic Syndrome cluster included 215 variants, and displayed the most distinct pathogenetic signature. It highlights attributes consistent with the metabolic syndrome, including lower levels of sex hormones (sex-hormone binding globulin, insulin, testosterone)^39,40, higher central adiposity (waist-to-hip ratio [WHR] adjusted for body-mass index [BMI])⁴¹ without higher overall adiposity, measured by BMI, systemic higher IR evaluated by the homoeostasis model assessment of insulin resistance, HOMA-IR, using both fasting plasma glucose and insulin (alongside higher HOMA-B, proinsulin level, and insulin fold change), lower high-density lipoprotein (HDL) cholesterol, higher triglycerides (TG), and altered cardiovascular functions (higher heart rate, increased cardiovascular event risk, higher renin-angiotensin-aldosterone system activity)⁸. SNVs within this cluster also strongly associate with shorter stature (lower height) and lower birth weight.

Previous findings reported that shorter stature is associated with a higher risk of T2D⁴² and cardiovascular events^38,43. Other studies linked greater height with insulin and insulin-like growth factor signalling pathways⁴⁴. The impaired insulin sensitivity may be one of the underlying factors in this association^45,46.

The T2D–high BP comorbidity is high in this cluster and was consistent with our bNMF clustering (Supplementary Fig. 3). The origin of the SNVs is an equal mix of T2D and BP (Supplementary Fig. 1). When comparing SNVs with previous T2D clustering, we observed an overlap between the SNVs in our Metabolic Syndrome cluster and the Type 2 Diabetes Global Genetics Initiative (T2DGGI) Metabolic syndrome cluster, as well as the bNMF cluster Lipodystrophy 1 (Supplementary Fig. 5).

In the Inverse T2D-BP risk cluster, we noted an inverse relationship of associated SNVs effects on higher T2D risk related to lower SBP/DBP/PP. Predominantly originating from associations with BP traits (Supplementary Fig. 1), the 353 SNVs within this cluster, when aligned to the T2D risk allele, are associated with a lower risk of cardiovascular events, such as atrial fibrillation (AF), coronary artery disease (CAD), stroke and heart failure. Additionally, these SNVs demonstrated associations with BMI and systemic higher IR (higher HOMA-IR). Comparison with previously reported BP-related clusters showed partial overlap with the Hypolipidaemia and Short stature SNV groups (Supplementary Fig. 6)³⁸.

The Higher adiposity cluster contained 137 SNVs—predominantly T2D signals—which showcased effects on higher BMI, reduced sex hormones (testosterone and SHBG), higher TG along with lower HDL- and LDL-cholesterol, higher risk of cardiovascular events (CAD, heart rate, stroke), and insulin resistance (higher HOMA-IR/HOMA-B). This cluster, distinct from the Metabolic Syndrome one, showed a high number of obesity-related SNVs within previous T2D clustering (Supplementary Fig. 5). The Vascular Dysfunction cluster included 287 SNVs mostly originating as BP signals. They are associated with cardiovascular traits (higher risk of AF, stroke, CAD, heart failure), lower birth weight and show strong effects on both T2D–high BP. This cluster showed a number of hypolipidemia SNVs from previous BP clustering (Supplementary Fig. 6). Lastly, Reduced beta-cell function cluster exhibited characteristics of impaired beta-cell function including lower homoeostasis model assessment of beta cell function (HOMA-B), higher glucose/glycated haemoglobin levels (random glucose [RG], HbA1c), metabolic dysregulation (TG, sex hormones), higher inflammation (C-reactive protein [CRP], IGF-1) and taller stature (height). The Reduced beta-cell function cluster contained 312 SNVs, predominantly T2D signals, found in the Beta cell 1 and Beta cell 2 clusters from the latest published T2D bNMF clustering (Supplementary Fig. 5b).

The five distinct mechanistic groups of genetic variants, revealed through clustering, contribute to the shared susceptibility to T2D and high BP. They provide a foundation for further exploration of the biological pathways.

Multiomic characterisation of T2D-BP clusters

To further characterise the T2D-BP comorbidity clusters, we evaluated the changes in gene expression and regulatory elements associated with the clustered SNVs. We first conducted a colocalization analysis to elucidate the impact of studied SNVs on gene expression patterns. We explored the genomic landscape within a 200 kb window surrounding each clustered SNV to assess the likelihood of a shared causal variant between our clusters and gene expression changes across 50 tissues from various eQTLs datasets using a hypothesis-free approach and the coloc R package (“Methods”)⁴⁷. We identified a total of 6321 colocalizations across the 50 tissues, involving 1558 genes and 448 clustered variants (Fig. 3a top, Supplementary Data 5, Supplementary Fig. 8).

**Fig. 3: Characterisation of pathogenetic clusters: genetic expression, and regulatory mechanisms.**

Our analysis revealed distinct gene expression signatures for each cluster, corroborating the diversity of the biological pathways involved. The Inverse T2D-BP risk cluster displays colocalization in the brain, particularly brain cerebellum (MGRN1, HELLS, SLC39A13) and adrenal glands (SLC7A1, RHOC, NUDT2). The Metabolic Syndrome cluster variants colocalized in adipose subcutaneous (JAZF1, ALKAL2, LCORL). The Higher adiposity cluster shows colocalization in skin (MYO19, EIF3C, SLC39A10) and whole blood (WFS1, CCDC134, MED27). The Vascular dysfunction cluster colocalized with fibroblasts (ERI1, FOXD4, RSRC1) and thyroid (CSTB, ZNF638, SNX31) and the Reduced beta-cell function cluster with pancreatic islets (C2CD4B, ADCY5, PHB).

We identified 99 tissue-specific colocalizations (Fig. 3a bottom, “Methods”). While thyroid and adipose subcutaneous tissues showed a high number of total colocalizations, pancreatic islets showed the highest number (14) of single-tissue (i.e., specific) colocalizations, particularly among the clusters strongly associated with risk of T2D such as the Reduced beta-cell function, Metabolic Syndrome and Higher adiposity clusters. Notably, the pancreatic islet tissue-specific colocalized genes include TH (synthesis of catecholamines)⁴⁸ in the Higher adiposity cluster, MTNR1B (circadian rhythms and glucose metabolism)⁴⁹, FXYD2 (Na,K-ATPase pump regulator)⁵⁰, G3BP2 (cellular stress)⁵¹ in the Reduced beta-cell function cluster, SYNDIG1L (synapse development), LTBP3 (cell growth, differentiation and repair)⁵², CLEC18A (immune function)⁵³ in the Metabolic Syndrome cluster (Supplementary Data 6). This demonstrates the predominant role of pancreatic islets in T2D pathogenesis and its related complications.

To explore the underlying mechanisms in the Inverse T2D-BP risk cluster, we conducted pathway analysis using Metascape⁵⁴ for the 202 colocalized genes identified within this cluster (Supplementary Fig. 8). This analysis revealed an overwhelming enrichment in the retinol metabolic process (GO:0042572), which involves one of three compounds that make up vitamin A (retinol, retinal, and retinoic acid). All components of the retinol metabolism are associated with both T2D and CVD⁵⁵.

We then dissected the localisation of our SNVs in a cluster-specific manner using chromatin accessibility atlases from CATLAS, based on scATAC-seq peaks. The atlas encompasses 222 cell types from 30 human adult tissues and 15 fetal tissues, allowing examination of the enrichment of candidate cis-regulatory elements (cCREs) in each cluster across different cell types (Fig. 3b and Supplementary Data 7). The clusters were enriched in diverse regulatory mechanisms. Specifically, the Inverse T2D-BP cluster exhibited significant (p ≤ 2.25 × 10⁻⁴) enrichment for regions of open chromatin in mesothelial cells, endocardial cells, endothelial in exocrine tissue cells as well as fetal endocardial and mesangial cells. The Reduced beta-cell function cluster demonstrated strong enrichment in several fetal cell types, such as islets, gastric goblet, alveolar epithelial, cardiomyocyte, as well as follicular cells and cells from the pancreas tissues, including delta, gamma, beta and alpha. This suggests that, beyond islet dysregulation and insulin impairment, pathways involved in fetal development also play an important role in adult metabolic health. Moreover, nominal enrichments were observed in fetal adrenal cortical cells for the Metabolic Syndrome and Higher adiposity clusters, suggesting hormone regulatory implications beginning as early as intrauterine development⁵⁶.

Given the large number of colocalized SNVs observed across clusters in tissues such as the thyroid, subcutaneous adipose tissue, tibial artery, tibial nerve, and lower leg skin (Fig. 3a), we further explored the colocalized genes by identifying their enrichment in the primary cell types corresponding to these tissues—namely, follicular cells, adipocytes, smooth muscle cells, Schwann cells, keratinocytes (Fig. 3b). Notably, we identified 15 genes that both colocalized in thyroid and were enriched in follicular cell cCREs (Supplementary Table 3). While some of these genes were previously associated with T2D such as CAMK1D (energy homoeostasis and beta-cell receptor signalling pathway)⁵⁷ or KCNH6 (insulin secretion and glucose homoeostasis)⁵⁸, and with BP regulation such as ACE (renin-angiotensin system)²⁸, the remaining are potential candidate genes for the T2D-BP pathogenesis. Among them are SAE1 (known in cancer)⁵⁹, GSAP (known in Alzheimer’s disease)⁶⁰, DCAF7 (cellular differentiation)⁶¹, MAP3K3 (stress and inflammation)⁶². Subsequent Metascape⁵⁴ pathway analysis highlighted fundamental cellular processes, including protein ubiquitination (GO:0016567) and positive regulation of protein modification process (GO:0031401). The overlap of signals between colocalized genes and cCREs in the other four tissues consistently highlighted the MAP3K3 gene. Subsequent pathway analyses did not yield conclusive results (Supplementary Table 4).

Overall, changes in gene expression within clusters highlighted the importance of the thyroid tissue in T2D-BP shared pathophysiology and retinol metabolism within the Inverse T2D-BP risk cluster, while the high number of fetal cell regulatory elements enrichment suggests a strong contribution of intrauterine growth pathways in both T2D and high BP.

T2D-BP comorbidity using partitioned PGSs

To evaluate the ability of the T2D-BP SNV clusters to predict comorbidities and complications, we used the individual-level data from UKB and built unweighted partitioned PGSs for each cluster group. Using the R software environment tool, comorbidPGS, we aligned partitioned PGS to the T2D risk allele, i.e., each allele increasing the risk of T2D is counted as one in the PGS calculation (“Methods”)²⁴.

To illustrate the influence of genetic predisposition, we computed the relative risk of comorbidity among the UKB individuals (“Methods”), based on the top 10% percentiles of each unweighted partitioned PGS. Whereas the overall prevalence of T2D-high BP comorbidity in the UKB was 5.49% (Fig. 4a), individuals in top 10% of the unweighted risk score of Higher adiposity, Metabolic Syndrome, and Reduced beta-cell function clusters had a relative risk RR[95% CI] of 1.36[1.32–1.41], 1.44[1.39–1.48], 1.55[1.51–1.60], respectively. Moreover, the individuals in the top 10% distribution of Metabolic Syndrome and Reduced beta-cells function combined PGSs, derived from 536 SNVs, showed a 2.13[1.96–2.31] fold increased risk of having comorbidity (Fig. 4b and Supplementary Table 5), reaching the same RR as a traditional pruning-and-thresholding (P + T) weighted T2D PGS. Survival analysis, using cumulative hazard plots, indicated that this elevated comorbidity risk was consistent and linear over 15 years of follow-up (year 0 representing the date of first diagnosis with either hypertension or T2D, Fig. 4c and Supplementary Fig. 11). This suggests that individuals with high PGS distributions remain at increased risk of comorbidity throughout their life course. Consequently, partitioned PGSs enhance the predictive ability to identify high-risk individuals at an earlier age^63,64.

**Fig. 4: The T2D-BP comorbidity risks stratified by partitioned PGSs in the UKB.**

Using the partitioned PGSs, we evaluated the association between PGS and multiple sets of complications based on the UKB hospital records (Fig. 5 and Supplementary Data 8). We detected a reciprocal protective effect of the Inverse T2D-BP cluster PGS on essential hypertension (OR[95% CI] = 0.91[0.90–0.92], p < 1.00 × 10⁻⁴⁰) alongside other circulatory system disorders such as coronary artery disease (CAD, OR[95% CI] = 0.94[0.93–0.95], p = 5.08 × 10⁻²²), angina pectoris (OR[95% CI] = 0.95[0.94–0.97], p = 1.29 × 10⁻¹²), AF (OR[95% CI] = 0.96[0.94–0.97], p = 1.01 × 10⁻¹⁰), chronic ischaemic heart disease (OR[95% CI] = 0.96[0.95–0.97], p = 9.08 × 10⁻¹⁰). The Inverse T2D-BP risk cluster PGS also associates with lower risk of gout (OR[95% CI] = 0.92[0.89–0.95], p = 1.10 × 10⁻⁶) and hypercholesterolaemia (OR[95% CI] = 0.98[0.97–0.99], p = 1.90 × 10⁻¹⁷). These results support previous research on the heterogeneous effects of hypertensive medications on the risk of T2D, indicating that some biological processes between T2D and high BP may reduce the risk of comorbidity¹⁶.

**Fig. 5: Association between complications and partitioned PGSs after clustering in the UKB.**

We confirmed the high contribution into T2D-BP comorbidity of the Metabolic Syndrome cluster, by showing that its relatively small number of SNVs could predict risk of multiple metabolic disorders, including T2D (OR[95% CI] = 1.24[1.23–1.26], p < 1.00 × 10⁻⁴⁰), hypertension (OR[95% CI] = 1.13[1.12–1.14], p ≤ 1.00 × 10⁻⁴⁰), hypercholesterolaemia (OR[95% CI] = 1.10[1.09–1.10], p ≤ 1.00 × 10⁻⁴⁰), hyperlipidaemia (OR[95% CI] = 1.10[1.08–1.13], p = 6.52 × 10⁻²⁰), fatty liver (OR[95% CI] = 1.13[1.09–1.18], p = 6.08 × 10⁻¹⁰), and hypothyroidism (OR[95% CI] = 1.03[1.02–1.04], p = 1.65 × 10⁻⁶). Additionally, the Metabolic Syndrome cluster PGS showed significant association with risk of cardiovascular complications such as CAD, angina pectoris, ischaemic heart disease, heart failure, and myocardial infarction. We also detected association with higher risk of kidney failure and calculus of kidney.

The Higher adiposity PGS showed the strongest risk prediction of obesity-related diseases, including T2D (OR[95% CI] = 1.22[1.20–1.23], p ≤ 1.00 × 10⁻⁴⁰), hypertension (OR[95% CI] = 1.09[1.09–1.10], p ≤ 1.00 × 10⁻⁴⁰), sleep apnoea (OR[95% CI] = 1.17[1.14–1.20], p = 1.28 × 10⁻³⁴), osteoarthritis (OR[95% CI] = 1.08[1.07–1.10], p = 1.51 × 10⁻³¹), carpal tunnel syndrome (OR[95% CI] = 1.09[1.07–1.11], p = 6.61 × 10⁻²⁰), and pneumonia (OR[95% CI] = 1.08[1.06–1.10], p = 3.92 × 10⁻¹²). This cluster PGS was associated with higher mental disorders, such as major depressive disorder, delirium or behavioural disorders due to use of tobacco, highlighting the intertwined relations between obesity and depressive conditions.

Among other clusters, the Vascular Dysfunction cluster was more predictive for cardiovascular complications, including hypertension (OR[95% CI] = 1.12[1.11–1.13], p < 1.00 × 10⁻⁴⁰), CAD (OR[95% CI] = 1.06[1.05–1.07], p = 4.25 × 10⁻²³), or ischaemic heart disease (OR[95% CI] = 1.06[1.04–1.07], p = 1.05 × 10⁻¹⁵). The Reduced beta-cell function unweighted PGS showed the strongest association with risk of T2D (OR[95% CI] = 1.33[1.32–1.35], p < 1.00 × 10⁻⁴⁰) and its related consequences, including obesity (OR[95% CI] = 1.03[1.02–1.05], p = 4.38 × 10⁻⁹), hyperlipidaemia (OR[95% CI] = 1.08[1.06–1.10], p = 5.17 × 10⁻¹³), fatty liver (OR[95% CI] = 1.09[1.05–1.14], p = 1.62 × 10⁻⁵), chronic kidney disease (OR[95% CI] = 1.10[1.07–1.12], p = 6.47 × 10⁻¹⁹), and hypothyroidism (OR[95% CI] = 1.03[1.01–1.04], p = 2.43 × 10⁻⁵). The Reduced beta-cell function weighted PGS (“Methods”) showed strong association with high SBP and PP, albeit not with DBP (Supplementary Data 9).

The partitioned PGSs effectively delineated the differences in prediction among the T2D-BP cluster SNVs. Partitioned PGSs shows that grouping of SNVs can highlight related comorbidities through different pathophysiological processes.

Discussion

In this large-scale multiomic study, we explored the complex genetic underpinnings of the comorbid relationship between T2D and high BP. Our analysis confirms and extends prior evidence of a direct genetic correlation and a large overlap in genetic signals shared between T2D and high BP^65,66. This observation is not unexpected, given the well-established comorbidity between T2D and hypertension—largely attributable to shared environmental and biological risk factors such as adiposity. Our approach extends beyond this by leveraging clustering of genetic variants to partition the T2D-BP genetic architecture into biologically coherent groups.

We curated a set of genome-wide significant common SNVs associated with T2D and high BP, thereby enriching for variants with stronger and trait-specific effects, and reducing the influence of broader, less specific cross-trait associations^67,68. Through hierarchical clustering using T2D-BP related endpoints and risk factors, we identified five clusters of SNVs, each highlighting unique pathogenetic processes underlying the T2D-BP relationship. This clustering approach provided a clearer delineation of genetic relationships, reducing heterogeneity compared to other clustering methods. Four of these SNV clusters—Metabolic Syndrome, Higher adiposity, Vascular dysfunction, and Reduced beta-cell function—align with established findings in high BP or T2D^27,69,70,71. We discovered an intriguing cluster of variants with an Inverse T2D-BP risk profile, implicating retinol metabolism. Although recent meta-analysis on the role of retinol in T2D and BP regulation have yielded inconsistent results, one retinol derivative, retinoic acid, has consistently been linked to higher IR and reduced cardiovascular events⁵⁵. While we propose a mechanistically plausible pathophysiological hypothesis for inverse T2D-BP risk effects, these patterns may also arise from incomplete overlap in the genetic architectures underlying T2D and BP. The Metabolic Syndrome cluster was associated with features such as shorter stature, higher WHR, and no detectable effect on BMI. These features are consistent with the International Diabetes Federation (IDF) definition of MetS⁷², which emphasises central obesity—better captured by WHR than BMI—as the primary driver. No detectable effect on BMI in this cluster likely reflects the specific contribution of visceral adiposity to cardiometabolic risk, rather than overall body adiposity, highlighting the relevance of WHR-linked pathways in MetS pathophysiology. Additionally, we observed an enrichment in colocalizations specific to the thyroid across all five T2D-BP clusters, suggesting a mechanistic role for thyroid function in T2D-BP comorbidity. This feature further highlights a need for better thyroid health in the general population to reduce impact of dysthyroidism on BP and T2D management⁷³.

We bring forward a property of partitioned PGS to differentially predict related T2D–high BP conditions⁷⁴. While PGS has shown predictive value for an expanding array of common diseases, such as for instance CAD^75,76, its clinical application remains limited due to a range of pitfalls⁷⁷. In this study, we report how specific clusters, particularly the Metabolic Syndrome and Reduced beta-cell function, can identify a sub-population of individuals bearing over twice the general population risk of T2D-BP comorbidity. Our study calls for partitioning of PGSs to predict complications in metabolic disorders like T2D and high BP, suggesting that in comorbidity risks, fewer SNVs could be more impactful than an entire genome-wide PGS to stratify the individuals at high risk of complications. For the survival analysis, individuals were assigned to the cluster corresponding to their highest partitioned PGS, to emulate a potential clinical framework where patients could be stratified into their predominant mechanistic risk pathway for targeted prevention. While this approach simplifies the genetic architecture, it illustrates how refined, cluster-specific PGSs could ultimately complement existing clinical tools such as QDiabetes⁷⁸ and QRisk⁷⁹ to enable earlier, pathway-informed interventions. This work paves the way for improved risk stratification and precision health approaches⁷⁶.

We acknowledge several caveats in this work. The pathophysiological mechanisms involved in both T2D and high BP are not fully explained by genetics alone, and are influenced by a variety of external and environmental factors, such as salt consumption or western diet. Moreover, the heterogeneity in sample origins, study designs, and sample sizes across the diverse included datasets could potentially reduce our ability to identify independent pathways. While the inclusion of individuals with potentially pre-existing comorbid conditions in GWAS studies of specific phenotypes, here T2D and BP, is often inevitable to capture the genetic architecture, it could somewhat inflate the estimated shared genetics due to phenotypic correlation. To mitigate the possibility of sample contamination bias, we used different subsets of T2D-BP GWASs for analyses that are sensitive to linkage disequilibrium structure. We mitigated the sample origin bias by prioritising datasets with multiple ancestries including European⁸⁰. Hierarchical clustering can force an SNV to fit a cluster that may not fully capture its effect. To mitigate this, we compared and found consistent results with previous clustering studies and sensitivity analyses using other clustering methods, such as bNMF²⁷. Finally, we chose to focus on clustered genome-wide significant common variants to provide a clear and interpretable framework for dissecting the T2D-BP shared genetic relationships. While this strategy highlights robust mechanistic clusters, it may overlook a sizeable portion of the polygenic signal. Future studies could apply complementary approaches such as expanding this study to whole-exome or whole-genome datasets, or using multivariate GWAS or genomicSEM^40,81 to further refine the understanding of T2D-BP relationship.

In this study, we highlight the mechanistic heterogeneity underlying T2D and high BP, demonstrating that their genetic relationship is not driven by a single shared pathophysiological process but instead arises from five distinct mechanistic pathways contributing to the T2D-BP comorbidity. The partitioned PGSs, derived through clustering, enable modelling of differential lifetime risk trajectories associated with T2D-BP comorbidity. These results provide a framework for future investigations into stratified risk prediction and pathophysiology-informed approaches to manage comorbid conditions.

Methods

Material description

We collected 45 publicly available GWAS summary statistics (Supplementary Data 2), spanning from 2017 to 09 June 2023. GWAS were selected if they included a large number of participants (N ≥ 10,000) and a preference for datasets with diverse ancestral backgrounds, involving the majority of individuals of European ancestry, to enhance genetic diversity in our study. European-only studies were included when multi-ancestry data were unavailable. Particularly for T2D, we used Mahajan et al.²² GWAS (2018b) as it allows us to alternate between a GWAS with UKB individuals or not, accessible on the DIAGRAM/DIAMANTE/T2DGGI webpage https://diagram-consortium.org/downloads.html. We alternated between using the Warren et al. GWAS⁷¹ based on the UKB and a subset version of the Evangelou et al. GWAS¹² without UKB (only ICBP data). Both T2D/BP GWASs without UKB individuals were used only for two specific analyses: LD score regression and PGS.

To ensure robustness in uncovering shared aetiologies, we gathered from multiple sources a list of T2D^{82,83,84,85,86} and BP SNVs^{13,30,87,88,89,90,91,92,93,94,95,96,97} reaching genome-wide significance (i.e., p < 5 × 10^-8) and characterised by minor allele frequency (MAF) > 0.01. SNV could have been reported for either one of the three BP metrics (SBP/DBP/PP) or it was reported for T2D. We identified a total of 1401 SNVs. This list was further curated for clustering to keep only independent SNVs (LD r² < 0.2). We did not apply additional pruning based on genetic distance. This decision was made to preserve the potential for capturing pleiotropic variants and regional architectures that could contribute to comorbidity between T2D and BP traits. The resulting list encompasses 1304 SNVs, including 500 T2D and 283/272/270 SBP/DBP/PP signals, with 9 unique SNVs demonstrating associations with T2D-BP comorbidity.

The UK Biobank (UKB, https://ukbiobank.ac.uk/) is a large prospective cohort study with genotypic and phenotypic data²¹. The dataset in this study derived from genome-wide imputed data, including 459,247 individuals of European ancestry. To identify individuals with T2D, we leveraged hospital admission records, self-reports, and ICD10/9 codes, successfully defining 33,446 T2D individuals. Hypertension was assessed using three blood pressure metrics: SBP, DBP, and PP, with both automated and manual records available in the UKB¹². We used the mean value when more than one value was found for a given individual and adjusted for medication use by adding 15 mmHg to SBP and 10 mmHg to DBP for individuals with reported data on blood pressure-lowering medication⁹⁸. We defined the outcome “hypertension” used in the last section of the Results (unweighted PGS evaluation) as either having SBP ≥ 150 mmHg, or DBP ≥ 90 mmHg, or taking blood pressure lowering medication. The complete criteria used to identify individuals with T2D and hypertension are illustrated in Supplementary Fig. 10. We identified 217,599 hypertensive individuals in the UKB (Supplementary Table 1).

The Genotype-Tissue Expression (GTEx, https://www.gtexportal.org/home/datasets/) is a project gathering samples from 49 non-diseased tissue across 1000 deceased individuals¹⁷. We used the expression quantitative trait loci (eQTLs) mapping genetic variants with changes in the expression of nearby genes.

The ABOS cohort is an ongoing prospective study that aims to identify the determinants of bariatric surgery outcomes, initiated at Lille University Hospital (Lille, France) in 2006. The study protocol has been previously detailed elsewhere (clinicaltrials.gov, NCT01129297)¹⁹. A total of 372 individuals of European descent were included in the ABOS liver eQTL study.

The translational human pancreatic islet genotype tissue-expression resource (TIGER, http://tiger.bsc.es/) is a large meta-analysis of cohorts aggregating more than 500 human islet genomic datasets from five cohorts in the Horizon 2020 consortium T2DSystems¹⁸.

The cis-element ATLAS (CATLAS, http://catlas.org/humanenhancer/) is a comprehensive resource gathering the genome-wide cCREs in 222 human cell types. The dataset encompasses chromatin accessibility data derived from single-cell Assay for Transposase-Accessible Chromatin (ATAC-seq) peaks spanning 30 human adult and 15 human fetal tissues.

We systematically applied multiple correction testing and adjusted the appropriate individual-level associations for age, sex, genotyping array and the first six principal components derived from genetic data.

Genetic correlation

We conducted genetic correlation analysis by using the whole T2D²² and BP¹² GWAS datasets of European ancestry only without UKB individuals. We estimated the liability-scale heritability (${h}_{2}$) of each phenotype and their genetic correlation (${r}_{g}$) by employing linkage disequilibrium score regression via ldsc v1.0.1²³. We used pre-computed LD scores from 1000 Genome Phase 3 SNVs in individuals of European ancestry. We manually aligned the GWAS datasets and excluded outlier SNVs, that are multi-allelic, poorly-imputed (by filtering to HapMap3 SNVs), have a MAF > 0.01 and 0 < p ≤ 1.

Reciprocal risk prediction using PGS

We constructed PGS before and after clustering with plink v1.9⁹⁹, using specific SNV sets, weights from T2D²², SBP, DBP, and PP¹² summary statistics (GWAS without UKB individuals) and individual-level data from the UKB. We carefully selected weights without UKB (Supplementary Data 2 in italic) to avoid sample overlap between the base data (composed of GWAS significant SNVs plus their weights) and the target data (individual-level data from the UKB). The Pruning and Thresholding (P + T) method was used before making PGS by selecting the only GWAS significant SNVs for each desired outcome^100,101. We subsequently used comorbidPGS v1 to conduct linear or logistic regression to evaluate the shared predisposition between PGS for the i-th trait and the j-th target phenotype Y, correcting for covariates including age, genetically-inferred sex, genetic array, and the first six principal components as following²⁴:

$$E\left({Y}_{{kj}}\right) ={\alpha }_{i}{{PGS}}_{{ki}}+{\alpha }_{{{\mathrm{age}}}}{{{\mathrm{age}}}}_{k}+{\alpha }_{{{\mathrm{sex}}}}{{{\mathrm{sex}}}}_{k}+{\alpha }_{{{\mathrm{array}}}}{{{\mathrm{array}}}}_{k} \\ +{\sum}_{x=1}^{6}{\alpha }_{x}P{C}_{{kx}}+{\alpha }_{0}$$

Where ${Y}_{{kj}}$ is the value of the j-th phenotype of the k-th individual (1 or 0 for T2D, continuous value in mmHg for BP traits), ${{{\mathrm{PGS}}}}_{{ki}}$ is the PGS for the i-th trait and the k-th individual, ${{{\mathrm{age}}}}_{k}$, ${{{\mathrm{sex}}}}_{k}$, ${{{\mathrm{array}}}}_{k}$ and $P{C}_{{kx}}$ being the respective covariates for the k-th individual. The $\alpha$ are the regression coefficients, ${\alpha }_{0}$ is the intercept.

Genetic overlap

We estimated the genetic overlap among the pre-curated non-independent 1401 SNVs by identifying variants within 500 kb of each other, or in LD r² > 0.2 for individuals of European ancestry. The reported loci are the proximal ones, identified by biomaRt¹⁰². We identified and coloured the signals per clusters in Supplementary Data 1.

Clusters of pathogenetic processes

We aggregated the 1304 independent T2D-BP SNVs and investigated 45 endophenotypes GWAS summary statistics including T2D/BP (Supplementary Data 2) to cluster them into distinct groups based on pathogenetic processes. To reduce/avoid missingness across datasets, we identified high-LD proxies (LD r² > 0.6) using European ancestry reference panel from the 1000 Genomes Project Phase 3. Although some of the GWAS datasets were of multiple ancestries, we prioritised datasets comprising European-only studies or those with a high proportion of European ancestry participants. This approach mitigated potential discrepancies in LD patterns between the GWAS datasets and the reference panel. For the i-th SNV of the j-th trait, we extracted the beta coefficient, ${\beta }_{{ij}}$, along with its standard error, ${s}_{{ij}}$, to build z-scores using the following formula ${Z}_{{ij}}=\frac{{\beta }_{{ij}}}{{s}_{{ij}}}$. Any signal exhibiting more than 20% missingness across the GWAS was excluded. We conducted imputation of the remaining missing z-scores using a random forest algorithm from imputeSCOPA³⁴. For each GWAS, we truncated SNVs if their absolute z-score exceeded two standard deviations.

We performed hierarchical clustering with the Ward method from the R function hclust, using Euclidean distance and the R package pheatmap for plotting (Supplementary Fig. 1). We determined the number of clusters by taking half of the maximum Euclidean tree-row distance. Our methodology, termed ‘hard clustering’, ensures precise assignment of each SNV to a single cluster. Each variant is treated as a unit, in opposition with approaches that treat positive and negative associations independently. Using a ‘hard clustering’ method offers a clearer delineation of clustering patterns. However, it may force a SNV into a group where is does not truly belong^9,37.

To validate our clusters, we compared them with those identified in the latest T2DGGI study⁹ and both latest ‘soft’ clustering for T2D³⁷ and BP³⁸. First, we looked at SNV assignment between our T2D-BP clusters, T2DGGI clusters (Supplementary Fig. 5a), latest T2D ‘soft’ clusters (Supplementary Fig. 5b), and latest BP ‘soft’ clusters (Supplementary Fig. 6 and Supplementary Data 3). To do so, we systematically looked for LD proxies (LD r² > 0.6) in the three clusters. We assign a SNV into ‘soft’ clusters based on weight >0.75 (Supplementary Data 3 and Supplementary Figs. 5, 6). Second, we performed GWAS weight comparison specifically between our T2D-BP hierarchical clusters (k = 5) and the T2DGGI hierarchical clusters (k = 8) by extracting each cluster weights for the common GWAS endophenotypes. We then calculated the Pearson correlation coefficient between the sets of trait cluster weights. Notably, our T2D-specific clusters showed significant correlations: the Higher adiposity cluster aligned with T2DGGI ‘Obesity’ cluster, the Metabolic Syndrome cluster was correlated with ‘Lipodystrophy’, ‘Metabolic syndrome’, and ‘Residual glycaemic’. The Reduced beta-cell cluster exhibited correlation across all T2DGGI clusters, particularly with ‘Residual glycaemic’ and ‘Obesity’ (Supplementary Fig. 7).

Additional sensitivity analyses include the use of three alternative clustering methods: adjusting Z-score for GWAS sample size prior to clustering (using the following formula ${Z}_{{ij}}=\frac{{\beta }_{{ij}}}{{s}_{{ij}}\times \sqrt{{N}_{j}}}$, Supplementary Fig. 2), MRClust³⁶ (Supplementary Fig. 3) and a ‘soft clustering’ method, bNMF²⁷ (Supplementary Fig. 4). In Supplementary Fig. 2a, b, we showed a comparative hierarchical clustering using our approach without GWAS sample size adjustment, and another approach by adjusting for sample size (dividing Z by $\sqrt{N}$, N being the sample size for a given GWAS), as it has been done in other studies⁹. We decided to use unadjusted Z-scores as main analysis as we wanted to prioritise robust, high-confidence associations. We showed in Supplementary Fig. 2c that the cluster assignment has followed the same patterns with or without adjustment (Chi-squared test of independence p < 2 × 10⁻¹⁶). We compared our bNMF results (k = 8) with our hierarchical clustering (k = 5) by merging the variant weights for each cluster of both methods. Logistic regression models were then employed to assess the association between the ‘hard’ cluster memberships and the ‘soft’ clustering weights. Specifically, for each hierarchical cluster, we modelled the binary cluster membership (0/1) as a function of the corresponding bNMF-derived weights. The models’ deviances were compared through ANOVA to evaluate the enrichment and significance of the clustering concordance (Supplementary Fig. 4c).

To improve interpretability of the clustering, we performed linear regression with the lm R function, across the SNVs within the k-th cluster on the i-th phenotype, defined by $E\left({Z}_{{ij}}\right)={\sum }_{k}{\alpha }_{{ik}}{C}_{{kj}}$, where ${C}_{{kj}}$ is a variable taking the value 1 if the j-th SNV was assigned to the k-th cluster, 0 otherwise. The resulting regression (Supplementary Data 4) is represented in Fig. 2 heatmap, with the p value denoting the colour intensity, and the colour indicating the sign of the regression coefficient. PAI 1, oestradiol (f and m), and age at menopause are not shown in Fig. 2 to improve readability.

Colocalization analysis

To identify the signals that share the same causal variants in T2D-BP GWASs and eQTLs, we systematically conducted Bayesian colocalization with coloc.abf ⁴⁷, between the clustered SNVs and eQTLs derived from 50 tissues from GTEx (48 tissues), ABOS (liver) and TIGER (pancreatic islet) datasets. We used an hypothesis-free approach. We assessed colocalization with the ‘lead’ GWAS per cluster (i.e., the main origin of the clustered SNVs, SBP GWAS for Inverse T2D-BP and Vascular dysfunction, T2D GWAS for Metabolic syndrome, Higher adiposity and Beta cell clusters). For each SNV within the 1304 clustered SNVs, we considered a genomic region of 100 kb up and downstream, and evaluated whether the variant colocalized with genetic expression in that region. Colocalization was asserted if the region was well-characterised (containing between 100 and 1000 SNV within eQTL), the posterior probability of the model sharing a single causal variant (H4) exceeded 80%, and the posterior probability of distinct causal variants with both traits (H3) was below 50%. We excluded variants within the MHC region (chr6: 25,000,000–35,000,000).

We present in Fig. 3a the count of colocalized genes across tissues (bar) and clusters (colours). We represented in this figure the top tissues based on their overall number of colocalization (N_coloc ≥ 100). The lower portion depicts the number of tissue-specific colocalizations, i.e., when the index SNV is associated to only one tissue-specific eQTL. The complete number of colocalization per cluster and tissue is represented in Supplementary Fig. 8. The colour intensity indicates the percentage of colocalized genes per cluster. Caution is warranted in interpreting the results as there is currently no robust method to rule out horizontal pleiotropy¹⁰³.

Subsequent pathway analysis of the thyroid eQTL and Inverse T2D-BP risk cluster colocalized genes was performed using metascape⁵⁴. For thyroid eQTL, we identified 244 colocalized genes and 75 SNVs in cCRE regions in follicular cells (used as major cell type within the thyroid). Among them, 15 signals were both associated with a colocalized gene in thyroid and a cCRE region in follicular cells. We used these remaining 15 colocalized genes with metascape default parameters. We performed the pathway analysis using 926 background genes expressed in the GTEx Thyroid RNA-seq v8 (TPM > 120). Similar analyses were conducted for the four other top colocalized tissues: adipose subcutaneous (adipocytes as major cell type), artery tibial (smooth muscle cells), nerve tibial (Schwann general cells), and skin (sun-exposed) lower leg (keratinocytes as major cell type). These analyses were also performed using metascape, with background genes expressed in GTEx RNA-seq v8 for each respective tissue: 1053 genes for adipose subcutaneous, 1053 for artery tibial, 1029 for nerve tibial, and 791 genes for skin (TPM > 120).

scATAC-seq enrichment analysis

To gain insights in the associated regulatory processes, we took the 1304 clustered SNVs to look for enrichment of regions of open chromatin within clusters. Specifically, we aggregated independent SNVs (LD r² < 0.2) within 50 kb of each clustered SNV from the 1000 Genome Project Phase 3. We followed a similar protocol by the T2DGGI paper⁹, and conducted a Firth bias-reduced logistic regression, using logistf R package and the following equation:

$$E\left({Y}_{i}\right)={\alpha }_{0}+{\alpha }_{{{\mathrm{EXON}}}}{{{\mathrm{EXON}}}}_{i}+{\alpha }_{3{{\mathrm{UTR}}}\,}{3{{\mathrm{UTR}}}}_{i}+{\alpha }_{5{{\mathrm{UTR}}}}{5{{\mathrm{UTR}}}}_{i}+{\sum }_{j}{\alpha }_{{ij}}{X}_{{ji}}$$

With ${Y}_{i}$ taking the value 1 if the i-th SNV is within one of the clusters, ${X}_{{ji}}$ taking the value 1 if the i-th SNV mapped to an ATAC-seq peak for the j-th cell type, 0 otherwise. We defined ${{{\mathrm{EXON}}}}_{j}$, $3{{{\mathrm{UTR}}}}_{j}$, $5{{{\mathrm{UTR}}}}_{j}$ indicators taking the value 1 if the i-th SNV is located to the respective annotation (from the Ensembl Project), 0 otherwise. The $\alpha$ are the coefficients of log fold enrichments, and ${\alpha }_{0}$ is the intercept.

We performed the logistic regression twice, one with ${\alpha }_{{ij}}=0$ and one without constraint. We reported on the heatmap of Fig. 3b the p value associated to Chi-squared tests between the two logistic regression models to observe the enrichment of the j-th cell type across clusters. We used multiple correction testing for Chi-squared tests across cell types and clusters, p_threshold = 2.25 × 10⁻⁴. More information on the 222 cell types gathered in CATLAS can be found in CATLAS website.

Prevalence and relative risk of T2D-BP comorbidity using partitioned PGS

The prevalence of T2D-BP comorbidity was defined by the proportion of cases divided by the total number of individuals in the UKB, ${\mbox{Prev}}=\frac{{{{\mbox{N}}}}_{{\mbox{cases}}}}{{{{\mbox{N}}}}_{{\mbox{total}}}}$. The summary table in Fig. 4b provides the prevalence of individuals with T2D-BP comorbidity (of having both T2D and hypertension) based on being in the top decile (top 10%) of the partitioned unweighted PGS distribution. Additional results are available in Supplementary Table 5, showing both the top 10% and 33% of those partitioned PGS and comparing them with weighted T2D and SBP PGS. For individuals associated with multiple clusters, assignment was based on the cluster where they ranked the highest. The relative risk (RR) is calculated as the ratio of the proportion of cases within a cluster subgroup to the proportion of cases in the overall population ${{\mathrm{RR}}}=\frac{{{{\mathrm{Prev}}}}_{{{\mathrm{cases}}},{{\mathrm{cluster}}}}}{{{{\mathrm{Prev}}}}_{{{\mathrm{cases}}},{{\mathrm{overall}}}}}$. The 95% confidence interval of the RR was estimated using the following equation:

$${{{\mathrm{CI}}}}_{95\%}=\exp \left[{{\mathrm{ln}}}({{\mathrm{RR}}})\pm 1.96\sqrt{\frac{1}{{N}_{{{\mathrm{cases}}},{{\mathrm{cluster}}}}}-\frac{1}{{N}_{{{\mathrm{cases}}},{{\mathrm{overall}}}}}}\right]$$

Survival analysis using partitioned PGS

Survival analysis was conducted using the survival R package v3.6-4 with a Cox proportional hazards model. In this model, time zero was defined as the date of the first diagnosis of either hypertension or type 2 diabetes (T2D). The primary event of interest was the subsequent development of a T2D-BP comorbidity, marked by the diagnosis of the second disease. To ensure adequate sample sizes within groups, individuals were assigned to a cluster group if their maximum partitioned unweighted PGSs was in the top 33% of distribution. For individuals associated with multiple clusters, assignment was based on the cluster where they ranked the highest. Figure 4c displays the cumulative hazard with two-sided 95% confidence intervals. Sensitivity analysis includes comparing this survival analysis using T2D weighted PGS and SBP weighted PGS respectively, and is included in Supplementary Fig. 11 and Supplementary Data 10.

Partitioned PGS and risk of complications

Following the same pipeline developed in Reciprocal risk prediction using PGS, partitioned PGSs were calculated using clustered SNVs in the risk-increasing direction of T2D, assuming an additive model (Supplementary Data 8). We do not associate a weight with partitioned PGSs, meaning each allele increasing the risk of T2D is counting as one. Sensitivity analysis included using weights derived from T2D and BP GWAS to predict the risk of the other disorders (Supplementary Data 9). Association with complications were performed using the UKB ICD-10 codes, updated with hospital data up to December 2023. Associations were corrected for covariates including age, genetically-inferred sex, genetic array, and the first six principal components. Briefly, we extracted all the UKB ICD-10 available codes and refined them to subcategories of interest, namely: Endocrine, nutritional, or metabolic; Mental and behavioural disorders; Nervous system, Eyes, ears, nose and throat; Circulatory system; Respiratory system; Digestive system; Skin; Musculoskeletal system; Genitourinary system. All reported associations in Fig. 5 were derived using unweighted partitioned PGS, comorbidPGS using binary logistic regression, and a P-value corrected for multiple testing, p_threshold = 7.75 × 10⁻⁵.

Inclusion and ethics

This study is based exclusively on previously published and publicly available datasets; no new data involving human participants or animals were collected. Hence, no additional ethical approval was required for this research. We made use of multi-ancestry datasets to the fullest extent possible. When robust data across multiple ancestry groups were available, all were included in the analysis. In cases where no suitable alternative was available, analyses were restricted to individuals of genetically inferred European ancestry. All references to ancestry are based on genetically inferred population structure and not on self-reported race or ethnicity. Similarly, the reported sex in this work was determined using genetic data.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The GWAS used in this study are all publicly available and listed in Supplementary Data 2. The UK Biobank Resource [https://ukbiobank.ac.uk/] was accessed using the Application Number 236. GTEx [https://www.gtexportal.org/home/downloads/adult-gtex/qtl] and TIGER eQTL data used in this study are publicly available via these links, respectively. Data from the ABOS cohort are not publicly available, since they are subject to national French data protection laws and restrictions imposed by the ethics committee to ensure data privacy of the study participants. Data can be accessed through an individual project agreement with the principal investigator of the University Hospital of Lille (Lille, France), F.P., using the email address: francois.pattou@univ-lille.fr. The ATAC-seq data from CATLAS are publicly available and can be accessed via the following link: https://catlas.org/humanenhancer/data/.

Code availability

The software used for this analysis can be found on Zenodo [https://doi.org/10.5281/zenodo.17448298].

Change history

15 May 2026
In this article the funding from ‘the National Institute for Health and Care Research Barts Biomedical Research Centre (NIHR203330); a delivery partnership of Barts Health NHS Trust, Queen Mary University of London, St George’s University Hospitals NHS Foundation Trust and St George’s University of London’ was omitted. The original article has been corrected.

References

The World Health Organization. Hypertension. https://www.who.int/news-room/fact-sheets/detail/hypertension (2023).
International Diabetes Federation. IDF Diabetes Atlas. https://diabetesatlas.org/ (2021).
Iglay, K. et al. Prevalence and co-prevalence of comorbidities among patients with type 2 diabetes mellitus. Curr. Med. Res. Opin. 32, 1243–1252 (2016).
Article PubMed Google Scholar
Colussi, G. L., Da Porto, A. & Cavarape, A. Hypertension and type 2 diabetes: lights and shadows about causality. J. Hum. Hypertens. 34, 91–93 (2020).
Lastra, G., Syed, S., Kurukulasuriya, L. R., Manrique, C. & Sowers, J. R. Type 2 diabetes mellitus and hypertension. Endocrinol. Metab. Clin. North Am. 43, 103–122 (2014).
Article PubMed Google Scholar
Schmieder, R. E. et al. Achievement of individualized treatment targets in patients with comorbid type-2 diabetes and hypertension: 6 months results of the DIALOGUE registry. BMC Endocr. Disord. 15, 23 (2015).
Article PubMed PubMed Central Google Scholar
Alberti, K. G. M. M., Zimmet, P. & Shaw, J. Metabolic syndrome—a new world-wide definition. A consensus statement from the International Diabetes Federation. Diabet. Med. 23, 469–480 (2006).
Article CAS PubMed Google Scholar
Huang, P. L. A comprehensive definition for metabolic syndrome. Dis. Model Mech. 2, 231–237 (2009).
Article CAS PubMed PubMed Central Google Scholar
Suzuki, K. et al. Genetic drivers of heterogeneity in type 2 diabetes pathophysiology. Nature 627, 347–357 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Keaton, J. M. et al. Genome-wide analysis in over 1 million individuals of European ancestry yields improved polygenic risk scores for blood pressure traits. Nat. Genet. 56, 778–791 (2024).
Article CAS PubMed PubMed Central Google Scholar
Qi, Q. et al. Genetic predisposition to high blood pressure associates with cardiovascular complications among patients with type 2 diabetes. Diabetes 61, 3026–3032 (2012).
Article CAS PubMed PubMed Central Google Scholar
Evangelou, E. et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 50, 1412–1425 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ehret, G. B. et al. Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478, 103–109 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Sun, D. et al. Type 2 diabetes and hypertension: a study on bidirectional causality. Circ. Res. 124, 930–937 (2019).
Article CAS PubMed PubMed Central Google Scholar
Aikens, R. C. et al. Systolic blood pressure and risk of type 2 diabetes: a Mendelian randomization study. Diabetes 66, 543–550 (2017).
Article CAS PubMed Google Scholar
Nazarzadeh, M. et al. Blood pressure lowering and risk of new-onset type 2 diabetes: an individual participant data meta-analysis. Lancet 398, 1803–1810 (2021).
Article CAS PubMed PubMed Central Google Scholar
Aguet, F. et al. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Article ADS CAS Google Scholar
Alonso, L. et al. TIGER: the gene expression regulatory variation landscape of human pancreatic islets. Cell Rep. 37, 109807 (2021).
Article CAS PubMed PubMed Central Google Scholar
Margerie, D. et al. Hepatic transcriptomic signatures of statin treatment are associated with impaired glucose homeostasis in severely obese patients. BMC Med. Genomics 12, 80 (2019).
Article PubMed PubMed Central Google Scholar
Zhang, K. et al. A single-cell atlas of chromatin accessibility in the human genome. Cell 184, 5985–6001.e19 (2021).
Article CAS PubMed PubMed Central Google Scholar
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pascat, V. et al. comorbidPGS: an R package assessing shared predisposition between phenotypes using polygenic scores. Hum. Hered. https://doi.org/10.1159/000539325 (2024).
Wain, L. V. et al. Genome-wide association study identifies six new loci influencing pulse pressure and mean arterial pressure. Nat. Genet. 43, 1005–1011 (2011).
Article CAS PubMed PubMed Central Google Scholar
Mancina, R. M. et al. The COBLL1 C allele is associated with lower serum insulin levels and lower insulin resistance in overweight and obese children. Diab. Metab. Res. Rev. 29, 413–416 (2013).
Article CAS Google Scholar
Udler, M. S. et al. Type 2 diabetes genetic loci informed by multi-trait associations point to disease mechanisms and subtypes: a soft clustering analysis. PLoS Med. 15, e1002654 (2018).
Article PubMed PubMed Central Google Scholar
Tsai, C.-T. et al. Angiotensinogen gene haplotype and hypertension. Hypertension 41, 9–15 (2003).
Article CAS PubMed Google Scholar
Liao, Z., Wang, Y., Qi, X. & Xiao, X. JAZF1, a relevant metabolic regulator in type 2 diabetes. Diab. Metab. Res Rev. 35, e3148 (2019).
Article Google Scholar
Johnson, A. D. et al. Association of hypertension drug target genes with blood pressure and hypertension in 86 588 individuals. Hypertension 57, 903–910 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lyssenko, V. et al. Mechanisms by which common variants in the TCF7L2 gene increase risk of type 2 diabetes. J. Clin. Investig. 117, 2155–2163 (2007).
Article CAS PubMed PubMed Central Google Scholar
Trevaskis, J. et al. Src homology 3-domain growth factor receptor-bound 2-like (endophilin) interacting protein 1, a novel neuronal protein that regulates energy balance. Endocrinology 146, 3757–3764 (2005).
Article CAS PubMed Google Scholar
Myers, T. A., Chanock, S. J. & Machiela, M. J. LDlinkR: an R package for rapidly calculating linkage disequilibrium statistics in diverse populations. Front. Genet. 11, 1–5 (2020).
Article Google Scholar
Mägi, R. et al. SCOPA and META-SCOPA: software for the analysis and aggregation of genome-wide association studies of multiple correlated phenotypes. BMC Bioinforma. 18, 4–11 (2017).
Article Google Scholar
Ward, J. H. Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58, 236–244 (1963).
Article MathSciNet Google Scholar
Foley, C. N., Mason, A. M., Kirk, P. D. W. & Burgess, S. MR-Clust: clustering of genetic variants in Mendelian randomization with similar causal estimates. Bioinformatics 37, 531–541 (2021).
Article CAS PubMed PubMed Central Google Scholar
Smith, K. et al. Multi-ancestry polygenic mechanisms of type 2 diabetes. Nat. Med. 30, 1065–1074 (2024).
Article CAS PubMed PubMed Central Google Scholar
Vaura, F. et al. Multi-trait genetic analysis reveals clinically interpretable hypertension subtypes. Circ. Genom. Precis. Med. 15, e003583 (2022).
Article CAS PubMed PubMed Central Google Scholar
Laaksonen, D. et al. Sex hormones, inflammation and the metabolic syndrome: a population-based study. Eur. J. Endocrinol. 601–608 https://doi.org/10.1530/eje.0.1490601 (2003).
Park, S. et al. Multivariate genomic analysis of 5 million people elucidates the genetic architecture of shared components of the metabolic syndrome. Nat. Genet. 56, 2380–2391 (2024).
Article CAS PubMed PubMed Central Google Scholar
Després, J.-P. & Lemieux, I. Abdominal obesity and metabolic syndrome. Nature 444, 881–887 (2006).
Article ADS PubMed Google Scholar
Asao, K. et al. Short stature and the risk of adiposity, insulin resistance, and type 2 diabetes in middle age. Diab. Care 29, 1632–1637 (2006).
Article Google Scholar
Stefan, N., Häring, H.-U., Hu, F. B. & Schulze, M. B. Divergent associations of height with cardiometabolic disease and cancer: epidemiology, pathophysiology, and global implications. Lancet Diab. Endocrinol. 4, 457–467 (2016).
Article Google Scholar
Ben-Shlomo, Y. et al. An investigation of fetal, postnatal and childhood growth with insulin-like growth factor I and binding protein 3 in adulthood. Clin. Endocrinol. 59, 366–373 (2003).
Article CAS Google Scholar
Wittenbecher, C., Kuxhaus, O., Boeing, H., Stefan, N. & Schulze, M. B. Associations of short stature and components of height with incidence of type 2 diabetes: mediating effects of cardiometabolic risk factors. Diabetologia 62, 2211–2221 (2019).
Article CAS PubMed PubMed Central Google Scholar
Johnston, L. W. et al. Short leg length, a marker of early childhood deprivation, is associated with metabolic disorders underlying type 2 diabetes. Diab. Care 36, 3599–3606 (2013).
Article CAS Google Scholar
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
Article PubMed PubMed Central Google Scholar
Bueno-Carrasco, M. T. et al. Structural mechanism for tyrosine hydroxylase inhibition by dopamine and reactivation by Ser40 phosphorylation. Nat. Commun. 13, 74 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Hu, C. & Jia, W. Linking MTNR1B variants to diabetes: the role of circadian rhythms. Diabetes 65, 1490–1492 (2016).
Article CAS PubMed Google Scholar
Zhou, K. et al. FXYD2 mRNA expression represents a new independent factor that affects survival of glioma patients and predicts chemosensitivity of patients to temozolomide. BMC Neurol. 21, 438 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kang, W. et al. Research progress on the structure and function of G3BP. Front. Immunol. 12, 718548 (2021).
Zhu, G. et al. Novel LTBP3 mutations associated with thoracic aortic aneurysms and dissections. Orphanet. J. Rare Dis. 16, 513 (2021).
Article PubMed PubMed Central Google Scholar
Chang, C.-M., Chang, W.-C. & Hsieh, S. Characterization of the genetic variation and evolutionary divergence of the CLEC18 family. J. Biomed. Sci. 31, 53 (2024).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y. et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523 (2019).
Article ADS PubMed PubMed Central Google Scholar
Olsen, T. & Blomhoff, R. Retinol, retinoic acid, and retinol-binding protein 4 are differentially associated with cardiovascular disease, type 2 diabetes, and obesity: an overview of human studies. Adv. Nutr. 11, 644–666 (2020).
Article PubMed PubMed Central Google Scholar
Horikoshi, M. et al. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nat. Genet. 45, 76–82 (2013).
Article CAS PubMed Google Scholar
Vivot, K. et al. CaMK1D signalling in AgRP neurons promotes ghrelin-mediated food intake. Nat. Metab. 5, 1045–1058 (2023).
Article CAS PubMed Google Scholar
Wang, H. et al. KCNH6 channel promotes insulin exocytosis via interaction with Munc18-1 independent of electrophysiological processes. Cell. Mol. Life Sci. 81, 86 (2024).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. et al. SAE1 promotes human glioma progression through activating AKT SUMOylation-mediated signaling pathways. Cell Commun. Signal. 17, 82 (2019).
Article PubMed PubMed Central Google Scholar
He, G. et al. Gamma-secretase activating protein is a therapeutic target for Alzheimer’s disease. Nature 467, 95–98 (2010).
Article ADS CAS PubMed Google Scholar
Melo-Cardenas, J., Bezavada, L., Cotton, A. & Crispino, J. D. DDB1 and CUL4 associated factor 7 (DCAF7) is essential for hematopoiesis. Blood 140, 8586–8587 (2022).
Article Google Scholar
Guan, J., Fan, Y., Wang, S. & Zhou, F. Functions of MAP3Ks in antiviral immunity. Immunol. Res. 71, 814–832 (2023).
Article CAS PubMed PubMed Central Google Scholar
Jiang, X., Holmes, C. & McVean, G. The impact of age on genetic risk for common diseases. PLoS Genet. 17, e1009723 (2021).
Article CAS PubMed PubMed Central Google Scholar
Thompson, D. J. et al. A systematic evaluation of the performance and properties of the UK Biobank Polygenic Risk Score (PRS) Release. PLoS ONE 19, e0307270 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wielscher, M. et al. Genetic correlation and causal relationships between cardio-metabolic traits and lung function impairment. Genome Med. 13, 104 (2021).
Article CAS PubMed PubMed Central Google Scholar
Vattikuti, S., Guo, J. & Chow, C. C. Heritability and genetic correlations explained by common SNPs for metabolic syndrome traits. PLoS Genet. 8, e1002637 (2012).
Article CAS PubMed PubMed Central Google Scholar
Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).
Article CAS PubMed Google Scholar
Spence, J. P. et al. Specificity, length and luck drive gene rankings in association studies. Nature https://doi.org/10.1038/s41586-025-09703-7 (2025).
Scott, R. A. et al. An expanded genome-wide association study of type 2 diabetes in Europeans. Diabetes 66, 2888–2902 (2017).
Article CAS PubMed PubMed Central Google Scholar
Dimas, A. S. et al. Impact of type 2 diabetes susceptibility variants on quantitative glycemic traits reveals mechanistic heterogeneity. Diabetes 63, 2158–2171 (2014).
Article CAS PubMed Google Scholar
Warren, H. R. et al. Genome-wide association analysis identifies novel blood pressure loci and offers biological insights into cardiovascular risk. Nat. Genet. 49, 403–415 (2017).
Article CAS PubMed PubMed Central Google Scholar
The IDF Consensus. Worldwide Definition of the METABOLIC SYNDROME. https://idf.org/media/uploads/2023/05/attachments-30.pdf (2023).
Gavrila, A. & Hollenberg, A. N. The Hypothalamic-Pituitary-Thyroid Axis: Physiological Regulation and Clinical Implications. In: Luster, M., Duntas, L., Wartofsky, L. (eds) The Thyroid and Its Diseases. Springer, Cham. (2019). https://doi.org/10.1007/978-3-319-72102-6_2.
Lambert, S. A., Abraham, G. & Inouye, M. Towards clinical utility of polygenic risk scores. Hum. Mol. Genet. 28, R133–R142 (2019).
Article CAS PubMed Google Scholar
Surakka, I. et al. Sex-specific survival bias and interaction modeling in coronary artery disease risk prediction. Circ. Genom. Precis. Med. 16, e003542 (2023).
Article CAS PubMed Google Scholar
Ma, Y. & Zhou, X. Genetic prediction of complex traits with polygenic scores: a statistical review. Trends Genet. 37, 995–1011 (2021).
Article CAS PubMed PubMed Central Google Scholar
Novembre, J. et al. Addressing the challenges of polygenic scores in human genetic research. Am. J. Hum. Genet. 109, 2095–2100 (2022).
Article CAS PubMed PubMed Central Google Scholar
Hippisley-Cox, J. & Coupland, C. Development and validation of QDiabetes-2018 risk prediction algorithm to estimate future risk of type 2 diabetes: cohort study. BMJ j5019 https://doi.org/10.1136/bmj.j5019 (2017).
Fuat, A. et al. A polygenic risk score added to a QRISK®2 cardiovascular disease risk calculator demonstrated robust clinical acceptance and clinical utility in the primary care setting. Eur. J. Prev. Cardiol. 31, 716–722 (2024).
Article PubMed Google Scholar
Graham, S. E. et al. The power of genetic diversity in genome-wide association studies of lipids. Nature 600, 675–679 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Grotzinger, A. D. et al. Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat. Hum. Behav. 3, 513–525 (2019).
Article PubMed PubMed Central Google Scholar
Morris, A. P. et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat. Genet. 44, 981–990 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mahajan, A. et al. Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes. Nat. Genet. 50, 559–571 (2018).
Article CAS PubMed PubMed Central Google Scholar
Saxena, R. et al. Genome-wide association study identifies a novel locus contributing to type 2 diabetes susceptibility in Sikhs of Punjabi origin from India. Diabetes 62, 1746–1755 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gaulton, K. J. et al. Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci. Nat. Genet. 47, 1415–1425 (2015).
Article CAS PubMed PubMed Central Google Scholar
Cho, Y. S. et al. Meta-analysis of genome-wide association studies identifies eight new loci for type 2 diabetes in east Asians. Nat. Genet. 44, 67–72 (2012).
Article CAS Google Scholar
Tragante, V. et al. Gene-centric meta-analysis in 87,736 individuals of European ancestry identifies multiple blood-pressure-related loci. Am. J. Hum. Genet. 94, 349–360 (2014).
Article CAS PubMed PubMed Central Google Scholar
Liang, J. et al. Single-trait and multi-trait genome-wide association analyses identify novel loci for blood pressure in African-ancestry populations. PLoS Genet. 13, e1006728 (2017).
Article PubMed PubMed Central Google Scholar
Levy, D. et al. Genome-wide association study of blood pressure and hypertension. Nat. Genet. 41, 677–687 (2009).
Article CAS PubMed PubMed Central Google Scholar
Takeuchi, F. et al. Interethnic analyses of blood pressure loci in populations of East Asian and European descent. Nat. Commun. 9, 5052 (2018).
Article ADS PubMed PubMed Central Google Scholar
Franceschini, N. et al. Genome-wide association analysis of blood-pressure traits in African-ancestry individuals reveals common associated genes in African and non-African populations. Am. J. Hum. Genet. 93, 545–554 (2013).
Article CAS PubMed PubMed Central Google Scholar
Giri, A. et al. Trans-ethnic association study of blood pressure determinants in over 750,000 individuals. Nat. Genet. 51, 51–62 (2019).
Article CAS PubMed Google Scholar
Kato, N. et al. Trans-ancestry genome-wide association study identifies 12 genetic loci influencing blood pressure and implicates a role for DNA methylation. Nat. Genet. 47, 1282–1293 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Whole-genome association study identifies STK39 as a hypertension susceptibility gene. Proc. Natl. Acad. Sci. USA 106, 226–231 (2009).
Article ADS CAS PubMed Google Scholar
Hoffmann, T. J. et al. Genome-wide association analyses using electronic health records identify new loci influencing blood pressure variation. Nat. Genet. 49, 54–64 (2017).
Article CAS PubMed Google Scholar
Liu, C. et al. Meta-analysis identifies common and rare variants influencing blood pressure and overlapping with metabolic trait loci. Nat. Genet. 48, 1162–1170 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wain, L. V. et al. Novel blood pressure locus and gene discovery using genome-wide association study and expression data sets from blood and the kidney. Hypertension 70, e4–e19 (2017).
Tobin, M. D., Sheehan, N. A., Scurrah, K. J. & Burton, P. R. Adjusting for treatment effects in studies of quantitative traits: antihypertensive therapy and systolic blood pressure. Stat. Med. 24, 2911–2935 (2005).
Article MathSciNet PubMed Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Choi, S. W. & O’Reilly, P. F. PRSice-2: Polygenic Risk Score software for biobank-scale data. Gigascience 8, 1–6 (2019).
Article Google Scholar
Choi, S. W., Mak, T. S. H. & O’Reilly, P. F. Tutorial: a guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020).
Article CAS PubMed PubMed Central Google Scholar
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191 (2009).
Article CAS PubMed PubMed Central Google Scholar
Richardson, T. G. et al. Systematic Mendelian randomization framework elucidates hundreds of CpG sites which may mediate the influence of genetic variants on disease. Hum. Mol. Genet. 27, 3293–3304 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research has been conducted using the UK Biobank Resource under application number 236. This project was in part funded by the Agence Nationale de la Recherche under the Programme d’Investissement d’Avenir (PreciDIAB, ANR-18-IBHU-0001 and RHU PreciNASH ANR-16-RHUS-0006), by the European Union through the “Fonds Européen de Développement Regional” (FEDER), by the “Conseil Régional des Hauts-de-France” (Hauts-de-France Regional Council), by the “Métropole Européenne de Lille” (MEL, European Metropolis of Lille), and by the European Research Council (ERC OpiO – 101043671, to A.B.). I.Pr. and Z.B. were in part funded by the Diabetes UK (BDA number: 20/0006307), UKRI (EP/Z535072/1), European Foundation for the Study of Diabetes (EFSD), and Novo Nordisk A/S Programme for Diabetes Research in Europe—2025. P.M. acknowledges the support of the National Institute for Health and Care Research Barts Biomedical Research Centre (NIHR203330); a delivery partnership of Barts Health NHS Trust, Queen Mary University of London, St George’s University Hospitals NHS Foundation Trust and St George’s University of London. The research of Y.S. was funded by The British Academy (RaR\100084). The authors would like to thank all the investigators from different consortia that built and shared the GWAS meta-analysis, eQTLs, and scATAC-seq atlases used in this study, as well as the UK Biobank participants and dedicated staff.

Author information

These authors jointly supervised this work: Philippe Froguel, Inga Prokopenko.

Authors and Affiliations

Université de Lille, Inserm UMR1283, CNRS UMR8199, European Genomic Institute for Diabetes (EGID), Institut Pasteur de Lille, Lille University Hospital, Lille, France
Vincent Pascat, Lucas Maurin, Jared G. Maina, François Pattou, Amna Khamis, Amélie Bonnefond, Philippe Froguel & Inga Prokopenko
Department of Metabolism, Digestion, and Reproduction, Imperial College London, London, UK
Vincent Pascat, Liudmila Zudina, Anna Ulrich, Ayse Demirkan, Zhanna Balkhiyarova, Marika Kaakinen, Amna Khamis, Amélie Bonnefond & Philippe Froguel
Section of Statistical Multi-omics, Department of Clinical and Experimental Medicine, University of Surrey, Guildford, UK
Liudmila Zudina, Ayse Demirkan, Zhanna Balkhiyarova, Igor Pupko, Yevheniya Sharhorodska, Marika Kaakinen & Inga Prokopenko
People-Centred Artificial Intelligence Institute, University of Surrey, Guildford, UK
Ayse Demirkan, Zhanna Balkhiyarova, Marika Kaakinen & Inga Prokopenko
Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
Yevheniya Sharhorodska
Department of General and Endocrine Surgery, CHU Lille, Lille, France
François Pattou
Université de Lille, Inserm, CHU Lille, Institut Pasteur de Lille, U1011-EGID, Lille, France
Bart Staels
Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
Marika Kaakinen
William Harvey Research Institute, Barts and the London Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
Patricia Munroe
National Institute of Health and Care Research, Barts Cardiovascular Biomedical Research Centre, Queen Mary University of London, London, UK
Patricia Munroe

Authors

Vincent Pascat
View author publications
Search author on:PubMed Google Scholar
Liudmila Zudina
View author publications
Search author on:PubMed Google Scholar
Lucas Maurin
View author publications
Search author on:PubMed Google Scholar
Anna Ulrich
View author publications
Search author on:PubMed Google Scholar
Jared G. Maina
View author publications
Search author on:PubMed Google Scholar
Ayse Demirkan
View author publications
Search author on:PubMed Google Scholar
Zhanna Balkhiyarova
View author publications
Search author on:PubMed Google Scholar
Igor Pupko
View author publications
Search author on:PubMed Google Scholar
Yevheniya Sharhorodska
View author publications
Search author on:PubMed Google Scholar
François Pattou
View author publications
Search author on:PubMed Google Scholar
Bart Staels
View author publications
Search author on:PubMed Google Scholar
Marika Kaakinen
View author publications
Search author on:PubMed Google Scholar
Amna Khamis
View author publications
Search author on:PubMed Google Scholar
Amélie Bonnefond
View author publications
Search author on:PubMed Google Scholar
Patricia Munroe
View author publications
Search author on:PubMed Google Scholar
Philippe Froguel
View author publications
Search author on:PubMed Google Scholar
Inga Prokopenko
View author publications
Search author on:PubMed Google Scholar

Contributions

V.P. and I.Pr. designed the experiments and led the manuscript writing. L.Z. contributed to the study design for clustering and partitioned PGS. L.M. contributed to the analysis for colocalization, pathway analysis, interpretation, and revision. A.U., J.G.M., A.D., Z.B., I.Pu., and Y.S. defined the phenotypes of interest in the UK Biobank and provided handmade GWAS summary statistics. F.P. and B.S. provided the ABOS cohort. M.K. contributed to the overall statistics evaluation. A.K. contributed to the colocalization evaluation. V.P., A.B., P.M., P.F., and I.Pr. contributed to the evaluation of the results. P.F. and I.Pr. jointly supervised the study. All authors read and approved the final paper.

Corresponding author

Correspondence to Inga Prokopenko.

Ethics declarations

Competing interests

V.P. is employed by Genomics Ltd., L.Z. is employed by Lifebit Biotech Inc. The authors declare that they have no other financial or non-financial competing interests. The views expressed in this study are the personal views of V.P. and L.Z., and do not represent the views of their current employers.

Peer review

Peer review information

Nature Communications thanks Ozan Dikilitas, Viktor H. Ahlqvist and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. [A peer review file is available].

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Supplementary Data (download XLSX )

Reporting Summary (download PDF )

Transparent Peer Review file (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pascat, V., Zudina, L., Maurin, L. et al. Partitioned polygenic scores show mechanistic heterogeneity in type 2 diabetes and hypertension comorbidity. Nat Commun 17, 1446 (2026). https://doi.org/10.1038/s41467-025-67449-2

Download citation

Received: 03 March 2025
Accepted: 01 December 2025
Published: 09 February 2026
Version of record: 09 February 2026
DOI: https://doi.org/10.1038/s41467-025-67449-2

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Genetic overlap between T2D and BP

Clusters of pathogenetic processes

Multiomic characterisation of T2D-BP clusters

T2D-BP comorbidity using partitioned PGSs

Discussion

Methods

Material description

Genetic correlation

Reciprocal risk prediction using PGS

Genetic overlap

Clusters of pathogenetic processes

Colocalization analysis

scATAC-seq enrichment analysis

Prevalence and relative risk of T2D-BP comorbidity using partitioned PGS

Survival analysis using partitioned PGS

Partitioned PGS and risk of complications

Inclusion and ethics

Reporting summary

Data availability

Code availability

Change history

15 May 2026

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links