Abstract
We aimed to assess the role of traditional cardiovascular risk factors on carotid artery intima-media thickness (IMT) and proteins associated with IMT. IMT was measured in 50,704 participants from the UK Biobank. Plasma levels of 2923 proteins were analyzed in 6328 individuals. Mendelian randomization (MR) analyses used genetic data on IMT, proteins, and risk factors. Observational analyses showed positive associations of LDL-cholesterol, body mass index (BMI), diabetes, and systolic blood pressure (SBP) with IMT, while HDL-cholesterol was inversely related. MR analysis confirmed causality for LDL, BMI, diabetes, but not HDL. Of 63 proteins observationally linked to IMT, 17 were replicated externally. Five proteins (BCAM, NTproBNP/NPPB, RABEPK, ACAN, FN1) were associated with IMT in MR, with RABEPK showing concordant directional relationships. MR supports causal links between LDL, BMI, diabetes, SBP, and IMT. RABEPK emerged as a potential therapeutic target, warranting further investigation.
Similar content being viewed by others
Introduction
Measurements of intima-media thickness (IMT) with ultrasound is commonly used in epidemiological studies to evaluate the atherosclerotic burden. Although the use of IMT as a substitute for plaque measurements has been questioned1,2, IMT measurements have been linked to overt atherosclerotic diseases, like myocardial infarction and ischemic stroke in epidemiological studies3,4,5. Furthermore, Mendelian randomization (MR) and colocalization studies have suggested that IMT is causally related to both myocardial infarction and ischemic stroke6, indicating that IMT indeed is a vascular characteristic of interest.
Risk factors for a thickened IMT is mainly the same risk factors as being well-known for myocardial infarction and ischemic stroke, such as high levels of BMI, blood pressure, LDL-cholesterol, diabetes, smoking and low levels of HDL-cholesterol7. However, while MR studies have confirmed that BMI, blood pressure, LDL-cholesterol, and diabetes likely are causally related to myocardial infarction and ischemic stroke8,9,10,11,12,13,14,15,16, the causal role of HDL-cholesterol as a protective factor has been questioned11,12. Previous MR studies have suggested causal associations of some lipids traits and BMI with IMT17,18,19,20,21, and it is of value to conduct a larger study including more cardiometabolic traits simultaneously using the UK Biobank data.
In the recent years, techniques for multiplex measurements of proteins have emerged and the circulating protein profile of many traits have been presented to deepen our understanding of disease pathophysiology and improve disease prediction22,23,24. In our previous study, we have identified eleven out of eighty-six proteins associated with thickened IMT using observational design25. But since that study the number of measurable proteins has grown considerably, we are now motivated to include a much greater number of proteins to be associated with IMT, using both observational and genetic analyses.
The aims of the present study were three-fold. First, to compare measured levels of traditional risk factors for IMT with genetically determined levels to evaluate causality. Second, to disclose the protein profile of a thick IMT. Third, to use MR and colocalization analyses to evaluate if some of protein-IMT associations were causal and therefore a suitable target for drug development regarding vascular diseases. For these aims, we used the UK Biobank as the major resource where almost 3000 proteins have been measured along with IMT and traditional risk factors.
Results
A flowchart describing the analyses conducted in this work can be found in Fig. 1. Basic characteristics of the studied sample is given in Table 1. For MR, 1676 cis-pQTL SNPs were included as genetic instruments (Supplementary Table 1).
Comparing the associations of measured and genetically determined levels of traditional risk factors with IMT
As could be seen in Fig. 2 and Supplementary Table 2, the observational data using measured levels of the risk factors disclosed the expected results, namely that LDL-cholesterol, BMI, diabetes, SBP were positively related to IMT, while HDL-cholesterol was related to IMT in an inverse fashion. The same pattern was also seen using the PRS for the risk factors, but in this case the estimates were generally lower than those of the measured levels. MR analyses showed LDL-cholesterol, BMI, and SBP were significantly associated with IMT in the expected fashion, while diabetes and HDL-cholesterol were not significant. IVW and MR Egger showed generally very similar estimates.
Relationships between IMT of the carotid artery and measured levels (Obs) or genetically determined levels of the traditional CVD risk factors, including using polygenetic risk scores (PRS), and Mendelian randomization [inverse-variance weighted (IVW) and MR Egger (Egger)] techniques. The regression coefficient (beta) and 95%CI are shown.
Measured protein levels vs IMT
Of the 2923 proteins evaluated in the observational part, 478 showed FDR < 0.05 in the age and sex-adjusted model. Of those, only 63 showed p < 0.05 in the model also adjusted for CVD risk factors (Fig. 3 and Supplementary Table 3). Top findings were PON3, IGFBP1, HAVCR1, TACSTD2 and LEP. LEP and FABP4 did however change the sign of the estimate from positive to negative following the additional adjustment for CVD risk factors (including BMI).
When the 63 proteins of interest were subjected to pathway enrichment analysis (www.reactome.org), a number of pathways were enriched (p < 0.05), including regulation of insulin-growth factor transport, post-translational protein phosphorylation, white adipocyte differentiation, ATF4 activates genes in response to endoplasmatic reticulum stress, and negative regulation of FGFR4 signaling (Supplementary Table 4). Out of the 63 proteins of interest, 47 were measured in the POEM study (Supplementary Table 5), of which, 17 were found significantly (p < 0.05) associated with IMT, showing concordant signs of the beta coefficients with those in UKB. Top findings included FAP (Prolyl endopeptidase FAP), NCAN (Neurocan core protein), CHGB (Secretogranin-1), PON3 and GAL (Galanin peptides).
Genetically determined protein levels vs IMT or risk factors
Of the 1676 proteins evaluated in the MR analysis, only 5 showed FDR < 0.05 vs IMT (BCAM, NTproBNP/NPPB, RABEPK, ACAN, FN1)(Supplementary Table 6). None of these proteins were amongst the 63 proteins identified to be significant in the observational part (FDR-adjusted), and only RABEPK showed the same sign of direction in the MR analysis as in the observational analysis (age and sex adjusted model), as could be seen in Fig. 4. The MR sensitivity analyses showed very similar estimates of MR-Wald ratio and other MR methods using multiple SNPs. No horizontal pleiotropy was observed for all associations. For BCAM-IMT association, heterogeneity was found for the estimates of SNPs used in MR-IVW and MR-Egger methods (Supplementary Table 7).
Of the 5 proteins, NTproBNP/NPPB showed strong evidence of colocalization with IMT (PP.H4 > 80%) as shown in Fig. 5, while RABEPK and ACAN showed medium evidence (PP.H4 > 70%). FN1 showed a PP.H4 of 58% in the colocalization analysis. BCAM showed no evidence of colocalization using the traditional method (Supplementary Table 8), but showed a strong evidence of colocalization (PP.H4 of 90%) when using the SuSiE method (Supplementary Table 9). As could be seen in Fig. 6A, all but NPPB were expressed in arterial tissues.
A Expression analysis in three relevant arteries for genes of five proteins being significant in the Mendelian randomization (MR) analysis vs the intima-media thickness (IMT). B Heat-map of MR-Wald ratio estimate of selected proteins with traditional risk factors of IMT. Significant levels are expressed as ***, **, * or lack of * if nominal p-value is <0.001, <0.01, < 0.05 or > 0.05. A blank square indicates that no association was estimated because of lack of genetic instruments. C Mediation analysis of the effect of proteins on IMT via SBP using MR.
Of the 63 proteins found significant in the observational analysis, seven showed p < 0.05 in the MR analysis (CST6, DSG4, FDX1, PRELP, SMOC2, SPINT3, TOR1AIP1). However, DSG4 and FDX1 showed opposite signs of the estimates in the observational and MR analyses. Five of the seven proteins excluding DSG4 and FDX1 were further submitted for colocalization analysis but none showed evidence (PP.H4 < 0.5) of colocalization (Fig. 5).
Out of the five proteins significantly associated with IMT in the MR analysis, all except ACAN were also associated with SBP (p < 0.05), where the associations of RABEPK (beta = 0.07, p < 0.001) and NPPB (beta = −0.11, p < 0.001) were more pronounced (Fig. 6B, Supplementary Table 10). NPPB was also positively and inversely associated with BMI and T2D risk, respectively.
The effect of proteins on IMT mediated by SBP using MR analysis
As shown in the Fig. 6C, the MR-IVW estimate of the association of RABEPK with IMT was 0.018 (p < 0.001). After adjusting for SBP, the estimate was 0.0097 (p = 0.036). Thus, 46% of the effect of RABEPK on IMT was mediated by SBP. However, no effect was mediated via SBP for the association of BCAM with IMT.
Druggability
For the proteins colocalized with IMT, 3 out of 4 proteins have the druggability tiers. FN1, ACAN and BCAM were listed as Tier 1, Tier 3 A and Tier 3 A, respectively. Only FN1 was targeted by drug in the clinical trial but not for atherosclerosis-related outcomes (Supplementary Table 11).
Discussion
The present study showed that traditional risk factors for CVD were related to IMT in the expected manner when using measured levels or PRS. Using MR, support for a causal role was only seen for LDL-cholesterol, BMI and SBP. Plasma levels of 63 proteins were related to IMT whereas 17 were replicated in the observational analyses, but using MR and colocalization, support for a causal role was only seen for 5 proteins. However, only the RABEPK- IMT association showed concordant direction in the observational and MR analyses.
Using observational data, it has previously been shown that the traditional risk factors for CVDs, like myocardial infarction and ischemic stroke, also are linked to a thick IMT7.
This picture is reproduced in the present study using observational data and PRS, but in the two-sample MR analysis only LDL-cholesterol, BMI and SBP were causally linked to IMT, while HDL-cholesterol and diabetes were not. Thus, using genetic information in two different ways (PRS and MR) gave diverging results for diabetes and HDL-cholesterol.
A causal role for HDL-cholesterol in myocardial infarction has been questioned since early genetic studies did not support a causal role for HDL-cholesterol11,12. Also the failure of HDL-rising drugs to improve CVD outcomes has raised concerns of a causal role of HDL-cholesterol in atherosclerotic diseases, as reviewed in26. Regarding diabetes, this was a more unexpected finding. Using the same genetic loci as instrumental variables for diabetes, we have previously shown genetically determined diabetes to be linked to both myocardial infarction and stroke, two atherosclerotic diseases, in similar MR studies as used in the present evaluation16. Since the occurrence of myocardial infarction and ischemic stroke are most often the result of an atherosclerotic plaque rupture and subsequent clot formation and arterial conclusion, it might be that diabetes is mainly a causal risk factor for these latter events and maybe not so much involved in early vascular pathology.
The discrepant findings for PRS and MR regarding the role of diabetes for IMT also raise questions how these two ways to use genetic information differ. One obvious difference is that the PRS used in the present evaluation is a PRS for type 2 diabetes, while the GWAS for diabetes used in the MR evaluation does include all kinds of diabetes. However, the vast majority of the diabetic cases included in the GWAS for diabetes are type 2, since the majority of subjects were middle-aged or elderly. Another difference is that having a high value at a PRS for diabetes is seen in a number of subjects with normal glucose tolerance, as well as in a number of subjects with overt diabetes at the instant when data are collected, as compared to the MR situation in which the cases in the diabetes GWAS all have overt diabetes.
Of the proteins suitable for MR analysis, only 5 showed evidences of a causal role for IMT when adjustment for multiple testing was performed (BCAM, NT-proBNP/NPPB, RABEPK, ACAN, FN1), but none of those were amongst the 63 proteins identified in the observational analysis. Colocalization analysis supported a causal role for all of those proteins, but NPPB was not found to have expression in arteries.
A previous MR analysis also identified NT-proBNP to be inversely related to IMT25. In that study, measured NT-proBNP levels were positively related to IMT (in the observational setting) and therefore the relationship between NT-proBNP and IMT was regarded not to be causal. It is more likely that a thick IMT representing a deranged hemodynamic situation that by an increased volume or pressure load on the myocardium results in increased production of NT-proBNP (and BNP (NPPB)). It was also seen in the expression analysis in Fig. 5A that NPPB was not abundantly expressed in the evaluated arteries.
Rab9 effector protein with kelch motifs (RABEPK), also called only Rab9, is a protein known to be involved in Retrograde transport at the Trans-Golgi Network and vesicle-mediated transport, as well as autophagosome formation in unconventional macroautophagy, as reviewed by Zhang et al.27. Genetic variation in this gene has previously been linked to cigarette use, drug abuse, bipolar disorders, as well as BMI28, but its role in atherosclerosis is less known. However, as recently reviewed by Nah29, RABEPK could by its role in alternative mitophagy potentially be involved in several myocardial disorders. In UK Biobank, the measured levels of RABEPK were not closely related to any of the traditional CVD risk factors (R2 < 1% for all comparisons, results not shown in detail). A look-up in the public databases regarding the potential druggability of RABEPK yielded no findings. Thus, based on the above-mentioned findings, RABEPK might well be a target worthwhile to be explored in future studies and drug development for vascular diseases.
The basal cell adhesion molecule (BCAM), also denoted Lutheran blood group glycoprotein (Lu), is a transmembrane receptor with five immunoglobulin-like domains and belongs to the immunoglobulin gene family. BCAM is involved in blood cell–endothelium interactions, such seen in sickle-cell anemia, polycytemia vera and retinal-vein occlusion, as reviewed by Wautier and Wautier30. However, the role of BCAM in CVD in not known. Genetic variation in the BCAM gene has mainly been linked to Alzheimer´s disease31 and blood lipid levels32. However, measured BCAM levels were not closely related to any of the traditional CVD risk factors (R2 < 1% for all comparisons) in UK Biobank. This protein has been judged as a Tier 3 A target of drug.
Aggrecan (ACAN) is a large chondroitin sulfate proteoglycan of the extracellular matrix. This protein forms giant hydrated aggregates with hyaluronan in the extracellular matrix (ECM). ACAN is a major component of cartilage, but is also expressed the cardiovascular system. It is expressed in preatherosclerotic lesions in human carotid arteries, but was reduced in calcified lesions33. Aggrecan has in another study been expressed in both stable and unstable advanced plaques34. Genetic variation in ACAN has mainly been linked to height35, and other anthropometric index, such as waist circumference36. In our analysis of measured ACAM levels in UK Biobank, the levels were inversely related to LDL-cholesterol (R2 = 2%) and inversely to BMI (R2 = 3%). ACAM has been judged as a Tier 3 A target of drug and an experimental compound is available.
Fibronectin 1 (FN1) linking matrices and cells and is found in both cell surface and extracellular matrix in many types of cells. As reviewed by Tang et al, fibronectin is involved in wound healing, hematopoiesis, infections, as well as cancer development37. A role of fibronectin 1 has also been suggested for atherosclerosis in which this protein is expressed mainly in the intima and is involved in the transformation of vascular smooth muscle cell to a secretory type of cell38. In a cross-sectional human study, low levels of fibronectin 1 was found in patients with coronary heart disease (CHD), but the levels of this protein was not related to the severity of CHD at coronary angiography39. SNPs in this gene have been linked to CHD40 and a drug, Ocriplasmin, is on the market since a decade for the treatment of symptomatic vitreomacular adhesion. In UK Biobank, measured levels of FN1 is associated with blood pressure (R2 = 3%), BMI (R2 = 5%) and LDL-cholesterol (R2 = 3%).
Since correction for multiple testing might produce false negative results, we evaluated if any of the 63 proteins found related with IMT in the observational analysis showed same directions of the associations in the MR analysis, and also showed p < 0.05, even though FDR was >0.05. Five such proteins were found (CST6, PRELP, SMOC2, SPINT3 and TOR1AIP1). To further examine if these proteins could be causally related to IMT, we performed colocalization analysis, but none of those met the colocalization criterion.
It may raise some concerns that the observational and genetic results did not converge as no overlapped proteins were found in both. The reasons behind this are multi-factorial, including heterogeneity of biases in different methods and of statistical estimation approaches. It has been recognized that MR is instinctively less biased than the observational study design, and some of MR results were even comparable to the randomized controlled trial (RCT) studies41,42. Our MR findings of NTproBNP/NPPB associated with IMT can serve as a reassurance that our MR results are meaningful to some extent because NTproBNP is a well-established biomarker of CVD used in clinical practice43. The MR association of NTproBNP was also supported by the strong colocalization evidence.
In the present study, we used FDR < 0.05 for the age and sex-adjusted model and nominal p < 0.05 for the multiple-adjusted model as significance levels in the discovery phase in UK biobank, given that we could validate most of the proteins in the independent POEM sample. It might be argued that we should have applied also FDR < 0.05 for the multiple-adjusted model. If FDR < 0.05 was applied also for the multiple-adjusted model, no protein would be found to be related IMT, which obviously not is the case, given that we have validated 11 (out of 86 tested) proteins to be linked to IMT in our previous study 25.
The strength of the present study is the large number of subjects in whom IMT has been measured together with traditional risk factors and PRS for those risk factors. The number of individuals with parallel protein measurements was smaller, but is still comparable with the largest study conducted so far25 and the protein-IMT associations were replicated in an external cohort, although not every protein was measured in the replication cohort. One limitation is that the imaging of IMT was performed some years following the collection of traditional risk factors and proteins, and thus the study design is more longitudinal than cross-sectional in that respect. Changes in risk factors and proteins may well have occurred during this time gap, but any such deviations would possibly only lead to underestimations of the evaluated relationships, and the analysis of the measured traditional risk factors with IMT showed the expected results compared to previous studies in this area. These proteins were also replicated in an independent cohort where IMT, proteins and risk factors were all measured at the baseline. Another limitation is that the discordant results found by observational and genetic analyses may raise concerns over the screening procedure of proteins of interest, and thus interpretation of such results should be cautious. Last, no analysis was conducted for carotid plaque which has been considered a superior measurement to IMT, because of lack of observational data of carotid plaque in the UK Biobank. However, both clinical trials and observational studies have implied strong evidence to support IMT as a useful surrogate marker for cardiovascular risk44. Thus our results are of great value to help us further understand vascular biology and atherosclerotic physiopathology.
In conclusion, MR suggested that LDL-cholesterol, BMI, diabetes and SBP are causally related to IMT. MR also identified five proteins of interest for IMT, but further analyses suggested mainly RABEPK to be an interesting target deserving further investigations.
Methods
Study samples
The UK Biobank study was approved by the UK North West Multi-Centre Research Ethics Committee (Project Application Number 90143) and the Swedish Ethical Review Authority (Nr. 2023-00148-01), and performed in accordance with the Declaration of Helsinki and other relevant guidelines/regulations. All participants provided written informed consent.
The PEOM study was approved by the Ethics committee of Uppsala University and all participants gave their written informed consent.
UK Biobank is a large, multi-center, prospective cohort study conducted across the UK (https://www.ukbiobank.ac.uk). In 2006–10, over 500,000 individuals aged 40–69 years underwent physical measurements, and blood samples were biobanked for later analysis of genes and biomarkers. The present study used data from the 50,704 individuals with an available measurement of IMT. Participants information on sex, age, and race was self-reported.
Fasting glucose, HbA1c, LDL- and HDL-cholesterol and creatinine were measured by a Beckman coulter AU5800, by standard methods. Blood pressure was measured twice in the sitting position with the automated Omron device. Glomerular filtration rate was estimated (eGFR) by the updated CKD-EPI formula45. Body mass index (BMI) was calculated from measured height and weight (weight/height^2). Ethnicity was categorized into four groups; White, Black, Asian, and other. Townsend social deprivation index was used as a marker of socioeconomic status. Smoking was categorized as never, previous or current. PRSs for 28 diseases and 25 quantitative traits have previously been calculated on the individuals in UK Biobank by Thompson and co-workers46. These data are, as other variables, available for scientists upon request from UK Biobank. The Standard PRS set of 28 diseases and 8 quantitative traits was generated from external GWAS data. The current study used the “Standard PRSs” calculated in the majority of subjects. PRS algorithms were built from trait-specific meta-analyses using a Bayesian approach, where appropriate combining data across multiple ancestries and related traits. Per-individual PRS values were calculated as the genome-wide sum of the per-variant posterior effect size multiplied by allele dosage. Further details on the calculations of the PRSs could be found in the reference46. The present study used the PRS for diabetes type2, hypertension, LDL and HDL cholesterol and BMI. Plasma levels of 2923 proteins measured by the Olink proteomics assay Explore (OLINK, Uppsala, Sweden) were determined by the proximity extension assay (PEA) technique in 52,164 UKB participants. An extensive quality control was performed, given in detail in the reference47. The far wall of the common carotid artery was scanned by ultrasound (CardioHealth Station, Panasonic Healthcare Corporation of North America, Newark, NJ, USA) at two prespecified angles for each of the two arteries (right 150°, right 120°, left 210°, and left 240°). The mean, min and max of the four measured angles were recorded. The present study used the mean value of the four measurements of maximal IMT. Details on the protocol and quality assurance have previously been published48. The measurements of IMT took place on average after 9.6 years following the baseline examination. The overlap between the individuals with protein measurements and the individuals with IMT measurements were 6328 individuals.
The Prospective investigation of ObEsity and Metabolism (POEM) study is a population-based cohort conducted between 2010 and 2016 in Uppsala, Sweden. The cohort is composed of 502 individuals all aged 50 years (50% women). Detailed description has been given in reference49. Participants information on sex, age, and race was self-reported. Information on gender and socioeconomic status was not collected. Plasma levels of 1463 proteins measured by the Olink proteomics assay Explore (OLINK, Uppsala, Sweden) were determined by the same PEA technique as in UK Biobank. Proteins detected in less than 75% of the individuals were not used in the analysis. The carotid artery was assessed by external B-mode ultrasound imaging (Acuson XP128 with a 10 MHz linear transducer, Mountain View, California, USA). IMT was evaluated in the far wall in the common carotid artery 1–2 cm proximal to the bulb. The images were digitized and imported into the AMS (Artery Measurement Software, Gothenburg, Sweden) automated software for dedicated analysis of IMT. A 10 mm segment with good image quality was chosen for IMT-analysis from the carotid artery. The value obtained was the mean of around 100 discrete measurements over the 10 mm segment. The given value for carotid artery IMT was the mean value from both sides50.
Selection of genetic instruments
Summary data of each risk factor were obtained from already published GWAS studies: For LDL and HDL, data from the Global Lipids Consortium were used51. Data from the GIANT consortium were used for BMI52. Data from the DIAGRAM consortium were used for diabetes53. For systolic blood pressure (SBP), own calculations using UK biobank were used. Only independent (in low linkage disequilibrium, R2 < 0.01) SNPs with p < 5e-8 were used as instrument.
We obtained the summary-level GWAS statistics from the UKB-Pharma Proteomics Project (PPP) which comprehensively mapped protein quantitative trait locus (pQTL) of 2923 proteins in 54,219 UK Biobank participants47. We only used the cis-pQTL SNPs for our MR analysis of proteins with various outcomes to minimize the likelihood of horizontal pleiotropy. The genetic instruments were curated from three sources: 1). The cis-pQTLs defined in the combined cohort; 2). If the sentinel SNP (cis-pQTL) is multiallelic in the combined cohort, the meta-analyzed statistics of the cis-pQTL SNP in the discovery and replication cohorts were used; 3). For a cis-pQTL SNP being multiallelic in both combined and discovery-replication cohorts, the SNP at the cis-region (defined as the 1 Mb pair window of the protein-coding gene) with the lowest p-value and being biallelic in the combined cohort was used as the instrument. Detailed definition of the combined and the discovery-replication cohorts can be found in the original publication47. Totally, 1676 cis-pQTL SNPs were included as genetic instruments. The minimum value of F-statistic for all instruments was 41.
Data source of outcome GWAS
Summary-level GWAS statistics of the ultrasound-measured carotid artery IMT (n = 71,128) were obtained from a GWAS of Europeans6.
Statistics
Observational analyses
The measured risk factors were transformed to the standard deviation (SD)-scale in order to be comparable with the PRS. Also IMT was transformed to the SD scale. Linear regression models between IMT and either measured levels of the risk factors (history of diabetes, systolic blood pressure, LDL and HDL cholesterol and BMI) or PRS for the risk factors (diabetes type 2, hypertension, LDL and HDL cholesterol and BMI) were applied. Included in these multiple models were also the covariates age, sex, smoking, Townsend deprivation index and race. P < 0.05 was regarded as significant in these multiple models.
Linear regression models between IMT and each of the 2923 available proteins (one by one) were applied in two sets of adjustment, including model 1 for age, sex and fasting time and model 2 for age, sex and fasting time plus smoking, Townsend deprivation index and race and the CVD risk factors diabetes, systolic blood pressure, LDL and HDL cholesterol, BMI, and eGFR. In order to be deemed significant, the estimate of proteins should show a false discovery rate (FDR) < 0.05 in model 1 and p < 0.05 in model 2. Measured levels of each of proteins of interest were inverse normalized rank transformed and were then associated with IMT in the POEM sample using linear regression models, including sex as confounder (age same in all subjects). P < 0.05 was considered significant in this validation analysis.
Genetics
Multivariable MR was performed with IMT as the outcome and diabetes, systolic blood pressure, LDL and HDL cholesterol and BMI as the exposures. The inverse-variance weighted (IVW) method was considered as the primary analysis, and MR Egger as a sensitivity test. P < 0.05 was considered to be significant for IVW, and we also demanded that the estimate of the MR Egger test to be in the same direction as the estimate of IVW. STATA 16.1 was used for the calculations of this part of the MR studies. We used a linear fixed-effects model (R package “metafor”)54 to generated the meta-analyzed statistics of the cis-pQTLs found in the discovery and replication cohorts where the weighted estimation with IVW was implemented. Since we only included the sentinel cis-pQTL SNP per protein, only the Wald-ratio method was used for the MR analysis. Multiple testing was addressed by the FDR controlling procedure and adjusted p-value (shortly as FDR) < 0.05 was regarded statistically significant. To examine if the hypothetic causal direction was oriented from proteins to IMT, the MR-Steiger test was used. For the protein-IMT associations found significant by MR, further sensitivity analyses employing multiple GWAS-significant (p-value < 5e-8) independent (in low linkage disequilibrium, R2 < 0.01) SNPs with F-statistic > 10 at the cis-locus (defined as the above) were performed including MR-IVW, MR-Egger, MR-Weighted median and MR-Weighted mode methods. Horizontal pleiotropy was examined based on the p-value of the MR-Egger intercept. Heterogeneity of the instruments was assessed by Cochrane’ Q value. Nominal p-value < 0.05 was regarded significant in the sensitivity analyses. All the MR analyses were conducted by using R package “TwoSampleMR”55.
To examine whether the significant associations of proteins with IMT found by MR were confounded by linkage disequilibrium, we furthered applied genetic colocalization for proteins of interest (FDR < 0.05) and IMT. We included the SNPs at the cis-region (as above) of each protein and defined a posterior probability (PP) of the hypothesis that two traits of interest are colocalized (H4) > 0.8 as strong support of colocalization whereas 0.5 < PP.H4 ≤ 0.8 as medium support. Three prior probabilities were defined as follows: p1 (a SNP is associated with trait 1) = 1e-4; p2 (a SNP is associated with trait 2) = 1e-4; For p12 (a SNP is associated with both traits), a more conservative value 5e-6 was used for improved robustness56. To relax the assumption by the traditional coloc method that at most one causal variant per trait, we further performed colocalization analysis in the framework of Sum of Single Effects (SuSiE) where applicable57. The sensitivity of the inference to variation of p12 was examined by visualization. The colocalization analyses were conducted by using R package “coloc”58.
We used the same genetic instruments of the proteins in the last step and evaluated the association of genetically predicted levels of proteins and risk factors of IMT by the MR-Wald ratio method and p-value < 0.05 was deemed significant.
To disentangle the mediate effect of proteins (RABEPK and BCAM) on IMT through SBP, we first employed multiple cis-SNPs of each protein (the same source as used for the MR sensitivity analyses of protein and IMT) as instruments to estimate the effect of proteins on IMT by univariable MR-IVW method (β1), which was regarded as the total effect. In the second step, using the same source of instruments as in the first step, the multivariable MR method was implemented to estimate the direct effect of proteins on IMT (β2), adjusted for SBP. The effect mediated by SBP was thus (β1- β2)/ β1. The mvmr function in the R package “TwoSampleMR” was used for the multivariable MR analysis.
Lookups of Genotype-Tissue Expression (GTEx) and druggability
For proteins of interest, we further looked up the gene expression levels in the human artery tissue at the GTEx Portal (https://gtexportal.org/home, dbGaP accession number phs000424.v8.p2). The tier of the proteins’ druggability was assigned according to the previous work by Finan et al.59 where Tier 1 as proteins targeted by approved drugs and drugs in clinical development; Tier 2 as proteins closely related to drug targets or with associated drug-like compounds; Tier 3 as extracellular proteins and members of key drug target families. Detailed drug development information was curated from DrugBank (https://go.drugbank.com/), https://clinicaltrials.gov/ and ChEMBL (https://www.ebi.ac.uk/chembl/) if applicable.
Data availability
I. Observational data were obtained from the UK Biobank. The UK Biobank and its data is an open research resourse available following submission of a research plan at https://www.ukbiobank.ac.uk. According to the Swedish law, individual health data cannot be made publicly available (POEM). Data supporting the findings of the present study are available for researchers at a reasonable request to L.L. (email: lars.lind@medsci.uu.se).
II. Summary-level GWAS data of
• proteins available at https://metabolomips.org/ukbbpgwas/.
• IMT available at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000930.v6.p1.
Code availability
No customized code generated for analysis in this work. STATA 16.1 was used for the calculations based on the observational data. R (version 4.1.0) was used for the MR and colocalization analyses.
References
Spence, J. D. IMT is not atherosclerosis. Atherosclerosis 312, 117–118 (2020).
Inaba, Y., Chen, J. A. & Bergmann, S. R. Carotid plaque, compared with carotid intima-media thickness, more accurately predicts coronary artery disease events: a meta-analysis. Atherosclerosis 220, 128–133 (2012).
Den Ruijter, H. M. et al. Common carotid intima-media thickness measurements in cardiovascular risk prediction: a meta-analysis. JAMA 308, 796–803 (2012).
Lorenz, M. W., Markus, H. S., Bots, M. L., Rosvall, M. & Sitzer, M. Prediction of clinical cardiovascular events with carotid intima-media thickness: a systematic review and meta-analysis. Circulation 115, 459–467 (2007).
O’Leary, D. H. et al. Carotid-artery intima and media thickness as a risk factor for myocardial infarction and stroke in older adults. Cardiovascular Health Study Collaborative Res. Group. N. Engl. J. Med. 340, 14–22 (1999).
Franceschini, N. et al. GWAS and colocalization analyses implicate carotid intima-media thickness and carotid plaque loci in cardiovascular outcomes. Nat. Commun. 9, 5141 (2018).
Lind, L., Andersson, J., Ronn, M. & Gustavsson, T. The echogenecity of the intima-media complex in the common carotid artery is closely related to the echogenecity in plaques. Atherosclerosis 195, 411–414 (2007).
Fall, T. et al. The role of adiposity in cardiometabolic traits: a Mendelian randomization analysis. PLoS Med. 10, e1001474 (2013).
Hagg, S. et al. Adiposity as a cause of cardiovascular disease: a Mendelian randomization study. Int J. Epidemiol. 44, 578–586 (2015).
Allara, E. et al. Genetic Determinants of Lipids and Cardiovascular Disease Outcomes: A Wide-Angled Mendelian Randomization Investigation. Circ. Genom. Precis Med. 12, e002711 (2019).
Holmes, M. V. et al. Mendelian randomization of blood lipids for coronary heart disease. Eur. Heart J. 36, 539–550 (2015).
White, J. et al. Association of Lipid Fractions With Risks for Coronary Artery Disease and Diabetes. JAMA Cardiol. 1, 692–699 (2016).
Gan, W. et al. Genetic Predisposition to Type 2 Diabetes and Risk of Subclinical Atherosclerosis and Cardiovascular Diseases Among 160,000 Chinese Adults. Diabetes 68, 2155–2164 (2019).
Leong, A. et al. Mendelian Randomization Analysis of Hemoglobin A1c as a Risk Factor for Coronary Artery Disease. Diab Care 42, 1202–1208 (2019).
Ross, S. et al. Mendelian randomization analysis supports the causal role of dysglycaemia and diabetes in the risk of coronary artery disease. Eur. Heart J. 36, 1454–1462 (2015).
Lind, L., Ingelsson, M., Sundstrom, J. & Arnlov, J. Impact of risk factors for major cardiovascular diseases: a comparison of life-time observational and Mendelian randomisation findings. Open Heart 8. https://doi.org/10.1136/openhrt-2021-001735 (2021).
Kivimaki, M. et al. Conventional and Mendelian randomization analyses suggest no association between lipoprotein(a) and early atherosclerosis: the Young Finns Study. Int J. Epidemiol. 40, 470–478 (2011).
Cupido, A. J., Asselbergs, F. W., Schmidt, A. F. & Hovingh, G. K. Low-Density Lipoprotein Cholesterol Attributable Cardiovascular Disease Risk Is Sex Specific. J. Am. Heart Assoc. 11, e024248 (2022).
Soremekun, O. et al. Genetically Predicted Lipid Traits, Diabetes Liability, and Carotid Intima-Media Thickness in African Ancestry Individuals: A Mendelian Randomization Study. Circ. Genom. Precis Med. 15, e003910 (2022).
Schmidt, A. F., et al. Biomedical consequences of elevated cholesterol-containing lipoproteins and apolipoproteins on cardiovascular and non-cardiovascular outcomes. Commun. Med. (Lond.) 3, 9 (2023).
Shah, S. et al. Causal relevance of blood lipid fractions in the development of carotid atherosclerosis: Mendelian randomization analysis. Circ. Cardiovasc Genet 6, 63–72 (2013).
Schuermans, A. et al. Integrative proteomic analyses across common cardiac diseases yield mechanistic insights and enhanced prediction. Nat. Cardiovasc Res 3, 1516–1530 (2024).
Lind, L., Mazidi, M., Clarke, R., Bennett, D. A. & Zheng, R. Measured and genetically predicted protein levels and cardiovascular diseases in UK Biobank and China Kadoorie Biobank. Nat. Cardiovasc Res 3, 1189–1198 (2024).
Zhang, X. et al. Associations of 2923 plasma proteins with incident inflammatory bowel disease in a prospective cohort study and genetic analysis. Nat. Commun. 16, 2813 (2025).
Lind, L. et al. Plasma Protein Profile of Carotid Artery Atherosclerosis and Atherosclerotic Outcomes: Meta-Analyses and Mendelian Randomization Analyses. Arterioscler Thromb. Vasc. Biol. 41, 1777–1788 (2021).
Armitage, J., Holmes, M. V. & Preiss, D. Cholesteryl Ester Transfer Protein Inhibition for Preventing Cardiovascular Events: JACC Review Topic of the Week. J. Am. Coll. Cardiol. 73, 477–487 (2019).
Zhang, J., Jiang, Z. & Shi, A. Rab GTPases: The principal players in crafting the regulatory landscape of endosomal trafficking. Comput Struct. Biotechnol. J. 20, 4464–4472 (2022).
Huang, J. et al. Genomics and phenomics of body mass index reveals a complex disease network. Nat. Commun. 13, 7973 (2022).
Nah, J. The Role of Alternative Mitophagy in Heart Disease. Int J Mol Sci 24. https://doi.org/10.3390/ijms24076362 (2023).
Wautier, J. L. & Wautier, M. P. Cellular and Molecular Aspects of Blood Cell-Endothelium Interactions in Vascular Disorders. Int J Mol Sci 21. https://doi.org/10.3390/ijms21155315 (2020).
Jansen, I. E. et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat. Genet 51, 404–413 (2019).
Klarin, D. et al. Genetics of blood lipids among ~300,000 multi-ethnic participants of the Million Veteran Program. Nat. Genet 50, 1514–1523 (2018).
Talusan, P. et al. Analysis of intimal proteoglycans in atherosclerosis-prone and atherosclerosis-resistant human arteries by mass spectrometry. Mol. Cell Proteom. 4, 1350–1357 (2005).
Novikova, O. A., Laktionov, P. P. & Karpenko, A. A. Mechanisms Underlying Atheroma Induction: The Roles of Mechanotransduction, Vascular Wall Cells, and Blood Cells. Ann. Vasc. Surg. 53, 224–233 (2018).
Weedon, M. N. et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat. Genet 40, 575–583 (2008).
Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187–196 (2015).
Tang, X. et al. FN1 promotes prognosis and radioresistance in head and neck squamous cell carcinoma: From radioresistant HNSCC cell line to integrated bioinformatics methods. Front Genet 13, 1017762 (2022).
Magnusson, M. K. & Mosher, D. F. Fibronectin: structure, assembly, and cardiovascular implications. Arterioscler Thromb. Vasc. Biol. 18, 1363–1370 (1998).
Peng, L., Deng, H., Li, J., Lu, G. & Zhai, Y. S. Plasma Fibronectin as a Novel Predictor of Coronary Heart Disease: A Retrospective Study. J. Cardiovasc. Dev. Dis.10. https://doi.org/10.3390/jcdd10100415 (2023).
van der Harst, P. & Verweij, N. Identification of 64 Novel Genetic Loci Provides an Expanded View on the Genetic Architecture of Coronary Artery Disease. Circ. Res 122, 433–443 (2018).
Richmond, R. C. & Davey Smith, G. Mendelian Randomization: Concepts and Scope. Cold Spring Harb. Perspect. Med. 12. https://doi.org/10.1101/cshperspect.a040501 (2022).
Zhao, S. S., Mackie, S. L. & Zheng, J. Why clinicians should know about Mendelian randomization. Rheumatol. (Oxf.) 60, 1577–1579 (2021).
Welsh, P. et al. Reference Ranges for NT-proBNP (N-Terminal Pro-B-Type Natriuretic Peptide) and Risk Factors for Higher NT-proBNP Concentrations in a Large General Population Cohort. Circ. Heart Fail 15, e009427 (2022).
Tschiderer, L. et al. Association of Intima-Media Thickness Measured at the Common Carotid Artery With Incident Carotid Plaque: Individual Participant Data Meta-Analysis of 20 Prospective Studies. J. Am. Heart Assoc.12. https://doi.org/10.1161/Jaha.122.027657 (2023).
Inker, L. A. et al. New Creatinine- and Cystatin C-Based Equations to Estimate GFR without Race. N. Engl. J. Med 385, 1737–1749 (2021).
Thompson, D. J. et al. UK Biobank release and systematic evaluation of optimised polygenic risk scores for 53 diseases and quantitative traits. medRxiv, 2022.2006.2016.22276246. https://doi.org/10.1101/2022.06.16.22276246 (2022).
Sun, B. B. et al. Plasma proteomic associations with genetics and health in the UK Biobank. Nature 622, 329–338 (2023).
Coffey, S. et al. Protocol and quality assurance for carotid imaging in 100,000 participants of UK Biobank: development and assessment. Eur. J. Prev. Cardiol. 24, 1799–1806 (2017).
Lind, L., Strand, R., Michaelsson, K., Ahlstrom, H. & Kullberg, J. Voxel-wise Study of Cohort Associations in Whole-Body MRI: Application in Metabolic Syndrome and Its Components. Radiology 294, 559–567 (2020).
Zheng, R. & Lind, L. A combined observational and Mendelian randomization investigation reveals NMR-measured analytes to be risk factors of major cardiovascular diseases. Sci. Rep. 14, 10645 (2024).
Discovery and refinement of loci associated with lipid levels. Nat. Genet. 45, 1274–1283. https://doi.org/10.1038/ng.2797 (2013).
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
Scott, R. A. et al. An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans. Diabetes 66, 2888–2902 (2017).
Viechtbauer, W. Conducting Meta-Analyses in R with the metafor Package. J. Stat. Softw. 36, 1–48 (2010).
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife 7. https://doi.org/10.7554/eLife.34408 (2018).
Wallace, C. Eliciting priors and relaxing the single causal variant assumption in colocalisation analyses. Plos Genet. 16, e1008720 (2020).
Wallace, C. A more accurate method for colocalisation analysis allowing for multiple causal variants. Plos Genet. 17, e1009440 (2021).
Giambartolomei, C. et al. Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics. Plos Genet. 10, e1004383 (2014).
Finan, C. et al. The druggable genome and support for target identification and validation in drug development. Sci. Transl. Med. 9, eaag1166 (2017).
Acknowledgements
This research has been conducted using the UK Biobank resource under Application Number 90143. We acknowledge the investigators and participants of the UK Biobank study. POEM was supported by grants from Uppsala University hospital (to L.L.). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Funding
Open access funding provided by Uppsala University.
Author information
Authors and Affiliations
Contributions
L.L. and R.Z. conceived and designed the project and contributed to acquisition of the data. R.Z. and L.L. contributed to analysis or interpretation of the data. L.L. wrote the main manuscript text. Both authors contributed to critical revision of the manuscript, read and approved the final version for publication.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lind, L., Zheng, R. Associations between carotid artery intima-media thickness, traditional risk factors and proteins. npj Cardiovasc Health 2, 35 (2025). https://doi.org/10.1038/s44325-025-00073-7
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s44325-025-00073-7








