Introduction

Lipoprotein(a) is synthesized by the covalent linkage of apolipoprotein A to apolipoprotein B via disulfide bonds, and its concentration demonstrates an inverse association with the size of apolipoprotein A. Moreover, this concentration is predominantly influenced by genetic factors at a level of 90%1. The significance of Lp(a) as a risk factor for cardiovascular diseases2,3, degenerative aortic stenosis4, atrial fibrillation5 and heart failure6 has gained recognition. Lp(a) has a tendency to undergo oxidative alterations and produce oxidized phospholipids (OxPLs). Lp(a) carries more than 80% OxPLs in its particles, and OxPLs induce inflammatory responses by increasing secretion of inflammatory cytokines by macrophages, such as IL-1β, TNF-α and IL-6, multiplying the inflammatory effect7,8. Lipoprotein(a) also carries monocyte chemoattractant protein-1 (MCP-1), a key chemokine in the initiation and progression of vascular inflammationm, and Lp(a)-associated MCP-1 enhances recruitment of monocytes to the vascular wall9. On the other hand, despite the genetic determination of Lp(a) levels, several studies have demonstrated that chronic inflammation disrupts Lp(a) expression and elevates plasma levels of Lp(a)10,11. Collectively, the available evidence suggests a reciprocal association between Lp(a) and inflammation.

Immune-mediated inflammatory diseases encompass a diverse array of pathological conditions, such as celiac disease, Crohn’s disease and inflammatory bowel disease. Multiple observational studies have consistently indicated increased concentrations of Lp(a) in individuals diagnosed with different inflammatory disorders, such as rheumatoid arthritis, systemic lupus erythematosus, and acquired immunodeficiency syndrome12,13,14,15,16,17,18,19,20. However, the majority of clinical investigations are limited in scope and rely on an observational design. While epidemiological studies indicate potential links between Lp(a) and autoimmune disorders, the underlying causality of these connections remains uncertain. The presence of unmeasured covariates and reverse causation in these studies introduces bias, making it challenging to establish a definitive causal relationship. Nonetheless, exploring the potential causal linkage between Lp(a) and autoimmune disorders could offer valuable insights into specific biological pathways and contribute to the development of preventive strategies.

Mendelian randomization is a methodologically robust approach to establish causal associations between exposures and outcomes in epidemiological studies. In MR analyses, genetic variants, predominantly single nucleotide polymorphisms (SNPs), serve as instrumental variables (IVs) for putative risk factors. The principle of MR is grounded in Mendel’s second law, which posits that during gametic formation, gene alleles segregate independently when DNA is transmitted from parent to offspring. As these variants are randomly assigned during conception, MR has the potential to mitigate bias arising from environmental confounders when conducted appropriately21,22,23. In this study, we employed bidirectional and multivariable MR approaches to investigate the causal association between immune-mediated inflammatory diseases and lipoprotein(a), employing summary statistics obtained from GWAS conducted on European populations for both characteristics. Our study seeks to offer fresh perspectives and empirical data regarding the correlation between lipoprotein(a) and IMIDs.

Methods

Study design

To ascertain the causal direction between lipoprotein(a) and immune-mediated inflammatory diseases, we conducted univariable and multivariable MR analyses using GWAS datasets24. Firstly, we conducted univariable Mendelian randomization (UVMR) analyses to establish a bidirectional causal link between Lp(a) and IMIDs. Additionally, multivariable Mendelian randomization analyses25 were performed on lipid traits (Lp(a), HDL-C, LDL-C, TG) for autoimmune diseases to assess the independent association of Lp(a) with autoimmune diseases. The overall design of this study is depicted in Fig. 1 through a comprehensive flow chart. The assumptions outlined below were applied to all MR analyses and are collectively described for both the UVMR and MVMR approaches. We adopted three fundamental hypotheses of classical MR analysis, as follows: (1) IVs exhibit a direct relationship with exposure. (2) Confounding variables do not affect the independence of IVs. (3) IVs solely influence the outcomes through exposure22,26. We didn’t pre-register any study protocol or details.

Fig. 1
Fig. 1
Full size image

Diagram of the univariable and multivariable Mendelian randomization study for the association between lipoprotein(a) and risk of IMIDs.

Data sources

To perform our MR analyses, we used summary-level data from the publicly available GWAS for each trait27,28,29,30,31,32,33,34. Genetic IVs for HDL-C, LDL-C and TG were obtained through genome-wide association studies conducted in the UK Biobank35. GWAS data for exposure and outcomes were obtained from different databases to ensure minimal overlap. All study participants were of European descent, avoiding racial differences. The original publications provide comprehensive information regarding recruitment procedures and diagnostic criteria. The detail information of used GWAS datasets was listed in Table 1.

Table 1 Data sources used to identify genetic variants in this study.

Instrumental variable selection

Initially, we identified single nucleotide polymorphism associated with each trait using a threshold of p = 5 × 10 − 8 based on the comprehensive summary-level GWAS statistics. Subsequently, to ensure the independence among the SNPs, a strict linkage disequilibrium (LD) threshold of r2 = 0.001 was applied when clustering IVs within 10 Mb. We ensured the effect estimates were standardized for exposure and outcome, while excluding any alleles that could potentially cause incompatibility or palindromic SNPs. For consistency, we used only SNPs that were tested for the trait as IVs and did not use proxies to replace SNPs that were missing in the outcome data. Additionally, we eliminated any SNPs linked to the confounding variable affecting the result using the PhenoScanner. For Lp(a) as the exposure and IMIDs as the outcomes, we regard the C-reactive protein levels (CRP) as the confounding variable. For IMIDs as the exposures and Lp(a) as the outcome, we regard the fat content, blood lipid and the coronary disease as the confounding variables. We employed F statistics (beta2/se2)36 to evaluate the robustness of genetically determined instrumental variables, with a threshold of F > 10 in accordance with the first assumption of Mendelian randomization and to avoid bias towards weak IVs37,38.

Mendelian randomization analysis

We utilized the statistical software R (V4.4.1, http://www.r-project.org) and employed the TwoSampleMR (http://gittub.com/MRCIEU/TwoSampleMR) and MR-PRESSO (http://gittub.com/rondolab/MR-PRESSO) packages for conducting all analyses39. Multiple MR methods, including inverse variance weighting (IVW)40, weighted median (WM)41, MR-Egger42, and MR-pleiotropy residual sum and outlier (MR-PRESSO)42 were utilized for deducing causal connections between lipoprotein(a) and IMIDs. The IVW method was primarily employed for fundamental causal estimates, which would provide the most precise results when all selected SNPs were valid IVs. The IVW method calculates a weighted average of Wald ratio estimates. Under the assumption of Instrument Strength Independent of Direct Effect (InSIDE), the MR-Egger regression executes a weighted linear regression and yields a consistent causal estimate, even though the genetic IVs are all invalid42. However, it exhibits low precision and is susceptible to outlying genetic variants. Additionally, we utilized the weighted median technique, which computes the midpoint of the weighted approximations and ensures consistent effects even in scenarios where 50% of instrumental variables exhibit pleiotropy. The Weighted Median regression method, which does not demand the InSIDE hypothesis, calculates a weighted median of the Wald ratio estimates and is robust to horizontal pleiotropic bias41. It is confirmed that the Weighted Median method has some advantages over the MR-Egger regression, as it provides lower type I error and higher causal estimate power. MR-PRESSO identifies and eliminates outliers in IVW linear regression to offer refined MR estimations. To evaluate the potential for horizontal pleiotropy of the SNPs, we used MR Egger regression. In addition, we performed a sensitivity analysis using the “leave-one-out” method to detect any SNPs that may have a significant impact. In this method, every SNP was methodically eliminated and its impact on the correlation was evaluated43. The heterogeneity of selected SNPs was assessed using the Cochrane’s Q test (P < 0.05). In instances where significant heterogeneity was observed, we employed the random effects IVW test to obtain more cautious and reliable estimates42. The mr_funnel_plot function was utilized to generate funnel plots for visualizing the heterogeneity of IVs. We additionally estimated FDR corrected P values for the multivariable analyses to adjust for the multiple tests performed on each exposure.

Results

Instrumental variables

For lipoprotein(a) as the exposure, IVs were chosen as SNPs linked to lipoprotein(a) (4 SNPs for CeD, 8 SNPs for CD, 7 SNPs for IBD, 52 SNPs for MS, 65 SNPs for Pso, 46 SNPs for RA, 57 SNPs for SLE, 77 SNPs for T1D, 8 SNPs for UC). For lipoprotein(a) as the outcome, IVs were chosen as SNPs linked to IMIDs (10 SNPs for CeD, 44 SNPs for CD, 40 SNPs for IBD,12 SNPs for MS, 6 SNPs for Pso, 15 SNPs for RA, 9 SNPs for SLE, 21 SNPs for T1D, 29 SNPs for UC). The absence of weak instrument bias was indicated by all F-statistics > 10. The comprehensive details regarding the instrumental variables can be found in Table S1S3.

Causal estimates of genetic susceptibility to lipoprotein(a) and IMIDs risk

The findings from the MR analysis exploring the causal association between lipoprotein(a) and nine IMIDs traits are depicted in Fig. 2. Lipoprotein(a) exhibited no causal association with CeD (OR = 0.797, 95% CI 0.498–1.274), CD (OR = 1.231, 95% CI 0.519–2.920), IBD (OR = 1.026, 95% CI 0.804–1.309), MS (OR = 1.007, 95% CI 0.822–1.232), Pso (OR = 1.059, 95% CI 0.910–1.231), RA (OR = 0.936, 95% CI 0.782–1.120), SLE (OR = 1.039, 95% CI 0.801–1.349), T1D (OR = 1.065, 95% CI 0.931–1.218), and UC (OR = 1.016, 95% CI 0.707–1.461). This discovery aligns with the outcomes derived from alternative MR techniques, including MR Egger and weighted median.

Fig. 2
Fig. 2
Full size image

MR Estimates from Mendelian randomization analysis of lipoprotein(a) and risk of IMIDs.

There was noticeable heterogeneity observed in our instrumental variables for Lp(a) in relation to CD (Q P.val = 7.65E−11), SLE (Q P.val = 3.66E−5), MS (Q P.val = 0.0001), Pso (Q P.val = 0.0043) and T1D (Q P.val = 0.0051) as outcome (Table S4). We observed that all MR-Egger regression intercepts were not significantly different from zero, indicating no indication of horizontal pleiotropy between the Lp(a) instrumental variables and IMIDs (intercept p > 0.05), except for T1D where a marginal deviation was found (intercept p = 0.021) (Table S4). The MR-PRESSO analysis revealed significant horizontal pleiotropy in certain analysis. Nevertheless, the causal estimates of Lp(a) with MS, Pso, SLE, and T1D remained consistent even after conducting outlier-corrected analyses (Table S5).

In addition, the sensitivity analysis plots indicated that no individual SNP was expected to have a substantial impact on the causal relationship, thus affirming the reliability of our findings (Fig. S1S4). Taken collectively, these findings provide compelling evidence supporting the absence of a causal association between Lp(a) and IMIDs.

Fig. 3
Fig. 3
Full size image

MR Estimates from Mendelian randomization analysis of IMIDs and risk of lipoprotein(a).

Causal estimates of genetic susceptibility to IMIDs and lipoprotein(a) levels

Furthermore, conducting reverse studies investigating the association between exposure to the risk of 9 IBIDs and the outcome of Lp(a) levels, we found no significant association between CeD (OR = 0.998, 95% CI 0.995–1.002), CD (OR = 1.000, 95% CI 0.991–1.008), IBD (OR = 0.995, 95% CI 0.987–1.003), MS (OR = 0.992, 95% CI 0.983–1.001), Pso (OR = 1.003, 95% CI 0.989–1.018), RA (OR = 1.003, 95% CI 0.998–1.009), SLE (OR = 0.998, 95% CI 0.993–1.004), T1D (OR = 0.998, 95% CI 0.994–1.002), UC (OR = 1.004, 95% CI 0.994–1.013) and Lp(a) in the IVW analysis results (Fig. 3). The outcomes obtained from each of the three MR techniques exhibited concurrence. There was noticeable diversity observed in our instrumental variables for CD (Q P.val = 1.60E−6), and UC (Q P.val = 0.0024 (Table S6). The presence of imbalanced horizontal pleiotropy was not indicated by the MR-Egger intercept, as it exhibited a central tendency around zero in all MR analyses (Table S6). Although in certain analyses, MR-PRESSO revealed the presence of substantial horizontal pleiotropy, the causal estimates of Lp(a) with UC and CD remained robust after outlier-corrected analyses (Table S7). Additionally, the sensitivity analysis plots, which employed a leave-one-out approach, indicated that the individual impact of each SNP on the causal association was not significant. This finding further strengthens our conclusions (Figs. S5S8).

Multivariable MR

To account for potential pleiotropic pathways arising from the relationship between different lipid traits, we employed a multivariable Mendelian randomization model incorporating Lp(a), HDL-C, LDL-C, and TG as joint exposures for each IMIDs outcome. Following adjustment for HDL-C, LDL-C, and TG, genetically elevated Lp(a) showed no causal association with the onset of IMIDs, consistent with the findings of univariable MR analysis. Additionally, The association between genetically predicted HDL-C and type 1 diabetes remained marginally significant even after adjusting for multiple lipid traits (ORMVMR = 0.80, 95% CI 0.68–0.95; adjust P = 0.043). Furthermore, no significant associations were observed between other lipid traits and the IMID diseases of concern (Table S8).

Discussion

The correlation between levels of lipoprotein(a) and inflammatory conditions has garnered increasing attention. However, to our knowledge, this study represents the first systematic exploration of potential causal relationships between lipoprotein(a) levels and IMIDs using MR methods. Our results didn’t observe a causal relationship between genetic susceptibility to lipoprotein(a) and the risk of IMIDs. To differentiate between a genuine adverse outcome and the lack of validity in the MR studies, we conducted various sensitivity analyses to ensure adherence to the three MR assumptions. Taking into account the consistency of our MR findings across these diverse methodologies, we possess confidence regarding the veracity of our MR analyses.

Although most studies have suggested that Lp(a) is associated with inflammatory levels, whether a causal relationship exists between lipoprotein(a) and autoimmune diseases remains undetermined. Numerous observational studies have consistently reported elevated levels of Lp(a) in active autoimmune diseases, including rheumatoid arthritis14,44 and systemic lupus erythematosus45. However, Holm et al. performed a cross-sectional observational investigation and found no statistically significant disparities in serum Lp(a) levels among individuals with coronary artery disease, regardless of the presence or absence of inflammatory rheumatic disease46. Regarding the underlying mechanisms behind most of these observations, several investigations have indicated that Lp(a) itself may promote inflammation in diverse cellular populations, encompassing endothelial cells, monocytes and macrophages, through its association with oxidized phospholipids, mediating the increased cardiovascular disease risk17,47,48. In addition to the oxidation of lipoprotein (a), its glycosylated form, such as beta2-GPI-Lp(a), was detected in patients with rheumatoid arthritis49 and nephrotic syndrome50. Furthermore, elevated levels of Lp(a) are correlated with increased concentrations of acute phase proteins. In patients with rheumatoid arthritis, Lp(a) levels are positively associated with elevated C-reactive protein and erythrocyte sedimentation rate, indicating its significant role in the inflammatory cascade during the acute phase14. The formation of antibodies against Lp(a), which be related to its oxidation and glycosylation, appears to be triggered by autoimmune disease. Anti-malondialdehyde (MDA)-Lp(a) was detected in patients with antiphospholipid syndrome51. Some studies suggested that autoimmune processes may occur particularly in individuals with inherited high Lp(a) levels and certain HLA Class II genotypes, triggered by concurrent infections52.

Our findings suggest that there is no evidence from Mendelian randomization studies supporting a causal relationship between lipoprotein(a) levels and the risk of immune-mediated inflammatory diseases. The inconsistencies between our findings and previous observational studies that have reported a causal relationship can be attributed to confounding factors or the presence of reverse causality. Traditional observational research is susceptible to clinical confounding factors. The unmeasured variables associated with both the exposure and outcome lead to biased estimates of the effect of the exposure on the outcome. which can impact both exposure and outcome variables, thereby weakening the ability of observational studies to accurately establish causality. Second, these studies may be affected by reverse causality, where the outcome influences the exposure (rather than vice versa)53. Consequently, even if an observational study reports a strong correlation, it cannot definitively prove the existence of a direct causal association and the direction of relationship.

A considerable body of research indicates that conditions associated with abnormal blood lipid composition may influence immune-related diseases, and vice versa. In our analysis, after adjusting for HDL-C, LDL-C, and TG as covariates, only marginally statistical significance was observed between HDL-C and T1D. Here’s the possible explanation that the matter of lipids is complex as the composition and distribution of different kinds of lipids described in distinct organism species, organs, tissues, cells, and even cellular organelles is highly variable. In the future, by including more lipid characteristics and analyzing their relationship with inflammation-related diseases through lipidomics and other methods, it will provide more understanding of the relationship between lipid disorders and inflammation-related diseases.

Mendelian randomization overcomes this limitation by utilizing genetic instrumental variables to mitigate the influence of confounding factors and provide a relatively accurate assessment of causality. With the growing availability of large genetic data sets, MR has become a powerful and accessible tool for studying the risk factors for diseases. However, similar to other observational study designs, Mendelian Randomization possesses inherent limitations. Besides potential violations of the core MR assumptions, additional sources of bias may have influenced the study outcomes. Weak instruments, which are not strongly associated with the exposure of interest, can lead to biased MR estimates. Genetic instruments may be correlated with variants that are associated with the outcome of interest due to linkage disequilibrium, violating the MR assumptions54. In addition, diverse populations have been underrepresented in genomics research. Overall, while MR alone can never prove a causal relationship beyond reasonable doubt, MR offers a rigorous approach for investigating possible causal relationships in observational data and has the potential to transform our understanding of the etiology and treatment of diseases.

We would like to emphasize several strengths of our study and acknowledge certain limitations. Firstly, this study represents the inaugural investigation aimed at evaluating the bidirectional causal association between lipoprotein(a) and immune-mediated inflammatory diseases using a two sample Mendelian randomization approach. This methodology offers enhanced resilience against confounding factors, reverse causation, and non-differential exposures compared to observational studies. Secondly, we meticulously selected a diverse range of relatively prevalent autoimmune diseases along with their associated genome-wide association study data comprising large sample sizes. Lastly, we performed a sensitivity analysis to maintain the coherence of causal estimation and verify the reliability of our results. Our study also has several limitations. First and foremost, it is crucial to acknowledge that this research was carried out specifically within a European demographic. Consequently, prudence must be exercised when extrapolating our discoveries to alternative populations. Additionally, it is worth considering the possibility of other autoimmune diseases associated with lipoprotein(a) that were not encompassed within the scope of our analysis. Additionally, some of our Mendelian randomization analyses had inadequate statistical power for detecting minor impacts owing to the restricted variability explained by the single nucleotide polymorphism instruments or the relatively small sample sizes in the GWAS for outcome traits. In this regard, the exclusion of ambiguous or palindromic SNPs may have potentially compromised the statistical power of our MR study. The implementation of larger genome-wide association studies on autoimmune traits will markedly augment the statistical power of subsequent MR studies intended to detect and establish associations between these traits and potential risk factors or comorbidities.

Conclusion

Our investigate doesn’t observe the presence of a bilateral causal link between lipoprotein(a) levels and the susceptibility to immune-mediated inflammatory diseases in Europeans. This suggests that the observed associations could be attributed to shared genetic factors or confounding environmental influences. Future studies, especially those using other MR techniques or experimental models, could further explore the relationship and potential mechanism.