Introduction

Type 2 diabetes (T2D) is a common chronic condition that is known to increase risk of macro- and micro- vascular atherosclerotic diseases1,2,3,4. Over the last 30 years, prevalence and incidence of T2D has increased markedly5, and the age at which it is first diagnosed has decreased6. These changes are largely thought to be driven by the global obesity epidemic7, including increased testing for T2D in people who are obese. Whilst higher body mass index (BMI) is an established causal risk factor for T2D8, other risk factors have been proposed.

Both smoking and alcohol have been suggested as potential risk factors that may causally affect T2D. Observationally, smoking is associated with higher risk, with heavier smokers having the greatest risk9,10,11,12. Additionally, former smokers seem to have a higher risk for T2D compared to never smokers, with risk lowering over time since they quit11,13. For alcohol consumption, there is a long history of observational studies suggesting a J-shaped association with cardiovascular diseases14,15,16, with some studies finding evidence of a similar pattern with T2D17,18. This slightly higher risk among those who report no alcohol consumption may be an artefact, for example due to misreporting, or because some people stop drinking (or never start) for health reasons12,19. Despite this apparently higher risk at lower levels, across most of the distribution, higher alcohol consumption is associated with higher risk for T2D. The associations of smoking and alcohol with T2D might be causal or they might be influenced by confounding due to limited adjustment for socioeconomic position and related factors or only partially accounting for these in previous studies. In this study we aim to determine whether the relationships between smoking and alcohol and T2D are causal.

Mendelian randomisation (MR) is a causal inference method, which commonly uses genetic variants, typically single nucleotide polymorphisms (SNPs), as instrumental variables (IVs) for the exposure of interest. MR is less prone to confounding by socioeconomic, environmental and behavioural characteristics, or reverse causation than conventional observational studies20. It can be biased by violation of the core assumptions underlying MR (Box 1).

Three previous two-sample MR studies have found evidence for a potential causal effect of smoking initiation21 and lifetime smoking22,23 on increased risk of T2D. The smoking initiation study used data from a genome wide association study (GWAS) of diabetes with 74,124 cases and 824,006 controls and focused on smoking initiation, which does not capture other important smoking related behaviours, like smoking duration and intensity. One of the lifetime smoking studies used data from the same diabetes GWAS and was an atlas study that examined many outcomes, and therefore did not focus on this relationship, nor did it consider continuous traits related to diabetes. The other lifetime smoking study included T2D as well as continuous glycaemic outcomes and found no effects on the glycaemic outcomes. However, there are now more recent and larger GWAS available for all of these outcomes, increasing the power to detect effects.

We identified three previous MR studies examining alcohol and T2D. The first reported no effect. It used one SNP from the alcohol dehydrogenase 1B gene which encodes an enzyme involved in alcohol metabolism (i.e., it directly influences the amount of alcohol consumed in those who have ever drunk alcohol), in a one-sample MR approach within 261,991 adults of European ancestry (with 14,549 cases)24. This is a useful approach as there is unlikely to be bias due to horizontal pleiotropy. Therefore, further exploration, using two-sample MR and exploring underlying continuous traits may strengthen the conclusion of no effect, if results were consistent. The second study also used a one-sample MR approach and a genetic risk score for alcohol within UK Biobank (UKBB) (N = 408,540 with 33,656 cases) and found a potential causal effect of higher alcohol intake on T2D risk, with the strongest effects found in heavier drinkers in analyses stratified by alcohol intake25. The third study included both one-sample MR and two-sample MR analyses and used two genetic instruments, one for a single genetic variant and one for multiple genetic variants. They also included glycated haemoglobin (HbA1c), a continuous glycaemic trait, as an outcome. However, there are more recent and larger GWAS available now and the two-sample MR analyses with the genetic instrument including multiple genetic variants only included one method and no sensitivity analyses26.

The aim of this study was to use MR to explore the effects of lifetime smoking (capturing smoking status, duration and heaviness) and alcohol consumption (capturing the number of alcoholic drinks consumed on average per week) on T2D risk and underlying glycaemic traits (fasting glucose, fasting insulin and HbA1c). This adds to previous studies by exploring both T2D and related continuous traits, exploring effects of both smoking and alcohol behaviours in the same study, undertaking more sensitivity analyses to test genetic instrument validity, and the influence of unbalanced horizontal pleiotropy on our results, and comparing results from our main two-sample MR with those from one-sample MR, where this was possible, and data was available. We also use more recent and larger GWAS as genetic instruments for the outcomes compared to previous studies. In this study we used lifetime smoking as one exposure as this can be applied to non-smokers (unlike smoking heaviness) too and allows for a richer phenotype incorporating a range of smoking behaviours. We used alcohol consumption as our other exposure (i.e., average number of drinks participants reported consuming each week across all types of alcohol) to capture drinking over the whole distribution.

Results

Two-sample Mendelian randomisation with lifetime smoking as the exposure

We conducted two-sample Mendelian randomisation (MR) analyses to examine potential causal effects of lifetime smoking index (LSI) on T2D and the continuous glycaemic traits of fasting glucose, fasting insulin and HbA1c. We used the largest available European ancestry GWAS of T2D (from the Million Veteran Program, DIAMANTE and Biobank Japan)27 and for fasting glucose28, fasting insulin28 and glycated haemoglobin (HbA1c)28 (all using data from the Meta-Analyses of Glucose and Insulin-related traits Consortium [MAGIC]) as outcomes in these analyses (see Methods for further details). For these analyses we selected genetic instruments from GWAS data for the exposure and identified these single nucleotide polymorphisms (SNPs) in the GWAS data for the outcomes as described in the Methods.

Mean F-statistics for LSI were all 44.27 (Supplementary Table S1), indicating instruments were not weak. Figures 1 and 2 and Supplementary Table S2 provide the results from the main inverse-variance weighted (IVW) method and all sensitivity analyses of the effects of LSI on outcomes. For T2D (see Fig. 1), the main IVW result suggested a causal effect of higher LSI on T2D risk (OR per 1SD higher LSI = 1.42, 95% CI = 1.22 to 1.64). Details of the sensitivity analyses are presented in the Methods and Supplementary Materials Sect. 7, but results from weighted median, MR Pleiotropy RESidual Sum and Outlier (MR-PRESSO) (with and without outlier correction) and the Generalised Summary-data-based MR (GSMR) sensitivity analyses were consistent with there being an effect of higher LSI on T2D risk. By contrast MR-Egger and simulation extrapolation (SIMEX) adjusted MR-Egger results were in the opposite direction, though with wide confidence intervals (OR per 1SD higher LSI = 0.80, 95% CI = 0.46 to 1.41 for MR Egger and OR 0.75, 95% CI 0.43 to 1.31 for SIMEX adjusted MR-Egger). This may suggest that horizontal pleiotropy is influencing these effects. However, we also note that these approaches are not as well powered as the other approaches which could explain the wide confidence intervals. There was also evidence of between SNP heterogeneity (Cochran’s Q p-value = 5.42 × 10− 66; Rucker’s Q p-value = 1.10 × 10− 62) and potential bias due to unbalanced horizontal pleiotropy based on the MR-Egger and SIMEX adjusted MR Egger intercepts (p = 0.04 and p = 0.02, respectively) and the MR-PRESSO global test (p < 0.0003).

Fig. 1
figure 1

Two-sample Mendelian randomisation results of the potential causal effect of lifetime smoking on type 2 diabetes. Results from the main IVW two-sample Mendelian randomisation (MR) analysis of lifetime smoking on type 2 diabetes and the sensitivity analysis results from MR Egger, SIMEX, weighted median, PRESSO and GSMR analyses. Results are the odds ratios (OR) of type 2 diabetes per 1 SD higher lifetime smoking index (LSI) score, with 95% confidence intervals (CI), noting that 1 SD higher LSI value is equivalent to an individual smoking 20 cigarettes per day for 15 years and stopping 17 years ago or smoking 60 cigarettes a day for 13 years and stopping 22 years ago. SNP single nucleotide polymorphism, IVW inverse-variance weighted, SIMEX simulation extrapolation, PRESSO pleiotropy RESidual sum and outlier, GSMR generalised summary-data-based Mendelian randomization.

Our main IVW analyses also suggested that LSI was not causally related to fasting glucose (difference in mean fasting glucose in mmol/l per 1SD higher LSI = 0.003, 95% CI −0.03 to 0.04), fasting insulin (difference in mean fasting insulin in pmol/l per 1SD higher LSI = −0.03, 95% CI = −0.07 to 0.02) or HbA1c (difference in mean HbA1c in NGSP percent or equivalent per 1SD higher LSI = 0.009, 95% CI −0.01 to 0.03) (see Fig. 2). Sensitivity analyses were mostly consistent with these results, with the exception of MR-Egger and SIMEX adjusted MR-Egger, where imprecise estimates suggested that higher LSI might reduce fasting insulin (Fig. 2). The direction of effect in the MR-Egger and SIMEX adjusted MR-Egger analyses was also negative for these outcomes. This opposite direction may reflect the presence of potential unbalanced horizontal pleiotropy. Additional Multivariable MR (MVMR) analyses accounting for body mass index (BMI) were conducted where fasting glucose and fasting insulin were the outcomes due to the GWAS used adjusting for BMI. This approach can help overcome this issue and provide unbiased estimates (see Methods). These results were in line with the main IVW results (Supplementary Table S3).Please confirm the section headings are correctly identified.This is correct

Fig. 2
figure 2

Two-sample Mendelian randomisation results of the potential causal effect of lifetime smoking on glycaemic traits. Results from the main IVW two-sample Mendelian randomisation (MR) analysis of lifetime smoking on fasting glucose, fasting insulin and HbA1c and the sensitivity analysis results from MR Egger, SIMEX, weighted median, PRESSO and GSMR analyses. Results are the difference in mean fasting glucose (mmol/l), fasting insulin (pmol/l) and HbA1c (NGSP percent or equivalent) per 1 SD higher lifetime smoking index (LSI) value, with 95% confidence intervals (CI), noting that 1 SD higher LSI value is equivalent to an individual smoking 20 cigarettes per day for 15 years and stopping 17 years ago or smoking 60 cigarettes a day for 13 years and stopping 22 years ago. Multivariable Mendelian randomisation analyses including body mass index for the outcomes of fasting glucose and fasting insulin are not presented here but results with this approach were in line with the main IVW results. SNP single nucleotide polymorphism, IVW inverse-variance weighted, SIMEX simulation extrapolation, PRESSO pleiotropy RESidual sum and outlier, GSMR generalised summary-data-based Mendelian randomization.

Two-sample Mendelian randomisation with drinks per week as the exposure

We conducted two-sample Mendelian randomisation (MR) analyses to examine potential causal effects of alcoholic drinks per week (DPW) on T2D and the continuous glycaemic traits of fasting glucose, fasting insulin and glycated haemoglobin (HbA1c). The same outcome GWAS were used as described for LSI. For these analyses we selected genetic instruments from GWAS data for the exposure and identified these single nucleotide polymorphisms (SNPs) in the GWAS data for the outcomes as described in the Methods.

Mean F-statistics for alcohol consumption were all 31.55 (Supplementary Table S1). Figures 3 and 4 and Supplementary Table S2 provide the results from the main IVW and all sensitivity analyses of the effects of alcohol consumption on outcomes. For T2D, the main IVW results suggested that alcohol consumption was not causally related to T2D (OR per 1 SD higher log-transformed drinks per week = 1.04, 95% CI 0.40 to 2.65) (see Fig. 3). Some of the sensitivity analyses were consistent with these results, however, MR-Egger, weighted median and GSMR estimates suggested a potential causal effect of higher drinks per week on T2D risk, but confidence intervals were wide (Fig. 3). There was also evidence of between SNP heterogeneity (Cochran’s Q p-value = 2.86 × 10− 89; Rucker’s Q p-value = 2.00 × 10− 70) and potential bias due to unbalanced horizontal pleiotropy based on the MR Egger intercept (p = 0.0005) and the MR-PRESSO global test (p < 5 × 10− 04).

Fig. 3
figure 3

Two-sample Mendelian randomisation results of the potential causal effect of drinks per week on type 2 diabetes. Results from the main IVW two-sample Mendelian randomisation (MR) analysis of drinks per week on type 2 diabetes and the sensitivity analysis results from MR Egger, SIMEX, weighted median, PRESSO and GSMR analyses. Results are the odds ratios (OR) of type 2 diabetes per 1 SD higher log-transformed drinks per week, with 95% confidence intervals (CI). SNP single nucleotide polymorphism, IVW inverse-variance weighted, SIMEX simulation extrapolation, PRESSO pleiotropy RESidual sum and outlier, GSMR generalised summary-data-based Mendelian randomization.

Our main IVW analyses suggested a potential casual effect of higher drinks per week on higher fasting glucose (difference in mean fasting glucose in mmol/l per 1SD higher log-transformed drinks per week = 0.34, 95% CI = 0.09 to 0.59). This was consistent across all sensitivity analyses (see Fig. 4). However, there was also evidence of between SNP heterogeneity (Cochran’s Q p-value = 5.18 × 10− 40; Rucker’s Q p-value = 7.13 × 10− 40) and potential bias due to unbalanced horizontal pleiotropy based on the MR-PRESSO global test (p < 5 × 10− 04), but not the MR-Egger intercept (p = 0.46). Given that we still find an effect for the outlier-corrected PRESSO approach, this may suggest that there is little impact of horizontal pleiotropy on these results. It is worth noting that the MR-PRESSO global test result may also reflect other sources of effect heterogeneity, for example, the presence of multiple causal pathways from the exposure to the outcome. Our main IVW analyses suggested that drinks per week was not causally related to fasting insulin (difference in mean fasting insulin in pmol/l per 1SD higher log-transformed drinks per week = 0.08, 95% CI -0.16 to 0.32) or HbA1c (difference in mean HbA1c in NGSP percent or equivalent per 1SD higher log-transformed drinks per week = −0.05, 95% CI −0.17 to 0.07) (see Fig. 4). Sensitivity analyses were mostly consistent with these results, with the exception of the GSMR estimate suggesting that higher drinks per week might reduce fasting insulin (Fig. 4). In the additional MVMR analyses accounting for BMI where fasting glucose was the outcome, the effect we observed in the main analyses was attenuated and did not provide evidence of a causal effect. This may suggest that BMI is driving this effect. For fasting insulin, the MVMR analysis results were in line with the main IVW results (Supplementary Table S3).

Fig. 4
figure 4

Two-sample Mendelian randomisation results of the potential causal effect of drinks per week on glycaemic traits. Results from the main IVW two-sample Mendelian randomisation (MR) analysis of drinks per week on fasting glucose, fasting insulin and HbA1c and the sensitivity analysis results from MR Egger, SIMEX, weighted median, PRESSO and GSMR analyses. Results are the difference in mean fasting glucose (mmol/l), fasting insulin (pmol/l) and HbA1c (NGSP percent or equivalent) per 1 SD higher log-transformed drinks per week, with 95% confidence intervals (CI). Multivariable Mendelian randomisation analyses including body mass index for the outcomes of fasting glucose and fasting insulin are not presented here but effects for fasting glucose were attenuated with this approach, whilst results for fasting insulin were in line with the main IVW results. SNP single nucleotide polymorphism, IVW inverse-variance weighted, SIMEX simulation extrapolation, PRESSO pleiotropy RESidual sum and outlier, GSMR generalised summary-data-based Mendelian randomization.

One-sample Mendelian randomisation for drinks per week on type 2 diabetes and HbA1c in UK Biobank

Sample characteristics for those UK Biobank (UKBB) participants included in the one-sample MR analyses are shown in Supplementary Table S4. We were only able to assess the effect of drinks per week on T2D and HbA1c because the LSI genetic instruments were obtained from UKBB and the glucose and insulin measures in UKBB were from non-fasting samples.

Table 1 provides the results from the one-sample MR analyses. Results suggested a causal effect of higher drinks per week on T2D risk (OR per 1 SD higher log-transformed drinks per week = 1.71, 95% CI 1.24 to 2.36) and on lower HbA1c levels (but only when we excluded participants with a possible or probable diabetes diagnosis or HbA1c\(\:\ge\:\)6.5%) (difference in mean SD of log transformed HbA1c (mmol/mol) per 1 SD higher log-transformed drinks per week = −0.07, 95% CI −0.11 to −0.02). We also conducted analyses additionally adjusting for the genotyping chip used (this was either the UKBB axiom array or the UK BiLEVE array; see Methods for details). Adjusting for genotyping chip did not impact our results.

Table 1 One-sample mendelian randomisation results.

Discussion

Overall, the results of this study suggest that there is not strong evidence to support potential causal relationships of LSI and drinks per week on risk of T2D. Whilst there were some suggestive results, these were not consistent across analyses and outcomes. For example, we found evidence of a possible causal effect of higher LSI values on risk of T2D in our main IVW analysis, but this was not consistent across sensitivity analyses and results may be biased due to unbalanced horizontal pleiotropy. Furthermore, the lack of evidence of an effect on the underlying glycaemic traits suggests that we cannot confidently interpret this as a causal effect on T2D risk. Furthermore, we found little evidence of a possible causal effect of higher drinks per week on T2D risk in our two-sample MR analyses. We did observe a possible causal effect of drinks per week on fasting glucose, which was consistent across sensitivity analyses, but this was not supported by analyses with the other underlying glycaemic traits. In addition, when accounting for BMI this effect was no longer observed. This may suggest that BMI confounding is driving this effect. Results from our one-sample MR analyses (only examining drinks per week on T2D and HbA1c) suggested that there may be a causal effect of more drinks per week on T2D risk and lower HbA1c levels. However, given these mixed results across the two-sample and one-sample MR analyses, this study does not provide strong evidence of causal effects but do suggest that LSI and drinks per week may have some small effect on some of these outcomes. The explanations for these small effects may be complicated, for example, acting through pleiotropic pathways.

We attempted to avoid or address any potential violations to the MR assumptions (see Box 1) in this study. We used genetic instruments that were robustly associated with the exposures and also conducted both one-sample and two-sample MR analyses, where possible, as well as sensitivity analyses, to provide greater confidence in our results. To reduce potential biases due to population stratification we used data from samples of European ancestry and included principal components (PCs) in our one-sample analyses. Finally, we assessed whether horizontal pleiotropy might be present and influencing our results. We found that results for the effect of LSI on T2D, and the effect of drinks per week on fasting glucose, may have been influenced by horizontal pleiotropy. This could suggest that the genetic instruments for lifetime smoking and alcohol consumption are influencing T2D risk and potentially fasting glucose through pathways other than through the corresponding exposures. It is possible that the genetic instruments for LSI and drinks per week are also capturing other behaviours e.g., risk taking that may influence T2D risk and fasting glucose. We also found that when accounting for BMI the effect of higher drinks per week on higher fasting glucose was attenuated, suggesting confounding from BMI may partially account for this. Given we have attempted to address potential violations in the MR assumptions, for example using pleiotropy robust methods, we have greater confidence in our results.

Previous studies have largely shown associations using multivariable regression, and potential causal effects using MR, between smoking and T2D risk10,11,21,22,29 and between higher alcohol consumption and T2D risk17,18,24,25. However, we note that a previous MR study using a functional variant for alcohol metabolism and a previous study using smaller GWAS for alcohol consumption and T2D both also did not find an effect on T2D24,26, in line with our results, suggesting it is unlikely there is an effect of alcohol on T2D. Given that previous studies report mixed findings for alcohol and T2D and given the results we present in this study, it is unlikely that there is a notable effect of alcohol on T2D. In addition, our finding with decreased HbA1c in the one-sample MR analyses is in the opposite direction to what we might expect given that higher levels of HbA1c are observed in those with T2D. However, this may be due to the exclusion of individuals with a diagnosis of diabetes, and we do not find an effect in our two-sample MR, so results should be interpreted with caution. However, this finding also provides further evidence against alcohol being a strong risk factor for the outcomes examined in this study. Our results suggest that future studies examining risk factors for T2D should triangulate results across different analyses and consider underlying glycaemic traits as outcomes in order to arrive at the correct conclusions.

Limitations

The key strengths of this study and how this study adds to previous literature is described in the introduction and above. In terms of limitations, we were unable to explore a non-linear effect as recent evidence suggests that current methods for doing so in MR are potentially biased30. Some observational studies have suggested a J-shaped association between alcohol and coronary heart disease, which T2D is a risk factor for14,15,16,19. However, that J-shaped association, even if causal, suggests a linear association across most of the distribution. Our one-sample MR analyses in UKBB may be subject to selection bias31,32. Given that participants in UK Biobank are less likely to drink as much alcohol and be healthier compared to the general population33, it is possible that selection bias could have distorted the one-sample MR effect estimates in this study. Specifically, we observed effects in our one-sample MR analyses but not in our two-sample MR analyses where drinks per week was the exposure and type 2 diabetes and HbA1c were the outcomes. Therefore, we cannot rule out the possibility that selection bias could have induced associations in our one-sample MR analyses and therefore the one-sample MR results should be interpreted with caution alongside our two-sample MR results. Future studies replicating these analyses in other samples would be useful to examine whether selection bias in UKBB may be an issue here, however our two-sample MR analyses in part overcome this for alcohol but not lifetime smoking where the GWAS was conducted in UK Biobank as well. Our analyses were conducted in samples of European ancestry, due to the data available, so results are not generalisable beyond this group. Finally, it is possible that measurement error in the exposure or outcome could bias our results. This is more likely to be the case for the exposures, which could be subjectively influenced, for example misreporting of cigarettes smoked per day, time since cessation or duration of smoking and number of drinks consumed per week. In particular, in UKBB drinks per week is measured based on number of glasses of alcohol consumed and this is similar to the definitions used in the GWAS. This does not account for units, and glass/drink size can vary, so there is likely variation in how this was reported. If measurement error is random, which is likely to be the case here, this could have attenuated the one-sample MR results.

Conclusions

In summary, we found limited evidence of a possible causal effect of higher lifetime smoking index score and drinks per week score on T2D risk. We found further evidence of a possible causal effect of higher drinks per week on higher fasting glucose, although it is possible that confounding by BMI was driving this effect given the effect attenuated in the MVMR analyses. Overall results were not consistent across analyses and some results may be biased by horizontal pleiotropy. Therefore, we do not find strong evidence of smoking and alcohol influencing risk of T2D. Whilst public health interventions to prevent and/or reduce smoking and alcohol consumption are important for reducing the risk of other chronic diseases (e.g., heart disease and certain types of cancer), our results suggest this is unlikely to directly influence T2D risk to a large degree. However, it would be useful to explore some aspects of these relationships further before drawing any strong conclusions on these relationships. For example, future research should include triangulation approaches and glycaemic traits to allow for a more in depth understanding of the causal influence of risk factors on T2D.

Methods

Exposure GWAS and selection of genetic instruments

We used the largest GWAS of lifetime smoking index (LSI)34 and alcohol consumption (drinks per week)35, avoiding sample overlap with the outcome GWAS, which can bias results. Both GWAS only included participants of European ancestry and with complete genotype and phenotypic data (for relevant smoking and alcohol phenotypes), resulting in 462,690 participants from UKBB in the LSI GWAS and 941,280 participants from GSCAN (GWAS and Sequencing Consortium of Alcohol and Nicotine) in the drinks per week GWAS. GWAS adjusted for PCs to further control for population substructure. SNPs (and associations with the relevant exposures) were selected if they reached genome-wide statistical significance (p ≤ 5 × 10-08) and were independent (i.e., we excluded SNPs in linkage disequilibrium; r2 of 0.001; window of 10,000 kb; European 1000 genomes reference panel). For any palindromic SNPs we tried to infer the positive strand based on allele frequencies, but if this was not possible, these SNPs were excluded. Where a SNP was available for the exposure and not the outcome, we attempted to identify proxy SNPs using LDlink36 and an LD r2 threshold of > 0.8. After exclusions and identifying any proxies, we searched for the remaining LSI SNPs in the outcome GWAS (118 SNPs for all outcomes) and the remaining alcohol consumption SNPs in the outcome GWAS (70 SNPs for all outcomes). Details of the exposure GWAS, including derivation of the LSI and drinks per week of alcohol are provided in Supplementary Materials Sect. 1. To summarise, the LSI reflects a combination of smoking related behaviours including smoking status, duration and heaviness, where never smokers have a score of 0. Drinks per week reflects the average number of drinks/glasses consumed per week by participants.

LSI is in standard deviation (SD) units, therefore, in our MR analyses we explore effects per 1 SD higher LSI. To give context, 1 SD higher LSI value is equivalent to an individual smoking 20 cigarettes per day for 15 years and stopping 17 years ago or smoking 60 cigarettes a day for 13 years and stopping 22 years ago. Natural log-transformed drinks per week were used in the GWAS, therefore, in our MR analyses we explore effects per 1 SD higher log-transformed drinks per week. To give context, in UKBB (the sample used in our one-sample MR) 1 SD was equal to 2.14 of the log transformed drinks per week.

Outcome GWAS and harmonisation of exposure SNPs

We obtained associations of the exposure SNPs with outcomes from the largest GWAS of T2D27, fasting glucose28, fasting insulin28 and glycated haemoglobin (HbA1c)28. We only used GWAS data including participants of European ancestry, resulting in 148,726 cases and 965,732 controls from the Million Veteran Program, DIAMANTE and Biobank Japan for T2D. The continuous traits all used data from the Meta-Analyses of Glucose and Insulin-related traits Consortium (MAGIC), with 209,605 participants with data for fasting glucose, 158,550 with data for fasting insulin and 149,289 with data for HbA1c. GWAS summary statistics for the exposure and outcome were harmonised so that the SNP allele-exposure and SNP allele-outcome associations were in the same direction. Details of these GWAS can be found in Supplementary Materials Sect. 2. To summarise, the T2D GWAS included cases with a diagnosis of T2D and controls without, fasting glucose was measured in mmol/l, fasting insulin in pmol/l in serum and HbA1c as a percentage. Therefore, in our MR analyses results are reported as the odds of T2D and the difference in mean fasting glucose (mmol/l), fasting insulin (pmol/l) and HbA1c (NGSP percent or equivalent) per 1 SD higher LSI or log-transformed drinks per week.

UK biobank data for one-sample mendelian randomisation

We used data from the UKBB, a large population-based prospective health research resource of 503,317 participants (5.5% response of those invited), recruited between 2006 and 2010, aged between 38 and 73 years and from the UK37. Further details are included in the Supplementary Materials (Sect. 3 and on the study website (www.ukbiobank.ac.uk)).

Drinks per week

The drinks per week phenotype was constructed from responses to questions on the average weekly intake of a range of different alcoholic beverages (defined as number of glasses they had). Where this information was not available, weekly consumption was estimated from measures of average monthly intake (see Supplementary Materials Sect. 4 for further details). Data were natural log-transformed due to being right skewed and standardised (1 SD was equal to 2.14 of the log transformed drinks per week).

Type 2 diabetes

We derived possible or probable T2D using the Eastwood algorithm38 (see Supplementary Materials Sect. 5). In one-sample MR analyses we excluded individuals who had possible or probable type 1 diabetes as per the Eastwood algorithm.

HbA1c

Serum HbA1c (mmol/mol) was assayed using five Bio-Rad Variant II Turbo analysers, values outside of the reportable range of 15 to 184 mmol/mol, or invalidated for any other reason, were excluded (further information can be found at https://biobank.ctsu.ox.ac.uk/crystal/ukb/docs/serum_hb1ac.pdf). These analysers used high performance liquid chromatography (HPLC) to determine the relative concentration of HbA1c in packed red blood cells, from blood samples (approximately 9 ml) collected at recruitment. Many studies examining continuous traits related to T2D exclude participants with a diabetes diagnosis or above thresholds indicative of diabetes. This means that results are not necessarily applicable to the whole population from which the study sample is drawn and can result in selection bias39. On the other hand, people with a diagnosis of diabetes will have made changes to their lifestyles and/or be on medications that impact these continuous traits and associations with them. Therefore, we conducted analyses with and without excluding those with possible or probable type 1 or type 2 diabetes using the Eastwood algorithm and those who had a HbA1c measure of \(\:\ge\:\)6.5% (or 48 mmol/mol) at baseline. Data were natural log-transformed due to being right skewed and standardised (1 SD was equal to 0.15 log mmol/mol). As UKBB did not collect fasting samples we have not conducted one-sample MR on fasting glucose and insulin.

Genetic data

A total of 488,377 participants had genotyped samples. Pre-imputation quality control, phasing and imputation are described elsewhere40 and summarised in the Supplementary Materials Sect. 6.

Statistical analysis

We pre-registered the analysis plan for this study on the Open Science Framework in March 2021 (https://osf.io/ygucn). All analyses were conducted in R41 (version 3.6.2).

Two-sample mendelian randomisation main analyses

We conducted two-sample MR analyses using the TwoSampleMR package in R42.

We used the inverse-variance weighted (IVW) method as our main analysis43. This fits a linear regression model of the mean SNP-outcome value on mean SNP-exposure value across all SNPs and constrains the intercept of the regression slope to be zero, with the slope providing an unbiased effect estimate under the assumption that there is no horizontal pleiotropy44.

Two-sample mendelian randomisation sensitivity analyses

Sensitivity analyses used to explore the assumption that there is no horizontal pleiotropy were done using MR-Egger45, weighted median46, MR Pleiotropy RESidual Sum and Outlier (MR-PRESSO)47 and Generalised Summary-data-based MR (GSMR)48 methods. Full details of these sensitivity analyses are provided in Supplementary Sect. 7. However, we provide a summary of these below.

MR Egger is identical to IVW with the exception that the intercept reflects the best fitted regression model and is not constrained to zero45. The slope provides a causal estimate controlling for potential unbalanced horizontal pleiotropy and a non-null intercept is indicative of unbalanced horizontal pleiotropy.

The weighted median provides an unbiased causal estimate if no more than 50% of the weight of the SNPs used in the genetic instrument are influenced by horizontal pleiotropy46.

MR-PRESSO is used to detect and correct for potential horizontal pleiotropic outliers in the instrument47. Uncorrected and outlier-corrected effects are estimated and there are additional tests to detect whether horizontal pleiotropy is present.

The GSMR approach allows estimation of a causal effect when removing potential outliers and including SNPs in the instrument that are correlated, by estimating the LD between SNPs from a reference sample48.

In addition, we explored between SNP heterogeneity, which might be an indicator of horizontal pleiotropy or violation of other assumptions, using Cochran’s Q, where a p-value < 0.05 may indicate the presence of between SNP heterogeneity. We also assessed heterogeneity between SNPs, whilst adjusting for any horizontal pleiotropy for the MR-Egger method, using the Rucker’s Q-test, again with a p-value threshold of < 0.05.

The IVW and MR-Egger methods assume that there is no measurement error in the SNP-exposure estimates46, known as the ‘NO Measurement Error’ (NOME) assumption. The extent of the NOME assumption violation can be quantified using regression dilution I-squared statistics, where a lower value indicates greater violation. An I-squared of less than 0.9, indicates that MR-Egger estimates should be interpreted with caution due to regression dilution and where this is the case, we have conducted simulation extrapolation (SIMEX) corrections as a sensitivity analysis. The SIMEX approach is a bias adjustment method which provides an estimate for the case where NOME had been satisfied. We also estimated the mean F-statistic for each analysis, which indicates instrument strength, where a value under 10 may indicate a weak instrument46.

Overall, by using these different methods, which make different assumptions, we were able to assess the robustness of evidence for causal effects against violations of the MR assumptions. We were interested in whether there was evidence of causal effects. However, we have previously shown that causal effect estimates when using exposures related to cigarette smoking may be unreliable49. Therefore, we considered consistency of evidence (e.g., direction of effect estimate, p-value as a measure of strength of evidence against the null) across analyses to guide our inference regarding whether or not a causal effect may be operating, but did not attempt to directly estimate the magnitude of any such effect50.

Finally, the GWAS used for the outcomes of fasting glucose and fasting insulin adjust for body mass index (BMI), which can bias our results. Multivariable MR (MVMR) analyses including BMI can help overcome this issue and provide unbiased estimates of the exposure of interest (LSI and drinks per week) on the outcome51 (see Supplementary Materials Sect. 8 for further details).

One-sample mendelian randomisation analyses

One-sample MR analyses were conducted using the OneSampleMR and Applied Econometrics with R (AER) packages, respectively, in R52,53. We generated weighted allele genetic risk scores in UKBB for alcohol consumption using the per-allele regression coefficients from each independent genome-wide significant SNP for each exposure as weights and then summing those weighted values (see Supplementary Materials Sect. 9). We used two-stage least squares regression with adjustment for age, sex, the first 10 PCs (derived from PC analysis of UKBB genotype data, imputed to a reference set combining UK10K haplotype and Haplotype Reference Consortium [HRC] reference panels), assessment centre and genotyping chip. Two genotyping chips were, the UKBB axiom array (which 90% of participants were genotyped with) and the UK BiLEVE array. The latter was used for those in the UK BiLEVE study54, which was oversampled for smokers, and therefore adjusting for genotyping chip may introduce collider bias. Therefore, we performed analyses with and without adjustment for chip.

Ethics

All studies that contributed to the exposure and outcome GWAS used in MR analyses had ethics approval and participant consent for their data to be used in genetic analyses. UKBB (data used in one-sample MR analyses) received ethics approval from the UK National Health Service Research Ethics Committee (11/NW/0382).