Introduction

Diabetes mellitus (DM) is a chronic metabolic disorder characterized by hyperglycemia due to insulin resistance and relative insulin deficiency. The global incidence of DM has been rising, driven largely by increasing rates of obesity1. Obesity is a major risk factor for DM as it contributes to insulin resistance and beta-cell dysfunction2. Insulin resistance, a hallmark of DM, often precedes the onset of the disease and is exacerbated by excess adipose tissue3. The interplay between obesity and insulin resistance significantly elevates the risk of developing DM4. Global data from the International Diabetes Federation reveals that in 2019, approximately 463 million individuals aged 20–79 were diagnosed with DM, resulting in an incidence rate of 9.3%5. The burden of diabetes is particularly significant in Japan, where the prevalence has been steadily increasing. According to the National Health and Nutrition Survey Japan, the prevalence of diabetes among adults aged 20 and older has risen from 13.7% in 1997 to 20.3% in 20166. This alarming trend underscores the need for focused public health strategies to address this escalating issue in the Japanese population. While DM is generally considered irreversible, preventive measures can play a pivotal role in mitigating its impact7. Numerous lifestyle-related risk factors, including excessive alcohol consumption, have been found to be significantly associated with the development of DM8. Specifically, heavy alcohol consumption has been shown to elevate the risk of hypertension, obesity, and insulin resistance9,10,11, all of which are recognized as pivotal contributors to the onset of DM.

In recent years, the relationship between alcohol consumption and the risk of developing DM has garnered significant interest. Several meta-analyses and Mendelian randomized studies have shown that heavy alcohol consumption increases the risk of diabetes12,13,14. However, some studies have discovered inconsistent conclusions. A meta-analysis of 26 studies involving 706,716 participants showed that heavy drinking does not increase the risk of diabetes15. In addition, a negative association between alcohol intake and diabetes risk was recently revealed in the Atherosclerosis Risk in Communities Study16. Current research on the relationship between alcohol consumption and DM is still controversial, and the number of cohort studies is limited. Therefore, the association between alcohol consumption and DM still needs further study.

The propensity score (PS) quantifies the likelihood of experiencing a particular exposure, such as varying levels of alcohol intake, based on a predefined array of covariates. Employing the propensity score matching (PSM) technique provides significant benefits in studies facing the challenge of outcomes influenced by a plethora of covariates, where confounding factors might skew results, and when the feasibility of conducting randomized clinical trials is limited due to resource constraints and ethical issues. Traditional regression models, when faced with numerous confounders, may inadvertently introduce bias if they fail to account for all relevant variables or if there's lingering confounding. On the other hand, incorporating too many variables can lead to overfitting, obscuring the true relationship between the exposure and the outcome. Therefore, this research utilized PSM analysis to explore the authentic relationship between high levels of alcohol intake and the occurrence of diabetes mellitus among the Japanese populace.

Methods

Study design and data source

This research utilized open-source data from the NAGALA (NAFLD in Gifu Area, Longitudinal Analysis) database as a secondary analysis of a medical examination program. The center where these programs were conducted, established in 1994, performed over 8000 medical examinations annually, with 60% of participants undergoing one to two exams per year. Due to the high frequency of repeated examinations, the original study cohort comprised all participants who underwent repeated examinations between 2004 and 2015. Researchers can access the original study data from the Dryad Digital Repository (https://datadryad.org/). The dataset (https://doi.org/10.5061/dryad.8q0p192) comprises data for 15,464 participants free of DM at baseline. These data were employed for a secondary analysis in full compliance with Dryad's terms of service. We utilized data from this cohort, which was established to investigate the relationship between ectopic obesity and diabetes and has been previously published17. We have used their data to explore the relationship between excessive alcohol consumption and DM. The principal objective of this investigation was to assess the influence of excessive alcohol consumption as an independent variable on the incidence of diabetes mellitus, which was considered the dependent variable.

Study participants

In the initial study, informed written consent was obtained from all participants after receiving approval from the Clinical Research Ethics Committee at Murakami Memorial Hospital17. Additionally, the ethical endorsement of this study was obtained from the Ethics Committee of the Shenzhen Dapeng New District Nan'ao People's Hospital (2,022,082,201). Moreover, the study adhered to the principles outlined in the Declaration of Helsinki, and all procedures detailed in the Declarations section were conducted in accordance with applicable regulations and guidelines.

The initial sample for this study comprised 20,944 Japanese individuals who underwent physical examinations and participated in at least two examinations between 2004 and 201517. Subjects meeting any of the following criteria were excluded from the study at baseline: (1) missing data for variables; (2) pre-existing liver diseases, such as hepatitis C or hepatitis B; (3) ethanol consumption over 60 g/day for men and 40 g/day for women; (4) medication usage; (5) diabetes; (6) fasting plasma glucose (FPG) levels ≥ 6.1 mmol/L. After excluding 5480 individuals, a total of 15,464 participants (7034 females and 8430 males) were included in the data analysis (Fig. 1).

Figure 1
figure 1

Study population.

Definition of excessive alcohol consumption

The evaluation of alcohol consumption included inquiries to participants about their alcohol consumption in the preceding month, followed by the calculation of their average weekly alcohol consumption. Excessive alcohol consumption was defined as consuming more than 30 g per day for males and more than 20 g per day for females18,19,20,21,22,23.

Diagnosis of incident diabetes

DM was defined as fasting plasma glucose levels of ≥ 7 mmol/L, glycosylated hemoglobin levels of ≥ 6.5%24, or self-reported during the follow-up period.

Covariates

This analysis, including covariates, was guided by clinical insights and findings from prior studies25,26,27,28,29. Based on the principles mentioned earlier, the following variables were used as covariates: gender, age, BMI, smoking status, regular exerciser, systolic blood pressure (SBP), diastolic blood pressure (DBP), high-density lipoprotein cholesterol (HDL-C), triglycerides (TG), total cholesterol (TC), gamma-glutamyl transferase (GGT), alanine aminotransferase (ALT), aspartate aminotransferase (AST), glycosylated hemoglobin (HbA1c), and fasting plasma glucose (FPG). Data collection on lifestyle and medical history was facilitated through a standardized self-management questionnaire, while measurements of weight, height, and blood pressure were conducted by trained personnel. Laboratory tests were performed uniformly under controlled conditions to gather additional data.

Statistical analyses

Continuous variables adhering to a normal distribution were presented as mean ± standard deviation (SD), while those with a skewed distribution were depicted using the median (interquartile range). Frequencies and percentages were used to report categorical variables. Group differences were assessed using the Wilcoxon rank-sum test (skewed distribution), two-sample t-tests (normal distribution), and the χ2 test (categorical variables). Incidence rates were expressed in terms of person-years and cumulative incidence.

The present study employed propensity score analysis to ensure comparability between the excessive alcohol consumption and non-excessive alcohol consumption groups at baseline. This was achieved by creating a single group of subjects with similar baseline characteristics, as detailed in the flowchart in Fig. 1. As previously described, employing a comprehensive multivariate logistic regression model, wherein excessive alcohol consumption was designated as the independent variable and fifteen baseline variables served as covariates, the propensity score was meticulously derived30. A greedy-matching algorithm with a caliper width of 0.01 was used to build a one:one matching protocol without replacement. Standardized differences were used to assess group equilibrium, with a standardized difference below 10.0% indicating a relatively minor imbalance in a covariate31. Furthermore, the Kaplan–Meier method was utilized to calculate the likelihood of diabetes-free survival in each group, and statistical significance was assessed using the log-rank test. The Cox proportional hazards regression model was employed to investigate the relationship between excessive alcohol consumption and the incidence of diabetes within the propensity score-matched cohort. This study applied a doubly robust estimation method that combined the propensity score model and multivariate regression model to examine the association between excessive alcohol consumption and the risk of developing diabetes32.

Propensity score matching may result in a reduced sample size, leading to unstable estimates and biased results, so we assessed the stability of our results by conducting sensitivity analyses. Inverse probability of treatment weighting (IPTW) helps to mitigate this issue by utilizing all available data, thus preserving the cohort size and enhancing the robustness of our findings. We used the estimated propensity score from the original cohort to compute the IPTW. Specifically, IPTW was calculated as one divided by the PS for individuals with excessive alcohol consumption and as one divided by (1 minus the PS) for individuals with non-excessive alcohol consumption. This approach allowed us to establish a weighted cohort33. Our study employed a range of sensitivity analyses to evaluate the robustness of the results and the impact of employing various models for associative inference on the conclusions. Two connection inference models were applied to the weighted and original populations in the sensitivity analysis. The resulting P-values and effect sizes were documented and compared across all models.

Prespecified subgroup analyses were conducted based on BMI and age. Continuous variables were converted to categorical variables using clinical cut-off points. Each stratification accounted for all factors, excluding the stratification factor itself. To maintain the balance of baseline characteristics between the excessive alcohol consumption and non-excessive alcohol consumption groups, only the corresponding matched pairs within the same subgroup were selected for the subgroup analyses. For instance, in the subgroup of participants with a BMI < 24 kg/m2, only matched pairs from both the excessive alcohol consumption and non-excessive alcohol consumption groups within the BMI < 24 kg/m2 category were included in the subgroup analysis. Likelihood ratio tests were employed to evaluate modifications and interactions within the subgroups. Our findings were presented according to the STROBE statement34.

Statistical computations were carried out with Empower Stats (R) version 2.2 (http://www.empowerstats.com, X&Y Solutions, Inc., Boston, MA) and R software 3.6.1 (http://www.R-project.org, The R Foundation). A threshold of P < 0.05 in bidirectional tests established the benchmark for statistical significance.

Ethics approval and consent to participate

In the initial study, informed written consent was obtained from all participants after receiving approval from the Clinical Research Ethics Committee at Murakami Memorial Hospital. In addition, the study has also been approved by the Ethics Committee of the Shenzhen Dapeng New District Nan'ao People's Hospital (2022082201), and was conducted in accordance with the ethical principles of the Declaration of Helsinki.

Results

Characteristics of participants

In our study, a total of 15,464 individuals were included, with 54.51% being male. In the original cohort, 4735 participants had alcohol-free consumption, and the mean alcohol consumption was 4.10 g per day in the non-excessive alcohol consumption group and 39.69 g per day in the excessive alcohol consumption group. After matching, 233 participants had no alcohol consumption, and the mean alcohol consumption was 8.23 g per day in the non-excessive alcohol consumption group and 32.47 g per day in the excessive drinking group. Among these individuals, 1184 (7.66%) reported excessive alcohol consumption. The study's participants had an average age of 43.71 ± 8.90 years. With the exception of TC, all baseline traits indicated in Table 1 that were compared between the excessive and non-excessive alcohol consumption groups showed statistically significant differences. Table 1 indicated that individuals engaging in excessive alcohol consumption typically presented elevated blood pressure, increased BMI, older age, and heightened levels of GGT, ALT, HDL-C, FPG, TG, and AST. Additionally, subjects with excessive alcohol consumption had lower HbA1c levels, a higher prevalence of males, current smokers, and individuals with regular exerciser. Through a one:one propensity score matching technique, we successfully matched 1149 individuals with excessive alcohol consumption to 1149 individuals who did not engage in excessive alcohol consumption. Following the implementation of PSM, the standardized differences for nearly all baseline characteristics were less than 10.0%, indicating a good match. Thus, the baseline characteristics of the non-excessive and excessive drinking groups differed only slightly.

Table 1 Baseline characteristics before and after propensity score matching.

Incidence rate of diabetes

Table 2 displayed diabetes mellitus incidence rates pre- and post-PSM. Following a median monitoring duration of 5.39 years, diabetes was diagnosed in 373 subjects. Before applying PSM, the prevalence rates for the overall population, non-excessive alcohol consumption group, and excessive alcohol consumption group were 2.41%, 2.27%, and 4.14%, respectively. The cumulative rates per 1000 person-years were 3.99, 3.75, and 6.18 for the total population, non-excessive alcohol consumption group, and excessive alcohol consumption group, respectively. Following propensity score matching, the prevalence rates of diabetes were as follows: 3.26% in the overall group, 2.35% in the non-excessive alcohol consumption group, and 4.18% in the excessive alcohol consumption group. Additionally, the cumulative rates of diabetes were found to be 5.42 per 1000 person-years in the overall group, 3.93 per 1000 person-years in the non-excessive alcohol consumption group, and 6.89 per 1000 person-years in the excessive alcohol consumption group. Kaplan–Meier analysis revealed a significant difference in the likelihood of diabetes-free survival between individuals with excessive alcohol consumption and those with non-excessive alcohol consumption before propensity score matching (P = 0.001; Fig. 2a). Following propensity score matching, this difference remained significant (P = 0.021; Fig. 2b).

Table 2 Incidence rate of incident diabetes before and after propensity score matching.
Figure 2
figure 2

Kaplan–Meier event-free survival curve based on Excessive alcohol consumption and Non-Excessive alcohol consumption. (a) Kaplan–Meier event-free survival curve based on Excessive alcohol consumption and Non-Excessive alcohol consumption in the original cohort. (b) Kaplan–Meier event-free survival curve based on Excessive alcohol consumption and Non-Excessive alcohol consumption in the propensity score matching cohort.

The results of the relationship between excessive alcohol consumption and diabetes risk before and after propensity score matching.

Table 3 showcased the connection between excessive alcohol consumption and diabetes mellitus occurrence before and after propensity score matching. The crude model that did not adjust for any covariates revealed a notable link between excessive alcohol consumption and diabetes risk (HR = 1.79, 95% CI 1.33–2.42, P = 0.0001 before matching; HR = 1.73, 95% CI 1.08–2.77, P = 0.0227 after matching). This trend persisted in Model 1 with HR of 1.33 (95% CI 0.96–1.83) before matching and 2.02 (95% CI 1.25–3.29) after matching when we adjusted for age, BMI, gender, smoking status, regular exerciser, SBP, DBP, ALT, AST, GGT, TC, TG, and HDL-C. Further refinement (Model 2) incorporated additional variables, including age, BMI, gender, smoking status, regular exerciser, SBP, DBP, ALT, AST, GGT, TC, TG, HDL-C, HbA1c, and FPG, maintaining the significant link (HR = 1.41, 95% CI 1.01–1.97, P = 0.0433 before matching; HR = 1.78, 95% CI 1.08–2.93, P = 0.0242 after matching). This association between excessive alcohol consumption and diabetes risk persisted, even after propensity score adjustments (HR = 1.60, 95% CI 1.14–2.24, P = 0.0064 before matching; HR = 1.73, 95% CI 1.08–2.78, P = 0.0223 after matching).

Table 3 Association between excessive alcohol consumption and incident diabetes before and after propensity score matching.

Sensitivity analysis

A sensitivity test was conducted to ascertain the stability of the link between excessive alcohol consumption and diabetes risk, as detailed in Table 4. These analyses encompassed weighted cohorts. Our findings uniformly demonstrated a significant association between excessive alcohol consumption and diabetes onset. When accounting for confounding factors, we observed an increased diabetes risk of a 77% elevation (HR = 1.77, 95% CI 1.53–2.04, P < 0.001) in the weighted group.

Table 4 Association between excessive alcohol consumption and incident diabetes in a weighted cohort model.

The results of the subgroup analysis

We used a subgroup analysis to detect the effect of potential confounders, which might affect the relationship between excessive alcohol consumption and the risk of diabetes. We treated age and BMI as the stratification variables to evaluate the trend of effect sizes in these variables. Table 5 showed that the association between excessive alcohol consumption and diabetes was notably stronger in individuals with BMI < 24 kg/m2.

Table 5 Effect size of excessive alcohol consumption on incident diabetes in prespecified and exploratory subgroups.

Discussion

Findings from the PSM cohort study reveal an independent correlation between heavy alcohol intake and diabetes occurrence. Post-adjustment for the PS, individuals engaging in excessive alcohol use demonstrated a 73% increased likelihood of diabetes onset. Sensitivity analyses across both weighted and original cohorts uniformly underscored a strong link between heavy drinking and diabetes, thereby bolstering the credibility of our results. Furthermore, in subgroup analysis, we observed that the association between excessive alcohol consumption and diabetes was notably stronger in individuals with BMI < 24 kg/m2.

Alcohol consumption is one of the key risk factors contributing to the global burden of disease through its harmful effects on health, including cardiovascular disease, cirrhosis of the liver, neuropsychiatric disorders, and cancer35,36,37. Numerous studies have demonstrated that heavy alcohol consumption significantly increases the incidence of DM12,13,14. For instance, a meta-analysis of 38 observational studies, encompassing over 1.9 million participants, indicated an elevated diabetes risk associated with heavy drinking12. Nevertheless, any potential risk reduction in moderate drinkers may be limited to specific populations, such as women and non-Asians12. A Mendelian randomized study including 72,299 middle-aged and elderly volunteers found that heavy alcohol consumption was strongly associated with the risk of diabetes mellitus (HR = 1.295, 95% CI 1.059–1.583), after adjusting for age, residence areas, gender, BMI, hypertension, hyperlipidemia, education, daily activity, energy intake, and smoking status13. Furthermore, a Mendelian randomization study involving 408,540 European participants revealed a 1.10-fold increased odds of DM (95% CI 1.06–1.13) among those consuming more than 14 drinks per week14. Notably, these findings align with our own results. In our analysis, employing the doubly robust estimation method in the PSM cohort, we observed a significant 73% increase in the risk of incident diabetes associated with excessive alcohol consumption. Importantly, this risk remained unchanged even after adjusting for propensity scores.

Our research offers an in-depth analysis of the link between excessive alcohol consumption and diabetes risk within real-world contexts. We conducted meticulous adjustments for various covariates, including sociodemographic factors, lifestyle habits, and clinical parameters. Furthermore, our study benefits from a substantial sample size. The findings highlight a significant association between excessive alcohol consumption and the incidence of diabetes. Our results suggest that people with excessive alcohol consumption have a high risk of getting diabetes in the future. Additionally, our study contributes to the advancement of propensity score methods in correlation studies.

However, it is important to note that some researchers have reported contrasting findings regarding the association between excessive alcohol consumption and the risk of diabetes. For example, in a meta-analysis incorporating 706,716 participants (275,711 males and 431,005 females) from 26 studies, the findings indicated that, in comparison to the minimal category of alcohol consumption, the relative risk for diabetes in the heavy alcohol consumption category was 0.95 (95% CI 0.83, 1.09; P = 0.480)15. Similarly, another prospective analysis, spanning a median follow-up period of 21 years and involving 12,042 study participants, revealed intriguing results16. Over the course of this extended observation, 3795 participants ultimately developed diabetes16. Surprisingly, this prospective study identified a negative correlation between alcohol intake and the risk of diabetes, suggesting that moderate to high alcohol consumption could act as a protective factor against the development of diabetes16. Interestingly, among participants categorized as heavy alcohol consumers, an interesting observation was made: when they reduced their alcohol intake during follow-up, their risk of developing diabetes appeared to increase16.

The inconsistent findings in the existing literature regarding the relationship between excessive alcohol consumption and diabetes can be attributed to several factors. First, study participants exhibit diversity in terms of race, gender, nationality, age, and other demographic characteristics. Second, different studies may choose to include or exclude certain covariates, leading to variations in the results. Third, variations in follow-up time across studies could also contribute to inconsistencies. Shorter follow-up periods may not capture the long-term effects of alcohol consumption on insulin resistance and impaired beta-cell function, whereas longer follow-up periods may introduce additional confounding factors and lead to changes in confounders. Fourth, different approaches have been employed to address confounding factors. Our study aligns with the prevailing evidence in the literature, supporting the hypothesis that excessive alcohol consumption is associated with an increased risk of diabetes. This underscores the significance of interventions aimed at reducing alcohol consumption as a preventive measure for diabetes.

The exact pathway through which alcohol influences the onset of diabetes remains to be fully elucidated. It is hypothesized that the association between alcohol-induced diabetes risk involves mechanisms such as insulin resistance and compromised β-cell functionality. Direct pancreatic damage from excessive alcohol intake has been posited38. Evidence from animal research indicates that high levels of ethanol consumption impair islets and β-cells, disrupting insulin pathways, diminishing β-cell mass, and thus reducing insulin output while elevating fasting glucose concentrations39,40. Ethanol exposure is also thought to induce β-cell apoptosis via mitochondrial dysfunction, marked by an uptick in reactive oxygen species and a drop in adenosine triphosphate production41. Additionally, ethanol might impede the insulin-stimulated activation of phosphatidylinositol 3-kinase and the expression of glucose transporter four in skeletal muscle, further leading to insulin resistance39. Our research corroborates the theory that excessive alcohol intake significantly elevates the risk of diabetes development.

Study strengths and limitations

Our study exhibits several notable strengths. Firstly, it represents the first investigation to employ Propensity Score Matching in assessing the relationship between excessive alcohol consumption and the risk of diabetes. PSM is a robust methodology that addresses diverse data requirements by minimizing inter-group disparities, balancing confounding variables, and achieving an effect comparable to randomization. Secondly, our study reinforces the robustness of its findings through a range of sensitivity analyses. By using Inverse Probability of Treatment Weights, we established a weighted cohort to further evaluate the association between excessive alcohol consumption and diabetes incidence. This approach provides an additional means to assess the performance of traditional regression models.

However, it's important to acknowledge the limitations of this research. Firstly, the study exclusively involved individuals of Japanese ancestry, which may restrict the generalizability of our findings to other ethnic groups. Further research in diverse populations is necessary to validate the broader applicability of our results. Secondly, the raw data lacked information from the 2-h oral glucose tolerance test, and participants with FPG levels ≥ 6.1 mmol/L were not included. As a result, our study may underestimate the actual relationship between excessive alcohol consumption and diabetes risk. In future research, we intend to design a study that includes participants with FPG levels ≥ 6.1 mmol/L and incorporates a 2-h oral glucose tolerance test to explore the relationship between excessive alcohol consumption and diabetes more accurately. Third, in the non-excessive alcohol consumption group, 233 participants had no alcohol consumption. In the non-excessive alcohol consumption group, participants may have had a sick-quitter effect bias, which may have affected our results' stability. In future studies, we will exclude participants who did not consume alcohol to minimize this bias further. Lastly, while the PSM method helps ensure the balance of measured covariates, it cannot address potential imbalances in unmeasured confounding variables, such as family history of diabetes or dietary habits. To minimize the potential impact of these unmeasured variables on our results, we adopted a caliper width of 0.01.

Conclusions

Based on the findings of this retrospective cohort study employing PSM, a substantial correlation exists between excessive alcohol consumption and diabetes. Consequently, this study offers clinical evidence that can be referenced for managing the risk of diabetes by controlling excessive alcohol consumption.