Introduction

The pathophysiology of type 2 diabetes mellitus (T2DM) is multifactorial1. The presentation of clinical features at diagnosis varies from those related to insulin resistance to insulin deficiency and mild age-related diabetes1,2. While white Europeans show higher proportion of insulin resistance-related diabetes, individuals of South Asian, East Asian, and African ancestry have a higher proportion of insulin deficiency3,4,5,6. This, in part, leads to the higher frequency of young-onset diabetes in people of Asian and African ancestry7. Insulin deficiency is driven by pancreatic beta-cell dysfunction, which could be due an insufficient number of beta-cells, reduced mass, function, early cell death, increasing visceral fat, or perhaps a combination of these factors.

The efficiency of pancreatic beta-cells depends on a highly competent endoplasmic reticulum (ER) due to the high amount of protein synthesis and secretion required, especially in high-demand states8. The unfolded protein response (UPR) is a key cellular process reversing ER stress9,10. Therefore, an unimpaired UPR is crucial for continued insulin synthesis throughout life. Persistent ER stress, caused by impairments in UPR, has been linked to beta-cell dysfunction and death, while a functional UPR promotes increased insulin secretion9,11,12,13.

XBP1 is a highly conserved transcription factor produced in response to ER stress, and its function is to regulate the UPR14,15. Impairment of the UPR is linked to a decline in pancreatic islet function. Islets of eight individuals with T2DM showed decreased spliced XBP1 expression16. We elucidate this mechanism in the visual abstract. In 2022, Lee et al. showed that knocking out XBP1 in high-fat diet-fed mice resulted in the development of T2DM through failed beta-cell compensation and increased apoptosis17. Pancreatic beta-cells in XBP1 knock-out mice showed greater beta-cell dedifferentiation, beta-to-alpha cell transdifferentiation, and impaired proinsulin processing. Together this rendered the beta-cells ineffective when challenged with metabolic stressors, leading to reduced beta-cell function and eventually hyperglycaemia. In 2023, Wang et al. used human single-cell sequencing and multi-omic experiments to identify mechanisms of beta-cell dysfunction in type 2 diabetes. They identified XBP1 as a key transcription factor that was down-regulated in individuals with diabetes compared to those with pre-diabetes and healthy controls, which sensitizes beta-cells to ER stress.

The genetics of beta-cell function, specifically insulin secretion, in humans is not as well studied as other aspects of T2DM. This is largely because most genetic studies of diabetes have been performed in well-powered cohorts of European ancestry, who have a lower burden of beta-cell dysfunction compared to other ancestries18. Furthermore, the largest meta-genome-wide association studies (GWAS) on beta-cell function were performed in healthy white Europeans19 and remain the only contributor even in recent studies20. To address this gap in both data and knowledge, it is imperative to build on biological evidence of beta-cell susceptibility pathways and test their role in T2DM in non-white populations using approaches other than GWAS. Summary statistics from the most recent trans-ancestry genome-wide association study of T2DM reported a credible set of 126 variants in the genomic region on chromosome 22, but so far, none have been implicated with XBP1 expression21. The role of XBP1 expression in maintaining glucose homeostasis, glycaemic control, and response to diabetes therapeutics across human populations has not been previously studied.

We test the hypothesis that a SNPs associated with the expression of XBP1 in pancreatic islet cells drives an increased risk of T2DM, reduced beta-cell function (HOMA-B), impaired pre-treatment glycaemic control (HbA1c), and impaired response to insulin secretagogues. We find that decreased XBP1 expression in pancreatic islet cells is associated with increased type 2 diabetes risk, and that the variants associated with reduced XBP1 expression are more common in East and South Asians as well as African populations. Using data from South Asians cohorts, we demonstrate that the variant is associated with lower beta-cell function and higher stimulated response. In trans-ancestry meta-analyses, we observe the effect of this variant on higher HbA1c. Finally, we demonstrate that drugs like sulphonylureas are less likely to work in carriers of these variants.

Methods

To aid readers, we have included table with acronyms for study names and details in Supplementary Table 1. Throughout the manuscript, where individual-level data are presented, we have used genetically-inferred ancestry.

Identifying potentially lead variant for pancreatic islet expression of XBP1

To identify a Single Nucleotide Polymorphism (SNP) affecting the expression of XBP1 (rs7287124, eSNP), we used cis-eQTLs results from pancreatic islets (n = 420) and pancreatic beta-cell samples (n = 26) from the Integrated Network for Systematic analysis of Pancreatic Islet RNA Expression (InsPIRE) study (details in Supplementary Methods 1). Previous studies have estimated that the most significant variant in cis-eQTL analyses has a causal role 50% of the time (including when the causal variant has not been genotyped or imputed)22. This cis-eQTL was also replicated by the Translational Human Pancreatic Islet Genotype tissue-Expression Resource (TIGER)23. A study flowchart is provided in Supplementary Fig. 1.

Assessing the probability of the lead variant being associated with type 2 diabetes using colocalisation analyses

We performed colocalisation analyses to test if the lead variant identified (rs7287124) was associated with both expression of XBP1 and type 2 diabetes risk. Colocalisation analyses compare the distribution of summary statistics from large-scale association studies of complex traits and eQTLs, while reducing false positive discoveries by correcting for linkage disequilibrium across a genetic region24,25. For colocalisation analyses, we used the programme, ezQTL, which computes the probability of colocalisation using two established methods, HyPrColoc (Hypothesis Prioritisation in multi-trait Colocalization)26 and eCAVIAR (CAusal Variants Identification in Associated Regions)27. Since colocalisation analyses were used to check for linkage contamination, only single-ancestry GWA studies could be utilised. Therefore, we used summary statistics from the two largest available ancestry-specific GWAS of T2DM: one of white Europeans (n cases = 80,154) and the another of East Asian descent (n = 36,614) from the DIAMANTE consortium and Biobank Japan, respectively21,28. Unfortunately, a similarly powered GWAS in south Asians is not available, and therefore colocalisation with south Asians was not undertaken. We then assessed the direction of effect between expression and T2DM risk using inverse variance weighted and weighted median methods in the MR-Egger package29.

All resources used for summary statistics have been published previously and are publicly available, additional details on these are provided in Supplementary Table 1 and Methods 1.

Association of XBP1 expression with glycaemic traits

We used individual-level data from 4 biobanks in this analysis: (1) the Tayside Diabetes Cohort based in Scotland, UK (TDC, comprised of white Europeans), (2) Dr. Mohan’s Diabetes Specialities Centre (DMDSC) and (3) Telemedicine Project for Screening diabetes and complications in rural Tamil Nadu (TREND), both comprised of South Asian Indians based in southern India, and finally (4) Genes & Health (G&H, British South Asian Pakistani and Bangladeshi) based in London, UK. Study design and data collection for these biobanks have been described previously, and additional details in Supplementary Methods30,31,32. Briefly, TDC and G&H are cohorts with record linkage of electronic health data from their local National Health Service boards, which have been used for ancestry-specific diabetes research3,33. DMDSC has record-linked clinical data from visits to a private diabetes clinic and hospital chain in South India. In cohorts with linked health records: TDC, G&H, and DMDSC, only participants with newly diagnosed T2D (glycaemic measures 12 months before and 1 month after diagnosis) were included. In TREND, a survey of diabetes prevalence in rural South India, only participants with screen-detected diabetes were considered. Additional details about the cohorts, including genotyping methods, are provided in Supplementary Methods. All studies used Standard criteria for diagnosis of T2DM using fasting or random plasma glucose, Oral Glucose Tolerance Test (OGTT), or glycated haemoglobin (HbA1c). Serum C peptide (where available), fasted and stimulated glucose, and HbA1c were restricted to measures taken 12 months prior to, and up to, 1 month after diagnosis of T2DM to reduce contamination of effects by diabetes therapies.

Glycaemic traits

Serum C-peptide and plasma glucose were measured only in the DMDSC bioresource and using a fixed protocol, fasted and stimulated measures34. Patients visit the facility in a fasted state, and a blood draw is made. These are then used in Homeostatic Models of Assessment (HOMA) to estimate pancreatic beta-cell function and insulin sensitivity35,36. Similarly, we assessed the effect of the eQTL variant on stimulated blood glucose (SBG, 2hrG) levels in rural-dwelling Asian Indians who were screen-detected as having diabetes during the TREND survey.

HbA1c is commonly recorded and provides a measure of long-term blood sugar control. We tested the association of the variant rs7287124 with HbA1c from all four biobanks. For meta-analyses of the effect of rs7187124 with HbA1c in people without diabetes, we utilised summary estimates from the Meta-Analysis of Glucose and Insulin-related traits Consortium (MAGIC). Summary estimates from the three largest ancestry-specific contributors: Europeans(70%), East Asians(13%), and South Asians were used20.

Sub-group analyses

Lower XBP1 expression impacts beta-cell viability under high metabolic stress17,37. We hypothesised that individuals with lower expression of XBP1 would be susceptible to metabolic stress, leading to a younger age of T2DM onset with non-obese BMI. Therefore, we expect that the effects of the lead SNP would vary by BMI and age at diagnosis, and this would be detectable in an interaction term between the two variables when predicting the association between the lead SNP and glycaemic traits in the full population.

Thresholds for non-obese and obese BMI followed ethnicity-specific guidelines; non-obese for white Europeans was BMI <30 kg/m2, whereas for South Asian Indians <25 kg/m2, and South Asian Pakistanis and Bangladeshis living in the UK, <27.5 kg/m38,39. Associations between the XBP1 lead SNP (rs7287124) were tested in the full cohort, and in those with (1) young onset and non-obese BMI at diagnosis, and (2) in those with older onset and obese BMI. Genetic data from these cohorts have been used previously, and an overview of genotyping methods is provided in Supplementary Methods 3.

Statistical models used

For HOMA-B associations, we used the following 3 models: one unadjusted model assessing genotype effect on beta-cell function, second adjusted for HOMA-S (insulin sensitivity), age, BMI, and sex, and a third to determine if there was an interaction between the variant, age, and BMI at diagnosis. For stimulated blood glucose, models were adjusted for BMI and age at diagnosis, as well as sex. Finally, for HbA1c association testing, the full model was:

HbA1c ~ rs7287124 + Age at diagnosis + BMI at diagnosis + Sex + Age at diagnosis* BMI at diagnosis

Sub-group analyses models were not adjusted for the interaction, the model below was used:

HbA1c ~ rs7287124 + Age at diagnosis + BMI at diagnosis + Sex

All analyses were undertaken using R4.2.140. We meta-analysed the overall and sub-group stratified associations assuming random-effects using the R package Metafor41.

Pharmacogenomic effects on sulphonylurea and GLP1-RA response

Finally, to test the hypothesis that XBP1 expression exerts an effect on glycaemic control through beta-cell function, we used the largest available pharmacogenomic studies from the MetGen Plus and the DIRECT consortia to examine the effect of rs7287124 on response to an incretin mimetic like glucagon-like peptide 1- receptor agonist (GLP-1RA) and insulin secretagogues like sulphonylureas42,43.

Tayside, Scotland UK

Ethical approval for the study was provided by the Tayside Medical Ethics Committee (REF:053/04) and the study has been carried out in accordance with the Declaration of Helsinki. All volunteers provide written informed consent, allowing analysis of data for biomedical research and publication of results. Data access was through application.

Genes and Health, UK

Ethical approval is from the National Research Ethics Committee (London and Southeast), and the Health Research Authority (ref. 13/LO/124), and Queen Mary University of London is Sponsor. All volunteers provide written informed consent, allowing analysis of biomedical data and publication of results. Data access was through application.

All other studies and cohorts used (as summarised in Supplementary Table 1) are publicly available and links to publications have been provided.

Ethical committee approvals

DMDSC and TREND

NIHR Global Health Research Unit on Global Diabetes Outcomes Research, Institutional Ethics Committee of Madras Diabetes Research Foundation, Chennai, India. IRB number IRB00002640. Granted 24th August 2017. All volunteers provide written informed consent, allowing analysis of data for biomedical research and publication of results. Data access was through application and as part of the NIHR Global Health Research Unit.

Results

rs7287124 is the lead eQTL variant for XBP1 expression in pancreatic islets

A single intronic variant; rs7287124 (Chromosome 22: 29239157A > G) was reported to act as a cis-eQTL for XBP1 expression in pancreatic islets (βexpression = −0.18, p = 2 × 10−4, Table 1) by the InsPIRE study44. The direction of expression effect was consistent in pancreatic beta-cells eQTL results (N = 26, βexpression = −0.219, p = 0.54). Using TIGER summary statistics, we confirm that the G allele is associated with lower XBP1 expression (Table 1 and Supplementary Table 2) and is not an eQTL for any other genes in the pancreas. Further, we observed that the effect of rs7287124 in both Biobank Japan28 and the DIAMANTE trans-ancestry meta-GWAS summary estimates21, is directionally consistent with increased diabetes risk (Table 1). The effect allele frequency (EAF) in European populations is 24%, whereas South Asians and East Asians have higher EAF of 54% and 65% respectively, suggesting that these ethnicities will have lower genetically predicted XBP1 levels in pancreatic islets.

Table 1 Allele frequencies across ancestries and effects of lead eQTL variant on XBP1 expression in pancreatic beta-cells, T2DM risk

rs7287124 is associated with XBP1 expression and type 2 diabetes risk in East Asians but not white Europeans

We found a strong posterior probability that the same locus is associated with T2DM risk and XBP1 expression in East Asians (islets = 1%, pancreas 5%) (Fig. 1a), while weaker probabilities were observed in white Europeans (islets = 0.3%, pancreas = 0.55%) using the eCAVIAR method. The HyPrColoc method showed probabilities of 0.88 and 0.87 in islets and pancreas for East Asians, and 0 for both tissues in Europeans. (Supplementary Figs. 24 and Table 3). In East Asians, the variant identified in the colocalisation analyses was rs58004020; however, this variant is not present in build 37 of the human genome for South Asians. Therefore, we identified a proxy variant that is in strong LD with this lead variant and is uniformly available across all ancestry groups and has survived LD-pruning in GWAS summary statistics. The eQTL variant rs7287124 is in LD with rs58004020 in East Asians, Europeans, and South Asians: R2 = 0.99, 0.98, and 0.98 respectively and D’ = 1, 0.72, and 0.85, respectively. Both variants are in the gene ZNRF3 but have no effect on the expression of ZNRF3 in the pancreas or pancreatic islets. Since rs7287124 is the lead SNP of the eQTL and is in complete LD with rs58004020 in East Asians, where it was identified, rs7287124 was selected for downstream investigation with glycaemic traits and drug response. Therefore, colocalisation analyses strongly suggest that XBP1 expression mediates the effect of rs7287124 on T2DM risk.

Fig. 1: Colocalisation and Mendelian Randomisation methods for XBP1 expression and type 2 diabetes risk.
Fig. 1: Colocalisation and Mendelian Randomisation methods for XBP1 expression and type 2 diabetes risk.
Full size image

a Colocalisation probabilities using eCAVIAR and HyPrColoc between XBP1 expression in both pancreatic islets and GTEx pancreas samples and type 2 diabetes (T2D) risk in East Asians (EAS) from Biobank Japan and white Europeans (EUR) from DIAMANTE. Orange line indicates eCAVIAR recommended threshold for significant colocalisation. In both cases, East Asians are more likely to have overlapping causal variants. b Mendelian Randomisation (MR) ratio estimates of association between XBP1 expression in pancreatic islets and T2D risk in Biobank Japan. These plots have been made using 25 variants with LD < 0.4 in the XBP1 region +/−5KB. Weighted median estimate for the effect was −0.083 (SE: 0.025), P value = 9.2 × 10−3, and inverse variance weighted estimate was −0.065 (SE: 0.020), P value = 7.17  × 10 −4.

We then investigate the direction of effect by testing the hypothesis that lower XBP1 expression in pancreatic islets was associated with higher T2DM risk. Using inverse variance weighted and weighted median models, we find this association between the two traits to be significant (B −0.065, SE 0.02, and P < 7.17 × 10−4) and directionally consistent with our hypothesis (Fig. 1b).

Association with human beta-cell function in Asian Indians

Baseline characteristics of cohorts with individual-level data are described in Supplementary Table 4. In 470 participants of the Asian Indian cohort, rs7287124 was associated with HOMA-B in unadjusted (Fig. 2a) and adjusted models (Supplementary Table 5). In additive models of genetic effect adjusted for HOMA-S (insulin sensitivity), age, BMI, and sex, the G allele was associated with lower HOMA-B (B: −0.14, SE: 0.05, P = 5×10-3). An interaction between age and BMI at diagnosis on HOMA-B was observed (P = 4.3 × 10−5). In individuals diagnosed young and non-obese, this effect was greater (n = 82, B: −0.30, SE: 0.14, P = 0.036). However, the effect was not significant in the older and non-lean sub-group (n = 134, P = 0.27). Recessive models were run to confirm the genetic model is additive (Supplementary Table 5). The variant was not associated with insulin resistance or sensitivity (P = 0.12).

Fig. 2: Effect of variant on beta-cell function and 2 hour stimulated glucose levels.
Fig. 2: Effect of variant on beta-cell function and 2 hour stimulated glucose levels.
Full size image

a Box plot showing additive models of XBP1 eQTL variant (rs7287124) and log-transformed Homeostasis Model of Assessment of Beta-Cell function (HOMA-B) calculated using fasted glucose and stimulated C-peptide in 470 individuals with newly diagnosed type 2 diabetes at DMDSC, Chennai, India. After adjustment for age, sex, BMI, and HOMA-B, the association remained significant (P = 5 × 10−3). b Boxplot showing effect of rs7287124 on 2 hr stimulated blood sugar in 484 rural-dwelling Indians from the TREND study. After adjustment for age, sex, and BMI, the association remained significant (P = 0.02). Per allele was associated with 0.7 mmol/L higher 2 h glucose (95% CI: 0.1–1.4).

Association with stimulated blood glucose levels

We then assess the association between rs7287124 and stimulated 2-hour blood glucose (SBG) levels in 484 rural-dwelling Asian Indians from TREND. The variant was associated with higher SBG in models adjusted for BMI, age, and sex. (B = 0.7 mmol/L, SE = 0.3, P = 0.02) (Fig. 2b) However, no effect was observable in either of the sub-groups.

Association with HbA1c levels, especially in individuals diagnosed young and with non-obese BMI

We then tested the hypothesis that the variant will likely be associated with longer-term glycaemic control using HbA1c, which is much more routinely measured. To assess the effect of rs7287124 across ancestries, we included individual-level data from newly-diagnosed South Asian Indians represented by DMDSC (n = 459) and TREND (n = 471), South Asian Bangladeshi and Pakistani represented by Genes and Health (n = 644), and white Europeans from the TDC (4908). Additionally, we used summary statistics from existing GWAS in individuals without T2DM.

The variant was in Hardy-Weinberg equilibrium in all cohorts. The trans-ancestry meta-analyses showed a significant increase in HbA1c per risk allele regardless of diabetes status (Fig. 3a). Random-effects meta-analysis showed the pooled effect across T2DM status to be 4.32 mmol/mol per risk allele (95%CI: 2.60, 6.04), P value 8 × 10−7. Therefore, homozygous carriers of the G allele are estimated to have 8.64 mmol/mol higher HbA1c levels compared to non-carriers. In meta-analyses of the effect in healthy individuals, each risk allele was associated with 5.37 mmol/mol higher HbA1c (95%CI:0.22, 10.52), while in people with T2DM, the effect was 4.37 (95% CI: 2.60– 6.55).

Fig. 3: Trans-ancestry meta-analyses of the effect of rs7287124 on HbA1c levels overall and in young, lean T2DM cases.
Fig. 3: Trans-ancestry meta-analyses of the effect of rs7287124 on HbA1c levels overall and in young, lean T2DM cases.
Full size image

Forest plots showing random effects of XBP1 eQTL variant on HbA1c (mmol/mol). Effects are pooled across summary statistics available from the MAGIC consortium for East Asians, Europeans, and South Asians without T2DM (n = 173,186) and for 6482 individuals with newly-diagnosed T2DM across 4 cohorts: Tayside Scotland (TDS), urban-dwelling South Asian Indians (DMDSC), rural-dwelling South Asian Indians (TREND), and British South Asian Pakistani and Bangladeshis (G&H). The random effect meta-analysis shows an increase of 4.32 mmol/mol of HbA1c per risk allele (95% CI: 2.60–6.04). Forest plots of sub-groups of people with newly diagnosed T2DM showed a stronger effect in 477 individuals diagnosed young with non-obese BMI (6.41 mmol/mol), compared to no effect in those diagnosed older with non-lean BMI (n = 3382).

Approximately 7% of individuals (n = 477) with newly-diagnosed diabetes had young onset (<40 years) with non-obese BMI. Despite using ethnicity-specific BMI thresholds, this proportion differed across cohorts: in Asian Indians, 30% in DMDSC, 34% in TREND (<25 kg/m2), in British South Asians from G&H, 9.5% (<27.5 kg/m2), and in white Europeans from TDS, 2.4% (<30 kg/m2). The meta-analysis in sub-groups shows that the effect of this variant is driven by the young, non-obese sub-group, as the per allele effect was 6.41 mmol/mol (3.04, 9.79), P = 2 × 10−4 (Fig. 3b). In contrast, those diagnosed older with obese BMI (n = 3382) showed no genetic effect (P = 0.97). In our study cohorts, 25% of South Asian Indians (DMDSC + TREND), 20% of South Asian Pakistani and Bangladeshi (G&H), and 0.05% of white Europeans (TDC) were homozygous carriers of the risk allele (G/G) (Supplementary Fig. 5).

rs7287124 is associated with worse response to sulphonylureas but no differential response to GLP1-RA

We hypothesized that carriers of an allele associated with reduced beta-cell function are likely to have limited response to insulin secretagogues. To test this hypothesis, we examined the effect of these variants on glycaemic response to sulphonylureas and GLP-1RA. Carriers of the G allele showed no difference in response to GLP-1RA compared to those with the C allele (P value = 0.42), Supplementary Table 6). For sulphonylureas, we found that the allele was associated with a worse response, with each G allele associated with 0.061% (0.7 mmol/mol) lesser reduction in HbA1c (SE: 0.023, P value = 0.008). The variant is independent of all known sulphonylurea pharmacogenetic, dynamic, or kinetic variants (Supplementary Table 7).

Discussion

Using genetic information, we found that T2DM risk was associated with decreased expression of XBP1. An eQTL for XBP1 reported in pancreatic islets and beta-cells (rs728712(A > G)) was also associated with lower beta-cell function using HOMA-B and higher stimulated glucose in Asian Indians with newly-diagnosed T2DM. In a trans-ancestry meta-analysis, we observed that variant carriers had higher HbA1c at diagnosis (P value 8 × 10−7), and that this effect was stronger in individuals diagnosed young with non-obese BMI (P = 2 × 10−4). The variant is more common in those of South Asian (46–50%), East Asian (65–68%), and West African (45%) ancestry compared with Europeans (22–25%). Finally, we detected that for variant carriers, treatment with insulin secretagogue: sulphonylurea was associated with higher post-treatment HbA1c. A visual abstract summarizing the findings can be found Supplementary Fig. 6.

To examine pre-treatment glycaemic control in individuals with new-onset diabetes, we applied strict inclusion criteria around time of diagnosis. Our trans-ancestry meta-analysis showed the average per allele effect of rs7287124 on HbA1c was 4.32 mmol/mol (P = 8 × 10−7). This is equivalent to an 8.64 mmol/mol (0.79%) difference when comparing homozygotes (those with AA v. GG genotypes). Individuals diagnosed young and with non-obese BMI, the per allele effect was 6.41 mmol/mol, which is equivalent to a difference of 12.82 mmol/mol (1.2%) in HbA1c comparing homozygotes.

All else being equal (age at diagnosis, sex, BMI, and ancestry), the per allele difference for rs7287124 was equivalent to or greater than newer diabetes therapies. A meta-analysis of the efficacy of recently introduced diabetes therapies showed the average reduction in HbA1c in response to DPP4 inhibitors, SGLT2 is, and GLP1-RA was 0.53%, 0.79%, and 0.78% respectively. These differences are more modest than the observed effect of rs7287124 on HbA1c in this study, particularly in the young and non-obese sub-group.

Sulphonylureas, which close the KATP channel and force beta-cells to secrete insulin independently of glucose concentrations, may place an additional load on the ER and secretory pathway. Moreover, sulphonylureas have been demonstrated to directly induce apoptosis in human islets45. We find that reduced islet XBP1 levels in carriers of rs7287124 likely render beta-cells more sensitive to ER stress and therefore have diminished glycaemic control in response to sulphonylurea therapy. Future studies will be required to investigate the impact of sulphonylurea treatment on beta-cell function, differentiation, and survival in risk allele carriers.

GLP-1RA response is possibly limited by the fact that very few users are on monotherapy. Beyond that, there is evidence that GLP-1RA have an endoplasmic stress-reducing effect46,47. This may negate or overcome the diminished ER-stress response associated with reduced XBP1 expression.

The use of BMI as a surrogate for body composition and high metabolic stress is a limitation. BMI has been shown to be less specific than measures such as waist-to-hip ratio, particularly in South Asians, who are more likely to carry fat ectopically. However, waist-to-hip ratio is not widely measured and, therefore, not available across all study cohorts. Our use of ethnicity-specific BMI thresholds in part mitigates the misclassification of obesity. Furthermore, while we have undertaken robust replication, some well-known resources, such as the UK Biobank, were not used as there are under 2000 individuals of South Asian ancestry and 60 of East Asian.

Another limitation of the study is the use of a candidate gene approach necessitated by a lack of data availability in ancestrally diverse populations. The association observed in our study is similar to those seen in GWA studies; the most recent meta-GWAS of HbA1c demonstrated an effect size for common variants (such as rs2971669 in GCK) with similar allele frequency as rs7287124 (in Europeans), was ~3.3 mmol/mol. However, signals such as XBP1 are likely to be masked in such GWAS, as research in populations particularly susceptible to insulin deficiency or beta-cell dysfunction (such as South Asians, East Asians, and West Africans) is limited3,4,5,6,20. While efforts to increase the representation of non-European ancestries have been growing, studies of glycaemic traits are still more strongly representative of Europeans (70%)18,20, leaving the remaining 30% to capture genetic effects in all remaining ancestry groups. Similarly, pharmacogenomic studies are almost entirely representation of European cohorts42,43,48. Finally, due to the challenge of capturing treatment-naïve glycaemic measures, large-scale meta-GWAS for glycaemic traits are undertaken in healthy individuals who do not have diabetes. The lack of individual-level data for these studies, particularly around the age and BMI at diagnosis, precludes them from subgroup analyses.

Therefore, we utilised a candidate gene approach, building on a strong foundation of in vitro, in vivo, and human single-cell sequencing studies. To support this approach, we validate our target using colocalisation analyses with cell-specific transcriptomics, validating the signal in all available resources of pancreatic islet expression data in humans23,44. We observe the same pattern whether using eQTLs in the pancreas or islet cells. The stronger association with pancreas samples is likely due to the larger single study in GTEx relative to islet-specific transcriptomics.

The lower risk allele frequency in Europeans could be a result of positive selection. Indeed, calculating a gene-based score for recent selection based on data from the 1000 Genomes Selection Browser49, we find XBP1 (and a 20 kb region flanking it which includes ZNRF3) is ranked in the top 6.8% of all genes and long non-coding RNAs in terms of evidence of recent selection in the European CEU population relative to the Chinese CHB population (details in Supplementary). Selection conversion suggests that protective alleles for increased XBP1 expression have been selected for to a greater degree in Europeans, who also have a lower risk of beta-cell dysfunction and therefore lower frequency of young and lean onset diabetes. This would explain why risk of lower XBP1 expression is colocalised with T2DM risk in East Asians rather than white Europeans.

Since rs7287124 is located in ZNRF3, we validate its involvement in T2DM by examining evidence for genetic burden of common variants located in the gene using the AMP-T2DM knowledge portal. Using summary statistics of all available GWAS, the Human Genetic Evidence guidelines (HuGE) tool calculates a Bayes Factor of 45 × 5−8, concluding that there is very strong evidence for the involvement of variants located in this gene (eQTL and tagger variants for XBP1 expression) with T2DM risk in humans (Supplementary Fig. 7)50. Interestingly, a test for the well-known beta-cell gene TCF7L2 produces a similar level of evidence.

Our findings are generalizable to the populations included in the study and have particular value to those with the phenotype of young-onset diabetes in the absence of obesity. People of South Asian descent are nearly twice as likely to have young-onset T2DM and nearly four times more likely to have young onset with lean BMI compared to white Europeans3,5. A unique feature of this study is the use of ancestrally diverse populations, and in particular, the dissection of South Asian ancestries. We observe differences across these two South Asian groups in risk allele frequency: in South Indians (DMDSC + TREND), we observe a higher percentage of homozygous risk allele carriers amongst compared to British Pakistanis and Bangladeshis in G&H. The average age of diagnosis, BMI were lower in the South Indians, while HbA1c was higher (Supplementary Table 3). Reflecting this diversity both in location and genetics, we have used different thresholds to classify non-obese BMI in non-migrant Asian Indians compared to migrant South Asians39.

Our findings suggest that ethnicities such as South Asians have lower genetically-determined XBP1 expression, resulting in worse beta-cell response to metabolic stress, leading to T2DM. Our findings support the value of genomics in precision diagnosis and therapeutics. Further studies of reversal of glycaemic deterioration for carriers of XBP1 variant would be useful and could be undertaken using recall-by-genotype trials. The discovery of this association in type 2 diabetes and glycaemic traits in humans highlights the potential for development of strategies to therapeutically enhance XBP1 expression in pancreatic beta-cells to reverse beta-cell decline and dysfunction.