Allelic effects on KLHL17 expression underlie a pancreatic cancer genome-wide association signal at chr1p36.33

Connelly, Katelyn E.; Hullin, Katherine; Abdolalizadeh, Ehssan; Zhong, Jun; Eiser, Daina; O’Brien, Aidan; Collins, Irene; Das, Sudipto; Duncan, Gerard; Chanock, Stephen J.; Stolzenberg-Solomon, Rachael Z.; Klein, Alison P.; Wolpin, Brian M.; Hoskins, Jason W.; Andresson, Thorkell; Smith, Jill P.; Amundadottir, Laufey T.

doi:10.1038/s41467-025-59109-2

Download PDF

Article
Open access
Published: 30 April 2025

Allelic effects on KLHL17 expression underlie a pancreatic cancer genome-wide association signal at chr1p36.33

Nature Communications volume 16, Article number: 4055 (2025) Cite this article

6110 Accesses
1 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Pancreatic Ductal Adenocarcinoma (PDAC) is the third leading cause of cancer-related deaths in the U.S. Both rare and common germline variants contribute to PDAC risk. Here, we fine-map and functionally characterize a common PDAC risk signal at chr1p36.33 (tagged by rs13303010) identified through a genome wide association study (GWAS). One of the fine-mapped SNPs, rs13303160 (OR = 1.23 (95% CI 1.15-1.32), P-value = 2.74×10⁻⁹, LD r² = 0.93 with rs13303010 in 1000 G EUR samples) demonstrated allele-preferential gene regulatory activity in vitro and binding of JunB and JunD in vitro and in vivo. Expression Quantitative Trait Locus (eQTL) analysis identified KLHL17 as a likely target gene underlying the signal. Proteomic analysis identified KLHL17 as a member of the Cullin-E3 ubiquitin ligase complex with vimentin and nestin as candidate substrates for degradation in PDAC-derived cells. In silico differential gene expression analysis of high and low KLHL17 expressing GTEx pancreas samples suggested an association between lower KLHL17 levels (risk associated) and pro-inflammatory pathways. We hypothesize that KLHL17 may mitigate cell injury and inflammation by recruiting nestin and vimentin for ubiquitination and degradation thereby influencing PDAC risk.

Upregulation of KLHL17 promotes the proliferation and migration of non-small cell lung cancer by activating the Ras/MAPK signaling pathway

Article 17 August 2022

Discovering key transcriptomic regulators in pancreatic ductal adenocarcinoma using Dirichlet process Gaussian mixture model

Article Open access 12 April 2021

Deeper insights into long-term survival heterogeneity of pancreatic ductal adenocarcinoma (PDAC) patients using integrative individual- and group-level transcriptome network analyses

Article Open access 30 June 2022

Introduction

Pancreatic cancer is currently the third leading cause of cancer-related deaths and is expected to move to second place by 2030 in the United States¹. Pancreatic ductal adenocarcinoma (PDAC) comprises over 90% of pancreatic cancer cases. While its survival rate has improved over the years, detection, prevention, and treatment of PDAC remains a challenge². Epidemiological factors known to increase the risk of PDAC include Type 2 diabetes (T2D), pancreatitis, smoking, and obesity³. Additionally, both rare high-risk and common, low effect size germline variants are known to contribute to PDAC susceptibility^4,5,6,7,8.

The Pancreatic Cancer Cohort Consortium and Pancreatic Cancer Case-Control Consortium have sought to identify common germline variants that influence risk of PDAC through genome-wide association studies (GWAS). Previous GWAS phases, PanScan I⁵, II⁶, and III^7,8 and PanC4⁹, have identified 17 independent risk signals for PDAC. A 2018 meta-analysis of these four studies (9040 cases and 12,496 controls) and TaqMan replication using samples from the PANcreatic Disease ReseArch (PANDoRA) consortium (2,497 cases and 4611 controls) uncovered five new risk loci⁴. One of the newly identified loci was at chr1p36.33. The initial meta-analysis of PanScan I, II, III and PanC4 identified chr1p36.33 (tagged by single nucleotide polymorphism (SNP) rs13303010; P-value = 7.3 × 10⁻⁷; odds ratio (OR) = 1.20; 95% confidence interval (CI) = 1.12–1.29) as a suggestive risk locus for PDAC (Table 1). A meta-analysis including samples from the PANDoRA consortium (11,537 cases and 17,107 controls) improved the signal beyond the GWAS significance threshold (rs13303010, OR = 1.26, 95% CI 1.19–1.35, P-value = 8.36 × 10⁻¹⁴) (Table 1)⁴, further supporting chr1p36.33 as a PDAC risk locus.

Table 1 Summary statistics for rs13303010 on 1p36.33 PDAC risk locus

Full size table

GWAS have been instrumental in estimating disease risk, identifying candidate genes, and uncovering novel pathways underlying disease development. However, functional characterization of GWAS loci is critical to pinpoint the biological mechanism underlying risk. Most GWAS signals map to non-coding, regulatory regions of the genome and are hypothesized to influence disease risk through allele-specific changes in gene expression¹⁰. Further, each locus is decorated with tens to hundreds of highly correlated variants complicating the identification of functional variant(s) and gene(s) underlying the risk signal. Statistical fine mapping and genomic assays (e.g., ATAC-seq and massively parallel reporter assays) have been beneficial for reducing the number of candidate functional variants to move forward for testing^11,12,13. Chromatin capture and expression quantitative trait locus (eQTL) analysis are valuable for identifying putative functional genes^14,15. While these methods greatly assist in the process of functionally characterizing GWAS risk loci, it is still a time-consuming process. In fact, the biological mechanisms underlying only a handful of the 22 published PDAC risk signals have been functionally characterized to date: chr5p15.33/TERT¹⁶, chr16q23.1/CTRB2¹⁷, and chr13q22.1/DIS3¹⁸.

Interestingly, some overlap has been observed between GWAS loci for PDAC and associated epidemiological risk factors. A number of PDAC risk loci have common SNPs or colocalize with GWAS for traits that influence PDAC risk: chr1q32.1/NR5A2/T2D, chr2p13.3/ETAA1/T2D/waist-hip-ratio (obesity measure), chr8q24.21/MIR1208/T2D, chr9q34/ABO/T2D/body fat percentage, chr12q24.31/HNF1A/Maturity-onset Diabetes of the Young/T2D, chr16q23.1/BCAR1/T2D, chr18q21.32/GRP/T2D/BMI/obesity (https://mvp-ukbb.finngen.fi/ and NHGRI GWAS Catalog¹⁹). Further pathway enrichment analysis of genes ±100 kb of PDAC GWAS risk loci indicate an enrichment of genes associated with Maturity-onset diabetes of the young (KEGG), Sequence-specific DNA-binding transcription factor activity (GO Molecular Function), and Cellular response to UV (GO Biological Process)⁴. Further, DEPICT enrichment analysis indicated that genes associated with GWAS risk loci are highly expressed in numerous gastrointestinal tissues⁴.

Here, we apply fine-mapping methods to identify candidate functional variants for in vitro testing. We subsequently identify rs13303160 as a functional variant at chr1p36.33 likely mediating the expression of KLHL17 through allele-preferential binding of JunB and JunD transcription factors. Our work to characterize KLHL17’s function in PDAC risk suggests a role in mitigating acinar to ductal metaplasia (ADM) and epithelial to mesenchymal transition (EMT) through protein homeostasis.

Results

Fine-mapping of the chr1p36.33 PDAC risk locus

To identify candidate functional variants at the chr1p36.33 risk locus, we first performed a meta-analysis using GWAS data from PanScan I-III^5,6,7,8, PanC4⁹ and an additional 1066 PDAC cases and 9399 controls from the UK Biobank (UKBB)²⁰ resulting in 10,106 cases and 21,895 control subjects with imputed GWAS data. In this analysis, rs13303010 remained the most statistically significant SNP at chr1p36.33 (OR = 1.24, 95% CI 1.16–1.32, P-value = 2.09 × 10⁻¹⁰) (Fig. 1, Table 1).

**Fig. 1: Overview of chr1p36.33 PDAC risk locus.**

To identify credible causal variants (CCVs) underlying this association signal, we applied fine-mapping approaches to the summary statistics for this meta-analysis. We implemented the Bayesian approach Sum of Single Effects Linear Regression (SuSiE)¹³ to identify credible sets of CCV with 90% confidence. SuSiE identified one credible set with five variants (Table 2). We also applied likelihood ratio (LLR < 1:100) and linkage disequilibrium (LD r² > 0.8) thresholds which identified two additional CCVs (Table 2). To be inclusive, we moved all seven variants forward for in vitro functional analysis.

Table 2 Summary statistics and fine-mapping for the CCVs in UK Biobank meta-analysis

Full size table

Most GWAS variants are noncoding and thought to affect gene expression of target genes in an allele specific manner. As such, GWAS variants have been shown to be enriched in active gene regulatory elements indicated by posttranslational modifications of histones (H3K4me3, H3K4me1, H3K27ac) and accessible chromatin^11,21,22,23. We examined the set of statistically fine-mapped variants in the context of maps of pancreas gene regulatory elements (using a ChromHMM 8-state model and ATAC-seq) we generated in PDAC and normal-derived pancreas cell lines²⁴. All the fine-mapped variants at chr1p36.33 lie within active and bivalent transcriptional start sites and active enhancer elements (Fig. 1b, c). Two variants lie within regions of open chromatin (Fig. 1c, Table 2) lending further support for the fine-mapped variants as candidate functional SNPs influencing gene expression in cis or trans.

Assessing allele-preferential binding and gene regulatory activity of candidate functional variants

To identify functional variants underlying this GWAS signal, we first sought to identify variants that exhibited allele-preferential transcription factor (TF) binding. We tested the set of seven fine-mapped variants at chr1p36.33 in electrophoretic mobility shift assays (EMSAs) using nuclear extracts from the PANC-1 pancreatic cancer cell line. Three of the seven variants demonstrated allele-preferential protein binding: rs13303010, rs13303327, and rs13303160 (Fig. 2). The GWAS tag SNP, rs13303010, showed preferential binding to the protective alternate allele (A) (Fig. 2a). Additionally, we observed consistent allele-preferential binding with the protective alternate allele (A) at rs13303327 (Fig. 2b). The third variant, rs13303160, exhibited some preferential binding to the risk-increasing reference (G) allele (Fig. 2c). These preferential binding patterns were also observed with MIA PaCa-2 (Supplementary Fig. 1a–c) and HeLa nuclear lysate (Supplementary Fig. 1d, e).

**Fig. 2: Identification of allele-preferential binding and activity using EMSA and luciferase reporter assays.**

We next moved the three variants with allele-preferential binding forward to evaluate allele-preferential gene regulatory activity using luciferase reporter activity assays. As these variants lie near active transcriptional start sites (TSS) (Fig. 1c), we first tested allele-preferential promoter activity by placing 141-201 base pair (bp) sequences centered on the variant of interest (see Methods) into a luciferase vector without a minimal promoter (in the pGL4.14 vector). We observed strong promoter activity for the rs13303010 constructs compared to the empty vector (EV) but minimal allele-preferential luciferase activity in MIA PaCa-2, PANC-1 and HEK293T cells (Fig. 2d). The second SNP, rs13303327, demonstrated allele-preferential luciferase activity with the alternate (A) allele having a stronger regulatory effect (Fig. 2e). The third SNP, rs13303160, exhibited strong promoter activity with the alternate (A) allele demonstrating an allele-preferential effect in the reverse orientation in the MIA PaCa-2 and HEK293T cells (Fig. 2f).

While rs13303010 and rs13303327 lie in a 1328 bp region between the TSS of the NOC2L and KLHL17 genes, rs13303160 is located 360 bp downstream of the 3’UTR of KLHL17 and 303 bp upstream of the TSS for PLEKHN1 (Fig. 1c), suggesting it could influence promoter and/or enhancer activity at this locus. We therefore tested the sequence surrounding rs13303160 as an enhancer upstream of a minimal promoter and the luciferase gene (in the pGL4.23 vector). As an enhancer element, the rs13303160 constructs demonstrated strong enhancer activity in all three cell lines with the alternate (A) allele exhibiting stronger activity (Fold change (FC) 1.5–2.8, P-value = 1 × 10⁻⁶–0.02; Fig. 2g). Thus, through EMSA and luciferase assays, we narrowed the set of seven fine-mapped candidate functional variants down to two (rs13303327 and rs13303160) that each demonstrate allele-preferential protein binding and gene regulatory activity.

Identifying allele-preferential protein binding

To identify TFs potentially mediating the allele-preferential regulatory activity we observed, we performed an in silico TF motif analysis using PERFECTOS-APE²⁵. This analysis predicts TF binding potential for both alleles of a variant and provides a P-value for the estimated strength of binding. We then calculated the fold-change between P-values as a proxy for the binding affinity change between alleles.

For rs13303327, we identified several E74-like factors (ELF), members of the E-twenty-six (ETS) family of transcription factors, having a 13-24-fold difference in binding P-values between the A and G alleles (Table 3, Supplementary Table 1). The ELF transcription factors recognize the motif GGAA (Fig. 3a) which is disrupted by the risk allele-G at rs13303327 by replacing the last A with a G.

Table 3 Predicted TFs with allelic binding preferences for rs13303327 and rs13303160

Full size table

**Fig. 3: Allele-preferential binding of ELF to rs13303327 in vitro and in vivo.**

To validate TF binding predictions for rs13303327, we performed EMSAs with recombinant ELF1, ELF2, ELF3, and ELF4 proteins to assess allele-preferential binding in vitro (Fig. 3b). ELF2 consistently demonstrated preferential binding to the A allele as compared to the G allele (Fig. 3b, Supplementary Fig. 2). ELF1, 3 and 4 did not demonstrate consistent or differences in allelic binding (Fig.3b). Additionally, including an ELF2 antibody led to a loss of the observed binding further confirming ELF2 as the protein responsible for the allele-preferential band in the original EMSAs (Fig. 3c).

To assess if ELF2 is enriched at rs13303327 in an allele-preferential manner in the context of the native DNA, we performed chromatin immunoprecipitation followed by quantitative PCR (ChIP-qPCR) in two pancreatic cancer cell lines heterozygous at this SNP (Hs766T and SW1990). We were unable to enrich ELF2 at rs13303327 or a predicted positive control region from the SW1990 PDAC cell line. In Hs766T cells, we observed minimal enrichment of ELF2 relative to the IgG and negative control region (Fig. 3d). We then assessed allele-preferential enrichment of the ChIP DNA using a TaqMan genotype probe for rs13303327 and observed an enrichment of ELF2 over IgG; however, an allele-preferential enrichment compared to the input DNA was not noted (Fig. 3e). Based on these results, we conclude that ELF2 does not bind rs13303327 in an allele-preferential manner in PDAC cell lines.

For rs13303160, allele-preferential binding prediction pointed towards preferential binding of TFs to the A allele, which was opposite of what we observed in the EMSA (Fig. 2c) but consistent with the luciferase assay (Fig. 2f, g). The transcription factors with the largest fold change in predicted binding P-values (20-87-fold) were the Activator Protein 1 (AP-1) transcription factors (Jun and Fos) (Table 3 and Supplementary Table 1) with the risk allele (G) disrupting the AP-1 motif (Fig. 4a).

**Fig. 4: Allele-preferential binding of AP-1 proteins to rs13303160 in vitro and in vivo.**

The binding motif for AP-1 (and sequence flanking rs13303160) is a TPA (12-O-Tetradecanoylphorbol-13-Acetate) response element indicating that TPA may induce expression and binding of AP-1 to these elements²⁶. We therefore repeated both the EMSA and luciferase experiments using cells treated with TPA. Upon TPA treatment, we observed an induction of AP-1 as demonstrated by the increase in JunB protein expression (Supplementary Fig. 3). EMSAs with TPA-treated HeLa nuclear extract demonstrated allele-preferential binding opposite what was originally seen (Fig. 2c) with preferential binding to the A allele over the G allele (Fig. 4b). In luciferase assays, cells treated with TPA demonstrated a stronger induction of enhancer activity in the PANC-1 cell line compared to vehicle control (Fig. 4c). However, the allele-preferential effects remained the same with the protective A allele showing higher activity as compared to the risk increasing G allele (FC 2-3.9, P-value = 6.3 × 10⁻³–0.018). This indicates that AP-1 proteins may be responsible for the gene regulatory activity observed in the luciferase assays.

The AP-1 family of proteins includes the c-Fos and c-Jun proteins that can homo- or heterodimerize and play different roles in transcriptional regulation depending on the context²⁶. The in silico analysis did not predict which of the AP-1 protein family members bind rs13303160 as they all recognize the same motif. To determine which AP-1 protein(s) may exhibit allele-preferential binding in vitro, we performed EMSA using recombinant proteins for c-Fos, FosB, Fos-related antigen 1 (FRA-1), Fos-related antigen 2 (FRA-2) (Fig. 4d), Jun, JunB, and JunD (Fig. 4e). c-Fos and FosB demonstrated some allele-preferential binding, though inconsistently. Recombinant JunB and JunD proteins, on the other hand, demonstrated consistent allele-preferential binding. We subsequently performed supershift EMSAs with antibodies against JunB and JunD and observed a shift in the allele-preferential band when the antibody is added to the binding reaction indicating that the allele-preferential bands observed include JunB and JunD (Fig. 4f,g).

We then assessed if the in vitro allele-preferential binding of JunB and JunD translated to the context of genomic DNA. We performed ChIP-qPCR for JunB and JunD in the PDAC SW1990 and Hs766T cell lines (both heterozygous at rs13303160) and observed an enrichment of JunB with two primer sets that encompass the SNP and a third primer set just upstream of the SNP in SW1990 cells (Fig. 4h). We were unable to observe consistent enrichment in the Hs766T cell line with these primers. We additionally examined JunD localization at rs13303160 in the SW1990 cell line and observed a significant enrichment of JunD with primer set 2 (PS2) that encompasses the SNP (Fig. 4h). To assess allele-preferential enrichment in SW1990 cells, we quantified the immunoprecipitated DNA using a TaqMan genotyping assay. Compared to the input DNA, we observed an increased enrichment of the A allele over the G allele for both JunB (FC = 5.2; P-value = 2.4 × 10⁻³) and JunD (FC = 3.4 P-value = 1 × 10⁻⁵) (Fig. 4i).

In summary, functional characterization of seven fine-mapped SNPs at chr1p36.33 led to the identification of three variants with allele-preferential binding in vitro, two of which also displayed allele-preferential gene regulatory activity. Transcription factor binding predictions and in vitro binding assays identified ELF2 and JunB/JunD as likely mediators of the allele-preferential effect at rs13303327 and rs13303160, respectively. Further in vivo analysis using ChIP-qPCR highlighted allele-preferential binding of JunB/D at rs13303160. Thus, we conclude rs13303160 represents a functional variant at the chr1p36.33 pancreatic cancer risk locus.

Identification of likely target genes mediating risk at chr1p36.33

Chr1p36.33 is gene dense with over 30 protein coding genes, microRNAs, and long non-coding RNAs. Using eQTL analysis, we identified two potential target genes at this locus for the tag SNP rs13303010 using the pancreas-specific GTEx (v8, n = 305) dataset: KLHL17 (Normalized Effect Size, NES = 0.35; P-value = 4.9 × 10⁻⁹) and NOC2L (NES = −0.25; P-value = 3.2 × 10⁻⁵) (Fig. 5a). Colocalization analysis²⁷ of the GTEx (v7) pancreas eQTL and the PDAC GWAS signals indicated that the KLHL17 and NOC2L eQTL may share a single causal variant with the GWAS signal (posterior probability = 0.99 and 0.75, respectively). Further, a transcriptome-wide association study (TWAS) from our group identified KLHL17 as a borderline significant gene in the pancreas and multi-tissue models (Z = −3.96, FDR = 0.052; FDR = 0.063, respectively)²⁸. Due to the stronger colocalization probability, suggestive TWAS results, and corroborating in vitro allele-preferential activity, we focused on KLHL17 as a likely target gene underlying the chr1p36.33 risk signal.

**Fig. 5: Analysis of the effects of altered KLHL17, a Cullin3-E3 complex member, expression on cellular growth of PDAC cells in vitro.**

Characterizing the function of KLHL17 in the pancreas

KLHL17 is a member of the Kelch family of proteins. These proteins are substrate recognition proteins for the Cullin3-E3 ubiquitin complex (CRL3)²⁹, but the function of KLHL17 in the pancreas has not been defined. We therefore generated doxycycline-inducible KLHL17-FLAG tagged overexpressing PANC-1 and MIA PaCa-2 cell lines (Supplementary Fig. 4a, b) to study its function in the pancreas. We first examined KLHL17’s cellular localization using immunofluorescence. In the MIA PaCa-2 and PANC-1 overexpressing cell lines, KLHL17 localized throughout the cell (Fig. 5b). This is in contrast to findings from the Human Protein Atlas in A-431 (epidermoid carcinoma), U-251MG (glioblastoma), and U2OS (osteosarcoma) cells, which indicate localization in the nucleoplasm and nuclear bodies³⁰ suggesting possible cell-type and/or context-specific functions for KLHL17. We then assessed whether KLHL17 is a member of the Cullin3-E3 ubiquitin complex. Using whole cell lysates from the PANC-1 induced overexpression and empty vector control cell lines, we performed an immunoprecipitation (IP) with a FLAG antibody (Supplementary Fig. 4c, d) followed by mass spectrometry (IP-MS) to identify proteins interacting with KLHL17. We identified enrichment of KLHL17 along with Cullin-3 (CUL3), E3 ubiquitin-protein ligase RBX1 (RBX1), and ubiquitin-ribosomal protein eL40 fusion protein (UBA52) in the KLHL17-FLAG IP but not in the empty vector control IP (Fig. 5c) indicating that KLHL17 is a part of the CRL3 ubiquitin ligase complex in pancreatic cells.

To identify candidate substrates for degradation by KLHL17, we applied a 1.5-fold enrichment filter (KLHL17-FLAG/Empty Vector) to the KLHL17 IP-MS data and identified 62 proteins (Supplementary Data 1). To further narrow this set of proteins down, we performed a global proteomics experiment where we titrated KLHL17 expression in our PANC-1 inducible overexpression system to examine the candidate protein(s) expression as KLHL17 expression levels increase. As KLHL17 is associated with the CRL3 ubiquitin complex, we would expect that KLHL17 substrates would decrease with increasing KLHL17 protein levels. This narrowed the list of candidate substrates to 23 proteins. Of interest are vimentin/VIM (P-value = 0.0029) and nestin (P-value = 0.027) (Fig. 5c, Supplementary Fig. 4e, f, Supplementary Data 1) as both have roles in early pancreatic carcinogenesis and are upregulated in PDAC^{31,32,33,34,35}. In a follow-up global proteomics experiment using the MIA PaCa-2 KLHL17 overexpression system, we observed a downward trend in VIM with increasing KLHL17 expression, albeit not significant (P-value = 0.17) (Supplementary Fig. 4g, Supplementary Data 1). Nestin, on the other hand, did not display a decrease in expression with increasing KLHL17 (Supplementary Fig. 4h, Supplementary Data 1) in MIA PaCa-2 cells, possibly due to the fact that NES/nestin mRNA³⁰ and protein expression (Supplementary Fig. 4h) is 20-70 fold lower as compared to PANC-1 cells. This suggests that KLHL17 may recruit vimentin and nestin to the CRL3 ubiquitin ligase complex for ubiquitination and degradation.

Assessment of cell growth after KLHL17 over-expression and knockdown

We next assessed if KLHL17 plays a role in pancreas cell proliferation and cell viability. Knockdown of KLHL17 using an siRNA pool in the PANC-1 and MIA PaCa-2 PDAC cell lines revealed a significant reduction in proliferation for both cell lines four days following knockdown (Supplementary Fig. 5a, c). However, we observed limited knockdown efficiency for KLHL17 and a non-specific reduction of the nearby NOC2L gene (Supplementary Fig. 5b, d). As these results did not distinguish whether the reduced levels of KLHL17 or NOC2L mediated the growth effects observed, we implemented CRISPR interference (CRISRPi) in PANC-1 and MIA PaCa-2 cells using guide RNAs targeting the 5’ untranslated region (UTR) of KLHL17. CRISPRi-mediated knockdown of KLHL17 significantly reduced KLHL17 mRNA levels in PANC-1 and MIA PaCa-2 cell lines (Fig. 5e, g) while minimally affecting NOC2L expression (Supplementary Fig. 5e, f). Surprisingly, despite the significant reduction in KLHL17 mRNA levels, KLHL17 protein levels remain unchanged even 36 days after transduction (Supplementary Fig. 5g, h). Subsequent growth analysis revealed that inhibition of KLHL17 mRNA expression did not affect cell proliferation in the two PDAC cell lines (Fig. 5d, f). Furthermore, overexpression of KLHL17 in PANC-1 and MIA PaCa-2 cells did not alter cell growth (Fig. 5h, i, Supplementary Fig. 4a). We hypothesize that the initial growth phenotype observed with KLHL17 siRNA is likely a result of the non-specific reduction in NOC2L expression as targeted knockdown of NOC2L with siRNA also reduces cell growth (Supplementary Fig. 5i–l).

Interrogating the functional consequences of decreased KLHL17

We sought to investigate additional potential functional consequences of altered KLHL17 expression in the pancreas using an in silico knockdown analysis as previously described^28,36. We performed this analysis using the GTExv8 Pancreas RNA-seq dataset. Samples were separated into quartiles based on KLHL17 gene expression and the highest (n = 82) and lowest (n = 82) quartiles (Supplementary Fig. 6a) were subjected to differential gene expression analysis with EdgeR³⁷. We then used Gene Set Enrichment Analysis (GSEA)^38,39 and QIAGEN Ingenuity Pathway Analysis (IPA)⁴⁰ on the significantly differentially expressed genes (GSEA: n = 13,090, FDR < 0.05; IPA: n = 3511, FDR < 0.05 and log2 fold change > |0.5| (FC = 1.4)) to identify enriched pathways. We observed a positive enrichment of gene sets involved in inflammation or inflammation-related diseases in the group with the lowest KLHL17 expression (Fig. 6a–c). Further, IPA indicated similar results with nine of the ten most significant pathways being related to inflammation (Supplementary Fig. 6b) and eight of the nine pathways predicted to be activated in the GTEx samples with lower KLHL17 expression. This indicates that lower KLHL17 expression may be associated with a pro-inflammatory environment in the pancreas. We hypothesize that KLHL17 plays a role in mitigating cell injury and inflammation by recruiting vimentin and nestin for ubiquitination and degradation. This suggests that lower expression of KLHL17 that is associated with the risk promotes a pro-inflammatory environment that is prime for tumorigenesis (Fig. 6d).

**Fig. 6: Gene Set Enrichment Analysis (GSEA) of differentially expressed genes from an in silico *KLHL17* knockdown.**

Discussion

Here, we functionally characterize a common PDAC risk locus at chr1p36.33. Fine mapping identified a set of seven candidate functional variants for further analysis. In vitro binding assays narrowed this set to three SNPs, namely rs13303010, rs13303327, and rs13303160 that exhibited allele-preferential protein binding. Subsequent luciferase experiments revealed in vitro allele-preferential gene regulatory activity for rs13303327 and rs13303160, but not rs13303010. We next identified ELF2 and JunB/D as transcription factors mediating the binding preference to the protective alleles of rs13303327 and rs13303160, respectively. This binding preference for JunB and JunD at rs13303160 was confirmed in the context of native chromatin. Taken together, we conclude that rs13303160 is a functional SNP at chr1p36.33 whose effect is mediated through allele-preferential binding of JunB and JunD.

eQTL analysis in GTEx pancreas samples identified two possible target genes, KLHL17 and NOC2L. NOC2L encodes the protein Novel INHAT Repressor which is known to repress p53 and histone acetyltransferase activity⁴¹. KLHL17 encodes Actinfilin, a known substrate recognition protein for the Cullin3-RING ligases, in neurons⁴². We focused on characterizing KLHL17 as a functional gene underlying the chr1p36.33 GWAS signal for multiple reasons: (1) strong colocalization of the KLHL17 eQTL with the GWAS signal; (2) KLHL17 is a suggestive PDAC TWAS gene²⁸; and (3) allele-preferential luciferase regulatory activity and TF binding that were congruent with the KLHL17 eQTL.

The KLHL family, comprised of 42 proteins, has been reported to have countless roles in cancer including gastrointestinal cancers^29,43. Canonically, KLHL proteins are substrate adaptor proteins for CRL3, an E3 ubiquitin ligase complex⁴⁴, that are responsible for mediating the recognition, ubiquitination and degradation of their protein substrate(s) and are involved in a variety of cellular processes²⁹. Changes in KLHL protein expression affect substrate protein expression and downstream pathways^45,46. Depending on the KLHL family member and the cancer context, both increased and decreased KLHL family member expression has been described in cancer^29,43. Recently, there has been an interest in targeting KLHL proteins to uncover their mechanisms and as a therapeutic strategy for associated diseases such as cancer⁴⁷.

Only one study has characterized KLHL17’s role in cancer⁴⁸. In Liu et al., the authors highlighted a non-canonical role for KLHL17 in the Ras/Map Kinase (MAPK) pathway where KLHL17 overexpression enhances cell proliferation, migration, and colony forming ability for non-small cell lung cancer (NSCLC)⁴⁸. They did not indicate if the involvement with MAPK could be through protein ubiquitination, as is known for the KLHL proteins and KLHL17 in neurons⁴². In PDAC-derived cells, we observed that KLHL17 associates with members of the CRL3 family supporting a canonical function for this protein in the pancreas, as a CRL3 substrate recognition protein. We further identified candidate substrates that KLHL17 may recruit for ubiquitination and degradation. Two potential protein substrates of interest were the intermediate filament proteins vimentin and nestin, which have documented roles in inflammation^31,33,34, tumorigenesis^31,49, and metastasis^32,50,51,52.

Further, Liu and colleagues reported an overexpression of KLHL17 in NSCLC tumors compared to adjacent-normal tissue from TCGA⁴⁸. In the pancreas, there was no significant difference in KLHL17 expression between TCGA PAAD tumor samples and normal or normal-adjacent tissue³⁵. However, eQTL and TWAS results indicate that lower KLHL17 expression levels are associated with increased PDAC risk^14,28. We sought to characterize KLHL17’s function on pancreas cell growth but were presented with several technical challenges: (1) our initial knockdown experiments using siRNA against KLHL17 revealed a growth suppression, but knockdown efficiency was minimal and non-specific; (2) CRISPRi-mediated knockdown of KLHL17, which did not demonstrate a growth phenotype, reduced KLHL17 mRNA expression, however, protein levels remained unchanged, even after 36 days. This confounding finding means that we cannot rule out a growth phenotype for different KLHL17 protein levels. We hypothesize that the initial growth phenotype observed with KLHL17 siRNA can be contributed to the non-specific knockdown of NOC2L, as specific knockdown of NOC2L using siRNA also suppressed cell growth. This suggests that NOC2L may play a role in PDAC tumorigenesis.

To uncover KLHL17’s role as a functional gene underlying the chr1p36.33 risk signal, we utilized an agnostic approach to start to characterize its function in the pancreas. In silico differential gene expression analysis using GTEx pancreas samples comparing those with low KLHL17 to high KLHL17 mRNA expression highlighted an enrichment of upregulated genes involved in inflammation-related pathways and gene sets. This suggests that lower levels of KLHL17 may associate with a pro-inflammatory state in the pancreas, or reduced ability to resolve the consequences of inflammatory signals. While such an in silico approach has been shown to have a strong concordance with knockout mouse RNA-sequencing data³⁶, there are limitations to consider when assessing the results. First, for KLHL17, there is a relatively small difference in expression between samples in the upper and lower quartile (~2 fold). Second, because the dataset is derived from 168 bulk pancreas tissue samples, heterogeneity (e.g. cell type composition, genetics, and environmental factors) is likely to add noise. However, this approach provided us with a basis to develop hypotheses regarding KLHL17’s function in the pancreas.

Inflammation is a contributing risk factor to the development of PDAC⁵³. PDAC arises from pancreatic intraepithelial neoplasia (PanIN) that display cancerous and pancreatic duct cell properties. Although PanIN exhibit duct-like properties, multiple lines of evidence indicate that pancreatic acinar cells that have undergone acinar-to-ductal metaplasia (ADM) are precursors for PDAC⁵⁴. ADM is a trans-differentiation process in which acinar cells lose acinar-specific markers and gain duct cell markers. The plasticity of acinar cells makes them highly sensitive to external stimuli⁵⁵. Acinar cells can recover from an acute stimulus, but with a more sustained stimulus, such as during chronic inflammation, ADM can become irreversible resulting in progression to PanIN⁵⁶. In addition to ADM, epithelial to mesenchymal transition (EMT) is observed in pre-malignant PanIN lesions and is prevalent in regions of ADM with inflammation^31,57.

Vimentin and nestin expression, identified in our proteomics as candidate KLHL17 substrates, are induced upon cell injury and stress. Vimentin plays an important role in EMT, the recruitment of inflammatory cells to resident tissues, the activation of the inflammasome, and fibrosis⁵⁸. Nestin is a binding partner of vimentin and a marker of multi-potent progenitors. Upon stress and injury, nestin expression is induced and nestin-positive cells have the ability to re-enter the cell cycle and differentiate in the repair process^33,34,50,59. In the pancreas, both proteins have been implicated in ADM and pre-malignant EMT and are upregulated in tumors^28,30,35. During ADM following pancreatic injury, a transition population of nestin-positive cells is formed³⁴. Additionally, changes in nestin expression correlate with changes in EMT markers^32,52. Finally, induction of pancreatitis promotes EMT and vimentin expression³¹. As lower expression of KLHL17 is associated with increased risk, this suggests that higher vimentin and nestin expression are associated with an increased risk of PDAC, likely promoting inflammation, ADM and EMT.

Unfortunately, our in vitro model system is limited with regards to understanding the effects of inflammation, ADM, and EMT in the context of KLHL17 expression in the pancreas in vivo. Most pancreas cell lines are derived from tumors or metastatic lesions and have ductal characteristics. Available human normal-derived pancreatic cell lines are duct epithelial cells and would not recapitulate ADM under such stimuli. Primary human pancreas cells, particularly acinar cells, are difficult to maintain in culture as the acinar cell population is quickly lost due to ADM and cell death⁶⁰. Due to these limitations, additional studies are needed to investigate the role of KLHL17 in the context of pancreatic injury, inflammation, and pancreatic tissue.

Taken together, we propose an explanatory model for the chr1p36.33 PDAC GWAS locus wherein lower KLHL17 expression allows for unresolved inflammation and acinar cell injury resulting in increased likelihood of the development of PanINs and progression to PDAC. Under cellular stress, JunB and JunD are induced by extracellular stimuli, such as cytokines⁶¹. Under such conditions, the preferential binding of JunB/D to the protective allele at rs13303160 may drive KLHL17 expression, resulting in the recognition of vimentin and nestin for ubiquitination and degradation and mitigation of ADM, EMT and associated inflammation. In contrast, when the risk allele is present, JunB/D may not sufficiently induce KLHL17 expression to resolve inflammation (Fig. 6d). Further in vivo studies are needed to fully explore the role of KLHL17 in the pancreas especially in modulating carcinogenesis.

Methods

Ethics

All studies obtained consent from participants and Institutional Review Board (IRB) approvals including IRB certifications permitting data sharing in accordance with the NIH Policy for Sharing of Data Obtained in NIH Supported or Conducted Genome-Wide Association Studies (GWAS). Additionally, the PanScan study was approved by the NCI Special Studies Institutional Review Board. All methods and procedures followed the international criteria outline in the Declaration of Helinski.

UK Biobank PDAC GWAS and meta-analysis

We obtained the cancer registry and hospital inpatient information phenotype data from UKBB on August 6^th, 2021 (Approval #29565, Laufey T. Amundadottir). Case criteria for patients diagnosed with pancreatic ductal adenocarcinoma (PDAC): [22006-0.0] = 1 (white British), [22010-0.0] ≠ 1 (recommended genomic analysis exclusions), [22019-0.0] ≠ 1 (sex chromosome aneuploidy exclusion), [31-0.0] (self-reported sex; 0 = female, 1 = male) is consistent with [22001-0.0] (genetic sex), [40012] (Tumor behavior) include only: ‘3’—malignant, primary site, [40008] (age at cancer diagnosis), [40006]—Type of Cancer ICD10 codes (C25*—except for C25.4). Control criteria: exclude control if any of the following Data Fields have values > 0, [134]—Number of self-reported cancers, [2453]—Cancer diagnosed by doctor; exclude if any ICD10 codes in the following starting with C*, [40006]—Type of Cancer ICD10 codes, [41202]—Diagnoses—main ICD10 summary information, [41204]—Diagnoses—secondary ICD10 summary information, [41270] Diagnoses—ICD10 from Hospital inpatient information; exclude if any question in the following with a value that is not “NA”, [20001]—Cancer code self-reported, [20007]—Interpolated age when cancer first occurred, [40001]—Underlying primary cause of death ICD10; [40008]—Age at cancer diagnosis, [40011]—Histology of cancer tumor, [40012]—Behavior of cancer tumor, [40013]—Type of cancer ICD9 codes, [84]—Cancer year/age first occurred (Medical conditions), [40009]—Reported occurrences of cancer—Cancer Register; [31-0.0] (self-reported sex) is consistent with [22001-0.0] (genetic sex); [22010]—Recommended genomic analysis exclusions—filter samples based on poor heterozygosity/missingness as per UKB analysis (exclude if “1”); [22018] - Genetic Relatedness exclusions - exclude those with “1” or “2”; [22019]—Sex chromosome aneuploidy—those with ‘Yes’ were excluded; [22021]—Genetic kinship to other participants—we only used those with “No kinship found”. We selected up to 10 controls for each case based on similar age groups (same gender and birthyear ±5 years) and genetic background (using PCAmatchR⁶²). The imputed genotype data were downloaded from UKBB (November 2021). We removed variants with MAF < 0.5%, INFO < 0.3, and completion rates > 10%. Genome-wide association tests were performed using the “Frequentist” additive model in SNPTEST(v2.5.4-beta3)⁶³ with covariates (age, gender, significant principal components, and array type).

Summary statistics from four previously published GWAS phases (PanScan I-III)^5,6,7,8 and PanC4⁹, and the UK Biobank summary statistics described above (case subjects: n = 10,106 and control subjects: n = 21,895) were used for the meta-analysis. The meta-analysis was performed using Metal (03/25/2011)⁶⁴.

Fine-mapping of the GWAS signal

The chr1p36.33 GWAS region was fine-mapped using the chi-squared likelihood ratio test of the GWAS P-values for all SNPs. Linkage disequilibrium (LD) with the tag SNP rs13303010 was determined for every GWAS variant between chr1:1-2,300,000 (hg19) using LDLink⁶⁵ European population. Variants with a LLR < 1:100 and an LD r² > 0.8 were considered likely functional SNPs for experimental validation. Additionally, the Sum of Single Effects (SuSiE v.012.35 in RStudio 2022.02.3 + 492), a Bayesian approach, was used to identify credible sets of variants likely harboring functional variant(s)¹³. A threshold of 0.9 probability that the credible set of variants contains a causal SNP and L = 10 (up to 10 variants in a credible set) was used. Plink v1.07 and PanScan III data (including cases and controls) was used to generate the required LD matrix^7,8.

Electrophoretic mobility shift assays (EMSA)

SNP-centered 31 bp oligonucleotides (Supplementary Table 2) were labeled with IRDye©700 fluorescent dye on the 5’ end and HPLC purified (Integrated DNA Technologies, Coralville, IA). Competitor oligonucleotides were unlabeled (IDT). Oligonucleotides were annealed at 99 °C and cooled slowly to room temperature. EMSA binding reactions were as follows: 10X binding buffer, 50% glycerol, polyDiDC, nuclear lysate (2.5–5 µg, ActiveMotif, Carlsbad, CA), labeled oligo (5 nM), water. Competition binding reactions included unlabeled oligos at 50X and 100X of the labeled oligo concentration. Reactions were incubated at room temperature in the dark for 20 min. Reactions were then loaded on 4–12% gradient TBE gels (Invitrogen, Waltham, MA) with 0.5X TBE and ran for 100 min at 90 V. Gels were imaged on the BioRad ChemiDoc^TM with the IR680 setting. For the TPA-stimulated EMSAs, 2-h TPA stimulated HeLa nuclear extract (ActiveMotif, Carlsbad, CA) was used.

For supershift assays, the nuclear lysate and antibodies for ELF2, JunB or JunD were incubated for 20 min at room temperature prior to the addition of other binding reaction components. The complete reactions were incubated for an additional 20 min.

For EMSAs with recombinant proteins, 135 ng or 270 ng of recombinant protein was used in place of nuclear lysate. Recombinant ELF1 (TP760629), ELF2 (TP760288), ELF3 (TP300631), ELF4 (TP761826), JUNB (TP303595), JUND (TP316958 4), c-FOS (TP760257), FOS1L (TP302104), FOSB (TP762032), FOS2L (TP760114) proteins were purchased from Origene (Rockville, MD). c-JUN was purchased from Abcam (Waltham, MA) (ab84134). ELF5, which has negligible expression in the pancreas, was not tested.

Plasmids

Luciferase backbone plasmids used for reporter assays were pGL4.23 and pGL4.14 (Promega, Madison, WI). Four gene blocks (forward orientation reference allele, forward orientation alternate allele, reverse orientation reference allele, reverse orientation alternate allele) of varying sizes (141-201 bps) were synthesized for rs13303160, rs13303010, and rs7524174 (Integrated DNA Technologies, Coralville, IA). Sequences to be assayed (Supplementary Table 3) were determined based on ATAC-seq and ChromHMM annotations previously generated in PDAC cell lines²⁴ with the goal of being inclusive of regulatory elements as well as sequence complexity. A 165 bp sequence for rs13303327 was cloned from a heterozygous HapMap CEPH subject (NA12716) (Primers in Supplementary Table 3). The gene blocks were ligated into the luciferase plasmid as either enhancers or promoters. KLHL17 pcDNA3 plasmid was purchased from GenScript (Piscataway, NJ) and subcloned into the pFUGW with a TREG3 promoter for tetracycline-inducible lentiviral expression (Frederick National Cancer Research Laboratories). For CRISPRi experiments, stably expressing dCas9-KRAB-ZIM3 cell lines were generated using pLX303-ZIM3-KRAB-dCas9 (Addgene # 154472, Watertown, MA). Guide RNAs were purchased from Integrated DNA Technologies (Coralville, IA) and cloned into pU6-sgRNA EF1Alpha-puro-T2A-BFP (Addgene #60955, Watertown, MA #60955). Guide RNA sequences (Supplementary Table 4) for CRISPRi were determined using the UCSC Genome Browser CRISPR targets track that uses the CRISPOR program⁶⁶. The negative targeting control (sgNegative) is targeted to an open chromatin region within the same topologically associated domain as KLHL17. Sequences can be found in Supplementary Table 4. siRNA pools for NOC2L (L-020539-02-0010, Horizon Discovery Dharmacon Lafayette, CO), KLHL17 (L-031770-00-0020, Horizon Discovery Dharmacon Lafayette, CO) were purchased.

Cell culture

Cell lines (MIA PaCa-2 (CRM-CRL-1420), PANC-1 (CRL-1469), Hs766T (HTB-134), and HEK293T (CRL-3216)) were purchased from ATCC (Manassas, VA). SW1990 cell line was a gift from Dr. Jaiswal Kshama. PANC-1, Hs766T, and HEK293T cells were maintained in DMEM with 10% FBS. MIA PaCa-2 cells were maintained in DMEM with 2.5% horse serum and 10% FBS. SW1990 cells were maintained in RPMI and 10% FBS. All cells were grown at 37 °C with 5% CO₂. For virus production, HEK293T cells were grown in high-glucose DMEM media supplemented with 10% FBS, 1% glutamine, and 1% sodium pyruvate. Cell lines were routinely tested for mycoplasma and were always found to be negative. Cell lines were also tested for authentication with a panel of short tandem repeats (STRs) via the Identifiler kit (Life Technologies, Carlsbad, CA) and compared with ATCC and DSMZ (German Collection of Microorganisms and Cell Cultures—https://www.dsmz.de/) STR profile datasets. All cell lines with profiles in the databases matched and those not with profiles in this database matched earlier passages of these cell lines in use in our laboratory. Commonly misidentified cell lines were not used in this study.

Lentivirus production

HEK293T cells were plated 24 h prior to transfection. The plasmid of interest and packaging vectors (psPAX2 Addgene #12260, pMD2.G Addgene #12259) were transfected using Lipofectamine 3000 (Thermo Fisher Scientific, Waltham, MA) and media was changed 6–8 h post-transfection. Forty-eight hours later, virus was harvested, filtered with a 0.45-micron syringe filter, and precipitated overnight at 4 °C with 2X PEG precipitation buffer. The precipitated virus was centrifuged at 1000 × g for 30 min, 4 °C, the supernatant was removed and pelleted virus resuspended in PBS.

Generation of stable cell lines

MIA PaCa-2 and PANC-1 cell lines were used to make stably expressing dCas9-KRAB-Zim3 and doxycycline-inducible KLHL17-FLAG overexpressing lines. For stably expressing dCas9 lines, cells were transduced with the virus for 24 h. After 24 h, selection with 10 µg/mL of blasticidin was initiated. Once selection was complete, cells were diluted in a 96-well plate at a seeding density of 0.5 cells/well to isolate individual cell colonies. These colonies were assessed for dCas9-KRAB-Zim3 protein expression by western blot. Transduction with CRISPRi gRNAs was performed in the same manner using the stably expressing dCas9 cells and selected with 8–10 µg/mL of puromycin. To generate doxycycline-inducible KLHL17-FLAG cell lines, previously generated MIA PaCa-2 and PANC-1 cells stably transduced with the Clontech (Mountain View, CA) TetOn3G transactivator plasmid (pLVX-Tet3G)⁶⁷, were transduced with the TREG3p-FUGW-KLHL17 lentiviral expression plasmid (described above) in a 12-well plate. Media was changed 24 h post-transduction and puromycin selection (8-10 µg/mL) initiated at 48 h.

Luciferase assays

MIA PaCa-2, PANC-1, and HEK293T cells were seeded in 48-well plates 24 h prior to transfection. Cells were co-transfected with 1 µg of the luciferase plasmid pGL4.14 or pGL4.23, (Promega, Madison, WI) and 35 ng of pGL4.74 Renilla vector (Promega, Madison, WI) using Lipofectamine 2000 (Thermo Fisher Scientific, Waltham, MA). Forty-eight hours following transfections, cells were washed twice with PBS. Luciferase assays were performed with the Promega Dual Luciferase Reporter Assay following the manufacturer’s instructions. Luciferase activity was normalized to Renilla luciferase activity and reported as the fold-change relative to the empty luciferase vector. A Student’s two-tailed t-test was performed to test for statistically significant differences between alleles. For luciferase assays with TPA, cells were simultaneously transfected and stimulated with 200 nM Phorbol 12-myristate 13-acetate (TPA, Millipore Sigma, Burlington, MA) or DMSO 24 h after plating and harvested 48 h after transfection. Luciferase activity was determined as described above. Significance testing was performed on the ratio of A to G relative luciferase activity using an unpaired two-tailed t-test.

Colocalization analysis

Colocalization analysis was performed on the 2018 GWAS summary statistics⁴ and the GTExv7 pancreas eQTL data (downloaded from the GTEx portal in 2019) using coloc (version 4.0)²⁷. SNPs within 1 Mb of rs13303010 in the GWAS summary statistics were used. This list of SNPs was used to filter the gene-SNP pairs in the GTEx data and colocalization analysis was performed.

Cell growth assays

For KLHL17 overexpression cell proliferation assays, overexpression in stable cell lines was induced with 100 ng/mL of doxycycline 24 h prior to plating. Cells were then plated in 12-well plates. Twenty-four hours after plating (defined as 0 h), cell images were taken on the Lionheart (Biomek, Brea, CA) plate reader every 48 h for seven days.

For KLHL17 and NOC2L knockdown cell growth assays, cells were plated 24 h prior to transfection in a 6-well plate. Cells were then transfected with siRNA pools against NOC2L (L-020539-02-0010, Horizon Discovery Dharmacon Lafayette, CO), KLHL17 (L-031770-00-0020, Horizon Discovery Dharmacon Lafayette, CO) using RNAiMAX (Thermo Fisher Scientific Waltham, MA non-targeting control (D-001220-01-05, Horizon Discovery Dharmacon Lafayette, CO) using RNAiMAX (Thermo Fisher Scientific Waltham, MA). Cell counts were then taken using the Lionheart (0 h). Counts were taken daily for 7 days. At 72 h, cells were re-transfected with siRNA. For CRISPRi growth assays, cells were plated in a 12-well plate following complete selection with puromycin. The first count was taken 24 h after plating and considered 0 h. Counts were subsequently taken every 24 h for 7 days. For all cell proliferation assays, the cell count for each day was normalized to the 0-h cell count and is represented as a fold change relative to the initial cell count. Significance testing was performed using an unpaired two-tailed t-test on the non-targeting/negative controls and the knockdown at each time point.

RNA isolation and reverse transcriptase quantitative PCR

RNA was isolated using the QIAGEN RNeasy kit with a DNase digest and the QIAcube (QIAGEN, Germantown, MD). RNA was reverse transcribed to cDNA using SuperScript III Reverse Transcriptase (Invitrogen, Waltham, MA). Gene expression levels were quantified by qRT-PCR using TaqMan (Thermo Fisher Scientific, Waltham, MA) assays: NOC2L (Hs00610834_g1), KLHL17 (Hs00938625_g1), HPRT (Hs99999909_m1).

Chromatin Immunoprecipitation

Chromatin Immunoprecipitation was performed using the ActiveMotif (Carlsbad, CA) High Sensitivity ChIP kit with SW1990 and Hs766T PDAC cell lines. For JunB/D ChIP-qPCR, cell media was replaced approximately 16 h prior with serum free media and cells were stimulated with 200 nM of TPA (Millipore Sigma, Burlington, MA) for 3.5 h prior to cross-linking. Following crosslinking, cells were lysed with a 25-gauge syringe prior to sonication. Samples were sonicated using the Covaris ME220 (Woburn, MA). Shearing efficiency and chromatin concentration was assessed for the input. Immunoprecipitation was set up following the ActiveMotif protocol. Two hundred microliters of sheared chromatin were used for IPs with either ELF2 (4 µg), JunD (4 µg), JunB (10 µL) or IgG (4 µg) antibodies. IPs were incubated at 4 °C overnight, then protein G agarose beads were added for 3 h. Following DNA purification, enrichment of ELF2, JunB, or JunD at the regions of interest was assessed using qPCR (Supplementary Table 5 for primers) using SYBR Green Master Mix. Percent input was calculated following ActiveMotif’s protocol using a standard curve from input DNA for each primer set. Allelic enrichment was determined using TaqMan genotyping assays (C__57466801_10 and custom design). The ratio of A to G alleles was calculated and compared to the input DNA allelic ratio. Control regions were determined from ELF2 and JunB ChIP-seq data (GSE177468, GSE119930, respectively). Significance testing was performed between the percent input for IgG and the antibody of interest using an unpaired two-tailed t-test. For the allele-specific enrichment, unpaired two-tailed t-tests were performed on the ratio of A to G in the IP compared to the input.

Antibodies

Antibodies for EMSA: ELF2 (ab28726, Abcam; 12499-1-AP, Proteintech Rosemont, IL), JunB (C37F9, Cell Signaling Technologies, Danvers, MA), JunD (D17G2, Cell Signaling Technologies, Danver, MA). For Western Blot and Immunoprecipitation: JunB (1:1000, C37F9, Cell Signaling Technologies, Danver, MA), JunD (1:1000, D17G2, Cell Signaling Technologies, Danver,MA), FLAG (1:1000, F1804, Millipore Sigma, Burlington, MA), KLHL17 (1:500, PA5-56689 Thermo Fisher Scientific, Waltham, MA); 1:500, HPA031251, Millipore Sigma), GAPDH (1:1000, ab125247, Abcam, Waltham, MA), SP1 (1:000 ab13370, Abcam); mouse anti-rabbit light chain specific antibody HRP (1:5000, C840Z39 Jackson ImmunoResearch, West Grove, PA); donkey anti-mouse secondary HRP (1:5000, ab7061, Abcam, Waltham, MA), donkey anti-rabbit secondary HRP (1:5000, ab205722, Abcam, Waltham, MA); For ChIP: ELF2 (4 µg,12499-1-AP, Proteintech, Rosemont, IL), JunB (10 µL, C37F9,Cell Signaling Technologies, Danver, MA), JunD (4 µg, 720035, Invitrogen, Waltham, MA), Rabbit IgG (4 µg, 2729S, Cell Signaling Technologies, Danver, MA). Immunofluorescence: KLHL17 (1 µg/mL, HPA031251, Millipore Sigma, Burlington, MA), AlexaFluor647 (1:1000, A-31573, Thermo Fisher Scientific, Waltham, MA)

Western blot analysis

Cells were lysed with either NP-40-DOC-SDS lysis buffer (150 mM NaCl, 50 mM Tris, 1% NP-40, 1% Sodium Deoxycholate, 1% Sodium dodecyl sulfate) or NE-PER^TM Nuclear and Cytoplasmic extraction kit (Thermo Fisher Scientific, Waltham, MA). Lysates were run on Criterion^TM XT precast 3-8% Tris-Acetate gels (Bio-Rad, Hercules, CA) using XT running buffer and transferred to a PDVF membrane using a standard wet transfer or Bolt^TM 4-12% Bis-Tris Plus gels (Invitrogen, Waltham, MA) using MOPS running buffer and transferred to PDVF membranes using the iBlot^TM Transfer Stacks (Invitrogen, Waltham, MA). Membranes were blocked in 5% Bovine Albumin Serum and incubated with primary antibody overnight at 4 °C. The appropriate HRP secondary antibody was added for a 1 h incubation at room temperature. Following washes with TBS-t, chemiluminescence was detected with SuperSignal™ West Femto Maximum Sensitivity Substrate (Thermo Fisher Scientific, Waltham, MA) and imaged on the Bio-Rad ChemiDoc^TM.

Immunoprecipitation

Cells were treated with 1 µM of MG-132 proteosome inhibitor (Millipore Sigma, Burlington, MA) 16 h prior to harvest. Cells were harvested and lysed using 50 mM HEPES (pH 7.4), 150 mM NaCl, 0.5 mM EDTA, 0.1% NP-40, protease inhibitor, and 10 µg MG-132 on ice for 5 min then freeze-thawed once. Samples were centrifuged at 14,000 × g for 5 min to pellet cell debris. One milligram of protein extract was incubated with 2 µg of FLAG antibody at 4 °C for 2 h. Protein G beads (Invitrogen, Waltham, MA) were added and incubated for an additional 30 min. Five IPs (for a total of 5 mg of protein used) were combined and washed three times with 50 mM HEPES. Ten percent of the IP was used for western blot analysis and the remaining 90% was subjected to mass spectrometry.

Protein digestion and TMT labeling

Samples for mass-spectrometry included 3 independent FLAG-KLHL17 IPs, 4 independent Empty Vector FLAG IPs, as a background control, 3 biological replicates of MIA PaCa-2-KLHL17 and PANC-1 KLHL17 cells treated with 0.1, 1, 10, and 100 ng/mL of doxycycline, and 3 biological replicates of CRISPRi sgNeg (control) and sgKLHL17-1 in PANC-1 and MIA PaCa-2 cells at both Day 8 and Day 36. The cell pellets were lysed in EasyPrep Lysis buffer (Thermo Fisher, CA) according to the manufacturer’s protocol. Lysates were clarified by centrifugation and protein concentration was quantified using the BCA protein estimation kit (Thermo Fisher, CA). Fifteen micrograms of lysate were reduced, alkylated and digested by the addition of trypsin at a ratio of 1:50 (Promega) and incubating overnight at 37 °C.

For TMT labeling 100 µg of TMTpro label (Thermo Fisher, CA) in 100% ACN was added to each sample. After incubating the mixture for 1 h at room temperature with occasional mixing, the reaction was terminated by adding 50 µL of 5% hydroxylamine, 20% formic acid. The peptide samples for each condition were pooled and peptide clean-up was performed using the proprietary peptide clean up columns from the EasyPEP Mini MS Sample Prep kit (Thermo Fisher, CA).

High pH reverse phase fractionation

The first dimensional separation of the peptides was performed using a Waters Acquity UPLC system coupled with a fluorescence detector (Waters, Milford, MA) using a 150 mm × 3.0 mm Xbridge Peptide BEM^TM 2. 5 µm C18 column (Waters, MA) operating at 0.35 mL/min. The dried peptides were reconstituted in 100 µL of mobile phase A solvent (10 mM Ammonium Formate, pH 9.4). Mobile phase B was 10 mM Ammonium Formate/90% acetonitrile, pH 9.4. The column was washed with mobile phase A for 5 min followed by gradient elution 10–50% B (5–60 min) and 50−75% B (60–70 min). The fractions were collected every minute. These 60 fractions were pooled into 24 fractions. The fractions were vacuum centrifuged to dryness and stored at −80 °C until analysis by mass spectrometry.

Mass spectrometry acquisition and data analysis

The dried peptide fractions were reconstituted in 0.1% TFA and subjected to nanoflow liquid chromatography (Thermo Easy nLC 1200, Thermo Scientific, Thermo Scientific) coupled to an Orbitrap LUMOS mass spectrometer (Thermo Scientific, CA). Peptides were separated using a low pH gradient using a 5-50% ACN over 120 min in mobile phase containing 0.1% formic acid at 300 nL/min flow rate. MS scans were performed in the Orbitrap analyzer at a resolution of 120,000 with an ion accumulation target set at 4e⁵ and max IT set at 50 ms over a mass range of 400–1600 m/z. Ions with determined charge states between 2 and 5 were selected for MS2 scans. A cycle time of 3 s was used and a quadrupole isolation window of 0.7 m/z was used for MS/MS analysis. An Orbitrap at 50,000 resolutions with a normalized AGC set at 250 followed by maximum injection time set as “Auto” with a normalized collision energy setting of 38 was used for MS/MS analysis.

Acquired MS/MS spectra were searched against a human Uniprot protein database using a SEQUEST HT and percolator validator algorithms in the Proteome Discoverer 2.4 software (Thermo Scientific, CA). The precursor ion tolerance was set at 10 ppm and the fragment ions tolerance was set at 0.02 Da along with methionine oxidation included as dynamic modification. Carbamidomethylation of cysteine residues and TMT16 plex (304.2071 Da) was set as a static modification of lysine and the N-termini of the peptide. Trypsin was specified as the proteolytic enzyme, with up to 2 missed cleavage sites allowed. Searches used a reverse sequence decoy strategy to control for the false peptide discovery and identifications were validated using the percolator software. Only peptides with less than 50% co-isolation interference were used for quantitative analysis.

Reporter ion intensities were adjusted to correct for the impurities according to the manufacturer’s specification and the abundances of the proteins were quantified using the summation of the reporter ions for all identified peptides. The reporter abundances were normalized across all the channels to account for equal peptide loading. Data analysis and visualization were performed in Microsoft Excel and R.

On-bead trypsin digestion and LC-MS/MS analysis

Beads were resuspended in 30 μL of 50 mM HEPES (pH 8.0) and heated at 95 °C for 10 min. Samples were treated with 2 μg of trypsin and incubated at 37 °C overnight with constant shaking. The supernatant containing the tryptic digests was collected after centrifugation. The residual beads were washed twice with 50 mM HEPES (pH 8.0), and the supernatant and washes combined for maximum recovery. Peptides were desalted using EasyPep MS sample prep kit (Thermo Scientific, CA) and lyophilized. The dried peptides were suspended in 15 µL of 0.1% TFA and analyzed using an EASY-nLC 1200 (ThermoFisher Scientific, Waltham, MA) in front of an Q Exactive HF (ThermoFisher Scientific, Waltham, MA) equipped with an EasySpray ion source. The desalted tryptic peptide was loaded onto an Acclaim PepMap 100 (75 µM × 2 cm) C18 trap column (ThermoFisher Scientific, Waltham, MA) followed by a separation on PepMap RSLC C18 (75 µM × 25 cm) analytical column. The peptides were eluted with a 5% to 27% gradient of Acetonitrile with 0.1% Formic acid over 60 min and 27% to 40% gradient of Acetonitrile with 0.1% Formic acid over 45 min with a flow rate of 300 nL/min. The MS1 was performed at 60,000 resolutions over mass range of 380 to 1580 m/z, with a maximum injection time of 120 ms and an AGC target of 3e6. The MS2 scans were performed at resolution of 15,000, normalized collision energy set at 27, maximum injection time of 50 ms and an AGC target of 2e5.

MS files were searched with Proteome Discoverer 2.4 using the Sequest node. Data were searched against the Uniprot human database using a full tryptic digest, 2 max missed cleavages, minimum peptide length of 6 amino acids and maximum peptide length of 40 amino acids, an MS1 mass tolerance of 10 ppm, MS2 mass tolerance of 0.02 Da

Substrate identification

For analysis to identify enriched proteins in the IP mass spectrometry data we used the peptide matching spectral (PSM) counts. We first added a pseudo count of 1 was added to all peptide matching spectral (PSM) counts to adjust for proteins that had 0 PSM counts in order to calculate a fold-enrichment relative to the empty vector IP. The median of the pseudo count transformed PSM counts was calculated (n = 2 for KLHL17-FLAG; n = 3 for empty vector IP), and the fold change between the KLHL17-FLAG IP/Empty vector IP was calculated. P-values were calculated using the PSM + 1 counts. Proteins that were significantly (P < 0.05) enriched in the KLHL17 IP over the empty vector IP and had an enrichment fold change > 1.5 were considered candidate substrates. We then compared this list to the enriched proteins identified in the pilot IP-MS experiment. The overlapping proteins (n = 62) were moved forward and cross-referenced with the global proteomics experiment examining protein expression with different levels of KLHL17 expression. Proteins that were enriched and displayed a doxycycline dose-dependent decrease in protein expression were determined to be likely KLHL17 substrates.

Immunofluorescence analysis

MIA PaCa-2 and PANC-1 cells overexpressing KLHL17 were plated on coverslips and induced with 100 ng/mL of doxycycline for 72 h. Cells were then washed with PBS and fixed with 4% paraformaldehyde (Thermo Fisher Scientific, Waltham, MA) for 15 min. Following fixation, cells were permeabilized with 0.25% Triton-X for 15 min. Samples were then blocked in 2% BSA (Millipore Sigma, Burlington, MA) for 1 h at room temperature. Primary KLHL17 antibody was added to the cells at 1 µg/mL and incubated at 4 °C overnight. Cells were washed with PBS and incubated with an AlexaFluor 647 secondary antibody for 1 h at room temperature in the dark. Coverslips were then mounted on slides using ProLong^TM Diamond Antifade Mountant with DAPI (Thermo Fisher Scientific, Waltham, MA) and allowed to cure for at least 24 h at 4 °C in the dark. Slides were imaged with a Zeiss Microscope.

In silico knockdown and pathway analysis

An in silico knockdown analysis for KLHL17 was performed using GTExv8 normal pancreas tissue sample-derived RNA-seq data (downloaded from the GTEx portal in 2020) as previously described²⁸. Briefly, we scaled GTExv8 pancreas gene expression counts to account for sequencing depth and RNA composition across all samples (n = 328) to give normalized counts of the trimmed mean of M-values (TMM) using EdgeR (3.38.1 in RStudio 2022.02.3 + 492)³⁷. Genes with no reads for > 20% of the samples were excluded. Normalized reads were used to segregate samples into quartiles based on KLHL17 expression. Only the samples in the top and bottom quartiles (n = 82 each) were used for downstream analysis. The raw counts for these selected samples were re-normalized for sequencing depth to obtain pseudo-counts which were analyzed using the quantile-adjusted conditional maximum likelihood (qCML) method in EdgeR. Differential expression (log2[bottom/top quartile] and P values) was then assessed using an exact test. The statistically significant (FDR < 0.05) differentially expressed genes were subjected to Gene Set Enrichment Analysis (GSEA) using webgestalt.org³⁹. The ranked list for the GSEA was based on log2 fold change. Without an FDR filter, there was no significant enrichment of gene sets. For Ingenuity Pathway Analysis, a FDR < 0.05 and a log2 fold change > |0.5| was used to filter genes for input (IPA, QIAGEN, Germantown, MD). For IPA, both the log2 fold change and FDR were considered in the analysis of enriched pathways.

Transcription factor binding prediction

In silico Transcription Factor binding prediction was performed using PrEdict Regulatory Functional Effect of SNPs by Approximate P value Estimation (PERFECTOS-APE; https://opera.autosome.org/perfectosape/). Briefly, the SNPs with allele-preferential activity were submitted for analysis to determine the probability of a TF binding site from the position matrices in the HOCOMOCO11 transcription factor database. Once a P-value of predicted TF binding sequence was determined for each allele, a fold change was calculated²⁵.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The proteomics data generated in this study have been deposited in the MassIVE database under accession code #MSV000096025 [https://massive.ucsd.edu/ProteoSAFe/private-dataset.jsp?task=e357b08275e34f89b280e716ccd20d8f]. The PDAC GWAS data used in this study are available under controlled access because of data use limitations and can be requested through dbGaP: phs000206.v6.p3 [https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000206.v6.p3] and phs000648.v1.p1 [https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000648.v1.p1]. UK Biobank data is available through the UK Biobank (https://www.ukbiobank.ac.uk/). Source data are provided with this paper.

References

Rahib, L. et al. Projecting cancer incidence and deaths to 2030: the unexpected burden of thyroid, liver, and pancreas cancers in the United States. Cancer Res. 74, 2913 (2014).
Article CAS PubMed Google Scholar
Cronin, K. A. et al. Annual report to the nation on the status of cancer, part 1: National cancer statistics. Cancer 128, 4251–4284 (2022).
Article PubMed Google Scholar
Capasso, M. et al. Epidemiology and risk factors of pancreatic cancer. Acta Biomed. 89, 141–146 (2018).
PubMed Google Scholar
Klein, A. P. et al. Genome-wide meta-analysis identifies five new susceptibility loci for pancreatic cancer. Nat. Commun. 9, 556–556 (2018).
Article ADS PubMed PubMed Central Google Scholar
Amundadottir, L. et al. Genome-wide association study identifies variants in the ABO locus associated with susceptibility to pancreatic cancer. Nat. Genet 41, 986–990 (2009).
Article CAS PubMed PubMed Central Google Scholar
Petersen, G. M. et al. A genome-wide association study identifies pancreatic cancer susceptibility loci on chromosomes 13q22.1, 1q32.1 and 5p15.33. Nat. Genet 42, 224–228 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wolpin, B. M. et al. Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer. Nat. Genet. 46, 994–1000 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhang, M. et al. Three new pancreatic cancer susceptibility signals identified on chromosomes 1q32.1, 5p15.33 and 8q24.21. Oncotarget 7, 66328–66343 (2016).
Article PubMed PubMed Central Google Scholar
Childs, E. J. et al. Common variation at 2p13.3, 3q29, 7p13 and 17q25.1 associated with susceptibility to pancreatic cancer. Nat. Genet. 47, 911–916 (2015).
Article CAS PubMed PubMed Central Google Scholar
Merkulov, V. M., Leberfarb, E. Y. & Merkulova, T. I. Regulatory SNPs and their widespread effects on the transcriptome. J. Biosci. 43, 1069–1075 (2018).
Article CAS PubMed Google Scholar
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Long, E. et al. Massively parallel reporter assays and variant scoring identified functional variants and target genes for melanoma loci and highlighted cell-type specificity. Am. J. Hum. Genet. 109, 2210–2229 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Ser. B: Stat. Methodol. 82, 1273–1300 (2020).
Article MathSciNet Google Scholar
Aguet, F. et al. The GTEx Consortium atlas of genetic regulatory effects across human tissues. bioRxiv https://doi.org/10.1101/787903 (2019).
Trang, K. B. et al. 3D genomic features across >50 diverse cell types reveal insights into the genomic architecture of childhood obesity. medRxiv https://doi.org/10.1101/2023.08.30.23294092 (2024).
Fang, J. et al. Functional characterization of a multi-cancer risk locus on chr5p15.33 reveals regulation of TERT by ZNF148. Nat. Commun. 8, 15034–15034 (2017).
Article ADS PubMed PubMed Central Google Scholar
Jermusyk, A. et al. A 584 bp deletion in CTRB2 inhibits chymotrypsin B2 activity and secretion and confers risk of pancreatic cancer. Am. J. Hum. Genet. https://doi.org/10.1101/2023.08.30.23294092 (2021).
Hoskins, J. W. et al. Functional characterization of a chr13q22.1 pancreatic cancer risk locus reveals long-range interaction and allele-specific effects on DIS3 expression. Hum. Mol. Genet. 25, 4726–4738 (2016).
CAS PubMed PubMed Central Google Scholar
Cerezo, M. et al. The NHGRI-EBI GWAS Catalog: standards for reusability, sustainability and diversity. Nucleic Acids Res. 53, D998–D1005 (2025).
Article PubMed Google Scholar
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLOS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Schaub, M. A., Boyle, A. P., Kundaje, A., Batzoglou, S. & Snyder, M. Linking disease associations with regulatory information in the human genome. Genome Res. 22, 1748–1759 (2012).
Article CAS PubMed PubMed Central Google Scholar
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228–1235 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hartl, C. et al. CREdb: a comprehensive database of Cis-regulatory elements and their activity in human cells and tissues. Epigenetics Chromatin 17, 21 (2024).
Article PubMed PubMed Central Google Scholar
Zhong, J. et al. Large-scale multi-omic analysis identifies noncoding somatic driver mutations and nominates ZFP36L2 as a driver gene for pancreatic ductal adenocarcinoma. medRxiv https://doi.org/10.1101/2024.09.22.24314165 (2024).
Vorontsov, I.E., Kulakovskiy, I.V., Khimulya, G., Nikolaeva, D.D. & Makeev, V.J. PERFECTOS-APE-predicting regulatory functional effect of SNPs by approximate p-value estimation. In: Proc. International Conference on Bioinformatics Models, Methods and Algorithms 102–108 (2015) https://doi.org/10.5220/0005189301020108.
Garces de Los Fayos Alonso, I. et al. The Role of Activator Protein-1 (AP-1) Family Members in CD30-Positive Lymphomas. Cancers (Basel) 10 https://doi.org/10.3390/cancers10040093 (2018).
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383–e1004383 (2014).
Article PubMed PubMed Central Google Scholar
Zhong, J. et al. A transcriptome-wide association study (TWAS) identifies novel candidate susceptibility genes for pancreatic cancer. J. Natl Cancer Inst. djz246, https://doi.org/10.1093/jnci/djz246 (2020).
Ye, G. et al. The roles of KLHL family members in human cancers. Am. J. Cancer Res. 12, 5105–5139 (2022).
CAS PubMed PubMed Central Google Scholar
Uhlén, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed Google Scholar
Huang, X. et al. Luteolin inhibits pancreatitis‑induced acinar‑ductal metaplasia, proliferation and epithelial‑mesenchymal transition of acinar cells. Mol. Med. Rep. 17, 3681–3689 (2018).
CAS PubMed Google Scholar
Hagio, M., Matsuda, Y., Suzuki, T. & Ishiwata, T. Nestin regulates epithelial-mesenchymal transition marker expression in pancreatic ductal adenocarcinoma cell lines. Mol. Clin. Oncol. 1, 83–87 (2013).
Article PubMed Google Scholar
Ishiwata, T. et al. Defined localization of nestin-expressing cells in l-Arginine-induced Acute Pancreatitis. Pancreas 32 (2006).
Means, A. L. et al. Pancreatic epithelial plasticity mediated by acinar cell transdifferentiation and generation of nestin-positive intermediates. Development 132, 3767–3776 (2005).
Article CAS PubMed Google Scholar
Tang, Z., Kang, B., Li, C., Chen, T. & Zhang, Z. GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res. 47, W556–W560 (2019).
Article CAS PubMed PubMed Central Google Scholar
Cobo, I. et al. Transcriptional regulation by NR5A2 links differentiation and inflammation in the pancreas. Nature 554, 533–537 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2009).
Article PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. 102, 15545–15550 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Elizarraras, J. M. et al. WebGestalt 2024: faster gene set analysis and new support for metabolomics and multi-omics. Nucleic Acids Res. 52, W415-W421, https://doi.org/10.1093/nar/gkae456 (2024).
Krämer, A., Green, J., Pollard, J. Jr & Tugendreich, S. Causal analysis approaches in Ingenuity Pathway Analysis. Bioinformatics 30, 523–530 (2013).
Article PubMed PubMed Central Google Scholar
Hublitz, P. et al. NIR is a novel INHAT repressor that modulates the transcriptional activity of p53. Genes Dev. 19, 2912–2924 (2005).
Salinas, G. D. et al. Actinfilin is a Cul3 substrate adaptor, linking GluR6 kainate receptor subunits to the ubiquitin-proteasome pathway. J. Biol. Chem. 281, 40164–40173 (2006).
Article CAS PubMed Google Scholar
Fu, A. B., Xiang, S. F., He, Q. J. & Ying, M. D. Kelch-like proteins in the gastrointestinal tumors. Acta Pharm. Sin. 44, 931–939 (2023).
Article CAS Google Scholar
Shi, X. et al. Kelch-like proteins: physiological functions and relationships with diseases. Pharmacol. Res. 148, 104404 (2019).
Article CAS PubMed Google Scholar
Bertocci, B. et al. Klhl6 deficiency impairs transitional B cell survival and differentiation. J. Immunol. 199, 2408–2420 (2017).
Article CAS PubMed Google Scholar
Ohta, T. et al. Loss of Keap1 function activates Nrf2 and provides advantages for lung cancer cell growth. Cancer Res. 68, 1303–1309 (2008).
Article CAS PubMed Google Scholar
Zhou, Y. et al. Targeting kelch-like (KLHL) proteins: achievements, challenges and perspectives. Eur. J. Med. Chem. 269, 116270 (2024).
Article CAS PubMed Google Scholar
Liu, Z. et al. Upregulation of KLHL17 promotes the proliferation and migration of non-small cell lung cancer by activating the Ras/MAPK signaling pathway. Lab. Investig. 102, 1389–1399 (2022).
Article CAS PubMed Google Scholar
Carrière, C., Seeley, E. S., Goetze, T., Longnecker, D. S. & Korc, M. The Nestin progenitor lineage is the compartment of origin for pancreatic intraepithelial neoplasia. Proc. Natl Acad. Sci. USA 104, 4437–4442 (2007).
Article ADS PubMed PubMed Central Google Scholar
Ishiwata, T., Matsuda, Y. & Naito, Z. Nestin in gastrointestinal and other cancers: effects on cells and tumor angiogenesis. World J. Gastroenterol. 17, 409–418 (2011).
Article PubMed PubMed Central Google Scholar
Satelli, A. & Li, S. Vimentin in cancer and its potential as a molecular target for cancer therapy. Cell Mol. Life Sci. 68, 3033–3046 (2011).
Article CAS PubMed PubMed Central Google Scholar
Su, H.-T. et al. Stem cell marker nestin is critical for TGF-β1-mediated tumor progression in pancreatic cancer. Mol. Cancer Res. 11, 768–779 (2013).
Article CAS PubMed Google Scholar
Gukovsky, I., Li, N., Todoric, J., Gukovskaya, A. & Karin, M. Inflammation, autophagy, and obesity: common features in the pathogenesis of pancreatitis and pancreatic cancer. Gastroenterology 144, 1199–1209.e1194 (2013).
Article CAS PubMed Google Scholar
Grimont, A., Leach, S. D. & Chandwani, R. Uncertain beginnings: acinar and ductal cell plasticity in the development of pancreatic cancer. Cell Mol. Gastroenterol. Hepatol. https://doi.org/10.1016/j.jcmgh.2021.07.014 (2021).
Wang, L., Xie, D. & Wei, D. in Pancreatic Cancer: Methods and Protocols (ed Gloria H.) 299–308 (Springer New York, 2019).
Pinho, A. V. et al. Adult pancreatic acinar cells dedifferentiate to an embryonic progenitor phenotype with concomitant activation of a senescence programme that is present in chronic pancreatitis. Gut 60, 958 (2011).
Article CAS PubMed Google Scholar
Rhim, A. D. et al. EMT and dissemination precede pancreatic tumor formation. Cell 148, 349–361 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ridge, K. M., Eriksson, J. E., Pekny, M. & Goldman, R. D. Roles of vimentin in health and disease. Genes Dev. 36, 391–407 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bornstein, S. R., Berger, I. & Steenblock, C. Are Nestin-positive cells responsive to stress? Stress 23, 662–666 (2020).
Article CAS PubMed Google Scholar
Baldan, J., Houbracken, I., Rooman, I. & Bouwens, L. Adult human pancreatic acinar cells dedifferentiate into an embryonic progenitor-like state in 3D suspension culture. Sci. Rep. 9, 4040 (2019).
Article ADS PubMed PubMed Central Google Scholar
Ren, F.-J., Cai, X.-Y., Yao, Y. & Fang, G.-Y. JunB: a paradigm for Jun family in immune response and cancer. Front. Cell. Infection Microbiol. 13 https://doi.org/10.3389/fcimb.2023.1222265 (2023).
Brown, D. W., Myers, T. A. & Machiela, M. J. PCAmatchR: a flexible R package for optimal case-control matching using weighted principal components. Bioinformatics 37, 1178–1181 (2021).
Article CAS PubMed Google Scholar
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 39, 906–913 (2007).
Article CAS PubMed Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Machiela, M. J. & Chanock, S. J. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 31, 3555–3557 (2015).
Article CAS PubMed PubMed Central Google Scholar
Concordet, J.-P. & Haeussler, M. CRISPOR: intuitive guide selection for CRISPR/Cas9 genome editing experiments and screens. Nucleic Acids Res. 46, W242–W245 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hoskins, J. W. et al. Transcriptome analysis of pancreatic cancer reveals a tumor suppressor function for HNF1A. Carcinogenesis 35, 2670–2678 (2014).
Article CAS PubMed PubMed Central Google Scholar
Milan, M. et al. FOXA2 controls the cis-regulatory networks of pancreatic cancer cells in a differentiation grade-specific manner. EMBO J. 38, e102161 (2019).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study utilized the high-performance computational capabilities of the Biowulf Linux cluster at the NIH, Bethesda, MD, USA (http://biowulf.nih.gov). The authors would like to thank the Frederick National Cancer Research Laboratories for the generation of inducible plasmids. We also thank participants and clinical coordinators participating in the Pancreatic Cancer Cohort Consortium, Pancreatic Cancer Case-Control Consortium and the UK Biobank for providing samples for the GWAS studies. The authors acknowledge the research contributions of the Cancer Genomics Research Laboratory for their expertise, execution, and support of this research in the areas of project planning, wet laboratory processing of specimens, and bioinformatics analysis of generated data. The data used for the analyses described in this manuscript were obtained from the GTEx Portal versions 7 and 8 pancreas data in 2019 and 2020. This research has been conducted using data from UK Biobank (Approval #29565, Laufey T. Amundadottir), a major biomedical database (www.ukbiobank.ac.uk)²⁰. This work was supported by the Intramural Research Program (IRP) of the Division of Cancer Epidemiology and Genetics (DCEG), National Cancer Institute (NCI), US National Institutes of Health (NIH). This project has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, under NCI Contract No. 75N910D00024 (L.T.A.). The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government. The American Cancer Society (ACS) funds the creation, maintenance, and updating of the Cancer Prevention Study II cohort. The authors express sincere appreciation to all Cancer Prevention Study-II participants, and to each member of the study and biospecimen management group. The authors would like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention’s National Program of Cancer Registries and cancer registries supported by the National Cancer Institute’s Surveillance Epidemiology and End Results Program. The American Cancer Society funds the creation, maintenance, and updating of the Cancer Prevention Study-II cohort (and/or Cancer Prevention Study-3). The authors express sincere appreciation to all Cancer Prevention Study-II participants, and to each member of the study and biospecimen management group. The authors would like to acknowledge the contribution to this study from central cancer registries supported through the Centers for Disease Control and Prevention’s National Program of Cancer Registries and cancer registries supported by the National Cancer Institute’s Surveillance Epidemiology and End Results Program. Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy or views of the International Agency for Research on Cancer / World Health Organization. We acknowledge funding for the Women’s Health Study (WHS) source of data: CA047988, CA182913, HL043851, HL080467, and HL099355. We acknowledge WHI investigators listed here: https://www-whi-org.s3.us-west-2.amazonaws.com/wp-content/uploads/WHI-Investigator-Short-List.pdf. The WHI program is funded by the National Heart, Lung, and Blood Institute, National Institutes of Health, U.S. Department of Health and Human Services through 75N92021D00001, 75N92021D00002, 75N92021D00003, 75N92021D00004, 75N92021D00005. The EPIC-Norfolk study (https://doi.org/10.22025/2019.10.105.00004) has received funding from the Medical Research Council (MR/N003284/1, MC-UU_12015/1 and MC_UU_00006/1) and Cancer Research UK (C864/A14136). We are grateful to all the participants who have been part of the project and to the many members of the study teams at the University of Cambridge who have enabled this research. Support for title page creation and format was provided by AuthorArranger, a tool developed at the National Cancer Institute.

Funding

Open access funding provided by the National Institutes of Health.

Author information

Authors and Affiliations

Laboratory of Translational Genomics, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
Katelyn E. Connelly, Katherine Hullin, Ehssan Abdolalizadeh, Jun Zhong, Daina Eiser, Aidan O’Brien, Irene Collins, Jason W. Hoskins & Laufey T. Amundadottir
Protein Characterization Laboratory, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research Inc, Frederick, MD, USA
Sudipto Das, Gerard Duncan & Thorkell Andresson
Laboratory of Genomic Susceptibility, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
Stephen J. Chanock
Metabolic Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
Demetrius Albanes, Steven C. Moore, Rachael Z. Stoltzenberg-Solomon & Rachael Z. Stolzenberg-Solomon
Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA
Kala Visvanathan & Alison P. Klein
Department of Pathology, Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins School of Medicine, Baltimore, MD, USA
Michael Goggins & Alison P. Klein
Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
Chen Yuan & Brian M. Wolpin
Department of Medicine, Georgetown University, Washington, USA
Jill P. Smith
Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology, National Cancer Institute and Genetics, Rockville, MD, USA
Gabriella Andreotti, Laura Beane-Freeman, Sonja I. Berndt, Nathaniel Rothman & Debra Silverman
Departments of Obstetrics and Gynecology and Population Health, NYU Grossman School of Medicine, NYU Perlmutter Comprehensive Cancer Center, New York, NY, USA
Alan A. Arslan
Division of Preventive Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
Julie E. Buring, J. Michael Gaziano & I-Min Lee
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Julie E. Buring, A. Heather Eliassen, Edward L. Giovannucci, I-Min Lee, Lorelei Mucci, Howard D. Sesso, Meir J. Stampfer & Walter C. Willett
Unit of Genetics, Department of Biology, University of Pisa, Pisa, Italy
Daniele Campa
Genomic Epidemiology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
Federico Canzian
Department of Population Health, NYU Grossman School of Medicine, NYU Perlmutter Comprehensive Cancer Center, New York, NY, USA
Yu Chen
Cancer Genomics Research Laboratory, Frederick National Lab for Cancer Research, Frederick, MD, USA
Charles C. Chung, Belynda Hicks, Amy Hutchinson & Michelle R. Manning
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
A. Heather Eliassen & Meir J. Stampfer
Department of Nutrition, Harvard T. H. Chan School of Public Health, Boston, MA, USA
A. Heather Eliassen, Edward L. Giovannucci, Meir J. Stampfer & Walter C. Willett
Division of Aging, Brigham and Women’s Hospital, Boston, MA, USA
J. Michael Gaziano & Howard D. Sesso
Boston VA Healthcare System, Boston, MA, USA
J. Michael Gaziano
SWOG Statistical Center, Fred Hutchinson Cancer Center, Seattle, WA, USA
Phyllis J. Goodman
Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
Christopher A. Haiman
Department of Epidemiology, Johns Hopkins School of Public Health, Baltimore, MD, USA
Miranda R. Jones & Kala Visvanathan
Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
Verena Katzke
Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
Charles Kooperberg, Ulrike Peters & Emily White
Trans-Divisional Research Program, Division of Cancer Epidemiology, National Cancer Institute and Genetics, Rockville, MD, USA
Peter Kraft
Cancer Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, USA
Loic LeMarchand
Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Center (CNIO), Madrid, Spain
Núria Malats
CIBERONC, Madrid, Spain
Núria Malats & Francisco X. Real
Department of Public Health, Finnish Institute for Health and Welfare (THL), Helsinki, Finland
Satu Männistö
Cancer Epidemiology Division, Cancer Council Victoria, East Melbourne, VIC, Australia
Roger Milne & Melissa C. Southey
Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC, Australia
Roger Milne & Melissa C. Southey
Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Parkville, VIC, Australia
Roger Milne
Department of Population Science, American Cancer Society, Atlanta, GA, USA
Alpa V. Patel & Caroline Um
Epithelial Carcinogenesis Group, Molecular Oncology Programme, Spanish National Cancer Research Center (CNIO), Madrid, Spain
Francisco X. Real
Universitat Pompeu Fabra, Barcelona, Spain
Francisco X. Real
Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
Veronica W. Setiawan
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN, USA
Xiao-Ou Shu & Wei Zheng
Department of Clinical Pathology, The University of Melbourne, Melbourne, VIC, Australia
Melissa C. Southey
Division of Cancer Epidemiology National Cancer Institute and Genetics, Rockville, MD, USA
Geoffrey S. Tobias
Paris-Saclay University, UVSQ, Inserm, Gustave Roussy, CESP, Villejuif, France
Therese Truong
Clinical Genetics Branch, Division of Cancer Epidemiology, National Cancer Institute and Genetics, Rockville, MD, USA
Nicolas Wentzensen
Department of Epidemiology, Fred Hutchinson Cancer Center, Seattle, WA, USA
Emily White
Department of Epidemiology and Environmental Health, University of Buffalo, Buffalo, NY, USA
Jean Wactawski-Wende
Department of Quantitative Health Sciences, Mayo Clinic College of Medicine, Jacksonville, FL, USA
Samuel O. Antwi
Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA
Paige M. Bracci & Elizabeth A. Holly
Lunenfeld-Tanenbaum Research Institute, Sinai Health System and University of Toronto, Toronto, Canada
Steven Gallinger & Rayjean J. Hung
Department of Gastrointestinal Medical Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, USA
Manal Hassan & Donghui Li
Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, Australia
Rachel E. Neale
Department of Quantitative Health Sciences, Mayo Clinic College of Medicine, Rochester, MN, USA
Kari G. Rabe
Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, CT, USA
Harvey A. Risch
Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, USA
Herbert Yu

Authors

Katelyn E. Connelly
View author publications
Search author on:PubMed Google Scholar
Katherine Hullin
View author publications
Search author on:PubMed Google Scholar
Ehssan Abdolalizadeh
View author publications
Search author on:PubMed Google Scholar
Jun Zhong
View author publications
Search author on:PubMed Google Scholar
Daina Eiser
View author publications
Search author on:PubMed Google Scholar
Aidan O’Brien
View author publications
Search author on:PubMed Google Scholar
Irene Collins
View author publications
Search author on:PubMed Google Scholar
Sudipto Das
View author publications
Search author on:PubMed Google Scholar
Gerard Duncan
View author publications
Search author on:PubMed Google Scholar
Stephen J. Chanock
View author publications
Search author on:PubMed Google Scholar
Rachael Z. Stolzenberg-Solomon
View author publications
Search author on:PubMed Google Scholar
Alison P. Klein
View author publications
Search author on:PubMed Google Scholar
Brian M. Wolpin
View author publications
Search author on:PubMed Google Scholar
Jason W. Hoskins
View author publications
Search author on:PubMed Google Scholar
Thorkell Andresson
View author publications
Search author on:PubMed Google Scholar
Jill P. Smith
View author publications
Search author on:PubMed Google Scholar
Laufey T. Amundadottir
View author publications
Search author on:PubMed Google Scholar

Consortia

Pancreatic Cancer Cohort Consortium

Jun Zhong
, Demetrius Albanes
, Gabriella Andreotti
, Alan A. Arslan
, Laura Beane-Freeman
, Sonja I. Berndt
, Julie E. Buring
, Daniele Campa
, Federico Canzian
, Stephen J. Chanock
, Yu Chen
, Charles C. Chung
, A. Heather Eliassen
, J. Michael Gaziano
, Edward L. Giovannucci
, Phyllis J. Goodman
, Christopher A. Haiman
, Belynda Hicks
, Amy Hutchinson
, Miranda R. Jones
, Verena Katzke
, Charles Kooperberg
, Peter Kraft
, I-Min Lee
, Loic LeMarchand
, Núria Malats
, Michelle R. Manning
, Satu Männistö
, Roger Milne
, Steven C. Moore
, Lorelei Mucci
, Alpa V. Patel
, Ulrike Peters
, Francisco X. Real
, Nathaniel Rothman
, Howard D. Sesso
, Veronica W. Setiawan
, Xiao-Ou Shu
, Debra Silverman
, Meir J. Stampfer
, Melissa C. Southey
, Geoffrey S. Tobias
, Therese Truong
, Caroline Um
, Kala Visvanathan
, Nicolas Wentzensen
, Emily White
, Chen Yuan
, Wei Zheng
, Jean Wactawski-Wende
, Walter C. Willett
, Brian M. Wolpin
, Rachael Z. Stoltzenberg-Solomon
& Laufey T. Amundadottir

Pancreatic Cancer Case-Control Consortium

Samuel O. Antwi
, Paige M. Bracci
, Steven Gallinger
, Michael Goggins
, Manal Hassan
, Elizabeth A. Holly
, Rayjean J. Hung
, Donghui Li
, Núria Malats
, Rachel E. Neale
, Kari G. Rabe
, Harvey A. Risch
, Herbert Yu
& Alison P. Klein

Contributions

K.E.C. and L.T.A. conceived, designed, and oversaw the study. K.E.C., K.H., E.A., I.C. performed the experiments. K.E.C., K.H., and E.A. performed data analysis. S.D., G.D., and T.A. performed the proteomics experiments and provided expertise for experiment design and data analysis. J.Z., D.R.E., A.O.B., and J.W.H. assisted with bioinformatics and statistical analysis of the GWAS, ChromHMM, and fine-mapping. L.T.A., S.J.C., R.Z.S., B.M.W., A.P.K., J.P.S., and authors from the Pancreatic Cancer Cohort Consortium and Pancreatic Cancer Case-Control Consortium provided the samples, analysis, and oversaw the GWAS. K.E.C. and L.T.A. wrote the manuscript. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Katelyn E. Connelly or Laufey T. Amundadottir.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Alvaro Monteiro and José Rodríguez-Martínez for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Connelly, K.E., Hullin, K., Abdolalizadeh, E. et al. Allelic effects on KLHL17 expression underlie a pancreatic cancer genome-wide association signal at chr1p36.33. Nat Commun 16, 4055 (2025). https://doi.org/10.1038/s41467-025-59109-2

Download citation

Received: 27 September 2024
Accepted: 11 April 2025
Published: 30 April 2025
Version of record: 30 April 2025
DOI: https://doi.org/10.1038/s41467-025-59109-2

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Fine-mapping of the chr1p36.33 PDAC risk locus

Assessing allele-preferential binding and gene regulatory activity of candidate functional variants

Identifying allele-preferential protein binding

Identification of likely target genes mediating risk at chr1p36.33

Characterizing the function of KLHL17 in the pancreas

Assessment of cell growth after KLHL17 over-expression and knockdown

Interrogating the functional consequences of decreased KLHL17

Discussion

Methods

Ethics

UK Biobank PDAC GWAS and meta-analysis

Fine-mapping of the GWAS signal

Electrophoretic mobility shift assays (EMSA)

Plasmids

Cell culture

Lentivirus production

Generation of stable cell lines

Luciferase assays

Colocalization analysis

Cell growth assays

RNA isolation and reverse transcriptase quantitative PCR

Chromatin Immunoprecipitation

Antibodies

Western blot analysis

Immunoprecipitation

Protein digestion and TMT labeling

High pH reverse phase fractionation

Mass spectrometry acquisition and data analysis

On-bead trypsin digestion and LC-MS/MS analysis

Substrate identification

Immunofluorescence analysis

In silico knockdown and pathway analysis

Transcription factor binding prediction

Reporting summary

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

Pancreatic Cancer Cohort Consortium

Pancreatic Cancer Case-Control Consortium

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links