Introduction

Endometrial cancer is a significant health issue for women. In China, the number of new malignant endometrial cancer cases in 2022 was approximately 80,000, with about 15,000 deaths. This disease has a high incidence and mortality rate among female patients with tumors1. Morphologically, endometrial cancer is categorized into nine subtypes, including endometrioid, serous, clear cell, dedifferentiated carcinoma, and others2. According to The Cancer Genome Atlas (TCGA), endometrial cancer can be divided into four molecular subtypes: non-specific molecular profile, MMR (Mismatch Repair) deficiency, POLE (Polymerase ɛ) mutation, and p53 abnormality3. These new molecular classifications provide valuable tools for diagnosing endometrial cancer, although some challenges remain4. The integration of molecular and histological classifications has led to diverse therapeutic approaches, introducing greater complexity in diagnostics and treatment. Developing a prognostic system that includes molecular classifications enhances risk stratification, potentially benefiting certain cases5. While molecular classification is useful for diagnosing uterine corpus endometrial carcinoma (UCEC), it is important to recognize that endometrial cancer lacks a definitive early biomarker. Thus, molecular classifications should be viewed as supplementary to histomorphometry classification and not serve as a complete replacement6. In addition, unresolved issues with molecular classification necessitate the identification of new biomarkers for early UCEC diagnosis and the prioritization of target discoveries for UCEC therapy.

CDKN2A is located on chromosome 9p21 and is a member of the INK4 family of cyclin-dependent kinase inhibitors. It is expressed in various human cells and encodes several proteins, including p14ARF and p16INK4a, which regulate cell cycle and inhibit tumor development and progression7. CDKN2A deficiency correlates with worse outcomes and poorer prognosis in non-small cell lung cancer, while high CDKN2A expression is closely related to immune infiltration in hepatocellular carcinoma. CDKN2A is a potential biomarker and therapeutic target in both renal cell carcinoma and triple-negative breast cancer8,9,10,11. CDKN2A abnormalities, such as aberrant mutations, deletions, and copy number variations (CNVs), have varied effects on the cell, and specific CDKN2A mechanisms in UCEC should be further explored12.

The roles of CDKN2A in various aspects of UCEC were investigated in the present study. Bioinformatics and public datasets were utilized to evaluate its potential to offer novel predictive or therapeutic insights for UCEC patients. CDKN2A expression levels were analyzed in different cancer stages and histological subtypes, noting an upregulation at both transcriptional and protein levels, which correlated with poor prognosis. CDKN2A was also found to be related to immune-related pathways. Thus, the relationship between CDKN2A and immune therapy was investigated. In addition, CDKN2A somatic copy number alterations (SCNAs) were examined in UCEC samples, revealing significant enrichment of CDKN2A-associated SCNA genes in drug metabolism and suggesting a tight link between CDKN2A and tumor therapy.

Results

Screening process for target gene CDKN2A relates with cell growth in UCEC

Genes identified via intersection analysis are statistically robust and biologically relevant, potentially engaging in disease’s molecular underpinnings, like signal transduction, cell cycle governance, and apoptosis. An intersection of two gene sets from GEPIA2 (Gene Expression Profiling Interactive Analysis) was performed in the present study. One was associated with endometrial cancer prognosis and included 500 genes (p-value < 0.05), and the other was correlated with differential gene expression between endometrial cancer and normal tissue, comprising 7,131 genes (p-value < 0.05). The study aimed to identify key genes that are significantly associated with tumor prognosis and play a crucial role in tumorigenesis, progression, and therapeutic response.

The intersection included 115 genes. A Venn diagram was created using DAVID online (Fig. 1A). The functions and pathways of these genes were then categorized using gene ontology (GO) (Fig. 1B) and Kyoto Encyclopedia of Genes and Genomes (KEGG) (Fig. 1C) analyses.

Fig. 1
figure 1

Screening process for target genes and CDKN2A related to cell growth in UCEC. (A) Venn diagram for genes associated with survival and differential expression. (B, C) GO and KEGG enrichment analysis show different functions of 115 intersecting genes. Genes associated with “growth” and cancer-related genes were selected (red box).

Then, the set of genes related to “growth” from the GO analysis was intersected with the set of cancer-related genes from the KEGG analysis. The genes identified using this intersection were apoptosis factors BAX and CDKN2A. BAX functions as an anti- or pro-apoptotic regulator in various cellular activities, and CDKN2A plays a key role in cell cycle regulation, with somatic mutations frequently observed in most human cancers. Since CDKN2A is a cell cycle-related regulator, it was selected as the focus of the present research due to its alterations in UCEC.

CDKN2A transcription level is upregulated in UCEC associated with poor prognosis

First, CDKN2A transcription levels were evaluated across 24 different cancer types, including UCEC (Fig. 2A). Subsequently, CDKN2A transcription levels were examined in 35 normal and 546 primary tumor tissues from UCEC patients. A differential mRNA expression was observed between normal and tumor tissues, with elevated CDKN2A mRNA levels in the tumor samples (Fig. 2B).

Fig. 2
figure 2

CDKN2A is upregulated in UCEC patients and associated with poor prognosis. (A) CDKN2A expression in pan-cancer. Blue represents normal, red shows tumor. (B) CDKN2A expression between normal and primary tumor tissue. (C) CDKN2A expression in different cancer stages. (D) CDKN2A expression in different histological subtypes. (E) Overall survival rate of low and high CDKN2A expression groups. 174 tumor samples from GTEx were analyzed. Log-rank p = 0.0055. Group cutoff: median. Hazard ratio (HR) = 2.9, p = 0.0078. HR was calculated based on Cox PH model. 95% confidence interval is shown as dotted line. *: p < 0.05; **: p < 0.01; ***: p < 0.001.

Furthermore, elevated CDKN2A transcription levels were noted in tumor tissues across various cancer stages (Fig. 2C) and different histological subtypes (Fig. 2D), which correlated with tumor progression. This suggests that increased CDKN2A mRNA levels are proportional to tumor severity. Concurrently, an increased CDKN2A transcription level was observed in serous and mixed endometrial cancers, suggesting variability in its expression across different UCEC types. Tumor progression and elevated CDKN2A transcription in tumor tissue may be associated with decreased patients’ survival rates (Fig. 2E).

CDKN2A protein expression is upregulated in UCEC

CDKN2A protein levels were analyzed using the Clinical Proteomic Tumor Analysis Consortium (CPTAC) dataset and The University of Alabama at Birmingham Cancer (UALCAN) data analysis portal. CDKN2A protein expression exhibited a positive correlation with UCEC, as evidenced by data from 31 normal and 100 UCEC samples (Fig. 3A). Further analysis of CDKN2A protein expression across different UCEC stages (Fig. 3B), grades (Fig. 3C), and histological subtypes (Fig. 3D) revealed significant upregulation in the early stages and across different histological subtypes.

Fig. 3
figure 3

CDKN2A protein expression is upregulated in UCEC patients. (A) CDKN2A protein expression increases significantly in UCEC samples. Z-values represent standard deviations from the median across samples. CDKN2A protein expression level in different UCEC (B) stages, (C) grades, and (D) histological subtypes. (E) Statistical analysis of all CDKN2A IHC images from the Human Protein Atlas. (F) IHC image of CDKN2A in normal and UCEC tissue. *: p < 0.05; **: p < 0.01; ***: p < 0.001.

To enhance the visualization of CDKN2A protein expression, immunohistochemistry (IHC) UCEC images were collected from the Human Protein Atlas (HPA) and thoroughly analyzed (Fig. 3E). The majority of IHC images of normal endometrial tissues within the HPA indicated “not detected,” with only a few displaying moderate or low expression. Conversely, the expressions in tumor tissues varied. Seven showed strong expression, fourteen were moderate, ten were weak, and only three were unexpressed (Fig. 3E). Patient information, expression status, and IHC images for both normal and strongly expressed samples were assembled (Fig. 3F), facilitating the visualization of elevated CDKN2A expression levels in UCEC.

According to the protein expression analysis, CDKN2A expression was already significant at stage 1 and grade 1 of endometrial cancer, indicating that the increase in CDKN2A expression occurs early in the disease process. For patients, IHC should be one of the most intuitive and convenient methods to diagnose tumors. Therefore, early disease detection using IHC could be highly significant.

Two CDKN2A signaling pathways involved in UCEC

To further elucidate the role of CDKN2A in UCEC pathogenesis, the CPTAC-UCEC dataset was utilized to explore the potential signaling pathways associated with CDKN2A. The CPTAC-UCEC dataset consists of 99 UCEC cases divided into “pathway altered” and “others” based on whether the pathway is altered. The CDKN2A protein expression levels in these two groups were compared to those in 31 normal adjacent cases. The findings indicated that the CDKN2A protein levels were closely correlated with various signaling pathways, such as Hippo, Wnt, p53/Rb, mTOR, RTK, MYC/MYCN, chromatin modifiers, and NRF2 (Fig. 4). Notably, CDKN2A protein expression was closely linked to significant changes in the p53/Rb and NRF2 pathways related to inflammation (indicated in the red box), suggesting CDKN2A’s involvement in the p53 pathway and tumor immunity.

Fig. 4
figure 4

Alteration in p53/Rb- and NRF2-related pathways associated with CDKN2A protein expression. UALCAN analysis of CDKN2A proteomic expression based on Hippo, Wnt, p53/Rb, mTOR, RTK, MYC/MYCN, SWI/SNF complex, chromatin modifier, and NRF2 status. CDKN2A protein expression was determined using mass spectrometry. Z-values represent standard deviations from the median across UCEC samples. A total of 99 CPTAC-UCEC cases were used. *: p < 0.05; **: p < 0.01; ***: p < 0.001.

CDKN2A has interactions with immune therapy in UCEC.

CDKN2A protein expression was observed to be associated with alterations in the p53/Rb and NRF2 inflammation-related pathways (Fig. 4). Considering that the p53 and NRF2 pathways are closely linked to immunotherapy, the association between CDKN2A and immune therapies was investigated.

To explore the correlation between CDKN2A expression and immune cells in UCEC patients, TCGA-UCEC data (n = 545) were processed using TIMER (Tumor Immune Estimation Resource). The results showed that CDKN2A was more negatively correlated with CD8+ T cell activity (p = 2.43e-08) and partially correlated with CD4+ T cells activity (p = 0.0111) (Fig. 5A). A significant correlation was noted between CDKN2A expression and UCEC patients’ survival (p = 7.88e-05) (Fig. 5B). Among the different immune cell types, CD8+ T cells (log-rank p = 0.022) and B cells (log-rank p = 0.019) were shown to correlate with UCEC patients’ survival rate (Fig. 5B). The relationship between CDKN2A, interleukin-2 (IL2), interferon alpha 1 (IFNA1), tumor necrosis factor (TNF), and TP53 were also investigated (Fig. 5C). CDKN2A expression was found to be associated with IL2, IFNA1, TNF, and TP53, with respective correlation coefficients of -0.155 (log-rank p = 2.83e-04), 0.157 (log-rank p = 2.3e-04), 0.144 (log-rank p = 7.54e-04), and − 0.178 (log-rank p = 2.87e-05).

Fig. 5
figure 5

Interaction between CDKN2A and immune therapy in UCEC. (A) CDKN2A correlation with immune cells in the immune system shows a purity-corrected fraction of Spearman rho value and statistical significance. Log2(TPM) is the log2 of the transcript count per million. (B) Clinical relevance of B cells, CD8+T cells, CD4+T cells, macrophages, neutrophils, dendritic cells, and CDKN2A expression with UCEC samples from TCGA. Group cutoff: median. (C–F) IL2, IFNA1, TNF, and TP53 correlation. A total of 545 TCGA-UCEC cases were used.

CDKN2A somatic copy number alteration relates to drug metabolism in UCEC

SCNA is a prevalent characteristic of human cancer and plays a crucial role in tumorigenesis by influencing gene expression. Abnormal DNA CNV constitutes a significant molecular mechanism in various conditions, including cancer, hereditary diseases, and cardiovascular diseases. In UCEC, copy number alterations significantly influence prognosis.

In LinkedOmics, CDKN2A-SCNA data from the TCGA-UCEC dataset (n = 539) was integrated with RNA-seq data from the same cohort for analysis. As shown in the volcano plot in Fig. 6A, the genes on the right (dark red dots) have a significant positive correlation with CDKN2A-SCNA, while the genes on the left (dark green dots) have a significant negative correlation. The expression of the top 50 correlated genes with the most significant p-value were represented using a heatmap (Fig. 6B). Enrichment analysis of the obtained up- and down-regulated genes showed that multiple biological functions were associated with CDKN2A SCNA (Fig. 6C). Drug metabolism analysis revealed the most significant correlation (Fig. 6D). Therefore, because the down-regulated genes associated with CDKN2A-SCNA were correlated with drug metabolism, CDKN2A drug sensitivity was analyzed in both the GDSC (Genomics of Drug Sensitivity in Cancer) and CTRP (The Cancer Therapeutics Response Portal) databases to investigate the relationship between CDKN2A and different drugs (Fig. 6E). Drug sensitivity analysis demonstrated that CDKN2A is related to the sensitivity of numerous drugs. Cerulenin, manumycin A, and PRIMA-1 significantly negatively correlated with CDKN2A. Conversely, Nutlin-3a (-) and PD-0332991 show a significant positive correlation with CDKN2A.

Fig. 6
figure 6

Somatic copy number alteration in CDKN2A is related to drug metabolism in UCEC. (A) Differential genes from RNA-seq and CDKN2A mutant-related SCNA data for TCGA-UCEC differential genes. P-value obtained from Pearson correlation test. (B) Heatmap of differential genes. (C) Top 25 KEGG pathways for differential genes. (D) GSEA plot of CDKN2A-related differential genes. FDR (false discovery rate) < 0.05, t-test corrected for multiple testing. (E) Correlation between CDKN2A expression and sensitivity of GDSC and CTRP drugs (top 30) in pan-cancer. Orange shows positive correlation; purple represents negative correlation.

Discussion

The CDKN2A gene is located at human chromosome band 9p21 and is expressed in a variety of human cells. It encodes several proteins, including p14ARF and p16INK4a, which are known to regulate the cell cycle and inhibit tumor development and progression7. In normal tissue, p16INK4a binds to CDK4/6 with Cyclin D to prevent the cell from entering the S phase and initiating DNA synthesis. It also dephosphorylates pRb (Retinoblastoma protein) to inhibit cell proliferation. However, CDKN2A alterations can impair p16INK4a, increase pRb phosphorylation, and promote tumorigenesis13,14. In normal tissue, p14ARF typically forms a trisomy complex with MDM2 and p53 15. However, p14ARF loses its ability to bind to MDM2 (Mouse double minute 2) and cannot form a complex when abnormalities occur in CDKN2A. Consequently, MDM2 promotes p53 degradation, leading to tumorigenesis. Therefore, CDKN2A is crucial for maintaining tissue homeostasis and a balanced interplay between tumor suppression and senescence.

CDKN2A expression was found to be upregulated in UCEC in the present study, and this upregulation was closely associated with a reduction in patients’ survival time. Additionally, CDKN2A expression was significantly elevated at the early stage of tumor development, with corresponding increases in protein expression. The IHC statistical data from the HPA indicated that CDKN2A protein expression was higher in UCEC compared to the level in normal endometrial tissue.

Tumor development is a complex process that involves the interaction of multiple signaling pathways. In the present proteomics study, CDKN2A was linked to the p53/Rb and NRF2 pathways. Previous studies have shown that the role of NRF2 in immune metabolism and redox balance is significant for immunotherapy16. Consequently, CDKN2A was found to be correlated with CD8+ T cells, which are vital for immune defense against infections and cancer17. Observations in Fig. 5 indicate that a decrease in CD8+ T cell activity correlates with increased CDKN2A expression, suggesting a negative relationship between CDKN2A and CD8+ T cells. The specific molecular mechanisms behind this correlation remain unclear. In addition, the evolutionarily conserved p53 signaling pathway is pivotal in either controlling cell death or maintaining cellular homeostasis through various regulations, and it plays a crucial role in immune therapy18. IL2, IFNA1, and TNF also have important roles in tumor immunotherapy19,20,21.

In the study, CDKN2A expression was associated with the p53/Rb and NRF2 inflammatory pathways, as evidenced by proteomic mass spectrometry data. This discovery prompted further investigation into the potential connections between CDKN2A and immune therapy.

CNV is the duplication or deletion events of the genome22. Uterine serous carcinoma is a subtype of UCEC that is linked to TP53 misexpression, has high mortality and recurrence, and is often CNV-high, marked by cell cycle deregulation and CDKN2A misexpression23. CNV-low-UCEC has a moderate prognosis, with few TP53 mutations but many in the Wnt pathway and in genes like PTEN3,24,25. CNV-high-UCEC is characterized by frequent mutations in TP53, PIK3CA, and PPP2R1A and is associated with poor prognosis25,26. The TCGA database shows that CNV-high-UCEC is heterogeneous and covers nearly all subtypes3,27, with most patients having poor outcomes. Previous studies have indicated that CNV influences drug response and personalized therapy, while integrating mRNA and CNV omics data improves drug sensitivity prediction28,29. Given the link between CNV and UCEC prognosis, TCGA UCEC data were utilized in order to combine SCNA data for CDKN2A with RNA-seq data. GSEA was then conducted on differentially expressed genes. The results showed that down-regulated genes were significantly enriched in drug metabolism in the analysis of SCNA differential genes associated with CDKN2A. Thus, the relationship between CDKN2A and drug sensitivity was evaluated in a variety of cell lines. We found that PRIMA-1, Nutlin-3a, and PD-0332991 interact with the CDKN2A-encoded proteins p16INK4a and p14ARF in the process of regulating the cell cycle and tumor growth30,31,32, while the relationship between Cerulenin and Manumycin A with CDKN2A requires further study.

Next, we plan to delve deeper into the role of CDKN2A in drug metabolism. Molecular assessments of UCEC cases will be collected and performed, then we’ll categorize them into two groups depending on whether CDKN2A exhibits SCNA. Following the sequencing process and other experiments, the variations in gene expression will be analyzed to further investigate the implications of CDKN2A-SCNA.

Using GTEx UCEC data, the present study identified 115 genes associated with prognosis by intersecting the top 500 survival-related genes with those genes expressed differentially. CDKN2A was highlighted via the GO and KEGG analyses, which showed its increased mRNA and protein levels in UCEC and correlation with poor survival. Changes in CDKN2A protein levels were linked to alterations in the p53/Rb and NRF2 pathways, suggesting its relevance in immune therapy. In addition, GSEA of CNV-related downregulated genes in TCGA-UCEC cases indicated significant enrichment in drug metabolism. Elevated CDKN2A expression was closely tied to UCEC prognosisand various treatment strategies, including drug sensitivity, positioning it as a potential biomarker and therapeutic target for UCEC. Further research is needed to explore its mechanisms.

Methods

Gene of interest filtration

GEPIA2 33 (GEPIA 2 (cancer-pku.cn)) is an interactive web application for gene expression analysis based on 8,587 normal and 9,736 tumor samples from the GTEx34 (https://gtexportal.org/home/). There are 174 UCEC cases (with 174 tumor samples paired with 13 normal adjacent tissue samples) in the GTEx. Based on the GTEx-UCEC cases, the top 500 most significant survival-related genes related to UCEC were collected. The median of expression served as the group cutoff. The genes met the predefined statistical significance criteria (p-value < 0.05) and a minimum fold change threshold. A total of 7,131 differential expression genes were used based on the requirements of adjusted p-value < 0.05 and |log2 fold change| > 1.

At https://bioinformatics.psb.ugent.be/webtools/Venn/, The two gene types were intersected to generate a Venn plot and obtain 115 genes with intersections.

Integrated bioinformatics analysis

DAVID35 (https://bioinformatics.psb.ugent.be/webtools/Venn/) is an online analysis tool with integrated discovery and annotation capabilities that provides typical batch annotation and GO terminology augmentation analysis to highlight the most relevant content with a list of GO terms associated with a given gene.

Function enrichment analysis based on the GO36 and KEGG37,38,39 pathway map of 115 correlative genes was carried out using the online OMICSHARE cloud platform tool, which is an open data analysis platform that was used to visualize the enrichment analysis (http://www.omicshare.com/tools/Home/Soft/pathwaygsea).

Investigation of expression

UALCAN is an interactive and convenient web resource for comprehensively analyzing cancer OMICS data40. UALCAN was utilized to examine gene expression across various forms of cancer (http://ualcan.path.uab.edu). Initially, CDKN2A transcription was investigated in 24 different cancer types using data from TCGA41. To explore the mRNA expression levels of CDKN2A in UCEC patients, the TCGA-UCEC dataset encompassing 546 cases was subsequently employed. Further details regarding this dataset are accessible on the TCGA portal via the following link: https://portal.gdc.cancer.gov/. Graphs and plots were generated to delineate the expression profiles for outcome visualization.

UALCAN was used to investigate CDKN2A protein expression in UCEC tissue and to explore the potential signaling pathways involved in CDKN2A activation. The CPTAC-UCEC dataset from the Clinical Proteomic Tumor Analysis Consortium (CPTAC)42 was utilized. This dataset comprises 100 cases (with 100 tumor samples paired with 31 normal adjacent tissue samples) and is accessible at https://pdc.cancer.gov/pdc/. CDKN2A protein expression in UCEC tissue was quantified using tissue mass spectrometry with isobaric labels (tandem mass tags − 10). The method protocol can be found in43. This dataset was also used to investigate the expression profile of CDKN2A based on various pathways (HIP, mTOR, MYC/MYCN, RTK, SWI-SNF complex, Wnt, p53/Rb-related, chromatin modifier, and NRF2 pathways). Z-values represent standard deviations from the median across samples for the given cancer type. Log2 spectral count ratio values from CPTAC were first normalized within each sample profile and then across samples.

Survival rate analysis

Kaplan-Meier (KM) plotter was used to determine the association between target gene expression and OS in UCEC patients. A total of 174 cases (174 tumor samples paired with 13 normal adjacent tissue samples) were analyzed. The correlation between CDKN2A expression and clinical prognosis of UCEC patients was explored based on the different expression levels in GEPIA2 33. Median expression served as the group cutoff. The hazard ratio (HR) was calculated based on the Cox proportional hazards model with 95% confidence interval (CI) (dotted line). Log-rank p-values were also computed.

TIMER analysis

The TIMER (https://cistrome.shinyapps.io/timer/) database was employed to investigate the correlation between CDKN2A expression and immune cell infiltrates, which offers a comprehensive resource for the analysis of immune infiltrates across various cancer types44. TIMER provides estimates of immune cell abundance using several immune deconvolution methods, allowing users to dynamically generate high-quality figures that explore immunological, clinical, and genomic features. TIMER was utilized to analyze the correlation between CDKN2A expression and immune infiltrates, focusing on the clinical relevance of CDKN2A based on TCGA datasets. Specifically, the relationship between CDKN2A and immune cell types, including CD8+ T cells, CD4+ T cells, B cells, neutrophils, dendritic cells, and others, was examined.

Somatic copy number alteration analysis

LinkedOmics (https://www.linkedomics.org/login.php) is a publicly accessible portal that encompasses multi-omics data from all 32 TCGA cancer types and 10 CPTAC cancer cohorts45.

Integrated analysis using LinkedOmics was carried out to gain a deeper understanding of CDKN2A expression patterns and its association with clinical outcomes in UCEC. Initially, analytical tools provided by LinkedOmics were used to perform a joint analysis of the CDKN2A-SCNA and RNA-seq datasets related to 548 UCEC samples from TCGA database (accessible at https://pdc.cancer.gov/pdc/) in order to explore the relationship between CDKN2A CNVs and its mRNA expression levels. The analysis focused on statistically significant gene expression (adjusted p-value < 0.05) and the magnitude of CNVs (log2 fold change > 1). This comprehensive approach was aimed at identifying whether CDKN2A gene CNVs influence its expression in UCEC and further assessing the impact of these variations on patient prognosis. All analyses were conducted following standardized bioinformatics protocols and were aided by the advanced visualization tools offered by LinkedOmics to facilitate the interpretation of the results.

Drug sensitivity analysis

Gene Set Cancer Analysis (GSCA) (GSCA: Gene Set Cancer Analysis (wchscu.cn)) is a database that enables the searching, investigating, and exploring of gene-set cancer analyses pertaining to mRNA expression, mutations, immune infiltration, and drug resistance. GSCA integrates over 10,000 multidimensional genomic data points for 33 distinct cancer types from TCGA and more than 750 small molecule drugs from GDSC (Genomics of Drug Sensitivity in Cancer) and CTRP (The Cancer Therapeutics Response Portal)46.

The GSCA platform was used to examine CDKN2A drug sensitivity in a range of cell lines, thereby offering insights that can inform the integration of clinical data and small molecule drugs to identify potential therapeutic agents.