Introduction

Cancer is a major global health challenge and one of the leading causes of death worldwide1. As a complex polygenic disease, the occurrence and development of cancer involve multiple genetic and environmental factors2. In recent years, with the rapid development of high-throughput sequencing technologies, cancer research has entered a new era. Traditional cancer research typically focuses on specific cancer types, whereas pan-cancer analysis offers a global perspective aimed at revealing the commonalities and differences between different cancer types3,4. This approach not only helps to understand the general biological characteristics of cancer but also promotes the development of cross-cancer treatment strategies.

In recent years, the role of immune-modulatory related proteins in cancer has gradually become a research focus, with cyclophilin A (CypA, encoded by the PPIA gene) attracting significant attention for its multifunctionality. PPIA is a ubiquitously expressed peptidyl-prolyl cis-trans isomerase, initially identified as the intracellular receptor for the immunosuppressant cyclosporine A (CsA), involved in regulating T cell activation5. When PPIA binds to its cell surface receptor CD147, it may enable tumor cells to evade T cell immune surveillance6. The immunosuppressive activity of CsA is believed to originate from the binding of the CsA-PPIA complex to calcineurin7, and PPIA gene knockout mice are resistant to the immunosuppression by CsA8. However, besides its immunoregulatory role, PPIA also plays a critical part in cancer development by influencing cell cycle progression, resistance to apoptosis, and inflammatory responses through catalyzing protein folding, nuclear translocation, and regulating signal transduction. Studies have shown that knocking down PPIA expression can inhibit the tyrosine phosphorylation and nuclear translocation of the signal transducer and activator of transcription 3 (Stat3) and induce interleukin-6 (IL-6)-dependent apoptosis in multiple myeloma cells9. PPIA enhances NF-κB transcriptional activity by directly binding to a specific sequence in the N-terminal of p65/RelA, promoting its nuclear translocation while maintaining protein stability10. Knocking down PPIA can inhibit the IL-8 activity induced by the NF-κB pathway, and hamper the proliferation and tumor growth of glioblastoma cells11. Additionally, PPIA can bind to the extracellular receptor CD147 and stimulate the expression of cyclin D1 and survivin through STAT3 phosphorylation, which co-localizes with the cancer stem cell marker CD44, promoting tumorigenesis and the growth of pancreatic cancer12. PPIA is upregulated in lung cancer, where it directly interacts with nuclear factor E2-related factor 2 (NRF2) through a hydrophobic sequence containing trans-proline 174, preventing NRF2 ubiquitination and degradation, thus promoting the proliferation of lung cancer cells13. Moreover, PPIA has also been shown to be upregulated in esophageal cancer14 and breast cancer15. Although the mechanisms of PPIA in cancer regulation have been reported in some cancer types, a systematic analysis of its expression heterogeneity, epigenetic modification characteristics, and association with genomic instability at a pan-cancer level is still lacking.

In this study, we conducted a comprehensive pan-cancer analysis of PPIA using pan-cancer data from multiple databases and cohorts, including TCGA, GTEx, and GEO. Our analysis covered various dimensions, including gene expression, single-cell expression distribution, epigenetic modifications, tumor immune characteristics, drug sensitivity, and prognosis analysis. Figure 1 summarizes the findings of this study.

Fig. 1
figure 1

A schematic summary of the research content.

Materials and methods

Data collection

We obtained the pan-cancer expression data and methylation data in TPM processed by the STAR pipeline from the TCGA database (https://portal.gdc.cancer.gov/)16. Clinical information, survival data, transcript expression data, and GTEx expression data for pan-cancer were retrieved from the UCSC Xena database (http://xena.ucsc.edu/)17. Single-cell transcriptome sequencing data for 13 types of cancer were obtained from the GEO database (https://www.ncbi.nlm.nih.gov/geo/), with GEO accession numbers GSE161529 (triple-negative breast cancer)18, GSE138709 (intrahepatic cholangiocarcinoma)19, GSE166555 (colorectal cancer)20, GSE196756 (esophageal squamous cell carcinoma)21, GSE139324 (HPV + HNSC)22, GSE121636 (KIRC)23, GSE149614 (hepatocellular carcinoma)24, GSE154109 (acute myeloid leukemia)25, GSE123902 (LUAD)26, GSE184880 (high-grade serous ovarian cancer)27, GSE137829 (castration-resistant prostate cancer)28, GSE167297 (diffuse-type gastric cancers)29, and GSE163203 (papillary thyroid carcinoma)30. To increase the reliability of our study, HCC data from the ICGC database (https://dcc.icgc.org/)31 was obtained as a validation set. Additionally, we validated the expression level of PPIA in HCC using multiple independent cohorts from the GEO database, with GEO accession numbers GSE1452032, GSE3637633, GSE4543634, GSE5423635, GSE6389836, GSE7642737, GSE11279038, and GSE12124839. Detailed cohort information is provided in Table S1. The study encompasses 33 cancer types, including Adrenocortical carcinoma (ACC), Bladder urothelial carcinoma (BLCA), Breast invasive carcinoma (BRCA), Cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), Cholangio carcinoma (CHOL), Colon adenocarcinoma (COAD), Lymphoid neoplasm diffuse large B-cell lymphoma (DLBC), Esophageal carcinoma (ESCA), Glioblastoma multiforme (GBM), Head and neck squamous cell carcinoma (HNSC), Kidney chromophobe (KICH), Kidney renal clear cell carcinoma (KIRC), Kidney renal papillary cell carcinoma (KIRP), Acute myeloid leukemia (LAML), Brain lower grade glioma (LGG), Liver hepatocellular carcinoma (LIHC), Lung adenocarcinoma (LUAD), Lung squamous cell carcinoma (LUSC), Mesothelioma (MESO), Ovarian serous cystadenocarcinoma (OV), Pancreatic adenocarcinoma (PAAD), Pheochromocytoma and paraganglioma (PCPG), Prostate adenocarcinoma (PRAD), Rectum adenocarcinoma (READ), Sarcoma (SARC), Skin cutaneous melanoma (SKCM), Stomach adenocarcinoma (STAD), Testicular germ cell tumors (TGCT), Thyroid carcinoma (THCA), Thymoma (THYM), Uterine corpus endometrial carcinoma (UCEC), Uterine carcinosarcoma (UCS), and Uveal melanoma (UVM).

Pan-cancer expression and prognosis analysis

The expression level of PPIA in pan-cancer and its corresponding normal tissues was explored using R software. The impact of PPIA on pan-cancer cell lines was analyzed through the DepMap platform (https://depmap.org/portal/)40. Within the “Dependent Cell Lines” module, the effect of PPIA knockdown on the viability of pan-cancer cell lines was assessed, where a gene score of 0 indicates no impact on cancer cell viability, and a negative score with a lower value suggests stronger inhibition of cancer cell viability, with the median score for common oncogenes being − 1. Survival data for 31 cancer types, including Overall Survival (OS), Disease-Specific Survival (DSS), Disease-Free Interval (DFI), and Progression-Free Interval (PFI), were obtained from UCSC Xena41. Subsequently, PPIA expression data were matched with survival data for each sample, and univariate Cox regression analysis and Kaplan-Meier survival analysis were performed using the “survival” and “forestplot” packages in R software to explore the correlation between PPIA expression levels and pan-cancer prognosis. Similarly, clinical information on 31 cancer types, including age, T, N, M stage, and overall stage status, was obtained from UCSC Xena. PPIA expression data were then matched with clinical information for each sample to analyze PPIA expression levels across different clinical groups. The “timeROC” package was used to plot Receiver Operating Characteristic (ROC) curves to evaluate the sensitivity and accuracy of PPIA for overall survival prognosis. Finally, the expression level of PPIA was also examined across different tumor stages.

Single-cell expression distribution in pan-cancer

To further explore the cellular expression distribution of PPIA in pan-cancer, we analyzed 14 scRNA datasets. The “Seurat (Version 5.2.1)” package was used to filter the datasets with specific criteria as follows: number of genes > 300, sequencing depth < 30,000, and mitochondrial ratio < 20%. NormalizeData, FindVariableFeatures, ScaleData, and RunPCA functions were applied for data normalization, identification of highly variable genes, and dimension reduction. Subsequently, the harmony method in the IntegrateLayers function was used to remove batch effects between samples and integrate the data, and Uniform Manifold Approximation and Projection (UMAP) was employed for further dimension reduction. FindNeighbors and FindClusters functions were utilized for cell clustering analysis. Finally, cell type annotation for each unit was performed based on marker gene expression, including T cells (CD3D, CD3E, TRAC, CD8A), NK cells (KLRD1, GNLY, NKG7), B cells (CD19, CD79A, MS4A1), macrophages (CD68, CD163, CD14), DC cells (CLEC9A, XCR1, CCL22, CD1E), mast cells (KIT, TPSAB1, TPSB2), fibroblast cells (COL1A1, FGF7, MME), endothelial cells (VWF, PECAM1, PLPP1), and malignant epithelial cells (EPCAM, KRT18, KRT19).

Tumor immune microenvironment and genetic alteration analysis

Three algorithms were used to assess the immune microenvironment score of pan-cancer samples: the ssGSEA algorithm provided by the R package “GSVA”42, which calculates immune infiltration based on markers of 24 types of immune cells43; the CIBERSORTx website (https://cibersortx.stanford.edu/) offers markers for 22 types of immune cells and uses the CIBERSORT algorithm script to calculate immune infiltration44; and the ESTIMATE algorithm provided by the R package “estimate” for calculating immune infiltration45. PPIA expression data for each sample were matched with immune infiltration scores, followed by correlation analysis. The ‘maftools’ package in R was used to evaluate the tumor mutation burden (TMB) score, while the microsatellite instability (MSI) score was obtained from previous reports46. Expression data for immune-related genes (including major histocompatibility complex (MHC) and immune checkpoints) and RNA modification-related genes (including m1A, m5C, and m6A) were extracted. Subsequently, the correlations between PPIA expression levels and TMB, MSI, and the aforementioned related genes were assessed.

Drug sensitivity analysis

Cancer samples were divided into high expression and low expression groups based on the median expression level of PPIA. Differential analysis was conducted to identify differentially expressed genes (FDR < 0.05, |LogFC|>1). Subsequently, the top 100 most significantly upregulated mRNA genes from each cancer type were uploaded to the SPIED3 platform (http://92.205.225.222/HGNC-SPIED3-QF.py). Using data from 1,309 compounds in the CMAP database as the reference dataset, compounds with negative correlation in the analysis indicated potential reversal of the expression trend of differential genes. Expression data for anti-PD-L1 therapy were obtained from the IMvigor210 dataset47, and the correlation between PPIA expression and immunotherapy response was analyzed. NCI-60 drug scoring data and RNA-seq expression data were downloaded from the CellMiner database (https://discover.nci.nih.gov/cellminer/home.do)48, and drugs approved by the FDA and clinically verified were filtered out to explore their correlation with PPIA.

Pan-cancer enrichment analysis

Cancer-related signature gene set files were downloaded from the Molecular Signatures Database (https://www.gsea-msigdb.org/gsea/msigdb)49. Based on the differential analysis results between the high and low PPIA expression groups, a GSEA enrichment analysis was conducted using the “clusterProfiler” package. In this study, the KEGG gene set and Hallmark gene set were selected as references, and the normalized enrichment score (NES) and false discovery rate (FDR) were calculated.

Expression validation in hepatocellular carcinoma cell lines

The cell lines HepG2, HUH-7, and THLE2 were acquired from ProCell (Wuhan, China), while MHCC-97 H was sourced from Fenghui Biological (Hunan, China). The human HCC cell lines, including HepG2, HUH-7, and MHCC-97 H, were cultured using high-glucose DMEM (Gibco, USA). In contrast, THLE2, representing normal liver cells, were grown in BEGM (Thermo fisher scientific, USA). Both types of culture media were enhanced with 10% fetal bovine serum and kept in a 5% CO2 incubator (Heal force, China) at a stable temperature of 37 °C.

Proteins were extracted from cells using RIPA buffer (Solarbio, China), and their concentrations were determined with a BCA kit (Solarbio, China). The expression of PPIA (1:1000, A0993, ABclonal, China) across these cell lines was analyzed through western blotting (WB).

To assess the mRNA expression of PPIA in the above cells, RT-qPCR was conducted. Total RNA was extracted using the Promega kit and quantified. cDNA was synthesized with a reverse transcription kit. The PCR protocol included 40 cycles with the following conditions: pre-denaturation at 95°C for 2 minutes, denaturation at 95°C for 30 seconds, annealing at 60°C for 30 seconds, and extension at 72°C for 30 seconds. GAPDH served as an internal control, and the relative expression level of PPIA was calculated using the 2−∆∆Ct method. Primer sequences (Bioengineering Co., Ltd, Shanghai): PPIA, forward 5’-TACAGTGCTTGCTGGCAGTT-3’ and reverse 5’-TCCGCCACCTCTAGGATAGG-3’; GAPDH, forward 5’-TGCACCACCAACTGCTTAGC-3’ and reverse 5’- GGCATGGACTGTGGTCATGAG-3’.

Statistical analysis

Correlation analyses between two variables were performed using the Spearman method. Differences between two groups were analyzed using the Wilcoxon rank-sum test. For comparisons among more than two groups, the Kruskal-Wallis test was employed. Kaplan-Meier (K-M) curves and the Cox proportional hazards regression model were used for survival analysis. Statistical analyses and graphical plotting were conducted using R software (version 4.3.2). A P-value of less than 0.05 was considered statistically significant. A p-value of less than 0.05 was deemed statistically significant.

Results

Abnormal expression of PPIA in pan-cancer

We used expression data from TCGA and GTEx to comprehensively analyze the differential expression of PPIA in various tumor tissues and normal tissues. Overall, PPIA is dysregulated in most tumors, primarily with upregulated expression, including BLCA, BRCA, CESC, CHOL, COAD, ESCA, GBM, HNSC, KICH, KIRC, KIRP, LIHC, LUAD, LUSC, PRAD, READ, STAD, THCA, and UCEC, while it is downregulated in LAML (Fig. 2A-B). Given the widespread dysregulation of PPIA in pan-cancer, we further analyzed the expression differences of different PPIA transcripts in normal and tumor tissues. The results show that protein-coding transcripts are upregulated in various cancers, while transcript expression is downregulated in LAML (Fig. 2C). Notably, a nonsense-mediated decay (NMD)-related transcript of the PPIA gene (ENST00000415933) is upregulated in a variety of tumor tissues, which may be associated with NMD escape, gene mutations, and abnormal splicing50. To enhance the reliability of our study, we conducted expression validation in liver cancer. Validation of PPIA upregulation in liver cancer was carried out in 10 independent cohorts from the GEO database (Fig. 2D). We conducted WB and qPCR validation in one normal liver cell line and three liver cancer cell lines, and the results indicated significant upregulation of PPIA in liver cancer (Fig. 2E-F). Further analysis using the DepMap database examined the impact of PPIA knockdown on the viability of pan-cancer cell lines, revealing dependency scores below 0.5 in most cancer cell lines, suggesting that PPIA knockdown can inhibit the viability of these cancer cell lines (Figure S1).

Fig. 2
figure 2

Abnormal expression of PPIA in pan-cancer. (A,B) Box plots compare the expression levels of PPIA mRNA in tumor tissues (yellow) and normal tissues (blue) from the TCGA and GTEx databases. The statistical method used is the Wilcoxon rank sum test. (C) The heatmap compares the expression levels of PPIA transcripts in tumor and normal tissues from the combined TCGA and GTEx databases. Blue, orange, and white squares indicate downregulation, upregulation, and no difference in tumor tissues, respectively. The statistical method used is the Wilcoxon rank sum test. (D) Box plots compare the expression levels of PPIA mRNA in liver cancer tissues (yellow) and normal tissues (blue) from the GEO database. The statistical method used is the Wilcoxon rank sum test. (E) WB validates the protein expression level of PPIA in liver cancer cell lines. (F) qPCR validates the relative expression level of PPIA in liver cancer cell lines. Symbols *, **, and *** denote P < 0.05, P < 0.01, and P < 0.001, respectively.

Distribution of PPIA expression in single-cell transcriptome across pan-cancer

After conducting initial analysis on the expression of PPIA across pan-cancer, we found that it exhibits dysregulated states in most tumors, predominantly manifesting as upregulated expression. To gain deeper insights into the cell-specific expression patterns of PPIA in different cancers, we further analyzed single-cell transcriptome data. The results indicated that PPIA is widely distributed across different cell groups in various cancer types, yet its expression patterns vary among cancer cells (Fig. 3). For instance, in COAD, ESCA, and LUAD, PPIA mainly distributes in epithelial cells and keratinocytes, while in KIRC, LIHC, and LAML, its expression is potentially concentrated in immune cell populations. This differentiated expression distribution suggests that PPIA may play diverse roles in the tumor microenvironment and might be associated with specific biological behaviors of cancers.

Fig. 3
figure 3

Distribution of PPIA expression in single-cell transcriptome across pan-cancer. The size of the bubbles represents the proportion of PPIA expression in cells, while the depth of color indicates the relative expression level of PPIA in the cells.

Prognostic and diagnostic value of PPIA in pan-cancer

To explore the prognostic role of PPIA in various cancers, we used univariate Cox regression and KM survival analysis to assess the correlation of PPIA with four prognostic outcomes, including OS, DSS, DFI, and PFI. The results showed that high expression of PPIA is associated with poor prognosis in ACC, BLCA, BRCA, HNSC, KICH, LGG, LIHC, LUAD, MESO, PAAD, PRAD, SARC, SKCM, TGCT, and UVM, whereas it is associated with better prognosis in DLBC and THYM (Fig. 4A). Forest plots and KM survival curves detail the prognostic information (Fig. S2-3). To better understand the association between PPIA and clinical characteristics of cancer patients, we analyzed the expression differences of PPIA across different ages, TMN stages, and tumor stages. The results indicated that PPIA is correlated with clinical characteristics in some cancers. For instance, in ACC, KICH, LIHC, LUAD, and TGCT, higher stage corresponds to higher PPIA expression, whereas in KIRC and SKCM, the opposite is true (Fig. 4B). Diagnostic ROC analysis based on expression data from TCGA combined with GTEx cohorts showed that PPIA had an AUC greater than 0.9 in various cancers, indicating high diagnostic efficacy (Fig. 4C). These results suggest that PPIA is a potential molecular marker for predicting cancer prognosis and diagnosis.

Fig. 4
figure 4

Prognostic and Diagnostic Value of PPIA in Pan-Cancer. (1) Univariate Cox regression analysis and KM survival analysis were used to explore the correlation between PPIA expression and OS, DSS, DFI, and PFI. Red squares indicate PPIA as a prognostic risk factor, while blue squares indicate PPIA as a prognostic beneficial factor. (2) The heatmap shows the correlation analysis between PPIA expression and clinical characteristics. Blue, red, white, and gray squares represent negative correlation, positive correlation, no correlation, and missing data, respectively. (3) The forest plot presents the ROC analysis of PPIA expression for cancer diagnostic efficacy.

Immune characteristics of PPIA in the pan-cancer microenvironment

The development of malignant tumors is closely related to the immune microenvironment in which tumor cells reside. To further analyze the impact of PPIA on the immune status across pan-cancer, we used the “CIBERSORT,” “ssGSEA,” and “ESTIMATE” algorithms to assess the correlation between immune cell infiltration and PPIA. Overall, PPIA is closely related to the immune cell infiltration status of most cancers (Fig. 5A-C). PPIA is negatively correlated with most immune cells, such as B cells, T cells, and Th17 cells, but positively correlated with these cells in THYM. Certain cell subgroups are positively correlated in most cancers, such as Th2 cells, activated CD4 memory T cells, and Macrophages M0. The ESTIMATE algorithm showed that PPIA expression is negatively correlated with the immune score, estimate score, and stromal score in most cancers. Previous studies confirmed that the CypA protein encoded by PPIA can regulate the T cell-specific tyrosine kinase Itk for interleukin-2 and mediate intracellular signaling pathways for T cell activation51. When mutation of Itk disrupts binding with PPIA, it specifically increases the secretion of Th2 cell cytokines52.

Furthermore, there is also a correlation between PPIA expression and MHC, as well as immune checkpoints, although these correlations vary among different cancers and molecules (Figure S4). For example, it is positively correlated with MHC molecules in OV and SARC, and negatively correlated in ESCA, SKCM, and THYM. This suggests that PPIA may be related to tumor immune evasion mechanisms. These findings reveal dynamic interactions between PPIA expression and specific components of the tumor microenvironment in different cancer types. For instance, in THYM, PPIA expression is positively correlated with certain anti-tumor immune cells and negatively correlated with MHC molecules, which may be related to the complexity and dynamics of the tumor immune microenvironment. Studies have shown that MHC molecules do not always positively select T cells in the thymus53. Additionally, some thymic T cells (such as γδT cells) possess natural anti-tumor activity, with development dependent on specific signals in the thymic cortex, potentially independent of antigen presentation mediated by classical MHC molecules54.

Fig. 5
figure 5

Immune Characteristics of PPIA in the Tumor Microenvironment. (A,C) The ssGSEA algorithm (A), CIBERSORT algorithm (B), and ESTIMATE algorithm (C) were used to assess the correlation between immune cell infiltration and PPIA mRNA expression. Red squares indicate positive correlation, while blue squares indicate negative correlation, with deeper colors representing stronger correlations. Symbols *, **, and *** denote P < 0.05, P < 0.01, and P < 0.001, respectively.

Correlation between PPIA and pan-cancer genetic alterations

Gene mutations play a crucial role in cancer regulation. Additionally, TMB and MSI influence the response of cancers to immunotherapy. Tumor mutational burden assessment shows that PPIA expression is positively correlated with TMB in cancers such as THYM, KICH, and UCS, while it is inversely correlated in COAD (Figure S5). Microsatellite instability analysis also shows some associations with PPIA, such as in cancers like HNSC, KIRC, and LIHC (Figure S5), suggesting that PPIA may be related to DNA repair mechanisms or genomic stability and may influence the occurrence and development of tumors.

RNA modifications can affect mRNA stability, translation efficiency, splicing, and localization. In various cancers, the abnormal expression of RNA modification enzymes is closely associated with tumor occurrence, development, and metastasis, with some RNA modification enzymes potentially serving as cancer biomarkers. Our analysis indicates that the expression of PPIA is correlated with the expression of several RNA modification enzymes (including m1A, m5C, m6A) (Fig. 6A). For instance, in the m5C modification of most cancer types, NOP2, NSUN5, ALYREF, and YBX1, which perform “writer” and “reader” functions, are positively correlated with PPIA, while TET1, TET2, and TET3, which perform “eraser” functions, are negatively correlated with PPIA. This suggests that PPIA expression is positively correlated with m5C modification in pan-cancer.

DNA methylation is also a common epigenetic modification. Generally, methylation site levels in genic regions are positively correlated with gene expression, while increased methylation levels in gene promoter regions usually inhibit the transcription process, thereby affecting gene expression. Our study discovered 16 methylation sites correlated with PPIA expression (Fig. 6B). Interestingly, some methylation sites exhibit expression trends opposite to what is expected, such as the cg06436997 site in the gene transcription start region being positively correlated with PPIA. This might be related to the complex regulatory mechanisms of methylation, such as transcription factor binding preferences, uneven CpG island methylation, and the methylation paradox phenomenon, allowing genes to be effectively transcribed and expressed even under high methylation conditions.

Fig. 6
figure 6

Analysis of Genetic Alterations. (A) The heatmap illustrates the correlation between PPIA mRNA expression and mRNA expression of RNA modification genes (including m1A, m5C, m6A). Spearman correlation analysis was used for statistical evaluation. (B) The heatmap shows the correlation between PPIA mRNA expression and DNA methylation sites. Spearman correlation analysis was employed for statistical evaluation. Symbols *, **, and *** denote P < 0.05, P < 0.01, and P < 0.001, respectively.

Analysis of potential drugs targeting PPIA

To explore potential drugs that can target PPIA, we collected the top 100 upregulated genes with the most significant differences in each cancer from pan-cancer differential analysis and performed correlation analysis with perturbation gene expression data from the CMAP database. We selected the top 30 negatively correlated compounds, which may reverse the expression trends of differential genes in cancer (Fig. 7A). We also evaluated the relationship between PPIA and the response to anti-PD-L1 immunotherapy, finding that the group with a response to immunotherapy had higher PPIA expression levels compared to the non-responsive group (Fig. 7B). Additionally, we screened FDA-approved and clinically validated drugs from the CellMiner database for analysis, and the results showed that the sensitivity of 16 anti-tumor drugs was positively correlated with PPIA, indicating that patients with higher PPIA expression are more sensitive to these drugs (Figure S6).

Fig. 7
figure 7

Analysis of Potential Drugs Targeting PPIA. (A) Top 30 compounds negatively correlated with PPIA expression in the CMAP database. (B) Differences in PPIA expression among patients in the IMvigor210 cohort who received anti-PD-L1 treatment, across different treatment response conditions.

Pan-cancer enrichment analysis

To further explore the potential biological functions of PPIA’s extensive involvement in cancer occurrence and development, we divided cancer samples into high and low PPIA expression groups, then performed differential gene expression analysis and GSEA analysis. In KEGG pathway enrichment analysis, various cancers are enriched in genome stability-related pathways such as DNA replication, Mismatch repair, and Base excision repair and are in an activated state. Meanwhile, enrichment in cancer-related pathways such as Calcium signaling pathway, cAMP signaling pathway, and Cytokine-cytokine receptor interaction shows a suppressed state (Fig. 8A). In the Hallmark analysis, cell cycle-related pathways are significantly activated in various cancers, including G2M_CHECKPOINT and E2F_TARGETS. The activation of genome stability-related pathways may collectively contribute to uncontrolled proliferation in tumor cells, including DNA_REPAIR and MITOTIC_SPINDLE. Additionally, several cancer-related pathways are enriched, including KRAS_SIGNALING, MYC_TARGETS, INFLAMMATORY RESPONSE, and EPITHELIAL MESENCHYMAL TRANSITION (Fig. 8B). Previous studies have confirmed that PPIA can induce the transition of tumor cells from the G1 phase to the S phase to promote cell proliferation55. It can also regulate cell cycle-related proteins to enhance tumor cell proliferation12.

Fig. 8
figure 8

Enrichment Analysis in pan-cancer. (A) Based on the differential analysis results between high and low PPIA expression groups across 33 types of cancer, GSEA enrichment analysis was performed using the KEGG pathway gene set as a reference. (B) Similarly, GSEA enrichment analysis was conducted using the Hallmark gene set as a reference, based on the differential analysis results between high and low PPIA expression groups across 33 types of cancer. Red squares indicate pathways upregulated in the high PPIA expression group, blue squares indicate pathways downregulated, and white squares indicate pathways not enriched.

Discussion

PPIA, a key molecule involved in diverse biological functions such as molecular chaperoning, protein folding/transport, immune regulation, and cell signaling, is strongly implicated in cancer development and progression. Although previous studies established PPIA dysregulation in breast, esophageal, and pancreatic cancers, its broader roles and molecular characteristics across pan-cancer remained unclear. Our combined TCGA and GTEx data analysis revealed widespread PPIA dysregulation in most cancer types. Excluding MESO and UVM (lacking normal tissue data), only KICH and PCPG showed no significant expression differences. Significantly, PPIA was upregulated in the vast majority of cancers, with downregulation observed only in LAML. These findings strongly suggest a potential oncogenic role for PPIA across cancers. We further observed differential expression among PPIA transcripts. Protein-coding transcripts were frequently upregulated in multiple cancers. Notably, one NMD transcript of PPIA was upregulated in most cancers, possibly indicating NMD escape. Given that the hypoxic tumor microenvironment can inhibit NMD56, this finding warrants further investigation. Single-cell analysis confirmed PPIA’s broad expression across cell types, including macrophages. Tumor-associated macrophages (TAMs) are key drivers of pro-tumorigenic inflammation and progression57, potentially linking PPIA expression within these cells to its oncogenic role.

The tumor microenvironment (TME) comprises diverse components58. As research shows, the activation states and spatial distribution of immune cells within the TME significantly impact treatment efficacy59]– [60. Therefore, identifying predictive biomarkers and understanding immune infiltration status are essential for personalized immunotherapy. Our study reveals that PPIA expression correlates significantly with immune cell infiltration in pan-cancer analyses. Notably, PPIA predominantly showed negative correlations with anti-tumor immune cells like CD8 + T cells and NK cells. However, positive correlations were observed with pro-tumorigenic cells such as Th2 cells61. Intriguingly, certain cancers exhibited distinct patterns; for example, PPIA positively correlated with CD8 + T cells in THYM but negatively with Th2 cells in THCA. This variability likely stems from TME complexity and tumor heterogeneity, where specific immune contexts can alter cell function—recent studies even suggest Th2 cells may promote tumor growth62. Furthermore, PPIA expression associated with key immune regulatory genes, including immunosuppressive immune checkpoints (exploited by tumors to evade immunity and key therapeutic targets)63 and MHC molecules (core to antigen presentation and immune activation, enhancing checkpoint inhibitor efficacy)64. Collectively, our findings highlight PPIA’s widespread relevance in the pan-cancer immune landscape, advancing understanding of its role in cancer immune dysregulation.

Genetic alterations, including mutations, rearrangements, amplifications, and deletions, drive cancer initiation, progression, and treatment response by disrupting normal cellular controls65]– [66. In immunotherapy, TMB and MSI are key predictors of response to immune checkpoint inhibitors, with higher levels generally indicating greater immunogenicity and favorable outcomes67]– [68. Significantly, our analysis links PPIA expression to TMB in 18 cancer types and MSI in 14 types. These associations highlight PPIA’s connection to the tumor microenvironment and its potential as an immunotherapy biomarker in specific cancers. Beyond genetics, epigenetic mechanisms—notably RNA modifications—profoundly influence cancer by regulating gene expression and cellular functions, presenting promising therapeutic targets69]– [70. PPIA demonstrates significant correlations with multiple RNA modification regulators (m6A, m5C, m1A) across cancers. Intriguingly, PPIA frequently correlates positively with m5C “writers” (e.g., NOP2, NSUN5) and “readers” (e.g., ALYREF, YBX1), but negatively with “erasers” (TET1-3). This pattern suggests PPIA may cooperate with writers/readers to modulate mRNA stability, translation, or degradation, thereby influencing tumor progression. PPIA is also linked to DNA methylation—another key epigenetic alteration71. Promoter hypermethylation often silences genes72, while gene body methylation can promote activation/maintenance73.

To elucidate PPIA’s functional mechanisms across cancers, we performed GSEA. We found that PPIA is broadly associated with pathways related to genomic stability and the cell cycle. These results indicate that PPIA is associated with numerous oncogenic mechanisms: dysregulated proliferation, persistent DNA damage stress, and adaptive repair activation, collectively fueling tumor progression and therapy resistance74. Enrichment of E2F targets and G2/M checkpoint activation are hallmarks of aberrant cell cycle control in cancer. Normally, E2F activity is tightly restrained by Rb and confined to G1/S transition. However, Rb pathway inactivation in tumors leads to constitutive E2F activation, driving unscheduled S-phase gene expression and uncontrolled proliferation75. While DNA damage typically activates the G2/M checkpoint to halt division, this function is frequently impaired in TP53-mutant tumors. Consequently, cells enter mitosis with unrepaired DNA, exacerbating genomic instability76. This paradoxical activation yet functional deficiency of the checkpoint may represent adaptive tumor evolution. Upregulated DNA replication pathways indicate heightened replication stress, commonly induced by oncogenes (e.g., MYC, RAS—whose targets were also enriched). Oncogene-driven proliferation stalls replication forks and causes double-strand breaks, triggering ATM/ATR-mediated repair77. Persistent stress, however, can overwhelm repair capacity, leading to chromosomal aberrations. The concurrent upregulation of DNA repair mechanisms suggests tumors counteract chronic DNA damage. While potentially delaying early transformation, these repair pathways may be co-opted in advanced cancers to sustain genomic integrity and survival78.

While our study elucidates the role of PPIA in pan-cancer from multiple dimensions and cohorts, certain limitations should be acknowledged. First, the research primarily relies on data obtained from public databases and statistical analyses. While integrating multiple cohorts strengthens generalizability, it inevitably introduces heterogeneity stemming from diverse sources. Crucially, differences in sample processing protocols, sequencing platforms, and inherent biases within individual databases could lead to systematic variations in gene expression quantification and other molecular measurements. Consequently, the results derived from these bioinformatics analyses should be interpreted with appropriate caution regarding this inherent technical and batch variation. Secondly, correlation analysis cannot demonstrate causality, and the impact of PPIA on tumor progression lacks sufficient experimental validation. However, the related findings indicate directions for future research. In summary, our research demonstrates that PPIA is significantly upregulated in most cancers and is closely associated with cancer prognosis, the immune microenvironment, and epigenetic modifications. These findings warrant further in-depth exploration in future studies.