Abstract
Cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) tumorigenesis involves a combination of multiple genetic alteration processes. Constructing a survival-associated competing endogenous RNA (ceRNA) network and a multi-mRNA-based prognostic signature model can help us better understand the complexity and genetic characteristics of CESC. In this study, the RNA-seq data and clinical information of CESC patients were downloaded from The Cancer Genome Atlas. Differentially expressed mRNAs, lncRNAs and miRNAs were identified with the edgeR R package. A four-mRNA prognostic signature was developed by multivariate Cox regression analysis. Kaplan–Meier survival with the log-rank tests was performed to assess survival rates. The relationships between overall survival (OS) and clinical parameters were evaluated by Cox regression analysis. A survival-associated ceRNA network was constructed with the multiMiR package and miRcode database. Kyoto encyclopedia of genes and genomes (KEGG) analysis and gene ontology analyses were used to identify the functional role of the ceRNA network in the prognosis of CESC. A total of 298 differentially expressed mRNAs, 8 miRNAs, and 29 lncRNAs were significantly associated with the prognosis of CESC. A prognostic signature model based on 4 mRNAs (OPN3, DAAM2, HENMT1, and CAVIN3) was developed, and the prognostic ability of this signature was indicated by the AUC of 0.726. Patients in the high-risk group exhibited significantly worse OS. The KEGG pathways, TGF-β and Cell adhesion molecules, were significantly enriched. In this study, a CESC-associated ceRNA network was constructed, and a multi-mRNA-based prognostic model for CESC was developed based on the ceRNA network, providing a new perspective for cancer pathogenesis research.
Similar content being viewed by others
Introduction
Cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) is one of the most common malignancies worldwide, with more than 570,000 new cases and 274,000 deaths per year1,2. The long-term survival rates of patients with early-stage disease have been greatly improved in recent years. However, the 5-year survival rate of patients with recurrent or metastatic disease remains less than 16.8%. Currently, the prediction of CESC prognosis mainly depends on the tumor-node-metastasis (TNM) stage. However, the TNM stage is based on anatomical information and does not reflect the biological heterogeneity of CESC. Hence, it is urgent to find novel biomarkers based on transcriptomics data that can act as prognostic indicators to guide precise individualized treatment.
Recently, the competing endogenous RNA (ceRNA) hypothesis has provided novel insights into the cancer research. CeRNA links the function of message RNA (mRNA) with long-noncoding RNA (lncRNA) and microRNA (miRNA)3. A ceRNA is a transcript targeted by a miRNA that sequesters the activity of the bound miRNA, effectively de-repressing other targets of that miRNA4. MiRNAs are small (20–22 nucleotides long) noncoding RNA that have been recognized as important negative regulators of mRNA translation5. LncRNAs are larger noncoding RNAs then miRNAs with more than 200 nucleotides6. The ceRNA hypothesis was proposed as a unique pathway for regulating the expression of RNAs7. The ceRNA hypothesis states that miRNAs act as the hub genes that suppress mRNA translation but lncRNAs compete for binding to one or more sites in miRNAs to suppress the function of miRNAs and participate in post-transcriptional control8.
Previous studies reported that the ceRNA networks might act as the biomarkers for prognosis in CESC. Song et al.9 constructed a CESC-associated ceRNA network which composed of 50 lncRNAs, 81 mRNAs and 18 miRNAs, and found that several RNAs were associated with the prognosis. Chen et al.10 constructed a CESC-associated ceRNA network composed of 17 lncRNAs, 5 miRNAs, and 7 mRNAs by weighted correlation network analysis (WGCNA), and found that E2F1 and hsa-mir-204 were related with worse prognosis. Ding et al.11 constructed a CESC-associated ceRNA network and revealed that ADGRF4, ANXA8L1, HCAR3, IRF6 and PDE2A were associated with the prognosis. However, those studies did not construct a prognostic model based on the prognostic RNAs for CESC. CESC is a heterogeneous disease with multiple gene alterations and interactions. Hence, it is of great significance to construct a CESC-associated ceRNA network and develop a multigene prognostic model based on the ceRNA network.
As shown in the workflow diagram (Fig. 1), we first downloaded the clinical information and RNA-seq data of CESC patients from The Cancer Genome Atlas (https://portal.gdc.cancer.gov) (TCGA) database (Fig. 1A). Then, we performed differential expression analysis between normal and tumor samples (Fig. 1B). Then, we analyzed the relationships between differentially expressed RNAs and overall survival (OS) (Fig. 1C). Next, we utilized Kyoto encyclopedia of genes and genomes (KEGG) pathway enrichment analysis and gene ontology (GO) functional annotation to further investigate the function of the survival-associated RNAs (Fig. 1D). In addition, we constructed a ceRNA network based on those prognostic-associated RNAs (Fig. 1E). Afterward, four prognosis-associated mRNAs (OPN3, DAAM2, HENMT1, and CAVIN3) were identified through multivariate Cox regression (Fig. 1F). Finally, we developed a prognostic signature for CESC based on multiple mRNAs (Fig. 1G). This study aimed to provide a novel biomarker to guide personalized medicine and to facilitate an understanding of the molecular mechanisms for CESC.
The workflow of the present study. (A) First, we downloaded the clinical information and RNA-seq data of CESC patients from The Cancer Genome Atlas (https://portal.gdc.cancer.gov) (TCGA) database. (B) Second, we performed differential expression analysis between normal and tumor samples. (C) Then, we used univariate Cox regression analysis to identify the prognosis-associated mRNAs, miRNAs, and lncRNAs. (D) Then, functional annotation and pathway enrichment were performed to investigate the identified prognosis-associated mRNAs. (E) In addition, we constructed a ceRNA network based on these prognosis-associated RNAs. (F) Next, multivariate Cox regression analysis was used to screen the prognostic mRNAs for development of the prognostic model construction. (G) Finally, the prognostic model was developed using the 4 mRNAs selected by screening.
Results
Differentially expressed RNAs analysis
In total, we identified 1398 downregulated and 1495 upregulated mRNAs (Fig. 2A), 5 downregulated and 15 upregulated miRNAs (Fig. 2B), and 493 downregulated and 658 upregulated lncRNAs (Fig. 2C), as shown in more detail in an additional file. A total of 19,545 mRNAs, 2713 miRNAs, and 13,977 lncRNAs were extracted from transcriptome data. The overall differential expression landscapes of mRNAs, miRNAs, and lncRNAs between 3 adjacent normal samples and 240 tumor samples are presented in Fig. 2D–F, respectively (see Additional file 1).
Differentially expressed RNAs between CESC and adjacent normal tissues. Volcano plot of the differentially expressed (A) mRNAs, (B) miRNAs and (C) lncRNAs. Red indicates high expression, and green indicates low expression. Black indicates no significant difference in expression. Heat map of the overall expression landscape of the differentially expressed (D) mRNAs, (E) miRNAs, and (F) lncRNAs.
Please confirm the section headings are correctly identified.yes
Survival-associated RNAs
The relationships between the differentially expressed lncRNAs, miRNAs, and mRNAs and OS were evaluated in 240 CESC patients. Univariate Cox regression analysis was used to identify overall survival related RNAs (prognostic RNAs). Finally, we found that 298 mRNAs, 8 miRNAs, and 129 lncRNAs were significantly associated with OS. The top 15 mRNAs, miRNAs, and lncRNAs ranked by p value are shown in Fig. 3A–C, respectively, as shown in more detail in an additional file (see Additional file 2).
Functional annotation and pathway enrichment analysis
GO functional annotation and KEGG pathway enrichment analysis were utilized to investigate the biological functions of the prognostic mRNAs. GO functional annotations included biological process (BP), cellular component (CC), and molecular function (MF) annotations. The top 10 GO terms and KEGG pathways are listed in Fig. 4. GO BP analysis showed that the targeted mRNAs were significantly enriched in the terms epithelial cell proliferation and endothelial development (Fig. 4A). CC analysis revealed enrichment in the terms ruffle and catenin complex (Fig. 4B). In the MF analysis, the targeted mRNAs were significantly enriched in the terms BMP binding receptor and fatting acid synthase activity (Fig. 4C). KEGG pathway enrichment analysis revealed that the targeted genes were significantly enriched in the TGF-β and cell adhesion molecules signaling pathways (Fig. 4D). These results are shown in more detail in an additional file (see Additional file 3).
Survival-associated ceRNA network
A comprehensive survival-associated lncRNA-miRNA-mRNA ceRNA network was constructed by combining the lncRNA-miRNA interactions with the miRNA-mRNA interactions. The ceRNA network contained 24 lncRNAs, 6 miRNAs, and 34 mRNAs (Table S1, Fig. 5).
Predictive model for overall survival
To develop a multi-mRNA-based prognostic signature model for CESC, 8 mRNAs with P < 0.0001 (unadjusted) in the univariate Cox regression analysis were included in multivariate Cox regression analysis. Finally, 4 mRNAs (OPN3, DAAM2, HENMT1, and CAVIN3) were identified. Information on those mRNAs is shown in Table 1. In addition, we compared the transcript and protein levels of above 4 genes between adjacent normal and CESC tissues using a t test and immunohistochemistry (IHC). The transcript levels of OPN3 (P = 0.013, unadjusted) and HENMT1 (P = 0.0095, unadjusted) were significantly upregulated but the transcript levels of DAAM2 (P < 0.0001, unadjusted) and CAVIN3 (P < 0.0001, unadjusted) were significantly decreased in CESC tissues compared with adjacent normal tissues (Fig. 6). On the other hand, the Human Protein Atlas database was used to evaluate expression levels of the above 4 proteins in adjacent normal and CESC tissues as assessed by IHC. Based on the immunohistochemical staining images, the protein expression levels of OPN3 and HENMT1 were higher but the protein expression level of CAVIN3 was lower in CESC samples than in normal samples, which was consistent with the transcriptomics data (Fig. 7).
Comparison of the transcript levels of the 4 key mRNAs between CESC and normal tissues. The transcript levels of (A) OPN3 and (C) HENMT1 were significantly increased but the transcript levels of (B) DAAM2 and (D) CAVIN3 were significantly decreased in CESC tissues compared with normal tissues. *P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001 (unadjusted).
A multi-mRNA-based prognostic signature RS model was developed based on the above 4 mRNAs. The RS was calculated by the following equation: RS = (ExpOPN3 × βOPN3) + (ExpDAAM2 × βDAAM2) + (ExpHENMT1 × βHENMT1) + (ExpCAVIN3 × βCAVIN3). The “Exp” value represents the expression level and the “β” value represents the regression coefficient derived from the multivariate Cox regression model.
The RS of each patient was calculated according to the above equation. Then, CESC patients were divided into a low-risk group and a high-risk group with the median RS as the cut-off value. A t test was used to compare the expression levels of the 4 mRNAs between the low-risk group and the high-risk group. The expression levels of OPN3 (P < 0.0001, unadjusted), DAAM2 (P < 0.0001, unadjusted), and CAVIN3 (P < 0.0001, unadjusted) were higher but the expression levels of HENMT1 (P < 0.0001, unadjusted) were lower in the high-risk group than those in the low-risk group (Fig. 8). Figure 9 shows the performance of the mRNA-based model. Figure 9A shows the ranking of patients according to the RS. The scatter plot shows that the OS of CESC patients decreased along with the increasing RS (Fig. 9B). The heat map shows that the expression level of HENMT1 decreased but the expression levels of OPN3, DAAM2 and CAVIN3 increased with increasing RS (Fig. 9C).
Comparison of the transcript levels of the 4 key mRNAs between the low-risk group and high-risk group. The expression levels of (A) OPN3, (B) DAAM2, and (D) CAVIN3 were higher but the expression level of (C) HENMT1 was lower in the high-risk group than those in the low-risk group. *P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001 (unadjusted).
Performance of the prognostic model in classifying patients into the low-risk and high-risk groups. (A) Risk score distribution based on the four-mRNA signature. (B) Risk score distribution in the groups stratified by survival status based on the four-mRNA signature. (C) Heat map showing the expression of the 4 key mRNAs in low-risk and high-risk groups. The color from blue to red shows a trend from low expression to high expression.
Kaplan–Meier survival analysis with the log-rank tests was performed to identify the relationships between different RS groups and OS, and the ROC curves were used to evaluate the sensitivity and specificity of the model. CESC patients in the high-risk group exhibited significantly shorter OS times (P < 0.0001, unadjusted) (Fig. 10A). The AUC of the RS (0.726) revealed that the model showed prognostic assessment ability (Fig. 10B).
Several clinical parameters were found to have some prognostic value by univariate analysis, for example, M stage (P = 0.02, unadjusted), N stage (P = 0.01, unadjusted), T stage (P = 0.001), pathological stage (P = 0.001, unadjusted) and the signature RS (P = 1.5e − 6, unadjusted); however, only the signature RS remained statistically significant with the confirmation by multivariate analysis (HR = 6.35, P = 0.01, unadjusted; Table 2).
Discussion
In this study, distinct mRNAs, lncRNAs, and miRNAs were identified to further understanding of the molecular events related to CESC prognosis. In addition, the constructed prognostic ceRNA network provided new insights into the prognosis of CESC.
CESC tumorigenesis involves a combination of multiple genetic alteration processes. However, almost all studies to date have focused on a single ‘driver’ gene or a single cluster of driver genes of CESC. Daniel et al.12 reported that Keratin-17 is a prognostic biomarker in CESC. Li et al.13 reported that FAM83A is a potential biomarker regulated by miR-20, which promotes the development of CESC through the PI3K/AKT/mTOR signaling pathway. To date, no single pivotal driver gene or gene cluster has been reported to be superior for evaluating the prognosis of CESC. Moreover, TNM staging, used as a major prognostic indicator, is based on anatomical information and does not reflect the biological heterogeneity of CESC. Hence, it is of great importance to construct a prognostic ceRNA network and develop a multi-mRNA-based model based on survival-associated biomarkers to predict the prognosis of CESC.
Through bioinformatics analysis, we found that 298 mRNAs, 8 miRNAs, and 129 lncRNAs were associated with the prognosis of CESC. In addition, we constructed a CESC-associated ceRNA network that contained 24 lncRNAs, 6 miRNAs, and 34 mRNAs from those prognostic RNAs. The ceRNA network can help us better understand the pathogenesis and prognosis of CESC from the multidimensional perspective of gene expression. We even developed a prognostic model using four key prognostic mRNAs (OPN3, DAAM2, HENMT1, and CAVIN3) in the ceRNA network, which showed excellent prognostic ability.
CeRNA networks of CESC have been constructed in other studies, but there are some limitations. Song et al.9 constructed a CESC-associated ceRNA network that consisted of 50 lncRNAs, 81 mRNAs and 18 miRNAs and found that several RNAs were associated with the prognosis. However, a prognostic model based on the prognostic RNAs for CESC was not developed in that study. In another study, Chen et al.10 also constructed a CESC-associated ceRNA network; however, the relationship between OS and only a single gene was assessed in that study. Although some researchers have constructed a CESC-associated ceRNA network and simultaneously developed a prognostic model, they did not use ROC curves to evaluate the prognostic ability of the model. Prognostic models based on multiple mRNAs could provide more accurate predictions than those based on single gene. We constructed a ceRNA network and identified several novel potential prognostic RNAs for CESC. Moreover, a prognostic model was developed based on the four key prognostic mRNAs. The prognostic ability of this model was indicated by AUC of 0.726 in ROC analysis.
Several prognostic models for CESC based on multiple genes have been developed in previous studies. Meng et al.14 developed a prognostic model for CESC based on DSG2, ITM2A, CENPM, RIBC2, and MEIS2. Liu et al.15 developed a multi-mRNA prognostic model composed of ITGA5, HHEX, and S1PR4. Similarly, we also developed a CESC-associated prognostic model based on OPN3, DAAM2, HENMT1, and CAVIN3, and the prognostic ability of this model was indicated by AUC of 0.726 in ROC analysis. Moreover, in our study, a Cox proportional hazards model was used, and we corrected for the following confounding factors that can greatly affect prognosis: T stage, N stage, M stage, and pathological stage. The RS signature of this signature was significantly associated with the clinical outcome of CESC patients. These results indicate that the signature we developed is an independent prognostic factor.
These four mRNAs have also been reported to play vital roles in tumorigenesis in various cancers. OPN3 was found to be related to tumor metastasis and drug sensitivity. Chao et al.16 reported that OPN3 enhanced tumor metastasis in lung adenocarcinoma. Jiao et al.17 revealed that OPN3 sensitized hepatocellular carcinoma cells to 5-fluorouracil treatment by regulating the apoptotic pathway, a process was related to phospho-AKT and the Bcl2/Bax ratio. DAAM2 drives tumorigenesis via various pathways. Chen et al.18 found that DAAM2 promotes invasion in colorectal cancer by activating PAK1 and promoting MMP7 expression. Zhu et al.19 uncovered that DAAM2 driven degradation of VHL promotes gliomagenesis. Consistent with our findings, Huang et al.20 reported that HENMT1 plays protective roles in CESC. CAVIN3 plays opposing roles in different types of cancers. Sun et al.21 found that CAVIN3 promotes the migration, proliferation, and invasion of lung cancer cells and that process was related to the mammalian target of rapamycin (mTOR) signaling pathway. In contrast, CAVIN3 functions as a metastasis suppressor by inhibiting the AKT pathway in breast cancer22. According to the above literature, we speculated that the AKT/mTOR signal pathway could be the primary enriched pathway of these four mRNAs.
In this study, we also discovered some prognostic miRNAs that extensively reported in previous studies. MiRNA 210 was generally reported to exhibit oncogenic properties in breast, lung, head and neck, pancreatic cancer, and glioblastoma23. In consistent with our data, the overexpressed miRNA 210 in breast cancer is related with a poor prognosis, that result is correlated with aggressiveness and shorter time to distant metastasis24,25. MiRNA 200a as a potential biomarker was widely reported to be associated with a poor prognosis in epithelial ovarian cancer26. MiRNA 200b was widely reported in cancer chemosensitivity, that process was related with EMT, cancer stem cells proliferation, angiogenesis, apoptosis, and cell cycle distribution27. MiRNA 126 was reported to be a new and promising player in lung cancer that related to its promoting effect of metastasis and angiogenesis28. MiRNA 4664, miRNA 4258, and AP001205.1 were rarely reported in previous studies and their oncogenic properties remained unknown, that would provide new lights on miRNA field. In summary, various studies indicated that our discovered prognostic miRNAs were potential biomarkers of cancers.
Excepted for USP30-AS1 and DDN-AS1, few studies reported our discovered prognostic lncRNAs. Contrary to our data, USP30-AS1 was reported to be related with poor prognosis in both primary and recurrent glioma patients29. USP30-AS1 also promotes tumor cell survival by cis-regulating USP30 and ANKRD13A in acute myeloid leukemia30. DDN-AS1 was reported to be a poor prognosis factor with cervical cancer via DDN-AS1-miR-15a/16-TCF3 feedback loop regulates tumor progression31, that result support our study. All of these studies indicated that our discovered prognostic lncRNAs play vital roles in tumorigenesis and these unreported lncRNAs shed new lights on lncRNA filed.
The KEGG pathway analysis of the RNAs in the ceRNA network indicated that the targeted RNAs were significantly enriched in the TGF-beta signaling pathway and the cell adhesion molecules pathway. The TGF-beta pathway is a critical cancer-associated signaling pathway that is involved in proliferation, apoptosis, differentiation, migration, and epithelia-mesenchymal transition (EMT) of cancer32,33. The TGF-β signaling pathway also plays vital role in CESC. Deng et al.34 reported that CD36 and TGF-β interact to promote the EMT in CESC. Yang et al.35 found that downregulation of SEMA4C inhibited EMT, invasion, and metastasis in CESC via inhibition of TGF-β. The cell adhesion molecules pathway plays a critical role in the development of CESC. Carvalho et al.36 reported that cell adhesion molecule L1 is associated with a poor prognosis. Biological process analysis revealed that the main biological process altered by the survival-associated RNAs in the ceRNA network is the epithelial cell proliferation. Cytokinetic homeostasis is controlled by the balance between cell proliferation and apoptosis. Previous studies have reported that excessive cell proliferation activity can lead to precancerous lesions in some types of cancer. Obara37 indicated that the epithelial cell proliferation activity is significantly increased in a stepwise manner from normal gallbladder mucosa to cancerous tissue. However, different pathways are usually perturbed by different molecules and thus need to be further investigated in the laboratory.
Our study still has some limitations. First, although the effects of OPN3, DAAM2, HENTM1, and CAVIN3, composing our prognostic signature, on tumorigenesis have been reported in other types of cancer16,19,21,38, their exact effects on CESC have yet to be fully elucidated and need to be verified by experiments. Second, the research data came from a single online database, and another independent cohort is needed to verify the above results in the future. Third, the information of cervical cancer patients obtained from TCGA should be assessed with another experimental method.
Conclusion
In conclusion, we identified multiple potential prognostic markers and constructed a ceRNA network that provides novel insights for studying gene complexity in CESC and helps us better understand the pathogenesis and prognosis of CESC from the multidimensional perspective of gene expression. Here, we developed a multi-mRNA-based prognostic model that may compensate for limitations of the commonly used prognostic evaluation system. This study may provide novel biomarkers and facilitate the design of molecular targeted therapies for CESC. Although we investigated the potential prognostic value of these genes, due to work limitations, we did not perform an evaluation in another independent cohort to verify above results. In addition, the biological functions and mechanisms of above genes have yet to be fully elucidated. In summary, more experimental and clinical studies are needed to explore their functions and verify their prognostic value of these genes in the future.
Materials and methods
Data processing
The clinical information and RNA-seq data of CESC patients were downloaded from The Cancer Genome Atlas (https://portal.gdc.cancer.gov) (TCGA) database. The inclusion criteria were as follows: (1) Patients with complete clinical information, including T stage, M stage, N stage, pathological stage, survival status, age, and histological type; and (2) Patients with complete lncRNA-seq, miRNA-seq, and mRNA-seq data. Finally, 240 CESC samples and 3 adjacent non-tumor samples were examined. All of the human subjects/data were downloaded and analyzed for scientific purposes, which was in accordance with TCGA ethics approval.
Differentially expressed RNAs analysis
The edgeR R package39 was used to identify differentially expressed mRNAs, lncRNAs, and miRNAs between CESC and adjacent non-tumor samples. A false discovery rate (FDR)-adjusted P value < 0.05 and an absolute log2 fold change |log2FC| value > 2 were considered to indicate statistical significance. The expression data for lncRNAs, miRNAs, and mRNAs were converted to log2(count + 1) values after normalization with the edgeR R package for further analysis.
Survival analysis and development of the prognosis-associated signature
The survival R package was used for univariate Cox regression analysis to assess the relationships between differentially expressed RNAs (i.e., lncRNAs, miRNAs, and mRNAs) and OS, and mRNAs with a P < 0.05 (unadjusted) were considered to statistically significant and termed to prognostic RNAs40. To further develop a multi-mRNA-based prognostic model, prognostic mRNAs with a P < 0.001 (unadjusted) in the univariate Cox regression analysis were included in multivariate Cox regression analysis, and prognostic mRNAs with P < 0.05 (unadjusted) were considered to statistically significant. Finally, we obtained four mRNAs (OPN3, DAAM2, HENTM1, and CAVIN3) to for the development of the prognostic signature.
The mRNA-based prognostic signature risk score model was developed on the basis of a linear combination of the expression level (Exp) multiplied by the regression coefficient (β) derived from the multivariate Cox regression model and was represented by the following equation, as previously reported41,42: Risk Score (RS) = (ExpOPN3 × βOPN3) + (ExpDAAM2 × βDAAM2) + (ExpHENMT1 × βHENMT1) + (ExpCAVIN3 × βCAVIN3).
With the median RS as cut-off value, CESC patients were divided into a low-risk group and a high-risk group. Time-dependent receiver operating characteristic (ROC) analysis was applied to assess the prognostic accuracy of the model, and the area under the curve (AUC) was used to assess the specificity and sensitivity of the model. The pROC R package was used to perform the time-dependent ROC analysis43.
Analysis of the protein expression levels of the 4 key mRNAs
The Human Protein Atlas (https://www.proteinatlas.org) provides tissue and cellular distribution information for approximately 26,000 human proteins. This information was obtained via immunoassay techniques (immunohistochemistry, immunofluorescence, and western blotting) to detect protein expression in 64 cancer cell lines, 48 normal human tissues and 20 tumor tissues44. In this study, we compared the expression levels of the 4 key genes (OPN3, DAAM2, HENTM1, and CAVIN3) between normal and CESC tissues using immunohistochemical images from the Human Protein Atlas database.
Functional annotation and pathway enrichment analysis
The clusterProfiler R package45 and ggplot2 R package46 were used to process the data for prognostic mRNAs and visualize the enrichment results of gene ontology (GO) enrichment analysis and Kyoto encyclopedia of genes and genomes (KEGG) pathway enrichment analysis47,48. GO enrichment analysis included biological processes (BP), molecular functions (MF), and cellular components (CC). The statistical significance threshold was set at Benjamini–Hochberg (BH)-adjusted P value < 0.05 for enrichment analyses.
ceRNA network construction
We constructed a ceRNA network based on the identified prognostic RNAs. The multiMiR R package49 and miRcode (http://www.mircode.org/) database were applied to predict miRNA-mRNA interactions and lncRNA-miRNA interactions. Finally, Cytoscape (version 3.7.0) software was utilized to visualize the ceRNA network based on the lncRNA-miRNA-mRNA axes by combining the lncRNA-miRNA interactions with the miRNA-target gene interactions50. In the ceRNA network, lncRNAs and mRNAs act as natural miRNA sponges to suppress miRNA functions by binding to one or more sites in miRNA.
Statistical analysis
The relationships between different groups and the gene expression profiles were assessed by t tests, and P < 0.05 (unadjusted) was considered to statistical significance. Univariate Cox regression and multivariate Cox regression analysis was used to analyze the relationships between the clinicopathological parameters and OS. R software (version 4.0.2; https://mirrors.tuna.tsinghua.edu.cn/CRAN/) was used to generate figures and perform statistical analyses.Please note we have moved the section “Ethics approval and Patient consent for publication” to the end of the methods, as per house style.yes
Ethics approval
All of the data involved to human was obtained reasonably basing on TCGA ethical statements.
Patient consent for publication
Not applicable.
Data availability
The datasets used during the current study are available from TCGA (https://portal.gdc.cancer.gov) and The Human Protein Atlas (https://www.proteinatlas.org). Analyzed data is available within supplementary information or from the corresponding author upon reasonable request.
Abbreviations
- CESC:
-
Cervical squamous cell carcinoma and endocervical adenocarcinoma
- TCGA:
-
The Cancer Genome Atlas
- ceRNA:
-
Competitive endogenous RNA
- KEGG:
-
Kyoto encyclopedia of genes and genomes
- GO:
-
Gene ontology
- BP:
-
Biological progress
- CC:
-
Cellular components
- MF:
-
Molecular function
- ROC:
-
Receiver operating characteristic
- AUC:
-
Area under curve
- |FC|:
-
Absolute value of fold change
- RNA-seq:
-
RNA sequence
- LncRNA:
-
Long non-coding RNA
- miRNA:
-
MicroRNA
- mRNA:
-
Message RNA
- EMT:
-
Epithelia-mesenchymal transition
- OS:
-
Overall survival
- RS:
-
Risk score
- HR:
-
Hazard risk
- Exp:
-
Expression level
- β:
-
Regression coefficient
- IHC:
-
Immunohistochemistry
- BMP:
-
Bone morphogenetic protein
- WGCNA:
-
Weighted correlation network analysis
- FDR:
-
False discovery rate
References
Paavonen, J. Human papillomavirus infection and the development of cervical cancer and related genital neoplasias. Int. J. Infect. Dis. 11(Suppl 2), S3-9. https://doi.org/10.1016/s1201-9712(07)60015-0 (2007).
Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424. https://doi.org/10.3322/caac.21492 (2018).
Qi, X. et al. ceRNA in cancer: Possible functions and clinical implications. J. Med. Genet. 52, 710–718. https://doi.org/10.1136/jmedgenet-2015-103334 (2015).
Karreth, F. A. & Pandolfi, P. P. ceRNA cross-talk in cancer: When ce-bling rivalries go awry. Cancer Discov. 3, 1113–1121. https://doi.org/10.1158/2159-8290.Cd-13-0202 (2013).
Mishra, S., Yadav, T. & Rani, V. Exploring miRNA based approaches in cancer diagnostics and therapeutics. Crit. Rev. Oncol. Hematol. 98, 12–23. https://doi.org/10.1016/j.critrevonc.2015.10.003 (2016).
Peng, W. X., Koirala, P. & Mo, Y. Y. LncRNA-mediated regulation of cell signaling in cancer. Oncogene 36, 5661–5667. https://doi.org/10.1038/onc.2017.184 (2017).
Thomson, D. W. & Dinger, M. E. Endogenous microRNA sponges: Evidence and controversy. Nat. Rev. Genet. 17, 272–283. https://doi.org/10.1038/nrg.2016.20 (2016).
Salmena, L., Poliseno, L., Tay, Y., Kats, L. & Pandolfi, P. P. A ceRNA hypothesis: The Rosetta Stone of a hidden RNA language?. Cell 146, 353–358. https://doi.org/10.1016/j.cell.2011.07.014 (2011).
Song, J. et al. Reconstruction and analysis of the aberrant lncRNA-miRNA-mRNA network based on competitive endogenous RNA in CESC. J. Cell Biochem. 119, 6665–6673. https://doi.org/10.1002/jcb.26850 (2018).
Chen, W. et al. Construction and analysis of lncRNA-mediated ceRNA network in cervical squamous cell carcinoma by weighted gene co-expression network analysis. Med. Sci. Monit. 25, 2609–2622. https://doi.org/10.12659/msm.913471 (2019).
Ding, H. et al. The new biomarker for cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) based on public database mining. Biomed. Res. Int. 2020, 5478574. https://doi.org/10.1155/2020/5478574 (2020).
Mockler, D. et al. Keratin 17 is a prognostic biomarker in endocervical glandular neoplasia. Am. J. Clin. Pathol. 148, 264–273. https://doi.org/10.1093/ajcp/aqx077 (2017).
Rong, L. et al. FAM83A as a potential biological marker is regulated by miR-206 to promote cervical cancer progression through PI3K/AKT/mTOR pathway. Front Med (Lausanne) 7, 608441. https://doi.org/10.3389/fmed.2020.608441 (2020).
Meng, H. et al. Identification of key genes in association with progression and prognosis in cervical squamous cell carcinoma. DNA Cell Biol. 39, 848–863. https://doi.org/10.1089/dna.2019.5202 (2020).
Liu, J. et al. Methylation-driven genes and their prognostic value in cervical squamous cell carcinoma. Ann. Transl. Med. 8, 868. https://doi.org/10.21037/atm-19-4577 (2020).
Xu, C. et al. Expression of OPN3 in lung adenocarcinoma promotes epithelial-mesenchymal transition and tumor metastasis. Thorac. Cancer 11, 286–294. https://doi.org/10.1111/1759-7714.13254 (2020).
Jiao, J. et al. Opsin3 sensitizes hepatocellular carcinoma cells to 5-fluorouracil treatment by regulating the apoptotic pathway. Cancer Lett. 320, 96–103. https://doi.org/10.1016/j.canlet.2012.01.035 (2012).
Chen, A., Liu, Z., Wu, Q. & Li, H. Disheveled-associated activator of morphogenesis 2 promotes invasion of colorectal cancer by activating PAK1 and promoting MMP7 expression. Genes Genomics 43, 923–935. https://doi.org/10.1007/s13258-021-01111-1 (2021).
Zhu, W. et al. Daam2 driven degradation of VHL promotes gliomagenesis. Elife https://doi.org/10.7554/eLife.31926 (2017).
Huang, Z., Li, F. & Li, Q. Expression profile of RNA binding protein in cervical cancer using bioinformatics approach. Cancer Cell Int. 21, 647. https://doi.org/10.1186/s12935-021-02319-7 (2021).
Sun, G. & Ni, K. The role of Cavin3 in the progression of lung cancer and its mechanism. Biomed. Res. Int. 2020, 6364801. https://doi.org/10.1155/2020/6364801 (2020).
An, X. et al. Cavin3 suppresses breast cancer metastasis via inhibiting AKT pathway. Front. Pharmacol. 11, 01228. https://doi.org/10.3389/fphar.2020.01228 (2020).
Bavelloni, A. et al. MiRNA-210: a current overview. Anticancer Res. 37, 6511–6521. https://doi.org/10.21873/anticanres.12107 (2017).
Radojicic, J. et al. MicroRNA expression analysis in triple-negative (ER, PR and Her2/neu) breast cancer. Cell Cycle 10, 507–517. https://doi.org/10.4161/cc.10.3.14754 (2011).
Volinia, S. et al. Breast cancer signatures for invasiveness and prognosis defined by deep sequencing of microRNA. Proc. Natl. Acad. Sci. U. S. A. 109, 3024–3029. https://doi.org/10.1073/pnas.1200010109 (2012).
Teng, Y. et al. miRNA-200a/c as potential biomarker in epithelial ovarian cancer (EOC): Evidence based on miRNA meta-signature and clinical investigations. Oncotarget 7, 81621–81633. https://doi.org/10.18632/oncotarget.13154 (2016).
Feng, B., Wang, R. & Chen, L. B. Review of miR-200b and cancer chemosensitivity. Biomed. Pharmacother. 66, 397–402. https://doi.org/10.1016/j.biopha.2012.06.002 (2012).
Chen, Q., Chen, S., Zhao, J., Zhou, Y. & Xu, L. MicroRNA-126: A new and promising player in lung cancer. Oncol. Lett. 21, 35. https://doi.org/10.3892/ol.2020.12296 (2021).
Wang, N., Li, J., Xin, Q. & Xu, N. USP30-AS1 contributes to mitochondrial quality control in glioblastoma cells. Biochem. Biophys. Res. Commun. 581, 31–37. https://doi.org/10.1016/j.bbrc.2021.10.006 (2021).
Zhou, W., Xu, S., Deng, T., Zhou, R. & Wang, C. LncRNA USP30-AS1 promotes the survival of acute myeloid leukemia cells by cis-regulating USP30 and ANKRD13A. Hum. Cell 35, 360–378. https://doi.org/10.1007/s13577-021-00636-7 (2022).
Liu, Z. et al. DDN-AS1-miR-15a/16-TCF3 feedback loop regulates tumor progression in cervical cancer. J. Cell Biochem. 120, 10228–10238. https://doi.org/10.1002/jcb.28307 (2019).
Syed, V. TGF-β signaling in cancer. J. Cell Biochem. 117, 1279–1287. https://doi.org/10.1002/jcb.25496 (2016).
Xie, F., Ling, L., van Dam, H., Zhou, F. & Zhang, L. TGF-β signaling in cancer metastasis. Acta Biochim. Biophys. Sin. (Shanghai) 50, 121–132. https://doi.org/10.1093/abbs/gmx123 (2018).
Deng, M. et al. CD36 promotes the epithelial-mesenchymal transition and metastasis in cervical cancer by interacting with TGF-β. J. Transl. Med. 17, 352. https://doi.org/10.1186/s12967-019-2098-6 (2019).
Yang, L. et al. Downregulation of SEMA4C inhibit epithelial-mesenchymal transition (EMT) and the invasion and metastasis of cervical cancer cells via inhibiting transforming growth factor-beta 1 (TGF-β1)-induced hela cells p38 mitogen-activated protein kinase (MAPK) activation. Med. Sci. Monit. 26, e918123. https://doi.org/10.12659/msm.918123 (2020).
Mancusi de Carvalho, J. P. et al. L1 cell adhesion molecule (L1CAM) in stage IB cervical cancer: Distinct expression in squamous cell carcinomas and adenocarcinomas. J. Clin. Pathol. 73, 748–753. https://doi.org/10.1136/jclinpath-2020-206500 (2020).
Obara, T. et al. Epithelial cell proliferation and gene mutation in the mucosa of gallbladder with pancreaticobiliary malunion and cancer. J. Hepatobiliary Pancreat. Surg. 6, 229–236. https://doi.org/10.1007/s005340050112 (1999).
Hoff, A. M. et al. Identification of novel fusion genes in testicular germ cell tumors. Cancer Res. 76, 108–116. https://doi.org/10.1158/0008-5472.Can-15-1790 (2016).
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140. https://doi.org/10.1093/bioinformatics/btp616 (2010).
Terry, M., Therneau, P. M. G. Modeling Survival Data: Extending the Cox Model. (2000).
Lossos, I. S. et al. Prediction of survival in diffuse large-B-cell lymphoma based on the expression of six genes. N. Engl. J. Med. 350, 1828–1837. https://doi.org/10.1056/NEJMoa032520 (2004).
Alizadeh, A. A. et al. Prediction of survival in diffuse large B-cell lymphoma based on the expression of 2 genes reflecting tumor and microenvironment. Blood 118, 1350–1358. https://doi.org/10.1182/blood-2011-03-345272 (2011).
Robin, X. et al. pROC: An open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform. 12, 77. https://doi.org/10.1186/1471-2105-12-77 (2011).
Wu, Z. H., Cai, F. & Zhong, Y. Comprehensive analysis of the expression and prognosis for GBPs in head and neck squamous cell carcinoma. Sci. Rep. 10, 6085. https://doi.org/10.1038/s41598-020-63246-7 (2020).
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16, 284–287. https://doi.org/10.1089/omi.2011.0118 (2012).
Wickham, H. ggplot2: Elegant graphics for data analysis. (2016).
Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y. & Morishima, K. KEGG: New perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353-d361. https://doi.org/10.1093/nar/gkw1092 (2017).
Kanehisa, M., Furumichi, M., Sato, Y., Ishiguro-Watanabe, M. & Tanabe, M. KEGG: Integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545-d551. https://doi.org/10.1093/nar/gkaa970 (2021).
Ru, Y. et al. The multiMiR R package and database: Integration of microRNA-target interactions along with their disease and drug associations. Nucleic Acids Res. 42, e133. https://doi.org/10.1093/nar/gku631 (2014).
Shannon, P. et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. https://doi.org/10.1101/gr.1239303 (2003).
Acknowledgements
We thank those researchers who shared the data and R packages for this study.
Funding
This study was sponsored by “the Postgraduate Scientific Research Fund” of Zhejiang Chinese Medical University (2020YKJ01).
Author information
Authors and Affiliations
Contributions
L.L. performed the data analysis. L.L., G.C.L., and Q.S.G. prepared the manuscript. G.C.L., W.W.W., and F.L. revised the manuscript. G.C.L. acquired the funding. X.M.L. designed the study.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, L., Guo, Q., Lan, G. et al. Construction of a four-mRNA prognostic signature with its ceRNA network in CESC. Sci Rep 12, 10691 (2022). https://doi.org/10.1038/s41598-022-14732-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-14732-7