Introduction

Lung cancer is prevalent and has the highest cancer-related mortality worldwide1. Non-small cell lung cancer (NSCLC) is the most common type and accounts for about 85% of this disease. It is increasingly accepted that NSCLC exhibits notably sexual dimorphism2 based on the following facts: an increasing incidence of NSCLC among young women than men was observed3, the driver mutations such as epidermal growth factor receptor (EGFR) and anaplastic lymphoma kinase (ALK) alterations occur more frequently in females4, the expression level of programmed cell death ligand-1 (PD-L1) is higher in males5 and correspondingly, male patients can benefit more from anti-PD1/PD-L1 immunotherapy6,7. Thus, it is very urgent to identify the reasons underlying the sexual dimorphism of NSCLC to develop strategies preventing and curing this disease.

In the last 20 years, a large number of studies had been done to investigated whether estrogens or their cognate estrogen receptor (ER)-α and -β play roles during the pathogenesis and development of this disease; however, little progress has been made8. The G protein-coupled estrogen receptor1 (GPER1), formerly named GPR30, is the third and novel type of ER, which is different from ER-α and -β, due to its ability that yields a rapid non-genomic signaling upon binding with estrogen9. In the past two decades, GPER1 was found to play a broad range of biological roles under many physiological and pathological contexts, including regulation of metabolism10, inflammatory or immune response11; additionally, it was involved in the tumorigenesis and progression of several tumors, ranging from estrogen-related cancers such as breast12 and ovarian cancers13 to estrogen-unrelated cancers such as thyroid14, prostate15, liver16 and gastric cancers17. We and others had previously demonstrated that positive expression rate of GPER1 was more than 80% in NSCLC18,19; furthermore, our work showed that its subcellular localization was associated with EGFR mutation status, resistance to EGFR-tyrosine kinase inhibitor (EGFR-TKI) and patients’ outcomes18,20. However, the expression changes and roles of GPER1 during tumorigenesis of NSCLC remain unclear.

In this study, by using bioinformatic and self-sequencing data, we first performed a pan-cancer analysis to compare the expression levels of GPER1 between cancer and healthy tissue, and to analyze the correlation of GPER1 expression with patient prognosis. Furthermore, we conducted a comprehensive analysis regarding GPER1 functions in NSCLC, with a focus on the modulating tumor immune microenvironment (TIME). We found the expression of GPER1 was reduced in the tumorigenesis in most of the solid tumors, and it was broadly involved in TIME modulation and extracellular matrix (ECM) assembly in NSCLC. Indeed, our analysis provides new insights regarding GPER1 in tumorigenesis and cancer progression, therefore presenting a theoretical basis for subsequent studies aimed to developing new therapeutic targets and biomarkers for NSCLC.

Methods

Data collection

The expression data of 34 and 31 cancers such as LUAD, LUSC, GBM and OV were downloaded from TIMER database and GEPIA database, respectively. In addition, RNA-seq data for 21 cancers including LUAD, LUSC, ESCA, and BRCA were downloaded from The Cancer Genome Atlas (TCGA, https://portal.gdc.cancer.gov/) database. The TCGA-LUAD dataset included 526 LUAD samples and 59 normal samples. In addition, this study also downloaded LUAD-related expression data from the Gene Expression Omnibus (GEO, https://www.ncbi.nlm.nih.gov/) database; 46 LUAD samples and 45 control samples were included in the GSE18842 dataset, and the sequencing data of 58 LUAD tumor samples and non-tumor samples were included in the GSE32863 dataset. The TCGA-LUSC dataset included 501 LUSC samples and 49 normal samples. Similarly, this study also downloaded LUSC-related expression data from the GEO (https://www.ncbi.nlm.nih.gov/geo/query) database; 48 LUSC samples and 30 control samples were included in the GSE2088 dataset, and 16 LUSC samples and 15 control samples were included in the GSE31446. Furthermore, mRNA sequencing was performed on cells in four subgroups. The four subgroups were human lung adenocarcinoma A549 cells (A), A549 cells with activated GPER1 (A549 + G1, AG), A549 cells with knocked down GPER1 (Aal cells, AA), and Aal cells with GPER1 agonist added (Aal + G1, AAG), with three samples in each group.

Analysis of GPER1 in Pan-cancer

GPER1 plays an important role in cancer. First, this study verified the expression of GPER1 in 34 different cancers and 31 different cancers such as LUAD, GBM, OV and normal tissues in TIMER database and GEPIA database, respectively. In addition, cancers were classified into high and low expression groups based on the median GPER1 expression in the 21 cancers of TCGA datasets, and survival differences between high and low expression groups were compared. To determine whether GPER1 expression was associated with patient outcomes, cox regression analysis was further performed for 21 cancers.

Analysis of GPER1 in LUAD and LUSC

To analyze the role played by GPER1 in LUAD and LUSC, respectively, we compared the expression of GPER1 in cancer tissues with that in paracancer samples using log-rank sum test in TCGA database, GES32863, GSE18842, GSE2088 and GSE31446 datasets, and assessed the diagnostic ability of GPER1 plotting receiver operating characteristic (ROC) curves.

Identification of hub genes

In the TCGA dataset, LUAD and LUSC patients were divided into GPER1 high and low expression groups, respectively, based on the median value of GPER1, and then differential analysis was performed for GPER1 high and low expression groups using the “limma” package21. The screening conditions for differentially expressed genes (DEGs) were P < 0.05, |log2FC| >0.5. To investigate the interactions between DEGs, we used the STRING (https://string-db.org) website to construct a protein-protein interactions (PPI) network based on DEGs. After that, we analyzed the DEGs in the PPI network by Degree, MCC, and MNC through cytoHubba plugin in Cytoscape, and selected the top 20 concurrent genes of the three algorithms as hub genes.

Functional enrichment analysis

This study performed Kyoto Encyclopedia of Genes and Genomes (KEGG)22,23,24 and gene ontology (GO) enrichment analyses of hub genes using the “clusterProfiler” package25, and the enriched pathways were screened by P < 0.05. In order to analyze the effect of high and low expression groups on disease progression, this study also used the “clusterProfiler” package25 to perform enrichment analysis of high and low expression groups. First, the “limma” package21was used to analyze the differences between the high and low expression groups, and then the sorted gene sets were sorted according to FC in descending order and entered into “clusterProfiler” for enrichment analysis with a screening condition of P < 0.05.

Immune infiltration analysis

The activity of immune cells in each sample could be obtained by the Cell type Identification by Estimating Relative Subsets Of RNA Transcripts (CIBERSORT) algorithm. In this study, we calculated the proportion of immune cells by the CIBERSORT algorithm and compared immune cells between high and low expression groups by rank sum test. After that, we also calculated the relationship between differential immune cells between high and low expression groups and GPER1 using Pearson correlation analysis.

Cell culture and whole transcriptome sequencing

Human LUAD cell line A549 was purchased from the American Type Culture Collection and cultured in 1640 medium supplemented with 10% fetal bovine serum at 37 °C with 5% CO2, as described previously20. GPER agonist G1 was purchased from Tocris (Ellisville, USA), and GPER1 knockdown or overexpression and activation with G1 were done as described previously20. Whole transcriptome sequencing assay was conducted by an external service provider (LC Sciences, Houston, TX).

Identification and enrichment analysis of GPER1 repression-related genes of sequencing data

Firstly, Trimmomatic software was used to filter the low-quality data from the original data to remove contamination and splice sequences, and finally clean data was obtained. Then the quality of sequencing data was evaluated by FastQC26 (v0.11.9) software, and the Clean Data was compared with the reference genome (GRCh37). The comparison tool was hisat2 (v2.2.1). After obtaining the gene expression matrix for each sample, we used the “DESeq2” package26 to perform differential analysis between AA and control A. KEGG and GO enrichment analysis of DEGs were performed using the “clusterProfiler” package25 to filter the enriched pathways. Then we also performed differential analysis between AAG and AA and enrichment analysis for DEGs. The screening conditions for DEGs were P < 0.05 and |Log2FC|>1.The enriched pathways were screened with P < 0.05. Next, genes with opposite expression trends between DEGs in the AA and A groups and DEGs in the AAG and AA groups were crossed as GPER1 repression-related genes. And the “clusterProfiler” package25 was used to perform KEGG and GO enrichment analysis of GPER1 repression-related genes.

Identification of GPER1 activation-related genes

Differential analysis was performed between AG and A and between AAG and AG groups using the “DESeq2” package26 with the screening conditions of P < 0.05 and |Log2FC|>1. KEGG and GO enrichment analysis was performed on DEGs using the “clusterProfiler” package25. Next, DEGs with opposite expression trends between AG and A and between AAG and AG groups were crossed to obtain GPER1 activation-associated genes, and KEGG and GO enrichment analysis of GPER1 activation-associated genes were performed using the “clusterProfiler” package25.

Screening of GPER1 activation and repression-related hub genes

We used STRING (https://string-db.org) website to construct PPI networks for GPER1 activation and repression-related genes with a confidence level of 0.2 to help us identify the nodes that play key roles in the network. Then we analyzed the genes in the PPI network by using the cytoHubba plugin in Cytoscape for Degree, MCC, and MNC, respectively, taking the top 20 genes for each algorithm and selecting the genes that were present in all three metrics as hub genes for subsequent analysis.

Construction of upstream and downstream regulatory networks of hub genes

We used the “DESeq2” package26 to screen for differential miRNAs, lncRNAs, and circRNAs between AA and A at P < 0.05 and |Log2FC|>1. Similarly, we also screened for differential miRNAs, lncRNAs, and circRNAs between the AG and A groups. After that, the Hub gene was entered into the Starbase database for mRNA–miRNA regulatory relationship analysis to obtain the miRNAs that interacted with the Hub gene, and the predicted miRNAs were intersected with all the differential miRNAs between AA and A and between AG and A to obtain the intersecting miRNAs. Meanwhile, we input the hub gene into the network analyst online database for mRNA-TF interaction prediction, preserving the TF-mRNA–miRNA triplet regulatory interaction. In this way, we obtained a TF-mRNA–miRNA regulatory network. Next, the obtained intersecting miRNAs were entered into Starbase database to obtain miRNA–circRNA correlation and miRNA–lncRNA correlation, and again only all the differential lncRNAs and differential circRNAs obtained between AA and A, between AG and A, and the predicted lncRNAs and circRNAs were retained. Thus, we obtained an mRNA–miRNA–lncRNA regulatory network and an mRNA–miRNA–circRNA regulatory network.

Real-time reverse transcription PCR (qRT-PCR) experiment

TRlzol was used to extract RNA from the cells of A549 group, Aal group, Aal + G1 group and A549 + G1 group, respectively. The concentration of RNA was detected by NanoPhotometer N50. The mRNA was reverse transcribed using the SureScript-First-strand-cDNA-synthesis-kit kit from Saville, and then the reverse transcription product cDNA was subjected to qPCR reaction. GAPDH was the internal reference gene, and the primer sequences were shown in Table 1. Finally, the expression of hub gene in the four groups of cells was compared using F-test.

Table 1 Primer sequences.

Western blot analysis

For this experiment, cells were treated using G1 at the concentrations of 0.1 µM for various times indicated in the figure legend. The Protein cell lysates and Western blotting procedures were performed as described previously20. Cellular proteins were firstly separated by 10% SDS-PAGE and then, transferred to PVDF membranes by half wet transfer on the ice. The monoclonal rabbit antibody against ERK1/2 (ab184699) and ERK1/2 (pT202/T185) (ab201015) were obtained from Abcam (Cambridge, UK), and these specific primary antibodies were incubated with the membranes at 4 °C overnight, then the corresponding secondary antibody conjugated with HRP were incubated for 1 h at room temperature. Membranes incubated by ECL luminescent solution were then imaged through chemiluminescent HRP substrate. Images were analyzed using Image J software. Each experiment was repeated at least three times with representative gels shown.

F-actin cytoskeleton observation

Cells were grown on glass coverslips for 48 h, fixed with 4% paraformaldehyde, permeabilized in 0.1% triton X-100 and blocked with 5% goat serum. Cells were then incubated with TRITC-conjugated phalloidin to label filamentous actin (F-actin). Coverslips were mounted on glass slides and subsequently imaged using a Nikon Eclipse 80i epifluorescence microscope (Tokyo, Japan).

Transwell migration assay

Cells (5 × 104 in 100 µl) were seeded into the upper well of a transwell chamber, which contained 500 µl of medium in the lower well. Chambers were then incubated at 37 °C for 24 h. Rho kinase inhibitor Y27632 (0.1 µM) was used in the experiments.

Statistical analysis

The SPSS System 17.0 was used for the statistical analysis of data sets. Student’s t-test or ANOVA were used to evaluate difference between subgroups. Two-tailed p-values of 0.05 or less were considered to be statistically significant.

Results

The expression of GPER1 was significantly lower in most tumor tissues than in non-cancerous tissues

In the TIMER database, GPER1 expression was lower than normal tissue in 13 cancers, including BLCA, BRCA, CHOL, COAD, ESCA, KICH, LUAD, LUSC, PRAD, READ, STAD, THCA, and UCEC (Fig. 1A). Likewise, in the GEPIA database, GPER1 expression was lower in 13 cancers than their normal tissues, including ACC, BLCA, BRCA, CESC, CHOL, COAD, KICH, LUAD, LUSC, READ, STAD, UCEC and UCS (Fig. 1B). Overall, the expression of GPER1 was significantly lower in most tumor tissues than in non-cancerous tissues. Importantly, GPER1 was a protective factor in ESCA, KIRC and UCEC; however, there was only a marginally protective effect in BRCA and LUAD (Fig. 1C).

Fig. 1
figure 1

Pan-cancer analysis of GPER1. (A) The expression profiles of GPER1 in the TIMER database. (B) The expression profiles of GPER1 in the GEPIA database. (C) Prognostic analysis of GPER1 in a cancer dataset.

GPER1 had good diagnostic value as distinguishing cancer from normal samples

The expression of GPER1 in cancer tissues was significantly lower than that in normal tissues in the TCGA, GSE32863, and GSE18842 datasets of LUAD (Fig. 2A) as well as in TCGA, GSE2088, and GSE31446 datasets of LUSC (Fig. 2B). In addition, the area under the curve (AUC) values of GPER1 gene in these datasets were 0.92, 0.948, and 0.955 in LUAD (Fig. 2C) as well as 0.946, 0.912 and 0.792 in LUSC (Fig. 2D), respectively, indicating that this gene had good diagnostic value as distinguishing cancer from normal samples.

Fig. 2
figure 2

The expression and diagnosis of GPER1 in LUAD and LUSC. (A) The expression of GPER1 in LUAD tissues and normal tissues in TCGA, GES32863 and GSE18842 datasets. (B) The expression of GPER1 in LUSC tissues and normal tissues in TCGA, GSE2088 and GSE31446 datasets. (C) The ROC curve of GPER1 in TCGA, GES32863, and GSE18842 datasets, evaluating the diagnostic value of GPER1 in distinguishing LUAD samples from normal samples. (D) The ROC curve of GPER1 in TCGA, GSE2088 and GSE31446 datasets, evaluating the diagnostic value of GPER1 in distinguishing LUSC samples from normal samples.

The 14 and 13 hub genes were identified in LUAD and LUSC, respectively

In LUAD, the median value of GPER1 (0.736) divided TCGA-LUAD into 263 GPER1 high expression samples and 263 GPER1 low expression groups. There were 198 DEGs in high expression patients compared to low expression patients, with 142 upregulated genes and 56 downregulated genes (Fig. 3A). The heat map of the top 20 DEGs was shown in Fig. 3B. The confidence level in the PPI network was chosen to be 0.4, and the network contained 154 nodes and 466 edges (Fig. 3C), in which CAV1 and ITGAM were connected with most genes and they were key genes in the PPI network. 14 common mRNAs, which were hub genes, were analyzed in Degree, MCC, and MNC, namely ITGA2, ITGB4, VWF, ITGA3, ITGB6, ICAM1, FGA, A2M, ITGAM, PLAU, CAV1, PLAT, SERPINA1, KIT (Fig. 3D).

Fig. 3
figure 3

Identification of hub genes in LUAD and LUSC. (A) Volcano map of DEGs between GPER1 high expression group and GPER1 low expression group in LUAD. (B) The heat map of the top 20 DEGs between GPER1 high expression group and GPER1 low expression group in LUAD. (C) The PPI network of DEGs between GPER1 high expression group and GPER1 low expression group in LUAD. (D) The Venn diagram of 14 hub genes between GPER1 high expression group and GPER1 low expression group in LUAD. (E) Volcano map of DEGs between GPER1 high expression groups and GPER1 low expression groups in LUSC. (F) The heat map of the top 20 DEGs between GPER1 high expression group and GPER1 low expression group in LUSC. (G) The PPI network of DEGs between GPER1 high expression group and GPER1 low expression group in LUSC. (H) The Venn diagram of 13 hub genes between GPER1 high expression group and GPER1 low expression group in LUSC.

In LUSC, the median value of GPER1 (0.597) divided TCGA-LUSC into 250 GPER1 high expression samples and 251 GPER1 low expression groups. There were 432 DEGs in high expression patients compared to low expression patients, with 337 upregulated genes and 95 downregulated genes (Fig. 3E). The heat map of the top 20 DEGs was shown in Fig. 3F. The confidence level in the PPI network was chosen to be 0.7, and the network contained 225 nodes and 734 edges (Fig. 3G). 13 common mRNAs, which were hub genes, were analyzed in Degree, MCC, and MNC, namely COL1A1, COL1A2, COL3A1, COL5A1, COL5A2, MMP2, SPARC, BMP1, FN1, BGN, DCN, FBN1, ITGA3 (Fig. 3H).

The main enrichment pathways of GPER1 in LUAD and LUSC were similar

In LUAD, functional enrichment analysis of the Hub genes revealed that 14 hub genes were enriched for a total of 633 GO functions and 25 KEGG pathways. GO functions included wound healing, integrin binding, coagulation, cell-matrix adhesion, focal adhesion, integrin complex, collagen-containing extracellular matrix, regulation of fluid levels, protease binding, plasma membrane signaling receptor complex, and response to hypoxia (Fig. 4A). Figure 4B showed 25 KEGG pathways, such as complement and coagulation cascades, extracellular matrix (ECM)-receptor interactions, focal adhesion, regulation of actin cytoskeleton, arrhythmogenic right ventricular cardiomyopathy, PI3K-Akt signaling pathway, hematopoietic cell lineage, human papillomavirus infection, platelet activation, etc. Importantly, GSVA functional enrichment analysis showed that the GPER1 high expression group was enriched to immune response, T cell response, angiogenesis and other pathways (Fig. 4C).

Fig. 4
figure 4

Enrichment analyses of hub genes in LUAD and LUSC. (A) GO enrichment map of hub genes in LUAD. (B) KEGG enrichment map of hub genes in LUAD. (C) GSVA functional enrichment analysis between GPER1 high expression group and GPER1 low expression group in LUAD. (D) GO enrichment map of hub genes in LUSC. (E) KEGG enrichment map of hub genes in LUSC. (F) GSVA functional enrichment analysis between GPER1 high expression group and GPER1 low expression group in LUSC.

In LUSC, functional enrichment analysis of the hub genes revealed that 13 hub genes were enriched for a total of 287 GO functions and 15 KEGG pathways. GO functions included collagen-containing ECM, ECM structural constituent, collagen fibril organization, platelet-derived growth factor binding, etc. (Fig. 4D). Figure 4E showed 15 KEGG pathways, such as AGE-RAGE signaling pathway in diabetic complications, protein digestion and absorption, proteoglycans in cancer, ECM-receptor interactions, relaxin signaling pathway, amoebiasis, focal adhesion, diabetic cardiomyopathy, PI3K-Akt signaling pathway, human papillomavirus infection, regulation of actin cytoskeleton, etc. In addition, GSVA functional enrichment analysis showed that the GPER1 high expression group was enriched to immune response, T cell response and other pathways (Fig. 4F), which was similar to that in LUAD.

Together, these results suggest that GPER1 signaling participated in cell-matrix adhesion, ECM assembly, regulation of actin cytoskeleton and immune response in NSCLC.

Immune infiltration analysis of LUAD and LUSC

Next, we performed immune infiltration analysis in view of GPER1 high expression being associated with immune response as showed by GSVA functional enrichment analysis. In LUAD, the proportions of 22 types of immune cells in the low and high GPER1 expression groups were shown in Fig. 5A, in which five immune cell scores, namely Plasma cells, gamma delta T cells, Monocytes, M1 Macrophages, and resting Mast cells, differed between the low and high GPER1 expression groups (Fig. 5B). Plasma cells, M1 Macrophages and gamma delta T cells were more concentrated in the low expression group, while Monocytes and resting Mast cells were more concentrated in the high expression group (Fig. 5C). In addition, GPER1 was positively correlated with resting Mast cells and Monocytes, while negatively with Plasma cells, gamma delta T cells and M1 Macrophages (Fig. 5D,E).

Fig. 5
figure 5

Immune infiltration analysis in LUAD and LUSC. (A) Column stacking diagram of immune cell abundance in high and low GRER1 expression groups in LUAD. (B) The expression of immune cells in the high and low GRER1 expression groups in LUAD. (C) Box plot reveals the expression of differential immune cells in high and low GRER1 expression groups in LUAD. (D,E) Correlation analysis between differential immune cells and GPER1 in LUAD. (F) Column stacking diagram of immune cell abundance in GRER1 high and low expression groups in LUSC. (G) The expression of immune cells in the high and low GRER1 expression groups in LUSC. (H) Box plot reveals the expression of differential immune cells in high and low GRER1 expression groups in LUSC. (I) Correlation analysis between differential immune cells and LUSC.

In LUSC, the proportions of 22 types of immune cells in the low and high GPER1 expression groups were shown in Fig. 5F, in which eight immune cell scores, namely naive B cells, memory B cells, memory CD8 T cells, resting memory CD4 T cells, activated memory CD4 T cells, regulatory T cells (Tregs), etc., differed between the low and high GPER1 expression groups (Fig. 5G). The naive B cells, gamma delta T cells, CD8 T cells and activated memory CD4 T cells were more abundant in the low expression group, while memory B cells, M0 Macrophages, resting memory CD4 T cells and Tregs were more concentrated in the high expression group (Fig. 5H). In addition, GPER1 was positively correlated with memory B cells, M0 Macrophages and Tregs, while negatively with naive B cells, gamma delta T cells, CD8 T cells and activated memory CD4 T cells (Fig. 5I).

GRER1 had strong correlation with common immune checkpoints in LUAD and LUSC

Next, the expression levels of 9 immune check point genes were compared between low and high GPER1 expression groups in LUAD and LUSC, respectively, including CD27, CD274, CD80, CTLA4, IDO1, LAG3, PDCD1, PDCD1LG2 and TIGIT. Strikingly, in LUAD, GPER1 expression is consistently positively correlated with CD274 (PD-L1), CD80, CTLA4, PDCD1, PDCD1LG2 and TIGIT (Fig. 6A); in LUSC, GPER1 expression is positively correlated with CD27, CD80, CTLA4, PDCD1and TIGIT, but negatively with CD274 (PD-L1) (Fig. 6B).

Fig. 6
figure 6

Analysis of difference of immune checkpoint and TIDE between high and low GPER1 groups. (A) Immune checkpoint gene expression patterns between high and low GPER1 groups in LUAD. (C) TIDE analysis between high and low GPER1 groups in LUAD. (B) Immune checkpoint gene expression patterns between high and low GPER1 groups in LUSC. (D) TIDE analysis between high and low GPER1 groups in LUSC.

GPER1 expression predict response to ICI

In view that GPER1 expression level was significantly associated with both immune cells infiltration and immune checkpoint genes expression in LUAD and LUSC, respectively, we further evaluated the predictive effect of GPER1 expression on response to PD1/PD-L1 immune checkpoint inhibitors (ICIs) using TCGA NSCLC cohort by TIDE analysis. As shown in Fig. 6D, the low expression of GPER1 could predict benefit from ICIs treatment only in LUSC (P < 0.001), but did not in LUAD (Fig. 6C).

Identification and enrichment analysis of GPER1 repression-related genes of sequencing data

The matching rate was higher than 84.19% in all 12 sequenced samples, indicating that the matching rate was high and not interfered by exogenous RNA molecules, which could meet the need of subsequent data analysis. A total of 1520 DEGs were counted in AA compared with control A, including 801 up-regulated genes and 719 down-regulated genes (Fig. 7A,B). 1520 DEGs were enriched to a total of 1942 GO functions and 76 KEGG pathways. The main functions were collagen-containing ECM, axon development, epithelial cell proliferation, cell–cell junctions, membrane potential regulation, regulation of ion transport across membranes, synaptic organization, metal ion transport protein activity, response to allosteric stimuli, receptor ligand activity, and leukocyte migration (Fig. 7C). Figure 7D showed 25 KEGG entries, such as neuroactive ligand-receptor interactions, MAPK signaling pathway, cytokine-cytokine receptor interactions, PI3K-Akt signaling pathway, calcium signaling pathway, cell adhesion molecules, axon guidance, Rap1 signaling pathway, cAMP signaling pathway, proteoglycans in cancer, chemoattractant-receptor activation, Ras signaling pathway, regulation of the actin cytoskeleton, and other entries. AAG had 240 up-regulated genes and 439 down-regulated genes compared to the AA group (Fig. 7E,F), and they were enriched to 1208 GO functions and 46 KEGG pathways, mainly including collagen-containing ECM, regulation of epithelial cell proliferation, amoeboid migration, fatty acid metabolic processes, regulation of trans-synaptic signaling, extracellular matrix organization (Fig. 7G), and ECM-receptor interactions, protein digestion and uptake, PI3K-Akt signaling pathway, Apelin signaling pathway, bile secretion and other pathways (Fig. 7H). The up-regulated genes between AA and A groups crossed with the down-regulated genes between AAG and AA groups to obtain 104 crossover genes, and the down-regulated genes between AA and A groups crossed with the up-regulated genes between AAG and AA groups to obtain 28 crossover genes, which 132 genes are GPER1 repression-related genes (Fig. 8A), and they were enriched to 244 GOs and 8 KEGG pathways. The main functions include regulation of chemical synaptic transmission, regulation of small molecule metabolic processes, response to peptide hormones, synaptic organization, phospholipid binding, fatty acid metabolic processes, response to insulin, and the main pathways include protein digestion and absorption, metabolism of xenobiotics by cytochrome P450, pentose and glucuronide interconversion, insulin resistance, and glutamatergic synapses (Fig. 8B, C).

Fig. 7
figure 7

Identification and enrichment analysis of differentially expressed genes. (A,B) Volcano map and heat map of differentially expressed genes between AA and A groups. (C) GO enrichment map of differentially expressed genes between AA and A groups. (D) KEGG enrichment map of differentially expressed genes between AA and A groups. (E,F) Volcano map and heat map of differentially expressed genes between AAG and AA groups. (G) GO enrichment map of differentially expressed genes between AAG and AA groups. (H) KEGG enrichment map of differentially expressed genes between AAG and AA groups.

Fig. 8
figure 8

Acquisition and enrichment analysis of GPER1 repression-related genes. (A) Venn diagram of GPER1 repression-related genes. (B) GO enrichment map of GPER1 repression-related genes. (C) KEGG enrichment map of GPER1 repression-related genes.

Identification of GPER1 activation-related genes

There were 271 DEGs in AG compared with control A, including 110 up-regulated genes and 161 down-regulated genes (Fig. 9A,B). 271 DEGs were enriched for 1942 GOs, including muscle system processes, T cell activation, positive regulation of cytokine production, regulation of ion transport across the membrane, response to nutrient levels, receptor ligand activity, signal receptor activator activity, muscle contraction, lipid transport, lipid localization, and signal release (Fig. 9C). 271 DEGs were also enriched for 2 KEGG pathways, tryptophan metabolism and renin secretion (Fig. 9D). The AAG had 744 up-regulated genes and 971 down-regulated genes compared to the AG group (Fig. 9E,F), and they enriched 2057 GOs and 73 KEGG pathways, including collagen-containing ECM, passive transmembrane transporter protein activity, metal ion transmembrane transporter protein activity, regulation of ion transmembrane transport, epithelial cell proliferation, actin binding, positive regulation of cell adhesion, receptor ligand activity, axonogenesis, and other functions as well as receptor interactions, cytokine-cytokine receptor interactions, PI3K-Akt signaling pathway, MAPK signaling pathway, calcium signaling pathway, and natural killer cell-mediated cytotoxicity pathways (Fig. 9G,H). In addition, 39 GPER1 activation-associated genes were obtained and they were enriched to 275 GOs and 7 KEGG pathways (Fig. 10A). GPER1 activation-associated genes were mainly associated with positive regulation of cell secretion, positive regulation of hormone secretion, hormone metabolic processes, multicellular biological processes, positive regulation of protein transport, steroid metabolic processes, cortisol synthesis and secretion, neuroactive ligand-receptor interactions, complement and coagulation cascades, IL-17 signaling pathway, and aldosterone synthesis and secretion pathways (Fig. 10B,C).

Fig. 9
figure 9

Identification and enrichment analysis of differentially expressed genes. (A,B) Volcano map and heat map of differentially expressed genes between AG and A groups. (C) GO enrichment map of differentially expressed genes between AG and A groups. (D) KEGG enrichment map of differentially expressed genes between AG and A groups. (E,F) Volcano map and heat map of differentially expressed genes between AAG and AG groups. (G) GO enrichment map of differentially expressed genes between AAG and AG groups. (H) KEGG enrichment map of differentially expressed genes between AAG and AG groups.

Fig. 10
figure 10

Acquisition and enrichment analysis of GPER1 activation-associated genes. (A) Venn diagram of GPER1 activation-associated genes. (B) GO enrichment map of GPER1 activation-associated genes. (C) KEGG enrichment map of GPER1 activation-associated genes.

Identification of 13 GPER1 activation and repression-related hub genes

The PPI network contains 147 nodes and 582 edges (Fig. 11A). The ENO2, LEP, ADM were related to many genes such as KLKB1, LEP, and KIF1A. In addition, the network diagram of the top 20 genes in each algorithm of Degree, MCC and MNC analysis was shown in Fig. 11B. A total of 13 GPER1 activation and repression-related hub genes were present in the three analyses, namely SPP1, KIF1A, SNCA, GRM4, SNAP91, TLR2, AMPH, LEP, MME, PGR, MUC5AC, ADM and ENO2 (Fig. 11C).

Fig. 11
figure 11

Identification of GPER1 activation and repression-related hub genes. (A) PPI interaction network of GPER1 activation and repression-related genes. (B) The network diagram of the top 20 genes in each algorithm of Degree, MCC and MNC analysis. (C) Venn diagram of GPER1 activation and repression-related hub genes.

Construction of upstream and downstream regulatory networks of hub genes

The volcano plots of differential miRNAs, differential lncRNAs, and differential circRNAs between AA and A groups were shown in Fig. 12A–C. A total of 66 differential miRNAs (66 down-regulated differential miRNAs), 316 differential lncRNAs (177 up-regulated differential lncRNAs and 139 down-regulated differential lncRNAs), and 3 differential circRNAs (1 up-regulated differential circRNA, 2 down-regulated differential circRNAs) between AG and A were obtained (Fig. 12D–F). The TF-miRNA–mRNA network contains 26 nodes with 36 edges (4 mRNAs, 8 TFs, 14 miRNAs). Both UBTF and hsa-miR-31-5P can regulate ENO2 and KIF1A, and both hsa-miR-23b-3p and ETS1 can regulate ENO2 (Fig. 12G). In addition, the lncRNA-miRNA–mRNA network contains 32 nodes with 48 edges (5 mRNAs, 15 miRNAs, 12 lncRNAs). LUCAT1 can regulated ADM by hsa-miR-182(1)a-5p, and TEX41 can regulated KIF1A by hsa-miR-107 (Fig. 12H). Moreover, the circRNA-miRNA-mRNA regulatory network analysis contained 4 nodes with 3 edges (2 mRNAs, 1 circRNA, 1 miRNA), hsa-circ-0001460 regulated ADM and SNAP91 by hsa-miR-137 (Fig. 12I).

Fig. 12
figure 12

Construction of upstream and downstream regulatory networks of GPER1 activation and repression-related hub genes. (A–C) The volcano plots of differential miRNAs, differential lncRNAs and differential circRNAs between AA and A groups. (D–F) The volcano plots of differential miRNAs, differential lncRNAs and differential circRNAs between AG and A groups. (G) The TF-mRNA–miRNA network. (H) The mRNA–miRNA–lncRNA network. (I) The mRNA–miRNA–circRNA regulatory network.

The expression of hub genes was verified by qRT-PCR

FGFR3, CFS1 were lowly expressed in Aal and highly expressed in Aal + G1, A549 + G1 compared to A549 and were lower in Aal than in Aal + G1 and A549 + G1 groups. LEP, FGF5, TGFβ2, TLR2 had the highest expression in Aal group and the lowest expression in A549 group. In addition, the expression of LAMB3, ADM was progressively higher in Aal, Aal + G1, and A549 + G1 compared with A549 cells. The expression of HSPG2 was high in Aal, A549 + G1 and low in Aal + G1 compared to A549, and the expression of CHRM3 was lowest in A549 and highest in Aal + G1 (Fig. 13).

Fig. 13
figure 13

The barchart revealed the expression of hub genes in A549 group, Aal group, Aal + G1 group and A549 + G1 group.

Enhanced response of A549 cells to G1 stimulation after overexpression of GPER1

Since sequencing data of A549 cells showed both GPER1 repression- and activation-related genes were involved in MAPK signaling, we assayed the effects of GPER1 expression on the phosphorylation of erk1/2, a downstream effector of MAPK, and on the response of A549 cells to G1, a specific agonist of GPER1. A549 cells with GPER1 knockdown responded with reduced sensitivity to G1 while cells with GPER1 overexpression with enhanced sensitivity when compared with control cells. After treatment with G1 for the indicated time, A549 cells with GPER1 overexpression showed a robust and durable phosphorylation of erk1/2 while cells with GPER1 knockdown presented a delayed phosphorylation of erk1/2 when compared with the control cells (Fig. 14).

Fig. 14
figure 14

Knockdown or overexpression of GPER1 influenced response of A549 cells to G1 stimulation. knockdown of GPER1 by short hairpin RNA (GPER1-sh), GPER1 overexpression (GPER1-oe).

Decreased expression of GPER1 promoted actin cytoskeleton rearrangement

Considering that bioinformatic analysis suggested GPER1 involving in cytoskeleton rearrangement, cell–cell junctions and cellular interaction with extracellular matrix, and it is well known that cytoskeleton rearrangement can exert significant influence on the latter two biological processes, we interrogated whether expression alteration of GPER1 affected cytoskeleton rearrangement. As expected, knocking down expression of GPER1 resulted in dramatic actin fibers (F-actin) polymerization, including the formation of actin stress fiber and lamellipodia (GPER1-sh vs. control for actin stress fiber, 42.0 ± 3.1% cells/view vs. 1.2 ± 0.3 cells/view, P < 0.0001), while GPER1 overexpression had little effect on the actin cytoskeleton rearrangements (GPER1-oe vs. control for actin stress fiber, 0.7 ± 0.3% cells/view vs. 1.2 ± 0.3% cells/view, P > 0.05) (Fig. 15).

Fig. 15
figure 15

Actin cytoskeleton rearrangement observation. GPER1 knockdown induced actin fibers (F-actin) polymerization in A549 cells, including the formation of actin stress fiber (thin arrows) and lamellipodia (thick arrows). F-actin were stained with TRITC-conjugated phalloidin (red labeling). knockdown of GPER1 by short hairpin RNA (GPER1-sh), GPER1 overexpression (GPER1-oe).

Knocking down GPER1 promoted the migration of A549 cells

It is well known that cytoskeletal rearrangement is a prerequisite for cell migration. We therefore investigated whether F-actin cytoskeleton rearrangement caused by GPER1 knockdown enhanced the migratory ability of A549 cells using transwell assays. As expect, the migration of A549 cells with GPER1 knockdown (102.0 ± 8.2 cells/view) was significantly enhanced, while cells with GPER1 overexpression (20.3 ± 3.5 cells/view) was significantly decreased when compared with controls (55.7 ± 2.3 cells/view; P < 0.001, P = 0.0042, respectively). Furthermore, Y27632, a pharmacological inhibitor of Rho kinase commonly playing a critical role in F-actin polymerization, notably inhibited migratory ability of A549 cells with GPER1 knockdown (67.0 ± 3.5 cells/view, P = 0.0045). These results revealed that decreased expression of GPER1 could promote the migration of A549 cells, and Rho kinase activity was required for this process (Fig. 16).

Fig. 16
figure 16

GPER1 expression and Y27632 influenced the migration of A549 cells. knockdown of GPER1 by short hairpin RNA (GPER1-sh), GPER1 overexpression (GPER1-oe).

Discussion

Lung cancer has the first highest mortality and second highest incidence among all malignances globally, thus understanding its pathogenesis is very essential to develop strategies preventing and curing this disease. In this study, we systematically demonstrated that the expression of GPER1 at the mRNA level was significantly and consistently decreased during tumorigenesis in 22 solid tumors, including NSCLC, which was in line with previous findings in other cancer types, such as breast cancer27, thyroid cancer14,28, gastric cancer17, ovarian cancer13 and hepatocellular carcinoma16. Our results further indicated GPER1 acted as a protective factor in ESCA, KIRC, UCEC, BRCA and LUAD, but was only marginally statistically significant in BRCA and LUAD; in addition, GPER1 was involved in the modulation of immune microenvironment in NSCLC and its low expression could predict the likelihood that LUSC patients will benefit from treatment with ICIs.

By constructing the PPI network, we identified 14 hub genes in LUAD, namely ITGA2, ITGB4, VWF, ITGA3, ITGB6, ICAM1, FGA, A2M, ITGAM, PLAU, CAV1, PLAT, SERPINA1 and KIT. ITGA2, ITGB4, ITGA3, ITGB6 and ITGAM all belong to the integrin family, which mainly mediate the cellular adhesion and interaction with ECM, being associated with drug resistance and worse prognosis in LUAD29,30. Up-regulation of ICAM131 and loss of FGA32 could facilitate tumor metastasis, respectively; the expression of VWF was reported to be decreased during pathogenesis and be related to favorable overall survival in LUAD33; A2M was associated with tumorigenesis and modulation of TIME in LUAD34,35; PLAU was identified as a upregulated gene in early-stage NSCLC36; caveolin-1(CAV1) and PLAT were involved in radio-resistance37 and gefitinib resistance in LUAD38, respectively; SERPINA1 promoted the occurrence of lung cancer in patients with chronic obstructive pulmonary disease39,40; KIT, a proto-oncogene, was upregulated in small cell lung cancer41.

In LUSC, data analysis identified 13 hub genes, including MMP2, SPARC, BMP1, COL1A1, COL1A2, COL3A1, COL5A1, COL5A2, FN1, BGN, DCN, FBN1 and ITGA3. Strikingly, all of these genes were also predominantly involved in ECM assembly and cellular interactions with ECM42. Functional enrichment analysis of hub genes further suggested that GPER1 signaling was mainly involved in cellular adhesion, ECM assembly, regulation of energy, metabolism and cytoskeleton both in LUAD and LUSC. These biological processes were tightly associated with tumor initiation, development and resistance to multiple drugs; furthermore, ECM was demonstrated to can protect tumor cells against attacks from immune cells43,44. Additionally, our data suggested that the expression of GPER1 was concordantly and positively correlated with the expression of six immune checkpoints in LUAD, and positively with five but negatively with only one in LUSC. Beyond these, GPER1 expression tended to be negatively associated with anti-tumor immune cells infiltration, such as Plasma cells, gamma delta T cells and M1 Macrophages in LUAD as well as naive B cells, gamma delta T cells, activated memory CD4 and CD8 T cells in LUSC. However, the TIDE analysis identified that only LUSC with low expression of GPER1 could response well to PD1/PD-L1 inhibitors. We speculated such different functions of GPER1 between LUAD and LUSC may be in that GPER1 can displays different dominant molecular subtypes and subcellular localization according to driver mutation or histology types, as reported in our previous work18,20. Therefore, further research is very needed to investigate whether GPER1 protein undergo posttranslational modification in the future.

In addition, our sequencing data analysis from A549 cell identified 13 hub genes associated with GPER1 activation and repression, including SPP1, KIF1A, SNCA, GRM4, SNAP91, TLR2, AMPH, LEP, MME, PGR, MUC5AC, ADM and ENO2. Among them, LEP45, ADM46 and ENO247 were reported to be elevated in NSCLC and were linked to the diagnosis or prognosis of NSCLC; particularly, ENO2 is being widely utilized as a predictive biomarker in lung cancer in clinical practice. Both SNAP9148 and MUC5AC49 were upregulated and associated with poor prognosis in NSCLC, while the AMPH is a suppressor of lung cancer50; the expression level of PGR could predict better overall survival in LUAD51. Both SPP151,52 and SNCA52 could facilitate immune cells infiltration and are correlated with NSCLC progression. Besides, TLR2 can connect the innate to the adaptive immune system and modulate lung cancer tumorigenesis and metastases52,53. Importantly, KIF1A, GRM4 and MME were firstly reported by us to be related to LUAD; the promoter hypermethylation of KIF1A is associated with tumorigenesis in primary head and neck squamous carcinoma53, and GRM4 was recently found to play an important role in suppressing antitumor immunity in melanoma54.

We conducted whole-transcriptome sequencing and identified GPER1-related differential miRNAs, lncRNAs, circRNAs and mRNAs for the first time in LUAD cell line; meanwhile, comprehensive regulatory networks among them were built, including TF-miRNA–mRNA, lncRNA–miRNA–mRNA, and circRNA–miRNA–mRNA regulatory networks. A study showed that hsa-miR-31-5P played an important role in the pathogenesis of NSCLC55; Chen et al. reported hsa-miR-23b-3p was involved in mTOR signaling pathway and acted as an oncogenic miRNA in cancer stem cells derived from A549 cells56. The hsa-miR-181a-5p was linked to tumor immune infiltration and cancer progression in NSCLC. The hsa-miR-107 could promote NSCLC cell proliferation by up-regulating FGFRL157. Moreover, the hsa-circ-0001460, also named circNEIL3, was related to radiotherapy resistance in lung cancer58 and was linked to poor prognosis of patients with glioma59, and the hsa-miR-137 was involved in cisplatin resistance in NSCLC60. Besides, the lncRNA LUCAT1 was first discovered in smoking-related lung cancer61 and further demonstrated to be associated with cisplatin resistance in NSCLC62; the lncRNA TEX41 was reported to play an important role in LUAD bone metastasis by promoting autophagy63. These data provide valuable information for further investigating GPER1-related regulatory networks in the future.

Finally, a total of 10 hub genes (CSF1, TLR2, LEP, ADM, HSPG2, LAMB3, CHRM3, FGFR3, FGF5 and TGFB2) were verified to be regulated by GPER1 using qRT-PCR, and Phalloidin-staining showed knocking down expression of GPER1 caused F-actin cytoskeleton rearrangement, being known to function in the initiation and development of tumors64,65. In addition, cytoskeleton rearrangement is essential for cellular interaction with ECM, ECM remolding, cell-cell adhesion and junction as well as malignant progression of tumor cells66,67.

In conclusion, our results showed that the expressions of GPER1 in mRNA level were significantly decreased during the course of tumorigenesis and it appeared to be a protective factor in many types of solid tumor. GPER1 widely participated in various biological functions in NSCLC, including cytoskeleton rearrangement, ECM assembly, regulation of energy and metabolism, and modulation of TIME. Moreover, its low expression could predict benefit from PD1/PD-L1 inhibitors in LUSC patients. However, several limitations should be considered in interpreting our findings. Firstly, these results mainly acquired based on the mRNA expression levels of GPER1. Therefore, in vitro and in vivo experiments are warranted to investigate how GPER1 protein participates in tumorigenesis and immune regulation. Secondly, although our results revealed GPER1 expression is decreased in cancer tissues than their counterpart healthy tissues, its roles in health lung tissue remain less known. Thus, elucidating its functions and relevant mechanisms in both healthy and tumor tissues can provide promising strategies for preventing and curing lung cancer in the future.