Introduction

Lung cancer is one of the most frequently diagnosed cancer and remains the major cause of cancer-related deaths worldwide1,2. More than 80% of total diagnoses were NSCLC, and lung adenocarcinoma (LUAD) is the most common subtype within NSCLC classifications3. The 5-year survival rate of NSCLC highly depends on the stage, with roughly 80% in stage I, only 13–60% in stage II to stage III and less than 10% in metastatic disease4. Although the standard of care for early-stage NSCLC is still surgical resection, the treatment of advanced or metastatic diseases has undergone remarkable changes during the past decade due to the advances in molecular targeted therapy and immunotherapy5,6,7. However, because of targeted therapy restricted to LUAD containing driver mutations and responses from immunotherapy occurring uncommonly, only a minority of patients benefit from these therapies8. Acquired resistances are also the major challenge of both therapies9. Therefore, detection of novel biomarkers is crucial for further therapeutic advances.

Tubulin epsilon and delta complex 2 (TEDC2), also named Chromosome 16 open reading frame 59 (C16orf59), is a protein coding gene whose functions are poorly identified yet. Previous studies reported that TEDC2 could highly express in central nervous system lymphoma and might contribute to the tumorigenesis of LUAD10,11, but the prognostic and immune features of TEDC2 in LUAD have not been comprehensively characterized. Therefore, this study extracted LUAD samples from The Cancer Genome Atlas (TCGA) and Gene-Expression Omnibus (GEO) databases to identify the diagnostic and prognostic potential of TEDC2, and characterize the association between TEDC2 and immune microenvironment of LUAD via the application of various algorithms. Our bioinformatics analysis would provide a novel insight into the immune features of TEDC2 and reveal the therapeutic potential of targeting TEDC2 in LUAD.

Material and methods

Data acquisition

A total of 535 LUAD samples and their clinicopathologic characteristics were downloaded from The Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/) and analyzed in this study. The datasets (GSE18842, GSE7670, GSE27262 and GSE140797) were obtained from Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/gds) to verify the differential expression of TEDC2 between LUAD and normal tissues. The protein levels of TEDC2 between LUAD and normal tissues were compared in the Human Protein Atlas (https://www.proteinatlas.org/) which provides protein immunohistochemistry in normal human tissues and tumor tissues.

Co-expression gene analysis

The co-expression genes positively and negatively correlated with TEDC2 expression were explored through the LinkFinder module in the LinkedOmics database (http://www.linkedomics.org/login.php), which is a web portal that analyzes multi-omics data from TCGA datasets12. LinkInterpreter module was used to perform the gene set enrichment analyses (GSEA) of Gene Ontology biological process (GO_BP) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways.

Correlation between TEDC2 and immune microenvironment

The assessment of immune cell infiltration was conducted by the single-sample gene set enrichment analysis (ssGSEA) which contained 28 immune cells13,14. The infiltration of stromal and immune cells was further assessed by Estimation of STromal and Immune cells in MAlignant Tumor tissues using Expression data (ESTIMATE) algorithm using “estimate” R package15. The correlation of TEDC2 expression with the abundance of six types of infiltrating immune cells (CD8+ T cells, CD4+ T cells, macrophages, B cells, dendritic cells (DCs) and neutrophils) in LUAD was evaluated by TIMER2.0 database (http://timer.cistrome.org/)16.

To study the relationship between TEDC2 gene expression and immune checkpoints in LUAD, we extracted 47 common immune checkpoints and used Spearman’s rank correlation coefficient to analyze the correlation. The associations of TEDC2 expression with multiple markers for immune cells and immune checkpoints leading to T cell exhaustion were further investigated in the Gene Expression Profiling Interactive Analysis (GEPIA) 2 database which is an online database providing key interactive and customizable functions17.

Statistical analysis

The patients were divided into two groups according to the median expression of TEDC2. The measurement data were statistically analyzed with t test if normally distributed. Otherwise, Mann–Whitney U test was used. The receiver operating characteristic (ROC) curve was used to evaluate the potential diagnostic value of TEDC2. Overall survival (OS), disease specific survival (DSS) and progress free survival (PFS) analyses were performed by Kaplan–Meier curve and log-rank test. Univariate and multivariable analyses were conducted by the Cox proportional hazards regression models to determine the prognostic value of TEDC2 in survival outcomes. Multivariate regression model included the factors that P value was less than 0.1 in univariate analysis. R statistical software (version 3.6.3) and SPSS software (version 24.0) were used to perform the analyses in the study. Statistical significance was set at P value < 0.05 (All P values presented were 2-sided).

Results

TEDC2 is upregulated in LUAD

The baseline characteristics of patients with LUAD obtained from TCGA database were summarized in Table 1. The expression of TEDC2 in LUAD samples was significantly upregulated at mRNA levels in TCGA database (P < 0.001, Fig. 1A), and the mRNA expression of TEDC2 was positively correlated with advanced T stage (T1 vs. T2, P = 0.008, Fig. 1B), N stage (N0 vs. N2, P = 0.011, Fig. 1C), M stage (M0 vs. M1, P = 0.044, Fig. 1D) and pathologic stage (stage I vs. stage III, P = 0.025, Fig. 1E), but the significant differences were not observed in T3 and T4, N1 and N3, stage II and stage IV.

Table 1 Baseline characteristics of the LUAD patients.
Figure 1
figure 1

TEDC2 expression status in LUAD. (A) TEDC2 expression was higher in LUAD tissues than in normal tissues. (B) High TEDC2 expression was associated with advanced T stage. (C) High TEDC2 expression was associated with advanced N stage. (D) High TEDC2 expression was associated with advanced M stage. (E) High TEDC2 expression was associated with advanced pathologic stage. *P < 0.05; **P < 0.01; ***P < 0.001. The figure was created by R statistical software (version 3.6.3).

The differential expression of TEDC2 between LUAD and normal tissues was verified by the data sets GSE18842 (44 pairs LUAD and adjacent normal tissues), GSE7670 (28 pairs LUAD and adjacent normal tissues), GSE27262 (25 pairs LUAD and adjacent normal tissues) and GSE140797 (7 pairs LUAD and adjacent normal tissues). The results confirmed that mRNA expression of TEDC2 was higher in LUAD tissues than that in paracancer tissues (Fig. 2A–D). The expression of TEDC2 at the protein level was also analyzed in the Human Protein Atlas, and we found that TEDC2 protein expression increased significantly in patients with LUAD (Fig. 3).

Figure 2
figure 2

Validation of higher TEDC2 expression in LUAD than that in normal tissues in (A) GSE18842, (B) GSE7670, (C) GSE27262 and (D) GSE140797 datasets. **P < 0.01; ***P < 0.001. The figure was created by R statistical software (version 3.6.3).

Figure 3
figure 3

The protein level of TEDC2 was higher in LUAD tissue than normal tissue in the Human Protein Atlas (Antibody HPA055389, 10X). The figure was download in Human Protein Atlas (https://www.proteinatlas.org/).

TEDC2 possesses diagnostic and prognostic value for LUAD

The diagnostic value of TEDC2 was determined by ROC curve analyses which were performed between tumor tissues with different stage and normal tissues. As shown in Fig. 4A, high TEDC2 levels could effectively distinguish LUAD tissues from normal tissues. Then, subtype analyses were performed in different TNM stage, and the results suggested that TEDC2 possessed satisfactory diagnostic value regardless of early or advanced stage (Fig. 4B–N).

Figure 4
figure 4

Analysis of the diagnostic value of TEDC2. (A) ROC curve analysis of the diagnostic value of TEDC2 in patients with LUAD. (BE) ROC curve analysis of the diagnostic value of TEDC2 in patients with LUAD at T stage. (FH) ROC curve analysis of the diagnostic value of TEDC2 in patients with LUAD at N stage. (I,J) ROC curve analysis of the diagnostic value of TEDC2 in patients with LUAD at M stage. (KN) ROC curve analysis of the diagnostic value of TEDC2 in patients with LUAD at pathologic stage. The figure was created by R statistical software (version 3.6.3).

Kaplan–Meier analysis suggested that high expression of TEDC2 was significantly associated with poor OS (P < 0.001, Fig. 5A), DSS (P = 0.002, Fig. 5B) and PFS (P = 0.009, Fig. 5C) in LUAD patients. Univariate Cox regression analysis indicated that TEDC2 expression was correlated with OS (HR 1.704, 95% CI 1.273–2.282), DSS (HR 1.796, 95% CI 1.240–2.601) and PFS (HR 1.422, 95% CI 1.092–1.851) (Table 2). Furthermore, multivariate Cox regression revealed that TEDC2 expression was an independent adverse prognostic indicator for OS (HR 1.498, 95% CI 1.057–2.124, P = 0.023) and DSS (HR 1.774, 95% CI 1.120–2.810, P = 0.015) in LUAD patients (Fig. 6).

Figure 5
figure 5

Kaplan–Meier survival curves for (A) overall survival (OS), (B) disease specific survival (DSS) and (C) progress free survival (PFS) of the LUAD patients with high and low TEDC2 expression level. The figure was created by R statistical software (version 3.6.3).

Table 2 Cox regression analysis of TEDC2 and clinicopathologic characteristics with survival outcomes in LUAD.
Figure 6
figure 6

Multivariate Cox regression analysis of TEDC2 expression and clinicopathologic characteristics with survival outcomes in LUAD. (A) Multivariate Cox regression analysis in overall survival (OS). (B) Multivariate Cox regression analysis in disease specific survival (DSS). (C) Multivariate Cox regression analysis in progress free survival (PFS). The figure was created by R statistical software (version 3.6.3).

TEDC2 co-expression network in LUAD

The LinkFinder module in the LinkedOmics database was used to explore the co-expression pattern of TEDC2 in LUAD. The results showed that 10,003 genes were positively correlated with TEDC2, while 9985 genes were negatively correlated with TEDC2 (Supplementary figure 1). Heat maps displayed the top 50 genes positively and negatively associated with TEDC2 (Fig. 7A,B), and correlation coefficients and P values were presented in supplementary table 1 and 2. In the LinkInterpreter module, GO term annotation showed that co-expressed genes of TEDC2 were mainly involved in chromosome segregation, DNA replication, double-strand break repair, mitotic cell cycle phase transition, spindle organization, cell cycle checkpoint, DNA recombination, telomere organization, chromatin assembly or disassembly, ncRNA processing, etc. (Fig. 8A). KEGG pathway analysis indicated enrichment in cell cycle, DNA replication, spliceosome, homologous recombination, proteasome, mismatch repair, Fanconi anemia pathway, RNA transport, ribosome biogenesis in eukaryotes, base excision repair, etc. (Fig. 8B).

Figure 7
figure 7

The co-expression genes with TEDC2 in LUAD from the LinkedOmics database. (A,B) Heat maps of top 50 genes positively and negatively correlated to TEDC2 in LUAD, respectively. The figure was created by the LinkedOmics database (http://www.linkedomics.org).

Figure 8
figure 8

(A) GO annotations and (B) KEGG pathways of TEDC2 in LUAD (Kanehisa, M. and Goto, S., 2020. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27-30). The figure was created by the LinkedOmics database (http://www.linkedomics.org).

TEDC2 is correlated with immune infiltration in LUAD

The ssGSEA and the ESTIMATE algorithms were employed to characterize the immune features of TEDC2. According to the ssGSEA algorithm, the expression of TEDC2 was negatively correlated with most infiltrated immune cells (Fig. 9A). Moreover, the expression of TEDC2 was significantly negatively associated with the stromal score of LUAD (R = − 0.325, P < 0.001, Figs. 9B), as well as the immune score (R = − 0.285, P < 0.001, Fig. 9C). The ESTIMATE score which was calculated as the sum of the stromal and immune scores was also negatively correlated with the expression of TEDC2 (R = − 0.332, P < 0.001, Figs. 9D). The correlation of TEDC2 expression with immune cell infiltration was further explored in the TIMER2.0 database. The result indicated significantly negative association of TEDC2 expression with DC (R = − 0.138, P = 2.18e−03) and B cell (R = − 0.108, P = 1.65e−02) (Fig. 10).

Figure 9
figure 9

The correlation between TEDC2 expression and immune microenvironment in LUAD. (A) The correlation of TEDC2 expression with infiltrated immune cells by ssGSEA algorithm. (BD) The correlation of TEDC2 expression with stromal score, immune score, and ESTIMATE score by ESTIMATE algorithm, respectively. The figure was created by R statistical software (version 3.6.3).

Figure 10
figure 10

The correlation of TEDC2 expression with immune infiltration in LUAD from the TIMER2.0 database. The figure was created by TIMER2.0 database (http://timer.cistrome.org/).

The analysis of the relationship between TEDC2 expression and immune checkpoints showed that TEDC2 was associated with most immune checkpoints in LUAD (Fig. 11), among which TEDC2 was significantly positively correlated with lymphocyte activating 3 (LAG3), CD276, programmed cell death 1 (PDCD1), tumor necrosis factor receptor superfamily member (TNFRSF) 25, TNFRSF4 and TNFRSF18 (Table 3). To further investigate the impact of TEDC2 on immune microenvironment in LUAD, the associations of TEDC2 with immune marker sets of immune cells and immune checkpoints leading to T cell exhaustion were analyzed in the GEPIA2 database (Table 4). We found that TEDC2 expression was negatively associated with the levels of some marker sets marking DC, B cell, tumor-associated macrophage (TAM), monocyte and neutrophil. Moreover, TEDC2 expression was positively correlated with immune checkpoints leading to T cell exhaustion such as PDCD1, LAG3 and CD276.

Figure 11
figure 11

The correlation of TEDC2 expression with common immune checkpoints. The figure was created by R statistical software (version 3.6.3).

Table 3 The relationship between TEDC2 and immune checkpoints.
Table 4 Correlation analysis between TEDC2 and markers of immune cells in GEPIA2 database.

Discussion

The past decade in the LUAD research has been characterized by a greater understanding of cancer biology and management, with targeted therapy and immunotherapy providing significant survival benefits and manageable toxicity profiles in selected patients. However, major challenges still remain, including low response rate and drug resistance18,19. Thus, there is a clear urgent need to identify new driver gene alterations to expand the population that benefit from targeted therapy or immunotherapy, predict treatment responses and prevent or overcome the drug resistance. In this study, through employing open-access databases for comprehensive analyses, we found that TEDC2 could be involved in the tumorigenesis and progression of LUAD, and might contribute to the formation of immunosuppressive microenvironment in LUAD patients. These results suggested that TEDC2 could be regarded as a novel potential target for the treatment of LUAD.

The role of TEDC2 has been explored in few studies. Lim, D. H. et al. reported that TEDC2 could be predominantly expressed in primary central nervous system (CNS) diffuse large B-cell lymphoma (DLBCL) compared to non-CNS DLBCL11. Hsu, M. K. et al. suggested that TEDC2 might be one of the potential genes for the tumorigenesis of LUAD and the construction of accurate classification systems distinguishing tumor from normal tissues10. TEDC2 was also seemed to act as a potential marker for treatment effect in male schizophrenia patients20. However, TEDC2 is not yet thoroughly studied and currently its function remains unclear. Our study preliminarily demonstrated a part of functions of TEDC2 in LUAD.

According to the analyses of data from TCGA and GEO databases, the mRNA expression of TEDC2 was significantly upregulated in LUAD compared to normal tissues. It was worth mentioning that although the sample of GSE140797 was relatively small, significant differences of TEDC2 expression were detected in all pairs. Moreover, we have used other datasets from GEO database to confirm the result of the GSE140797 analysis. The protein level of TEDC2 was also confirmed to be higher in LUAD by immunohistochemistry analysis in Human Protein Atlas. In addition, ROC curve analysis found that TEDC2 could distinguish patients with LUAD from the normal population regardless of tumor stage. These evidences indicated that TEDC2 might play an important role in the tumorigenesis of LUAD and could serve as a new diagnostic marker for LUAD patients. To determine whether TEDC2 could be used as a prognostic marker in LUAD, we investigated the prognosis of LUAD patients with different TEDC2 expression levels. Kaplan–Meier curve analysis revealed that high TEDC2 expression was associated with inferior survival outcomes including OS, DSS and PFS. Cox regression analysis proved that high TEDC2 expression was an independent risk factor for poor prognosis, suggesting the potential prognostic value of TEDC2 in LUAD.

To unravel the biological functions of TEDC2, co-expression analysis and functional enrichment analysis were performed. LinkedOmics database analysis pointed out that most co-expressed genes with TEDC2 were mainly enriched in mitotic cell cycle processes, including chromosome segregation, DNA replication and cell cycle phase transition, suggesting that these genes could act as oncogenes to promote LUAD by accelerating cell cycle phase.

Another vital aspect of this study was that TEDC2 might be involved in regulating immune microenvironment in LUAD. There is increasing evidence proving the important role of tumor immune microenvironment in cancers21,22. With the application of ssGSEA and ESTIMATE algorithms, we identified that the expression of TEDC2 was significantly negatively associated with immune infiltrates, which implied that TEDC2 might induce immunosuppressive context. The relationships between TEDC2 and immune infiltrates in LUAD were also analyzed by TIMER2.0 and GEPIA2 databases. The results demonstrated that TEDC2 expression showed negative correlation with DC and B cell. DC is one of the major regulators of immune response and can elicit T cell response, and previous studies have proved that DC could be associated with cytotoxic T cell infiltration and predict favorable outcome23,24. B cell has emerged as a key player in immune microenvironment and correlates with better prognosis in NSCLC25,26.

In addition, we conducted a systematic analysis of more than 40 common immune checkpoint genes and found that PDCD1, LAG3 and CD276 were highly correlated with TEDC2 expression, and the analysis of GEPIA2 database also proved these correlations. PDCD1 is an inhibitory receptor and negative regulator of T cell function, which can promote disease progression in patients with NSCLC27. LAG3 is able to function in coordination with other checkpoints such as PDCD1 to inhibit the activity of effector T cells and promote the suppressive activity, but effects and signaling events after LAG3 activation have not been completely understood28. CD276 expression on lung cancer leads to a lower number of tumor infiltrating lymphocytes and promotes lymph node metastasis, suggesting a role for CD276 in immune evasion and tumor progression29. Besides, TEDC2 expression was also correlated with TNFRSF25, TNFRSF4 and TNFRSF18 in our analysis, which might be correlated with immune evasion and poor outcome in lung cancer, but the molecular mechanisms of these immune checkpoint molecules still remain elusive and need further investigation30,31.

This study preliminarily demonstrated the diagnostic and prognostic values of TEDC2 in LUAD, as well as the immune characteristics. However, there were certain limitations in our study. First, data heterogeneity was inevitable due to all the data in this study obtained from online databases. Second, gene expression analysis based on open-source databases might not be sufficiently accurate and the experiment examining biological functions of TEDC2 lacked, which required additional experiments to provide a better understanding of the underlying biological mechanisms of TEDC2. Finally, although the regulatory effect of TEDC2 on immune microenvironment was evaluated by various algorithms in this study and the results reached statistical significance, the actual status of specific immune processes should be further investigated by in vitro and vivo models.

Conclusion

The preliminary results in this study characterize the clinical and immune features of TEDC2 in LUAD. The high expression of TEDC2 was associated with poor prognosis and immunosuppressive microenvironment. TEDC2 could be used as a biomarker to predict the prognosis and as a potential target for treatment in LUAD patients.