Introduction

Global Cancer Statistics 2022 showed that lung cancer is not only the most common type of cancer worldwide, accounting for 12.4% of all cancer cases, but also the leading cause of cancer deaths, accounting for 18.7% of cancer-related deaths worldwide1. Non-small cell lung cancer (NSCLC) is the most common type of lung cancer, accounting for approximately 85% of all lung cancer cases with a survival rate of less than 15%, with lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) being the most common NSCLC subtypes2. Lung squamous cell carcinoma tends to occur in men and is strongly associated with smoking, while LUAD usually occurs in women who do not smoke3. Remarkably, the Xuanwei region of China’s Yunnan Province has one of the highest rates of lung cancer in the world and often exhibits familial clustering4. Xuanwei Lung cancer (XWLC) refers to a lung cancer in Xuanwei City and Fuyuan County, Yunnan Province, China, with high morbidity and mortality due to the extensive use of local bituminous coal, characterized by a younger age of onset, significant gender differences, and a mortality rate much higher than the national average5. Lung cancer in this region is dominated by adenocarcinoma, and the risk is higher among female residents6. Studies have shown that the high incidence of lung cancer is not significantly related to gender or smoking habits, but is mainly closely related to the behavior of burning bituminous coal in the family7. While immunotherapy and targeted therapy have significantly improved the efficacy of NSCLC treatment over the past 20 years, the majority of NSCLC patients develop resistance to targeted drugs and disease progression at advanced stages8. Therefore, the search for new diagnostic and prognostic biomarkers for NSCLC remains critical.

Keratin, also known as cytokeratin, is an intermediate filament (IF) forming protein of epithelial cells and is expressed in both normal and malignant epithelial cells9. These small molecular weight structural proteins (molecular weight from 40 to 76 kDa) are able to interact with a variety of regulatory proteins to initiate a signaling cascade10. As a typical marker of epithelial cells, keratin is widely expressed in a variety of cancers and plays an important role in regulating cell migration, invasion and epithelial-mesenchymal transition (EMT)11. The Keratin 80 (KRT80) gene, located on chromosome 12q13, encodes a protein of 452 amino acids that promotes the formation of intermediate filaments (IF) heterodimers, structurally closer to type II hair keratin12. KRT80 is overexpressed in a variety of tumors and can effectively enhance the proliferation, invasiveness and migration of cancer cells, which is closely related to poor prognosis of cancer patients13.

In this study, the expression of KRT80 in NSCLC was investigated by immunohistochemistry, and its correlation with clinicopathological features and prognosis were analyzed. We also paid special attention to the differences in the expression of KRT80 in different geographical regions (Xuanwei and Non-xuanwei region) and different histological subtypes (LUAD and LUSC). The study results revealed that the expression level of KRT80 in NSCLC patients in Xuanwei area was significantly higher than that in Non-xuanwei area, and the expression level in lung adenocarcinoma was significantly higher than that in lung squamous cell carcinoma. In addition, the expression of KRT80 may have a certain correlation with the clinicopathological features such as lymph node metastasis and tumor size in NSCLC patients, but this relationship still needs further study to clarify. Although the association between KRT80 and patient outcomes is not significant, its potential as a diagnostic biomarker for NSCLC is of concern. The findings of this study not only provide preliminary evidence for the potential of KRT80 as a biomarker for NSCLC, but also reveal differences in its expression across geographic regions and lung cancer subtypes, providing new directions for future research. Future studies can further explore the molecular mechanism of KRT80 in NSCLC and its interaction with the tumor immune microenvironment, in order to provide a new perspective for the diagnosis and treatment of NSCLC.

Materials and methods

The online database

To assess the expression of KRT80 in NSCLC, our comprehensive database from the Gene Expression Omnibus (GEO, https://www.ncbi.nlm.nih.gov/gds/?term) website has been extracted using Affymetrix HG - U133_Plus_2 chip data. The GSE19188 dataset, which contained genome-wide gene expression analyses from 91 NSCLC and 65 normal lung tissues, was incorporated into the analysis14. In addition, we incorporated the GSE75037 dataset, which used Illumina WG6-V3 expression chips to analyze gene expression profiles in 83 lung adenocarcinomas and 83 matched adjacent non-malignant lung tissues, to analyze the expression of KRT80 in NSCLC15. To further validate and complement our findings, we utilized the GEPIA2 database and Kaplan-Meier (K-M) plotter database to analyze the potential impact of KRT80 mRNA expression on NSCLC. GEPIA2 is an online database (http://gepia2.cancer-pku.cn/#index) that integrates TCGA RNA-seq expression and clinical data for survival analysis of selected genes16. Through the expression analysis and survival analysis of KRT80 in the GEPIA2 database, we further explored the role of KRT80 in NSCLC and further explained the association between KRT80 and patient prognosis. Kaplan-Meier (K-M) plotter database (https://kmplot.com) includes gene expression data from databases such as the Gene Expression Omnibus (GEO), the European Genome Genome Archive (EGA), and the Cancer Genome Atlas (TCGA)17. It allows us to assess the correlation between specific gene expression levels and patient survival time by analyzing data from a large number of patients. Using this database, we were capable of exploring the potential association between KRT80 mRNA expression levels and survival in patients with NSCLC, providing us with an important tool to evaluate the potential of KRT80 as a prognostic biomarker.

Clinical data collection

In this study, we collected clinical data from patients with NSCLC, which has been approved by the Ethics Committee of the First Affiliated Hospital of Kunming Medical University. All participants provided written informed consent. The study included 175 patients, including 107 men and 68 women, with a median age of 57 years and an age range of 27 to 78 years. NSCLC patients were divided into the Xuanwei lung cancer (XWLC) group (72 patients) and the non-Xuanwei lung cancer group (103 patients), based on the definition of three consecutive generations of residents in coal producing areas and a history of coal burning for more than 10 years. The sample collection period was from December 2008 to October 2015, and 175 pairs of samples were obtained from the Third Affiliated Hospital of Kunming Medical University. The samples were treated with paraffin embedding after being fixed with 10% formaldehyde at room temperature for 24 h. We obtained samples of primary NSCLC tissue and its paired non-cancerous lung tissue (5 cm from the tumor edge). The tumor samples were staged according to the TNM Staging 7 th edition of Lung cancer and the American Joint Committee on Cancer (AJCC) staging system, including clinical and pathological staging18.

The patients included in this study were all diagnosed with stage I-III NSCLC, and patients receiving preoperative radiation, chemotherapy, or biologic therapy were excluded. None of the patients had a diagnosis of multiple primary cancers in other organs or tissues. Clinical data are derived from patient records. All 175 NSCLC patients underwent complete surgical resection, including primary tumor and mediastinal and bronchial lymph node dissection. All patients had valid and accurate follow-up data, ranging from 2 months to 110 months. Patient follow-up data were updated through medical records, interviews, and telephone calls.

According to the National Comprehensive Cancer Network (NCCN) guidelines19, adjuvant chemotherapy is not recommended for patients with stage IA, but may be beneficial for some patients with stage IB. For stage II and III NSCLC, which are challenging and controversial to treat, four cycles of platinum-based adjuvant chemotherapy are recommended to address micrometastatic disease.

Immunohistochemistry (IHC)

To assess the expression levels and distribution patterns of KRT80 in 175 pairs of non-small cell lung cancer (NSCLC) tissues, immunohistochemical (IHC) staining was conducted following established methodologies5. Tissue specimens were sectioned into 4 μm thick slices and subjected to heat treatment at 70 °C for 2 h. Dewaxing was performed using xylene, followed by rehydration through a graded alcohol series. For the detection of the KRT80 protein, the tissue sections were incubated with a rabbit polyclonal KRT80 antibody at a dilution of 1:500 (Catalog No: 16835-1-AP, Proteintech) at 4 °C for 12 h. Subsequently, the slices were rinsed three times with phosphate-buffered saline (PBS) and incubated with an avidin-biotin complex (Vector Laboratories, Inc.). The chromogenic reaction was conducted utilizing 3,3’-diaminobenzidine as the chromogen. A volume of 30 µL of the secondary antibody, provided in its original concentration from the DAB kit (Dako; Agilent Technologies, Inc.), was applied to the tissue sections and incubated at 37 °C for 30 min. Subsequently, the tissue sections underwent dehydration through a graded ethanol series at room temperature and were mounted in neutral gum for examination via optical microscopy (Leica Microsystems GmbH) at magnifications of 100x and 200x.

Evaluation of IHC

In the immunohistochemical (IHC) analysis, brown-yellow staining was predominantly localized within the cytoplasm, with occasional presence in the nuclei of KRT80-positive cells. The staining results were evaluated by two independent pathologists from the Third Affiliated Hospital of Kunming Medical University, who employed fluorescent inverted microscopy at magnifications of x100 and x200, while remaining blinded to the clinical parameters. The magnitude and intensity of the immune response were evaluated using a semi-quantitative approach. The specific scoring criteria were defined as follows: a score of 0 indicated 0% immune response cells; a score of 1 corresponded to less than 5% immunoreactive cells; a score of 2 represented 5–50% immune response cells; and a score of 3 denoted more than 50% immune response cells. The intensity of immunoreactivity was scored as follows: 0 for negative, 1 for weak, 2 for intermediate, and 3 for strong. These scores were categorized based on the cumulative immunoreactivity scores: 0 was classified as negative, 1–2 as weak, 3 as moderate, and 4–6 as strong20. A final immunoreactivity score greater than 0 was considered positive, whereas a score of 0 was deemed negative.

Statistical analysis

All statistical analyses were performed using R software (version 4.0.4) and SPSS version 26 (SPSS for Windows, release 26.0; SPSS, Chicago, IL, USA) and Prism 9.8 (GraphPad software, Inc., San Diego, CA, USA). Kaplan-Meier curve and log-rank test were used to evaluate the difference in survival rate among all groups. Wilcoxon test or Student ‘st test was used for continuous data, and Fisher exact test or Chi-square test was used for categorical variables. P < 0.05 was the threshold of significance, and the difference was statistically significant.

Results

KRT80 in online database

In this study, we performed a preliminary analysis of the GSE19188 dataset, which included 91 NSCLC cases and 65 normal lung tissue samples, and the GSE75037 dataset, which included 83 lung adenocarcinomas and 83 corresponding adjacent normal lung tissue. The analysis results (Fig. 1a) showed that the expression level of KRT80 in NSCLC was significantly higher than that in normal lung tissue (p < 0.001), and there was significant heterogeneity in the expression level of KRT80 in different pathological subtypes of NSCLC. Specifically, the expression level of KRT80 in both LUAD and LUSC was significantly higher than that in normal lung tissue (p < 0.001), and the expression level in lung adenocarcinoma was higher than that in lung squamous cell carcinoma. In contrast, there was no significant difference in KRT80 expression in large cell carcinoma compared with normal lung tissue. We performed a confirmatory analysis through the GEPIA2 database and the results were consistent with our preliminary analysis (Fig. 1b-c). In addition, we searched KRT80 survival curve data from the GEPIA2 database and Kaplan-Meier plotter database (Fig. 1d). The results showed that in the GEPIA2 database, the survival prognosis of NSCLC patients with high expression of KRT80 was worse (p = 0.017), while in LUAD and LUSC patients, there was no significant difference in survival prognosis between the high expression and low expression groups of KRT80. In Kaplan-Meier plotter database, there was no significant correlation between the expression level of KRT80 and survival prognosis.

Fig. 1
figure 1

KRT80 in Online databases. (a) Expression of KRT80 in NSCLC in GEO database. (b-c) Box diagram of KRT80 expression difference in GEPIA2 database and its pan-cancer expression. (d) Survival curve of KRT80 in GEPIA2 database and Kaplan-meier plotter database (231849_at). LCC: Large cell lung cancer, LUAD: lung adenocarcinoma, LUSC: lung squamous cell carcinoma, * Indicates a statistical difference.

The expression of KRT80 in NSCLC and its correlation with clinicopathological parameters

In this study, 175 samples were divided into KRT80-negative group and KRT80-positive group according to KRT80 expression. The positive and negative expressions of KRT80 in cancer tissues and adjacent normal lung tissues were shown in Fig. 2a. Table 1 shows in detail the expression pattern of KRT80 in NSCLC patients and its correlation with multiple clinicopathological features. Overall, the positive expression rate of KRT80 in cancer tissues of NSCLC patients (64.6%, 113/175) was significantly higher than that in para-cancer tissues (29.7%, 52/175) (p = 0.000). Chi-square test results showed that the positive expression of KRT80 in NSCLC was significantly correlated with area (P = 0.000), age (p = 0.045) and pathological type (p = 0.001), but was not significantly correlated with gender, smoking history, drinking history, pathological type, pT stage, pTNM, and tumor size. Regionally, the positive rate of KRT80 expression was 83.3%(60/72) in xuanwei area and 51.5% (53/103) in Non-xuanwei area. In terms of pathological types, the positive rate of KRT80 was 71.8% (94/131) in LUAD and 43.2% (19/44) in LUSC. The tumor stages were: stage I 86 cases (49.1%), stage II 34 cases (19.4%), and stage III 55 cases (31.5%). The median follow-up time was 39 months, and the follow-up time ranged from 2 to 110 months. Of the 175 patients, 71 (40.6%) died from non-small cell lung cancer, while 104 (59.4%) were still alive at the end of follow-up.

Fig. 2
figure 2

Expression of KER80 in patients with NSCLC. (a) Immunohistochemical staining of KRT80 in patients with NSCLC. (b) Univariate and multivariate cox regression analysis of 175 patients with NSCLC. (c-d) Survival curve and ROC curve of NSCLC patients. p < 0.05 was statistically significant.

Table 1 Expression of KRT80 in NSCLC patients and its correlation with clinicopathological parameters.

In order to further explore the influence of factors such as KRT80 expression on the prognosis of patients with NSCLC, univariate and multivariate cox regression analysis was performed (Fig. 2b). Univariate analysis showed that differentiation, pT stage, pN stage, pTNM stage and tumor size were significantly correlated with prognosis (p < 0.05). However, in multivariate analysis, only the pN period remained significant (p = 0.020). Patients with non-small cell lung cancer with lymph node metastasis had a 2.656 times greater risk of disease progression or death than those without lymph node metastasis (HR: 2.656, 95% CI :1.169–6.032). Kaplan-Meier (K-M) survival curve analysis showed that there was no significant difference in survival prognosis between KRT80 positive expression group and negative group in NSCLC patients (p = 0.35, Fig. 2c). To assess the potential of KRT80 as a biomarker for NSCLC, receiver operating characteristic (ROC) curves were constructed. The area under the curve (AUC) is 0.6686 (p < 0.0001, Fig. 2d), which corresponds to a exploratory diagnostic potential classifier in NSCLC.

The expression of KRT80 in NSCLC and its correlation with clinicopathological parameters in different regions

We divided 175 patients into Xuanwei group (72 cases) and non-Xuanwei group (103 cases) according to the region. Table 2 details the expression of KRT80 in both Xuanwei and non-Xuanwei patients and its correlation with clinicopathological parameters. The results showed that the expression of KRT80 was not significantly correlated with most clinicopathological features (p > 0.05). Specifically, in the Xuanwei group, the expression of KRT80 was moderately correlated with the degree of tumor pathological differentiation (Low vs. Moderately-Gigh vs. Unknown, p = 0.032). In the Non-xuanwei group, the expression of KRT80 was correlated with the histological type of the tumor (p = 0.037). In Xuanwei group, univariate cox analysis showed that pN stage (p = 0.000), pTNM stage (p = 0.004) and Tumor size (p = 0.003) were significantly correlated with prognosis (p < 0.05)(Fig. 3a). However, in the multivariate cox analysis, only the significance of Tumor size(p = 0.012) was retained. The K-M curve showed (Fig. 3b) that there was no significant correlation between the expression of KRT80 and patient survival in Xuanwei group (p = 0.54). As shown in the ROC curve (Fig. 3c), the AUC value is 0.7014 (p < 0.0001), and the maximum AUC is 1, which indicates a perfect classifier for Xuanwei NSCLC. In the non-Xuanweil group, univariate cox analysis showed that pT stage (p < 0.05), pN stage (p = 0.001), pTNM stage(p = 0.000) were significantly correlated with prognosis (Fig. 3d). However, in the multivariate cox analysis, only the significance of the pT stage was retained (regardless of the T1 vs. T2 vs. T3 vs. T4 group or the T1-2 vs. T3-4 group). As shown in the K-M curve (Fig. 3e), there was no significant correlation between KRT80 expression and survival prognosis in the non-Xuanwei group (p = 0.43). As shown in the ROC curve (Fig. 3.f), the AUC value is 0.6456 (p = 0.0003), which indicates a moderate diagnostic performance for non-Xuanwei NSCLC.

Table 2 The expression of KRT80 in Xuanwei and Non-xuanwei patients and its correlation with clinicopathological parameters.
Fig. 3
figure 3

Expression of KRT80 in Xuanwei and Non-xuanwei region. (a) Univariate and multivariate cox analysis of KRT80 in Xuanwei patients. (b-c) Survival curve and ROC curve of KRT80 patients in Xuanwei area. (d) Univariate and multivariate cox analysis of KRT80 in Non-xuanwei patients. (e-f) Survival curve and ROC curve of KRT80 patients in Non-xuanwei area. Statistical significance was shown by p-values that were less than 0.05.

The expression of KRT80 and its correlation with clinicopathological parameters in different lung cancer subtypes

We divided 175 patients into lung adenocarcinoma group (72 cases) and lung squamous cell carcinoma group (103 cases) according to histological type. Table 3 details the expression of KRT80 in lung adenocarcinoma group and lung squamous cell carcinoma group and its correlation with clinicopathological parameters. The results showed that in the lung adenocarcinoma group, the expression of KRT80 was only correlated with the region (p = 0.001), and the positive expression rate of KRT80 in Xuanwei region (54/63, 85.7%) was significantly higher than that in Non-xuanwei region (40/68, 58.8%). There was no significant correlation between the expression of KRT80 and clinicopathologic features in lung squamous cell carcinoma group (p > 0.05). In the lung adenocarcinoma group, univariate cox analysis showed that KRT80 expression (p = 0.049), area (p = 0.009), differentiation (p = 0.049), pT stage (p = 0.017), pN stage (p = 0.000), pTNM stage (p = 0.000) and tumor size (p = 0.000) were significantly correlated with prognosis (Fig. 4a). In multivariate cox analysis, KRT80 expression (p = 0.002), differentiation (p = 0.049), pN stage(p = 0.022) and tumor size (p = 0.006) were still significantly correlated with prognosis. K-M curve showed that KRT80 expression was significantly correlated with survival in lung adenocarcinoma group (p = 0.045, Fig. 4b). As shown in the ROC curve (Fig. 4c), the AUC value was 0.6908 (p < 0.0001), suggesting a reasonably effective classifier for lung adenocarcinoma. In the lung squamous cell carcinoma group, univariate cox analysis showed a significant correlation between pN stage (p = 0.033) and prognosis (Fig. 4d). As shown in the K-M curve (Fig. 4e), there was no significant correlation between KRT80 expression and survival in lung squamous cell carcinoma group (p = 0.22). As shown in the ROC curve (Fig. 4f), the AUC value was 0.6023 (p = 0.0985), indicating that the classifier may not be sufficiently effective for diagnosing lung squamous cell carcinoma.

Table 3 The expression of KRT80 in LUAD and LUSC patients and its correlation with clinicopathological parameters.
Fig. 4
figure 4

Expression of KRT80 in LUAD and LUSC. (a) Univariate and multivariate analysis of patients with LUAD. (b-c) KRT80 survival curve and ROC curve of LUAD patients. (d) Univariate and multivariate analysis of patients with LUSC. (e-f) KRT80 survival curve and ROC curve of LUSC patients. A p value less than 0.05 indicates a statistically significant difference.

Discussion

NSCLC is an aggressive tumor with a high mortality rate and is prone to distant metastasis21. In Xuanwei, Yunnan Province, China, the incidence and mortality of lung cancer is notably elevated, which is mainly related to the long-term exposure of local residents to polycyclic aromatic hydrocarbons (PAHs) and particulate pollution from indoor coal burning22. Keratin is the intermediate filament of epithelial cells that is expressed in a tissue-specific manner and is used as a biomarker for cancer diagnosis23,24. In the field of lung cancer, keratin members such as KRT6 A, KRT14, KRT17, and keratin-19 fragment (Cyfra21-1) can be used as prognostic biomarkers for lung cancer25,26,27,28. KRT80 is a unique member of the keratin family because it is widely expressed in a variety of epithelial cells13. KRT80 has been shown to be overexpressed in a variety of cancers and associated with poor prognosis12,29,30,31. However, the research on the role of KRT80 in lung cancer remains limited. At present, studies on the role and impact of KRT80 in NSCLC have mainly relied on bioinformatics methods. Our study is among the few that validate KRT80 expression in NSCLC using clinical samples. Compared with Jing Jiang et al. 's study32, this study only included 16 samples, while our study had a larger sample size, including a total of 175 cases, and included more comprehensive clinicopathological parameters. Such a large sample size provides us with a more solid data base, thus enhancing the reliability and generalizability of our findings. In addition, our study focused on the expression differences of KRT80 in different regions and different histological subtypes (LUAD and LUSC).

We systematically analyzed KRT80 expression and its association with clinicopathological features in 175 patients with NSCLC. The results showed that KRT80 was highly expressed in cancer tissues. The expression of KRT80 was significantly correlated with the region and pathological type. Specifically, the positive expression rate of KRT80 in patients in Xuanwei area was significantly higher than that in Non-xuanwei area, and the positive expression rate in patients with lung adenocarcinoma was also significantly higher than that in patients with lung squamous cell carcinoma. These results suggest that the positive expression of KRT80 may be influenced by region-specific factors and different histological subtypes of lung cancer. Through univariate and multivariate cox regression analysis, postoperative lymph node staging (pN stage) is an independent prognostic factor for NSCLC patients, but its confidence interval is wide, and the accuracy of this prognostic factor needs to be further verified in future studies. The expression level of KRT80 is not significantly associated with the survival prognosis of patients, so it may not be suitable as a prognostic biomarker for total population of NSCLC. The area under the ROC curve (AUC) is often used to evaluate the predictive performance of biomarker models33. For cancer biomarkers with potential clinical utility, the AUC value should be higher than 0.6, while an AUC value higher than 0.7 is obtained from high-quality cancer biomarkers34. The results of our ROC curve analysis showed that the AUC value corresponding to KRT80 was close to 0.7, indicating that KRT80, as a biomarker, has a relatively significant diagnostic value and can effectively distinguish cancer tissues from normal tissues.

Based on these important findings, we further explored the differences in the expression of KRT80 in different geographical regions (Xuanwei region and Non-xuanwei region) and different histological types (lung adenocarcinoma and lung squamous cell carcinoma), aiming to further understand the role of KRT80 in NSCLC and its influencing factors. Unique environmental factors in Xuanwei, such as long-term exposure to polycyclic aromatic hydrocarbons (PaHs) and indoor coal-fired particles22, may lead to differences in KRT80 expression patterns. In the Xuanwei group, KRT80 expression was correlated with the degree of pathological differentiation of tumors, while in the non-Xuanwei group, KRT80 expression was correlated with the histological type of tumors. These results may indicate that KRT80 plays a different role in NSCLC in different regions. Although this phenomenon has been preliminarily observed in the current study, the complex underlying molecular mechanism behind it remains to be further explored and elucidated. Although KRT80 has been studied as a potential prognostic marker in a variety of cancers, its application as a prognostic marker has certain limitations. For example, the effects of KRT80 on prognosis and clinically relevant indicators in patients with various cancers have not been extensively studied, and even opposite conclusions have been drawn in different studies of the same cancer13. Therefore, it is necessary to deepen the molecular mechanism research and increase clinical studies to clarify the clinical application prospect of KRT80. Multivariate Cox regression analysis showed that tumor size was an independent prognostic factor in the Xuanwei group, while pT stage was an independent prognostic factor in the non-Xuanwei group. These results further emphasize the importance of tumor stage and size in NSCLC prognosis. This conclusion is contrary to the findings of Jing Jiang et al.32, who suggested that KRT80 is associated with the prognosis of patients with NSCLC. This difference may be due to differences in the selection of subjects and sample size between the two studies, which may have affected the consistency of the findings. However, ROC curve analysis indicated that KRT80 has a good ability to distinguish cancerous tissue from non-cancerous lung tissue in Xuanwei and Non-xuanwei region.

Finally, we divided the patients into lung adenocarcinoma group and lung squamous cell carcinoma group according to histological type, and analyzed the correlation between KRT80 expression and clinicopathological parameters. In the lung adenocarcinoma cohort, KRT80 expression exhibited a significant correlation with region, and KRT80 serves as an independent prognostic factor for patients with lung adenocarcinoma. However, there was no significant correlation between KRT80 expression and survival prognosis in lung squamous cell carcinoma group. These results suggest that KRT80 may have different biological significance and prognostic value in different pathological types of NSCLC. It is noteworthy that KRT80, as a biomarker, may exhibit sufficient effectiveness in diagnosing lung adenocarcinoma but not lung squamous cell carcinoma.

In the field of lung cancer research, several studies have revealed the role of KRT80 in NSCLC. KRT80 promotes the proliferation, migration, invasion and EMT of non-small cell lung cancer by regulating TGF-β/SMAD signaling pathway, which may be a potential target for NSCLC treatment. The regional and histological variation in KRT80 expression may stem from microenvironmental TGF-β/SMAD gradients, receptor availability specific to cell types, and SMAD - dependent transcriptional regulation. Together, they shape the spatial heterogeneity in KRT80 - driven EMT and invasion, underscoring the TGF-β/SMAD pathway’s crucial role in modulating KRT80’s oncogenic effects in NSCLC11. In addition, Hiroki Sanada et al. studied tumor suppressive microRNAs (miRNAs) and found that high expression of KRT80 was significantly correlated with poor prognosis of lung adenocarcinoma (LUAD) patients, with the 5-year survival rate and disease-free survival rate were significantly reduced35. Gang Li et al. also found that the expression level of KRT80 was increased in lung adenocarcinomas and was associated with poor prognosis of patients when studying KRT family genes36. Fan Feng et al.. revealed the correlation between the high expression of KRT80 and the lower overall survival rate of LUAD patients based on the differential expression genes shared by tuberculosis and LUAD37. At the same time, it was observed that the expression level of KRT80 increased with the increase of clinicopathological stage, suggesting that KRT80 may be associated with the progression of tumor metastasis. Jing Jiang et al. identified KRT80 as a new prognostic and predictive biomarker of human lung adenocarcinoma by bioinformatics methods, and the transcription level of KRT80 in LUAD samples was positively correlated with clinical parameters such as lymph node metastasis stage, distant metastasis, and pathological stage32. However, these studies mainly rely on bioinformatics methods and experiments at the cell sample level. Our study, based on 175 clinical samples, adopted a more direct and systematic approach to clinical data analysis. We not only verified the expression pattern of KRT80 in NSCLC, but also explored its relationship with various clinicopathological features, including region, age, pathological type and prognosis. In addition, our study evaluated the independent effect of KRT80 expression on patient prognosis through Cox regression analysis, providing more comprehensive clinical evidence.

In summary, our study clearly demonstrated the up-regulation of KRT80 expression in cancer tissues of NSCLC patients, especially in lung adenocarcinoma patients and people in Xuanwei area. The expression of KRT80 showed significant regional specificity and histological heterogeneity among subtypes. Although KRT80 expression was not directly associated with patient prognosis, it showed a high ability to distinguish cancerous tissue from adjacent normal lung tissue, suggesting that KRT80 may be used primarily as a potential diagnostic biomarker rather than an independent prognostic indicator.