Mitochondria-related gene-based molecular subtypes of lung adenocarcinoma and their prognostic implications

Zhanghuang, Ziyi; Xie, Fei; Ma, Xuemei; Chen, Jinfeng

doi:10.1038/s41598-025-07982-8

Download PDF

Article
Open access
Published: 22 July 2025

Mitochondria-related gene-based molecular subtypes of lung adenocarcinoma and their prognostic implications

Ziyi Zhanghuang¹^na1,
Fei Xie¹^na1,
Xuemei Ma¹ &
…
Jinfeng Chen²

Scientific Reports volume 15, Article number: 26577 (2025) Cite this article

2971 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Lung adenocarcinoma (LUAD) is the most prevalent subtype of lung cancer, presenting significant challenges in treatment and prognostic prediction. Despite advancements in therapeutic approaches in recent years, personalized medicine has not yet achieved a notable breakthrough. Given the poor prognosis of patients, there is an urgent need to enhance the ability for precise prediction. Mitochondria play a crucial role in the metabolism and energy production of cancer cells, yet their specific impact in lung adenocarcinoma warrants further investigation. This study leveraged data from the TCGA and GEO databases to stratify 515 lung adenocarcinoma patients into two distinct subtypes based on mitochondrial-related genes. We systematically evaluated survival outcomes and biological pathway activities between subtypes, characterized their immune infiltration profiles, and developed a prognostic model using subtype-specific differentially expressed genes. Drug sensitivity disparities were further assessed. Single-cell RNA sequencing data were analyzed using an XGBoost classifier to delineate cell-type heterogeneity across subtypes at single-cell resolution. In LUAD, we identified two distinct subtypes. One subtype exhibited active mitochondrial metabolism, which was associated with poor prognosis and higher tumor purity. Moreover, this subtype showed greater sensitivity to Osimertinib. Further single-cell analysis revealed that this subtype was characterized by substantial macrophage infiltration, potentially promoting tumor progression through the NF-κB signaling pathway. Overall, our study identified novel LUAD subtypes and provided new insights into the clinical treatment of LUAD.

Integrative analysis of single-cell and bulk RNA-sequencing data revealed T cell marker genes based molecular sub-types and a prognostic signature in lung adenocarcinoma

Article Open access 10 January 2024

SMR-guided molecular subtyping and machine learning model reveals novel prognostic biomarkers and therapeutic targets in non-small cell lung adenocarcinoma

Article Open access 10 January 2025

Prognostic signature based on mitochondria quality control proteins for the prediction of lung adenocarcinoma patients survival

Article Open access 25 September 2023

Introduction

Lung cancer is one of the most common cancer worldwide, with the highest incidence and mortality rates among all cancers^1,2. Non-small cell lung cancer (NSCLC), accounting for approximately 85% of all lung cancer cases, represents the predominant pathological type³. Lung adenocarcinoma (LUAD) is the most frequent histological subtype of NSCLC, comprising around 40% of newly diagnosed lung cancer cases⁴. The complex biological characteristics and high degree of heterogeneity of LUAD pose significant challenges for its treatment and prognostic prediction. Currently, clinical treatment strategies include surgical resection, chemotherapy, radiotherapy, molecular targeted therapy⁵, and immunotherapy⁶. Although these approaches have markedly improved the prognosis of LUAD patients⁷, issues such as drug resistance, adverse side effects, and the lack of clear patient selection criteria still limit the overall clinical benefits⁸.

The prognosis of LUAD patients remains generally poor, with an overall 5-year survival rate of only 10%–15%⁹. Studies have shown that tumor heterogeneity is a key factor affecting treatment efficacy and prognosis¹⁰, manifested across multiple levels including genomic alterations, individual genetic variability¹¹, and dynamic changes within the tumor microenvironment¹². While existing prognostic evaluation systems, such as Tumor-Node-Metastasis (TNM) staging system and molecular subtyping¹³, provide important guidance for clinical decision-making, they still exhibit significant limitations in terms of predictive accuracy, individualized application, and dynamic monitoring of the tumor microenvironment. Thus, there is an urgent need to develop more precise and comprehensive prognostic assessment systems.

Mitochondria, as central organelles for cellular energy metabolism, play essential roles in a variety of critical biological processes, including Ca²⁺ signaling, reactive oxygen species (ROS) homeostasis, regulation of apoptosis and cell proliferation, and lipid transport¹⁴. Recent studies have revealed that mitochondrial dysfunction is closely associated with the initiation and progression of multiple cancer types¹⁵. For example, enhanced mitochondrial biogenesis has been shown to promote tumorigenesis by altering glycolytic metabolism in gastric and colorectal cancers¹⁶; heterogeneity of mitochondrial oxidative phosphorylation (OXPHOS) function has been linked to immunotherapy resistance in triple-negative breast cancer¹⁷; and in LUAD, KRAS mutations can lead to increased mitochondrial DNA copy number, promoting tumor progression through enhanced oxidative phosphorylation¹⁸, while Targeting mitochondrial transcription elongation factor (TEFM) has been demonstrated to effectively inhibit tumor growth¹⁹. Moreover, mitochondrial-mediated programmed cell death signatures have been implicated in prognostic prediction and immunotherapy response evaluation in LUAD²⁰. These findings provide novel insights for the development of mitochondria-targeted therapeutic strategies. However, the impact of mitochondrial gene expression patterns on LUAD progression and prognosis remains poorly understood, limiting their clinical application.

To better predict the prognosis of LUAD, this study conducted a molecular classification of patients based on mitochondrial-related gene expression profiles. Utilizing publicly available datasets, LUAD patients were stratified into distinct subtypes according to the expression patterns of these genes. We systematically evaluated the differences among these subtypes in terms of gene expression, metabolic pathways, immune microenvironment, and clinical outcomes, and further conducted detailed investigations of cellular heterogeneity at single-cell resolution. Additionally, to gain insights into potential therapeutic responses, we assessed the sensitivity of various chemotherapeutic agents across these subtypes. Collectively, the molecular classification system established in this study may provide a novel theoretical basis for precision therapy in LUAD.

Materials and methods

Data acquisition and processing

A cohort comprising a total of 515 LUAD patients was enrolled in this investigation. Their RNA sequencing data, somatic mutation profiles, and corresponding clinical data were procured from The Cancer Genome Atlas (TCGA, https://portal.gdc.cancer.gov/), accessed via the TCGAbiolinks²¹ package (version 2.30.4) in the R programming environment. Clinical and transcriptomic data from an additional LUAD dataset, GSE31210²², were retrieved from the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/gds/) database using GEOquery²³ package (version 2.70.0) in the R programming environment. 1136 mitochondria-related genes that are essential for mitochondrial function were obtained from the MitoCarta 3.0²⁴ database (https://www.broadinstitute.org/mitocarta/mitocarta30-inventory-mammalian-mitochondrial-proteins-and-pathways). Six gene sets related to mitochondrial function, five gene sets related to inflammatory pathways, and eight gene sets associated with immune evasion were obtained from the Molecular Signatures Database (http://www.gsea-msigdb.org/gsea/msigdb/index.jsp)²⁵.

Univariate Cox regression analysis was performed on a total of 59,423 genes derived from TCGA-LUAD tumor samples to identify genes significantly associated with prognosis. Based on a selection criterion of p < 0.01, 5,567 genes were identified. By intersecting these genes with 1,136 mitochondria-related genes, 259 prognostic mitochondria-related genes (PMGs) were further determined. The gene symbols of the PMGs were subsequently converted to ENTREZ IDs using the clusterProfiler²⁶ R package (version 4.10.1), and Gene Ontology (GO) enrichment analysis was conducted using the enrichGO function. The enrichment results were visualized with the ggplot2²⁷ package (version 3.5.1).

Mitochondrial-related subtypes analysis by consensus clustering

Based on the expression profiles of the 259 PMGs, consensus clustering²⁸ (CC) was performed on the TCGA-LUAD gene expression matrix using the ConsensusClusterPlus²⁸ R package (version 1.66.0). To assess the clustering stability of different LUAD subtypes, 80% resampling of the samples was conducted. The optimal number of clusters was determined to be k = 2. Random sampling and clustering were repeated for 1,000 iterations to ensure the robustness and reproducibility of the findings. Accordingly, the TCGA-LUAD cohort was divided into two distinct PMGs-related subtypes, designated as C1 and C2. Principal component analysis²⁹ (PCA) was performed to evaluate the homogeneity between the identified clusters, and the results were visualized using the ggplot2 package (version 3.5.1). Kaplan–Meier survival curves were generated using the survival³⁰ R package (version 3.5.8), and differences in survival rates between clusters were assessed by the log-rank test, with p < 0.05 considered statistically significant.

Gene expression and pathway differences in PMGs-related subtypes

The GSVA³¹ package (version 1.50.5) was utilized to calculate the enrichment scores of PMGs-related subtypes based on the Hallmark³² gene sets. The results were visualized using the pheatmap package (version 1.0.12). To evaluate the mitochondrial functional differences between PMGs-related subtypes, particular attention was given to six key mitochondrial function-associated pathways, including Fatty Acid β-Oxidation, Mitochondrial Biogenesis, Mitochondrial Protein Degradation, Mitophagy, Oxidative Phosphorylation (OXPHOS), and the Tricarboxylic Acid (TCA) cycle. The differences in GSVA scores for these metabolic pathways between subtypes were visualized using the ggplot2 package (version 3.5.1), with a statistical significance threshold set at p < 0.05.

Evaluation of immune microenvironment heterogeneity in PMGs-related subtypes

The estimate³³ R package (version 1.0.13) was employed to evaluate the immune scores, stromal scores, and ESTIMATE scores of PMGs-related subtypes, as well as to infer tumor purity for each sample. The CIBERSORT³⁴ algorithm (https://cibersortx.stanford.edu/) was utilized to estimate the relative proportions of 22 common immune cell types within the PMGs-related subtypes. Statistical significance of differences between groups was assessed using the Wilcoxon rank-sum test.

Establishment of prognostic models for lung adenocarcinoma based on DEGs

Differentially expressed genes (DEGs) between the identified cohorts were analyzed using the limma³⁵ R package (version 3.58.1), with statistical thresholds set at |logFC|> 1 and p < 0.05. The results were visualized using the pheatmap package (version 1.0.12). A univariate Cox proportional hazards regression model was applied to evaluate the association between DEG expression levels and patient survival outcomes. To further reduce the number of candidate variables for model construction, Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis was performed using the glmnet³⁶ package (version 4.1.8). Subsequently, key genes selected through LASSO regression were subjected to stepwise multivariate Cox proportional hazards regression analysis. A prognostic risk score model was then constructed based on the regression coefficients, with the risk score formula defined as follows:\({\text{Risk Score }} = \, \beta_{1} \times {\text{Gene}}_{1} + \beta_{2} \times {\text{Gene}}_{2} + \beta_{3} \times {\text{Gene}}_{3} + \cdots + \beta_{n} \times {\text{Gene}}_{n} .\)

Patients were stratified into high-risk and low-risk groups based on the median value of the risk score. Kaplan–Meier (K–M) survival analysis and receiver operating characteristic (ROC) curve analysis were employed to evaluate the predictive performance of the prognostic model. Additionally, a nomogram was constructed using the rms R package (version 6.8.0) to assess the potential clinical utility of the risk score as an independent prognostic indicator.

Drug sensitivity analysis and genomic alteration analysis

The oncoPredict³⁷ R package (version 0.2) was utilized to predict the sensitivity of patients with different PMG-related subtypes to commonly used therapeutic agents in LUAD. Somatic mutation data for the TCGA-STAD cohort were downloaded using the TCGAmutations R package (version 0.4.0), retaining only samples with complete mutation information for subsequent analyses. Single nucleotide variant (SNV) data were analyzed and visualized using the maftools³⁸ R package (version 2.18.0).

Investigating cellular type differences among mitochondria-related subtypes at single-cell resolution

Single-cell RNA sequencing (scRNA-seq) datasets from GSE117570³⁹, GSE131907⁴⁰, and GSE148071⁴¹ were selected for further analysis. The scDblFinder⁴² R package (version 1.16.0) was employed to identify and remove doublets. Cells with fewer than 500 UMI counts and those with mitochondrial gene expression exceeding 15% were further filtered out. Batch effects across different datasets were corrected using the harmony R package (version 1.2.1). The top 3000 highly variable genes were selected as features for subsequent dimensionality reduction and clustering using the FindVariableGenes function in the Seurat⁴³ package (version 4.4.0). Dimensionality reduction was performed based on the first 30 principal components, and clustering was conducted with a resolution parameter set to 0.5. Cluster-specific marker genes were identified using the FindAllMarkers function with parameters set to logfc.threshold = 0.25 and min.pct = 0.25. Clusters were annotated by integrating information from the original dataset publications and known classical marker genes for cell types.

A total of 37 untreated primary tumor samples were selected for subsequent in-depth analysis. The utilized scRNA-seq data were not accompanied by bulk RNA-seq profiles. According to the method described by Zhou et al.⁴⁴, an approximate bulk RNA-seq expression profile for each single-cell sample was generated by averaging the gene expression values across all cells within each sample. Subsequently, the TCGA-LUAD dataset was randomly divided into a training set and a validation set at a 7:3 ratio. An XGBoost⁴⁵-based supervised classification model was constructed using the training set to classify single-cell samples based on PMG expression profiles. The model’s performance was validated using the validation set, with the area under the curve (AUC) employed as the evaluation metric. Finally, bar plots were generated using the ggplot2 R package (version 3.5.1) to compare differences in cell type distributions between the C1 and C2 subtypes in the single-cell samples.

Assessment of gene set scoring at the single-cell level

Enrichment scores of Immune evasion signaling pathways in single-cell samples were evaluated using the “AUCell⁴⁶,” “UCell⁴⁷,” “singscore⁴⁸,” and “ssgsea⁴⁹” methods implemented in the irGSEA⁵⁰ software package (https://github.com/chuiqin/irGSEA/) (version 3.3.2). The rank aggregation (RRA) algorithm was applied to integrate the results from different methods and to identify immune evasion pathways that were consistently and significantly enriched across all approaches.

Cell–cell communication analysis

Intercellular communication networks were constructed using the CellChat⁵¹ R package (version 2.1.0) (https://github.com/sqjin/CellChat) to explore potential interactions among various cell types. After creating the CellChat object, the CellChatDB.human database was selected to infer communication probabilities and specific receptor-ligand interactions within the defined cell populations.

Statistical analysis

All of the statistical analyses and data visualizations were performed in the R programming environment (R version 4.3.1). The Wilcoxon rank-sum test was employed for comparative analysis between two groups. The K–M method was employed to draw survival curves. p-value < 0.05 was considered statistically significant. Unless stated otherwise, all parameter settings for the R packages utilized in this study were set to their default values.

Results

The expression patterns of mitochondrial-related genes identify two distinct LUAD subtypes

To stratify patients with lung adenocarcinoma (LUAD), we initially conducted a genome-wide univariate Cox regression analysis, applying a stringent threshold of p < 0.01, and identified 5,567 genes significantly associated with overall survival (OS). Concurrently, 1,136 mitochondria-related genes were obtained from the MITOCARTA3.0 database (Supplementary Table 1), referred to as mitochondrial genes (MGs). By intersecting these two gene sets (Fig. 1A), a total of 259 genes were identified that were both associated with mitochondrial function and prognostically relevant in LUAD (Supplementary Table 2), hereafter termed prognostic mitochondrial genes (PMGs). These PMGs possess dual functional and prognostic significance, thus providing a molecular foundation for the classification of mitochondria-associated subtypes in LUAD. Gene Ontology (GO) enrichment analysis revealed that the PMGs were predominantly involved in biological processes related to mitochondrial function and energy metabolism (Fig. 1B).

Based on the expression profiles of the PMGs, consensus clustering analysis was performed on the TCGA-LUAD cohort to uncover potential molecular subtypes. When the number of clusters was set to k = 2, the consensus matrix heatmap exhibited distinct and well-defined boundaries (Fig. 1C), indicating high intra-cluster similarity and substantial inter-cluster heterogeneity. Furthermore, the cumulative distribution function (CDF) curve plateaued at k = 2 (Fig. 1D), suggesting that increasing the cluster number beyond two did not markedly improve clustering stability. Therefore, k = 2 was selected as the optimal clustering solution. Accordingly, the TCGA-LUAD cohort was stratified into two molecular subtypes, designated C1 and C2. Principal component analysis (PCA) further validated the clustering results, demonstrating a clear separation between the two subtypes in two-dimensional space (Fig. 1E), thereby underscoring the distinct gene expression patterns between C1 and C2. Kaplan–Meier survival analysis showed that patients classified into the C1 subtype exhibited significantly poorer OS compared to those in the C2 subtype (p = 0.0011), indicating that this molecular classification has robust prognostic relevance (Fig. 1F).

To elucidate differences in pathway activity between the two subtypes, we performed Gene Set Variation Analysis (GSVA) based on the Hallmark gene sets. Among the 50 pathways evaluated, 33 displayed significant differences in activity scores between C1 and C2 (p < 0.05) (Fig. 1G). Notably, the majority of pathways, particularly those involved in metabolic processes, were more highly activated in the C1 subtype, suggesting an enhanced metabolic phenotype. In contrast, only four pathways—primarily immune-related, including allograft rejection and IL6/JAK/STAT3 signaling—were more activated in the C2 subtype.

Further, to specifically investigate mitochondrial functional differences between the subtypes, we examined GSVA scores of six key mitochondrial pathways, including fatty acid β-oxidation, mitochondrial biogenesis, mitochondrial protein degradation, mitophagy, oxidative phosphorylation, and the tricarboxylic acid (TCA) cycle. Across all assessed pathways, C1 exhibited significantly higher GSVA scores compared to C2 (Fig. 1H), reflecting a global upregulation of mitochondrial metabolic activity.

The tumor immune microenvironment was also compared between the two subtypes using the ESTIMATE algorithm. The C1 subtype demonstrated lower stromal, immune, and ESTIMATE scores, but higher tumor purity relative to the C2 subtype (Fig. 1I,J). Furthermore, immune cell infiltration was assessed using the CIBERSORT algorithm, revealing that C1 tumors harbored higher levels of T follicular helper cells, M0 macrophages, M2 macrophages, and activated mast cells. Conversely, plasma cells, resting memory CD4+ T cells, resting dendritic cells, and resting mast cells were more abundant in the C2 subtype (Fig. 1K). These findings collectively suggest that mitochondrial gene expression patterns are intricately associated with both metabolic reprogramming and immune landscape remodeling in LUAD.

Construction and validation of a prognostic model based on DEGs associated with PMGs-related subtypes

To identify robust biomarkers for prognostic evaluation, we first analyzed differentially expressed genes (DEGs) between the two subtypes. A total of 932 DEGs were identified, including 446 genes upregulated in the C1 subtype and downregulated in C2, and 486 genes exhibiting the opposite expression pattern (Fig. 2A). To further screen for candidate genes associated with the prognosis of lung adenocarcinoma (LUAD) patients, univariate Cox regression analysis was performed. Based on gene expression profiles and survival data from 515 patients in the TCGA-LUAD cohort, 190 genes were initially identified as significantly associated with prognosis (p < 0.05) (Supplementary Table 3). To refine these candidates, least absolute shrinkage and selection operator (LASSO) regression analysis was conducted. The feature selection process and cross-validation results of the LASSO model are presented in Fig. 2B,C, respectively. According to the changes in variable coefficients with increasing regularization parameter (λ), 16 key genes were selected for subsequent multivariate Cox regression analysis (Supplementary Table 4).

Multivariate Cox regression analysis was then performed to validate the prognostic value of these 16 genes and to construct the final prognostic model. Ultimately, eight genes were identified as independent prognostic factors. As shown in Fig. 2D, these genes exhibited significantly different expression levels between the high- and low-risk groups. The forest plot further confirmed that all genes had statistically significant hazard ratios (HRs) (global p = 2.33 × 10⁻¹²) (Fig. 2E). Based on these eight independent prognostic factors, we constructed a prognostic risk score model, with the formula as follows: \({\text{Risk score }} = \, \left( {0.{26573}} \right) \times {\text{PLK1 }} + \, \left( {0.{11685}} \right) \times {\text{LYPD3 }} + \, \left( {0.{11713}} \right) \times {\text{FOSL1 }} + \, \left( {0.{53113}} \right) \times {\text{DLGAP5 }} + \, \left( { - 0.{32573}} \right) \times {\text{IKZF3}}\)\(+ \, \left( {0.{17318}} \right) \times {\text{C4orf48 }} + \, \left( { - 0.{55832}} \right) \times {\text{CENPW }} + \, \left( { - 0.{21994}} \right) \times {\text{TRIP13}}\).

Time-dependent ROC curve analysis (Fig. 2F) demonstrated that the model had good predictive accuracy for overall survival (OS), with an area under the curve (AUC) of approximately 0.7. Kaplan–Meier survival analysis revealed that patients in the high-risk group had significantly shorter OS compared to those in the low-risk group (p < 0.001) (Fig. 2G), indicating the model’s ability to effectively stratify patients by risk level. A nomogram incorporating sex and risk score was subsequently developed to further evaluate the independent prognostic value of the risk score (Fig. 2H). The calibration plot showed excellent agreement between predicted and observed OS (Fig. 2I).

Further statistical analysis showed that the majority of patients with the poor-prognosis C1 subtype were classified into the high-risk group (198/308), whereas patients with the favorable-prognosis C2 subtype were predominantly distributed in the low-risk group (145/206), supporting the accuracy of the risk model in distinguishing patients with different prognostic risks (Fig. 2J). Finally, the robustness of the model was validated using the independent GSE31210 cohort. Consistent with the findings in the TCGA-LUAD cohort, the model maintained good predictive performance (Fig. 2K), and Kaplan–Meier survival analysis confirmed a significant OS difference between high- and low-risk groups, with shorter survival observed in the high-risk group (p = 0.012) (Fig. 2L). Together, these results suggest that our risk score model has strong clinical potential as a robust prognostic tool for LUAD patients.

Drug sensitivity differences and TMB characteristics between PMGs subtypes

To investigate the differences in drug sensitivity between the two lung adenocarcinoma (LUAD) subtypes, we used the OncoPredict package to estimate the half-maximal inhibitory concentration (IC50) values for several chemotherapeutic and targeted agents commonly used in LUAD treatment. The results showed that Cisplatin, Carboplatin, Gemcitabine, Irinotecan, and Vinorelbine exhibited higher IC50 values in the C1 subtype compared to the C2 subtype, whereas Osimertinib displayed a significantly lower IC50 value in the C1 subtype (Fig. 3A). To further explore the genomic alterations that might contribute to differential therapeutic responses, we generated waterfall plots of somatic mutations for the C1 (Fig. 3B) and C2 (Fig. 3C) subtypes using the “maftools” package. Interestingly, the C2 subtype exhibited a higher overall mutation rate and a broader spectrum of gene mutations. Tumor mutational burden (TMB) analysis revealed that the C1 subtype had a higher TMB (Fig. 3D,E), which may partially explain its increased sensitivity to Osimertinib.

Single-cell insights into cellular diversity of C1 and C2 subtypes

To further explore the differences in various cell types between the two lung adenocarcinoma (LUAD) subtypes, we conducted a comprehensive analysis using 37 primary tumor samples from three publicly available single-cell RNA sequencing datasets: GSE131907, GSE148071, and GSE117570. After implementing rigorous quality control measures, we obtained a total of 64,667 cells for subsequent analyses. Batch effect correction results, shown in Fig. 4A,B, demonstrate that technical variations among datasets were effectively mitigated. At a resolution parameter of 0.5, we distinguished 14 major cell types based on classical marker genes, including T lymphocytes, Monocytes, B lymphocytes, Macrophages, Dendritic cells, Plasma cells, Mast cells, Natural Killer(NK) cells, Alveolar cells, Epithelial cells, Endothelial cells, Fibroblasts, Oligodendrocytes, and Malignant cells (Fig. 4C). Figure 4D displays the top five highly expressed genes in each cell type.

Since the single-cell RNA-seq data lacked corresponding bulk RNA-seq data, we simulated bulk expression profiles by averaging gene expression levels across all cells within each single-cell sample following the method proposed by Zhou et al. Based on previous subtype classifications, we then constructed a classification prediction model using the XGBoost algorithm, associating complex mitochondrial gene expression patterns with known subtype labels to classify single-cell samples. We randomly divided the TCGA-LUAD cohort into two parts at a 7:3 ratio, using the larger portion for training and the smaller for validation. A classification prediction model was trained on the training set using the expression profiles of 259 PMGs, achieving an AUC of 1.0. Remarkably, the model also demonstrated high predictive accuracy in the validation set, with an AUC of 0.93 (Fig. 4E), indicating its excellent performance in PMGs-based sample classification. Using this classifier, we categorized the single-cell samples and identified 17 C1 subtype samples and 20 C2 subtype samples. Gene set variation analysis (GSVA) revealed distinct pathway activation patterns between the C1 and C2 subtypes (Fig. 4F). Notably, although fewer significantly enriched pathways were detected in the single-cell data compared to the bulk sequencing data, the pathways commonly enriched in both datasets exhibited highly consistent activity trends, further validating the robustness and reliability of the classification model. The distribution patterns of different cell types between the C1 and C2 subtypes are illustrated in Fig. 4G. Compared with the C2 subtype, the C1 subtype exhibited a higher proportion of macrophages and malignant tumor cells, whereas the C2 subtype showed a higher proportion of T lymphocytes and B lymphocytes. This observation suggests that macrophages in the C1 subtype may not exert their inherent anti-tumor effects but instead participate in tumor-promoting processes.

Heterogeneous impact of macrophages on prognosis across PMGs-related subtypes

Given the higher abundance of malignant tumor cells and macrophages in the C1 subtype, along with the significantly poorer prognosis associated with C1, we hypothesized that macrophages might play a critical role in tumor progression among patients with this subtype. GO enrichment analysis revealed that DEGs in macrophages were predominantly enriched in pathways related to energy metabolism and biosynthesis (Fig. 5A). To further elucidate the role of macrophages in LUAD prognosis, particularly in the context of immune evasion, we conducted a detailed gene set enrichment analysis of eight key immune escape signaling pathways. As shown in Fig. 5B, we integrated the results from four different gene set enrichment methods to comprehensively assess the activity of macrophages between the C1 and C2 subtypes. The analysis demonstrated that macrophages in the C1 subtype exhibited significantly upregulated activity in the NF-κB signaling pathway, while those in the C2 subtype displayed relatively suppressed activity, suggesting that the NF-κB pathway may play a central role in immune evasion within the C1 subtype.

Using CellChat for cell–cell communication analysis, we identified frequent and complex interactions between macrophages and malignant tumor cells. Heatmaps depicting the number (Fig. 5C) and the probability (Fig. 5D) of ligand–receptor interactions between these two cell types revealed extensive intercellular communication. In the signaling from macrophages to tumor cells, the PPIA-BSG and LGALS9-P4HB ligand–receptor pairs were significantly enriched (Fig. 5E); conversely, in the signaling from tumor cells to macrophages, the PPIA-BSG, MIF-(CD74 + CD44) complex, MT-RNR2-FPRL2, and MDK-NCL interactions were frequently observed (Fig. 5F).

Discussion

Lung adenocarcinoma (LUAD) is a complex multifactorial disease. Despite significant advances in targeted therapies and immunotherapies in recent years², challenges such as heterogeneous treatment responses and poor prognoses persist¹⁰. The heterogeneity of LUAD at clinical, behavioral, cellular, and molecular levels is a major factor influencing patient outcomes¹⁰. However, current LUAD classification methods are primarily based on histopathological features, which often fail to fully capture the tumor’s molecular characteristics and biological functions, thereby limiting their effectiveness in accurately predicting patient prognosis.

Mitochondria, as the primary sites for cellular energy production, are critically involved in biological processes closely related to cancer initiation and progression, including anabolic metabolism and oxidative stress¹⁴. Although the role of mitochondrial function in cancer progression has been studied across several cancer types¹⁵, its specific mechanisms and contributions in LUAD remain insufficiently explored.

In this study, we identified two potential LUAD subtypes based on the expression profiles of key mitochondrial genes and systematically characterized their distinct features. The C1 subtype exhibited significantly enhanced mitochondrial functional activity and was notably associated with poorer survival outcomes, higher tumor purity, and a more immunosuppressive microenvironment. Despite the heightened mitochondrial activity observed in C1, this did not translate into favorable prognostic outcomes; rather, it correlated with shorter survival times. This phenomenon may reflect an increased dependency of tumor cells on energy metabolism, thereby promoting tumor growth and metastasis. Furthermore, the combination of higher tumor purity and a significantly immunosuppressive microenvironment in the C1 subtype likely provides a niche conducive to tumor cell survival and dissemination. Enhanced mitochondrial function may not only facilitate metabolic adaptation of tumor cells but also exacerbate immune suppression by modulating immune cell functions, ultimately leading to poor patient outcomes. Notably, a higher infiltration level of M2 macrophages was observed in the C1 subtype compared to C2. Macrophages exhibit remarkable plasticity and can differentiate into distinct functional phenotypes through polarization processes. Among these, M2 macrophages, characterized by their anti-inflammatory properties, tissue repair capabilities, and immunosuppressive functions, play a critical role in tumor progression. Substantial evidence suggests that mitochondria play a pivotal role in regulating macrophage polarization⁵². The study by Willenborg et al. demonstrated that mitochondrial metabolism drives the pro-inflammatory and angiogenic functions of M1 macrophages through the early glycolysis-mtROS-HIF1α axis, followed by promotion of M2 macrophage survival and tissue remodeling functions via late-stage OXPHOS-hormesis. This metabolic reprogramming orchestrates the temporal transition of macrophage phenotypes from M1 to M2⁵³. These findings suggest that mitochondrial metabolism may facilitate LUAD progression through dual mechanisms: directly supporting the energy demands and proliferative capacity of tumor cells, while simultaneously remodeling the immune microenvironment (particularly by inducing M2 macrophage polarization) to evade immune surveillance.

In contrast, the C2 subtype exhibited lower mitochondrial functional activity, better survival outcomes, lower tumor purity, and higher levels of immune cell infiltration. Additionally, the C2 subtype demonstrated an enrichment trend in certain pathways related to inflammatory responses and immune regulation. However, upon evaluating the expression activity of five classical inflammation-related pathways—JAK-STAT, NF-κB, inflammasome, cGAS-STING, and MAPK—no significant differences were observed between the C1 and C2 subtypes. These findings suggest that the C2 subtype may represent a collection of samples characterized by weaker mitochondrial function and lacking the metabolic features observed in the C1 subtype.

To elucidate the clinical relevance of mitochondrial-related subtyping, we constructed a prognostic model based on eight differentially expressed genes (DEGs) between the subtypes: PLK1, LYPD3, FOSL1, DLGAP5, IKZF3, C4orf48, CENPW, and TRIP13. These genes are critically involved in tumorigenesis and tumor progression: PLK1 regulates cell cycle progression and its inhibitors have advanced to phase III clinical trials⁵⁴; LYPD3 functions as a potential antibody–drug conjugate (ADC) target and is implicated in immune evasion, and its elevated expression is significantly associated with poor prognosis in LUAD⁵⁵; FOSL1 regulates tumor cell progression in solid malignancies by modulating epithelial-mesenchymal transition (EMT)⁵⁶; DLGAP5 functions as a master cell cycle regulator that drives lung adenocarcinoma progression via PLK1 upregulation, substantiating its oncogenic role⁵⁷; CENPW is a gene implicated in nucleosome assembly, has been identified as an oncogene associated with poor prognosis in hepatocellular carcinoma⁵⁸ and breast cancer⁵⁹. However, its role in the progression of lung adenocarcinoma remains to be elucidated; IKZF3 influences lymphocyte differentiation and is a potential therapeutic target in breast cancer⁶⁰; TRIP13 participates in DNA damage repair and its overexpression associated with poor prognosis in LUAD⁶¹; and C4orf48 promotes renal fibrosis through RNA-binding mechanisms, whereas its biological functions in tumorigenesis and cancer progression remain poorly characterized⁶². These genes play pivotal roles in critical biological processes including tumorigenesis, immune regulation, and cell cycle control. While some have been established as potential therapeutic targets or prognostic biomarkers, the precise molecular mechanisms and clinical implications of others in lung adenocarcinoma remain to be fully elucidated.

It is important to note that our prognostic model achieved an AUC (area under the curve) value of 0.7, indicating moderate predictive performance. This is lower compared to deep learning-based approaches such as SpaRx⁶³ (AUC = 0.938) and DrugFormer⁶⁴ (AUC = 0.975), likely due to the limitations of the currently used Lasso-Cox regression model in capturing complex nonlinear relationships. Future studies should aim to enhance predictive accuracy by leveraging more advanced deep learning models or establishing integrated predictive frameworks. Moreover, extensive validation in larger cohorts and further clinical application testing will be necessary.

To explore the specific therapeutic strategies for the two molecular subtypes of LUAD, we systematically assessed the IC50 of six commonly used clinical antitumor drugs. These drugs include: cisplatin, a platinum-based chemotherapeutic agent that exerts cytotoxic effects by forming DNA cross-links, and is widely used in first-line combination chemotherapy for advanced NSCLC; carboplatin, a second-generation platinum drug with a similar mechanism of action but a more favorable toxicity profile, commonly used in combination chemotherapy for NSCLC; gemcitabine, an antimetabolite that inhibits DNA synthesis, an essential component of basic NSCLC chemotherapy regimens; irinotecan, an antitumor agent targeting topoisomerase I, primarily used in second-line therapy for small cell lung cancer (SCLC) and NSCLC; vinorelbine, a plant alkaloid chemotherapeutic agent that inhibits microtubule polymerization to exert anti-mitotic effects, suitable for palliative treatment of NSCLC; and osimertinib, a third-generation EGFR tyrosine kinase inhibitor that specifically targets EGFR mutations, a first-line treatment for EGFR-mutated NSCLC.The results showed that osimertinib was the only drug that exhibited significantly enhanced drug sensitivity in the C1 subtype (IC50 value decreased by 42% compared to the C2 subtype, p < 0.001), while other drugs demonstrated better efficacy in the C2 subtype. To further investigate the molecular mechanisms underlying this difference, we conducted genomic feature analysis. Somatic mutation spectrum analysis revealed that the KRAS mutation frequency was similar between the two subtypes, and the EGFR gene did not appear in the top 30 mutated genes. Based on this finding, we turned to TMB analysis.TMB analysis revealed significant differences between the subtypes: while the TMB values in both subtypes were positively correlated with the number of mutations, the TMB values in the C1 subtype were more concentrated and significantly higher than those in the C2 subtype. The C2 subtype exhibited a broader range of TMB values, with 23.5% of samples falling below the clinical reference threshold of 0.5. These data suggest that the higher TMB in the C1 subtype may enhance sensitivity to osimertinib by increasing genomic instability, whereas the relatively lower TMB in the C2 subtype may explain its better response to conventional chemotherapeutic agents. This finding provides important biological insights into the treatment response differences between LUAD molecular subtypes.

We extended the subtyping strategy to single-cell resolution using the XGBoost algorithm, enabling a more detailed comparison of cellular composition differences between subtypes. Given the lack of matched bulk RNA-seq profiles for single-cell sequencing data, we constructed pseudo-bulk RNA-seq expression profiles by averaging the gene expression values of individual cells within each sample⁴⁴. It should be noted that, while this arithmetic averaging strategy offers advantages in computational simplicity and stability and can reflect the overall expression characteristics of a cellular population, it presents two important limitations: first, it may obscure the heterogeneity of specific cellular subpopulations; second, simple arithmetic averaging fails to adequately account for the impact of cellular composition proportions on the expression profiles, where existing pseudo-bulk methods demonstrate superior performance⁶⁵.

In the C1 subtype, the proportions of macrophages and malignant tumor cells were significantly higher than those observed in the C2 subtype. Conversely, the proportions of other immune cell types, aside from macrophages, were markedly lower in C1 compared to C2. This suggests that a high abundance of macrophages may synergize with tumor cells to suppress anti-tumor immune responses, thereby enhancing the proliferative and invasive capacities of tumor cells. Moreover, tumor-associated macrophages (TAMs) may facilitate chronic inflammatory responses through mitochondrial-associated pathways, thereby promoting tumor progression and immune evasion⁶⁶. This finding aligns with the unfavorable prognosis observed in the C1 subtype.Therefore, we propose that the heightened mitochondrial metabolic activity observed in C1 may be linked to macrophage-mediated immunosuppression and tumor-promoting effects, which is further corroborated by the elevated infiltration of M2-polarized macrophages in this subtype.

Further in-depth cell–cell communication analyses revealed that interactions between macrophages and malignant tumor cells were closely associated with activation of the NF-κB signaling pathway. This indicates that macrophages may enhance tumor survival and exert immunosuppressive functions through NF-κB pathway activation, thereby facilitating tumor immune evasion. In this process, reactive oxygen species (ROS) serve as critical signaling molecules for NF-κB pathway activation, with mitochondria being one of the primary sources of ROS⁶⁷. Mitochondria-derived ROS can activate NF-κB signaling within macrophages, leading to the secretion of both pro-inflammatory and immunosuppressive cytokines, thus shaping a tumor-promoting microenvironment⁶⁸. Consequently, mitochondrial regulation of macrophage activation states plays a pivotal role in modulating the tumor microenvironment and represents a key factor influencing lung adenocarcinoma progression and patient prognosis.

Ligand–receptor interaction analysis further elucidated the regulatory mechanisms of the tumor microenvironment in the C1 subtype. Notably, the interaction between galectin-9 (LGALS9) and prolyl 4-hydroxylase subunit beta (P4HB) was significantly enriched in macrophage-to-tumor cell signaling, potentially mediating T cell suppression via NF-κB pathway activation⁶⁹. This finding aligns with our observation of significantly lower T cell infiltration in the C1 subtype compared to C2. Moreover, the binding of cyclophilin A (PPIA) to basigin (BSG) may facilitate tumor invasion and recruit M0 macrophages, consistent with the elevated proportion of M0 macrophages in the C1 subtype⁷⁰. Importantly, the interaction between macrophage migration inhibitory factor (MIF) and CD74 antigen (CD74) was more prominent in the context of TP53 mutations, aligning with the higher tumor mutational burden (TMB) and TP53 mutation rates observed in C1⁷¹. Additionally, the binding of midkine (MDK) to nucleolin (NCL) may inhibit wild-type EGFR signaling pathways, potentially contributing to the increased sensitivity of C1 tumors to osimertinib treatment.

Previous work by Zhou et al. proposed a novel classification of 1,076 TCGA-BRCA samples based on the expression of 29 mitophagy-related genes, resulting in two distinct subtypes. Among them, the cluster 1 subtype exhibited high expression of mitophagy-related genes and was associated with poorer survival outcomes, consistent with our finding that the C1 subtype, characterized by worse prognosis, exhibited significantly higher mitophagy pathway activity compared to C2. Interestingly, Zhou et al. also reported that triple-negative breast cancer (TNBC) and metastatic breast cancer were more prevalent in the cluster 2 subtype, which may be attributed to excessive ROS levels leading to lower expression of mitophagy genes in cluster 2⁷². Wang et al. further classified samples based on the expression of nuclear-encoded mitochondrial genes (NMGs), identifying three subtypes associated with immune cell infiltration patterns and exploring their potential involvement in cuproptosis, a recently discovered form of cell death with anti-cancer properties⁷³.

In LUAD, previous studies have predominantly focused on the role of specific genes in regulating mitochondrial function. In contrast, our study systematically established, for the first time, a molecular subtyping framework for lung adenocarcinoma based on mitochondrial-related gene signatures through multi-omics analyses. We further elucidated the interplay between mitochondrial function and the tumor immune microenvironment, offering novel insights for precision diagnosis and individualized treatment of lung adenocarcinoma. Nonetheless, several limitations should be acknowledged. First, our study is based solely on bioinformatic analyses and lacks validation in solid clinical specimens. Second, limitations also exist in the single-cell analysis component: although 64,667 cells from 37 samples provided substantial cellular data, the relatively small sample size may hinder comprehensive assessment of population heterogeneity. Future studies should aim to validate these findings in larger cohorts.

Data availability

The data that support the findings of this study are available in GEO (https://www.ncbi.nlm.nih.gov/geo/, GSE31210, GSE117570, GSE131907 and GSE148071)and TCGA (https://portal.gdc.cancer.gov/repository, TCGA-LUAD).

Abbreviations

AUC:: Area under the curve
CC:: Consensus clustering
C1:: Cluster 1
C2:: Cluster 2
CDF:: Cumulative distribution function
DEGs:: Differentially expressed genes
EMT:: Epithelial-mesenchymal transition
ESTIMATE:: Estimation of STromal and Immune cells in MAlignant Tumour tissues using Expression data
GEO:: Gene Expression Omnibus
GO:: Gene Ontology
GSVA:: Gene Set Variation Analysis
IC50:: Half of the maximal inhibitory concentration
KM:: The Kaplan–Meier method
LUAD:: Lung adenocarcinoma
MGs:: Mitochondrial genes
NSCLC:: Non-small cell lung cancer
OS:: Overall Survival
OXPHOS:: Oxidative phosphorylation
PCA:: Principal component analysis
PMGs:: Prognostic mitochondria-related genes
ROC:: Receiver operating characteristic
ROS:: Reactive oxygen species
SNV:: Single nucleotide variant
TAMs:: Tumor-associated macrophages
TCGA:: The Cancer Genome Atlas
TEFM:: Targeting mitochondrial transcription elongation factor
TMB:: Tumor mutational burden
TNM:: Tumor, node, metastasis staging system

References

Adams, S. J. et al. Lung cancer screening. The Lancet 401, 390–408. https://doi.org/10.1016/S0140-6736(22)01694-4 (2023).
Article Google Scholar
Bray, F. et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 74, 229–263. https://doi.org/10.3322/caac.21834 (2024).
Article PubMed Google Scholar
Molina, J. R., Yang, P., Cassivi, S. D., Schild, S. E. & Adjei, A. A. Non-small cell lung cancer: Epidemiology, risk factors, treatment, and survivorship. Mayo Clin. Proc. 83, 584–594. https://doi.org/10.4065/83.5.584 (2008).
Article PubMed Google Scholar
Travis, W. D. et al. The 2015 World Health Organization classification of lung tumors: Impact of genetic, clinical and radiologic advances since the 2004 classification. J. Thorac. Oncol. 10, 1243–1260. https://doi.org/10.1097/JTO.0000000000000630 (2015).
Article PubMed Google Scholar
Makarem, M. & Jänne, P. A. Top advances of the year: Targeted therapy for lung cancer. Cancer 130, 3239–3250. https://doi.org/10.1002/cncr.35423 (2024).
Article CAS PubMed Google Scholar
Cable, J. et al. Frontiers in cancer immunotherapy-a symposium report. Ann. N. Y. Acad. Sci. 1489, 30–47. https://doi.org/10.1111/nyas.14526 (2021).
Article CAS PubMed Google Scholar
Zhang, Y., Lu, L. & Zheng, R. Emerging trends and focus on immune checkpoint inhibitors for non-small cell lung cancer treatment: Visualization and bibliometric analysis. Front. Pharmacol. 14, 1140771. https://doi.org/10.3389/fphar.2023.1140771 (2023).
Article CAS PubMed PubMed Central Google Scholar
Meyer, M.-L. et al. New promises and challenges in the treatment of advanced non-small-cell lung cancer. The Lancet 404, 803–822. https://doi.org/10.1016/S0140-6736(24)01029-8 (2024).
Article CAS Google Scholar
Beadsmoore, C. J. & Screaton, N. J. Classification, staging and prognosis of lung cancer. Eur. J. Radiol. 45, 8–17. https://doi.org/10.1016/S0720-048X(02)00287-5 (2003).
Article CAS PubMed Google Scholar
He, Y. et al. Mechanisms of progression and heterogeneity in multiple nodules of lung adenocarcinoma. Small Methods 5, 2100082. https://doi.org/10.1002/smtd.202100082 (2021).
Article CAS Google Scholar
Hua, X. et al. Genetic and epigenetic intratumor heterogeneity impacts prognosis of lung adenocarcinoma. Nat. Commun. 11, 2459. https://doi.org/10.1038/s41467-020-16295-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Spatial transcriptomics delineates molecular features and cellular plasticity in lung adenocarcinoma progression. Cell Discov. 9, 96. https://doi.org/10.1038/s41421-023-00591-7 (2023).
Article CAS PubMed PubMed Central Google Scholar
Tsao, M.-S., Shepherd, F. A., Brambilla, E. & Soria, J.-C. Predictive versus prognostic value of lung adenocarcinoma classification. J. Clin. Oncol. 33, 225–226. https://doi.org/10.1200/JCO.2014.58.7022 (2014).
Article PubMed Google Scholar
Suomalainen, A. & Nunnari, J. Mitochondria at the crossroads of health and disease. Cell 187, 2601–2627. https://doi.org/10.1016/j.cell.2024.04.037 (2024).
Article CAS PubMed Google Scholar
Harrington, J. S., Ryter, S. W., Plataki, M., Price, D. R. & Choi, A. M. K. Mitochondria in health, disease, and aging. Physiol. Rev. 103, 2349–2422. https://doi.org/10.1152/physrev.00058.2021 (2023).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. The role of mitochondrial biogenesis, mitochondrial dynamics and mitophagy in gastrointestinal tumors. Cancer Cell Int. 25, 46. https://doi.org/10.1186/s12935-025-03685-2 (2025).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Z. et al. A mitochondria-interfering nanocomplex cooperates with photodynamic therapy to boost antitumor immunity. Biomaterials 317, 123094. https://doi.org/10.1016/j.biomaterials.2025.123094 (2025).
Article CAS PubMed Google Scholar
Mennuni, M. et al. High mitochondrial DNA levels accelerate lung adenocarcinoma progression. Sci. Adv. 10, eadp3481. https://doi.org/10.1126/sciadv.adp3481 (2024).
Article CAS PubMed PubMed Central Google Scholar
Hu, W. et al. Identification of TEFM as a potential therapeutic target for LUAD treatment. J. Transl. Med. 22, 692. https://doi.org/10.1186/s12967-024-05483-2 (2024).
Article CAS PubMed PubMed Central Google Scholar
Zhang, L., Cui, Y., Zhou, G., Zhang, Z. & Zhang, P. Leveraging mitochondrial-programmed cell death dynamics to enhance prognostic accuracy and immunotherapy efficacy in lung adenocarcinoma. J. Immunother. Cancer 12, e010008. https://doi.org/10.1136/jitc-2024-010008 (2024).
Article PubMed PubMed Central Google Scholar
Colaprico, A. et al. TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 44, e71. https://doi.org/10.1093/nar/gkv1507 (2016).
Article CAS PubMed Google Scholar
Okayama, H. et al. Identification of genes upregulated in ALK-positive and EGFR/KRAS/ALK-negative lung adenocarcinomas. Cancer Res. 72, 100–111. https://doi.org/10.1158/0008-5472.Can-11-1403 (2012).
Article CAS PubMed Google Scholar
Davis, S. & Meltzer, P. S. GEOquery: A bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics 23, 1846–1847. https://doi.org/10.1093/bioinformatics/btm254 (2007).
Article CAS PubMed Google Scholar
Rath, S. et al. MitoCarta3.0: An updated mitochondrial proteome now with sub-organelle localization and pathway annotations. Nucleic Acids Res. 49, D1541–D1547. https://doi.org/10.1093/nar/gkaa1011 (2021).
Article CAS PubMed Google Scholar
Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739–1740. https://doi.org/10.1093/bioinformatics/btr260 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS: J. Integr. Biol. 16, 284–287. https://doi.org/10.1089/omi.2011.0118 (2012).
Article CAS Google Scholar
Valero-Mora, P. M. ggplot2: Elegant graphics for data analysis. J. Stat. Softw. Book Rev. 35, 1–3. https://doi.org/10.18637/jss.v035.b01 (2010).
Article Google Scholar
Wilkerson, M. D. & Hayes, D. N. ConsensusClusterPlus: A class discovery tool with confidence assessments and item tracking. Bioinformatics 26, 1572–1573. https://doi.org/10.1093/bioinformatics/btq170 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lever, J., Krzywinski, M. & Altman, N. Principal component analysis. Nat. Methods 14, 641–642. https://doi.org/10.1038/nmeth.4346 (2017).
Article CAS Google Scholar
Gray, R. J. Modeling survival data: Extending the Cox model. J. Am. Stat. Assoc. 97, 353–354. https://doi.org/10.1198/jasa.2002.s447 (2002).
Article Google Scholar
Hänzelmann, S., Castelo, R. & Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-Seq data. BMC Bioinform. 14, 7. https://doi.org/10.1186/1471-2105-14-7 (2013).
Article Google Scholar
Liberzon, A. et al. The Molecular Signatures Database (MSigDB) hallmark gene set collection.
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612. https://doi.org/10.1038/ncomms3612 (2013).
Article CAS PubMed Google Scholar
Newman, A. M. et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 37, 773–782. https://doi.org/10.1038/s41587-019-0114-2 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47–e47. https://doi.org/10.1093/nar/gkv007 (2015).
Article CAS PubMed PubMed Central Google Scholar
Friedman, J. H., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22. https://doi.org/10.18637/jss.v033.i01 (2010).
Article PubMed PubMed Central Google Scholar
Maeser, D., Gruener, R. F. & Huang, R. S. oncoPredict: An R package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Brief. Bioinform. 22, bbab260. https://doi.org/10.1093/bib/bbab260 (2021).
Article CAS PubMed PubMed Central Google Scholar
Mayakonda, A., Lin, D. C., Assenov, Y., Plass, C. & Koeffler, H. P. Maftools: Efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 28, 1747–1756. https://doi.org/10.1101/gr.239244.118 (2018).
Article CAS PubMed PubMed Central Google Scholar
Song, Q. et al. Dissecting intratumoral myeloid cell plasticity by single cell RNA-seq.
Kim, N. A.-O. X. et al. Single-cell RNA sequencing demonstrates the molecular and cellular reprogramming of metastatic lung adenocarcinoma.
Wu, F. et al. Single-cell profiling of tumor heterogeneity and the microenvironment in advanced non-small cell lung cancer.
Germain, P. L., Lun, A., Garcia Meixide, C., Macnair, W. & Robinson, M. D. Doublet identification in single-cell sequencing data using scDblFinder. F1000Res 10, 979. https://doi.org/10.12688/f1000research.73600.2 (2021).
Article PubMed Google Scholar
Luecken, M. A.-O. & Theis, F. A.-O. Current best practices in single-cell RNA-seq analysis: A tutorial.
Zhou, Q. et al. Cancer functional states-based molecular subtypes of gastric cancer. J. Transl. Med. 21, 80. https://doi.org/10.1186/s12967-023-03921-1 (2023).
Article CAS PubMed PubMed Central Google Scholar
Chen, T. & Guestrin, C. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (Association for Computing Machinery, San Francisco, California, USA, 2016).
Aibar, S. A.-O. et al. SCENIC: single-Cell regulatory network inference and clustering.
Andreatta, M. & Carmona, S. J. UCell: Robust and scalable single-cell gene signature scoring.
Foroutan, M. A.-O. et al. Single sample scoring of molecular phenotypes.
Hänzelmann, S., Castelo R Fau - Guinney, J. & Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-seq data.
Fan, C. et al. irGSEA: The integration of single-cell rank-based gene set enrichment analysis. Brief. Bioinform. https://doi.org/10.1093/bib/bbae243 (2024).
Article PubMed PubMed Central Google Scholar
Jin, S. et al. Inference and analysis of cell-cell communication using Cell Chat. Nat. Commun. 12, 1088. https://doi.org/10.1038/s41467-021-21246-9 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yunna, C., Mengru, H., Lei, W. & Weidong, C. Macrophage M1/M2 polarization. Eur. J. Pharmacol. 877, 173090. https://doi.org/10.1016/j.ejphar.2020.173090 (2020).
Article CAS PubMed Google Scholar
Willenborg, S. et al. Mitochondrial metabolism coordinates stage-specific repair processes in macrophages during wound healing. Cell Metab. 33, 2398-2414.e2399. https://doi.org/10.1016/j.cmet.2021.10.004 (2021).
Article CAS PubMed Google Scholar
Liu, Z., Sun, Q. & Wang, X. PLK1, a potential target for cancer therapy. Transl. Oncol. 10, 22–32. https://doi.org/10.1016/j.tranon.2016.10.003 (2017).
Article PubMed Google Scholar
Hu, P. et al. Elevated expression of LYPD3 is associated with lung adenocarcinoma carcinogenesis and poor prognosis. DNA Cell Biol. 39, 522–532. https://doi.org/10.1089/dna.2019.5116 (2020).
Article CAS PubMed Google Scholar
Sobolev, V. V. et al. Role of the transcription factor FOSL1 in organ development and tumorigenesis. Int. J. Mol. Sci. 23, 1521 (2022).
Article CAS PubMed PubMed Central Google Scholar
Chen, M. et al. DLGAP5 promotes lung adenocarcinoma growth via upregulating PLK1 and serves as a therapeutic target. J. Transl. Med. 22, 209. https://doi.org/10.1186/s12967-024-04910-8 (2024).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y. et al. Knockdown of CENPW inhibits hepatocellular carcinoma progression by inactivating E2F signaling. Technol. Cancer Res. Treat. 20, 15330338211007252. https://doi.org/10.1177/15330338211007253 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, L. et al. Investigating CENPW as a novel biomarker correlated with the development and poor prognosis of breast carcinoma. Front. Genet. 13, 900111 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lin, C.-Y. et al. IKZF3 amplification frequently occurs in HER2-positive breast cancer and is a potential therapeutic target. Med. Oncol. 39, 242. https://doi.org/10.1007/s12032-022-01812-x (2022).
Article CAS PubMed Google Scholar
Li, W. et al. Thyroid hormone receptor interactor 13 (TRIP13) overexpression associated with tumor progression and poor prognosis in lung adenocarcinoma. Biochem. Biophys. Res. Commun. 499, 416–424. https://doi.org/10.1016/j.bbrc.2018.03.129 (2018).
Article CAS PubMed Google Scholar
Yang, J. et al. The secreted micropeptide C4orf48 enhances renal fibrosis via an RNA-binding mechanism. J. Clin. Investig. 134, e178392. https://doi.org/10.1172/JCI178392 (2024).
Article CAS PubMed PubMed Central Google Scholar
Tang, Z. et al. SpaRx: Elucidate single-cell spatial heterogeneity of drug responses for personalized treatment. Brief. Bioinform. 24, bbad338. https://doi.org/10.1093/bib/bbad338 (2023).
Article CAS PubMed PubMed Central Google Scholar
Liu, X. et al. DrugFormer: Graph-enhanced language model to predict drug sensitivity. Adv. Sci. (Weinheim, Baden-Wurttemberg, Germany) 11, e2405861. https://doi.org/10.1002/advs.202405861 (2024).
Article CAS Google Scholar
You, Y. et al. Modeling group heteroscedasticity in single-cell RNA-seq pseudo-bulk data. Genome Biol. 24, 107. https://doi.org/10.1186/s13059-023-02949-2 (2023).
Article CAS PubMed PubMed Central Google Scholar
Pathria, P., Louis, T. L. & Varner, J. A. Targeting tumor-associated macrophages in cancer. Trends Immunol. 40, 310–327. https://doi.org/10.1016/j.it.2019.02.003 (2019).
Article CAS PubMed Google Scholar
Kuo, C.-L. et al. Mitochondrial oxidative stress in the tumor microenvironment and cancer immunoescape: foe or friend?. J. Biomed. Sci. 29, 74. https://doi.org/10.1186/s12929-022-00859-2 (2022).
Article PubMed PubMed Central Google Scholar
Yu, M. et al. BCAT1 promotes lung adenocarcinoma progression through enhanced mitochondrial function and NF-kappaB pathway activation. J. Zhejiang Univ. Sci. B 23, 760–769. https://doi.org/10.1631/jzus.B2100985 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lv, Y., Ma, X., Ma, Y., Du, Y. & Feng, J. A new emerging target in cancer immunotherapy: Galectin-9 (LGALS9). Genes & Diseases 10, 2366–2382. https://doi.org/10.1016/j.gendis.2022.05.020 (2023).
Article CAS Google Scholar
Tian, P. et al. Dual stimulus responsive borosilicate glass (BSG) scaffolds promote diabetic alveolar bone defectsrepair by modulating macrophage phenotype. Bioact. Mater. 26, 231–248. https://doi.org/10.1016/j.bioactmat.2023.02.023 (2023).
Article CAS PubMed PubMed Central Google Scholar
Fey, R. M., Nichols, R. A., Tran, T. T., Vandenbark, A. A. & Kulkarni, R. P. MIF and CD74 as emerging biomarkers for immune checkpoint blockade therapy. Cancers 16, 1773 (2024).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y., Wei, X., Li, W., Zhang, S. & Zhao, Y. Comprehensive analysis of mitophagy-related subtypes of breast cancer and the association with immune related characteristics. Heliyon 9, e23267. https://doi.org/10.1016/j.heliyon.2023.e23267 (2023).
Article CAS PubMed PubMed Central Google Scholar
Wang, X. et al. Construction of molecular subtype and prognostic model for gastric cancer based on nucleus-encoded mitochondrial genes. Sci. Rep. 14, 28491. https://doi.org/10.1038/s41598-024-78729-0 (2024).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

All authors would like to thank Lisha Huang from the College of Chemistry and Life Sciences, Beijing University of Technology, for providing technical assistance during the course of this study.

Funding

This study was supported by the Naval Logistics Department, Comprehensive Planning Bureau of the Central Military Commission Logistics Support Department (Grant No. BHJ17L018).

Author information

Ziyi Zhanghuang and Fei Xie contributed equally to this work.

Authors and Affiliations

College of Chemistry and Life Science, Beijing University of Technology, Beijing, 100124, China
Ziyi Zhanghuang, Fei Xie & Xuemei Ma
Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Thoracic Surgery II, Peking University Cancer Hospital & Institute, Beijing, 100142, China
Jinfeng Chen

Authors

Ziyi Zhanghuang
View author publications
Search author on:PubMed Google Scholar
Fei Xie
View author publications
Search author on:PubMed Google Scholar
Xuemei Ma
View author publications
Search author on:PubMed Google Scholar
Jinfeng Chen
View author publications
Search author on:PubMed Google Scholar

Contributions

Z. Z. and F. X. designed the article；Z. Z. compiled ,combined and examined the data. F. X. wrote the paper. Z. Z. , F. X. ,X.M and J.C. revised and finalized the manuscript. All authors agree to be accountable for the content of the work. All authors contributed to the article and approved the submitted version.

Corresponding authors

Correspondence to Xuemei Ma or Jinfeng Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Supplementary Material 4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Zhanghuang, Z., Xie, F., Ma, X. et al. Mitochondria-related gene-based molecular subtypes of lung adenocarcinoma and their prognostic implications. Sci Rep 15, 26577 (2025). https://doi.org/10.1038/s41598-025-07982-8

Download citation

Received: 08 November 2024
Accepted: 18 June 2025
Published: 22 July 2025
Version of record: 22 July 2025
DOI: https://doi.org/10.1038/s41598-025-07982-8