Abstract
Gliomas are the most common and aggressive primary tumors of the central nervous system. Dysregulated transcription factors (TFs) and genes have been implicated in glioma progression, yet these tumors’ overall structure of gene regulatory networks (GRNs) remains undefined. We analyzed transcriptional data from 989 primary gliomas in The Cancer Genome Atlas (TCGA) and the Chinese Glioma Genome Atlas (CGGA) to address this. GRNs were reconstructed using the RTN package which identifies regulons—sets of genes regulated by a common TF based on co-expression and mutual information. Regulon activity was evaluated through Gene Set Enrichment Analysis. Elastic net regularization and Cox regression identified 31 and 32 prognostic genes in the TCGA and CGGA datasets, respectively, with 11 genes overlapping, many of which are associated with neural development and synaptic processes. GAS2L3, HOXD13, and OTP demonstrated the strongest correlations with survival outcomes among these. Single-cell RNA-seq analysis of 201,986 cells revealed distinct expression patterns for these genes in glioma subpopulations, particularly oligoprogenitor cells. This study uncovers key GRNs and prognostic genes in gliomas, offering new insights into tumor biology and potential therapeutic targets.
Similar content being viewed by others
Introduction
Gliomas represent the most common and lethal primary tumors of the central nervous system (CNS)1. These tumors exhibit remarkable molecular and clinical heterogeneity, making them challenging to treat effectively. The latest World Health Organization (WHO) 2021 classification has refined the molecular characterization of gliomas, classifying them into three main categories2,3. In adults, diffuse gliomas are categorized according to the status of isocitrate dehydrogenase (IDH) 1 and 2 genes, as well as the presence of a 1p/19q chromosomal co-deletion: 1) astrocytoma, IDH-mutant; 2) oligodendroglioma, IDH-mutant with 1p/19q co-deletion; and 3) glioblastoma, IDH wild-type. Tumor grading follows a nine-step protocol that evaluates tumor location, histology, and molecular markers4.
The integration of molecular targets into the classification of gliomas has significantly advanced our understanding of their pathogenesis3,5. This refined system facilitates more accurate diagnoses and tailored treatment strategies, which have notably improved survival rates, particularly for WHO grades 02 and 036. Nonetheless, recurrence remains a frequent outcome for these patients, and the prognosis for grade 04 gliomas has remained stagnant over the decades7. Therefore, there is a need for a deeper understanding of the molecular mechanisms underlying glioma development.
Transcriptional regulation plays a crucial role in the biology of gliomas8–10. Alterations in chromatin structure and epigenetic modifications can significantly affect tumor aggressiveness and phenotype, suggesting that disruptions in genetic regulation are central to tumor formation and progression11–13. In this context, investigating gene regulatory networks (GRNs) is essential for identifying and characterizing transcription factors (TFs) along with their target genes14–16. GRNs represent intricate regulatory interactions that control gene expression, dictating cellular fate and response to external signals. A core element of GRNs is the regulon, which refers to a transcription factor and the set of genes it directly regulates. These regulons reflect coordinated regulatory programs that can be inferred from co-expression and mutual information patterns in transcriptomic data.
In gliomas, the dysregulation of GRNs may contribute to tumor heterogeneity, therapy resistance, and progression. Computational inference of GRNs from transcriptomic data provides valuable insights into these regulatory interactions, enabling the identification of key regulators involved in glioma biology.
To reconstruct GRNs, we utilized the RTN package16, a framework specifically designed for regulatory network analysis based on mutual information and transcriptional regulons. RTN employs the ARACNe (Algorithm for the Reconstruction of Accurate Cellular Networks) algorithm to infer TF-target interactions17, followed by bootstrapping and statistical refinement to enhance robustness. This approach allows the identification of master regulators and their downstream targets, uncovering potential molecular drivers of glioma progression.
In light of the pressing need for deeper insights into glioma biology, this study seeks to employ an integrative systems approach to uncover potential mechanisms and molecular targets of prognostic significance. We analyzed two RNA-seq datasets comprising 989 samples from primary glioma patients. We reconstructed gene GRNs for both datasets and identified key prognostic genes, contributing to a deeper understanding of the pathology of these dynamic and challenging diseases.
Results
Reconstruction and activity analysis of gene regulatory networks in glioma
To address the inherent diversity and complexity of gliomas, we reconstructed their GRNs, grouping genes into regulatory units centered around TFs. Using the RTN package, we identified regulons to investigate regulatory mechanisms driving glioma biology.
We utilized two publicly available datasets: The Cancer Genome Atlas (TCGA) and the Chinese Glioma Genome Atlas (CGGA)18. The TCGA dataset comprised samples reclassified according to the 2021 WHO glioma classification, whereas the CGGA dataset followed the 2016 WHO classification due to the lack of updated annotations.
Following GRN reconstruction, we evaluated regulon activity using two-tailed Gene Set Enrichment Analysis (GSEA), enabling the assessment of regulatory directionality in both TCGA and CGGA datasets (Fig. 1a). This analysis facilitated the assignment of regulon activity scores to individual samples, providing a quantitative measure of their functional roles in glioma progression.
Least absolute shrinkage and selection operator (LASSO) analysis of regulons from TCGA and CGGA: (a) Schematic workflow. (b) Cross-validation results demonstrating the performance of LASSO for variable coefficient selection. The plot illustrates the partial likelihood deviance across different values of the regularization parameter (lambda), aiding in the selection of the optimal lambda. (c, d) LASSO coefficients for the identified regulons in the TCGA and CGGA dataset respectively. Each dot represents the magnitude and direction of the effect of each variable on the model. (e) Hierarchical tree-and-leaf representation of the identified regulons; the size of each circle corresponds to the number of genes within each regulon. Green circles indicate regulons derived from TCGA, while orange circles represent those from CGGA. (e) Heatmaps displaying the activity levels of the identified regulons across all samples from CGGA and TCGA.
Survival-associated regulons and their functional roles in glioma
To identify potential regulons associated with survival in glioma samples, we applied the Least Absolute Shrinkage and Selection Operator (LASSO) method in conjunction with Cox regression, using age and tumor grade as covariates (Fig. 1b; Supplementary Fig. 1a and 1b; Supplementary Table 1). This analysis revealed 22 regulons (FOXM1, DMRTA2, GPBP1L1, GSX2, ZNF675, SOX10, SATB2, SHOX2, POU4F1, NHLH2, NEUROG1, ZNF501, ZIM2, EBF3, ZNF474, ZNF607, SPZ1, TP63, ZNF683, FOXG1, GATA5, and HNF4A) with non-zero coefficients in the CGGA (Fig. 1c) dataset and 28 (ATF6B, DBP, DMRTA2, GATA4, GLI4, GRHL3, HOXA2, IRX5, KIN, MEIS1, NEUROG3, NFX1, OTP, POU3F1, SETBP1, SNAI3, SOX10, TBPL1, THAP1, TRPS1, XPA, ZNF20, ZNF333, ZNF423, ZNF736, ZNF845, ZNF91 and ZSCAN32.) in the TCGA dataset (Fig. 1d), suggesting their potential prognostic relevance.
Further analysis using RTN revealed that several regulons displayed similarities in network clusterization, as shown in the hierarchical tree and leaf network representation. Despite minimal overlap between the regulons identified in the two datasets, with only SOX10 common to both (Fig. 1e), the tree-and-leaf representation highlighted shared network similarities across datasets. Interestingly, these shared clusters corresponded to distinct transcription factors (TFs), underscoring potential functional convergence among different regulons.
Hierarchical cluster analysis of regulon activity levels in the TCGA and CGGA datasets, arranged from top to bottom, distinctly identified a cluster corresponding to WHO grade 04 samples, alongside other samples associated with poor overall survival (Fig. 1f). Despite differences in the identified regulons, we focused on the genes associated with each regulon, as their analysis provided deeper insights into the underlying regulatory networks while mitigating batch effects.
Using gene lists extracted from the RTN results, we visualized the associated gene networks, identifying 1700 genes across the 28 TCGA regulons and 1083 genes across the 22 CGGA regulons (Supplementary Figure 1c). While the regulons differed between datasets, enrichment analysis of the associated gene sets revealed shared biological processes (BPs) across regulons in both datasets. For instance, the DMRTA2 and SHOX2 regulons in the CGGA shared biological processes with the IRX5 regulon in the TCGA dataset, primarily related to synaptic signaling and cell proliferation (Fig. 2a; Supplementary Table 2).
Characterization of genes within the regulons: (a) Functional enrichment analysis of all identified regulons from CGGA and TCGA, highlighting the biological processes significantly associated with the gene sets. (b) Venn diagram illustrating the overlap of genes within prognostically relevant regulons from CGGA and TCGA. The diagram indicates the number of unique and shared genes. (c, d) Cross-validation results for Elastic Net variable coefficient screening, demonstrating the stability and performance of selected variables across different parameter settings. The plots shows partial likelihood deviance as a function of the regularization parameters, aiding in the identification of the optimal model, significant variables coefficient values and elastic net coefficients for the identified regulons in the CGGA and TCGA dataset respectively. Each dot represents the magnitude and direction of the effect of each variable on the model.
Additionally, individual regulons specific to either TCGA or CGGA highlighted key aspects of glioma biology, including immune infiltration, apoptosis, and protein metabolism. These findings underscore the ability of regulons to capture critical prognostic processes, as previously described19–23.
To further evaluate the relationship between these genes and survival outcomes, we focused on the 162 genes common to both datasets (Fig. 2b). Enrichment analysis revealed that these genes were significantly associated with pathways related to synaptic signaling, embryonic development, and cell division (Supplementary Figure 2a: Supplementary Table 3). These findings strengthen the hypothesis that synaptic integration plays a pivotal role in glioma development and underscore the potential involvement of glioma stem cells in these regulatory networks24–26.
Prognostic significance of common genes identified across TCGA and CGGA
To identify potential prognostic variables, we performed elastic net regularization combined with Cox regression analysis, focusing on the 162 genes common to both the TCGA and CGGA datasets. This integrative approach yielded 31 genes from the TCGA dataset and 32 genes from the CGGA dataset, each with significant positive or negative coefficient values (Fig. 2c,d; Supplementary Table 4).
In the CGGA dataset, genes with positive and negative coefficients included GAS2L3, HOXA3, OTP, TXLNB, HOXD13, IRX1, KIF14, SLC2A10, SSH3, MOXD1, SOX10, HDAC4, MADD, GFRA1, ALDH5A1, CSTB, WBP1L, GABRB3, GBF1, COPZ2, CRY2, CRTAC1, SLC1A4, GNL1, ARPP21, KCNJ11, SLITRK5, RNF157-AS1, GNAL, BICC1, LRRTM4, and LCE1C (Fig. 2c). Similarly, in the TCGA dataset, genes with positive and negative coefficients included PLEK2, ISG20, GAS2L3, OTP, TXLNB, ATP10B, HOXD13, IRX1, EFEMP2, TKTL1, EMP3, ERBB3, PLAT, TNR, NSG2, GABRB3, NRG3, DGCR2, LUZP2, CRY2, MRPS16, CRTAC1, GNL1, TEF, RNF180, SLITRK5, CTBP2, GRID1, GRHPR, LCE1C, and CTB-1I21.1 (Fig. 2d). Among these genes, 11 were common to both datasets: CRTAC1, GABRB3, GNL1, CRY2, SLITRK5, LCE1C, GAS2L3, HOXD13, TXLNB, OTP, and IRX1 (Fig. 3a).
Common prognostic genes. (a) Upset plot illustrating the overlap of genes identified by elastic net in the TCGA and CGGA datasets. (b) Hazard ratios for the eleven common prognostic genes, adjusted for tumor grade and patient age. The plot presents the hazard ratios with corresponding confidence intervals. (c, d) Survival curves of CGGA and TCGA respectively, stratified by the maximization of the log-rank test statistic, demonstrating the differential survival probabilities of patients with high versus low expression levels. The curves are accompanied by log-rank test p-values to assess the statistical significance of the observed differences. Benjamini–Hochberg (BH) was used to adjust for multiple hypothesis.
To further explore the prognostic significance of the 11 candidate genes, we conducted individual evaluations using a Cox proportional hazards model, adjusting for tumor grade and age as covariates. Of these genes, GAS2L3, TXLNB, HOXD13, OTP, and IRX1 had positive coefficients in both datasets, while LCE1C, CRTAC1, GNL1, SLITRK5, and CRY2 showed negative coefficients across both datasets (Fig. 3b).
Among these genes, GAS2L3, HOXD13, and OTP emerged as the only ones with significant hazard ratios consistently observed in both datasets. Stratifying samples based on high and low expression of each gene revealed that these three genes effectively distinguished survival outcomes across different tumor grades and datasets (Fig. 3c,d).
Survival analysis revealed significant differences between high- and low-expression groups, with these analyses performed for the 11 genes described above (Supplementary Fig. 3 and Supplementary Table 5). In the CGGA dataset, GAS2L3 had a pronounced impact on survival probabilities across WHO grades II, III, and IV, with p-values of 0.00039, < 0.0001, and 0.00061, respectively. Similarly, in the TCGA dataset, GAS2L3 demonstrated significant associations with survival outcomes for grades 03 and 04 (p-values of 0.00453 and 0.00283, respectively), while no significant effect was observed for grade 02.
For the HOXD13 gene, survival analysis revealed significance for grades II, III and IV as well in the CGGA dataset (p = 0.00827; p < 0.0001 and 0.00635 respectively), and in the TCGA dataset, only for grades 03 and 04 ( p = 0.00064 and 0.00478 respectively). The OTP gene was significant in grades II and IV in the CGGA dataset (p = 0.00039 and p = 0.0052, respectively), whereas in the TCGA dataset, significance was observed only for grade 04 (p = 0.0348 ).
Functional characterization of prognostic genes in glioma subpopulations
To explore the potential functions and specificities of the targeted genes, we utilized a single-cell RNA sequencing (scRNA-seq) dataset compiled by Abdelfattah et al.27, which includes 201,986 cells from patients with primary and recurrent glioblastoma, as well as grade 02 astrocytoma and oligodendrogliomas, following the workflow illustrated in Fig. 4a. The analysis revealed eight distinct cell clusters: B cells, T cells, endothelial cells (Endo), glioma cells, myeloid cells, oligodendrocytes (Oligo), pericytes, and other cells, as previously determined by Abdelfattah et al. (Fig. 4b).
Gene expression at the single-cell level. (a) Schematic representation of the analysis workflow used to assess gene expression at the single-cell level. (b) UMAP plot displaying the distribution of all 201,989 single cells, with cells color-coded according to their respective labels. (c) Feature plots illustrating the expression levels of the eleven prognostic genes across all cells. Each plot showcases the spatial distribution of gene expression within the cell population. (d) Dotplot showcasing average expression (color gradient) and percentage of cells expressing (dot size) of genes across cell populations.
In addition, we explored the expression of eleven genes associated with patient’s overall survival (OTP, GAS2L3, TXLNB, HOXD13, IRX1, GABRB3, LCE1C, CRTAC1, GNL1, SLITRK5, and CRY2) across cell clusters. Most of these genes showed high expression levels in cluster enriched in glioma cells (GCs), indicating their potential roles in tumor biology. However, GAS2L3, CRY2, and GNL1 exhibited expression across both tumor and non-tumor cells (Fig. 4c,d). To further investigate the roles of these genes within glioma cells, we subsetted the data and re-clustered it (Fig. 5a; Supplementary Fig. 4a and b).
Glioma subcluster exploration. (a) UMAP plot illustrating glioma subclusters identified through clustering analysis. Seven distinct GCs are specified. (b) UMAP plot depicting the cell cycle phases of the cells included in the analysis. Cells are color-coded according to their respective cell cycle phases (G1, S, G2/M). (c) Heatmap displaying the top 5 marker genes for each GC. (d) UMAP plot showcasing the module scores of the four glioma cell states (AC, MES, NPC, OPC). (e) Quadrant plot illustrating the distribution of each GCs across four quadrants, each representing one of the glioma cell states.
Based on this analysis, we identified seven distinct GCs with varying proportions across the tumor types (Supplementary Fig. 4c,d). Notably, GC03 and GC04 were characterized by the presence of dividing cells, indicating active cell proliferation (Fig. 5b). The remaining clusters were further characterized based on their top marker genes, with unique expression profiles visualized in the heatmap (Fig. 5c; Supplementary Table 5).
Evidently, GC03 and GC04 exhibited high expression of proliferative markers, including TOP2A, NUSAP1, BIRC5, PBK, UBE2C, PCLAF, TYMS, CLSPN, MAD2L1, and CENPK, indicating a highly proliferative profile. GC00 was characterized by markers associated with an oligodendrocytic phenotype (OLIG1 and OLIG2), while GC02 displayed a mesenchymal profile, evidenced by the expression of NDRG1, VEGFA, AKAP12, and ADM. GC01 showed specific astrocytic markers, namely AQP4 and GJA1. GC05, although not strongly expressing its top markers, exhibited immune-related characteristics, highlighted by the presence of CG2, CD3D, CD3E, GZMH, and GZMK. Finally, GC06 was defined by markers associated with a neuroprogenitor state, including RND3, ELAVL4, STMN2, CD24, and INSM1.
Additionally, we classified these clusters using the functional modules described by Neftel et al28. With this, we labeled GCs as oligodendrocyte precursor cell (OPC), astrocytic (AC), mesenchymal (MES), and neural precursor cell (NPC) (Fig. 5d). Using the module scores from these four profiles we generated coordinates for each cell score and plotted these in a quadrant plot, for visualization of each cluster’s profile (Fig. 5e). This approach allowed us to see how each cell cluster aligns with different cellular profiles (OPC, AC, MES, and NPC).
Despite some overlap, GC00 distinctly segregated into the OPC quadrant. In contrast, GC01 was more associated with the AC quadrant, and GC02 was situated in the MES quadrant. Furthermore, GC03 and GC04 lacked a specific signature, which was anticipated as they comprised cells in both G2 and S phases. The final cluster, GC06, was well characterized as an NPC cluster, incorporating features from all four cellular profiles (Fig. 5d,e).
We further explored the expression of key genes within the glioma cluster with UMAP visualizations. This method helps to understand which genes are expressed in different cell clusters and their association with specific cellular profiles. We focused on the expression of eleven genes (OTP, GAS2L3, TXLNB, HOXD13, IRX1, GABRB3, LCE1C, CRTAC1, GNL1, SLITRK5, and CRY2) within the glioma cluster (Fig. 6a,b). The OTP expression was predominantly observed in GC06, associated with an NPC profile. In contrast, HOXD13 did not exhibit a specific cluster or profile. GAS2L3 was broadly expressed, with notable concentration in GC03, which consists of cells in the G2 phase without a defined profile. Interestingly, OTP, GAS2L3, TXLNB, and HOXD13 showed low expression in the core of the OPC cluster. Conversely, GABRB3, CRTAC1, SLITRK5, and CRY2 displayed preferential expression in the OPC cluster.
Gene expression across glioma subclusters (GCs). (a) Feature plot displaying the expression levels of eleven selected genes within the glioma subset. Each cell is color-coded according to the expression intensity of the respective gene. (b) Dotplot of the mean and percent expression from the eleven genes across GCs.
Discussion
Our study provides critical insights into the mechanisms driving survival in primary glioma patients, highlighting specific prognostic genes that may serve as valuable references for therapeutic strategy development, such as GAS2L3, HOXD13 and OTP. By analyzing data from TCGA and CGGA, we identified key BPs (e.g. synaptic integration and cell differentiation) likely involved in glioma progression and recurrence, focusing on the relationships between genes, regulons, and biological pathways. This work not only enhances our understanding of glioma but also appoints to a specific group of genes (OTP, GAS2L3, TXLNB, HOXD13, IRX1, GABRB3, LCE1C, CRTAC1, GNL1, SLITRK5, and CRY2) that may impact patients’ survival. These findings underscore the potential for targeted interventions that modulate cellular plasticity, paving the way for improving therapeutic outcomes.
Gliomas are known for their high heterogeneity, both within and between histological grades and subtypes29–33. To address this complexity, we reconstructed the GRN using two distinct datasets, enabling us to account for the variability inherent to these tumors34. Although the regulons themselves did not show direct overlap, we observed a remarkable degree of similarity in the genes of these regulons and the BPs associated with these genes.
The discrepancy between the TCGA and CGGA may be attributed to a potential batch effect, which could introduce technical and biological variability, or it may also arise from populational backgrounds, requiring future investigation. Meanwhile the overlap of the genes within the regulons allowed us to identify the existence of underlying processes that are consistent between the datasets. It is worth noting that the difference in regulons may reflect the complexity of regulatory mechanisms in glioma, while the similarity in regulated genes highlights that networks can achieve the same biological results through different pathways, reinforcing the importance of central gene interactions over specific regulatory differences35.
Reconstructing GRNs from RNA-seq data sourced from TCGA and CGGA enabled us to identify a shared set of 162 genes across both datasets. These genes are involved in critical BPs, including synaptic function, cell cycle regulation, and developmental pathways. These findings underscore the parallels between glioma biology and normal neural embryonic development36–38, where synaptic communication plays a pivotal role in modulating cellular plasticity39,40. The disruption of early synaptic formation may trigger cascading effects on downstream pathways24,41–44, highlighting the central role of synaptic integration in both tumor initiation and progression.
Our detailed analysis of these 162 genes, using elastic net regularization and Cox regression, identified 31 genes associated with survival outcomes in TCGA and 32 in CGGA, with 11 genes common to both analyses. Genes linked to better prognoses, CRTAC1, GABRB3, GNL1, CRY2, and SLITRK5 stand out, whereas GAS2L3, HOXD13, TXLNB, OTP, and IRX1 are associated with poorer survival. From these genes, OTP, HOXD13, IRX1, CRY2, GNL1, SLITRK5, GABRB3 CRTCA1, LCE1C and GAS2L3 have been previously described as potential prognostic or participating in glioma biology45–56, while TXLNB has never been reported in this context. Notably, the shared presence of IRX1, OTP, and HOXD13 within the latter group underscores the critical role of transcriptional regulation in maintaining cellular plasticity47,50,57,58. However, further studies are necessary to elucidate the functional relevance of these candidate genes through validation in independent cohorts and experimental models.
GAS2L3, a key cytoskeletal regulator, plays a pivotal role in microtubule and actin filament crosslinking, which is essential for processes such as chromosome segregation and cell division59,60. Dysregulation of GAS2L3 can disrupt these mechanisms, potentially leading to genomic instability and enhancing the invasive characteristics of glioma cells61,62. Similarly, TXLNB, may be involved in vesicle transport and nerve regeneration, by promoting syntaxin binding, underscoring the interconnected roles of intracellular trafficking and cytoskeletal dynamics63. Together, these genes together, suggest a critical interplay between vesicle transport, and cytoskeletal dynamics in promoting glioma cell adaptability and aggressiveness.
Furthermore, our individual gene analyses, adjusted for age and tumor grade, indicated that HOXD13, GAS2L3, and OTP significantly stratifying survival groups, emphasizing their previously reported relevance as prognostic molecules45,48,49,51. These findings advocate that the expression levels of these genes are strongly associated with glioma aggressiveness and patient prognosis.
To assess gene expression heterogeneity within tumor cells, we examined it at the single-cell level. Our results revealed that while most of these genes were expressed within glioma cells, GAS2L3, GNL1, and CRY2 also exhibit expression in immune cells. These results highlight the potential interactions between glioma and the immune microenvironment, offering insights into the crosstalk that may influence tumor progression27,64–71.
The identification of seven distinct GCs at the single-cell level further elucidates the complexity of glioma biology. Using the cellular state framework established by Neftel et al.28 we accurately characterized these clusters based on astrocytic, mesenchymal, neuroprogenitor, and oligoprogenitor states. Notably, we observed a correlation between the hazard ratios of the identified genes and their expression within the OPC cluster. Genes with hazard ratios indicating a protective effect were predominantly expressed in this cluster, suggesting that the OPC state may confer a more favorable prognosis, potentially due to its association with less aggressive tumor phenotypes72–75.
In conclusion, our findings highlight the key roles of specific genes and processes in glioma biology, providing a foundation for future research into their regulatory mechanisms. We identify intricate relationships between regulons, genes, and biological processes that shape cell profiles, such as NPCs, MES, ACs, and OPCs (Fig. 7). Ten of the eleven genes studied have been previously linked to gliomas, with TXLNB being novel in this context. These genes shed light on crucial aspects of glioma malignancy, including synaptic integration, differentiation, and proliferation. Remarkably, the correlation between gene expression in the OPC cluster and survival outcomes emphasizes the potential of targeting these pathways to improve therapeutic strategies and patient prognosis.
Methods
Transcriptome datasets
Transcriptional data from 989 primary gliomas were obtained from TCGA (https://tcga-data.nci.nih.gov/tcga/) and the CGGA (http://www.cgga.org.cn/)18. From the TCGA, we selected the Lower Grade Glioma and Glioblastoma (GBMLGG) project, which comprises 661 primary tumor samples, from wich 567 had the reclassification made available by Zakharova et al. This project gathers data from both the Glioblastoma and Lower Grade Glioma studies. The samples obtained from tumor biopsies are categorized into grades 02, 03, and 04 based on the 2021 WHO classification criteria76. Sequencing was performed using the Illumina HiSeq platform and clinical data was retrieved for each sample.
In the CGGA dataset, we specifically selected the "mRNAseq_693 (batch 1)" dataset, which encompasses 693 tumor biopsy samples of gliomas18,77. To align with the restriction of primary tumors in TCGA, we applied a filter to the mRNAseq_693 dataset, resulting in 422 primary tumor samples spanning grades II, III, and IV. These samples were classified according to the WHO 2016 classification criteria and were also sequenced using the Illumina HiSeq platform.
Reconstruction of gene regulatory networks
To assess gene transcriptional regulatory networks, we utilized the RTN package from Bioconductor, R version 4.3.2 and RStudio78,79. First, we normalized the data to counts per million (CPM) using the EdgeR package80–82. Gene regulatory networks were constructed based on the normalized expression data from both TCGA and CGGA datasets individually. This process yielded a network of regulons, where each regulon consists of a TF and the genes it regulates, with the names of the regulons corresponding to those of the TFs. Following the reconstruction, we assessed regulon activity using the RTN package, performing a Two-Tailed GSEA2 for each regulon to provide insights into the expression of genes within the regulon for individual samples. Regulons network was visualized with RedeR package and support of igraph83,84.
Identification of prognostic regulons via LASSO regularization
To identify significant predictors associated with survival outcomes, we employed a LASSO Cox regression model using regulon activity. The glmnet package in R facilitated fitting the LASSO Cox regression model with cross-validation (cv.glmnet)85,86. The family argument was specified as “cox” to indicate Cox regression. Cross-validation was conducted to select the optimal value of the penalty parameter (lambda) for the LASSO regression model. The cv.glmnet function determined the optimal lambda (lambda_optimal) through cross-validation, identifying the model with the minimum error. We extracted the coefficients of the model corresponding to the optimal lambda. These coefficients represent the strength and direction of the association between predictor variables and survival outcomes. The final regression model was fitted using the optimal lambda (lambda_optimal), utilizing the glmnet function to specify the penalty parameter derived from the cross-validation process. Visualization was performed using ggplot2 package.
Functional enrichment analysis
To visualize the genes in the selected regulons, we conducted functional enrichment analysis focusing on common gene clusters in the R environment using the “ClusterProfiler” package87. We identified and filtered enriched terms in the Gene Ontology database, separating analyses by database and regulon group, applying a statistical criterion of adjusted p-value < 0.05. The results were visually presented through dot plots, offering an intuitive graphical representation of the enriched BPs associated with the analyzed gene groups.
Prognostic gene detection via elastic net regularization
We employed Elastic Net regression, a hybrid of LASSO and Ridge regression techniques, to identify significant predictors associated with glioma patient survival. The glmnet package in R facilitated the fitting of the Elastic Net regression model with cross-validation, utilizing the cv.glmnet function85,86. This approach allowed for simultaneous variable selection and regularization, enabling the identification of relevant features while mitigating potential issues such as multicollinearity. The optimal values of the regularization parameters (alpha and lambda) were determined through cross-validation to balance model complexity and predictive performance. The resulting coefficients from the Elastic Net regression model were interpreted to discern the strength and direction of the association between predictor variables and survival outcomes.
Evaluating gene prognostic values
We applied a proportional hazards regression model to associate gene expression with survival outcomes and time. The R package ‘survival’ was utilized, specifically the coxph function, which fits the regression model to the survival object88. This model allows for the determination of the influence of variables on survival outcomes over time, providing hazard ratio values with confidence intervals (CIs) for each variable. The survminer package was used to plot Kaplan Meyer curves, cutoff points were determined by surv_cutpoint() function89,90. Benjamini–Hochberg (BH) was used to adjust for multiple hypothesis.
Single-cell RNA-seq analysis
Data for single-cell analysis was obtained from the Broad Institute Single Cell Platform, specifically from the study by Abdelfattah et al27. The dataset comprises 201,986 cells, and we utilized the Seurat package for basic preprocessing, following the analytic pipeline described by Abdelfattah et al. Cells with mitochondrial content exceeding 20% and ribosomal content exceeding 50% were excluded. Harmony software was employed to correct batch effects related to patient and sex using the runHarmony() function91.
We utilized the classifications provided by Abdelfattah et al. to subset only glioma cells. Clustering and normalization were conducted on the glioma subset, clustree was employed, and clusters with less than 1000 cells and no spatial distinction were excluded92. The cell profiles were evaluated using AddModuleScore() from Seurat, and for visualization of each cluster’s profile, the scores were transformed into coordinates and plotted in a quadrant plot using ggplot2.
Data availability
All data used in this study are publicly available. RNA-seq data from TCGA can be accessed through the TCGA Data Portal (https://tcga-data.nci.nih.gov/tcga/). CGGA data can be downloaded from the CGGA Portal (http://www.cgga.org.cn/). Single-cell data were obtained from the Broad Institute Single Cell Portal (https://singlecell.broadinstitute.org/single_cell).
References
Ostrom, Q. T. et al. CBTRUS statistical report: Primary brain and other central nervous system tumors diagnosed in the United States in 2009–2013. Neuro Oncology 18, v1–v75 (2016).
Mathur, R. et al. Glioblastoma evolution and heterogeneity from a 3D whole-tumor perspective. Cell 187, 446–463 (2024).
Torp, S. H., Solheim, O. & Skjulsvik, A. J. The WHO 2021 Classification of Central Nervous System tumours: A practical update on what neurosurgeons need to know—A minireview. Acta Neurochir. 164, 66 (2022).
Louis, D. N. et al. The 2021 WHO classification of tumors of the central nervous system: A summary. Neuro Oncology 23, 1231–1251 (2021).
Antonelli, M. & Poliani, P. L. Adult type diffuse gliomas in the new 2021 WHO classification. Pathologica https://doi.org/10.32074/1591-951X-823 (2022).
Smith, H. L., Wadhwani, N. & Horbinski, C. Major features of the 2021 WHO classification of CNS tumors. Neurotherapeutics https://doi.org/10.1007/s13311-022-01249-0 (2022).
Śledzińska, P., Bebyn, M., Furtak, J., Koper, A. & Koper, K. Current and promising treatment strategies in glioma. Rev. Neurosci. 34, 483–516 (2023).
Wang, Y. & Wu, Z. The clinical significance and transcription regulation of a DNA damage repair gene, SMC4, in low-grade glioma via integrated bioinformatic analysis. Front. Oncol. 11, 761693 (2021).
Giannopoulou, A. I., Kanakoglou, D. S. & Piperi, C. Transcription factors with targeting potential in gliomas. Int. J. Mol. Sci. https://doi.org/10.3390/ijms23073720 (2022).
Gupta, M. K. et al. Altered transcriptional regulatory proteins in glioblastoma and YBX1 as a potential regulator of tumor invasion. Sci. Rep. 9, 66 (2019).
Phillips, R. E., Soshnev, A. A. & Allis, C. D. Epigenomic reprogramming as a driver of malignant glioma. Cancer Cell https://doi.org/10.1016/j.ccell.2020.08.008 (2020).
Garrett, M. C. et al. Chromatin structure predicts survival in glioma patients. Sci. Rep. 12, 8221 (2022).
Stępniak, K. et al. Mapping chromatin accessibility and active regulatory elements reveals pathological mechanisms in human gliomas. Nat. Commun. 12, 3621 (2021).
Huynh-Thu, V. A. & Sanguinetti, G. Gene regulatory network inference: An introductory survey. In Methods in Molecular Biology 1883 (2019).
Roura, A. J. et al. Regulatory networks driving expression of genes critical for glioblastoma are controlled by the transcription factor c-Jun and the pre-existing epigenetic modifications. Clin. Epigenet. 15, 29 (2023).
Chagas, V. S. et al. RTNduals: An R/bioconductor package for analysis of co-regulation and inference of dual regulons. Bioinformatics 35, 5357–5358 (2019).
Margolin, A. A. et al. ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinform. 7, 1–15 (2006).
Zhao, Z. et al. Chinese Glioma Genome Atlas (CGGA): A comprehensive resource with functional genomic data from chinese glioma patients. Genomics Proteomics Bioinform. 19, 1–12 (2021).
Chang, C. Y. et al. Endoplasmic reticulum stress contributes to indomethacin-induced glioma apoptosis. Int. J. Mol. Sci. 21, 557 (2020).
Ogunrinu, T. A. & Sontheimer, H. Hypoxia increases the dependence of glioma cells on glutathione. J. Biol. Chem. 285, 37716–37724 (2010).
Rocha, C. R. R. et al. Glutathione depletion sensitizes cisplatin- and temozolomide-resistant glioma cells in vitro and in vivo. Cell Death Dis. 5, e1505 (2014).
Wang, F. et al. CDC6 is a prognostic biomarker and correlated with immune infiltrates in glioma. Mol. Cancer https://doi.org/10.1186/s12943-022-01623-8 (2022).
Grabowski, M. M. et al. Immune suppression in gliomas. J. Neuro-Oncol. https://doi.org/10.1007/s11060-020-03483-y (2021).
Venkatesh, H. S. et al. Electrical and synaptic integration of glioma into neural circuits. Nature 573, 539–545 (2019).
Liu, Y. et al. Interactions of glial cells with neuronal synapses, from astrocytes to microglia and oligodendrocyte lineage cells. Glia https://doi.org/10.1002/glia.24343 (2023).
Venkataramani, V. et al. Glutamatergic synaptic input to glioma cells drives brain tumour progression. Nature 573, 532–538 (2019).
Abdelfattah, N. et al. Single-cell analysis of human glioma and immune cells identifies S100A4 as an immunotherapy target. Nat. Commun. 13, 767 (2022).
Neftel, C. et al. An integrative model of cellular states, plasticity, and genetics for glioblastoma. Cell 178, 835–849 (2019).
Richards, L. M. et al. Gradient of developmental and injury response transcriptional states defines functional vulnerabilities underpinning glioblastoma heterogeneity. Nat. Cancer 2, 157–173 (2021).
Dirkse, A. et al. Stem cell-associated heterogeneity in Glioblastoma results from intrinsic tumor plasticity shaped by the microenvironment. Nat. Commun. 10, 1787 (2019).
Nicholson, J. G. & Fine, H. A. Diffuse glioma heterogeneity and its therapeutic implications. Cancer Discov. https://doi.org/10.1158/2159-8290.CD-20-1474 (2021).
Finch, A. et al. Advances in research of adult gliomas. Int. J. Mol. Sci. https://doi.org/10.3390/ijms22020924 (2021).
Markouli, M., Strepkos, D., Papavassiliou, K. A., Papavassiliou, A. G. & Piperi, C. Bivalent genes targeting of glioma heterogeneity and plasticity. Int. J. Mol. Sci. https://doi.org/10.3390/ijms22020540 (2021).
Ke, C. et al. Linking differential radiation responses to glioma heterogeneity. Oncotarget 5, 1657 (2014).
Babal, Y. K., Kandemir, B. & Kurnaz, I. A. Gene regulatory network of ets domain transcription factors in different stages of glioma. J. Pers. Med. 11, 138 (2021).
Curry, R. N. & Glasgow, S. M. The role of neurodevelopmental pathways in brain tumors. Front. Cell Dev. Biol. https://doi.org/10.3389/fcell.2021.659055 (2021).
Lee, S. D., Song, J., LeBlanc, V. G. & Marra, M. A. Integrative multi-omic analysis reveals neurodevelopmental gene dysregulation in CIC-knockout and IDH1-mutant cells. J. Pathol. 256, 297–309 (2022).
Aldaz, P. et al. High SOX9 maintains glioma stem cell activity through a regulatory loop involving STAT3 and PML. Int. J. Mol. Sci. 23, 4511 (2022).
Pan, Y. & Monje, M. Neuron–glial interactions in health and brain cancer. Adv. Biol. https://doi.org/10.1002/adbi.202200122 (2022).
De Silva, M. I., Stringer, B. W. & Bardy, C. Neuronal and tumourigenic boundaries of glioblastoma plasticity. Trends Cancer https://doi.org/10.1016/j.trecan.2022.10.010 (2023).
Fossati, G. et al. Pentraxin 3 regulates synaptic function by inducing AMPA receptor clustering via ECM remodeling and β1-integrin. EMBO J 38, e99529 (2019).
Drexler, R. et al. Epigenetic neural glioblastoma enhances synaptic integration and predicts therapeutic vulnerability. bioRxiv https://doi.org/10.1101/2023.08.04.552017 (2023).
Hua, T. et al. Glioma-neuronal interactions in tumor progression: Mechanism, therapeutic strategies and perspectives (review). Int. J. Oncol. https://doi.org/10.3892/ijo.2022.5394 (2022).
Taylor, K. R. & Monje, M. Neuron–oligodendroglial interactions in health and malignant disease. Nat. Rev. Neurosci. https://doi.org/10.1038/s41583-023-00744-3 (2023).
Trong, P. D. et al. Identification of a prognostic hypoxia-associated gene set in IDH-mutant glioma. Int. J. Mol. Sci. 19, 2903 (2018).
Zhang, P. et al. Clinical significance of Iroquois homeobox gene—IRX1 in human glioma. Mol. Med. Rep. 17, 4651–4656 (2018).
Zhang, P. et al. The genomic organization and function of IRX1 in tumorigenesis and development. Cancer Transl. Med. 3, 6666 (2017).
Zhou, Y. et al. Elevated GAS2L3 expression correlates with poor prognosis in patients with glioma: A study based on bioinformatics and immunohistochemical analysis. Front. Genet. 12, 649270 (2021).
Li, Y. et al. Identification of hoxd10 as a marker of poor prognosis in glioblastoma multiforme. Onco Targets Ther. 14, 5183–5195 (2021).
Zhang, J. et al. A novel miR-7156-3p-HOXD13 axis modulates glioma progression by regulating tumor cell stemness. Int. J. Biol. Sci. 16, 3200 (2020).
Morosini, N. S., Sabedot, T. S., Cazacu, S. L., deCarvalho, A. & Noushmehr, H. P17.12.B HOXD13 gene plays a role as a putative master epigenetic regulator in glioma recurrence and progression. Neuro Oncology 24, ii91 (2022).
Badalotti, R. et al. Gene expression of GABAA receptor subunits and association with patient survival in glioma. Brain Sci. 14, 275 (2024).
Lu, F., Jiang, X., Zheng, G. & Zeng, G. The Prognosis Model of Glioblastoma was Constructed Based on Lactic Acid Metabolism-Related Genes. https://doi.org/10.21203/rs.3.rs-3784359/v1 (2024).
Tu, Z. et al. A novel signature constructed by RNA-binding protein coding genes to improve overall survival prediction of glioma patients. Front. Cell Dev. Biol. 8, 588368 (2021).
Liu, Y. et al. The role of SliTrk5 in central nervous system. BioMed Res. Int. https://doi.org/10.1155/2022/4678026 (2022).
Deng, Z. et al. Late Cornified Envelope Group I, a novel target of p53, regulates PRMT5 activity. Neoplasia 16, 656–664 (2014).
Acampora, D., Postiglione, M. P., Avantaggiato, V., Di Bonito, M. & Simeone, A. The role of Otx and Otp genes in brain development. Int. J. Dev. Biol. 44, 669–678 (2000).
Boulet, A. M. & Capecchi, M. R. Multiple roles of Hoxa11 and Hoxd11 in the formation of the mammalian forelimb zeugopod. Development 131, 299–309 (2004).
Zhang, N. et al. Growth arrest–specific 2 protein family: Structure and function. Cell Prolif. https://doi.org/10.1111/cpr.12934 (2021).
Sharaby, Y. et al. Gas2l3 is essential for brain morphogenesis and development. Dev. Biol. 394, 305–313 (2014).
Wolter, P. et al. GAS2L3, a target gene of the DREAM complex, is required for proper cytokinesis and genomic stability. J. Cell Sci. 125, 2393–2406 (2012).
Fackler, M., Wolter, P. & Gaubatz, S. The GAR domain of GAS2L3 mediates binding to the chromosomal passenger complex and is required for localization of GAS2L3 to the constriction zone during abscission. FEBS J. 281, 2123–2135 (2014).
Gayà-Vidal, M. & Albà, M. M. Uncovering adaptive evolution in the human lineage. BMC Genomics 15, 1–12 (2014).
Ghouzlani, A. et al. Immune checkpoint inhibitors in human glioma microenvironment. Front. Immunol. https://doi.org/10.3389/fimmu.2021.679425 (2021).
Qiu, R., Zhong, Y., Li, Q., Li, Y. & Fan, H. Metabolic remodeling in glioma immune microenvironment: intercellular interactions distinct from peripheral tumors. Front. Cell Dev. Biol. https://doi.org/10.3389/fcell.2021.693215 (2021).
Liu, H. et al. Neutralizing IL-8 potentiates immune checkpoint blockade efficacy for glioma. Cancer Cell 41, 693–710 (2023).
Khalili, N. et al. Radiomics for characterization of the glioma immune microenvironment. npj Precis. Oncol. https://doi.org/10.1038/s41698-023-00413-9 (2023).
Liu, H. et al. The role of lysosomal peptidases in glioma immune escape: Underlying mechanisms and therapeutic strategies. Front. Immunol. https://doi.org/10.3389/fimmu.2023.1154146 (2023).
Robilliard, L. D. et al. Comprehensive assessment of secreted immuno-modulatory cytokines by serum-differentiated and stem-like glioblastoma cells reveals distinct differences between glioblastoma phenotypes. Int. J. Mol. Sci. 23, 14164 (2022).
Yasinjan, F. et al. Immunotherapy: A promising approach for glioma treatment. Front. Immunol. 14, 1255611 (2023).
Hatcher, A. et al. Pathogenesis of peritumoral hyperexcitability in an immunocompetent CRISPR-based glioblastoma model. J. Clin. Investig. 130, 2286–2300 (2020).
Sojka, C. & Sloan, S. A. Gliomas: A reflection of temporal gliogenic principles. Commun. Biol. https://doi.org/10.1038/s42003-024-05833-2 (2024).
Tsankova, N. M. & Canoll, P. Advances in genetic and epigenetic analyses of gliomas: A neuropathological perspective. J. Neuro-Oncol. https://doi.org/10.1007/s11060-014-1499-x (2014).
Verma, R. et al. Olig1/2-expressing intermediate lineage progenitors are predisposed to PTEN/p53-loss–induced gliomagenesis and harbor specific therapeutic vulnerabilities. Cancer Res. 83, 890–905 (2023).
Guo, R. et al. Context-dependent regulation of Notch signaling in glial development and tumorigenesis. Sci. Adv. 9, eadi2167 (2023).
Zakharova, G. et al. Reclassification of TCGA diffuse glioma profiles linked to transcriptomic, epigenetic, genomic and clinical data, according to the 2021 WHO CNS tumor classification. Int. J. Mol. Sci. 24, 157 (2023).
Zhang, K. et al. Clinical management and survival outcomes of patients with different molecular subtypes of diffuse gliomas in China (2011–2017): A multicenter retrospective study from CGGA. Cancer Biol. Med. 19, 1460–1476 (2022).
Castro, M. A. A. et al. Regulators of genetic risk of breast cancer identified by integrative network analysis. Nat. Genet. 48, 12–21 (2015).
Fletcher, M. N. C. et al. Master regulators of FGFR2 signalling and breast cancer risk. Nat. Commun. 4, 2464 (2013).
Chen, Y., Lun, A. T. L. & Smyth, G. K. From reads to genes to pathways: Differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline. F1000Res 5, 1438 (2016).
McCarthy, D. J., Chen, Y. & Smyth, G. K. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 40, 4288–4297 (2012).
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2009).
Castro, M. A. A., Wang, X., Fletcher, M. N. C., Meyer, K. B. & Markowetz, F. RedeR: R/Bioconductor package for representing modular structures, nested networks and multiple levels of hierarchical associations. Genome Biol. 13, 1–11 (2012).
Csardi, G. & Nepusz, T. The igraph software package for complex network research. InterJ. Complex Syst. 1695, 1–9 (2006).
Tay, J. K., Narasimhan, B. & Hastie, T. Elastic net regularization paths for all generalized linear models. J. Stat. Softw. 106, 1–31 (2023).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. ClusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Lin, H. & Zelterman, D. Modeling survival data: Extending the Cox model. Technometrics 44, 85–86 (2002).
Kosinski, A. K. M., Biecek, P. & Fabian, S. Package ‘survminer’. The Comprehensive R Archive Network (2020).
Kassambara, A., Kosinski, M., Biecek, P. & Fabian, S. Package ‘survminer’. In Drawing Survival Curves Using ‘ggplot2’(R package version 0.3.1.). R package version 0.4.3 Preprint at (2017).
Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods 16, 1289–1296 (2019).
Zappia, L. & Oshlack, A. Clustering trees: A visualization for evaluating clusterings at multiple resolutions. Gigascience 7, giy083 (2018).
Acknowledgements
We thank all patients that agreed to participate in the construction of publicly available databases used in this study and all staff that worked and contributed to this open science projects.
Funding
We thank the São Paulo Research Foundation (FAPESP Grants 2020/07069-0 to O.C.M.; 2020/16246-2 and 2023/13356-0 to D.L.M.F.; 2023/12268-0 to A.S.A.; 2023/06086-6 to P.M.B.; 2023/14417-2 to J.N.U.; and 2023/07806-2 to I.S.F.) for financial support. We acknowledge the National Council for Scientific and Technological Development (CNPq) Brazil (Grant 309482/2022-4 to O.C.M. and 310376/2022-0 to M.B) and the Coordination of Superior Level Staff Improvement (CAPES) (Grants 88887.801068/2023-00 to A.L.N., CAPES/PROEX Grant 88887.699840/2022-00 to F.Y.N.V.). This work was also supported by Fundação de Amparo à Pesquisa do Estado do Rio de Janeiro (FAPERJ—E-26/205.530/2022 (284921), E-26/201.322/2022 (272260), E-26/211.648/2021 (269502), and E-26/210.302/2022 (270388) for M.B).
Author information
Authors and Affiliations
Contributions
P.M.B., D.L.M.F., and O.C.M. wrote the manuscript. P.M.B., D.L.M.F., I.S.F., O.C.M., F.Y.N.V., Y.L.G.C., J.N.U., A.L.N., D.G.A., I.A.F.B., R.G.N., H.I.N., L.F.S., R.D H.D.D., G.C.M., M.B. and A.S.A. provided scientific insights. P.M.B., D.L.M.F. and O.C.M., performed data and bioinformatics analyses, while D.L.M.F. and O.C.M. revised and edited the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Barcelos, P.M., Filgueiras, I.S., Nóbile, A.L. et al. Gene regulatory networks analysis for the discovery of prognostic genes in gliomas. Sci Rep 15, 14034 (2025). https://doi.org/10.1038/s41598-025-98542-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-98542-7