Introduction

Osteosarcoma, a rare yet aggressive primary malignant bone tumor, predominantly affects pediatric and adolescent populations1,2. Despite advancements in surgical resection, chemotherapy, and immunotherapy, survival outcomes remain poor, largely due to the high incidence of metastasis and chemotherapy resistance3,4. The five-year survival rate for osteosarcoma patients exhibiting chemotherapy resistance or metastasis is a mere 20-30%5,6,7. Although the TNM staging system provides basic prognostic insights8, it has limited capacity to capture the molecular complexity and heterogeneity of osteosarcoma. Consequently, researchers are prioritizing the discovery of novel molecular biomarkers to improve clinical outcome prediction and optimize therapeutic strategies9,10,11.

The treatment landscape for sarcomas, including osteosarcoma, is evolving through the integration of molecular and genomic insights. Immune checkpoint inhibitors, tumor microenvironment-targeted therapies, and combination regimens show promise in sarcoma management; however, their application in osteosarcoma remains exploratory. For example, genetic profiling of uterine leiomyosarcoma has uncovered frequent mutations in tumor suppressor genes such as TP53, RB1, and PTEN, pointing towards potential therapeutic targets12. Nonetheless, adjuvant chemotherapy yields inconsistent results across early-stage uterine leiomyosarcoma subtypes13,14. Similarly, while immune checkpoint inhibitors demonstrate efficacyin advanced patientscancers15,16,17,18,19,20, their role in osteosarcoma requires further validation. However, accurately forecasting prognosis has proved to be challenging due to the variations in the efficacy of treatment approaches21,22,23,24. These findings emphasize the difficulty of establishing universal adjuvant therapies for diverse sarcoma subtypes and accentuate the necessity for individualized approaches grounded in tumor-specific biology. To significantly improve osteosarcoma survival rates, it is important to ascertain innovative biomarkers capable of accurately predicting clinical outcomes and therapeutic responsiveness. Therefore, high-confidence prognostic gene signatures are essential for guiding prognosis and treatment regimens.

As high-throughput sequencing technology advances, researchers are increasingly confronted with vast amounts of biological and clinical data, collectively termed “Big Data”25,26,27. These datasets enable the systematic identification of gene signatures and molecular pathways driving cancer progression28, including osteosarcoma29,30,31. Previous studies have constructed prognostic signatures associated with hypoxia32, apoptosis33, energy metabolism22, and the immune tumor microenvironment (TME)34. Nevertheless, the absence of reliable biomarkers often leads to either overtreatment or undertreatment, causing significant financial burden, severe adverse effects, or accelerated disease progression35,36.

To address this gap, machine learning (ML), a subfield of artificial intelligence, has emerged as a powerful tool for constructing predictive models from large-scale datasets37,38. ML algorithms has proven instrumental in biomedical research, facilitating biomarker discovery, therapeutic target identification, and mechanistic insights across diseases39,40,41. For instance, Zhang et al.42 leveraged ML to develop a six-gene oxidative stress signature for predicting acute myeloid leukemia outcomes. Similarly, recent studies have employed ML to identify diagnostic and prognostic biomarkers in osteosarcoma43,44, such as Shi et al.’s six-gene risk model (MYC, COL13A1, UHRF2, MT1A, ACTB, GBP1)al23.Nevertheless, integrating ML with bioinformatics to link prognostic signatures to diagnostic biomarkers and elucidate their clinical significance remains underexplored in osteosarcoma.

To identify an optimal biomarker, we constructed an ML-derived prognostic signature (MLDPS) by implementing 66 ML algorithm combinations based on consensus prognostic signature genes (CPSGs). The MLDPS demonstrated robust predictive capabilities for overall survival (OS), metastasis, immunotherapy response, and drug treatment efficacy in osteosarcoma cohorts. After integrating key clinicopathological characteristics and 10 established osteosarcoma signatures, MLDPS exhibited consistent and significantly improved predictive performance. Overall, our research provides a valuable resource for guiding early diagnosis, prognostic evaluation, stratified treatment, personalized therapy, and post-treatment monitoring of osteosarcoma in clinical settings.

Materials and methods

Public osteosarcoma dataset collection and preprocessing

The TARGET-osteosarcoma cohort (https://ocg.cancer.gov/programs/target) consists of 85 samples with survival data and follow-up periods exceeding zero. We also acquired the GSE21257 dataset from the Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo/), which includes 53 osteosarcoma samples with survival information and follow-up periods greater than zero for all patients. In total, 138 samples were included in this study, encompassing 14,174 genes from the TARGET and GSE21257 cohorts.

Univariate Cox regression analysis

Univariate Cox regression analysis was conducted on the GSE21257 and TARGET cohorts using intersecting genes. CPSGs were selected for further investigation based on the criteria of P < 0.01 and consistent hazard ratios (HRs) that were either above or below 1 in both cohorts.

Functional enrichment analysis

The CPSGs were used to perform Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses. The R package ‘clusterProfiler’ was employed for Gene Set Enrichment Analysis (GSEA) along with the GO and KEGG analyses45,46,47,48,49. Gene Set Variations Analysis (GSVA) was conducted using the R package ‘GSVA’50. The biocarta, KEGG, Reactome, wikipathways, and Hallmark gene sets used in GSVA were downloaded from the Molecular Signatures Database (MSigDB; http://software.broadinstitute.org/gsea/msigdb/index.jsp) version 6.2.

Development of MLDPS

To construct the MLDPS and ensure high accuracy and generalizability, we integrated 10 widely used ML algorithms: gradient boosting machine (GBM), survival support vector machine (Survival-SVM), supervised principal components (SuperPC), ridge regression, partial least squares regression for Cox (plsRcox), elastic network (Enet), random forest (RSF), least absolute shrinkage and selection operator (LASSO), CoxBoost, and Stepwise Cox. Notably, RSF, LASSO, CoxBoost, and Stepwise Cox effectively reduce dimensions and screen variables.

The development pipeline of the MLDPS is outlined as follows:

  • Univariate Cox analysis was performed on the merged cohort (created by removing batch effects and redundancies from the TARGET and GSE21257 cohorts) to identify CPSGs, defined as genes with a significance level of P < 0.01 and consistent HR orientation across the two cohorts.

  • Ten ML algorithms were used to create 66 algorithms combinations to develop the most predictive MLDPS with superior C-index performance.

  • After establishing the model on the training cohort, we evaluated it on two separate testing cohorts. The average C-index was calculated for each model, with the model exhibiting the highest value being deemed optimal.

Prognostic value of MLDPS and potential clinical application

Patients from the TARGET, GSE21257, and merged cohorts were stratified into high- and low-risk score groups based on the median MLDPS value. The prediction accuracy of the MLDPS was assessed using receiver operating characteristic (ROC) curves and by calculating the time-dependent area under the ROC curve (AUC). The prognostic significance of the MLDPS was evaluated using Kaplan–Meier (KM) curves.

Collection and calculation of osteosarcoma’s published signatures

Numerous prognostic signatures have been developed for osteosarcoma, focusing on stratified management and precise treatment, including those based on glycolysis- and cholesterol synthesis–related genes22, M1 macrophage–related genes24, and apoptosis–associated genes33. To compare the predictive efficacy of the MLDPS with established prognostic signatures, we systematically reviewed 10 published articles on prognostic models. Subsequently, we computed the risk scores for each signature in the three cohorts using the genes and coefficients outlined in the respective articles. In addition, we conducted a comparative analysis of key clinical characteristics, such as gender, age, and metastasis, between the high- and low-risk score groups in the merged cohort.

Assessment of immune infiltrations

Immune infiltrations were evaluated in the merged cohort using the R ‘IOBR’ package51 and various algorithms, including MCPcounter52, EPIC53, xCell54, CIBERSORT55, IPS, quanTIseq56, ESTIMATE57, and TIMER58. We compared microenvironment-related scores, such as ESTIMATEScore, ImmuneScore, MicroenvironmentScore, and StromalScore, between the high- and low-risk score groups in the merged cohort. The correlation between the MLDPS risk score and chemokines, as well as chemokine receptors, in the combined cohort was determined utilizing the Pearson method to calculate the correlation coefficient and P-value. Correlation coefficients greater than 0 signify a positive correlation, while those less than 0 indicate a negative correlation. A P-value below 0.05 was considered statistically significant.

Response to immunotherapy

Tumor Immune Dysfunction and Exclusion (TIDE) (http://tide.dfci.harvard.edu/)59 is an integrated web tool used to forecast patients’ responses to immunotherapy. TIDE scores were calculated for each patient in the merged cohort. Subsequently, immunotherapeutic responses, represented by dysfunction scores and TIDE scores, were compared between high- and low-risk score groups.

Prediction of drug sensitivity

To further compare differences between high- and low-risk score groups, drug sensitivity analysis was performed to calculate the IC50 values of drugs for each sample using the ‘oncoPredict’ software package. Higher IC50 values indicate decreased sensitivity to treatment.

Statistical analysis

All statistical analyses were conducted using R software (version 4.2.2). AUC analysis was performed using the ‘timeROC’ package. The KM method, combined with the log-rank test, was employed to assess differences in survival rates among various patient subgroups.

Results

Development of osteosarcoma CPSGs using univariate Cox regression

Prior to biological data analysis, the collected datasets were assessed for batch effects, which were found to be present in both datasets (Fig. 1A). To address this, the datasets were merged for downstream analysis, using the R packages ‘limma’ and “sva” to remove batch effects and reduce dataset size (Fig. 1B). Next, CPSGs were screened from the merged dataset consisting of 138 samples and 14,174 genes using univariate Cox regression analysis. The robust MLDPS was subsequently constructed using the 14 CPSGs, which showed consistent prognostic outcomes in both the GSE21257 and TARGET cohorts (Fig. 1C). The 14 CPSGs comprised 12 risky genes (P-value < 0.01, HR > 1) and two protective genes (P-value < 0.01, HR < 1) (Fig. 1C).

Fig. 1
figure 1

Removal of batch effects and univariate regression analysis in osteosarcoma. (A) Principle-component Analysis (PCA) plot before batch effect removal. (B) PCA plot after batch effect removal. (C) Univariate Cox regression analysis results for 14 CPSGs in the GSE21257 and TARGET cohorts.

GO enrichment analysis revealed that the 14 CPSGs were predominantly enriched in the biological process (BP) of “pattern specification process” (Fig. 2A), the cell component (CC) of “mitochondrial inner membrane” (Fig. 2B), and the molecular function (MF) of “transcription coactivator binding,” “telomeric DNA binding,” and “telomerase RNA binding” (Fig. 2C). KEGG analysis identified the “Arginine and proline metabolism” as the most significant signaling pathway (Fig. 2D). These findings lay a groundwork for deeper investigations into the mechanisms of osteosarcoma and uncover pivotal biological processes and pathways that may influence its prognosis.

Fig. 2
figure 2

GO and KEGG pathway analyse of CPSGs. The GO enrichment analysis of 14 CPSGs. The enriched BPs (A), CCs (B), and MFs (C) are noted. (D) The KEGG pathway enrichment analysis of 14 CPSGs.

Construction of MLDPS

Risk prediction models were developed in the TARGET training cohort using a combination of 10 well-established ML algorithms. The predictive performance of all models was evaluated by calculating the average C-index across the TARGET, GSE21257 and merged cohorts. The optimal model (MLDPS) was built by selecting the combination of LASSO and RSF with the highest average C-index (0.866) (Fig. 3A). The LASSO algorithm effectively identified the most critical CPSGs, while the RSF algorithm filtered the most valuable model. Finally, seven potential gene signatures (CTNNBIP1, CORT, DLX2, TERT, BBS4, SLC7A1, and NKX2-3) were identified, as illustrated in Fig. 3B.

Fig. 3
figure 3

Construction and evaluation of the MLDPS effectiveness. (A) The C-indexes values of 10 well-established ML algorithm combinations across three cohorts. (B) Seven CPSGs selected for the final model using the LASSO algorithm. (C) Time-dependent ROC analysis for predicting OS in the TARGET cohort. (D) AUC values of the time-dependent ROC curves in the TARGET cohort. (E) KM curves for OS in the TARGET cohort.

Robust prediction performance of MLDPS

To estimate the discriminatory capability and clinical utility of the MLDPS, we plotted ROC and survival curves. As shown in Fig. 3C and D, the AUCs for OS in the TARGET cohort at 1, 3, and 5 years were 0.949, 0.957, and 0.963, respectively. Similarly, the model exhibited robust performance in the GSE21257 cohort, achieving AUCs of 0.954, 0.869, and 0.852 (Fig. 4A and B). The merged cohort had AUCs of 0.935, 0.913, and 0.912 (Fig. 4D and E). These findings indicate the robust prognostic capability of the MLDPS model for osteosarcoma patients, with AUCs ranging from 0.852 to 0.963 for OS at 1, 3, and 5 years across all three cohorts. Furthermore, KM analysis confirmed that the MLDPS could effectively stratify osteosarcoma patients into distinct high- and low-risk score groups. Patients in the high-risk score group, exhibited significantly worse survival outcomes compared to the low-risk score group across all three cohorts (P < 0.0001 for all cohorts) (refer to Figs. 3E and 4C, and Fig. 4F).

Fig. 4
figure 4

Prediction performance of MLDPS in the GSE21257 and merged cohorts. Time-dependent ROC analysis for predicting OS in the (A) GSE21257 and (D) merged cohorts. AUC values of the time-dependent ROC curves in the (B) GSE21257 and (E) merged cohorts. KM curves for OS in the (C) GSE21257 and (F) merged cohorts.

Re-evaluation of 10 previously published signatures in osteosarcoma

The expeditious advancement of high-throughput sequencing has boosted the stratified management and precise therapeutic approaches for tumors60. Recently, numerous prognostic signatures for osteosarcoma have been developed using ML algorithms, including RSF and CoxBoost, leveraging extensive and reliable datasets23,61,62. A total of 10 prognostic signatures were collected from published sources to perform a comparative assessment of the MLDPS’ predictive efficacy relative to these models. The evaluation was carried out using the C-index metric across three distinct cohorts. These results demonstrated that the MLDPS achieved significantly higher predictive accuracy than the other models across all three cohorts, consistently ranking first (Fig. 5). This underscores the robustness and reliability of the MLDPS. Notably, certain prognostic signatures, such as those reported in PMID3489417723, PMID372342549, and PMID3433707532, exhibited higher C-index values in one or two cohorts but performed poorly in the others. This discrepancy may stem from impaired generalization ability caused by overfitting, as plotted in Fig. 5.

Fig. 5
figure 5

Comparisons of C-indexes values between MLDPS and 10 expression-based signatures. C-indexes values of MLDPS and 10 published signatures in the TARGET, GSE21257, and merged cohorts. *P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001.

The clinical signature of MLDPS

Univariate Cox regression analysis revealed that the MLDPS, published signatures (excluding PMID3559838233), and metastasis were statistically significant within the merged cohort (Fig. 6A). However, no statistically significant differences were observed for gender or age (Fig. 6A). Further analysis was conducted to compare OS and metastasis between high- and low-risk score groups. Compared to the low-risk score group, the high-risk score group exhibited significantly inferior OS (Fig. 6B). Patients in the high-risk score group showed a greater propensity for developing metastasis, potentially contributing to their unfavorable prognosis (Fig. 6C).

Fig. 6
figure 6

The clinical signature of MLDPS and other signatures in the merged cohort. (A) Univariate Cox regression analysis of MLDPS and other signatures in the merged cohort. (B) Comparison of osteosarcoma between the high- and low-risk score groups. Left: Differences in the MLDPS risk scores between the OS groups. Right: Composition percentage of the high- and low-risk score groups in osteosarcoma. (C) Comparison of metastasis between the high- and low-risk score groups. Left: Differences in the MLDPS risk scores between the metastasis groups. Right: Composition percentage of the high- and low-risk score groups in metastasis.

Collectively, the timeROC curve, KM survival analysis, and Cox regression analysis outcomes consistently demonstrated that the MLDPS accurately forecasted the prognosis of osteosarcoma patients across all three cohorts. These findings suggest that the MLDPS holds promise as a valuable tool for clinical application.

The underlying biological mechanisms of MLDPS

Heatmaps (Fig. 7A and B) depicted the top 50 genes showing the most significant positive and negative correlations with the risk score. GO enrichment analysis of co-expressed genes positively associated with the MLDPS exhibited significant enrichment in various BPs, namely “ossification,” “ncRNA processing,” “embryonic organ morphogenesis,” “ribosome biogenesis,” “rRNA metabolic process,” and “rRNA processing” (Fig. 7C). Enrichment was observed in CCs such as “ribosome,” “mitochondrial matrix,” “ribosomal subunit,” “asymmetric synapse,” and “cytosolic ribosome” (Fig. 7D). With respect to MFs, “structural constituent of ribosome” (Fig. 7E) was significantly enriched. Notably, KEGG analysis showed “Ribosome” as the only pathway with significant enrichment (Fig. 7F).

Fig. 7
figure 7

Function and pathway analysis of MLDPS. (A) Heatmap of the top 50 positive correlation genes with the MLDPS risk score; (B) Heatmap of the top 50 negative correlation genes with the MLDPS risk score; The GO enrichment analysis of the top 500 positive correlation genes, including BP (C), CC (D) and MF (E); (F) The KEGG pathways enrichment analysis of the top 500 positive correlation genes.

To understand the biological implications of MLDPS, we performed a GSEA analysis. The findings indicated a positive correlation between a high-risk score and the upregulation of key gene sets involved in translation, rRNA processing in the nucleus and cytosol, the major pathway of rRNA processing in the nucleolus and cytosol, and other biological pathways (Supplementary Fig. 1A). GSVA was utilized to analyze the effects of MLDPS on biological pathways in osteosarcoma. A heatmap was used to visualize the top 20 biological processes with statistically significant differences, ranked in ascending order of their P-values. As depicted in Supplementary Fig. 1B, significant pathway differences were observed between the high- and low-risk score groups. These observations may offer a potential explanation for the unfavorable prognosis associated with the high-risk score group.

Immune landscape of MLDPS

A thorough analysis of the TME in osteosarcoma was conducted using the ‘IOBR’ R package51. The heatmap exhibited variations in the immunological composition between high- and low-risk score groups (Fig. 8A). As shown in Fig. 8B, the high-risk score group demonstrated notably diminished ESTIMATEScore, ImmuneScore, MicroenvironmentScore, and StromalScore, suggesting an immunosuppressive condition.

Fig. 8
figure 8

Landscapes of the TME between the high- and low-risk score groups. (A) Heatmap indicates the difference in the TME between the high- and low-risk score groups. (B) The diversities of the ESTIMATEscore, ImmuneScore, MicroenvironmentScore, and StromalScore between the high- and low-risk score groups. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001.

Further analysis identified a negative correlation between the risk score and the gene expression levels of most chemokines and chemokine receptors (Fig. 9A). Specifically, the correlation coefficients between the risk score and the expression levels of CCL8, CSF1, CCR1, CCR2, CCR4, and CCR7 were all below − 0.2, with corresponding P-values less than 0.05 (Fig. 9B).

Fig. 9
figure 9

The correlations between MLDPS and gene expression of chemokines and their receptors. (A) Correlation heatmap between MLDPS and expression of chemokines and their receptors. (B) Correlation analysis of MLDPS and CCL8, CSF1, CCR1, CCR2, CCR4 and CCR7 gene expression. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001.

TIDE algorithms, widely used for predicting responses to cancer immunotherapy, indicate suboptimal immunotherapy outcomes when TIDE values are elevated59. In the merged cohort, TIDE algorithms predicted responses to immunotherapy for 138 patients. Of these, 46 were identified as responders (TIDE score < 0), whereas the remaining 92 were classified as non-responders (Fig. 10A). To evaluate the predictive power of the MLDPS for cancer immunotherapy response, dysfunction and TIDE scores were calculated. Osteosarcoma patients with elevated MLDPS risk scores exhibited decreased dysfunction and TIDE scores, indicating a higher likelihood of responding to immunotherapy in the merged cohort (Fig. 10B − D). Overall, these findings indicat that individuals in the high-risk score group are more likely to benefit from immunotherapy.

Fig. 10
figure 10

Immunotherapeutic response prediction by TIDE. (A) Patients were classified as potential responders or non-responders to immunotherapy based on their TIDE values in the merged cohort. (B) Differences in Dysfunction scores between high- and low-risk score groups (P < 0.001). (C) Percent weight of clinical response to immunotherapeutic in high- and low-risk score groups. (D) Differences in TIDE scores between high- and low-risk score groups (P = 0.02).

Comparative analysis of drug sensitivity between high- and low-risk score groups

Precision medicine necessitates clinicians to promptly recognize patients sensitive to different treatments in order to provide tailored and personalized interventions63,64. Hence, the drug sensitivity between the two risk score groups was evaluated by the ‘oncoPredict’ R package. Drug sensitivity analysis demonstrated a notable discrepancy in the drug response between the two risk score groups. Specifically, the high-risk score group exhibited a correlation with decreased IC50 values for drugs such as ABT737_1910, AZ6102_2109, AZD5153_1706, AZD5991_1720, AZD6738_1917, and BDP.00009066_1866. Conversely, the low-risk score group was correlated with decreased IC50 values for drugs such as AZ960_1250, BMS.754807_2171, Entospletinib_1630, IAP_5620_1428, PF.4708671_1129, and Ribociclib_1632 (Fig. 11). Precise prediction of drug sensitivity may have the potential to enhance patient treatment outcomes.

Fig. 11
figure 11

Association between the risk model and drug sensitivity in osteosarcoma. Drug sensitivity of osteosarcoma patients with high- and low-risk score. *P < 0.05, **P < 0.01, ***P < 0.001.

Discussion

Osteosarcoma is a rare malignant tumor originating in bone tissue that predominantly afflicts adolescents and young adults65,66. Clinicians and researchers face critical challenges due to the lack of readily available biomarkers for screening, stratified treatment, and prognostic follow-up, potentially resulting in inadequate treatment67. To bridge these gaps, novel methodologies are needed to identify osteosarcoma biomarkers that accurately reflect the biological characteristics of the disease.

Recent studies have developed various prognostic signatures for osteosarcoma, many of which focus on specific biological pathways such as glycolysis, ferroptosis, and immunity68,69,70. However, this focused approach neglects crucial biological processes that may also contribute to the oncogenesis and progression of osteosarcoma. Moreover, existing research indicates that researchers tend to select modeling algorithms based on their knowledge limitations and personal preferences23,61,71. To compensate for these shortcomings, we devised an effective computational strategy for identifying optimal prognostic genes and constructed a highly accurate risk prediction model for stratifying risk levels. First, 14 CPSGs were identified by univariate Cox regression analysis from the 14,174 intersection genes in the TARGET and GSE21257 cohorts. Enrichment analysis revealed that the identified CPSGs were mainly enriched in “pattern specification process,” “mitochondrial inner membrane,” “transcription coactivator binding,” “telomeric DNA binding,” “telomerase RNA binding,” and “Arginine and proline metabolism”. These results uncover the biological functions of genes linked to osteosarcoma prognosis and offer valuable insights into their roles in the molecular dynamics of the disease, paving the way for further targeted research.

A total of 66 combinations of 10 widely used ML algorithms were generated to develop prognostic signatures. Among the 10 acknowledged ML algorithms, RSF, LASSO, CoxBoost, and Stepwise Cox possess the capability for variable screening and reduction of data dimensionality. Ultimately, we built the optimal model (MLDPS) by selecting the combination of LASSO and RSF based on their highest average C-index (0.866) across all three cohorts. Overfitting is a common challenge in ML, often causing models to perform well on training dataset but poorly on external validation datasets37,72. After reducing redundant information with LASSO, a seven-gene signature referred to as MLDPS was ultimately derived through RSF.

The timeROC curve analysis, KM survival analysis, and Cox regression analysis consistently indicated that MLDPS accurately and reliably predicted the prognosis of osteosarcoma patients across all three cohorts. The immune-related six-gene signature23, TGFβ-related two-gene signature9, and hypoxia-related four-gene signature32 achieved higher C-index values in one or two cohorts but exhibited suboptimal performance in others, likely due to limited generalization caused by overfitting73. In contrary, our MLDPS consistently ranked first in C-index values and exhibited superior predictive efficacy across all cohorts. Stratification of osteosarcoma patients into high- and low-risk score groups indicated that individuals in the high-risk score group had a notably poorer OS in comparison to those in the low-risk score group. Additionally, patients with high-risk score exhibited an increased likelihood of developing metastasis, contributing to their unfavorable prognosis.

The MLDPS model comprised seven potential gene signatures: CTNNBIP1, CORT, DLX2, TERT, BBS4, SLC7A1, and NKX2-3. The seven novel genes discovered in our study have been validated in previous research as being linked to tumor initiation and progression. CTNNBIP1 has been suggested as a potential tumor suppressor in colorectal74, breast75, cervical76, and gastric cancers77. CORT has been identified as a regulator of the growth and metastasis in lung and thyroid cancers78,79. Additionally, elevated CORT expression has been correlated with a high-risk subgroup of osteosarcoma patients80, consistent with our findings. DLX2 has been identified as an independent prognostic factor for colorectal cancer81. Recent research has provided increasing support for the safety of TERT-targeted immunotherapy82,83. SLC7A1, an arginine transporter, has been suggested to be associated with liver cancer84. NKX2-3 has been confirmed as an oncogenic driver in marginal-zone B-cell lymphomas85. However, the functional roles of the BBS gene remain unclear, nor has it been linked to cancer progression.

Functional and pathway analyses were performed to elucidate the biological mechanisms underlying MLDPS. Enrichment analysis of the co-expressed genes correlated with MLDPS revealed significant enrichment in various biological processes and cellular components, namely “ossification,” “ncRNA processing,” “embryonic organ morphogenesis,” “ribosome biogenesis,” “rRNA metabolic process,” “rRNA processing,” “ribosome,” “mitochondrial matrix,” “ribosomal subunit,” “asymmetric synapse,” “cytosolic ribosome,” “structural constituent of ribosome,” and “Ribosome”. GSEA revealed a positive correlation between the high-risk score group and the upregulation of key gene sets involved in translation, rRNA processing in the nucleus and cytosol, the primary pathway of rRNA processing in the nucleolus and cytosol, and other biological pathways. The activation of the pathways associated with MLDPS was further validated using GSVA. The findings partially elucidate the mechanisms underlying metastasis and poor prognosis in the high-risk score group.

Next, correlation analysis was conducted to explore the differences in the immune landscape between high- and low-risk score groups. The findings of the TME indicated a negative correlation between MLDPS risk scores and ESTIMATEScore, ImmuneScore, MicroenvironmentScore, and StromalScore, and the gene expression levels of many chemokines and chemokine receptors. These observations suggest that MLDPS can differentiate the TME, and individuals in the high-risk score group may experience an immunosuppressive state. However, lower TIDE scores suggested that individuals with high MLDPS risk scores are more likely to respond to immunotherapy. These observations underscore the necessity for further exploration of immunotherapeutic approaches and indicate the existence of unidentified immune evasion mechanisms that warrant investigation86,87. Our findings indicate that MLDPS might contribute to resistance against small-molecule drugs in human cancer cell lines. MLDPS has potential as both a prognostic indicator and a predictor of immunotherapy response, representing a significant advancement in osteosarcoma treatment.

To our knowledge, no prior studies have applied these feature extraction methods in the creation of a prognostic risk model for osteosarcoma. First, in contrast to existing prognostic models for a specific pathway, our model incorporated factors from CPSGs obtained from diverse cohorts, improving the robustness and prognostic relevance of the included genes. Second, to minimize bias from subjective preferences in modeling, we integrated 10 established ML algorithms into 66 combinations and identified the optimal model based on accuracy evaluation. Third, we methodically gathered data from two multicenter cohorts and selected the algorithm with the highest average C-index from the TARGET, GSE21257, and merged cohorts to construct the MLDPS. The seven-gene signature elucidated in this study empowers clinicians to better identify high-risk patients who may benefit from more rigorous surveillance and treatment strategies. By correlating MLDPS scores with drug sensitivity and immune microenvironment characteristics, our model offers valuable insights for personalized treatment strategies. Enrichment analyses illuminate molecular processes involved in osteosarcoma progression, including pathways related to ribosome biogenesis, rRNA processing, and immune suppression. These findings establish a foundation for further mechanistic studies and targeted therapeutic development.

While promising, our study has certain limitations. Despite presenting robust computational findings, experimental validation is necessary to affirm the biological roles of the identified genes in osteosarcoma progression and metastasis. Functional studies employing both in vitro and in vivo models may provide profound insights into the mechanisms underlying the MLDPS. Our results suggest an immunosuppressive tumor microenvironment in patients with high MLDPS scores, yet the exact mechanisms are unknown. Further studies are needed to investigate how these patients evade immune responses and to determine whether combination strategies could overcome immunotherapy resistance. Additionally, despite correction algorithms, the analyzed cohorts exhibited variations in size and sequencing platform. Therefore, larger prospective multicenter cohort studies are warranted to validate the clinical utility of MLDPS.

Over the next five years, we expect major progress in integrating ML with omics data for cancer research, particularly in osteosarcoma. The combination of genomic, transcriptomic, proteomic, and metabolomic data could lead to better prognostic models. With growing access to electronic health records and patient-specific omics data, ML-based tools like MLDPS could enhance clinical decision-making platforms by enabling real-time risk assessment and therapy optimization. As immunotherapy advances, the identification of predictive biomarkers for patient response will gain increasing important. Our findings establish a preliminary framework for linking risk stratification with immunotherapy response, potentially guiding future clinical trials. Collaboration among researchers, clinicians, and bioinformaticians is crucial to tackle osteosarcoma data variability and to create standardized protocols for data processing and model validation.

Conclusion

In summary, leveraging 14 CPSGs from the GSE21257 and TARGET cohorts, we developed and validated a robust MLDPS model based on 66 ML algorithm combinations. The MLDPS exhibited enhanced prediction accuracy for patient prognosis across diverse cohorts and showed significant associations with immunotherapy response and chemotherapy efficacy. Consequently, our MLDPS has significant potential as a tool to guide clinical decision-making and customize treatment strategies for osteosarcoma patients. However, additional research is needed to fully evaluate the capabilities of MLDPS and clarify its clinical relevance.