Introduction

Colorectal cancer (CRC) is one of the most prevalent malignancies worldwide, characterized by its high morbidity and mortality rates1,2. Despite the diagnostic and therapeutic strategies having achieved significant advancements in recent years3, the long-time prognosis of CRC patients, especially those with high-risk characteristics, remains unsatisfactory4. It’s important to develop biomarkers which can precisely stratify CRC patients based on their potential malignancy and treatment response5.

Long non-coding RNAs (lncRNAs), a class of transcripts longer than 200 nucleotides while rarely coded protein, have garnered increasing attention for recent cancer research6. LncRNAs played a vital role in various biological processes, including gene regulation, immune modulation, and tumor progression7,8. Previous studies have reported that specific lncRNAs have been implicated in CRC progression by regulating immune cell infiltration9 and promoting tumor metastasis10, underscoring their potential as prognostic biomarkers and therapeutic targets.

Migrasome, a newly discovered type of extracellular vesicle, mediates intercellular communication by transporting bioactive molecules during cell migration11, providing a novel perspective on the mechanisms underlying tumor progression and immune evasion in the tumor microenvironment (TME)12,13. TME has been proved to be essential in shaping CRC progression by regulating immune cell infiltration to affect immunotherapy14,15 and communicating with stromal cells such as cancer-associated fibroblast and macrophages16,17. However, the interplay between migrasome and CRC remained largely unexplored. As key regulators of gene expression, lncRNAs can be stably detected via routine transcriptome sequencing, which allowed preliminary exploration for migrasome functions based on existing data and clinical translation.

In this study, we mainly identified and validated a migrasome-related lncRNA signature for prognosis prediction in CRC, with a particular focus on its implications for tumor immunity and targeted therapy. By elucidating the roles of these lncRNAs in CRC progression, immune regulation, and therapeutic resistance, our findings may pave the way for the development of novel biomarkers for tumor stratification and therapeutic strategies for CRC patients.

Method

Patients and data collection

All the patients enrolled in this study were derive from TCGA COAD and READ cohorts (https://portal.gdc.cancer.gov/), which contained 650 patients with pathologically confirmed colorectal adenocarcinoma. To identify migrasome-related transcriptional variation, the RNA-sequence data and corresponding clinical information of each patient were acquired from TCGA database.

Identification of migrasome-related lncRNAs

The lncRNA expression matrix was screened from the RNA-sequence data based on reference annotation for the human genome from GENCODE. 11 migrasome-related genes were identified from previously published studies (Supplementary Table 1)18,19,20,21,22,23. Then, to identify migrasome-related lncRNAs, correlation analysis was performed for each lncRNA with migrasome-related genes. The lncRNAs with correlation coefficient > 0.6 and P < 0.001 were determined as migrasome-related lncRNAs. Subsequently, we used limma to identify differentially expressed migrasome-related lncRNAs between tumoral and normal samples in TCGA COREAD cohort in R sofware (version 4.0.5). The cutoff was |log2 fold change (FC)|≥ 1.0 and false discovery rate (FDR) < 0.05.

Construction and validation of the migrasome-related lncRNAs prognostic signature

When model training, the TCGA COREAD cohort was divided into training and test sets in a 7:3 ratio, with the training set employed for developing the prognosis prediction model and the testing set utilized to independently assess the prediction performance. To construct a robust signature for predicting prognosis of CRC patients, the differentially expressed migrasome-related lncRNAs were firstly applied into univariate Cox regression analysis and lncRNAs with P < 0.05 were further enrolled into the least absolute shrinkage and selection operator (LASSO) model to reduce overfitting (Supplementary Method). Finally, the independent prognostic indicators were identified for model construction via multivariate Cox regression analysis based on the Akaike information criterion (AIC) value. The prognostic risk score was linearly calculated based on the expression level of lncRNAs and their corresponding coefficients.

The patients were divided into the high-risk and low-risk groups based on the median of risk scores. The Kaplan–Meier analysis was performed to demonstrate the prognostic implications of the developed model based on migrasome-related lncRNAs. Time-dependent ROC curve analysis was also utilized by using “timeROC” R packages to assess the prediction accuracy of the risk score in one, three and five years, respectively.

The predictive nomogram

The clinical characteristics including age, gender, stage was integrated with our risk scores for univariate and multivariate Cox regression analyses to evaluate the independent prognostic ability of each characteristic and construct the multi-parameter predicting model. To facilitate the clinical application of the multi-parameter predicting model we developed, a nomogram was further constructed for prediction of overall survival (OS) of CRC patients. The calibration curves and concordance index (C-index) were performed to assess the robustness of the nomogram.

Functional enrichment analysis

To clarify the genomic variation between patients in the high-risk and low-risk groups, the differentially expression analysis of transcriptome array was performed to screen for differentially expressed genes (DEGs) (|logFC|> 1 and FDR < 0.05). The enrichment analysis for well-known pathways including Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG)24,25,26 based on DEGs were conducted to illustrate the functions and biological pathways linked to the migrasome-related lncRNAs. In addition, the Gene Set enrichment Analysis (GSEA) was performed to further investigate the potential signaling pathways in both groups.

Immune cell infiltration and immunity analysis

As previously reported, migrasome was associated with immune cell infiltration in the tumor microenvironment20,27. Therefore, we also explored the relationship between immune cells compositions and migrasome-related risk scores. The immune cells compositions were determined by CIBERSORT for each patient and the immune-related function of patients in TCGA CRC cohort were quantified via ssGSEA enrichment of well-known pathways (Supplementary Method). The Wilcoxon test was used to analyze the differential expression of immune checkpoint genes between the two groups. In addition, the potential of immune evasion to predict the immune checkpoint blockade (ICB) response was also generated by tumor immune dysfunction and exclusion (TIDE) tool and compared between low-risk and high-risk groups.

Drug sensitivity analysis

Recent studies have proposed various drugs which may exerts potentially antitumor efficacy against CRC including traditional chemotherapy and novel targeted drugs. To assess the predictive ability of our model for drug sensitivity in CRC, the anticancer drug dataset was acquired from Genomics of Drug Sensitivity in Cancer (GDSC, https://www.cancerrxgene. org/) and the relationship of IC50 values for various drugs between two risk groups was calculate using the “oncoPredict” package.

Statistical analysis

Statistical analysis was performed using R sofware (version 4.0.5). P < 0.05 was considered statistically significant.

Results

Identification of migrasome-related differentially expressed lncRNAs

Co-expression analysis revealed 1857 migrasome-related lncRNAs which were highly correlated to the expression level of 10 previously reported migrasome-related genes in 650 CRC samples (Fig. 1A & Supplementary Table 2). Figure 1B presented the differential expression of these migrasome-related lncRNAs between normal and tumor samples. Overall, 463 lncRNAs were up-regulated while 37 lncRNAs were down-regulated in tumors (Fig. 1C & Supplementary Table 3).

Fig. 1
figure 1

Identification of migrasome-related differentially expressed lncRNAs and screening of prognostic lncRNAs (A). Co-expression of lncRNAs that were highly correlated to 10 migrasome-related genes; (B) Volcano plot of 463 up-regulated and 37 down-regulated migrasome-related lncRNAs between normal and tumor samples; (C). Heatmap of differential expressed genes between normal and tumor samples; (D). LASSO coefficient profiles of the ten prognostic LncRNAs; E. Plots of cross-validation partial likelihood deviance of the LASSO model; F. Correlation matrix of the selected 10 prognostic lncRNAs with migrasome-related genes.

To avoid overfitting, 434 migrasome-related lncRNAs which potentially associated with prognosis of CRC were initially screened by the univariable Cox model from the training set. Then we used the LASSO Cox regression model to further select 12 lncRNAs that could predict survival outcomes of CRC more robustly (Fig. 1D, E). Finally, the multivariate Cox regression analysis was performed to avoid covariance of the predicting model and successfully identified 10 prognostic migrasome-related lncRNAs (AC012313.5, AC083967.1, AC019118.1, TNFRSF10A-AS1, ZEB1-AS1, LINC02516, AC114730.3, JARID2-AS1, AC020558.2, and AC016831.4) to build the risk model. Combining the corresponding coefficients, the risk score was computed as follow: AC012313.5 × (-0.99453) + AC083967.1 × (0.72723) + AC019118.1 × (1.21762) + TNFRSF10A-AS1 × (-0.27760) + ZEB1-AS1 × (0.60705) + LINC02516 × (-4.93759) + AC114730.3 × (0.50692) + JARID2-AS1 × (-0.47708) + AC020558.2 × (0.32755) + AC016831.4 × (-2.54820). Figure 1F showed that the selected 10 prognostic lncRNAs were closely related to expression levels of migrasome-related genes.

Development and validation of prognostic signature based on migrasome-related lncRNAs

650 CRC patients were divided into high-risk and low-risk groups according to their respective risk scores. The Principal Component Analysis (PCA) indicated that compared with the whole expression profiles, the selected migrasome-related lncRNAs could better discriminate patients from high-risk and low-risk subsets (Fig. 2A). The Kaplan–Meier analysis showed that patients with higher risk scores had significantly worse overall survival than those with lower risk scores in the training and testing sets (P < 0.001, Fig. 2B, C). The analysis of progression-free survival also confirmed the prognostic efficacy of our risk scores for CRC patients (P < 0.001, Fig. 2D). The risk score distribution map revealed that the risk scores were positively correlated with the mortality rate among CRC patients and the expression profiles of 10 prognostic migrasome-related lncRNAs were various between high-risk and low-risk groups (Fig. 2E). An independent external validation was performed based on the CRC cohort from GEO dataset (GSE72970), which showed that patients in the high-risk group had significantly shorter OS (P < 0.001) compared to the low-risk group (Supplementary Fig. 1A). In addition, we preformed the survival-related ROC curve analysis to assess the sensitivity and specificity of the risk score, which demonstrated that the area under the ROC curves (AUC) of the migrasome-related risk score for 1-, 3- and 5-year OS was 0.700, 0.738, and 0.782, respectively (Fig. 2F). The survival-related ROC curve analysis also achieved a AUC of 0.633 in the GSE72970 cohort (Supplementary Fig. 1B).

Fig. 2
figure 2

Development and validation of the migrasome-related lncRNA prognostic signature (A). PCA analysis of CRC patients based on the whole expression profiles and selected migrasome-related lncRNAs; (B and C). The Kaplan–Meier analysis of overall survival for patients with high risk scores and low risk scores in the training and testing sets; (D). The Kaplan–Meier analysis of progression-free survival for patients with high risk scores and low risk scores; (E). The risk score distribution map and the expression profiles of 10 prognostic migrasome-related lncRNAs among high-risk and low-risk patients; (F). The survival-related ROC curve analysis of patients based on the migrasome-related signature.

Construction of the multimodal model and nomogram for predicting long-term prognosis

Based on the clinical profiles acquired from TCGA COREAD dataset, we first compared the predictive value of the migrasome-related risk score of with classical clinical characteristics. The survival-related ROC curve analysis indicated that the performance of migrasome-related risk score was superior to other clinical factors including age, gender and stage for predicting OS of CRC patients (AUC = 0.782 VS 0.637, 0.492, 0.718), which was further validated by the C-index comparison (Fig. 3A, B).

Fig. 3
figure 3

Construction and validation of the multimodal prognostic model and nomogram (A). The survival-related ROC curve analysis for migrasome-related signature and clinical factors; (B). C-index comparison of the migrasome-related risk score and other clinical factors; (C and D). The Kaplan–Meier analysis of overall survival for patients with high risk scores and low risk scores in the early-onset and later-onset subgroups; (E and F). The Kaplan–Meier analysis of overall survival for patients with high risk scores and low risk scores in the stage I-II and III-IV subgroups; (G). Construction of the nomogram based on the multimodal model; (H). The calibration curves of the nomogram for 1-, 3-, and 5-year OS.

The sub-group survival analysis was also performed to confirm the value of migrasome-related risk score in patients from different clinical background. The result revealed that OS was significantly worse in the low-risk group than the high-risk group in both early-onset (P < 0.001) and later-onset (P < 0.001) CRC patients (Fig. 3C, D). For stage I-II (P < 0.001) or III-IV (P < 0.001) CRC, patients with higher risk scores had significantly worse overall survival than those with lower risk scores (Fig. 3E, F).

To assess the independent predicting value of migrasome-related risk score, we performed the multivariate Cox regression analysis and found that our risk score could effectively predict the survival of CRC patients independent of other clinical characteristics (Table. 1, HR = 1.034, 95% CI 1.018–1.050, P < 0.001). We thereby developed a nomogram based on the multivariate Cox model described above (Fig. 3G). The calibration curves proved the robustness of its consistent predicting value in 1-, 3-, and 5-year OS in CRC patients (Fig. 3H). In addition, the C-index of our nomogram was 0.753 (95% CI, 0.724–0.783) which indicated that the model held a good predictive capacity.

Table. 1 Multivariate cox regression analysis for migrasome-related risk score with other clinical characteristics.

Transcriptional variation comparison and functional enrichment of high-risk and low-risk patients

We performed the differential expression analysis of transcriptome array between high-risk and low-risk groups to explore their genomic variation. 254 up-regulated and 233 down-regulated genes were found in high-risk patients (Fig. 4A, B). For GO enrichment analysis, these DEGs were mainly enriched in signaling receptor activator activity, receptor ligand activity, G protein-coupled receptor binding and endocytic vesicle (Fig. 4C). Enrichment analysis for KEGG pathways indicated that neuroactive ligand-receptor interaction, VEGF signaling pathway and cytokine-cytokine receptor interaction were shown to be important pathways for the tumor progression in patients with high migrasome-related risk scores (Fig. 4D).

Fig. 4
figure 4

Transcriptional variation and functional enrichment analysis of high- and low-risk patients (A). Volcano plot of 254 up-regulated and 233 down-regulated genes between high-risk and low-risk patients; (B). Heatmap of differential expressed genes between high-risk and low-risk patients; C&D. GO and KEGG enrichment analysis of DEGs between high-risk and low-risk patients; E&F. GSEA enrichment analysis of GO pathways for high-risk and low-risk patients; G&H. GSEA enrichment analysis of KEGG pathways for high-risk and low-risk patients.

The GSEA enrichment analysis was further conducted, which showed that nucleosome-related pathways were enriched in low-risk patients, as well as chromatin structural constituents (Fig. 4E, G). For patients with high migrasome-related risk scores, immune-related pathways such as antigen processing and presentation exhibited notable enrichment, which may indicate a close association between migrasome and tumor immunity (Fig. 4F, H).

Immune cell infiltration analysis and immunotherapy efficacy evaluation of the migrasome-related risk score

Considering that migrasome has been previously reported to affect the infiltration and activation of immune cells in the tumor microenvironment, we determine the proportions of 22 types of immune cells and investigated the relationship between immune cell status and migrasome-related risk scores. A significant reduction of infiltrating plasma cells, resting CD4 + memory T cells and activated dendritic cells was observed in the high-risk group (Fig. 5A). As immunotherapy has been increasingly important for clinical application, we further used ssGSEA to quantify the function of immune cells and perform comparison in two groups. We demonstrated that immune scores of HLA and macrophages were significantly higher in the high-risk group (Fig. 5B), which sheds new light for current immunotherapy. The expression levels of 47 immune checkpoint-related genes collected from previously published studies were also compared between patients from the high-risk and low-risk groups, which demonstrated that the risk score had a potential association with 14 key regulator of immune checkpoint including TNF family, CD70 and CD276 (Fig. 5C).

Fig. 5
figure 5

Immune cell infiltration, immunotherapy efficacy, and drug sensitivity analysis (A). Fraction of immune cells in tumor microenvironment for high-risk and low-risk patients; (B). Scores of immune function for high-risk and low-risk patients; (C). The expression levels of immune checkpoint-related genes for high-risk and low-risk patients; (D). TIDE scores for high-risk and low-risk patients; (E-L). The sensitivity for targeted drugs including PLX-4720, JAK-8709, JAK-8517, Irinotecan, ERK-6604, ERK-2440, Epirubicin and Foretinib for high-risk and low-risk patients.

In addition, the sensitivity to immunotherapy and potential of immune escape of CRC patients were evaluated using the TIDE algorithm and compared between patients in the high- and low-risk groups. We found that the TIDE scores were significantly higher in the high-risk group than in the low-risk group (Fig. 5D), which indicated that patients with high migrasome-related risk scores tend to resist immunotherapy. However, whether immunotherapy is better for low-risk CRC patients still needs further exploration.

Drug sensitivity screening for high-risk and low-risk patients

Although adjuvant chemoradiotherapy has been widely used for CRC patients, targeted drugs still faced a significant gap in the treatment of CRC, which hindered precision medicine. It’s essential to develop predicting marker for treatment response of various kinds of targeted drugs, and thus recommend high-sensitivity personalized treatment for each patient. We compared the sensitivity of various targeted drugs of patients in the high-risk and low-risk groups based on the IC50 of each drug. The results showed that patients in the low-risk group were significantly more sensitive to PLX-4720 (B-RafV600E inhibitor), JAK-8517 (JAK/STAT inhibitor), ERK-2440 (ERK pathway inhibitor) and Epirubicin than those in the high-risk group (Fig. 5E–L).

Discussion

In this study, we developed and validated a prognostic signature based on migrasome-related lncRNAs for CRC patients, addressing a critical need for precise prognostic biomarkers and potential therapeutic targets. Our findings highlighted the intricate interplay between migrasome-related lncRNAs, tumor progression, and immune regulation.

The prognostic signature effectively stratified CRC patients into high- and low-risk groups, with significant differences in OS and PFS. This is consistent with prior studies indicating the prognostic value of specific lncRNA signatures in CRC28. For instance, Wang et al. reported a lncRNA-based model predicting the prognosis in colon adenocarcinoma with comparable accuracy29, but their work did not explore the role of migrasome in tumor biology. Our work builds on these findings by focusing on the novel role of migrasome-related lncRNAs and their association with immune features.

The functional enrichment analysis of differentially expressed genes revealed that the variation between high-risk and low-risk patients was mainly enriched in cell communication pathways such as signaling receptor activator activity, endocytic vesicle and cytokine-cytokine receptor interaction. This was in line with previous studies about the importance of migrasome in tumor microenvironment via regulating cell–cell communication30,31. GSEA analysis provided further interpretation that high-risk patients exhibited upregulation of pathways associated with immunity such as antigen processing and presentation, while low-risk patients may achieve better prognosis via nucleosome-related mechanism including chromatin stabilization32,33.

Our analysis of immune cell infiltration revealed reduced levels of plasma cells and activated dendritic cells in high-risk patients. This observation is in line with the findings of Cao et al., who reported that a decrease in dendritic cell activity correlates with inactive response to immunotherapy in tumor microenvironment34. Additionally, the elevated TIDE scores in high-risk patients suggest a greater propensity for immune evasion35, a phenomenon widely observed in advanced CRC stages36. These findings underscore the potential immunosuppressive effects of migrasome-related lncRNAs and their implications for therapy resistance.

Therapeutically, our results demonstrated that low-risk patients are more sensitive to targeted agents such as PLX-4720 and JAK-8517. This aligns with prior studies indicating that patients with active immune microenvironments tend to respond better to targeted therapies37. High-risk patients, in contrast, showed greater resistance to both immunotherapy and certain targeted drugs, potentially due to their immunosuppressive tumor microenvironment38. These insights suggest that migrasome-related lncRNAs could serve as predictive biomarkers for personalized therapy, similar to what has been reported for other lncRNA signatures in recent years39,40.

However, our study has limitations. First, the retrospective design based on TCGA data necessitates prospective validation in independent cohorts. While we identified associations between migrasome-related lncRNAs and immune features, the underlying mechanisms remain speculative. Further experimental studies are required to elucidate how these lncRNAs influence immune cell infiltration and tumor behavior.

Conclusion

This study establishes a novel migrasome-related lncRNA prognostic model for CRC, offering valuable insights into tumor biology and therapeutic responses. By integrating our findings with existing literature, we provide a comprehensive perspective on the potential of migrasome-related lncRNAs as biomarkers and therapeutic targets.