Abstract
Our objective was to capture subgroups of soft-tissue sarcoma (STS) using handcraft and deep radiomics approaches to understand their relationship with histopathology, gene-expression profiles, and metastatic relapse-free survival (MFS). We included all consecutive adults with newly diagnosed locally advanced STS (N = 225, 120 men, median age: 62 years) managed at our sarcoma reference center between 2008 and 2020, with contrast-enhanced baseline MRI. After MRI postprocessing, segmentation, and reproducibility assessment, 175 handcrafted radiomics features (h-RFs) were calculated. Convolutional autoencoder neural network (CAE) and half-supervised CAE (HSCAE) were trained in repeated cross-validation on representative contrast-enhanced slices to extract 1024 deep radiomics features (d-RFs). Gene-expression levels were calculated following RNA sequencing (RNAseq) of 110 untreated samples from the same cohort. Unsupervised classifications based on h-RFs, CAE, HSCAE, and RNAseq were built. The h-RFs, CAE, and HSCAE grouping were not associated with the transcriptomics groups but with prognostic radiological features known to correlate with lower survivals and higher grade and SARCULATOR groups (a validated prognostic clinical-histological nomogram). HSCAE and h-RF groups were also associated with MFS in multivariable Cox regressions. Combining HSCAE and transcriptomics groups significantly improved the prognostic performances compared to each group alone, according to the concordance index. The combined radiomic-transcriptomic group with worse MFS was characterized by the up-regulation of 707 genes and 292 genesets related to inflammation, hypoxia, apoptosis, and cell differentiation. Overall, subgroups of STS identified on pre-treatment MRI using handcrafted and deep radiomics were associated with meaningful clinical, histological, and radiological characteristics, and could strengthen the prognostic value of transcriptomics signatures.
Similar content being viewed by others
Introduction
Soft-tissue sarcomas (STS) represent prevalent malignant mesenchymal tumors characterized by diverse clinical and radiological presentations, along with distinctive histologic and molecular features, influencing their prognosis1. Contrast-enhanced (CE) MRI stands out as the optimal imaging modality for local staging of locally advanced STS, revealing various radiological phenotypes (radiophenotypes) and substantial intra- and inter-tumoral heterogeneity within and between STS2. A qualitative assessment of these radiophenotypes using conventional MRI sequences has been correlated with the French Federation of Cancer Center (FNCLCC) histologic grading, metastasis-free survival (MFS) and overall survival (OS)3,4, notably peritumoral enhancement after Gadolinium chelates injection, necrotic signal and marked heterogeneous signal intensity on T2-weigted imaging. However, this assessment remains subjective, inadequately reproducible, and incapable of capturing the intricate intra-tumoral patterns present in STS.
Radiomics entails the extensive quantification of the radiophenotype in medical imaging from any modality. It employs mathematical operators to derive numeric data, referred to as radiomics features (RFs), capturing aspects of tumor shape and texture, including three-dimensional rearrangements of gray levels within the tumor5. These RFs are subsequently explored in supervised machine-learning algorithms for predicting various oncologic outcomes or in unsupervised clustering algorithms to unveil hidden patterns within the data. The underlying hypothesis is that radiomics, on a macroscopic scale, reflects the molecular features of cancers, potentially serving as a non-invasive virtual biopsy6. Previous studies have successfully trained radiomics models to predict FNCLCC grade7,8, response to neoadjuvant chemotherapy or radiotherapy9,10,11, and patient survival12,13,14. However, only one exploratory study has directly correlated gene-expression profiles of STS with radiomics on a subset of 21 patients, suggesting links between radiomics clusters and pathways involved in apoptosis, immune infiltrates, and cell proliferation15. Conversely, predictive gene-expression signatures dedicated to STS have never been put in perspective with conventional radiological features or radiomics data16.
Recently, handcrafted RFs (h-RFs) have been complemented with deep RFs (d-RFs) obtained from the last fully connected layer of pre-trained convolutional neural networks. They aim to refine and personalize radiophenotypic quantification, though preliminary findings and concerns about their lack of explainability exist17,18. In particular, convolutional auto-encoder neural networks (CAE) are powerful deep learning techniques that extract the most meaningful features from medical images through encoder–decoder architecture, which compresses medical images into a low dimensional space and then reconstruct them as accurately as possible after retaining the most relevant characteristics (i.e., the d-RFs). Although these networks can remain completely unsupervised, it is also possible to provide output labels to each image to guide the learning process towards specific objectives, such as prognostication in half-supervised CAE (HSCAE). Comparing CAE and HSCAE can help in understanding the benefits of including labeled data (in terms of better image reconstruction, feature extraction, or better classification) and be valuable to estimate the intrinsic information contained in medical images. An objective and larger assessment of the added value and potentiation of h-RF, d-RF (either CAE or HSCAE), and transcriptomics data in STS patients is currently lacking. Additionally, understanding their relationships with semantic radiological features3,4 (i.e., explainable with medical language by radiologists) and the clinical–histological SARCULATOR nomogram of reference would facilitate the acceptance of these complex ‘-omics’ data19.
Therefore, the overarching objective of this study was to conduct an exploratory and comprehensive assessment of the inter-relations and potentialities of multi-omics data in newly diagnosed, locally advanced STS without preconceived notions. Our specific aims were (i) to identify patterns of STS using h-RFs, d-RFs, and transcriptomics data, (ii) to investigate the associations of the corresponding h-RF, d-RF, and transcriptomics groups among themselves with the SARCULATOR predictions, and with semantic radiological features known to correlate with grade and metastatic relapse-free survival (MFS), and (iii) to explore their prognostic value alone and in combination.
Results
Patient characteristics
The study flowchart is shown in Fig. 1. Briefly, out of 829 patients initially identified, 225 were included (median age: 61.2 years, range: 18–95, 120/225 [53.3%] men). Characteristics are presented in Table 1, with the most prevalent histotype being undifferentiated pleomorphic sarcoma (UPS, 71/225 [31.6%]). The majority of STS were strictly deep-seated (136/225 [70.4%]), located in the lower limbs (126/225 [56%]), and had a high (III) histologic grade (116/222 [52.3%], with 3 patients having non-available grade). One hundred and ten patients had sufficient materials for the transcriptomics analysis, whose characteristics are also shown in Table 1.
Comprehensive patient clustering
Unsupervised consensus clustering on h-RFs identified three groups: Ah-RF (71/225, 31.6%), Bh-RF (87/225, 38.7%), and Ch-RF (67/225, 28.9%).
Regarding the development of the CAE and HSCAE neural networks for the extraction of d-RFs, the Training and Testing cohorts only differed regarding tumor size (P = 0.0350) and the number of patients who underwent chemotherapy in addition to surgery (P < 0.0001) (Table 1). After training the two convolutional neural network models, we verified that the image reconstruction error according to mean square error (MSE) remains below 1% in both Training and Testing cohorts (Supplementary Table ST1). The unsupervised clustering developed in the HSCAE and CAE d-RFs from the Training cohort systematically provided two groups, named A and B, with cluster A (i.e., ACAE and AHSCAE) for each classification referring to the most numerous group. We then applied the centroid assignment technique to label the observations from the Testing cohort.
Unsupervised Consensus clustering on the transcriptomics data identified two groups: ARNA (69/110, 62.7%) and BRNA (41/110, 37.3%), which were significantly associated with the histological type (P < 0.0001).
Understanding patient clustering
Significant associations between radiomics-based clustering and clinical, histological, radiological assessment, SARCULATOR groups, and transcriptomics clustering were observed (Table 2). Radiomics clusters were systematically associated with size, SARCULATOR groups, and semantic radiophenotypes (all P-values < 0.001). Moreover, the d-RF clusters (using both the CAE and HSCAE approaches) were consistently associated with FNCLCC grade (P-value range: 0.0168–0.0274), while h-RFs groups were not (P = 0.1559). Thus, the Bh-RF corresponded to STS with the largest size (131 ± 52 mm versus 96 ± 39 mm for Ah-RF and 59 ± 33 mm for Ch-RF—P < 0.0001), the higher rate of low Pr-OS STS (42.5% versus 22.5% for Ah-RF and 12.5% for Bh-RF—P = 0.0001), and high-risk semantic radiophenotype (71.3% versus 53.5% for Ah-RF and 38.8% for Ch-RF, P = 0.0005). Regarding the deep-radiomics clusters, the same trend towards more aggressive presentations was observed for the STS in the ACAE and AHSCAE groups compared to BCAE and BHSCAE groups, respectively (i.e., larger size, lower Pr-OS, and higher rates of high-risk semantic radiophenotype). Moreover, there were significantly more FNCLCC grade III STS in ACAE and AHSCAE groups (58.7% and 58.8%, respectively) compared to the BCAE and BHSCAE groups (41.7% [P = 0.0274] and 41.9% [P = 0.0168], respectively).
The h-RF clustering was also significantly associated with the histological type (P = 0.0101), with a higher proportion of UPS in the Ah-RF group (28/71, 39.4%) compared to the Ch-RF group (16/67, 23.9%). This weak but significant association between imaging patterns and histological types was also observed with the semantic radiophenotypes (P = 0.0005—Chi-square test), again, with higher rates of UPS in the high-risk group (53/126, 42.1%) compared to the low-risk group (18/99, 18.2%).
However, none of the radiomics groups showed an association with the transcriptomics groups (P-value range: 0.1867–0.9942), but there were strong associations within themselves in pairwise comparisons (all P-values < 0.0001). In particular, the CAE and HSCAE grouping were strongly concordant, except for 18/225 (8%) patients, including 10 patients in the ACAE but BHSCAE groups and 8 patients in the BCAE but AHSCAE groups.
Lastly, the transcriptomics groups were significantly associated with the histological types (P < 0.0001) and the FNCLCC grade, with higher rates of grade III STS in the ARNA group compared to the BRNA group (44/67 [65.7%] versus 11/41 [26.8%]—P < 0.0001, 2 out of 110 patients without available grade).
Prognostic value of radiomics groups
Metastatic relapses occurred in 70/225 (31.1%) patients. MFS probability at 2 and 5 years was 78.9% (95%CI: 73.7–84.5) and 66.4% (95%CI: 59.8–73.7), respectively.
Survival analyses for each radiomics cluster (Table 3) revealed worse MFS in univariable analysis, with Bh-RF and AHSCAE groups remaining associated in multivariable analysis (univariable analyses for covariables are provided in Supplementary Table ST2).
Hence, regarding the h-RF clustering, hazard ratios (HRs) were 1.97 (95% confidence interval [CI]: 0.90-4.29, P = 0.0899) for Ah-RF group and 2.84 (95% CI: 1.23-6.57, P = 0.0146) for Bh-RF group, compared to the Ch-RF group.
Regarding d-RF clustering, multivariable analyses provided HR = 1.50 (95% CI: 0.79–2.85, P = 0.2112) for ACAE group compared to BCAE group, and HR = 2.73 (95% CI: 1.37–5.42, P = 0.0043) for AHSCAE group compared to BHSCAE group.
Stepwise Cox regression selected the HSCAE clustering for further analyses in its last step (minimal Akaike information criterion [AIC] = 645.8). Kaplan–Meier curves for h-RF and HSCAE clustering are presented in Fig. 2a, b.
Kaplan–Meier curves (with their risk tables) for (a) unsupervised clustering based on handcrafted radiomics features (h-RF groups), for (b) unsupervised clustering based on the half-supervised convolutional autoencoder neural network deep radiomics features (d-RF HSCAE groups) and for (c) the unsupervised transcriptomics groups (TG). The panel (d) shows the 5-fold cross-validated concordance-indices (c-index, with 95% confidence interval) for Cox regression models based on the h-RF, HSCAE, and transcriptomics groups alone and combined. e, f Creation of a binary hybrid variable accounting for the d-RF HSCAE and transcriptomics groups (exhaustive and binarized). P-values correspond to the Log-rank tests. *P < 0.05, ***P < 0.001.
Complementarity of radiomics and transcriptomics
The complementarity of h-RF clustering and the relevant d-RF clustering (HSCAE) with transcriptomics clustering was assessed in 110 patients having both the radiomics and transcriptomics data.
Transcriptomics clustering was associated with MFS in univariable analysis (HR for ARNA of 2.60, 95% CI: 1.12–6.02, P = 0.0261) (Table 4, Fig. 2c). Bh-RF and AHSCAE groups remained associated with lower MFS in this subcohort (univariable HR = 3.55, 95% CI: 1.29–9.72, P = 0.0140, and HR = 3.90, 95% CI: 1.60–9.52, P = 0.0028, respectively).
In a Cox regression model including the h-RF clustering and the transcriptomics clustering, the ARNA and Bh-RF groups were still associated with lower MFS (HR = 3/08, 95% CI: 1.32-7.17, P = 0.0092, and HR = 4.23, 95% CI: 1.54-11.65, P = 0.0052, respectively).
Similarly, in a Cox regression model including the HSCAE clustering and the transcriptomics clustering, the ARNA and AHSCAE groups remained associated with lower MFS (HR = 2.93, 95% CI: 1.25-6.83, P = 0.0129, and HR = 4.28, 95% CI: 1.74-10.50, P = 0.0015, respectively).
In order to identify the most relevant combination of radiomics and transcriptomics clustering, we then investigated and compared the prognostic value of univariable and combined models using a 5-fold cross-validation scheme. Significant increment in Harrell concordance index (c-index) with radiomics-based clustering combined with transcriptomics clustering was observed (Fig. 2d). Thus, the c-index for the Transcriptomics groups alone was 0.603 (95% CI: 0.574–0.675), versus 0.633 (95% CI: 0.599–0.771) for h-RF group alone and 0.666 (95% CI: 0.643–0.820) for the Transcriptomics × h-RF combined model (P = 0.0380 against Transcriptomics group alone, and P = 0.0469 against h-RF group alone). Regarding deep radiomics, the c-index was 0.709 (95% CI: 0.651–0.788) for the Transcriptomics × HSCAE combined model, which was significantly higher than for the HSCAE model alone (c-index = 0.661, 95% CI: 0.615–0.805, P = 0.0220) and for Transcriptomics group alone (P = 0.0110).
Consequently, we selected the HSCAE and transcriptomics combined model. Kaplan–Meier analysis reveals that the ARNA × AHSCAE combination group has a worse MFS prognosis with respect to all the other combination groups (hereafter called the ‘Other’ group, made of the BRNA × AHSCAE, ARNA × BHSCAE and BRNA × BHSCAE combinations) (Fig. 2e, f). The HR for the ARNA × AHSCAE group against other Other groups was 4.29 (95% CI: 2.07–8.90, P < 0.0001, Table 4). The ARNA × AHSCAE group remains significantly associated with MFS in the most frequent histotype, namely UPS (n = 29, with HR = 8.77, 95% CI: 1.04–73.6, P = 0.0456). We proceeded then with the differential gene expression analysis between the ARNA × AHSCAE group and the Other group.
Lastly, we investigated whether the prognostic performance of the clinical–histological SARCULATOR nomogram would be increased with deep radiomics and transcriptomics. The average c-indices in 5-fold cross-validation increased from 0.698 (95% CI: 0.608–0.787) for the SARCULATOR groups alone to 0.728 (95% CI: 0.650–0.807) for a model combining the SARCULATOR, HSCAE, and transcriptomics groups, despite the difference is not significant (P = 0.1458). In this last Cox model, two characteristics remained independently associated with MFS: the high Pr-OS SARCULATOR group (HR = 6/04, 95% CI: 2.48–14.73, P < 0.0001) and the AHSCAE group (HR = 2.69, 95% CI: 1/01–7/14, P = 0.0477). The Supplementary Table ST3 shows the corresponding survival table for this sub-analysis.
Gene-expression analysis
The Volcano plot analysis of differential gene expression (DGE) between ARNA × AHSCAE and Other groups identified 1230 differentially expressed genes (Fig. 3a). The full list of genes and their associated annotations are reported in Supplementary Table ST4. GeneSet enrichment analysis on the differentially expressed genes revealed 292 significantly enriched pathways (Supplementary Table ST5 reports the description of the gene set, the adjusted p-value, the enrichment status, and the activation or inhibition status of the other group). Figure 3b shows the list of the most significantly enriched pathways. The analysis of the genes participating in those genesets showed that the ARNA × AHSCAE group of worse prognosis activates the inflammatory response, epithelial-mesenchymal transition, hypoxia, apoptosis inhibition, G2M checkpoint, UV response, E2F targets, and xenobiotic metabolism gene sets. We performed an extensive review of the role of the genes belonging to those pathways whose results are reported in Supplementary Table ST6. For each geneset, we reviewed the description of the participating genes, their role played in a specific tumor type and the reference to the scientific study showing evidence of this role. We identified important genes having an oncogenic role in a wide variety of tumors where the role of the gene seems to be coherent with the one played in the ARNA × AHSCAE group of worse prognosis. A discriminant signature between the ARNA × AHSCAE and other groups, performed via PAMr, a machine learning method, identified a subset of 162 discriminant genes (Fig. 3c). The detailed annotation of those genes (reported in Supplementary Table ST7) highlighted genes belonging to the epithelial–mesenchymal transition, hypoxia, and apoptosis inhibition genesets in the ARNA × AHSCAE group.
Volcano plot of the DGE analysis (a). Significantly enriched genes and pathways in the ARNA × AHSCAE group are in red, whereas those significantly downregulated are in blue. b Summary report of the most significantly enriched gene sets belonging to the HALLMARK of Cancer and the LM22 human immune cells collections; values on the right side of the bars correspond to the proportion of genes activated in the ‘Other’ groups. c Heatmap of the 162 activated and inhibited genes from the PAMr cross-validated discriminant signature of the ARNA × AHSCAE group versus the ‘Others’ groups. All P-values on the plots were adjusted according to the Benjamini–Hochberg procedure. FC fold change.
MRI examples of patients with opposite radiomic-transcriptomics grouping and outcomes are presented in Fig. 4.
a A 67 years old man was diagnosed with a deep-seated, 89 mm long, FNCLCC grade III leiomyosarcoma of the thigh (white arrowheads). The tumor was classified in the AHSCAE group as well as in the ARNA group (i.e., hybrid group associated with worse survivals). The patient showed a lung metastatic relapse 8 months after diagnosis and died of disease 4 months later. b A 51 years old woman was diagnosed with a deep-seated, 115 mm long, FNCLCC grade II (on biopsy) dedifferentiated liposarcoma of the shoulder (black arrowheads). The tumor was classified in the BHSCAE group as well as in the BRNA group (i.e., groups associated with higher survivals). The patient has shown no relapse at the last follow-up, i.e., 5 years after diagnosis, and is still alive. Thus, although being rather similar for human eyes on conventional MRI, combining deep radiomics and transcriptomics data enabled us to distinguish these two patients with opposite outcomes. CAM class activation mapping (Supplementary Method M1) highlighting the important areas for predictions made by the half-supervised autoencoder neural network (HSCAE), T1: T1 weighted imaging, T2: T2 weighted imaging, T1+ contrast-enhanced fat-suppressed T1-weighted imaging.
Discussion
In this study, we devised methods to classify soft tissue sarcomas (STS) imaging, independent of their initial presentation on conventional MRI, employing both handcrafted radiomics (h-RFs) and deep radiomics (d-RFs, using CAE and HSCAE models). Cluster analysis identified classes that were consistently correlated with the clinical and histological SARCULATOR nomogram and prognostic semantic radiological phenotypes while displaying a notable disconnection from transcriptomic clusters19. Our findings suggest that combining pre-treatment radiomics and transcriptomic data through a combined radiomic-transcriptomic signature could enhance the predictive performance of transcriptomics in STS patients.
To our knowledge, no study has concurrently analyzed radiomics and transcriptomic datasets in a large cohort of STS patients. Radiomics data, obtained through either handcrafted or deep radiomics approaches, demonstrated strong associations with semantic radiological features linked to aggressive tumor characteristics, higher grade, and poorer survival outcomes3,4. Moreover, the h-RF grouping was significantly associated with the histological types, which illustrates the potentiality of imaging to pre-type STS from radiological images directly. The d-RF groups additionally correlated with FNCLCC grade and SARCULATOR groups, i.e., with histological and clinical variables strongly associated with metastasis and death risk. Collectively, these associations between radiomics groups, identified without prior assumptions, and clinical, histological, and radiological features having prognostic significance contribute to validating the relevance of radiomics data.
Notably, although the fully unsupervised h-RFs, the CAE clustering, and the HSCAE clustering exhibited c-indices lower than some previously published supervised models, they remained significantly better than random in both univariable and multivariable analyses accounting for cofounding covariables underscoring their intrinsic prognostic value for STS patients7,14,18,20,21,22.
Transcriptomics groups, while exhibiting weaker but significant associations with patient survivals, showed a substantial increase in prognostic performance (c-index = 0.709) when combined with HSCAE groups. The Kaplan–Meier curves for the combined deep-radiomics-transcriptomics variable demonstrated a clear gradient in survival probability from less to more aggressive groups. This emphasizes the complementary nature of radiomics and transcriptomics, suggesting that adding radiomics data to prognostic gene-expression signatures could enhance their predictive value. We believe this complementarity could be explained by the difference and complementarity in scale and nature of radiomics and transcriptomics data. Indeed, radiomics are macroscopic data assessed over a digital image of the entire tumor volume, whereas transcriptomics data are extracted from millimetric tumor samples. Even though our findings do not provide immediate clinical application, we believe they should encourage the investigations of multi-omics prognostic signatures for STS patients through multi-center collaborations. Moreover, our findings also suggest that radiomics-transcriptomics data could enhance pre-existing clinical-histological nomograms such as the SARCULATOR.
Lastly, our attempt to elucidate the gene-expression levels of the ARNA × AHSCAE group (indicating worse outcomes) involved analyzing 1230 differentially expressed genes (including 707 overexpressed genes) and 262 pathways in 110 patients. These findings implicated the tumor micro-environment and tumorigenesis through the dysregulation of adaptive immune reactions, cell growth, inflammation, cell differentiation, apoptosis, and hypoxia16,22. Interestingly, although the transcriptomics groups were significantly associated with the histological types, the ARNA × AHSCAE group remains associated with significantly lower MFS in the most frequent histotype, i.e., UPS. It must be emphasized that gene-expression signatures dedicated to STS exist, notably the complexity index in sarcoma (CINSARC) signature16,23, which would enhance the prognostic performances of the SARCULATOR nomogram24. The aim of this study was not to challenge CINSARC (established on 310 samples versus 110 in our study). However, we found 17 common underlying pathways between CINSARC and the pathways identified in our transcriptomics analysis (Supplementary Table ST8), as well as strong associations (P = 0.0069, Chi-Square test) between CINSARC and the transcriptomics groups in a subset of 54/110 (49.1%) patients with available data (unshown data). This highlights the consistency of molecular features involved in the aggressiveness of STS.
Despite these valuable insights, our study has limitations, including its retrospective nature, heterogeneous imaging protocols necessitating MRI processing, and the availability of transcriptomic data for only half of the study population. Additionally, deep radiomics analyses were restricted to a central slice of CE-T1-WI due to current limitations in freely available deep learning algorithms for multiple co-registered 3D volumes. Lastly, although large compared to prior radiomics studies in STS, the size of the study population was too small to enable the development of a reliable and more powerful supervised deep-learning model to predict patient MFS. For instance, the DeepSurv model required datasets with more than 1500 observations for its training25. Similarly, even though the MSEs of the reconstructions by the CAE and HSCAE were high in the Training and Testing cohorts, the small size of the Testing cohort (n = 25 patients) suggests the need to validate the approach in larger independent cohorts. Addressing these limitations could further enhance the performance of deep-learning models.
In conclusion, our study provides a comprehensive analysis of MRI radiomics profiles for STS, revealing relationships with patient outcomes, semantic radiological features, and the SARCULATOR nomogram. Moreover, it highlights the synergistic potential of radiomics and transcriptomic data to refine the prognostication of STS patients, opening avenues for the development of a radiomic-transcriptomic prognostic signature for sarcoma patients.
Methods
Study design
This single-center study was approved by the institutional review board of Bergonié (Sarcoma Reference Center of Nouvelle-Aquitaine, Bordeaux, France). Written informed consent was waived by its retrospective nature. The study was achieved in agreement with good clinical practice and applicable laws. All research procedures and protocols adhered to the principles set forth in the Declaration of Helsinki.
Patients were identified by the prospectively held surgical database from our sarcoma reference center. We included all consecutive patients between May 2008 and May 2020 as they presented with a newly diagnosed locally advanced STS, with histopathological confirmation according to a senior pathologist with expertise in STS from our institution (F.L.L., J.M.C., and R.P.), with available pre-treatment MRI including a gadolinium-chelates injection, and treated in a curative intent including surgery.
We excluded patients with atypical lipomatous tumors, metastases at initial staging (i.e., on chest CT scan), and patients whose pre-treatment MRI did not include at least one T1-weighted imaging (WI), one T2-WI, and one fat-suppressed (FS) CE T1-WI (CE-T1-WI).
Figure 1 shows the flow chart. Figure 5 shows the overall study workflow.
CE contrast enhanced, CAE convolutional autoencoder neural network, d-RF deep radiomics features, h-RF handcraft radiomics features, HSCAE half-supervised convolutional autoencoder neural network, MFS metastatic relapse-free survival, rCV repeated cross-validation, RNA ribonucleic acid, WI weighted imaging.
Histologic and clinical data collection
Data collection from medical records comprised the patient’s age at diagnosis, sex, World Health Organization Performance status (WHO-PS), the tumor depth, location, longest diameter (LD), pre-treatment histological type, and FNCLCC grade (either on biopsy or on entire specimen)26, and the initial therapeutic management, namely radiotherapy or anthracycline-based chemotherapy (categorized as none, neoadjuvant or adjuvant) always combined with curative surgery, and the surgical margins (categorized as R0 versus R1–R2).
Moreover, the 10-year predicted probability of OS (Pr-OS) according to the SARCULATOR nomogram for extremity and trunk wall STS was calculated for each patient using the free application19. Patients were then divided into three categories of 10-year Pr-OS: low (≤51%), intermediate (>51% and ≤66%), and high (>66%), as previously described in prior studies from the same authors27, 28. The corresponding new categorical variable was named SARCULATOR.
Follow-ups consisted of clinical examinations and chest radiographs every 3 months for 2 years, then every 6 months for 5 years, and then annually, with complementary local MRI and chest CT-scan in case of doubtful findings. The main outcome was MFS. MFS, local relapse-free survival (LFS), and overall survival (OS) corresponded to the time (in months) elapsed from surgery to metastatic relapse, local relapse, and death related to disease (or last follow-up), respectively. Patients without events during the study period were censored. All relapses were histologically proven.
MRI acquisition
The pre-treatment MRI examinations were acquired on different 1.5-Tesla MR-systems with adjustments of the coils, field-of-views, and matrices depending on tumor size, location, and depth. The protocols systematically included at least one T1-WI prior to contrast injection, one T2-WI, and one FS CE-T1-WI. Various methods were accepted for FS, i.e., fat saturation, Dixon, subtraction, fluid-sensitive, and short tau inversion-recovery sequences29. The ranges of repetition time and echo time were 500–700 and 10–15 ms for T1-WI and 2400–4500 and 70–130 ms for T2-WI, respectively. The ranges of in-plane resolution and thickness were 0.75 × 0.75–1.4 × 1.4 mm2 and 1–7 mm, respectively.
Conventional radiological analysis
The conventional (or semantic) radiological analysis was performed in consensus by two senior radiologists with expertise in STS (A.C. and M.K.) on a picture archiving and communication system (Enterprise Imaging, Agfa Healthcare, Mortsel, Belgium). They reproduced the same analysis as in the prognostic study by Crombé et al.4 and reported if at least 2 out of the 3 following semantic radiological features were present: (i) heterogeneous signal intensity (SI) on T2-WI ≥ 50% (i.e., when ≥50% of the tumor volume showed areas with high, intermediate and low SI), (ii) presence of an area compatible with necrosis (i.e., defined as high fluid-like SI on T2-WI without enhancement at T1-WI after gadolinium chelate injection), and (iii) presence of peritumoral enhancement (i.e., defined as contrast enhancement beyond the apparent tumor borders without mass effects), as the resulting semantic radiophenotype (named ‘high risk’) was a significant predictor of lower MFS and OS in this study. Conversely, if one or less of these three radiological features was seen, the tumor was categorized as ‘low risk’.
Radiomics groups
Handcrafted radiomics pipeline
Since the sequences were obtained on different MR-systems and given the lack of standardized units for conventional MRI, a 3-steps post-processing pipeline was achieved on the T1-WI, T2-WI, and CE-T1-WI sequences in order to homogenize the imaging dataset using the ITK library (https://github.com/InsightSoftwareConsortium/ITK). First, a bilinear interpolation was applied to resample the voxel size to a common resolution of 1 × 1 × 4 mm3. Second, non-uniform SIs due to magnetic field heterogeneity were corrected using N4-ITK bias field correction30. Third, SIs were homogenized with simple ITK histogram-matching and ranged between −10,000 and +1000031. Afterward, the MRIs were uploaded to the LIFEx freeware (version 4.70, Saclay, France), compliant with the guidelines from the International Biomarker Standardization Initiative32,33. One senior radiologist manually segmented the entire tumor volumes of interest (VOIs), slice-by-slice, on the CE-T1-WI and then propagated this VOI on the T1-WI and T2-WI with adjustments of the segmentation boundaries if needed. Afterward, the SIs were discretized into 256 gray-levels and 59 texture h-RFs were extracted per sequence, namely 13 first-order texture h-RFs, 46 second-order texture h-RFs (21 from the gray-level co-occurrence matrix [GLCM] using 1, 2 and 4 pixel distance to neighbors; 11 from the gray-level run length matrix [GLRLM], 3 from the neighborhood gray-level difference matrix [NGLDM], and 11 from the gray-level zone length matrix [GLZLM]) in addition to 4 shape features from CE-T1-WI. The precise definitions and formulas for all RFs can be found at: https://www.lifexsoft.org/index.php/resources/19-texture/radiomic-features.
The same procedure (from segmentation to RF extraction) was reproduced on 30 randomly selected patients in order to estimate the reproducibility of RFs according to intra-class correlation coefficient (ICC). We only selected the RFs with ICC ≥ 0.90 for the remaining analyses, namely 46 texture h-RFs from T1-WI, 52 texture h-RFs from T2-WI, 52 texture h-RFs from CE-T1-WI and 2 shape h-RFs (i.e., n = 152 h-RFs, listed in Supplementary Table T9). The consensus clustering algorithm was applied to the robust h-RFs from the entire cohort. After center-scaling those h-RFs, each clustering was resampled 10,000 times by leave-one-out of 40% of the samples, based on hierarchical clustering using the Pearson distance and the average link34.
Deep radiomics pipeline
For each patient, the mask resulting from the tumor segmentation was propagated on the initial fat-suppressed CE-T1-WI to remove the background surrounding the tumor with manual adjustments made if necessary. Subsequently, the slice at the tumor’s midpoint was selected for further analysis. Two types of neural networks were built: convolutional auto-encoder neural network (CAE) and half-supervised convolutional auto-encoder neural network (HSCAE), which included a prognostic loss function. Details on data preprocessing, augmentation, model architecture, loss function, optimization, performance evaluation, and d-RF extraction for each network are provided in Supplementary Method M135,36,37,38. Overall, the output of a convolutional autoencoder neural network is an image that is the best ‘replicate’ of the input image. To do so, the network articulates in two sequential sub-networks. The first is a ‘decoder network’, similar to a classical convolutional neural network used for supervised classification of images, where the input image is progressively reduced to a vector of ‘latent features’. The second part is an ‘encoder network’ which takes the ‘latent features’ and progressively reconstructs the input image by deconvolution. The fidelity of the reconstruction was optimized using the mean square error (MSE) between the input and output data. We trained the CAE and HSCAE models using a leave-10%-out cross-validation technique on a randomly selected subset of 200 patients (Training cohort), repeating the process 100 times. The resulting vector of ‘latent features’ (i.e., 1024 d-RFs per image) was subjected to unsupervised hierarchical clustering (with the Pearson distance and average link, after center-scaling the d-RFs) to assign each patient to a cluster (CAE and HSCAE grouping). Additional methodological details notably about artefactual morphological augmentation techniques, are given in Supplementary Method M1. Regarding the 25 remaining patients from the independent test set of 25 samples (Testing cohort), we verified the quality of the reconstructions provided by CAE and HSCAE according to MSE. Afterward, their CAE and HSCAE clusters were secondarily identified using the centroid method, which consists of calculating the Pearson distance between each new observation and the centroid of each cluster and to attribute the cluster with the shortest distance.
Gene expression analysis
We included in the analysis all patients with frozen material or paraffin-embedded tissue of good quality before treatment and contemporary to the baseline MRI used for radiomics. After whole-RNA sequencing, the produced RNA sequences were quality-controlled and aligned to the transcriptome. Gene Expression was then estimated by the counts of high-quality sequences aligned per Gene. ComBat harmonization method was applied to correct batch effect39. Finally, gene expression counts were normalized using the Voom method (Supplementary Method M2)40. Unsupervised transcriptomics grouping was similarly built on RNA-sequencing data with hierarchical consensus clustering.
Statistical analyses
Statistical analyses were also performed with R (v4.1.0). All tests were two-tailed. A P-value < 0.05 was deemed significant.
Understanding the ‘omics’ groups
Associations between the h-RFs group, d-RFs CAE and HSCAE groups, transcriptomics groups, histologic types, histologic grade, SARCULATOR groups, and semantic radiophenotypes (i.e., categorical variables) were tested with Chi-square tests. Associations with the SARCULATOR Pr-OS and tumor size were investigated using unpaired t-test of the Mann–Whitney test depending on the Shapiro–Wilk normality test.
Prognostic value of the ‘omics’ group alone and in combinations
The Kaplan–Meier curves for MFS, depending on the radiomics and transcriptomics groups, were drawn, and the differences in survivals were tested with the log-rank test. Univariable and multivariable Cox regressions were performed to estimate the hazard ratio (HR) with a 95% confidence interval (CI) of each group. Multivariable models comprised the following covariables involved in the SARCULATOR nomograms and accounting for patient management: age (continuous), histologic type (according to the SARCULATOR categorization, with myxoid/round cells liposarcoma as reference), histologic grade (I and II [reference] versus III), tumor size (continuous), chemotherapy (no [reference] versus yes) radiotherapy (no versus yes [reference]), and surgical margins (R0 [reference] versus R1 and R2). To identify the most relevant deep-RF grouping among the three we developed, a stepwise backward Cox regression (minimizing the Akaike information criterion [AIC]) including all d-RF clusters and the covariables was built. Lastly, to evaluate the complementarity of radiomics groups and transcriptomics groups to predict patient outcome, the Harrell concordance indices (c-indices) of the best d-RF grouping alone, the h-RF grouping alone, the transcriptomics grouping alone, and their combination (with and without interaction term) were evaluated in 5-fold cross-validation and compared using a bootstrapped distribution of their difference over 1000 random replicates of the study population. The same approach was applied to evaluate the potential synergy between the SARCULATOR, Transcriptomics, and d-RF groups. Patients with any missing value among the input variables of the multivariable Cox regressions were removed from the analyses.
Gene-expression profiling of the prognostic radiomics groups
Differential gene expression (DGE) and geneset enrichment analyses were performed between the radiomics-transcriptomics grouping with the highest association with patient MFS. Exploratory DGE was performed by t-test calculation per gene. To discriminate significant up/down-regulated genes, the fold change was set to 2. The P-value cut-off was adjusted with the Benjamini–Hochberg procedure. We then assessed geneset enrichment in biological pathways based on Broad Institute’s Molecular Signature database and the CIBERSORT LM22 immuno-genesets (Supplementary Method M3)41,42. Lastly, the prediction analysis for microarrays (PAM) method was applied to provide class prediction from gene expression profiling based on an enhancement of the simple nearest prototype (centroid) classifier (Supplementary Method M3)43,44,45.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The raw and processed data generated in this study have been deposited in NCBI’s Gene Expression Omnibus (GEO) and are accessible through GEO Series accession number GSE262937 for RNA-seq experiment. The differential gene expression analyses and pathways analyses are available in the supplementary materials. The radiomics datasets and raw MRIs used and/or analyzed during the current study are available from the corresponding author on reasonable request. Any additional results can be obtained from the corresponding author.
Code availability
The underlying code for this study is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author. The R studio and Conda environments and the packages with their versions used for the analyses are detailed in Supplementary Table ST10.
References
Fletcher, C. D. M. et al. WHO Classification of Soft Tissue and Bone Tumours. 5th edn (International Agency for Research on Cancer, IARC Press, Lyon, France, 2020). https://doi.org/10.1038/s41698-024-00616-8
Gronchi, A. et al. Soft tissue and visceral sarcomas: ESMO-EURACAN-GENTURIS Clinical Practice Guidelines for diagnosis, treatment and follow-up☆. Ann. Oncol. S0923-S7534, 02184-0 (2021).
Zhao, F. et al. Can MR imaging be used to predict tumor grade in soft-tissue sarcoma? Radiology 272, 192–201 (2014).
Crombé, A. et al. Soft-tissue sarcomas: assessment of MRI features correlating with histologic grade and patient outcome. Radiology 291, 710–721 (2019).
lambin, P. et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14, 749–762 (2017).
Martin-Gonzalez, P. et al. Integrative radiogenomics for virtual biopsy and treatment monitoring in ovarian cancer. Insights Imaging 11, 94 (2020).
Peeken, J. C. et al. Tumor grading of soft tissue sarcomas using MRI-based radiomics. EBioMedicine 48, 332–340 (2019).
Yan, R. et al. Magnetic resonance imaging-based radiomics nomogram for prediction of the histopathological grade of soft tissue sarcomas: a Two-Center Study. J. Magn. Reson. Imaging 53, 1683–1696 (2021).
Crombé, A. et al. T2-based MRI delta-radiomics improve response prediction in soft-tissue sarcomas treated by neoadjuvant chemotherapy. J. Magn. Reson. Imaging 50, 497–510 (2019).
Crombé, A. et al. High-grade soft-tissue sarcomas: can optimizing dynamic contrast-enhanced MRI postprocessing improve prognostic radiomics models? J. Magn. Reson. Imaging 52, 282–297 (2020).
Peeken, J. C. et al. MRI-based delta-radiomics predicts pathologic complete response in high-grade soft-tissue sarcoma patients treated with neoadjuvant therapy. Radiother. Oncol. J. Eur. Soc. Ther. Radiol. Oncol. 164, 73–82 (2021).
Crombé, A. et al. Can radiomics improve the prediction of metastatic relapse of myxoid/round cell liposarcomas? Eur. Radiol. 30, 2413–2424 (2020).
Yang, Y., Ma, X., Wang, Y. & Ding, X. Prognosis prediction of extremity and trunk wall soft-tissue sarcomas treated with surgical resection with radiomic analysis based on random survival forest. Update. Surg 74, 355–365 (2022).
Peeken, J. C. et al. Prognostic assessment in high-grade soft-tissue sarcoma patients: a comparison of semantic image analysis and radiomics. Cancers 13, 1929 (2021).
Crombé, A. et al. Distinct patterns of the natural evolution of soft tissue sarcomas on pre-treatment MRIs captured with delta-radiomics correlate with gene expression profiles. Eur. Radiol. 33, 1205–1218 (2023).
Merry, E., Thway, K., Jones, R. L. & Huang, P. H. Predictive and prognostic transcriptomic biomarkers in soft tissue sarcomas. NPJ Precis. Oncol. 5, 17 (2021).
Frated, G. et al. Prediction of lipomatous soft tissue malignancy on MRI: comparison between machine learning applied to radiomics and deep learning. Eur. Radiol. Exp. 6, 41 (2022).
Yang, Y., Zhou, Y., Zhou, C., Zhang, X. & Ma, X. MRI-based computer-aided diagnostic model to predict tumor grading and clinical outcomes in patients with soft tissue sarcoma. J. Magn. Reson. Imaging 56, 1733–1745 (2022).
Callegaro, D. et al. Development and external validation of two nomograms to predict overall survival and occurrence of distant metastases in adults after surgical resection of localised soft-tissue sarcomas of the extremities: a retrospective analysis. Lancet Oncol 17, 671–680 (2016).
Crombé, A. et al. Radiomics and artificial intelligence for soft-tissue sarcomas: current status and perspectives. Diagn. Interv. Imaging 104, 567–583 (2023).
Hu, Y. et al. A contrast-enhanced MRI-based nomogram to identify lung metastasis in soft-tissue sarcoma: a multi-centre study. Med. Phys. 50, 2961–2970 (2022).
Aggerholm-Pedersen, N. et al. A prognostic profile of hypoxia-induced genes for localised high-grade soft tissue sarcoma. Br. J. Cancer 115, 1096–1104 (2016).
Chibon, F. et al. Validated prediction of clinical outcome in sarcomas and multiple types of cancer on the basis of a gene expression signature related to genome complexity. Nat. Med. 16, 781–787 (2010).
Crombé, A. et al. Gene expression profiling improves prognostication by nomogram in patients with soft-tissue sarcomas. Cancer Commun. (Lond.) 42, 563–566 (2022).
Katzman, J. L. et al. DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med. Res. Methodol. 18, 24 (2018).
Trojani, M. et al. Soft-tissue sarcomas of adults; study of pathological prognostic variables and definition of a histopathological grading system. Int. J. Cancer 33, 37–42 (1984).
Callegaro, D. et al. A soft tissue sarcoma nomograms and their incorporation into practice. Cancer 123, 2802–2820 (2017).
Pasquali, S. et al. High-risk soft tissue sarcomas treated with perioperative chemotherapy: improving prognostic classification in a randomised clinical trial. Eur. J. Cancer Oxf. Engl 93, 28–36 (2018).
Delfaut, E. M. et al. Fat suppression in MR imaging: techniques and pitfalls. RadioGraphics 19, 373–382 (1999).
Tustison, N. J. et al. N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320 (2010).
Nyúl, L. G. & Udupa, J. K. On standardizing the MR image intensity scale. Magn. Reason. Med. 42, 1072–1081 (1999).
Nioche, C. et al. LIFEx: a freeware for radiomic feature calculation in multimodality imaging to accelerate advances in the characterization of tumor heterogeneity. Cancer Res. 78, 4786–4789 (2018).
Zwanenburg, A. et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295, 328–338 (2020).
Wilkerson, M. D. & Hayes, D. N. ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinforma. Oxf. Engl. 26, 1572–1573 (2010).
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. https://doi.org/10.48550/arXiv.1502.03167 (2015).
Glorot, X. & Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. J. Mach. Learn. Res. 9, 249–256 (2010).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. https://doi.org/10.48550/arXiv.1412.6980 (2017).
Selvaraju, R. R. et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vis. 128, 336–359 (2020).
Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostat. Oxf. Engl. 8, 118–127 (2007).
Law, C. et al. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome. Biol. 15, R29 (2014).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457 (2015).
Tibshirani, R., Hastie, T., Narasimhan, B. & Chu, G. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl Acad. Sci. USA 99, 6567–6572 (2002).
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Atanas, K. et al. The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res. 41, d793–d800 (2013).
Acknowledgements
This work received support from the French government, managed by the National Research Agency, under the France 2030 program with the reference ANR 21 RHUS 0010.
Author information
Authors and Affiliations
Contributions
A.C.: Conceptualization, methodology, software, validation, formal analysis, resources, data curation, writing—original draft, writing—review and editing, visualization. C.L.: Conceptualization, methodology, software, validation, formal analysis, writing—original draft, writing—review and editing, visualization. F.B.: Methodology, software, validation, formal analysis, writing—original draft, writing—review and editing. M.K.: Validation, formal analysis, resources, data curation, writing—review and editing. M.S.C.: Validation, formal analysis, data curation, writing—review and editing. M.T.: Validation, formal analysis, data curation, writing—review and editing. V.C.: Validation, formal analysis, resources, data curation, writing—review and editing. A.M.: Validation, formal analysis, data curation, writing—review and editing. J.M.C.: Validation, formal analysis, resources, data curation, writing—review and editing. R.P.: Validation, formal analysis, resources, data curation, writing—review and editing. F.L.L.: Validation, formal analysis, resources, data curation, writing—review and editing. A.B.: Methodology, software, validation, formal analysis, data curation, writing—review and editing. A.I.: Conceptualization, methodology, validation, formal analysis, resources, writing—review and editing, supervision, project administration.
Corresponding author
Ethics declarations
Competing interests
A.I.: Research grant: AMGEN, AstraZeneca, Bayer, BMS, Merck, MSD, Novartis, CHUGAI, Parthenon, Roche; Advisory board: Bayer, BMS, Merck, MSD, Novartis, CHUGAI, Parthenon, Roche. The other authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Crombé, A., Lucchesi, C., Bertolo, F. et al. Integration of pre-treatment computational radiomics, deep radiomics, and transcriptomics enhances soft-tissue sarcoma patient prognosis. npj Precis. Onc. 8, 129 (2024). https://doi.org/10.1038/s41698-024-00616-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41698-024-00616-8
This article is cited by
-
A prediction method for radiation proctitis based on SAM-Med2D model
Scientific Reports (2025)
-
A CT-based radiomics model for predicting lymph node metastasis in hepatic alveolar echinococcosis patients to support lymph node dissection
European Journal of Medical Research (2024)
-
Integration of pre-treatment computational radiomics, deep radiomics, and transcriptomics enhances soft-tissue sarcoma patient prognosis
npj Precision Oncology (2024)