Integration of pre-treatment computational radiomics, deep radiomics, and transcriptomics enhances soft-tissue sarcoma patient prognosis

Crombé, Amandine; Lucchesi, Carlo; Bertolo, Frédéric; Kind, Michèle; Spalato-Ceruso, Mariella; Toulmonde, Maud; Chaire, Vanessa; Michot, Audrey; Coindre, Jean-Michel; Perret, Raul; Le Loarer, François; Bourdon, Aurélien; Italiano, Antoine

doi:10.1038/s41698-024-00616-8

Download PDF

Article
Open access
Published: 07 June 2024

Integration of pre-treatment computational radiomics, deep radiomics, and transcriptomics enhances soft-tissue sarcoma patient prognosis

Amandine Crombé ORCID: orcid.org/0000-0003-0098-6482^1,2,3^na1,
Carlo Lucchesi ORCID: orcid.org/0000-0001-6657-2341⁴^na1,
Frédéric Bertolo⁴,
Michèle Kind¹,
Mariella Spalato-Ceruso ORCID: orcid.org/0000-0002-7582-3365^3,5,
Maud Toulmonde^3,5,
Vanessa Chaire^3,6,
Audrey Michot^3,7,
Jean-Michel Coindre^3,6,
Raul Perret ORCID: orcid.org/0000-0003-2698-0249⁶,
François Le Loarer^3,6,
Aurélien Bourdon⁴ &
…
Antoine Italiano^3,5

npj Precision Oncology volume 8, Article number: 129 (2024) Cite this article

3030 Accesses
7 Citations
Metrics details

Subjects

Abstract

Our objective was to capture subgroups of soft-tissue sarcoma (STS) using handcraft and deep radiomics approaches to understand their relationship with histopathology, gene-expression profiles, and metastatic relapse-free survival (MFS). We included all consecutive adults with newly diagnosed locally advanced STS (N = 225, 120 men, median age: 62 years) managed at our sarcoma reference center between 2008 and 2020, with contrast-enhanced baseline MRI. After MRI postprocessing, segmentation, and reproducibility assessment, 175 handcrafted radiomics features (h-RFs) were calculated. Convolutional autoencoder neural network (CAE) and half-supervised CAE (HSCAE) were trained in repeated cross-validation on representative contrast-enhanced slices to extract 1024 deep radiomics features (d-RFs). Gene-expression levels were calculated following RNA sequencing (RNAseq) of 110 untreated samples from the same cohort. Unsupervised classifications based on h-RFs, CAE, HSCAE, and RNAseq were built. The h-RFs, CAE, and HSCAE grouping were not associated with the transcriptomics groups but with prognostic radiological features known to correlate with lower survivals and higher grade and SARCULATOR groups (a validated prognostic clinical-histological nomogram). HSCAE and h-RF groups were also associated with MFS in multivariable Cox regressions. Combining HSCAE and transcriptomics groups significantly improved the prognostic performances compared to each group alone, according to the concordance index. The combined radiomic-transcriptomic group with worse MFS was characterized by the up-regulation of 707 genes and 292 genesets related to inflammation, hypoxia, apoptosis, and cell differentiation. Overall, subgroups of STS identified on pre-treatment MRI using handcrafted and deep radiomics were associated with meaningful clinical, histological, and radiological characteristics, and could strengthen the prognostic value of transcriptomics signatures.

Integrated diagnosis based on transcriptome analysis in suspected pediatric sarcomas

Article Open access 15 June 2021

Integrated radiogenomics analyses allow for subtype classification and improved outcome prognosis of patients with locally advanced HNSCC

Article Open access 06 October 2022

Pseudoprogression prediction in high grade primary CNS tumors by use of radiomics

Article Open access 08 April 2022

Introduction

Soft-tissue sarcomas (STS) represent prevalent malignant mesenchymal tumors characterized by diverse clinical and radiological presentations, along with distinctive histologic and molecular features, influencing their prognosis¹. Contrast-enhanced (CE) MRI stands out as the optimal imaging modality for local staging of locally advanced STS, revealing various radiological phenotypes (radiophenotypes) and substantial intra- and inter-tumoral heterogeneity within and between STS². A qualitative assessment of these radiophenotypes using conventional MRI sequences has been correlated with the French Federation of Cancer Center (FNCLCC) histologic grading, metastasis-free survival (MFS) and overall survival (OS)^3,4, notably peritumoral enhancement after Gadolinium chelates injection, necrotic signal and marked heterogeneous signal intensity on T2-weigted imaging. However, this assessment remains subjective, inadequately reproducible, and incapable of capturing the intricate intra-tumoral patterns present in STS.

Radiomics entails the extensive quantification of the radiophenotype in medical imaging from any modality. It employs mathematical operators to derive numeric data, referred to as radiomics features (RFs), capturing aspects of tumor shape and texture, including three-dimensional rearrangements of gray levels within the tumor⁵. These RFs are subsequently explored in supervised machine-learning algorithms for predicting various oncologic outcomes or in unsupervised clustering algorithms to unveil hidden patterns within the data. The underlying hypothesis is that radiomics, on a macroscopic scale, reflects the molecular features of cancers, potentially serving as a non-invasive virtual biopsy⁶. Previous studies have successfully trained radiomics models to predict FNCLCC grade^7,8, response to neoadjuvant chemotherapy or radiotherapy^9,10,11, and patient survival^12,13,14. However, only one exploratory study has directly correlated gene-expression profiles of STS with radiomics on a subset of 21 patients, suggesting links between radiomics clusters and pathways involved in apoptosis, immune infiltrates, and cell proliferation¹⁵. Conversely, predictive gene-expression signatures dedicated to STS have never been put in perspective with conventional radiological features or radiomics data¹⁶.

Recently, handcrafted RFs (h-RFs) have been complemented with deep RFs (d-RFs) obtained from the last fully connected layer of pre-trained convolutional neural networks. They aim to refine and personalize radiophenotypic quantification, though preliminary findings and concerns about their lack of explainability exist^17,18. In particular, convolutional auto-encoder neural networks (CAE) are powerful deep learning techniques that extract the most meaningful features from medical images through encoder–decoder architecture, which compresses medical images into a low dimensional space and then reconstruct them as accurately as possible after retaining the most relevant characteristics (i.e., the d-RFs). Although these networks can remain completely unsupervised, it is also possible to provide output labels to each image to guide the learning process towards specific objectives, such as prognostication in half-supervised CAE (HSCAE). Comparing CAE and HSCAE can help in understanding the benefits of including labeled data (in terms of better image reconstruction, feature extraction, or better classification) and be valuable to estimate the intrinsic information contained in medical images. An objective and larger assessment of the added value and potentiation of h-RF, d-RF (either CAE or HSCAE), and transcriptomics data in STS patients is currently lacking. Additionally, understanding their relationships with semantic radiological features^3,4 (i.e., explainable with medical language by radiologists) and the clinical–histological SARCULATOR nomogram of reference would facilitate the acceptance of these complex ‘-omics’ data¹⁹.

Therefore, the overarching objective of this study was to conduct an exploratory and comprehensive assessment of the inter-relations and potentialities of multi-omics data in newly diagnosed, locally advanced STS without preconceived notions. Our specific aims were (i) to identify patterns of STS using h-RFs, d-RFs, and transcriptomics data, (ii) to investigate the associations of the corresponding h-RF, d-RF, and transcriptomics groups among themselves with the SARCULATOR predictions, and with semantic radiological features known to correlate with grade and metastatic relapse-free survival (MFS), and (iii) to explore their prognostic value alone and in combination.

Results

Patient characteristics

The study flowchart is shown in Fig. 1. Briefly, out of 829 patients initially identified, 225 were included (median age: 61.2 years, range: 18–95, 120/225 [53.3%] men). Characteristics are presented in Table 1, with the most prevalent histotype being undifferentiated pleomorphic sarcoma (UPS, 71/225 [31.6%]). The majority of STS were strictly deep-seated (136/225 [70.4%]), located in the lower limbs (126/225 [56%]), and had a high (III) histologic grade (116/222 [52.3%], with 3 patients having non-available grade). One hundred and ten patients had sufficient materials for the transcriptomics analysis, whose characteristics are also shown in Table 1.

Table 1 Patients’ characteristics

Full size table

Comprehensive patient clustering

Unsupervised consensus clustering on h-RFs identified three groups: A_h-RF (71/225, 31.6%), B_h-RF (87/225, 38.7%), and C_h-RF (67/225, 28.9%).

Regarding the development of the CAE and HSCAE neural networks for the extraction of d-RFs, the Training and Testing cohorts only differed regarding tumor size (P = 0.0350) and the number of patients who underwent chemotherapy in addition to surgery (P < 0.0001) (Table 1). After training the two convolutional neural network models, we verified that the image reconstruction error according to mean square error (MSE) remains below 1% in both Training and Testing cohorts (Supplementary Table ST1). The unsupervised clustering developed in the HSCAE and CAE d-RFs from the Training cohort systematically provided two groups, named A and B, with cluster A (i.e., A_CAE and A_HSCAE) for each classification referring to the most numerous group. We then applied the centroid assignment technique to label the observations from the Testing cohort.

Unsupervised Consensus clustering on the transcriptomics data identified two groups: A_RNA (69/110, 62.7%) and B_RNA (41/110, 37.3%), which were significantly associated with the histological type (P < 0.0001).

Understanding patient clustering

Significant associations between radiomics-based clustering and clinical, histological, radiological assessment, SARCULATOR groups, and transcriptomics clustering were observed (Table 2). Radiomics clusters were systematically associated with size, SARCULATOR groups, and semantic radiophenotypes (all P-values < 0.001). Moreover, the d-RF clusters (using both the CAE and HSCAE approaches) were consistently associated with FNCLCC grade (P-value range: 0.0168–0.0274), while h-RFs groups were not (P = 0.1559). Thus, the B_h-RF corresponded to STS with the largest size (131 ± 52 mm versus 96 ± 39 mm for A_h-RF and 59 ± 33 mm for C_h-RF—P < 0.0001), the higher rate of low Pr-OS STS (42.5% versus 22.5% for A_h-RF and 12.5% for B_h-RF—P = 0.0001), and high-risk semantic radiophenotype (71.3% versus 53.5% for A_h-RF and 38.8% for C_h-RF, P = 0.0005). Regarding the deep-radiomics clusters, the same trend towards more aggressive presentations was observed for the STS in the A_CAE and A_HSCAE groups compared to B_CAE and B_HSCAE groups, respectively (i.e., larger size, lower Pr-OS, and higher rates of high-risk semantic radiophenotype). Moreover, there were significantly more FNCLCC grade III STS in A_CAE and A_HSCAE groups (58.7% and 58.8%, respectively) compared to the B_CAE and B_HSCAE groups (41.7% [P = 0.0274] and 41.9% [P = 0.0168], respectively).

Table 2 Associations with the radiomics-based clustering

Full size table

The h-RF clustering was also significantly associated with the histological type (P = 0.0101), with a higher proportion of UPS in the A_h-RF group (28/71, 39.4%) compared to the C_h-RF group (16/67, 23.9%). This weak but significant association between imaging patterns and histological types was also observed with the semantic radiophenotypes (P = 0.0005—Chi-square test), again, with higher rates of UPS in the high-risk group (53/126, 42.1%) compared to the low-risk group (18/99, 18.2%).

However, none of the radiomics groups showed an association with the transcriptomics groups (P-value range: 0.1867–0.9942), but there were strong associations within themselves in pairwise comparisons (all P-values < 0.0001). In particular, the CAE and HSCAE grouping were strongly concordant, except for 18/225 (8%) patients, including 10 patients in the A_CAE but B_HSCAE groups and 8 patients in the B_CAE but A_HSCAE groups.

Lastly, the transcriptomics groups were significantly associated with the histological types (P < 0.0001) and the FNCLCC grade, with higher rates of grade III STS in the A_RNA group compared to the B_RNA group (44/67 [65.7%] versus 11/41 [26.8%]—P < 0.0001, 2 out of 110 patients without available grade).

Prognostic value of radiomics groups

Metastatic relapses occurred in 70/225 (31.1%) patients. MFS probability at 2 and 5 years was 78.9% (95%CI: 73.7–84.5) and 66.4% (95%CI: 59.8–73.7), respectively.

Survival analyses for each radiomics cluster (Table 3) revealed worse MFS in univariable analysis, with B_h-RF and A_HSCAE groups remaining associated in multivariable analysis (univariable analyses for covariables are provided in Supplementary Table ST2).

Table 3 Survival analysis for metastatic relapse-free survival

Full size table

Hence, regarding the h-RF clustering, hazard ratios (HRs) were 1.97 (95% confidence interval [CI]: 0.90-4.29, P = 0.0899) for A_h-RF group and 2.84 (95% CI: 1.23-6.57, P = 0.0146) for B_h-RF group, compared to the C_h-RF group.

Regarding d-RF clustering, multivariable analyses provided HR = 1.50 (95% CI: 0.79–2.85, P = 0.2112) for A_CAE group compared to B_CAE group, and HR = 2.73 (95% CI: 1.37–5.42, P = 0.0043) for A_HSCAE group compared to B_HSCAE group.

Stepwise Cox regression selected the HSCAE clustering for further analyses in its last step (minimal Akaike information criterion [AIC] = 645.8). Kaplan–Meier curves for h-RF and HSCAE clustering are presented in Fig. 2a, b.

**Fig. 2: Summary of the survival analyses for metastatic relapse free survival (MFS) in the subcohort of patients with both radiomics and transcriptomics data (n = 110).**

Complementarity of radiomics and transcriptomics

The complementarity of h-RF clustering and the relevant d-RF clustering (HSCAE) with transcriptomics clustering was assessed in 110 patients having both the radiomics and transcriptomics data.

Transcriptomics clustering was associated with MFS in univariable analysis (HR for A_RNA of 2.60, 95% CI: 1.12–6.02, P = 0.0261) (Table 4, Fig. 2c). B_h-RF and A_HSCAE groups remained associated with lower MFS in this subcohort (univariable HR = 3.55, 95% CI: 1.29–9.72, P = 0.0140, and HR = 3.90, 95% CI: 1.60–9.52, P = 0.0028, respectively).

Table 4 Survival analysis and predictive performances in the subcohort of n = 110 patients with both radiomics and transcriptomics data

Full size table

In a Cox regression model including the h-RF clustering and the transcriptomics clustering, the A_RNA and B_h-RF groups were still associated with lower MFS (HR = 3/08, 95% CI: 1.32-7.17, P = 0.0092, and HR = 4.23, 95% CI: 1.54-11.65, P = 0.0052, respectively).

Similarly, in a Cox regression model including the HSCAE clustering and the transcriptomics clustering, the A_RNA and A_HSCAE groups remained associated with lower MFS (HR = 2.93, 95% CI: 1.25-6.83, P = 0.0129, and HR = 4.28, 95% CI: 1.74-10.50, P = 0.0015, respectively).

In order to identify the most relevant combination of radiomics and transcriptomics clustering, we then investigated and compared the prognostic value of univariable and combined models using a 5-fold cross-validation scheme. Significant increment in Harrell concordance index (c-index) with radiomics-based clustering combined with transcriptomics clustering was observed (Fig. 2d). Thus, the c-index for the Transcriptomics groups alone was 0.603 (95% CI: 0.574–0.675), versus 0.633 (95% CI: 0.599–0.771) for h-RF group alone and 0.666 (95% CI: 0.643–0.820) for the Transcriptomics × h-RF combined model (P = 0.0380 against Transcriptomics group alone, and P = 0.0469 against h-RF group alone). Regarding deep radiomics, the c-index was 0.709 (95% CI: 0.651–0.788) for the Transcriptomics × HSCAE combined model, which was significantly higher than for the HSCAE model alone (c-index = 0.661, 95% CI: 0.615–0.805, P = 0.0220) and for Transcriptomics group alone (P = 0.0110).

Consequently, we selected the HSCAE and transcriptomics combined model. Kaplan–Meier analysis reveals that the A_RNA × A_HSCAE combination group has a worse MFS prognosis with respect to all the other combination groups (hereafter called the ‘Other’ group, made of the BRNA × A_HSCAE, A_RNA × B_HSCAE and BRNA × B_HSCAE combinations) (Fig. 2e, f). The HR for the ARNA × A_HSCAE group against other Other groups was 4.29 (95% CI: 2.07–8.90, P < 0.0001, Table 4). The ARNA × A_HSCAE group remains significantly associated with MFS in the most frequent histotype, namely UPS (n = 29, with HR = 8.77, 95% CI: 1.04–73.6, P = 0.0456). We proceeded then with the differential gene expression analysis between the A_RNA × A_HSCAE group and the Other group.

Lastly, we investigated whether the prognostic performance of the clinical–histological SARCULATOR nomogram would be increased with deep radiomics and transcriptomics. The average c-indices in 5-fold cross-validation increased from 0.698 (95% CI: 0.608–0.787) for the SARCULATOR groups alone to 0.728 (95% CI: 0.650–0.807) for a model combining the SARCULATOR, HSCAE, and transcriptomics groups, despite the difference is not significant (P = 0.1458). In this last Cox model, two characteristics remained independently associated with MFS: the high Pr-OS SARCULATOR group (HR = 6/04, 95% CI: 2.48–14.73, P < 0.0001) and the A_HSCAE group (HR = 2.69, 95% CI: 1/01–7/14, P = 0.0477). The Supplementary Table ST3 shows the corresponding survival table for this sub-analysis.

Gene-expression analysis

The Volcano plot analysis of differential gene expression (DGE) between A_RNA × A_HSCAE and Other groups identified 1230 differentially expressed genes (Fig. 3a). The full list of genes and their associated annotations are reported in Supplementary Table ST4. GeneSet enrichment analysis on the differentially expressed genes revealed 292 significantly enriched pathways (Supplementary Table ST5 reports the description of the gene set, the adjusted p-value, the enrichment status, and the activation or inhibition status of the other group). Figure 3b shows the list of the most significantly enriched pathways. The analysis of the genes participating in those genesets showed that the A_RNA × A_HSCAE group of worse prognosis activates the inflammatory response, epithelial-mesenchymal transition, hypoxia, apoptosis inhibition, G2M checkpoint, UV response, E2F targets, and xenobiotic metabolism gene sets. We performed an extensive review of the role of the genes belonging to those pathways whose results are reported in Supplementary Table ST6. For each geneset, we reviewed the description of the participating genes, their role played in a specific tumor type and the reference to the scientific study showing evidence of this role. We identified important genes having an oncogenic role in a wide variety of tumors where the role of the gene seems to be coherent with the one played in the A_RNA × A_HSCAE group of worse prognosis. A discriminant signature between the A_RNA × A_HSCAE and other groups, performed via PAMr, a machine learning method, identified a subset of 162 discriminant genes (Fig. 3c). The detailed annotation of those genes (reported in Supplementary Table ST7) highlighted genes belonging to the epithelial–mesenchymal transition, hypoxia, and apoptosis inhibition genesets in the A_RNA × A_HSCAE group.

**Fig. 3: Summary of the differential gene expression (DGE) and pathway analyses between the A_RNA × A_HSCAE group and the ‘Others’ groups.**

MRI examples of patients with opposite radiomic-transcriptomics grouping and outcomes are presented in Fig. 4.

Discussion

In this study, we devised methods to classify soft tissue sarcomas (STS) imaging, independent of their initial presentation on conventional MRI, employing both handcrafted radiomics (h-RFs) and deep radiomics (d-RFs, using CAE and HSCAE models). Cluster analysis identified classes that were consistently correlated with the clinical and histological SARCULATOR nomogram and prognostic semantic radiological phenotypes while displaying a notable disconnection from transcriptomic clusters¹⁹. Our findings suggest that combining pre-treatment radiomics and transcriptomic data through a combined radiomic-transcriptomic signature could enhance the predictive performance of transcriptomics in STS patients.

To our knowledge, no study has concurrently analyzed radiomics and transcriptomic datasets in a large cohort of STS patients. Radiomics data, obtained through either handcrafted or deep radiomics approaches, demonstrated strong associations with semantic radiological features linked to aggressive tumor characteristics, higher grade, and poorer survival outcomes^3,4. Moreover, the h-RF grouping was significantly associated with the histological types, which illustrates the potentiality of imaging to pre-type STS from radiological images directly. The d-RF groups additionally correlated with FNCLCC grade and SARCULATOR groups, i.e., with histological and clinical variables strongly associated with metastasis and death risk. Collectively, these associations between radiomics groups, identified without prior assumptions, and clinical, histological, and radiological features having prognostic significance contribute to validating the relevance of radiomics data.

Notably, although the fully unsupervised h-RFs, the CAE clustering, and the HSCAE clustering exhibited c-indices lower than some previously published supervised models, they remained significantly better than random in both univariable and multivariable analyses accounting for cofounding covariables underscoring their intrinsic prognostic value for STS patients^{7,14,18,20,21,22}.

Transcriptomics groups, while exhibiting weaker but significant associations with patient survivals, showed a substantial increase in prognostic performance (c-index = 0.709) when combined with HSCAE groups. The Kaplan–Meier curves for the combined deep-radiomics-transcriptomics variable demonstrated a clear gradient in survival probability from less to more aggressive groups. This emphasizes the complementary nature of radiomics and transcriptomics, suggesting that adding radiomics data to prognostic gene-expression signatures could enhance their predictive value. We believe this complementarity could be explained by the difference and complementarity in scale and nature of radiomics and transcriptomics data. Indeed, radiomics are macroscopic data assessed over a digital image of the entire tumor volume, whereas transcriptomics data are extracted from millimetric tumor samples. Even though our findings do not provide immediate clinical application, we believe they should encourage the investigations of multi-omics prognostic signatures for STS patients through multi-center collaborations. Moreover, our findings also suggest that radiomics-transcriptomics data could enhance pre-existing clinical-histological nomograms such as the SARCULATOR.

Lastly, our attempt to elucidate the gene-expression levels of the A_RNA × A_HSCAE group (indicating worse outcomes) involved analyzing 1230 differentially expressed genes (including 707 overexpressed genes) and 262 pathways in 110 patients. These findings implicated the tumor micro-environment and tumorigenesis through the dysregulation of adaptive immune reactions, cell growth, inflammation, cell differentiation, apoptosis, and hypoxia¹⁶^,22. Interestingly, although the transcriptomics groups were significantly associated with the histological types, the A_RNA × A_HSCAE group remains associated with significantly lower MFS in the most frequent histotype, i.e., UPS. It must be emphasized that gene-expression signatures dedicated to STS exist, notably the complexity index in sarcoma (CINSARC) signature^16,23, which would enhance the prognostic performances of the SARCULATOR nomogram²⁴. The aim of this study was not to challenge CINSARC (established on 310 samples versus 110 in our study). However, we found 17 common underlying pathways between CINSARC and the pathways identified in our transcriptomics analysis (Supplementary Table ST8), as well as strong associations (P = 0.0069, Chi-Square test) between CINSARC and the transcriptomics groups in a subset of 54/110 (49.1%) patients with available data (unshown data). This highlights the consistency of molecular features involved in the aggressiveness of STS.

Despite these valuable insights, our study has limitations, including its retrospective nature, heterogeneous imaging protocols necessitating MRI processing, and the availability of transcriptomic data for only half of the study population. Additionally, deep radiomics analyses were restricted to a central slice of CE-T1-WI due to current limitations in freely available deep learning algorithms for multiple co-registered 3D volumes. Lastly, although large compared to prior radiomics studies in STS, the size of the study population was too small to enable the development of a reliable and more powerful supervised deep-learning model to predict patient MFS. For instance, the DeepSurv model required datasets with more than 1500 observations for its training²⁵. Similarly, even though the MSEs of the reconstructions by the CAE and HSCAE were high in the Training and Testing cohorts, the small size of the Testing cohort (n = 25 patients) suggests the need to validate the approach in larger independent cohorts. Addressing these limitations could further enhance the performance of deep-learning models.

In conclusion, our study provides a comprehensive analysis of MRI radiomics profiles for STS, revealing relationships with patient outcomes, semantic radiological features, and the SARCULATOR nomogram. Moreover, it highlights the synergistic potential of radiomics and transcriptomic data to refine the prognostication of STS patients, opening avenues for the development of a radiomic-transcriptomic prognostic signature for sarcoma patients.

Methods

Study design

This single-center study was approved by the institutional review board of Bergonié (Sarcoma Reference Center of Nouvelle-Aquitaine, Bordeaux, France). Written informed consent was waived by its retrospective nature. The study was achieved in agreement with good clinical practice and applicable laws. All research procedures and protocols adhered to the principles set forth in the Declaration of Helsinki.

Patients were identified by the prospectively held surgical database from our sarcoma reference center. We included all consecutive patients between May 2008 and May 2020 as they presented with a newly diagnosed locally advanced STS, with histopathological confirmation according to a senior pathologist with expertise in STS from our institution (F.L.L., J.M.C., and R.P.), with available pre-treatment MRI including a gadolinium-chelates injection, and treated in a curative intent including surgery.

We excluded patients with atypical lipomatous tumors, metastases at initial staging (i.e., on chest CT scan), and patients whose pre-treatment MRI did not include at least one T1-weighted imaging (WI), one T2-WI, and one fat-suppressed (FS) CE T1-WI (CE-T1-WI).

Figure 1 shows the flow chart. Figure 5 shows the overall study workflow.

Histologic and clinical data collection

Data collection from medical records comprised the patient’s age at diagnosis, sex, World Health Organization Performance status (WHO-PS), the tumor depth, location, longest diameter (LD), pre-treatment histological type, and FNCLCC grade (either on biopsy or on entire specimen)²⁶, and the initial therapeutic management, namely radiotherapy or anthracycline-based chemotherapy (categorized as none, neoadjuvant or adjuvant) always combined with curative surgery, and the surgical margins (categorized as R0 versus R1–R2).

Moreover, the 10-year predicted probability of OS (Pr-OS) according to the SARCULATOR nomogram for extremity and trunk wall STS was calculated for each patient using the free application¹⁹. Patients were then divided into three categories of 10-year Pr-OS: low (≤51%), intermediate (>51% and ≤66%), and high (>66%), as previously described in prior studies from the same authors²⁷^{, 28}. The corresponding new categorical variable was named SARCULATOR.

Follow-ups consisted of clinical examinations and chest radiographs every 3 months for 2 years, then every 6 months for 5 years, and then annually, with complementary local MRI and chest CT-scan in case of doubtful findings. The main outcome was MFS. MFS, local relapse-free survival (LFS), and overall survival (OS) corresponded to the time (in months) elapsed from surgery to metastatic relapse, local relapse, and death related to disease (or last follow-up), respectively. Patients without events during the study period were censored. All relapses were histologically proven.

MRI acquisition

The pre-treatment MRI examinations were acquired on different 1.5-Tesla MR-systems with adjustments of the coils, field-of-views, and matrices depending on tumor size, location, and depth. The protocols systematically included at least one T1-WI prior to contrast injection, one T2-WI, and one FS CE-T1-WI. Various methods were accepted for FS, i.e., fat saturation, Dixon, subtraction, fluid-sensitive, and short tau inversion-recovery sequences²⁹. The ranges of repetition time and echo time were 500–700 and 10–15 ms for T1-WI and 2400–4500 and 70–130 ms for T2-WI, respectively. The ranges of in-plane resolution and thickness were 0.75 × 0.75–1.4 × 1.4 mm² and 1–7 mm, respectively.

Conventional radiological analysis

The conventional (or semantic) radiological analysis was performed in consensus by two senior radiologists with expertise in STS (A.C. and M.K.) on a picture archiving and communication system (Enterprise Imaging, Agfa Healthcare, Mortsel, Belgium). They reproduced the same analysis as in the prognostic study by Crombé et al.⁴ and reported if at least 2 out of the 3 following semantic radiological features were present: (i) heterogeneous signal intensity (SI) on T2-WI ≥ 50% (i.e., when ≥50% of the tumor volume showed areas with high, intermediate and low SI), (ii) presence of an area compatible with necrosis (i.e., defined as high fluid-like SI on T2-WI without enhancement at T1-WI after gadolinium chelate injection), and (iii) presence of peritumoral enhancement (i.e., defined as contrast enhancement beyond the apparent tumor borders without mass effects), as the resulting semantic radiophenotype (named ‘high risk’) was a significant predictor of lower MFS and OS in this study. Conversely, if one or less of these three radiological features was seen, the tumor was categorized as ‘low risk’.

Radiomics groups

Handcrafted radiomics pipeline

Since the sequences were obtained on different MR-systems and given the lack of standardized units for conventional MRI, a 3-steps post-processing pipeline was achieved on the T1-WI, T2-WI, and CE-T1-WI sequences in order to homogenize the imaging dataset using the ITK library (https://github.com/InsightSoftwareConsortium/ITK). First, a bilinear interpolation was applied to resample the voxel size to a common resolution of 1 × 1 × 4 mm³. Second, non-uniform SIs due to magnetic field heterogeneity were corrected using N4-ITK bias field correction³⁰. Third, SIs were homogenized with simple ITK histogram-matching and ranged between −10,000 and +10000³¹. Afterward, the MRIs were uploaded to the LIFEx freeware (version 4.70, Saclay, France), compliant with the guidelines from the International Biomarker Standardization Initiative^32,33. One senior radiologist manually segmented the entire tumor volumes of interest (VOIs), slice-by-slice, on the CE-T1-WI and then propagated this VOI on the T1-WI and T2-WI with adjustments of the segmentation boundaries if needed. Afterward, the SIs were discretized into 256 gray-levels and 59 texture h-RFs were extracted per sequence, namely 13 first-order texture h-RFs, 46 second-order texture h-RFs (21 from the gray-level co-occurrence matrix [GLCM] using 1, 2 and 4 pixel distance to neighbors; 11 from the gray-level run length matrix [GLRLM], 3 from the neighborhood gray-level difference matrix [NGLDM], and 11 from the gray-level zone length matrix [GLZLM]) in addition to 4 shape features from CE-T1-WI. The precise definitions and formulas for all RFs can be found at: https://www.lifexsoft.org/index.php/resources/19-texture/radiomic-features.

The same procedure (from segmentation to RF extraction) was reproduced on 30 randomly selected patients in order to estimate the reproducibility of RFs according to intra-class correlation coefficient (ICC). We only selected the RFs with ICC ≥ 0.90 for the remaining analyses, namely 46 texture h-RFs from T1-WI, 52 texture h-RFs from T2-WI, 52 texture h-RFs from CE-T1-WI and 2 shape h-RFs (i.e., n = 152 h-RFs, listed in Supplementary Table T9). The consensus clustering algorithm was applied to the robust h-RFs from the entire cohort. After center-scaling those h-RFs, each clustering was resampled 10,000 times by leave-one-out of 40% of the samples, based on hierarchical clustering using the Pearson distance and the average link³⁴.

Deep radiomics pipeline

For each patient, the mask resulting from the tumor segmentation was propagated on the initial fat-suppressed CE-T1-WI to remove the background surrounding the tumor with manual adjustments made if necessary. Subsequently, the slice at the tumor’s midpoint was selected for further analysis. Two types of neural networks were built: convolutional auto-encoder neural network (CAE) and half-supervised convolutional auto-encoder neural network (HSCAE), which included a prognostic loss function. Details on data preprocessing, augmentation, model architecture, loss function, optimization, performance evaluation, and d-RF extraction for each network are provided in Supplementary Method M1^35,36,37,38. Overall, the output of a convolutional autoencoder neural network is an image that is the best ‘replicate’ of the input image. To do so, the network articulates in two sequential sub-networks. The first is a ‘decoder network’, similar to a classical convolutional neural network used for supervised classification of images, where the input image is progressively reduced to a vector of ‘latent features’. The second part is an ‘encoder network’ which takes the ‘latent features’ and progressively reconstructs the input image by deconvolution. The fidelity of the reconstruction was optimized using the mean square error (MSE) between the input and output data. We trained the CAE and HSCAE models using a leave-10%-out cross-validation technique on a randomly selected subset of 200 patients (Training cohort), repeating the process 100 times. The resulting vector of ‘latent features’ (i.e., 1024 d-RFs per image) was subjected to unsupervised hierarchical clustering (with the Pearson distance and average link, after center-scaling the d-RFs) to assign each patient to a cluster (CAE and HSCAE grouping). Additional methodological details notably about artefactual morphological augmentation techniques, are given in Supplementary Method M1. Regarding the 25 remaining patients from the independent test set of 25 samples (Testing cohort), we verified the quality of the reconstructions provided by CAE and HSCAE according to MSE. Afterward, their CAE and HSCAE clusters were secondarily identified using the centroid method, which consists of calculating the Pearson distance between each new observation and the centroid of each cluster and to attribute the cluster with the shortest distance.

Gene expression analysis

We included in the analysis all patients with frozen material or paraffin-embedded tissue of good quality before treatment and contemporary to the baseline MRI used for radiomics. After whole-RNA sequencing, the produced RNA sequences were quality-controlled and aligned to the transcriptome. Gene Expression was then estimated by the counts of high-quality sequences aligned per Gene. ComBat harmonization method was applied to correct batch effect³⁹. Finally, gene expression counts were normalized using the Voom method (Supplementary Method M2)⁴⁰. Unsupervised transcriptomics grouping was similarly built on RNA-sequencing data with hierarchical consensus clustering.

Statistical analyses

Statistical analyses were also performed with R (v4.1.0). All tests were two-tailed. A P-value < 0.05 was deemed significant.

Understanding the ‘omics’ groups

Associations between the h-RFs group, d-RFs CAE and HSCAE groups, transcriptomics groups, histologic types, histologic grade, SARCULATOR groups, and semantic radiophenotypes (i.e., categorical variables) were tested with Chi-square tests. Associations with the SARCULATOR Pr-OS and tumor size were investigated using unpaired t-test of the Mann–Whitney test depending on the Shapiro–Wilk normality test.

Prognostic value of the ‘omics’ group alone and in combinations

The Kaplan–Meier curves for MFS, depending on the radiomics and transcriptomics groups, were drawn, and the differences in survivals were tested with the log-rank test. Univariable and multivariable Cox regressions were performed to estimate the hazard ratio (HR) with a 95% confidence interval (CI) of each group. Multivariable models comprised the following covariables involved in the SARCULATOR nomograms and accounting for patient management: age (continuous), histologic type (according to the SARCULATOR categorization, with myxoid/round cells liposarcoma as reference), histologic grade (I and II [reference] versus III), tumor size (continuous), chemotherapy (no [reference] versus yes) radiotherapy (no versus yes [reference]), and surgical margins (R0 [reference] versus R1 and R2). To identify the most relevant deep-RF grouping among the three we developed, a stepwise backward Cox regression (minimizing the Akaike information criterion [AIC]) including all d-RF clusters and the covariables was built. Lastly, to evaluate the complementarity of radiomics groups and transcriptomics groups to predict patient outcome, the Harrell concordance indices (c-indices) of the best d-RF grouping alone, the h-RF grouping alone, the transcriptomics grouping alone, and their combination (with and without interaction term) were evaluated in 5-fold cross-validation and compared using a bootstrapped distribution of their difference over 1000 random replicates of the study population. The same approach was applied to evaluate the potential synergy between the SARCULATOR, Transcriptomics, and d-RF groups. Patients with any missing value among the input variables of the multivariable Cox regressions were removed from the analyses.

Gene-expression profiling of the prognostic radiomics groups

Differential gene expression (DGE) and geneset enrichment analyses were performed between the radiomics-transcriptomics grouping with the highest association with patient MFS. Exploratory DGE was performed by t-test calculation per gene. To discriminate significant up/down-regulated genes, the fold change was set to 2. The P-value cut-off was adjusted with the Benjamini–Hochberg procedure. We then assessed geneset enrichment in biological pathways based on Broad Institute’s Molecular Signature database and the CIBERSORT LM22 immuno-genesets (Supplementary Method M3)^41,42. Lastly, the prediction analysis for microarrays (PAM) method was applied to provide class prediction from gene expression profiling based on an enhancement of the simple nearest prototype (centroid) classifier (Supplementary Method M3)^43,44,45.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The raw and processed data generated in this study have been deposited in NCBI’s Gene Expression Omnibus (GEO) and are accessible through GEO Series accession number GSE262937 for RNA-seq experiment. The differential gene expression analyses and pathways analyses are available in the supplementary materials. The radiomics datasets and raw MRIs used and/or analyzed during the current study are available from the corresponding author on reasonable request. Any additional results can be obtained from the corresponding author.

Code availability

The underlying code for this study is not publicly available but may be made available to qualified researchers on reasonable request from the corresponding author. The R studio and Conda environments and the packages with their versions used for the analyses are detailed in Supplementary Table ST10.

References

Fletcher, C. D. M. et al. WHO Classification of Soft Tissue and Bone Tumours. 5th edn (International Agency for Research on Cancer, IARC Press, Lyon, France, 2020). https://doi.org/10.1038/s41698-024-00616-8
Gronchi, A. et al. Soft tissue and visceral sarcomas: ESMO-EURACAN-GENTURIS Clinical Practice Guidelines for diagnosis, treatment and follow-up^☆. Ann. Oncol. S0923-S7534, 02184-0 (2021).
Zhao, F. et al. Can MR imaging be used to predict tumor grade in soft-tissue sarcoma? Radiology 272, 192–201 (2014).
Article PubMed Google Scholar
Crombé, A. et al. Soft-tissue sarcomas: assessment of MRI features correlating with histologic grade and patient outcome. Radiology 291, 710–721 (2019).
Article PubMed Google Scholar
lambin, P. et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14, 749–762 (2017).
Article PubMed Google Scholar
Martin-Gonzalez, P. et al. Integrative radiogenomics for virtual biopsy and treatment monitoring in ovarian cancer. Insights Imaging 11, 94 (2020).
Article PubMed PubMed Central Google Scholar
Peeken, J. C. et al. Tumor grading of soft tissue sarcomas using MRI-based radiomics. EBioMedicine 48, 332–340 (2019).
Article CAS PubMed PubMed Central Google Scholar
Yan, R. et al. Magnetic resonance imaging-based radiomics nomogram for prediction of the histopathological grade of soft tissue sarcomas: a Two-Center Study. J. Magn. Reson. Imaging 53, 1683–1696 (2021).
Article PubMed Google Scholar
Crombé, A. et al. T2-based MRI delta-radiomics improve response prediction in soft-tissue sarcomas treated by neoadjuvant chemotherapy. J. Magn. Reson. Imaging 50, 497–510 (2019).
Article PubMed Google Scholar
Crombé, A. et al. High-grade soft-tissue sarcomas: can optimizing dynamic contrast-enhanced MRI postprocessing improve prognostic radiomics models? J. Magn. Reson. Imaging 52, 282–297 (2020).
Article PubMed Google Scholar
Peeken, J. C. et al. MRI-based delta-radiomics predicts pathologic complete response in high-grade soft-tissue sarcoma patients treated with neoadjuvant therapy. Radiother. Oncol. J. Eur. Soc. Ther. Radiol. Oncol. 164, 73–82 (2021).
Article Google Scholar
Crombé, A. et al. Can radiomics improve the prediction of metastatic relapse of myxoid/round cell liposarcomas? Eur. Radiol. 30, 2413–2424 (2020).
Article PubMed Google Scholar
Yang, Y., Ma, X., Wang, Y. & Ding, X. Prognosis prediction of extremity and trunk wall soft-tissue sarcomas treated with surgical resection with radiomic analysis based on random survival forest. Update. Surg 74, 355–365 (2022).
Google Scholar
Peeken, J. C. et al. Prognostic assessment in high-grade soft-tissue sarcoma patients: a comparison of semantic image analysis and radiomics. Cancers 13, 1929 (2021).
Article CAS PubMed PubMed Central Google Scholar
Crombé, A. et al. Distinct patterns of the natural evolution of soft tissue sarcomas on pre-treatment MRIs captured with delta-radiomics correlate with gene expression profiles. Eur. Radiol. 33, 1205–1218 (2023).
Article PubMed Google Scholar
Merry, E., Thway, K., Jones, R. L. & Huang, P. H. Predictive and prognostic transcriptomic biomarkers in soft tissue sarcomas. NPJ Precis. Oncol. 5, 17 (2021).
Article PubMed PubMed Central Google Scholar
Frated, G. et al. Prediction of lipomatous soft tissue malignancy on MRI: comparison between machine learning applied to radiomics and deep learning. Eur. Radiol. Exp. 6, 41 (2022).
Article Google Scholar
Yang, Y., Zhou, Y., Zhou, C., Zhang, X. & Ma, X. MRI-based computer-aided diagnostic model to predict tumor grading and clinical outcomes in patients with soft tissue sarcoma. J. Magn. Reson. Imaging 56, 1733–1745 (2022).
Article PubMed Google Scholar
Callegaro, D. et al. Development and external validation of two nomograms to predict overall survival and occurrence of distant metastases in adults after surgical resection of localised soft-tissue sarcomas of the extremities: a retrospective analysis. Lancet Oncol 17, 671–680 (2016).
Article PubMed Google Scholar
Crombé, A. et al. Radiomics and artificial intelligence for soft-tissue sarcomas: current status and perspectives. Diagn. Interv. Imaging 104, 567–583 (2023).
Article PubMed Google Scholar
Hu, Y. et al. A contrast-enhanced MRI-based nomogram to identify lung metastasis in soft-tissue sarcoma: a multi-centre study. Med. Phys. 50, 2961–2970 (2022).
Article PubMed Google Scholar
Aggerholm-Pedersen, N. et al. A prognostic profile of hypoxia-induced genes for localised high-grade soft tissue sarcoma. Br. J. Cancer 115, 1096–1104 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chibon, F. et al. Validated prediction of clinical outcome in sarcomas and multiple types of cancer on the basis of a gene expression signature related to genome complexity. Nat. Med. 16, 781–787 (2010).
Article CAS PubMed Google Scholar
Crombé, A. et al. Gene expression profiling improves prognostication by nomogram in patients with soft-tissue sarcomas. Cancer Commun. (Lond.) 42, 563–566 (2022).
Article PubMed Google Scholar
Katzman, J. L. et al. DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med. Res. Methodol. 18, 24 (2018).
Article PubMed PubMed Central Google Scholar
Trojani, M. et al. Soft-tissue sarcomas of adults; study of pathological prognostic variables and definition of a histopathological grading system. Int. J. Cancer 33, 37–42 (1984).
Article CAS PubMed Google Scholar
Callegaro, D. et al. A soft tissue sarcoma nomograms and their incorporation into practice. Cancer 123, 2802–2820 (2017).
Article PubMed Google Scholar
Pasquali, S. et al. High-risk soft tissue sarcomas treated with perioperative chemotherapy: improving prognostic classification in a randomised clinical trial. Eur. J. Cancer Oxf. Engl 93, 28–36 (2018).
Article Google Scholar
Delfaut, E. M. et al. Fat suppression in MR imaging: techniques and pitfalls. RadioGraphics 19, 373–382 (1999).
Article CAS PubMed Google Scholar
Tustison, N. J. et al. N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320 (2010).
Article PubMed PubMed Central Google Scholar
Nyúl, L. G. & Udupa, J. K. On standardizing the MR image intensity scale. Magn. Reason. Med. 42, 1072–1081 (1999).
Article Google Scholar
Nioche, C. et al. LIFEx: a freeware for radiomic feature calculation in multimodality imaging to accelerate advances in the characterization of tumor heterogeneity. Cancer Res. 78, 4786–4789 (2018).
Article CAS PubMed Google Scholar
Zwanenburg, A. et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295, 328–338 (2020).
Article PubMed Google Scholar
Wilkerson, M. D. & Hayes, D. N. ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinforma. Oxf. Engl. 26, 1572–1573 (2010).
Article CAS Google Scholar
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. https://doi.org/10.48550/arXiv.1502.03167 (2015).
Glorot, X. & Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. J. Mach. Learn. Res. 9, 249–256 (2010).
Google Scholar
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. https://doi.org/10.48550/arXiv.1412.6980 (2017).
Selvaraju, R. R. et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vis. 128, 336–359 (2020).
Article Google Scholar
Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostat. Oxf. Engl. 8, 118–127 (2007).
Google Scholar
Law, C. et al. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome. Biol. 15, R29 (2014).
Article PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
Article CAS PubMed PubMed Central Google Scholar
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tibshirani, R., Hastie, T., Narasimhan, B. & Chu, G. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl Acad. Sci. USA 99, 6567–6572 (2002).
Article CAS PubMed PubMed Central Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Article PubMed PubMed Central Google Scholar
Atanas, K. et al. The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res. 41, d793–d800 (2013).
Article Google Scholar

Download references

Acknowledgements

This work received support from the French government, managed by the National Research Agency, under the France 2030 program with the reference ANR 21 RHUS 0010.

Author information

These authors contributed equally: Amandine Crombé, Carlo Lucchesi.

Authors and Affiliations

Department of Oncologic Imaging, Bergonié Institute, F-33076, Bordeaux, France
Amandine Crombé & Michèle Kind
Department of Radiology, Pellegrin University Hospital, F-33076, Bordeaux, France
Amandine Crombé
Bordeaux Institute of Oncology, BRIC U1312, Sarcotarget team, INSERM, University of Bordeaux, Institut Bergonié, F-33000, Bordeaux, France
Amandine Crombé, Mariella Spalato-Ceruso, Maud Toulmonde, Vanessa Chaire, Audrey Michot, Jean-Michel Coindre, François Le Loarer & Antoine Italiano
Department of Bioinformatics, Bergonié Institute, F-33076, Bordeaux, France
Carlo Lucchesi, Frédéric Bertolo & Aurélien Bourdon
Department of Medical Oncology, Bergonié Institute, F-33076, Bordeaux, France
Mariella Spalato-Ceruso, Maud Toulmonde & Antoine Italiano
Department of Pathology, Bergonié Institute, F-33076, Bordeaux, France
Vanessa Chaire, Jean-Michel Coindre, Raul Perret & François Le Loarer
Department of Oncologic Surgery, Bergonié Institute, F-33076, Bordeaux, France
Audrey Michot

Authors

Amandine Crombé
View author publications
Search author on:PubMed Google Scholar
Carlo Lucchesi
View author publications
Search author on:PubMed Google Scholar
Frédéric Bertolo
View author publications
Search author on:PubMed Google Scholar
Michèle Kind
View author publications
Search author on:PubMed Google Scholar
Mariella Spalato-Ceruso
View author publications
Search author on:PubMed Google Scholar
Maud Toulmonde
View author publications
Search author on:PubMed Google Scholar
Vanessa Chaire
View author publications
Search author on:PubMed Google Scholar
Audrey Michot
View author publications
Search author on:PubMed Google Scholar
Jean-Michel Coindre
View author publications
Search author on:PubMed Google Scholar
Raul Perret
View author publications
Search author on:PubMed Google Scholar
François Le Loarer
View author publications
Search author on:PubMed Google Scholar
Aurélien Bourdon
View author publications
Search author on:PubMed Google Scholar
Antoine Italiano
View author publications
Search author on:PubMed Google Scholar

Contributions

A.C.: Conceptualization, methodology, software, validation, formal analysis, resources, data curation, writing—original draft, writing—review and editing, visualization. C.L.: Conceptualization, methodology, software, validation, formal analysis, writing—original draft, writing—review and editing, visualization. F.B.: Methodology, software, validation, formal analysis, writing—original draft, writing—review and editing. M.K.: Validation, formal analysis, resources, data curation, writing—review and editing. M.S.C.: Validation, formal analysis, data curation, writing—review and editing. M.T.: Validation, formal analysis, data curation, writing—review and editing. V.C.: Validation, formal analysis, resources, data curation, writing—review and editing. A.M.: Validation, formal analysis, data curation, writing—review and editing. J.M.C.: Validation, formal analysis, resources, data curation, writing—review and editing. R.P.: Validation, formal analysis, resources, data curation, writing—review and editing. F.L.L.: Validation, formal analysis, resources, data curation, writing—review and editing. A.B.: Methodology, software, validation, formal analysis, data curation, writing—review and editing. A.I.: Conceptualization, methodology, validation, formal analysis, resources, writing—review and editing, supervision, project administration.

Corresponding author

Correspondence to Amandine Crombé.

Ethics declarations

Competing interests

A.I.: Research grant: AMGEN, AstraZeneca, Bayer, BMS, Merck, MSD, Novartis, CHUGAI, Parthenon, Roche; Advisory board: Bayer, BMS, Merck, MSD, Novartis, CHUGAI, Parthenon, Roche. The other authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Crombé, A., Lucchesi, C., Bertolo, F. et al. Integration of pre-treatment computational radiomics, deep radiomics, and transcriptomics enhances soft-tissue sarcoma patient prognosis. npj Precis. Onc. 8, 129 (2024). https://doi.org/10.1038/s41698-024-00616-8

Download citation

Received: 04 December 2023
Accepted: 17 May 2024
Published: 07 June 2024
DOI: https://doi.org/10.1038/s41698-024-00616-8

This article is cited by

A prediction method for radiation proctitis based on SAM-Med2D model
- Ning Zhang
- Haifeng Ling
- Mei Zhang
Scientific Reports (2025)
A CT-based radiomics model for predicting lymph node metastasis in hepatic alveolar echinococcosis patients to support lymph node dissection
- Yinshu Zhou
- Pengcai Feng
- Haihong Zhu
European Journal of Medical Research (2024)
Integration of pre-treatment computational radiomics, deep radiomics, and transcriptomics enhances soft-tissue sarcoma patient prognosis
- Amandine Crombé
- Carlo Lucchesi
- Antoine Italiano
npj Precision Oncology (2024)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Patient characteristics

Comprehensive patient clustering

Understanding patient clustering

Prognostic value of radiomics groups

Complementarity of radiomics and transcriptomics

Gene-expression analysis

Discussion

Methods

Study design

Histologic and clinical data collection

MRI acquisition

Conventional radiological analysis

Radiomics groups

Handcrafted radiomics pipeline

Deep radiomics pipeline

Gene expression analysis

Statistical analyses

Understanding the ‘omics’ groups

Prognostic value of the ‘omics’ group alone and in combinations

Gene-expression profiling of the prognostic radiomics groups

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links