Main

Sarcomas, a heterogeneous group of tumors, account for about 200 000 cancers worldwide each year (about 1% of all human malignant tumors); however, they represent a disproportionately high number of all pediatric malignant tumors (15%).1, 2 Sarcomas comprise more than 50 subtypes broadly classified into two groups: bone and soft tissue sarcomas.3 Even within a specific subtype, sarcomas are highly heterogeneous, making them diagnostically and therapeutically challenging. A few have a benign soft tumor tissue stage, such as schwannoma, which may or may not progress to malignant sarcoma. Several sarcoma subtypes are characterized by chromosomal translocations resulting in tumor-specific fusion transcripts or DNA copy number changes that are widely used as diagnostic markers.4, 5 However, most sarcomas lack such a clearly elucidated genetic signature.

Recent advances have taken place in small RNA profiling, particularly for microRNAs (miRNAs), demonstrating that miRNAs have critical roles in the majority of canonical cellular signaling networks; dysregulation of miRNAs has now been implicated in many cancers, including breast cancer, colon cancer, gastric cancer, lung cancer, and sarcomas.6, 7 Therefore, any comprehensive analysis of oncogenic or tumor suppressor signaling will need to include both messenger RNAs (mRNAs) and miRNAs. Understanding their roles in sarcoma will lead to new therapeutic targets and diagnostic biomarkers, opening the door to a greater understanding of the molecular mechanisms of all cancers.

In this study, we analyzed the mRNA expression profiles of 35 primary human tissue samples representing 13 different sarcoma subtypes, along with schwannoma, normal bone and muscle tissue. Our aim was to identify known and novel mRNAs that were significantly and uniquely upregulated in specific sarcoma subtypes. To generate the expression profiles, we used RNA-Sequencing (RNA-Seq), which allows for the generation of a superior expression data set without the concerns of probe-selection bias and other potential technical issues associated with microarray chip experiments. Then, we combined our RNA-Seq data with miRNA expression profiles taken from the Sarcoma microRNA Expression Database (S-MED), which is the most comprehensive sarcoma miRNA data set published to date; it contains miRNA expression profiles from more than 300 sarcoma primary tumor tissue samples, representing 22 different sarcoma subtypes.8 By integrating our RNA-Seq data with S-MED data, we aimed to identify key regulating miRNAs that could provide suitable upstream targets for more effective treatment of sarcomas.

MATERIALS AND METHODS

Tumor Samples

We collected 35 snap-frozen human patient tissue samples comprising 13 sarcoma subtypes, along with schwannoma, and normal bone and muscle tissues through the tissue procurement facility at the Masonic Cancer Center, University of Minnesota. To our knowledge, none of the normal tissue samples were patient matched. The samples were stored at −80 °C until used. The University of Minnesota institutional review board approved this study. For each sarcoma subtype, we selected representative sarcoma samples per the patient’s clinical diagnosis and our comprehensive miRNA expression data profiles. Then, for further RNA-Seq analysis, we selected samples representing each sarcoma subtype per their location in the dendrogram we generated (which was based on 300 sarcoma samples).8

mRNA RNA-Seq Expression Profiling

To perform high-resolution quantification of mRNA expression using RNA-Seq, we used at least 1 μg of each pool of total RNA according to the mirVana miRNA Isolation Kit (Ambion, Austin, TX, USA). We sequenced mRNA libraries (four samples per lane, with 30–40 million paired-end reads in each sample) on a High Seq 2000 (Illumina, San Diego, CA, USA). To quality check raw-sequence FASTQ files, we used FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Then, to align those FASTQ files to the Human UCSC hg19 Assembly genome, we used the TopHat2 algorithm.9 Next, to quantify the aligned reads against the reference human genome hg19 transcript annotations, we used Cufflinks and Cuffdiff, according to the standard Fragments Per Kilobase of transcript per Million mapped reads (FPKM) methods.10

Data Analyses

For statistical analysis and clustering, we used the Partek Genomics Suite software package (Partek, St Louis, MO, USA). We performed a series of two-group t-tests to compare each sarcoma subtype with at least two unique patient samples versus all remaining samples (sarcoma and normal tissue). Before comparison analysis and clustering, we filtered extremely low and invariant genes out of the data sets. Significance cutoff filters, unless otherwise specified, were set at P<0.001 and an expression change of at least threefold. No transcripts in our chondrosarcoma analysis passed both significance filters, so we did not perform additional analysis on that gene set. To provide a more comprehensive reference pool, we included osteosarcoma, leiomyosarcoma, and rhabdomyosarcoma samples, but did not further analyze them, given specific concerns about heterogeneity and small sample size. Previously, our group already reported on a larger study that included gene and miRNA network analysis of osteosarcoma, MPNST, neurofibroma, and rhabdomyosarcoma.11, 12, 13

For functional analysis, upstream regulator analysis, and miRNA targeting analysis, we used Ingenuity Pathway Analysis (IPA) software (Qiagen, Redwood City, CA, USA). IPA uses a right-tailed Fisher’s exact test to calculate a P-value corresponding to the probability that a biologic function not relevant to the input data set is falsely identified as relevant. A Benjamini–Hochberg false discovery rate of 0.05 was used to correct such P-values. Network diagrams were generated by combining significant results derived from functional analysis, upstream regulator analysis, and miRNA targeting analysis for each sarcoma subtype.

RESULTS

Novel Overexpressed Gene Candidates

The findings of our mRNA expression analysis across the 35 tissue samples after unsupervised hierarchical clustering are highlighted in the heatmap (Figure 1a). Visual inspection easily identified some unique ‘hot spots’ (Figure 1b). To identify computationally all mRNAs that were significantly upregulated in each subtype (in comparison with the rest of the samples), we used a two-group t-test. Significantly upregulated genes and their deduced upstream regulators included both known players, and, interestingly, one or more novel candidates that have not been previously reported to be associated with sarcoma. The highly overexpressed genes in each sarcoma subtype, per their expression ratios, were as follows:

Figure 1
figure 1

(a) Heatmap shows mRNA expression across the 35 tissue samples after unsupervised hierarchical clustering of Fragments Per Kilobase of transcript per Million mapped reads (FPKM) data. Red bar indicates region expanded in (b) showing representative clusters of differentially overexpressed genes in schwannoma and angiosarcoma samples.

Angiosarcoma

We found that 173 genes were significantly (by at least threefold, P<0.001) upregulated in our angiosarcoma samples, as compared with our non-angiosarcoma samples (Table 1 and Supplementary Table 1). The two different angiosarcoma patient samples showed distinct patterns of gene expression—but with some common elements, most notably FLT4. In addition to genes previously associated with cancer formation and progression, such as CTLA4, CCL1, MADCAM1, and NTS, we identified significantly upregulated genes not previously associated with angiosarcoma such as HTN3, GPR182, and SCGB3A1 (HIN1). The most significantly upregulated transcript was for D21S2088E, a newly annotated long noncoding RNA (lncRNA) initially described by Reymond et al.14 as a transcript with no obvious open reading frame specifically expressed only in the testes, uterus, fetal brain, and spinal cord.

Table 1 Top 10 uniquely upregulated genes in our sarcoma samples (genes in bold have not been previously associated with sarcoma; full gene lists and mean expression values are available in Supplementary Tables 1–6)

The most significantly associated biologic functions for this gene set were typical hallmarks of cancer processes, including increases in cell migration, angiogenesis, cell invasion, and inflammatory response. The genes in this set shared multiple common upstream transcriptional regulators, most significantly TNF, VEGFA, and NKX2-3.

Epithelioid sarcoma

Our analysis identified 343 genes significantly (by at least threefold, P<0.001) upregulated in the epithelioid sarcoma samples versus the non-epithelioid sarcoma samples (Table 1 and Supplementary Table 2). All top 10 differentially expressed genes, including cystatin D (CST5), ARHGAP36, Cadherin 10 (CDH10), TMEM215, and IGSF1, have not been previously associated with epithelioid sarcoma but many have been previously associated with other cancers.

Highly associated functions for this set of upregulated genes include increased cell migration and movement, increased cell proliferation/decreased cell death, and increased angiogenesis, all hallmarks of cancer. Common upstream regulators for this set of genes include NFE2L2, TGFB1, KMT2D, and PRL. PRL itself was significantly upregulated (by about 50-fold) in the epithelioid sarcoma samples.

Ewing’s sarcoma

This comparison identified 77 genes significantly (≥3-fold, P-value <0.001) upregulated in the Ewing’s sarcoma samples versus the non-Ewing’s sarcoma samples (Table 1 and Supplementary Table 3). The most significantly differentially regulated transcripts included two lncRNAs, PCGEM1 and MAPT-AS1. LIPI and NKX2-2 upregulation have previously been reported to be highly specific markers for Ewing’s sarcoma, whereas the remaining transcripts have not been previously reported to be associated with any sarcoma subtype.

Significantly associated biologic functions include increases in cell proliferation, cell stimulation, mobilization of Ca2+, and, interestingly, skin development. The most significant common upstream regulators were KLF4, BDNF, and SOX2. SOX2 was upregulated (by about 20-fold) in Ewing’s sarcoma compared with the non-Ewing’s samples.

Fibromatosis

Our comparative analysis identified 82 genes significantly (by at least threefold, P<0.001) upregulated in the fibromatosis samples (Table 1 and Supplementary Table 4). Of the top 10 upregulated genes, three of them—ADAM12, POSTN, and PLA2G2A—have been previously associated with a different sarcoma subtype. SERPINB11 was the most highly differentially upregulated gene by a very large margin in this particular comparison analysis.

The most significantly enriched set of genes in this set are those involved in cellular adhesion and attachment, specifically metallopeptidases and their regulators. Highly associated biologic functions include regulation of fibroblast attachment, connective tissue disorders, tumor cell migration, and invasion. The most significant common upstream regulators were the TGFB1, TGFB3, and IL1B regulatory network, along with TP53, HTT, and TNF.

Schwannoma

In this comparison, we found a total of 357 genes significantly upregulated (by at least threefold, P<0.001) in the schwannoma samples (Table 1 and Supplementary Table 5). Of the top five upregulated genes, three of them—GFAP, GAP43, and SOX10—have previously been experimentally confirmed to be overexpressed in schwannomas. Novel genes included GJB6, ANGPTL7, and GRIA4.

We found strong activating associations with many different immune and inflammatory response processes, such as leukocyte activation. Interestingly, one of the top associated disease processes was rheumatic disease, including strong activating associations with both arthropathy and arthritis. Typical cancer hallmark processes such as cell movement, recruitment, and proliferation were also highly associated with this gene set. The top common upstream regulators include TNF, TGFB1, IFNG, IL1B, EGF, and IL6. Both IL1B and IL6 were significantly overexpressed (by about 20-fold) in our schwannoma samples.

Synovial sarcoma

Our analysis identified 38 genes significantly (by at least threefold, P<0.001) upregulated in the synovial sarcoma samples versus the non-synovial sarcoma samples (Table 1 and Supplementary Table 6). The most significantly differentially upregulated gene was one that is still only annotated as an open reading frame, C5orf58, but which is predicted by RefSeq to be protein coding. Of the 38 significantly upregulated genes, five were homeobox transcription regulators (HOXC4, HOXC9, HOXC10, HOXC11, and HOXC12). HOTAIR, the HOX transcript antisense RNA, was also significantly upregulated (by about 13-fold).

The most significantly associated biologic functions included neuronal, synapse, and epithelial cell development. The HOXC genes are associated with numerous skeletal system developmental processes. FGF11, FGF19, and FZD10 are all members of the epithelial–mesenchymal transition canonical pathway. Common transcriptional regulators included DLX1, DLX2, SOX2, and FGF8. DLX1 was significantly overexpressed (by about 10-fold) in the synovial sarcoma samples.

Upstream Transcription Factor Analysis

In addition to looking at the significant upstream regulators in each individual sarcoma data set, we also examined whether there were common upstream transcription factors that could be relevant across multiple sarcoma subtypes. In this analysis, the 10 most upregulated genes from each sarcoma were grouped and used as a superset for the upstream analysis. The top predicted transcription factors were well-known regulators of cellular growth and proliferation such as TNF, IFNG, TGFB1, and IL1B (Table 2).

Table 2 Predicted common upstream regulators

Integrated miRNA Targeting Analysis

As signaling networks are regulated not only by genes but also by miRNAs, we performed an integrated miRNA targeting analysis using miRNA expression data from SMED. In general, miRNAs act by downregulating their target genes, thus one would expect that when the target genes are upregulated the targeting miRNAs are downregulated. Therefore, we looked for miRNAs predicted to target the upregulated genes that were significantly downregulated (by at least twofold, P<0.05) in the sarcoma data set from S-MED (Table 3).

Table 3 Predicted miRNA regulators of upregulated genes (only miRNAs predicted to target three or more significantly upregulated genes are listed; for angiosarcoma, only the top 10 miRNAs are listed)

DISCUSSION

Sarcomas are a diverse collection of rare cancers with limited resources for research, both financially and with respect to availability of human patient tissues. In addition, many sarcoma subtypes themselves are highly heterogeneous and may have both benign and malignant stages. These obstacles provide a significant challenge in efforts to identify diagnostic biomarkers and therapeutic targets for these poorly understood tumors.

Nonetheless, our study demonstrates that, even with a small set of diverse samples, it is possible to perform a useful comparative analysis. We identified novel, uniquely expressed RNA molecules. Our findings should inform future preclinical studies, which may eventually lead to novel therapeutic targets and treatments. Definitive validation will require larger association studies and functional experiments. For each of the upregulated gene sets that we focused on, we looked for evidence that they were being upregulated either due to an increase in transcription factor activity or a downregulation of a common regulatory miRNA.

In each analyzed sarcoma subtype, we were able to identify both known sarcoma players and novel candidates—beyond known translocation events—for potential therapeutic targets or diagnostic biomarkers. To better understand the transcription regulation occurring in each sarcoma subtype, we looked for common transcriptional regulators that were predicted to be activated for each sarcoma data set. Each set of upregulated genes shared a set of common upstream regulators, some of which were significantly associated with multiple sarcoma subtypes. Activation of transcription factors can occur by multiple mechanisms. We observed, in several instances, both the identification of a common transcription factor activity and significant upregulation of the transcription factor itself. Integration of upstream regulators, both protein transcription factors and predicted targeting miRNAs, may provide a window into critical oncogenic networks that could be targeted therapeutically (Figure 2).

Figure 2
figure 2

Interaction networks involving significantly upregulated genes in our (a) angiosarcoma, (b) epithelioid sarcoma, (c) Ewing’s sarcoma, (d) fibromatosis, (e) schwannoma, and (f) synovial sarcoma datasets. Red nodes are upregulated genes. Blue nodes are significantly enriched upstream activating transcriptional regulators in each of our sarcoma datasets; bold indicates transcription factors found to be significant in multiple sarcoma datasets. Red and blue striped nodes are identified upstream transcriptional regulators that were also significantly upregulated in the sarcoma dataset. Green nodes are miRNAs predicted to target key upregulated genes and/or transcriptional regulators that were significantly downregulated in sarcoma; bold indicates miRNAs predicted to target multiple upregulated genes in the sarcoma gene set. Gray nodes are compounds/agonists that effect the upregulated genes in this network. Orange nodes are key biologic and disease processes associated with the total set of upregulated genes for each sarcoma subtype.

Known players identified in our analysis included previously validated biomarkers for specific sarcomas. For example, the NKX2-2 gene is a known target of the Ewing’s-specific fusion protein EWS-FLI-1 and an established immunohistologic marker for Ewing’s sarcoma cells.15 NKX2-2 was significantly upregulated in our Ewing’s sarcoma samples, along with LIPI, which was recently identified as a highly specific antigen abnormally expressed in Ewing’s sarcomas (and is therefore a strong candidate for immunotherapy: it could be used for targeting therapies specifically to sarcoma cells).16 FLT4, a marker for lymphatic vessels, was previously reported to be upregulated in sarcoma, including angiosarcoma, as we also found in our study.17, 18 Previous studies have reported amplification of FLT4 and MYC in radiation-induced secondary angiosarcomas, but not in other radiation-associated sarcomas. This study also found that FLT4 expression was 5.3-fold higher in secondary radiation-induced angiosarcomas compared with primary angiosarcomas.17, 18 Our angiosarcoma samples were derived from spontaneous sarcomas (not radiation induced) but still showed significantly higher FLT4 expression compared with the other samples. It should also be noted that these tumor samples may have included additional cells such as activated tumor-invading lymphocytes, which could potentially account for some of the observed CTLA4 expression.19

In our study, we additionally identified other previously established markers such as GFAP, GAP43, S100B, FABP7, and the SOX10 transcription factor. In schwannoma, GFAP is a known marker for nerve sheath differentiation and has been established as a biomarker for certain nerve sheath tumors.20, 21, 22 GAP43, S100B, FABP7, and the SOX10 transcription factors are all involved in neuronal development and were previously reported to be positively expressed in nerve-derived tumors such as schwannomas.23, 24, 25, 26, 27 Our reidentification of such known biomarkers adds to our confidence in our study’s novel differential expression results.

Many of the top upregulated genes were previously reported to be associated with various cancers, but ours is the first report associating them with sarcomas. CCL1, the natural cognate ligand of CCR8 that we identified in our angiosarcoma comparison, is known to be involved in cancer-related inflammation, malignant progression, and control of tumor cell entry into the lymph node.28 Just recently, ANGPTL7 was reported to be a hypoxia-driven proangiogenetic factor upregulated in cancer cells that can induce malignant behavior in human prostate cancer cells: it could be an interesting candidate to profile across many schwannomas, to determine if its expression levels correlate with malignant progression.29, 30 Many of the upregulated genes that we identified in our study have been functionally characterized as part of the inflammatory response, a hallmark enabling characteristic that can support multiple cancer capabilities.31 MADCAM1, another highly upregulated gene in our angiosarcoma samples, is associated with the onset and progression of multiple gastric inflammatory diseases and is also upregulated in melanoma cells.32, 33

Recent epigenetic studies have shown that just as expression patterns can be highly instructive, so, too, can methylation patterns. Although not examined in our study, several of the top results in our comparisons have been previously reported to be abnormally methylated in a variety of cancers. In particular, SCGBA31 (HIN-1) hypermethylation has been reported in a diverse set of cancers and has been shown to activate AKT signaling.34 GRIA4, which we found significantly upregulated in our schwannoma samples, is a target of the polycomb repressive complex 2 and is hypermethylated in follicular lymphoma.35

The top mRNA in our schwannoma comparison, GJB6 (connexin 30), whose primary pathologic association is with hearing loss, has not been previously reported in schwannoma tumors; however, some clinical reports have indicated a connection between hearing loss and schwannoma.36 GJB6 has been reported to be dysregulated in multiple other cancers but with differing directions of change that could improve its usefulness as a diagnostic marker.37, 38, 39 Interestingly, of the 10 most differentially upregulated genes in fibrosarcoma, three of them—ADAM12, POSTN, and PLA2G2A—have previously been identified in osteosarcoma, which is notoriously heterogeneous.40, 41, 42 The ZIC transcription factor family, two members of which were significantly upregulated in our synovial sarcoma samples, is normally involved in brain and neuronal development and is overexpressed in liposarcoma and meningioma relative to normal tissue.43, 44

Some of the top genes in our analysis, such as cystatin D (CST5) and ARHGAP36 in epithelioid sarcoma, have been previously reported to be dysregulated in cancer cells.45, 46 Yet, very little information has been published on the expression pattern or biologic function of others that we identified. TMEM215 is a transmembrane protein and therefore likely to be involved in cell signaling. SPATA22 was only recently identified as being required for meiosis in mammals and might have a proproliferative role in synovial sarcoma.47 Very little has been reported on the orphan G-protein-coupled receptor GPR182, other than that it is consistently upregulated in persistently propagated tumor-specific endothelial cells, and now in angiosarcoma as well.48 To date, no published works have described the function of FOXL2NB, FAM163B, or several of the other transcripts that still carry just the open reading frame designation, but are predicted to be protein coding. In our synovial sarcoma comparison, SERPINB11 was the most highly differentially upregulated transcript by more than an order of magnitude, primarily because of its near-absent expression in our non-synovial sarcoma samples. Of note, SERPINB11is classified as a polymorphic gene/pseudogene coding for a noninhibitory serine protease inhibitor and could be functioning as a serine protease inhibitor agonist.49 HTN3, a member of the antimicrobial histatin family, and ZNF469, traditionally associated with brittle cornea syndrome and keratoconus, have established functions or pathologic associations not immediately connected to cancer, inflammation, or cell growth.50 These apparently non-cancer-related genes could still be important genetic markers if they are signaling a translocation event or activation of a related signaling pathway, or they might have a yet-unrecognized role in sarcoma development or progression.

Recently, researchers have begun to explore the biologic significance and differential expression of noncoding RNAs beyond miRNAs in cancer development and progression.51 In our study, a number of the top hits across multiple sarcoma subtypes were lncRNAs, including D21S2088E in angiosarcoma and PCGEM1 in Ewing’s sarcoma, both of which were the most significantly upregulated transcripts relative to all of our other samples. PCGEM1 is normally expressed only in prostate tissue; significant upregulation of PCGEM1 is an established biomarker for prostate cancer.52 This potential overlap between Ewing’s sarcoma and prostate cancer renews the speculation that steroid receptors, including the androgen and progesterone receptors, may have a significant role in the development and progression of malignant skeletal tumors.53 The protein FUsed/Translocated in LipoSarcoma (FUS/TLS) binds to the androgen receptor and mediates androgen-dependent cell-cycle progression and prostate cancer growth.54, 55 A loss of FUS/TLS results in decreased cellular proliferation, impaired RNA processing, and increased expression of the closely related Ewing’s sarcoma protein EWS.56 Indeed, the transactivation domains of the TET family of RNA-binding proteins (such as FUS and EWS) may be interchangeable and FUS/ERG fusions have been reported in Ewing’s sarcoma patients.57

We also observed two long antisense RNAs of unknown biologic significance, MAPT-AS1 in Ewing’s sarcoma and DLX6-AS1 in synovial sarcoma. Expression of the MAPT-AS1 cis-antisense RNA correlates with expression of the MAPT gene and the progesterone receptor (PGR and TMEM26) and has been proposed as a predictive and prognostic marker for breast cancer.58 All of those lncRNAs, as well as other transcripts of unknown biologic function, could prove to be diagnostically useful biomarker panel members, even if their functional roles are not fully understood.

Transcripts that are significantly overexpressed in a particular sarcoma are not only useful as diagnostic biomarkers but also could indicate which chemotherapy regimen would be most efficacious for a particular patient. For example, our angiosarcoma samples showed significant overexpression of FLT4 and CTLA4, both of which are established therapeutic targets in other contexts. Sorafenib, efficacious against desmoid tumors, could be therapeutically beneficial in other FLT4-expressing sarcomas.59 CLTA4, a critical immunoregulatory gene, helps maintain tolerance to self-antigens by elevating the T-cell activation threshold.60 An anti-CTLA4 antibody approved to treat melanoma, ipilimumab, functions in part by releasing this immunity checkpoint inhibitor, allowing the patient’s cytotoxic T lymphocytes to mount a more robust antitumor immune response—a response that could also be used in sarcoma therapy.61

Moreover, three of the other sarcoma subtypes in our study also contained a known drug target in their top 10 differentially upregulated gene lists: PLA2G2A, upregulated in our fibromatosis samples, is a member of the prostaglandin family and can be downregulated by indomethacin, a nonsteroidal anti-inflammatory drug that targets COX1 and COX2, which are involved in prostaglandin synthesis.62 GRIA4, upregulated in our schwannoma samples, is one of the AMPA (α-amino-3-hydroxy-5-methyl-4-isoxazole propionate)-sensitive glutamate receptors; it could be regulated by AMPA agonists such as the drug tezampanel.63 HOXC11, upregulated in our synovial sarcoma samples, is a marker of poor prognosis in patients with renal clear-cell carcinoma.64 Treatment with the Src/Abl inhibitor dasatinib prevents recruitment of the HOXC11 transcription factor to S100β, an established biomarker of cutaneous melanoma progression.65 Patients with sarcomas with elevated levels of any of those genes could derive some additional benefit by adding a targeted therapeutic medication to their current treatment regimen.

Misdiagnosis of sarcoma subtype is still a significant barrier to effective treatment. Identifying and validating biomarkers for differential diagnosis will help ensure that patients receive the most effective treatments in a timely manner. Distinguishing between benign and malignant tumors will also be highly useful in determining the optimal treatment. As with all therapeutics, the potential benefits of any targeted drug regimen will need to be balanced against potential adverse events. Biomarkers that could predict sensitivity/resistance to various chemotherapy drugs would also be very valuable, particularly if they were broadly applicable to many tumor types.

We acknowledge that, given the heterogeneous nature of sarcomas (even within subtypes), the novel candidate genes we identified might not be relatively overexpressed across all samples. The number of samples we subjected to RNA-Seq was too small to identify a comprehensive biomarker signature for each sarcoma subtype, yet in our comparison analysis, we were able to identify both known sarcoma-specific markers and novel candidates. By integrating our RNA-Seq results with our previous miRNA profiling work covering those samples, we were able to identify deregulated miRNA–gene networks in each sarcoma subtype and the key predicted regulatory miRNAs. The overexpressed genes predicted to be targeted by those miRNAs represent multiple signaling pathways, supporting the notion that a single miRNA can regulate multiple pathways.

Such critical regulatory miRNAs should be further investigated to gain deeper insights into sarcoma pathobiology, which could be relevant in other cancers as well—particularly as many of the upregulated genes identified in our study have been strongly associated with higher-frequency cancers (such as breast, lung, and prostate cancer). In conclusion, our analysis is a proof-of-concept study demonstrating that high-throughput sequencing approaches and bioinformatic analysis of limited representative sarcoma samples can successfully identify novel candidates for biomarker and therapeutic targets.