Abstract
Background
Circulating nucleic acids in blood plasma form an attractive, minimally invasive resource to study human health and disease. In this study, we aimed to identify cell-free RNA alterations that can distinguish cancer patients from cancer-free individuals.
Methods
We first performed mRNA capture sequencing on 266 blood plasma samples from cancer patients and controls, including a discovery set of 208 donors across 25 cancer types and a replication set of 58 donors across three cancer types. We first conducted group-level comparisons and then compared individual patient profiles to a reference control population in a one-versus-many approach. This approach was further evaluated in independent cohorts: a prostate cancer plasma cohort (n = 180), a non-malignant disease plasma cohort (n = 125), a lymphoma plasma cohort (n = 65), and a bladder cancer urine cohort (n = 24), each including both patients and controls.
Results
Here we show that cancer patients exhibit both cancer type-specific and general cell-free RNA alterations. However, differentially abundant RNAs vary widely among patients and across cohorts, hampering robust biomarker identification. By comparing individual patient profiles to control populations, we identify so-called biomarker tail genes, which strongly deviate from controls. The number of these genes per sample distinguishes cancer patients from control samples. Independent cohorts also confirm the potential of this approach.
Conclusions
Our findings demonstrate substantial heterogeneity in cell-free RNA alterations among cancer patients and propose that patient-specific changes can be exploited for classification.
Plain language summary
RNAs that circulate in biofluids can provide information about a person’s health. In this study, we examined such extracellular RNAs in blood plasma from people with and without cancer to see if specific patterns could help identify cancer. We analyzed over 600 samples from multiple cancer types and compared each patient’s RNA patterns to those of healthy individuals. We found that each patient shows unique changes, and that the number of strongly altered RNAs can distinguish cancer patients from healthy people. These findings suggest that studying individual RNA patterns could improve cancer detection and support personalized care.
Similar content being viewed by others
Data availability
Raw RNA-sequencing data are available under controlled access in the European Genome-phenome Archive (EGA) due to the presence of potentially identifiable genetic information and in accordance with participant informed consent and the European General Data Protection Regulation (GDPR). The pan-cancer and three-cancer cohort FASTQs can be found under study EGAS00001006755 (dataset EGAD00001009713). The prostate cancer cohort FASTQs can be found under study EGAS50000001265 (datasets EGAD50000001805). The non-malignant cohort FASTQs can be found under study and EGAS50000001264 (dataset EGAD50000001806). The lymphoma cohort FASTQs can be found under study EGAS00001007127 (dataset EGAD00001010259). The bladder cancer cohort FASTQs can be found under study EGAS00001003917 (dataset EGAD00001005439). Access to these controlled datasets is granted by the relevant Data Access Committee (DAC) following submission and approval of a data access request through the EGA portal. Researchers must provide a research proposal and agree to the terms of a Data Access Agreement restricting use to approved research purposes and prohibiting re-identification attempts. Requests are typically reviewed within 4–8 weeks. Differential abundance and gene set enrichment analysis results are available (open access) in a Zenodo repository (https://doi.org/10.5281/zenodo.7953707)40. An overview of tail genes per type and 50% consensus biomarker tail genes can be found in Supplementary Data 6. Source data for Figs. 1–7 can be found in Supplementary Data 7.
Code availability
Software versions used for data analysis were: bedtools v2.30.0; cutadapt v1.18; FASTQC v0.11.9; FusionCatcher v1.30; HTSeq v0.11.0; samtools v1.16.1; STAR v2.7.3; UMI-tools v1.0.0; and R v4.2.1. The following R packages were used for downstream analyses: biomarRt v2.62.0; caret v6.0.94; data.table v1.16.4; DESeq2 v1.36.0; EnhancedVolcano v1.24.0; eulerr v6.1.1; fgsea v1.22.0; GeneOverlap v1.32.0; ggpubr v0.6.0; gridExtra v2.3; here v1.0.1; pheatmap v1.0.12; pROC v1.18.0; rstatix v0.7.2; tidyverse v1.3.2. Custom R scripts used for differential abundance analysis, tail gene identification, and downstream statistical analyses are available at https://github.com/OncoRNALab/tailgenes and have been archived on Zenodo (v1.0.0) with https://doi.org/10.5281/zenodo.1866919941. The archived version corresponds to the code used for this study.
References
Pessoa, L. S., Heringer, M. & Ferrer, V. P. ctDNA as a cancer biomarker: a broad overview. Crit. Rev. Oncol. Hematol. 155, 103109 (2020).
Vorperian, S. K., Moufarrej, M. N. & Quake, S. R. Cell types of origin of the cell-free transcriptome. Nat. Biotechnol. 40, 855–861 (2022).
Moufarrej, M. N. et al. Early prediction of preeclampsia in pregnancy with cell-free RNA. Nature 602, 689–694 (2022).
Larson, M. H. et al. A comprehensive characterization of the cell-free transcriptome reveals tissue- and subtype-specific biomarkers for cancer detection. Nat. Commun. 12, 2357 (2021).
Chen, S. et al. Cancer type classification using plasma cell-free RNAs derived from human and microbes. eLife 11, e75181 (2022).
Roskams-Hieter, B. et al. Plasma cell-free RNA profiling distinguishes cancers from pre-malignant conditions in solid and hematologic malignancies. npj Precis. Oncol. 6, 1–11 (2022).
Hulstaert, E. et al. Charting extracellular transcriptomes in the human biofluid RNA atlas. Cell Rep. 33, 108552 (2020).
Hardwick, S. A. et al. Spliced synthetic genes as internal controls in RNA sequencing experiments. Nat. Methods 13, 792–798 (2016).
Hulstaert, E. et al. Messenger RNA capture sequencing of extracellular RNA from human biofluids using a comprehensive set of spike-in controls. STAR Protoc. 2, 100475 (2021).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Tol, P. Colour schemes, Technical Note SRON/EPS/TN/09-002. (2012).
Mestdagh, P. et al. Evaluation of quantitative miRNA expression platforms in the microRNA quality control (miRQC) study. Nat. Methods 11, 809–815 (2014).
Nicorici, D. et al. FusionCatcher—a tool for finding somatic fusion genes in paired-end RNA-sequencing data. Preprint at https://doi.org/10.1101/011650 (2014).
Kim, P. & Zhou, X. FusionGDB: fusion gene annotation DataBase. Nucleic Acids Res. 47, D994–D1004 (2019).
Gao, Q. et al. Driver fusions and their implications in the development and treatment of human cancers. Cell Rep. 23, 227–238.e3 (2018).
Korotkevich, G. et al. Fast gene set enrichment analysis. Preprint at https://doi.org/10.1101/060012 (2021).
Liberzon, A. et al. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst. 1, 417–425 (2015).
Kanehisa, M. & Goto, S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Sherman, B. T. et al. DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 50, W216–W221 (2022).
Lorenzi, L. et al. The RNA Atlas expands the catalog of human non-coding RNAs. Nat. Biotechnol. 39, 1453–1465 (2021).
Everaert, C., Volders, P.-J., Morlion, A., Thas, O. & Mestdagh, P. SPECS: a non-parametric method to identify tissue-specific molecular features for unbalanced sample groups. BMC Bioinforma. 21, 58 (2020).
Lonsdale, J. et al. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Uhlén, M. et al. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Robin, X. et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinforma. 12, 77 (2011).
Bryzgunova, O. E., Konoshenko, M. Y. & Laktionov, P. P. Concentration of cell-free DNA in different tumor types. Expert Rev. Mol. Diagn. 21, 63–75 (2021).
Liquori, A. et al. Acute promyelocytic leukemia: a constellation of molecular events around a single PML-RARA fusion gene. Cancers 12, 624 (2020).
Khwaja, A. et al. Acute myeloid leukaemia. Nat. Rev. Dis. Prim. 2, 1–22 (2016).
Giannakeas, V. et al. Analysis of platelet count and new cancer diagnosis over a 10-year period. JAMA Netw. Open 5, e2141633 (2022).
Haemmerle, M., Stone, R. L., Menter, D. G., Afshar-Kharghan, V. & Sood, A. K. The platelet lifeline to cancer: challenges and opportunities. Cancer Cell 33, 965–983 (2018).
Sun, K. et al. Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments. Proc. Natl. Acad. Sci. USA 112, E5503–E5512 (2015).
Hulstaert, E. et al. Exploring the extracellular transcriptome in seminal plasma for non-invasive prostate cancer diagnosis. Preprint at https://doi.org/10.1101/2021.05.11.21256306 (2021).
Allen, B. M. et al. Systemic dysfunction and plasticity of the immune macroenvironment in cancer models. Nat. Med. 26, 1125–1134 (2020).
Hiam-Galvez, K. J., Allen, B. M. & Spitzer, M. H. Systemic immunity in cancer. Nat. Rev. Cancer 21, 345–359 (2021).
Xiong, S., Dong, L. & Cheng, L. Neutrophils in cancer carcinogenesis and metastasis. J. Hematol. Oncol. 14, 173 (2021).
Lugano, R., Ramachandran, M. & Dimberg, A. Tumor angiogenesis: causes, consequences, challenges and opportunities. Cell. Mol. Life Sci. 77, 1745–1770 (2020).
The exRNAQC Consortium et al. Blood collection tube and RNA purification method recommendations for extracellular RNA transcriptome profiling. Nat. Commun. 16, 4513 (2025).
Morlion, A. Differential abundance and gene set enrichment in plasma of cancer patients versus controls. Zenodo https://doi.org/10.5281/zenodo.12721611 (2024).
Morlion, A. OncoRNALab/tailgenes: code for ‘Patient-specific alterations in blood plasma cfRNA profiles enable accurate classification of cancer patients and controls’. Zenodo https://doi.org/10.5281/zenodo.18669199 (2026).
Acknowledgements
This work was supported by Ghent University (BOF; BOF19/DOC/228), Ghent University Industrial Research Fund (IOF; F2023/IOF-ConcepTT/068), Ghent University Hospital, Fund for Scientific Research Flanders (FWO; 11C1623N predoctoral fellowship to A.M., 11H7523N predoctoral fellowship to P.D., G0B2820N), the Belgian Foundation Against Cancer (STK; 2025-085 postdoctoral fellowship to A.M.), and Kom Op Tegen Kanker (Stand up to Cancer, the Flemish cancer society). The computational resources (Stevin Supercomputer Infrastructure) and services used in this work were provided by the VSC (Flemish Supercomputer Center), funded by Ghent University, FWO, and the Flemish Government—department EWI. We would like to thank Tim Mercer for providing the Sequin spikes, Eva Hulstaert and Francisco Avila Cobos for their support during the first phase of the project, Anneleen Decock for her feedback on the initial draft, and Scott Ailliet, Sofie Roelandt, and Deyna Keppens for their assistance with blood collection. We would also like to thank the Biostatistics Unit of the Faculty of Medicine and Health Sciences (Ghent University) for their feedback on the tail gene performance assessment.
Author information
Authors and Affiliations
Contributions
Conceptualization: A.Mo., C.E., J.V., and P.M.; Data curation: A.mo.; Formal analysis: A.Mo. and O.T.; Funding acquisition: A.Mo., J.V., J.V.D., and P.M.; Investigation: E.V., J.N., K.V., and N.J.R.; Methodology: A.Mo., J.V., and P.M.; Project administration: A.Mo., J.V., and P.M.; Resources: A.Ma., A.V., D.H., E.H., F.H., F.O., G.B., H.V., J.L., J.V., J.V.D., K.P., K.S., L.V., N.L., P.D., P.M., S.C., and V.S.; Software: A.Mo. and J.A.; Supervision: J.V. and P.M.; Validation: A.Mo. and O.T.; Visualization: A.mo.; Writing—original draft: A.Mo., J.V., and P.M.; Writing—review & editing: all authors.
Corresponding author
Ethics declarations
Competing interests
A.M., J.V., and P.M. are inventors on the international PCT application “Method to screen, diagnose or monitor treatment using increased numbers of deviating analytes” (WO2024200811; published 03/10/2024) covering the tail gene concept. They are also inventors on the European priority application “Blood plasma RNA profiles enable accurate diagnosis of prostate cancer” (EP26151329.5; filed 12/01/2026) covering specific prostate cancer biomarker tail genes from the prostate cancer cohort in this manuscript. Both applications are assigned to Ghent University.
Peer review
Peer review information
Communications Medicine thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Morlion, A., Decruyenaere, P., Schoofs, K. et al. Patient-specific alterations in blood plasma cfRNA profiles enable accurate classification of cancer patients and controls. Commun Med (2026). https://doi.org/10.1038/s43856-026-01507-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s43856-026-01507-8


