Background

The International Agency for Research on Cancer (IARC) recently predicted a significant increase in cancer incidence world-wide, expecting over 35 million new diagnoses in 2050Footnote 1,Footnote 2. The anticipated rise in cancer cases places great demands on the healthcare systems and underscores the need for enhanced strategies in cancer management. To meet this challenge, liquid biopsies offer a minimally invasive source for detection of a broad range of cancer biomarkers, including markers for screening, diagnosis, assessment of prognosis, detection of residual disease, early identification of recurrence, and prediction of treatment response [1] (Fig. 1). Tumor material, in the form of circulating tumor DNA (ctDNA), circulating tumor cells (CTCs) and exosomes, are shed by the tumors into various body fluids, including blood, urine, saliva, bile and seminal fluid [2], and show several advantages for biomarker discovery compared to tissue biopsies. While tissue biopsies offer a limited view of only one section of the tumor, liquid biopsies reflect both the entire tumor burden of a patient and the molecular cancer heterogeneity [1, 3]. Liquid biopsies are also typically less invasive, and enables repeated sampling, e.g. for monitoring treatment response and cancer progression [1, 4].

Fig. 1: Applications of liquid biopsy tests in cancer management.
Fig. 1: Applications of liquid biopsy tests in cancer management.
Full size image

Liquid biopsies offer a minimally invasive method with diverse applications in cancer care, including early screening, precise diagnosis, prognostic assessment, real-time disease monitoring, personalized treatment selection, and evaluation of treatment response.

DNA methylation refers to the addition of a methyl group to the 5’ position of cytosine, typically at CpG dinucleotides, resulting in 5-methylcytosine. This epigenetic modification regulates gene expression and chromatin structure without altering the underlying DNA sequence. DNA methylation is essential for processes such as genomic imprinting, X chromosome inactivation, transposon silencing, and ultimately cellular development and differentiation [5]. In cancer, DNA methylation patterns are frequently altered, with tumors typically displaying both genome-wide hypomethylation and hypermethylation of CpG rich gene promoters [6,7,8]. Promoter hypermethylation of key tumor suppressor genes is commonly associated with gene silencing, while global hypomethylation can induce chromosomal instability. Together, these events can disrupt normal growth pathways and drive malignant transformation.

DNA methylation alterations often emerge early in tumorigenesis and remain stable throughout tumor evolution [7, 9, 10], making cancer-specific DNA methylation patterns highly relevant as biomarkers. The inherent stability of the DNA double helix – arising from complementary base pairing and its helical conformation – provides additional protection compared to single-stranded nucleic acid-based biomarkers. However, the rapid clearance of circulating cell-free DNA (cfDNA), with estimated half-lives ranging from minutes up to a few hours [11, 12], represents a challenge for blood-based biomarker analyses. In this setting, DNA methylation biomarkers offer an advantage, as DNA methylation seems to impact cfDNA fragmentation [13]. Nucleosome interactions help to protect methylated DNA from nuclease degradation [14, 15], resulting in a relative enrichment of methylated DNA fragments within the cfDNA pool [16]. Collectively, these features contribute to the high potential of DNA methylation-based biomarkers, including their enhanced resistance to degradation during sample collection, storage, and processing, especially compared to more labile molecules such as RNA [17].

Various methods exist for analysis of DNA methylation and have been comprehensively reviewed [18,19,20]. Whole-genome bisulfite sequencing (WGBS) and reduced representation bisulfite sequencing (RRBS) are widely used for biomarker discovery, providing broad methylome coverage through bisulfite-based chemical conversion. Enzymatic methyl-sequencing (EM-seq), along with emerging third generation sequencing technologies such as nanopore and single-molecule real-time sequencing, offers comprehensive methylation profiling without chemical conversion, thereby better preserving DNA integrity. This is particularly promising for liquid biopsy analyses, where DNA quantity is often limited. Microarrays and enrichment-based techniques, including methylated DNA immunoprecipitation sequencing (MeDIP-seq), support both biomarker discovery and clinical validation by balancing profiling breadth with cost and throughput. However, these methods provide less detailed methylation information compared to whole-genome sequencing methods. Targeted methods, such as quantitative real-time PCR (qPCR) and digital PCR (dPCR), offer highly sensitive, locus-specific analysis, making them particularly suited for clinical validation.

Despite their potential, DNA methylation-based cancer biomarkers have seen limited adoption in routine clinical use. Searching PubMed with the MeSH terms “Biomarkers” and “DNA Methylation” together with the term “cancer” returns 6,191 publications since 1996. Yet, this vast amount of studies is not reflected in the number of DNA methylation biomarkers in clinical use, underscoring the challenges that remain in developing robust, high performance DNA methylation-based biomarkers. Several factors affect the successful clinical implementation of DNA methylation biomarkers (Fig. 2). Low levels of tumor material in liquid biopsies require advanced technologies for biomarker discovery and validation, including optimized discovery workflows and standardized targeted analyses. In addition, the choice of liquid biopsy source, selection of appropriate control groups, in both discovery and validation phases, sufficient independent validation, and large-scale clinical studies to demonstrate clinical utility, are all important factors that impact the chance of succeeding with clinical translation. These aspects are addressed in the following review, mainly focusing on diagnostic DNA methylation biomarkers. Other challenges, such as regulatory and economic barriers, are discussed elsewhere [21].

Fig. 2: Key elements for successful clinical implementation of DNA methylation biomarkers in liquid biopsies.
Fig. 2: Key elements for successful clinical implementation of DNA methylation biomarkers in liquid biopsies.
Full size image

Successful clinical implementation requires careful selection of an appropriate liquid biopsy source, paired with an optimized biomarker discovery approach and standardized targeted analyses. Emphasizing standardization, optimization and quality control across all stages increase the chances of developing robust biomarker tests that can be validated in clinical trials, demonstrating clinical utility and enabling eventual clinical implementation.

Liquid biopsy source

Blood

Blood is one of the most frequently used sources for liquid biopsy analyses [22]. The circulatory system reaches virtually every tissue in the body, allowing blood to serve as a reservoir for cancer-specific material shed from tumors, regardless of their anatomic location. Both ctDNA, CTCs and extracellular vesicles can be released into the blood stream as tumors grow and evolve [23]. Blood is also easily accessible through minimally invasive procedures, making it attractive from a patient perspective [24]. However, detection of circulating cancer biomarkers is challenging due to the high degree of dilution of tumor-derived signals within the total blood volume of an average adult (4–5 L). As a result, these signals are often present at extremely low concentrations, making them inherently challenging to detect. ctDNA is also highly unstable and degrades rapidly [25]. In addition, blood contains a variety of other components, including white blood cells and cfDNA fragments originating from various healthy tissues. This complex background not only dilutes the tumor-derived signal but may also contribute to false positive biomarker signals, thereby challenging the specificity of targeted analyses.

For blood-based analyses of cancer biomarkers, plasma is frequently used. While serum has a higher concentration of total cfDNA (including both normal cfDNA and ctDNA), plasma is usually enriched for ctDNA, and has less contamination of genomic DNA from lysed cells [3]. The stability of ctDNA is also higher in plasma [26]. Interestingly, while plasma ctDNA is mostly used for analyzing cancer-specific molecular features, such as mutations and DNA methylation alterations, the quantitative level of cfDNA in plasma has in numerous studies been proposed as a promising biomarker per se for various aspects of cancer management [27]. In breast cancer, cfDNA concentrations were found to be significantly higher in cancer patients compared to both healthy controls and individuals with benign tumors, indicating a diagnostic potential for cfDNA [28]. However, clinical utility requires more than a statistically significant difference between cancer and control groups. For a biomarker to be clinically useful, there should be minimal overlap in concentration ranges between groups—a criterion that often presents a challenge for using cfDNA concentration as a stand-alone biomarker [29]. Several reports have further explored how cfDNA and ctDNA yields vary across tumor types and how this variability influences the implementation of DNA methylation-based diagnostic biomarkers. The variability in overall cfDNA among patients with the same cancer type is often larger than the variability seen between different cancer types [30, 31]. In contrast, the fraction of ctDNA tends to differ more consistently between cancer types and stages [32]. This is highly relevant for DNA methylation-based diagnostics, since biomarker sensitivity will be limited not by total cfDNA abundance, but by the proportion of ctDNA present in the sample. At low ctDNA fractions, which are commonly seen in early-stage disease and e.g. cancers of the central nervous system [32], the robustness of DNA methylation-based detection will decrease.

Technological advances, including the development of highly sensitive methods, have facilitated the use of blood as a liquid biopsy source. Accordingly, numerous blood-based DNA methylation tests for cancer management have been proposed [33, 34]. Still, only a few tests have achieved FDA-approval (including Epi proColon and Shield for detection of colorectal cancer) or the designation “FDA Breakthrough Device” (e.g., the two multi-cancer tests Grail’s Galleri and OverC MCDBT; Table 1). Thus, the use of blood as a liquid biopsy source demonstrates promising potential but also presents clear challenges—where a focus on quality and optimization at each step could greatly improve the prospects for successful clinical implementation of blood-based DNA methylation biomarkers.

Table 1 Examples of DNA methylation-based liquid biopsy tests for cancer management that are FDA-approved, have received FDA breakthrough device designation, are CE-marked and/or are commercially available (All these websites were accessed on 28.03.2025).

Local liquid biopsy sources

Depending on their anatomical location, various cancer types are likely to shed material into nearby body fluids other than blood, including urine, saliva, cerebrospinal fluid, bile, stool, pleural effusions, peritoneal fluid and seminal fluid [2, 35]. In contrast to the systemic nature of blood, local body fluids often offer distinct advantages, including a higher biomarker concentration and reduced background noise from other tissues (Fig. 3). Thus, for certain cancer types, using a local liquid biopsy source could increase biomarker precision and reliability.

Fig. 3: Choosing the optimal liquid biopsy source - systemic vs local.
Fig. 3: Choosing the optimal liquid biopsy source - systemic vs local.
Full size image

Blood as a systemic liquid biopsy source, circulates throughout the body, collecting tumor materials irrespective of anatomical location. In contrast, local liquid biopsy sources often provide distinct advantages, including higher target concentrations and reduced background signals. For urological cancers, urine serves as an ideal local liquid biopsy source, offering non-invasive collection and potentially higher biomarker sensitivity compared to blood. The illustrated examples demonstrate these comparative advantages. *Reference to example 1 [36] and example 2 [40].

For urological cancers such as bladder-, prostate- and kidney cancer, urine is a natural liquid biopsy source due to its proximity to the urinary tract. Urine samples can be obtained non-invasively, minimizing patient discomfort and supporting better compliance with testing protocols. Urine is particularly effective for bladder cancer management, as most tumors are in direct contact with the urine, resulting in higher concentrations of tumor-derived biomarkers compared with blood. This leads to greater accuracy for urine-based tests (Fig. 3). A relevant example is detection of TERT mutations in matched urine and blood samples from bladder cancer patients, where the sensitivity was 87% in urine versus 7% in plasma [36]. Table 1 highlights the strong representation of urine-based tests for bladder cancer among commercially available and FDA-designated breakthrough devices. In comparison, fewer urine-based DNA methylation biomarker tests exist for prostate and renal cancer [37], likely due to lower amount of CTCs and ctDNA shed into the urine from these cancers [38, 39]. Nevertheless, a significantly higher biomarker sensitivity in urine compared to plasma has also been reported in prostate cancer [40].

For biliary tract cancers including cholangiocarcinoma, bile has emerged as a promising liquid biopsy source [41]. Studies indicate that bile samples often outperform plasma in detecting tumor-related somatic mutations [42], providing more sensitive and specific clinical information. Similar findings have been reported for stool and cerebrospinal fluid, which offer superior performance compared to plasma in detecting cancer-specific DNA methylation biomarkers in early-stage colorectal cancer [43] and brain tumors [44]. While some local liquid biopsy sources, such as urine and saliva, can be collected non-invasively, others - including bile, cerebrospinal fluid, pleural fluid and peritoneal fluid - typically require more invasive procedures. This could in principle limit their utility. However, many of these fluids are routinely obtained during diagnostic or therapeutic procedures, so their use for biomarker testing does not involve additional invasiveness or risk. For instance, bile can be obtained during endoscopic retrograde cholangiopancreatography (ERCP), a procedure frequently used in the diagnostic evaluation of suspected cholangiocarcinoma [45].

Conclusion

Blood plasma is the most widely used liquid biopsy source and shows considerable clinical promise across cancer types and applications. However, for some cancer types, local body fluids may offer advantages such as higher concentration of tumor-derived signals and a reduced background noise, which can significantly improve the accuracy and clinical utility of biomarker tests.

Discovery of DNA methylation biomarkers

The STARD (Standards for Reporting of Diagnostic Accuracy Studies) guidelines were developed to improve the completeness and transparency of diagnostic accuracy study reports [46], including those reporting new cancer biomarkers. These guidelines provide comprehensive recommendations covering essential aspects such as study design, results presentation, and the intended clinical application. An effective discovery strategy is necessary for identifying biomarkers with high diagnostic accuracy. This strategy should involve careful selection of appropriate liquid biopsy sources, optimized pre-analytical procedures, inclusion of relevant control groups and state-of-the-art high-throughput technologies. Additionally, it should incorporate an appropriate bioinformatics workflow to ensure robust data generation, accurate analysis and reliable interpretation, thereby increasing the likelihood of developing clinically relevant biomarkers.

Clinical material

To increase the chances of succeeding with a clinical liquid biopsy test, an appropriate sample type should be selected already in the biomarker discovery phase. To date, tissue samples have been most frequently used for biomarker discovery and have resulted in highly accurate cancer biomarkers that have subsequently been detected in liquid biopsies (Table 2). Incorporating liquid biopsy samples directly into the discovery phase may offer distinct advantages, allowing identification of biomarkers that are inherently detectable in biofluids, potentially reducing the number of steps from discovery to clinical application. An increasing number of methylation-based high-throughput methods are using cfDNA directly as starting material and suggest that biomarkers identified using this approach have robust performance (Table 2). As a proof of concept, Hlady et al. have demonstrated that biomarker candidates identified directly from cfDNA performed better than biomarkers primarily identified from tissue samples [47]. Efforts such as the Circulating Cell-free Genome Atlas (CCGA) study have also successfully contributed to the discovery of biomarker panels for detection and prediction of tissue of origin of several cancer types at high specificity [48,49,50].

Table 2 Examples of biomarker discovery studies using DNA methylation genome-wide approaches.

Pre-analytical steps

The performance and reliability of a biomarker test ultimately depend on the quantity and quality of DNA used in the analysis, both during the discovery phase and in subsequent targeted analyses. Thus, optimizing pre-analytical steps is critical - in particular for liquid biopsy material, where target DNA is often present in low quantities, prone to degradation and surrounded by a complex background of other cells and DNA fragments [23]. Indeed, cfDNA degradation and contamination with normal leukocyte DNA are major pre-analytical variables that can compromise the analytical performance of a blood-based test [3]. Factors such as collection tube type, processing time, storage temperature, centrifugation protocol and DNA purification method all significantly influence the quality and quantity of cfDNA available for downstream analyses [3]. For urine samples, time to processing and storage temperature are especially important. While sedimentation protocols have traditionally been the gold standard for urine processing, alternative solutions – such as stabilizing buffers [51] and urine filtration systems [52] – have been developed to simplify sample handling and preservation during transportation and storage. For bile, phenol-chloroform extraction has been shown to yield higher levels of amplifiable DNA compared to other isolation kits [41]. Efforts to standardize pre-analytical variables are ongoing, with several consortia - including the CANCER-ID project [53], the European Liquid Biopsy Society (ELBS; www.elbs.eu), and BLOODPAC (https://www.bloodpac.org/) [54, 55] - making important contributions to improve the clinical translation of liquid biopsy biomarkers.

To detect DNA methylation in downstream analyses, a DNA pre-treatment step is frequently required. Bisulfite conversion is considered the gold standard for this purpose, and various commercial kits are available, each with differences in DNA recovery and conversion efficiency [56]. Studies have also reported variability in DNA yield and recovery when combining different bisulfite conversion- and cfDNA isolation kits for plasma DNA methylation analysis [57]. Despite its status as the gold standard, bisulfite conversion is a harsh treatment, resulting in considerable DNA degradation, which is particularly challenging for liquid biopsy material where DNA is already limited. Recently, enzymatic conversion using TET2 and APOBEC enzymes (EM-seq; New England BioLabs) has been presented as more gentle and promising alternative to bisulfite conversion [58]. This approach yields longer DNA fragments compared to bisulfite conversion, but results in lower average cfDNA recovery (38% vs. 68%) [59]. As a result, bisulfite conversion seems to remain the preferred method for PCR-based analyses of DNA methylation biomarkers in liquid biopsies, while enzymatic conversion may be better suited for sequencing-based approaches that require longer DNA fragments.

Methylomic-based biomarker discovery

Discovery of novel biomarker candidates has benefitted from advances in high-throughput technologies and streamlined bioinformatic workflows. Microarray-based profiling and genome-wide methylation sequencing are widely used for this purpose (Table 2). DNA methylation arrays have an affordable price and streamlined bioinformatic workflows. Differentially methylated CpG sites between tumors and controls have been identified from arrays by numerous studies across malignancies [47, 60,61,62,63,64,65,66,67,68] and may constitute promising biomarker candidates for cancer detection. In particular, the commercially available EarlyTect BCD test for bladder cancer detection resulted from an array-based biomarker discovery [60] (Tables 1 and 2). While microarrays cover a set of selected CpG sites, methylome sequencing offers broader genome coverage [69], which facilitates the design and development of more precise and potentially more accurate biomarker assays for subsequent targeted validation. The broader coverage also allows the identification of sequences with relevant methylation patterns in previously unexplored regions, independent of their genomic location or potential function. A commonly used strategy to identify new DNA methylation biomarkers has focused on functionally relevant locations, such as CpG islands close to transcription start sites, where DNA methylation status will likely affect gene expression. However, recent studies have shown that highly performing DNA methylation biomarkers are not restricted to genomic regions with known biological function [70, 71]. This suggests that expanding the search beyond traditionally explored regions can reveal novel high-performing biomarkers, potentially overlooked by conventional strategies.

Several promising discovery studies have used WGBS to analyze plasma cfDNA [49, 50, 71]. Among these initiatives, the Grail’s Galleri test has emerged as a commercially available tool for early detection of multiple cancers (Tables 1 and 2). This blood-based test uses next-generation sequencing (NGS) data combined with a machine learning classifier to identify DNA methylation patterns in cfDNA, potentially detecting over 50 cancer types before symptoms appear. In parallel, enrichment-based techniques have proven to be cost-effective methods for genome-wide methylation profiling. One example is MeDIP combined with fragment sequencing, which has been successfully used for detecting cfDNA methylation in multiple types of cancers [72]. With RRBS, the amount of sequencing required is lowered compared to WGBS, while it still captures most genomic regions affected by DNA methylation [73, 74]. There is also considerable optimism tied to third generation sequencing technology, such as nanopore sequencing [75], which enables advanced, real-time detection of DNA methylation without the need for bisulfite conversion. This technology is expected to overcome several challenges associated with analyses of cfDNA in liquid biopsies, including issues related to DNA fragment interruption, GC bias, and degradation caused by bisulfite treatment. As a proof-of-principle, Lau and colleagues successfully used nanopore sequencing to monitor longitudinal cfDNA methylation dynamics during treatment of cancer patients [76].

Optimized bioinformatic workflows for analysis of sequencing-based data may increase the likelihood of discovering suitable biomarkers. Differentially methylated regions (DMRs) between tumors and controls are considered promising biomarker candidates [70, 72, 77,78,79,80,81]. Compared to single CpG sites, DMRs increase the robustness of subsequent targeted validation analyses, as demonstrated by e.g. the BladMetrix biomarker panel, detecting bladder cancer in urine with a sensitivity of 92% and a specificity of 93% [70]. Although sequencing-based approaches offer numerous advantages, their high cost remains a barrier for using large sample cohorts in the biomarker discovery phase. This may partly explain why a lower number of liquid biopsy-based DNA methylation biomarker panels are so far developed from sequencing-based discoveries compared to the more cost-effective array-based methods (Table 2). However, continuous decrease in price and improved sequencing technology, combined with enhanced bioinformatics capability, are likely to encourage increased use of sequencing-based approaches for liquid biopsy-based biomarker discovery and their direct application in a clinical setting.

Control groups

Biomarker discovery approaches are commonly designed to detect candidates with DNA methylation levels significantly above a threshold established from control cases, thereby robustly separating cancer cases from controls. This distinction is particularly important in plasma or serum samples, where the signal-to-noise ratio can be high. Consequently, the selection of control groups directly affects the specificity of a biomarker test and should be guided by the intended clinical application of the test. Non-malignant tissue adjacent to cancers or samples from healthy individuals are frequently used as controls in the discovery phase (Table 2). However, samples from individuals presenting with symptoms similar to those of cancer patients may provide more clinically relevant controls. Such controls help ensure that the biomarker test can accurately distinguish cancer from other conditions with overlapping clinical presentations, thereby improving reliability. For example, Hlady et al. identified candidate biomarkers for hepatocellular carcinoma using plasma samples from patients with cirrhosis, a patient group at increased risk for this cancer [47]. Similarly, samples from patients with haematuria - a symptom of bladder cancer but also of other genitourinary conditions - are frequently used as controls in bladder cancer detection [37]. For some cancers, including control samples from patients with tumors in anatomically adjacent organs can be beneficial. This approach enables the test to discriminate not only between cancer and normal tissue (cancer specificity), but also between different types of malignancy (cancer type specificity). For instance, prostate and renal cancer patients serve as useful controls in studies aiming to detect bladder cancer from tumor DNA in urine [70]. Importantly, for cancers preceded by inflammatory lesions that increase the risk of tumor development, it is crucial to include individuals with such inflammation in the control group. This is exemplified by the use of samples from primary sclerosing cholangitis [82] or Barrett’s esophagus [83] for the discovery of biomarkers for cholangiocarcinoma and esophageal adenocarcinoma, respectively. Finally, as DNA methylation generally increases with age [84], age-matched controls are particularly important in DNA methylation studies.

An instructive example of the importance of proper control group selection is provided by the CxBladder test [85]. Although Cxbladder initially showed promise for non-invasive detection of bladder cancer and advanced to clinical implementation, an elevated rate of false positives ultimately lowered its positive predictive value [86], leading to the loss of Medicare reimbursement in the USFootnote 3. Retrospective analyses revealed that validation studies did not include patient groups with non-urothelial malignancies or inflammatory urinary tract conditions [85, 87]. This case illustrates how the absence of important controls can undermine clinical validity and hinder successful translation of otherwise promising biomarker assays.

Zero background biomarkers

The discovery of cancer biomarkers with zero background signals in control samples has emerged as a highly accurate approach for blood-based testing and has been successfully applied in several studies [88,89,90,91]. This strategy could be particularly advantageous for early-stage cancers, where the amount of ctDNA released into the bloodstream is minimal. Feber et al. identified biomarker candidates for bladder cancer showing no or very low methylation signals in normal urothelium, blood and non-cancer urine samples [91], while Draškovič et al. identified candidate DMRs for several cancer types that were unmethylated in cfDNA of healthy individuals [90]. Moreover, the commercially available ColveraTM test for detection of colorectal cancer recurrence also utilizes DNA methylation biomarkers with no background signal in plasma from healthy individuals [92]. A zero-background approach differs from the conventional strategy, which establishes a threshold based on background signals in the control group to dichotomize samples. Although commonly used across studies, the threshold-based approach may risk filtering out true but weak (low-level) cancer-derived signals, potentially leading to false negatives. Adapting discovery workflows to prioritize biomarker candidates with zero or minimal background signals in controls could mitigate interference during validation, thereby potentially enhancing sensitivity and accuracy for cancer detection.

Conclusion

Successful discovery of liquid biopsy DNA methylation biomarkers requires optimized study designs, careful sample type selection, rigorous pre-analytical procedures, and the use of advanced molecular and bioinformatic technologies. Integrating liquid biopsy material in the discovery phase, applying sensitive genome-wide approaches, and selecting clinically informed control groups all contribute to improved diagnostic accuracy and potential utility. Approaches that prioritize biomarkers with zero background signals may also improve the translation of promising candidates into robust clinical tests.

Targeted biomarker analysis

To successfully translate biomarker candidates from large-scale discoveries to suitable tests for clinical routine, targeted methods and assays that are sufficiently sensitive, robust, fast, and cost efficient are required [93].

Methods for targeted analysis

Among existing FDA-approved or commercially available DNA methylation liquid biopsy tests for cancer management, the majority rely on quantitative methylation-specific PCR (qMSP; Table 1). PCR-based methods have long been considered the gold standard for targeted analyses in DNA methylation biomarker research, due to their reliability and precision. Additionally, PCR platforms are widely available and well-established in diagnostic laboratories, making them a practical choice for clinical applications. Moreover, qMSP allows high-throughput analysis of large clinical sample series. However, the detection of extremely rare alterations within a complex background may require a more sensitive method, such as digital PCR (dPCR). The dPCR technology has been reported to have a significantly lower limit of detection (LOD); approximately 0.001% compared to around 0.1% for qPCR [94]. This increased sensitivity is achieved by the division of the sample into thousands of partitions, with a separate PCR reaction running inside each partition [95]. The partitioning significantly reduces competition from background DNA and increases the detection of rare events [95]. Moreover, recent advancements in dPCR technology allow for multiplexing, i.e. analyzing multiple targets simultaneously in one reaction. This capability is particularly valuable for liquid biopsy analysis, where the amount of target DNA often is limited. Although none of the FDA-approved DNA methylation tests currently used for cancer management are dPCR-based, this method shows promising potential for future clinical use [96].

While PCR-based targeted methods are restricted to analyzing a small number of biomarkers, NGS allows for the detection of larger biomarker panels, potentially increasing test accuracy [97]. Some of the most sensitive NGS-based methods, such as whole genome sequencing, have an estimated LOD of ~0.001%. This sensitivity outcompetes qPCR and is comparable to dPCR in detecting rare alterations [94]. NGS is widely used in biomarker discovery, but its application in routine clinical practice has so far been limited by high costs and complex data analysis. However, advancement in sequencing technology and bioinformatics has opened new possibilities for more targeted liquid biopsy NGS tests to receive FDA-approved and/or become commercially available for cancer management (Table 1). For example, the blood-based Shield test (Guardant Health), approved by the FDA in 2024 for early detection and screening of colorectal cancer, uses NGS to detect a multi-marker panel consisting of DNA methylation, mutations and fragmentation patterns (Table 1).

Importantly, the choice of methodology for biomarker discovery may influence the compatibility of downstream targeted analyses. Robust qMSP and dPCR assays typically require high CpG density in the target sequence to adequately differentiate between methylated and unmethylated sites, rendering them well-suited for biomarkers identified via methylome sequencing platforms. In contrast, translating microarray-derived biomarkers to qMSP or dPCR assays can be more challenging, as microarray probes often target regions with lower CpG content. Validation of microarray data could instead be conducted on single CpG sites using e.g. pyrosequencing. However, pyrosequencing offers lower resolution and sensitivity compared to dPCR and is thus less suited for liquid biopsy applications, which typically require high analytical performance.

Standardization of targeted analyses

Regardless of the targeted method, ensuring high quality and standardization at every step will improve the likelihood for a biomarker test to achieve clinical utility. First, the precise selection of CpG sites targeted by a biomarker assay is important, as it can significantly influence the observed DNA methylation frequency [98]. For example, studies of the MAL gene promoter in colorectal cancer have reported widely diverging methylation frequencies, ranging from 6% [99] to 80% [100]. MAL has a variable DNA methylation pattern in its promoter region and the inclusion of different CpG sites in various targeted PCR assays will lead to discrepant results [101]. Such discrepancies in methylation frequencies for the same gene in the same cancer type can reduce the clinical value of a biomarker test [3]. Second, a variety of parameters in the experimental pipeline for PCR-based DNA methylation analyses can impact the consistency of methylation values. Among these, the choice of reference for normalization, or internal control, is particularly critical [102]. Since cancer often is characterized by chromosomal aberrations, using single-copy genes as references for normalization may introduce bias in the end-results. Internal controls targeting multiple loci, like ALU sequences in qPCR [102, 103] or the 4Plex control in dPCR [104], have been shown to increase the consistency of methylation measurements. For dPCR specifically, another step that directly affects final methylation concentrations is partition classification, i.e. classifying droplets into positive and negative clusters. Various partition classification algorithms have been developed and compared [105] and should be carefully selected for each study. Of note, the PoDCall algorithm has been specifically designed for DNA methylation analyses on the dPCR platform, and has been shown to increase the robustness of such analyses [106].

Although various recommendations are found in the literature, there is still no consensus in the scientific community on a standardized pipeline for targeted methylation analyses. Nevertheless, an important step towards improving transparency and quality in PCR-based studies has been the development of the MIQE (The Minimum Information for Publication of Quantitative Real-Time PCR Experiments) guidelines [107,108,109]. These guidelines have been widely adopted, with several scientific journals requiring authors to submit a standardized MIQE table. The implementation of MIQE guidelines has significantly increased awareness of the importance of maintaining quality at every step of the experimental pipeline.

Control groups

Selecting appropriate control groups is crucial when developing a biomarker test for clinical use. As discussed in the biomarker discovery phase, the choice of controls depends on both the cancer type under investigation and the intended clinical application of the test. While the most essential controls are typically included during discovery, additional relevant control groups should also be incorporated in targeted analyses. These may comprise patients with benign or inflammatory conditions, cancer-free individuals at high cancer risk, or patients with other cancer types. Achieving high biomarker specificity often requires inclusion of multiple control groups in the study design, particularly for liquid biopsies, where target biomarkers often coexist with DNA fragments and cells from other tissues. Blood-based cancer biomarkers often perform well when compared to healthy controls but may struggle to distinguish cancer from non-cancerous diseases, highlighting the challenge of identifying reliable biomarkers for cancer detection [110]. For urine-based biomarkers, specificity against other urological cancers is natural. However, evaluating their performance against non-urological malignancies (such as lung, breast and colorectal cancer), is also important, as cfDNA from these cancer types can be present in urine [111].

Conclusion

The choice of targeted method may influence a biomarker test’s potential to achieve clinical utility. Various methods differ in sensitivity and suitability for routine use, each with advantages and limitations. Regardless of method, rigorous quality control, optimization and standardization throughout the pipeline are essential to generate robust and consistent methylation results that can be validated in clinical trials.

Demonstrating analytical validity, clinical validity and clinical utility

The National Academy of Medicine advises that tumor biomarkers should not be employed in general practice until they have demonstrated both analytical validity and clinical utility [112]. Analytical validity is defined as “the ability of an assay to detect and measure, with statistical significance, the presence of a biomarker of interest accurately, reproducibly, and reliably” [3]. Key measures for evaluating analytical validity include sensitivity, specificity, accuracy, and robustness [3], and these should be assessed using samples representative of real clinical settings. The required analytic performance depends on intended clinical use, such as general population screening, high-risk monitoring, prediction of treatment response, or detection of relapse. Higher sensitivity is typically prioritized for biomarkers used in high-risk populations, whereas higher specificity is essential for screening in the general population [113].

Clinical validity refers to the “ability of an assay to divide, with statistical significance, one population into two or more groups on the basis of outcome, such as presence of cancer and treatment response” [3]. While most biomarkers are initially measured as continuous variables, clinical decisions frequently rely on establishing thresholds or cut-off values [112]. The validation of these thresholds requires assessment of parameters such as sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and positive and negative likelihood ratios (LR+, LR−) [114]. Both retrospective and prospective patient sample series may be used for clinical validation, preferably using samples series that are blinded and independent of the discovery series. The sample size needed for robust clinical validation depends on several factors, including the biomarker’s intended use and the required test performance (e.g. accuracy, sensitivity, specificity) [114]. Also, included patients should preferentially have characteristics that mirrors the target population to ensure applicability. Many studies that demonstrate clinical validity of a biomarker, yet fail to establish its clinical utility, have relied on convenience samples collected without specific consideration for the biomarker’s intended use [112].

To establish clinical utility, a biomarker must possess high analytical and clinical validity, and demonstrate improved patient outcomes when used to guide clinical strategy [112]. If a new biomarker fails to improve patient outcomes and to prove value beyond existing tests and biomarkers already in clinical use, it may never reach clinical implementation [115]. The most reliable way to demonstrate clinical utility is through well-powered prospective clinical trials, conducted either as observational studies, where the test does not direct patient management, or as intervention studies, where the test directs patient management [112]. In rare cancers, validation through well-powered clinical trials may be challenging. In such cases, smaller or adaptive trials, potentially using surrogate endpoints may be appropriate [116, 117]. Additionally, clinical utility can be assessed through prospective-retrospective studies using archived specimens collected during previously conducted prospective trials. In such cases, it is recommended to include at least two data sets from independent clinical trials to strengthen the validity of the findings [118]. Irrespective of design, selecting a patient population that accurately reflects those intended for the clinical application of the liquid biopsy assay will likely increases the chance that the observed biomarker accuracy also applies to real-world scenarios.

Summary

Several aspects affect the chance of succeeding with moving biomarkers from bench to bedside, some of which have been discussed in this review. The type of liquid biopsy source could significantly affect the accuracy of a biomarker test. This decision should be guided by the principle that the sensitivity of an accurate biomarker is generally proportional to the quantity of ctDNA copies analyzed: the more copies analyzed, the higher the sensitivity [10]. In many cases, the concentration of ctDNA in nearby body fluids is higher than in blood. Consequently, for certain cancers, using local body fluids as a liquid biopsy source may offer superior performance over blood-based assays. It is also important to consider whether to use tissue samples or liquid biopsies directly in the discovery phase, as this could impact the accuracy of the final test. Biomarkers with high sensitivity in tissue samples do not necessarily show the same accuracy in liquid biopsies. Although successful liquid biopsy biomarkers have been identified from tissue, using liquid biopsies already in the discovery phase may increase the accuracy of the final biomarkers and maximize the chances of developing clinically relevant and impactful biomarkers.

Both sequencing- and array-based approaches have their limitations and advantages. The choice of method in the discovery phase depends on several factors, including available resources, cost and intended downstream analysis or validation methods. Compared to sequencing-based approaches, arrays allow for more samples to be analyzed at a lower cost and have traditionally been the preferred method. However, one should keep in mind that transferring array-based biomarkers to targeted analyses may be cumbersome, while biomarker candidates identified from sequencing-based approaches could be more easily developed into PCR-based assays for validation. Such PCR-based assays can further be quite straightforwardly implemented into clinical use. Additionally, the cost of sequencing has decreased in recent years and is likely to continue doing so in the coming years, facilitating the use of NGS approaches, or even third generation sequencing (e.g. nanopore sequencing), for both discovery and as targeted panels for direct clinical use. Regardless of the chosen method, selecting appropriate clinical cases and controls that accurately represent the target population for the intended liquid biomarker test, is of importance both in the discovery and validation phase. Failing to include relevant controls may lead to difficulties in replicating the original findings, resulting in reduced accuracy when the biomarker is tested or evaluated in the actual study population.

Finally, clinical validation in independent sample series and performing clinical studies to prove clinical utility is essential. These steps ensure that the biomarkers provide actionable information to improve patient management and outcomes. Ultimately, the goal is to develop biomarkers that not only show statistical significance in controlled studies but also have meaningful impact on clinical decision making and patient care. Altogether, this comprehensive approach, from careful method selection and sample preparation to rigorous validation and clinical utility demonstration, is key to successful transitioning of biomarkers from bench to bedside.