Introduction

Cancer remains a significant global public health challenge. The Global Cancer Observatory (GCO) estimates 20 million new cancer cases and 9.7 million deaths globally in 20221. Cancer-related deaths in low- and middle-income countries (LMICs) constitute 70% of the global annual mortality2. Delays and barriers in accessing cancer care are among the contributing factors that often lead to patients presenting with advanced-stage cancer2. The median delay from symptom appearance to therapeutic care for lung cancers is 43 days in India and 141 days in Turkey3. A meta-analysis of 12 LMICs revealed an average diagnostic delay of 3.3 months in breast cancer4. Delays in definitive diagnosis and treatment can cause patient anxiety, increase costs, and reduce the potential benefits of treatment5,6. Early cancer detection is widely recognised for significantly improving cure rates, increasing 5-year survival, and reducing treatment costs7. Studies suggest that detecting cancer before it reaches stage IV could potentially decrease cancer-related deaths by at least 15% within 5 years8.

However, the U.S. Preventive Services Task Force (USPSTF) currently endorses screening for only four cancer types: biennial screening mammography for breast cancer9, cervical cytology and/or primary high-risk human papillomavirus (hrHPV) testing for cervical cancer10, as for colorectal cancer, stool-based tests including the high-sensitivity guaiac faecal occult blood test (gFOBT), faecal immunochemical test (FIT), and stool DNA test, along with direct visualisation tests including colonoscopy, CT colonography and flexible sigmoidoscopy11. Additionally, low-dose computed tomography (LDCT) is endorsed for lung cancer screening12. However, the application of these technologies in LMICs faces limitations due to high costs, complexity, and the requirement for sophisticated infrastructure and laboratories13. Yet, effective early detection methods are still lacking for the majority of cancers. Thus, there is an urgent need for an affordable and accessible test that can detect multiple cancer types, thereby enabling more precise clinical assessments and earlier diagnoses.

The application of liquid biopsy in oncology has exploded in the past decade, particularly in cancer early detection, diagnosis, prognosis, and treatment14,15. Liquid biopsy has been explored for screening of specific cancer types, including lung16, liver17, colorectal18, and lymphoma19, as well as for multiple cancer types20. In our previous study13, we developed a blood-based test for multi-cancer early detection (MCED), named OncoSeek. By integrating a panel of seven selected protein tumour markers (PTMs) with individual clinical data, and enhanced by artificial intelligence (AI), OncoSeek offers a cost-effective and accessible solution. The multi-centre study, including a training cohort and two validation cohorts, showed OncoSeek’s satisfactory performance and robustness in MCED across nine common cancer types, achieving an overall sensitivity of 51.7% at 92.9% specificity. Study participants were recruited from SeekIn and Sun Yat-sen Memorial Hospital (SYSMH) in China, and Johns Hopkins University School of Medicine (JHUSM) in the USA. OncoSeek was performed on two quantification platforms, Roche Cobas e411/e601 and Bio-Rad Bio-Plex 200.

In this study, we aim to assess the robustness of OncoSeek across more cohorts, platforms, and populations. To achieve this, we integrated four additional validation cohorts, a case-control cohort of symptomatic individuals, a prospective blinded study, and two retrospective case-control cohorts conducted on two distinct platforms. These cohorts consist of the participants from diverse geographic locations, utilising various sample types and platforms for PTM analysis. Furthermore, the symptomatic patient’s cohort allows us to further explore the potential clinical value of OncoSeek in the early diagnosis of cancer.

Results

The consistency of OncoSeek across different laboratories

Considering the diverse origins of study participants and the multiple laboratories involved, variations in sample types, reagents, instruments, and operators were expected. To evaluate the consistency of the results across different centres, we conducted repetitive experiments on a randomly selected subset of samples. Five non-cancer plasma samples from SeekIn underwent the analysis of seven PTMs at SeekIn and Shenyou laboratories both using Roche Cobas e401 analysers. Furthermore, 13 cancer patients’ plasma and serum samples from the Sun Yat-sen Memorial Hospital were assessed for the seven PTMs at SeekIn and Sun Yat-sen Memorial Hospital utilising Roche Cobas e411 and e601 analysers, respectively. In addition to the differences in laboratory settings and technicians, this validation incorporated two sample types, serum and plasma, performed on two different Roche instruments, Cobas e411 and e601. The results demonstrated a high degree of consistency, as evidenced by the close alignment of all PTM results to the 45-degree dashed line in Fig. 1a, b. The robust linear correlation across different laboratories is underscored by a Pearson correlation coefficient reaching 0.99 (Fig. 1a) and 1.00 (Fig. 1b), indicating the assay’s reliability for both cancer patients and non-cancer individuals.

Fig. 1: The consistency of PTM results across different laboratories.
figure 1

The correlations of PTM results between SeekIn and Shenyou laboratories (a) and SeekIn and SYSMH laboratories (b). The dashed line represents the identity line (y = x). SYSMH, Sun Yat-sen Memorial Hospital.

Participants’ demographic characteristics

Demographic characteristics of the study participants, including gender, age, cancer types, and stages, are detailed in Table 1. Following the consolidation of all seven cohorts, the collective group is referred to as the “ALL cohort”. Within this ALL cohort, the median age is recorded at 53 years. The gender distribution is almost equal, with 7704 (50.9%) females and 7418 (49.1%) males. The ALL cohort encompasses a variety of cancer types, 18 cases of bile duct, 311 cases of breast, 8 cases of cervix, 604 cases of colorectum, 5 cases of endometrium, 22 cases of gallbladder, 22 cases of head and neck, 314 cases of liver, 770 cases of lung, 233 cases of lymphoma, 137 cases of oesophagus,106 cases of ovary, 153 cases of pancreas, 259 cases of stomach, and additional 67 cases categorised as other types of cancer (Table 1, Supplementary Table 1).

Table 1 Clinical and demographic characteristics of participants

The cancer detection performance of OncoSeek in four validation cohorts and the ALL cohort

In this study, we evaluated the effectiveness of OncoSeek in detecting cancer across four validation cohorts and the comprehensive ALL cohort. A receiver operating characteristic (ROC) curve was plotted to assess OncoSeek’s ability to classify cancer patients from non-cancer individuals (Fig. 2). The area under the curves (AUCs) for the HNCH, FSD, BGI, and PUSH cohorts were 0.883, 0.912, 0.822, and 0.825, respectively. These results were strikingly consistent with those from the three previously reported cohorts, which had AUCs of 0.826, 0.744, and 0.819, respectively13. After combining the seven cohorts into one (the ALL cohort), the AUC was determined to be 0.829. These similar results indicate that OncoSeek maintains excellent consistency across diverse cohorts.

Fig. 2: The performance of OncoSeek across various cohorts was assessed using the receiver operating characteristic (ROC) curve.
figure 2

The corresponding area under the curve (AUC) values for the five cohorts were illustrated in the figure. The dotted vertical lines on the ROC curves indicate the threshold for 90.0% specificity. HNCH Henan Cancer Hospital, FSD First Saúde, PUSH Peking University Shenzhen Hospital.

OncoSeek showed high sensitivity and specificity in both symptomatic and prospective blinded cohort, with HNCH and FSD sensitivities of 73.1% (95% CI: 70.0%–76.0%) and 72.2% (95% CI: 46.5%–90.3%), respectively, and specificities of 90.6% (95% CI: 87.9%–92.9%) and 93.6% (95% CI: 87.3%–97.4%), respectively. The BGI and PUSH cohorts, assessed on two distinct quantification platforms, displayed comparable sensitivities of 55.9% (95% CI: 37.9%–72.8%) and 59.7% (95% CI: 51.4%–67.7%), respectively, along with specificities of 95.0% (95% CI: 92.4%–96.8%) and 90.0% (95% CI: 89.0%–91.0%) (Table 2).

Table 2 The performance of OncoSeek in different cohorts

Following the integration of the seven cohorts into the ALL cohort, the study included a total of 15,122 participants, comprising 3029 cancer patients and 12,093 non-cancer individuals, from three countries, and performed on four different platforms. In this large cohort, OncoSeek showed a sensitivity of 58.4% (95% CI: 56.6%–60.1%) and a specificity of 92.0% (95% CI: 91.5%–92.5%) (Table 2). Sensitivities vary for different types of cancers, bile duct at 83.3%, gallbladder at 81.8%, endometrium at 80.0%, pancreas at 79.1%, cervix at 75.0%, ovary at 74.5%, lung at 66.1%, liver at 65.9%, head and neck at 59.1%, stomach at 57.9%, colorectum at 51.8%, oesophagus at 46.0%, lymphoma at 42.9%, and breast at 38.9% (Fig. 3a). These cancer types constitute a significant burden, representing over 60% of worldwide cancer cases and more than 72% of cancer-related mortalities1. These findings underscored OncoSeek’s potential as a reliable assay for multi-cancer early detection. Given that breast, cervical, colorectal, and lung cancers are recommended by the U.S. Preventive Services Task Force (USPSTF) for screening with established methods9,10,11,12, we evaluated OncoSeek’s efficacy in identifying ten additional cancer types (bile duct, endometrium, gallbladder, head and neck, liver, lymphoma, oesophagus, ovary, pancreas, and stomach). The assay sustained a specificity of 92% while achieving a sensitivity of 61.4%. Hence, OncoSeek may serve as a valuable adjunct for cancers lacking effective early detection methods. Additionally, sensitivity was observed to increase with advancing clinical stages. At stage I, the sensitivity was 42.8%, rising to 52.1% for stage II, 61.9% for stage III, and peaking at 79.7% for stage IV (Fig. 3b). Subsequently, we evaluated the performance of OncoSeek for nine cancer types with over 100 cancer patients each in the ALL cohort, including breast, colorectum, liver, lung, lymphoma, oesophagus, ovary, pancreas, and stomach. The sensitivities of cancer signal detection for these nine pre-specified cancers were further detailed by individual cancer type and clinical stage in Fig. 3c, Supplementary Table 1, Supplementary Fig. 1. To further compare the consistency of sensitivity and specificity for each cancer type, we assumed a rectangular stage distribution for each cancer to calculate stage-standardised sensitivity estimates. However, due to insufficient sample sizes for each specific neoplasia in the BGI, FSD, and PUSH cohorts, and missing stage information in the SYSMH cohort, we selected only three cohorts—SeekIn, JHUSM, and HNCH—for this analysis (Supplementary Table 2). Specificities were similar across these cohorts, with values of 90.0%, 88.9%, and 90.6% for SeekIn, JHUSM, and HNCH, respectively. Moreover, the Kruskal-Wallis test indicated that there were no statistically significant differences in the stage-standardised sensitivity estimates for each cancer type among these three cohorts.

Fig. 3: The sensitivity of OncoSeek in large-scale MCED study.
figure 3

a OncoSeek sensitivity across cancer types. The graph presents sensitivity on the y-axis for various cancer types depicted on the x-axis, with a focus on individual cancer types. The types were arranged in descending order of sensitivity, and the bars represented the 95% confidence interval. The sample size for each cancer type was noted in parentheses. b OncoSeek sensitivity by clinical stage. c OncoSeek sensitivity by stage in the nine pre-specified cancers. Sensitivity was illustrated on the y-axis, and categorised by the clinical stage of cancer on the x-axis. The bars denote the 95% confidence interval, and the sample size for each stage was provided in parentheses.

Predicting affected TOO

A key feature of MCED tests is their ability to identify the tissue of origin (TOO), crucial for directing targeted diagnostic approaches. We evaluated OncoSeek’s ability to identify the TOO for nine prevalent cancers in 1663 cancer cases across the seven cohorts, where cancer signals were detected and the cancer types were confirmed. The overall accuracy of the top two most possible organ systems in the true positives was 70.6% (95% CI: 68.4%–72.7%). Ovarian cancer showed the highest accuracy at 73.4%, followed by lung cancer at 61.5%. Pancreatic and breast cancers both achieved an accuracy of 60.3%. Liver cancer accuracy was 58.5%, lymphoma 51.0%, colorectal cancer 40.9%, and upper gastrointestinal (Upper GI) cancers, combined with oesophageal and gastric, were at 16.4% (Fig. 4).

Fig. 4: TOO accuracy by individual cancer type.
figure 4

The confusion matrices illustrate the accuracy of TOO prediction. The matrices display the concordance between the actual tumour origins on the x-axis and the OncoSeek model’s predictions on the y-axis. The colour scheme reflects the proportion of samples with predicted TOO. The analysis included 1663 individuals with confirmed cancer diagnoses and predicted as having cancer at 92.0% specificity. Upper GI upper gastrointestinal, TOO tissue of origin.

Discussion

Identifying cancer at an earlier stage can facilitate more effective treatments, enhancing the quality of life for cancer patients and the survival rates for many types of cancers21. In LMICs, early cancer detection faces numerous challenges, such as limited infrastructure, limited trained personnel, high equipment purchases and maintenance costs, and inadequate training of healthcare providers22,23. The existing cancer screening and treatment services in LMICs are often fragmented. This fragmentation can create barriers for patients, particularly those lacking the necessary knowledge or financial resources to navigate the steps between screening, follow-up, diagnosis, and treatment24. As a result, patients may experience delays in cancer diagnosis, resulting in increased treatment costs and higher mortality rates25,26. Therefore, providing reliable and affordable screening tests, and increasing the utilisation of routine cancer screenings would be beneficial in LMICs23.

In this large-scale clinical study, we introduced OncoSeek, a CE-IVD marked MCED test designed to detect multiple cancer types. By integrating blood-based protein tumour marker analysis with AI algorithms, OncoSeek offers a cost-effective solution at under $25 reagent cost per test, requiring no additional equipment, infrastructure, or technical personnel13. The affordability and accessibility make it more practical for the healthcare providers in LMICs. To further validate the performance of OncoSeek, we conducted the largest MCED clinical study to date, encompassing seven independent cohorts with a total of 15,122 participants. The participants were recruited from seven different centres across three countries, China, the United States, and Brazil. Throughout the study, we utilised two different sample types, plasma and serum, to perform OncoSeek testing on four distinct immunoassay platforms, Roche, Abbott, Bio-Rad, and GBI. As a result, OncoSeek’s performance in cancer detection was well-validated, achieving a sensitivity of 58.4%, a specificity of 92.0%, and overall accuracy of 70.6% in TOO. Sensitivity for detecting fourteen common cancer types varied from 38.9 to 83.3%, including bile duct, breast, cervix, colorectum, endometrium, gallbladder, head and neck, liver, lung, lymphoma, oesophagus, ovary, pancreas, and stomach. These cancers impose a significant burden, accounting for over 60% of global cancer cases and more than 72% of cancer-related mortalities1. The robustness of OncoSeek was further evidenced by its consistent performance across the seven cohorts. The AUCs were as follows: 0.826 for SeekIn, 0.744 for SYSMH, 0.819 for JHUSM, as previously reported13; and 0.825 for HNCH, 0.822 for FSD, 0.883 for BGI, and 0.912 for PUSH, respectively. Galleri, an MCED test utilising next-generation sequencing (NGS), achieved a sensitivity of 51.5% at a specificity of 99.5%20. When adjusted to the same specificity (99.5%), OncoSeek’s sensitivity was 31.0%. Beyond performance metrics, OncoSeek offers notable advantages in cost-effectiveness and turnaround time. While Galleri is priced at $949 in the United States, OncoSeek provides a significantly more affordable alternative (<$25 reagent cost). Additionally, OncoSeek delivers results within 1.5 h, making it substantially faster than NGS-based MCED tests like Galleri, which takes days to complete. To optimise specificity while maintaining adequate sensitivity, we proposed a two-step screening strategy27. OncoSeek serves as the initial screening tool, and the individuals testing positive undergo a secondary test, SeekInCare—an MCED test integrating the seven protein tumour markers with four cfDNA-based genomic features: copy number aberration (CNA), fragment size (FS), end motifs, and oncogenic viruses, using shallow whole-genome sequencing at 3× coverage19,28. This two-step approach enhances specificity from 91.0 to 99.3% while preserving a sensitivity of 51.5%, effectively balancing sensitivity and specificity.

Furthermore, OncoSeek demonstrated a sensitivity of 61.4% for 10 cancer types, which lack established USPSTF-endorsed screening tests, including bile duct, endometrium, gallbladder, head and neck, liver, lymphoma, oesophagus, ovary, pancreas, and stomach. Certain cancers, such as bile duct29, gallbladder30, and pancreas31 are particularly challenging due to the absence of robust detection methods. Their asymptomatic nature and elusive locations often lead to late-stage diagnosis, severely limiting treatment options and contributing to poor prognoses. OncoSeek can be utilised for early detection of these cancers, offering patients more treatment options. Conversely, in the cancer types such as breast, oesophagus, and lymphoma, OncoSeek exhibits a sensitivity below 50%. Nonetheless, existing methods, including biennial mammograms, an X-ray imaging procedure of the breasts, effectively manage breast cancer detection32. Oesophageal cancer, easily accessible via endoscopy, allows for early diagnosis and potential curative treatment33. OncoSeek’s sensitivity in different cancer types are influenced by the presence of specific PTMs and the aggressiveness of the cancer. For instance, liver cancer, indicated by elevated AFP levels34, results in high sensitivity due to its unique PTMs. In contrast, cancer types like oesophageal, which lack such specific PTMs, are detected less readily. Furthermore, cancer invasiveness correlates with the amount of cancer proteins and/or DNAs released into the bloodstream20. Highly invasive cancers, like pancreatic cancer, tend to release more PTMs, thus increasing the sensitivity. Conversely, cancers like breast cancer, which is less invasive especially when it is localised, are less detectable by the blood-based tests20. Additionally, OncoSeek exhibited a sensitivity of 73.1% and a specificity of 90.6% among symptomatic individuals, which was for the higher proportion of cancer with advanced stages. This is consistent with the findings of the SYMPLIFY study using Galleri, reporting 66.3% sensitivity and 98.4% specificity in symptomatic patients35. The results suggest that OncoSeek can effectively serve as an auxiliary diagnostic tool for the individuals with cancer-related symptoms as well.

Similar to the other MCED technologies20,36,37, OncoSeek follows a sequential process – first detecting the presence of a cancer signal and then assigning its TOO. Errors can occur at each step, accumulating to affect overall performance. For example, in lung cancer, the sensitivity for detecting a cancer signal was 66.1% (509/770). Yet, only 61.5% (313/509) of these positive cases were correctly classified as originating in the lung, resulting in an overall sensitivity of 40.6% (313/770) for lung cancer. This distinction is clinically important, as the cancer type is unknown at the time of screening. The sequential nature of the current MCED assays is therefore a defining feature and a key consideration when evaluating their real-world utility. Future improvements in TOO assignment have the potential to mitigate error propagation and enhance overall clinical performance.

This study, while valuable, had several limitations. Despite including a small prospective blinded cohort, the study is predominantly a case-control design. Therefore, a larger-scale prospective study is needed to provide more robust evidence supporting the clinical application of OncoSeek. Meanwhile, the sensitivity of OncoSeek was lower at stage I (42.8%) compared to stage IV (79.7%), indicating a limitation in detecting early-stage cancers. As suggested in the published prospective studies, the sensitivity for early-stage cancers could be even lower, potentially leading to missed detection of early cancer patients38.

In summary, OncoSeek exhibited excellent consistency across seven cohorts from three countries, on four platforms, and with two types of samples. This performance indicates that OncoSeek is a robust MCED test. Coupled with its affordability and accessibility, it is well-suited for multi-cancer early detection, particularly in LMICs. Moreover, the superior performance in symptomatic patients supports that it can be used for early cancer diagnosis in the future.

Methods

Study design and quantification of PTMs

This study has incorporated four additional validation cohorts on top of three cohorts from the previous study13. Initially, we included a cohort of symptomatic patients, aiming to assess the efficacy of OncoSeek in early cancer diagnosis. Symptomatic patients are outpatients who exhibit cancer-related symptoms such as abdominal pain, radiographic detection of masses or nodules, and other indicators that need to be evaluated for cancer diagnosis. We enroled 1424 participants comprised of 869 symptomatic patients and 555 non-cancer individuals from the Henan Cancer Hospital (HNCH) between January 2023 and August 2023. Serum samples were collected to quantify the levels of seven selected PTMs including AFP, CA125, CA15-3, CA19-9, CA72-4, CEA, and CYFRA21-1 using the Roche Cobas e601/e602 analyser (Roche Diagnostics GmbH, Mannheim, Germany)13. Subsequently, we expanded our study by including a prospective-blinded validation cohort of 128 participants from First Saúde, Brazil (FSD), which included 110 non-cancer individuals and 18 cancer patients. Among the cancer patients, some were diagnosed before blood sampling, while others were diagnosed afterward. All this information was blinded to us. The plasma samples of these individuals were collected and the seven PTMS were detected similarly using the Roche Cobas e601 analyser13. This study received approval from the Medical Ethics committee of Henan Cancer Hospital (2023-KY-0153). All participants provided written informed consent upon enrolment. The study was performed in accordance with the Declaration of Helsinki.

Further, to assess OncoSeek’s platform feasibility, we included two cohorts using two distinct platforms. One cohort included 450 participants (34 cancer patients and 416 non-cancer individuals) from BGI Clinical Laboratories (BGI), with PTMs assessed using plasma samples by an ELISA analyzer (BGI-GBI Biotech, Beijing, China), as detailed by Ji X et al.39. The other cohort, from Peking University Shenzhen Hospital (PUSH) between January 2017 and June 2018, included 3738 participants (149 cancer patients and 3589 non-cancer individuals). Peripheral blood samples were collected using serum collection tube (Improve Medical, Guangzhou, China). Serum was separated from the blood samples via centrifugation at 1300 × g for 10 min, conducted at room temperature (approximately 20 °C to 25 °C) within 4–6 h after collection. 1 ml serum was used to quantify the levels of six PTMs on the Abbott I2000 platform (Abbott Laboratories, Chicago, USA) using the commercially available reagent kits following the manufacturer’s instructions. The six PTMs included AFP, CA125, CA15-3, CA19-9, CEA, and CYFRA21-1. CA72-4 was excluded due to its unavailability on the Abbott I2000 platform.

OncoSeek algorithmic analysis

As reported in our previous study13, we integrated the levels of PTMs into the OncoSeek algorithm to calculate the probability of cancer (POC). We selected the POC value associated with a 90.0% specificity in our training cohort as the threshold for cancer signal detection13. A POC exceeding this threshold was indicative of detected cancer signals; conversely, a POC below the threshold suggested the absence of such signals. For cohorts with missing CA72-4 data, we use the average value of the healthy group from the training cohort, which was 4.38, instead of the missing values. Next, we utilised a Random Forest (RF) algorithm to predict the tissue of origin (TOO) for patients with a true positive diagnosis. The RF algorithm prioritised organs based on their prediction probabilities, with the top two being identified as potential sites of tumour origin13.

Statistical analysis

We performed all statistical analyses utilising R software, version 4.1.2, a widely recognised platform for statistical computing (https://www.r-project.org). The performance of the receiver operating characteristic (ROC) curve was assessed to differentiate between cancer patients and non-cancer individuals, employing the pROC package, version 1.18.0. The metrics of sensitivity, specificity, and predictive values were ascertained through the epiR package, version 2.0.40 (https://cran.r-project.org/web/packages/epiR/index.html), which is dedicated to epidemiological analysis. Additionally, the DeLong test was implemented to statistically compare the areas under the ROC curves (AUCs), providing a measure of prediction accuracy.