Introduction

Oral squamous cell carcinoma (OSCC) constitutes a predominant subtype of head and neck squamous cell carcinoma (HNSCC), with ~90% of cases being HPV-negative, occurring mainly in Asian populations1. Globally, OSCC accounted for 377,713 new cases and 117,757 deaths in 20212. Although the incidence of HPV-negative disease is decreasing, its prognosis is significantly worse than that of HPV-positive disease3.

This discrepancy is largely attributed to the absence of reliable biomarkers for HPV-negative OSCC, essential for developing treatment plans and monitoring therapeutic responses4. Currently, the diagnosis and surveillance of OSCC are primarily based on clinical manifestations and radiological examinations. However, the process of early detection of recurrent disease is significantly challenging, especially due to micro metastases in lymph nodes and fibrosis or inflammation caused by radiation therapy or surgery5. These factors can obscure the identification of residual or recurrent tumor tissue, leading to delays in accurate detection6.

Circulating tumour DNA (ctDNA), a subset of cfDNA, is released into the bloodstream from apoptotic or necrotic tumor cells and carries tumor-specific genetic alterations. Unlike traditional biopsies, ctDNA reflects tumor heterogeneity more accurately, enhancing the detection of mutations relevant to targeted therapies7. As a critical component of liquid biopsies, ctDNA offers high sensitivity and specificity for monitoring tumor burden, assessing treatment efficacy, and detecting relapse or metastasis, thereby enabling personalized treatment strategies8.

Previous studies have demonstrated the potential of liquid biopsy in HNSCC management9,10. Multiple studies have shown that ctDNA detection precedes the clinical manifestation of disease recurrence, offering opportunities for early intervention11,12. Moreover, integrating different sample types has emerged as a promising strategy to enhance detection sensitivity13. For instance, Wang et al. reported that combining oral rinse and plasma samples increased detection rates compared to single-sample testing, with detection patterns varying by clinical stage and tumor location14. That finding was further supported by Ahn et al., who demonstrated improved sensitivity when combining oral rinse and plasma analysis in HNSCC patients15. Those studies highlight the complementary nature of different liquid biopsy samples, yet comprehensive mutation profiling and systematic evaluation of their clinical utility in HPV-negative OSCC remain limited.

Results

Characteristic of the study cohort and experimental design

We enrolled 123 HPV-negative OSCC patients (65 males and 58 females; median age 58 years, range 38–78 years). Among these patients, 68 were diagnosed with early-stage (I/II) and 55 with advanced-stage (III/IV) disease. The primary tumors were predominantly located on the tongue and gingiva, accounting for 61.8% of the cases, followed by occurrences on the buccal mucosa and palate. To comprehensively profile mutations and evaluate their clinical utility, we designed a multi-phase study using a targeted 551-gene NGS panel (Supplementary Data 1). In the discovery phase, we analyzed all patients’ paired tissue, oral rinse, and plasma samples at baseline. Additionally, we conducted longitudinal monitoring in a subset of 30 patients to track disease progression. Baseline characteristics and treatment methods are detailed in Table 1, and the study design and workflow are illustrated in Fig. 1.

Fig. 1: The study design, sample collections, and patients’ responses.
figure 1

Detectable mutation refers to patients with any identifiable somatic mutation using our NGS panel.

Table 1 Patients clinical characteristics

Comprehensive mutation profiling across tissue and liquid biopsy samples

In tumor tissue analysis, we identified mutations in all 123 tissue samples, with a median of 10 mutations per sample (range: 2–21). A total of 1284 mutations involving 160 genes were identified in the gDNA of the tumor tissues. The top four highly mutated genes were TP53, TERT, MYC, and PIK3CA, with mutation frequencies of 60.1% (74/123), 50.4% (62/123), 43.1% (53/123), and 39.8% (49/123), respectively (Supplementary Data 2). To validate our findings in an independent cohort, we analyzed mutation data from TCGA-HNSCC database. Within the 508 HNSCC patients with available SNV data, we observed that TP53 mutations occurred at a frequency of 69.3% (352/508), comparable to our cohort (60.1%). However, we noted substantially lower frequencies for EP300 (7.5%, 38/508), IKZF1 (1.4%, 7/508), and particularly TERT mutations (0.6%, 3/508) compared to our findings. Analysis of CNV data revealed notably higher amplification frequencies in the TCGA cohort for PIK3CA (77.1%, 374/485), MYC (76.7%, 372/485), and EGFR (56.7%, 275/485) than in our study population. These differences may be partially attributed to the higher proportion of advanced-stage disease in the TCGA cohort (77.3%, 375/485), as well as potential variations in detection methodologies and population characteristics.

In liquid biopsy analysis, ctDNA detection rates varied between sample types, with mutations detected in 94.3% (116/123) of oral rinse samples and 80.5% (99/123) of plasma samples (p = 0.002). Interestingly, while this difference was significant overall, the detection rates became comparable in advanced clinical stages (stages III or IV) (92.7% [51/55] versus 89.1% [49/55], p = 0.742). Notably, the baseline detection rate of plasma ctDNA was significantly associated with tumor differentiation (p = 0.028), lymph node metastasis (p = 0.007), and clinical stage (p = 0.003), while oral rinse-derived ctDNA detection remained consistent across clinical characteristics (Table 1).

Regarding mutation profiles, we identified 471 somatic mutations involving 109 genes in oral rinse samples (median 2 mutations/sample, range: 0–12) and 430 mutations involving 114 genes in plasma samples (median 1 mutation/sample, range: 0–12). The most frequently mutated genes in oral rinse samples were TP53, TERT, MYC, and EP300 (38.7% [45/116], 30.2% [35/116], 23.3% [27/116], and 23.3% [27/116], respectively), while in plasma samples they were TP53, TERT, MYC, and APC (42.4% [42/99], 30.3% [30/99], 26.3% [26/99], and 21.2% [21/99], respectively). The complete mutation spectra of oral rinse and plasma cfDNA samples are shown in Fig. 2A, B, respectively (mutated genes present in at least two patients).

Fig. 2: Comprehensive mutational profiling of cell-free DNA from oral rinse and plasma samples.
figure 2

Heatmap illustrating the mutational profile of 116 oral rinse-derived cfDNA samples (A) and 99 plasma cfDNA samples (B). Each column represents a tumour sample, and each row represents a gene. Only genes mutated in more than two cases are shown. The upper bar indicates the number of mutations per patient, and the right bars show gene mutation frequencies across the cohort. Samples are clustered by pathological subtype. cfDNA cell-free DNA.

Comparative analysis of mutation profiles between tissue and liquid biopsy samples

Analysis of somatic mutations in the 99 patients with paired samples revealed distinct but overlapping mutation patterns. We identified 352 mutations shared between tissue and oral rinse, and 366 between tissue and plasma (Fig. 3A, B). Additionally, 97 mutations were unique to oral rinse-derived ctDNA and 62 to plasma ctDNA, with a total of 544 mutations present in either tissue and oral rinse or tissue and plasma (Fig. 3C). These findings suggest that oral rinse and plasma samples can serve as complementary sources for mutation detection may capture different aspects of tumor heterogeneity and can even be enriched in clones with metastatic potential.

Fig. 3: Concordance between oral rinse-derived cfDNA, plasma cfDNA, and tumour gDNA genotyping.
figure 3

AC Venn diagrams showing the number of mutations in oral rinse-derived cfDNA and tumour gDNA (A), plasma cfDNA and tumour gDNA (B), and those shared between tissue and oral rinse-derived or plasma cfDNA (C). D Bar graph showing the allele frequency of variants in tumour gDNA. Blue bars indicate mutations detected in both tumour gDNA and cfDNA (oral rinse-derived or plasma), while red bars show mutations detected only in tumour gDNA. E Correlation between tumour DNA content in oral rinse and plasma samples. cfDNA cell-free DNA, gDNA genomic DNA.

To further evaluate the genetic correlation and differences between oral rinse and plasma samples, we analyzed the impact of each mutation’s VAF on detection concordance. Using tumor gDNA genotyping as the gold standard, the overall sensitivity was 31.4% (95% Confidence Interval [CI]: 28.7%–34.3%) for oral rinse-derived cfDNA and 32.7% (95% CI: 30%–35.5%) for plasma cfDNA. Notably, combining both sample types significantly improved the detection sensitivity to 48.6% (95% CI: 45.6%–51.5%). Mutations identified in tumor gDNA but not in plasma or oral rinse-derived cfDNA typically had lower VAFs (mean VAF = 8.4%, Fig. 3D). When patients with low VAF mutations (VAF < 5%) were excluded from tumor biopsy, detection sensitivity increased to 36.5% (95% CI: 33.4%–39.7%) for oral rinse-derived cfDNA and 37.1% (95% CI: 34%–40.4%) for plasma cfDNA, with the combined sensitivity reaching 55.9% (95% CI: 52.6%–59.1%).

Overall, OSCC exhibits genetic heterogeneity, with slight differences in mutation occurrence between oral rinse and plasma. Compared to individual oral rinse or plasma testing, the sensitivity of detecting these biomarkers significantly increases when both oral rinse and plasma are tested simultaneously. Interestingly, we observed no significant correlation between tumor DNA content in oral rinse and plasma samples (correlation coefficient = 0.0468, p = 0.6458; Fig. 3E).

Identification of the features significantly associated with 1-year recurrence

We investigated the relationship between 1-year recurrence and both genetic and clinical characteristics. The cohort was randomly divided at a 2:1 ratio into training (n = 82) and validation (n = 41) sets to develop and validate the prediction model. Through differential mutation analysis between patients with and without 1-year recurrence, we identified seven SNV mutation genes (TP53, TERT, APC, FAT1, VEGFA, IKZF1, and EP300) and three CNV mutation genes (MYC, EGFR, and PIK3CA) with mutation frequencies greater than 10% (Fig. 4A; Supplementary Data 2). Stepwise logistic regression analysis further refined these to seven genes significantly associated with recurrence (TP53, TERT, IKZF1, EP300, MYC, EGFR, and PIK3CA) (Fig. 4B, C; Supplementary Data 3).

Fig. 4: Overview of clinical and mutational characteristics.
figure 4

A The most frequently mutated genes. B Significantly different SNVs included in the 1-year recurrence model. C Significantly different CNVs included in the 1-year recurrence model. D Differences in clinical characteristics based on 1-year recurrence status. p-values were calculated using Fisher’s exact test. The size of the white dots in (C) represents sample size. SNV single-nucleotide variant, CNV copy number variation.

Considering the potential importance of clinical factors in prognosis, we further compared one-year recurrence among patients grouped by various clinical factors, including gender, age, smoking status, alcohol consumption status, differentiation status, treatment modality, T stage, lymph node metastasis status, and clinical stage as detected by immunohistochemistry. The results showed that patients with advanced clinical stages (69.8% versus 35.9%, p = 0.004), lymph node metastasis (63.6% versus 33.0%, p = 0.018), and advanced T stage (T3, T4) (63.5% versus 36.7%, p = 0.023) had higher one-year recurrence rates. To avoid collinearity among clinical variables, we selected T stage and lymph node metastasis as independent clinical factors for model construction (Fig. 4D).

Machine learning integrates gene mutation status and clinical factors to build a tumour prediction model to predict patient outcomes

In summary, we identified 7 mutated genes and 2 clinical factors associated with 1-year recurrence in patients. The Random Forest model demonstrated not only the highest predictive performance but also the most stable results across both training and validation sets. While XGBoost and GBM showed excellent performance in the training set (AUC: 0.861, 95% CI: 0.781–0.942 and AUC: 0.866, 95% CI: 0.786–0.946, respectively), their performance slightly decreased in the validation set (AUC: 0.827, 95% CI: 0.698–0.957 and AUC: 0.837, 95% CI: 0.714–0.960, respectively). The Elastic Net achieved moderate results in its best feature combination (training set: AUC: 0.807, 95% CI: 0.707–0.906; validation set: AUC: 0.794, 95% CI: 0.659–0.929), but its overall performance was inferior to the other models (Supplementary Table 1). Based on its superior c-index and stability, we selected the Random Forest as our final prediction model. Further analysis revealed that the combined model integrating seven mutated genes and 2 clinical factors demonstrated excellent sensitivity and specificity in both the training cohort (AUC: 0.869, 95% CI: 0.798–0.950) and the validation cohort (AUC: 0.854, 95% CI: 0.740–0.967) (Fig. 5A). Using nomogram analysis based on multivariate logistic regression, we identified EGFR, PIK3CA, MYC, TP53, and TERT as the strongest predictors among all incorporated features, with EGFR mutation demonstrating the highest predictive weight (Fig. 5B).

Fig. 5: Predicting 1-year recurrence and PFS/OS by combining mutation and clinical characteristics.
figure 5

A ROC curve for the 1-year recurrence predictive model in the training and validation cohorts. B A nomogram based on stepwise logistic regression to predict 1-year recurrence. C Predicting PFS and OS by combining key mutation and clinical characteristics. D, E Kaplan–Meier curves comparing high- and low-risk groups for PFS (D) and OS (E). PFS progression-free survival, OS overall survival.

Since patients without 1-year recurrence showed better survival outcomes, we further evaluated the model’s long-term prognostic value. In the follow-up analysis of 113 patients (10 lost to follow-up; median follow-up time 846 days, range 117–1623 days), the model demonstrated good performance in predicting both progression-free survival (PFS, C-index = 0.753, 95% CI: 0.700–0.806, AIC = 576.6, BIC = 597.2) and overall survival (OS, C-index = 0.700, 95% CI: 0.630–0.771, AIC = 471.6, BIC = 490.7) (Fig. 5C). Risk stratification based on the model’s predictions effectively identified patient subgroups with significantly different outcomes: the low-risk group showed markedly better PFS (HR, 4.977; 95% CI, 2.976–8.322; p < 0.001) and OS (HR, 3.644; 95% CI, 2.031–6.538; p < 0.001) than did compared to the high-risk group (Fig. 5D, F). To validate these findings in an external cohort, we constructed a simplified model incorporating T stage, N stage, and three key CNV mutations using the TCGA-HNSCC dataset. This validation model also successfully stratified patients into groups with significantly different survival outcomes (HR = 1.326; 95% CI: 1.009–1.743; p = 0.043), confirming the prognostic utility of combining these clinical and molecular parameters across different patient populations (Supplementary Fig. 1).

Clinical Relevance of Pre-treatment ctDNA Analysis

To evaluate the clinical utility of liquid biopsy, we assessed both ctDNA status and concentration in paired oral rinse and plasma samples from 99 patients. Using our personalized seven-gene panel, samples with at least two detectable somatic variations were classified as ctDNA-positive16. Analysis revealed positive ctDNA in 44.4% (44/99) of oral rinse samples and 36.4% (36/99) of plasma samples (Figs. 6A and 7A).

Fig. 6: Mutation landscape of pre-treatment oral rinse-derived ctDNA.
figure 6

A Overview of oral rinse-derived ctDNA status, clinical characteristics, and 1-year recurrence at baseline. B, C Kaplan–Meier estimates of PFS (B) and OS (C) according to pre-treatment oral rinse-derived ctDNA status. ctDNA circulating tumour DNA, PFS progression-free survival, OS overall survival.

Fig. 7: Mutation landscape of pre-treatment plasma ctDNA.
figure 7

A Overview of plasma ctDNA status, clinical characteristics, and 1-year recurrence at baseline. B, C Kaplan–Meier estimates of PFS (B) and OS (C) according to pre-treatment plasma ctDNA status. ctDNA circulating tumour DNA, PFS progression-free survival, OS overall survival.

In univariate analysis, significant associations were observed between positive ctDNA status and both T stage (oral rinse: p = 0.003; plasma: p = 0.029) and lymph node metastasis (oral rinse: p = 0.035; plasma: p = 0.029). However, multiple linear regression modeling for both oral rinse ctDNA (R² = 0.113) and plasma ctDNA (R² = 0.082) status did not demonstrate strong associations between ctDNA status and these clinical features, although T stage remained significant in oral rinse ctDNA (p = 0.012). T stage and lymph node metastasis showed marginal significance in plasma ctDNA multivariate models (p = 0.093) (Supplementary Tables 2,3).

Notably, ctDNA status demonstrated substantial prognostic value. Both sample types significantly associated Positive ctDNA status with inferior survival outcomes. In oral rinse samples, ctDNA positivity predicted shorter PFS (HR, 2.978; 95% CI, 1.791–4.951; p < 0.001) and OS (HR, 1.980; 95% CI, 1.15–3.406; p = 0.014). Similarly, plasma ctDNA positivity was associated with decreased PFS (HR, 3.311; 95% CI, 1.999–5.485; p < 0.001) and OS (HR, 2.111; 95% CI, 1.241–3.591; p = 0.006) (Figs. 6B, C; and 7B, C).

In contrast, quantitative analysis of ctDNA concentration showed limited prognostic value. Pre-treatment ctDNA concentrations averaged 4.23 ng/mL (range 0.89–15.6 ng/mL) in oral rinse and 5.15 ng/mL (range 0.59–16.12 ng/mL) in plasma. Similarly, while univariate analysis revealed associations between ctDNA concentration and clinical features (lymph node metastasis with oral rinse-derived ctDNA, p = 0.012; tumor site and T stage with plasma ctDNA, p = 0.010 and p = 0.032, respectively), multiple linear regression modeling for both oral rinse (R² = 0.109) and plasma (R² = 0.212) ctDNA did not demonstrate strong associations between baseline ctDNA values and these clinical features, although lymph node metastasis remained significant in oral rinse (p = 0.006) and tumor site remained marginally significant in plasma (p = 0.029) multivariate models (Supplementary Tables 4, 5).

When patients were stratified using optimized concentration thresholds (3.59 ng/mL for oral rinse; 3.84 ng/mL for plasma), no significant survival differences were observed between high and low concentration groups in either PFS (oral rinse: HR, 1.343; 95% CI, 0.820–2.199, p = 0.242; plasma: HR, 1.039; 95% CI, 0.636–1.697; p = 0.88) or OS (oral rinse: HR, 1.259; 95% CI, 0.740–2.143, p = 0.396; plasma: HR, 0.868; 95% CI, 0.511–1.476; p = 0.602) (Supplementary Fig. 2).

Evaluation of pre-treatment ctDNA status and concentration in early recurrence prediction models

For ctDNA status integration, all models demonstrated robust predictive performance, with AUCs ranging from 0.825 to 0.857. Notably, plasma ctDNA status integration achieved the highest performance (AUC: 0.857; 95% CI: 0.782–0.933), while the tumor prediction model alone and the oral rinse-derived ctDNA integration model showed comparable results (AUC: 0.843; 95% CI: 0.764–0.923 and AUC: 0.825; 95% CI: 0.741–0.909, respectively) (Fig. 8A). However, when incorporating ctDNA concentration data, both plasma and oral rinse-derived ctDNA models (AUC = 0.818; 95% CI: 0.731–0.904 and AUC = 0.824; 95% CI: 0.739–0.908, respectively) showed slightly inferior performance compared to the original tumor prediction model (AUC = 0.830; 95% CI: 0.747–0.913) (Fig. 8B).

Fig. 8: Prediction of 1-year recurrence and prognosis using a combined model with ctDNA status.
figure 8

A 1-year recurrence prediction based on the combined model incorporating DNA mutations, clinical factors, and ctDNA status from oral rinse and plasma samples. B 1-year recurrence prediction using the combined model with ctDNA concentration from oral rinse and plasma samples. C, D Kaplan–Meier curves comparing high- and low-risk groups for PFS (C) and OS (D). ctDNA circulating tumour DNA, PFS progression-free survival, OS overall survival.

In survival outcome analysis, we evaluated the impact of integrating ctDNA status and concentration on model performance. For PFS prediction, the model incorporating plasma ctDNA status (C-index = 0.756; 95% CI: 0.698–0.815; AIC = 477.0; BIC = 498.6) outperformed the baseline model (C-index = 0.730; 95% CI: 0.670–0.790; AIC = 481.3; BIC = 500.7) with improved C-index and lower AIC values. The oral rinse-derived ctDNA status model (C-index = 0.735; 95% CI: 0.677–0.794; AIC = 482.8; BIC = 504.4) showed only slight improvement in C-index but with increased AIC and BIC values (Supplementary Table 6).

In comparison, models incorporating ctDNA concentration for PFS prediction showed similar but generally inferior performance to ctDNA status models. The plasma ctDNA concentration model (C-index = 0.736; 95% CI: 0.676–0.795; AIC = 483.1; BIC = 504.7) and oral rinse-derived ctDNA concentration model (C-index = 0.733; 95% CI: 0.673–0.794; AIC = 480.6; BIC = 502.2) both demonstrated modest improvements in C-index, but their AIC and BIC values suggested potential overfitting (Supplementary Table 6).

For OS prediction, all integrated models showed minimal changes compared to the baseline model (C-index = 0.676; 95% CI: 0.594–0.759; AIC = 409.1; BIC = 427.4). The plasma ctDNA status model (C-index = 0.678; 95% CI: 0.596–0.760; AIC = 410.0; BIC = 430.2) and plasma ctDNA concentration model (C-index = 0.683; 95% CI: 0.601–0.765; AIC = 411.0; BIC = 431.3) slightly improved C-index but increased both AIC and BIC values. The oral rinse-derived ctDNA status model (C-index = 0.670; 95% CI: 0.588–0.752; AIC = 410.8; BIC = 431.1) and oral rinse-derived ctDNA concentration model (C-index = 0.661; 95% CI: 0.573–0.748; AIC = 408.9; BIC = 429.2) showed C-index values slightly lower than or comparable to the baseline model (Supplementary Table 6).

Based on these findings, we further stratified patients into high-risk and low-risk groups based on the median risk score. The low-risk group had significantly longer PFS (HR = 3.014; 95% CI: 1.796–5.060; p < 0.001) and OS (HR = 2.977; 95% CI: 1.662–5.333; p < 0.001) than did the high-risk group (Fig. 8C, D).

Mutational profile association with clinicopathological features and impact of treatment on mutation spectrum

In the analysis of the 7-gene panel (TP53, TERT, IKZF1, EP300, MYC, EGFR and PIK3CA) mutation spectrum and clinicopathological features in 123 OSCC patients, we identified two key gene mutations significantly associated with specific clinical characteristics, with an additional gene mutation showing borderline significance. TP53 mutations were significantly increased in advanced T stage (75.0% vs 53.0%, p = 0.030), patients with lymph node metastasis (76.3% vs 52.9%, p = 0.017), and those with advanced clinical stage (78.2% vs 45.6%, p < 0.001). Similarly, EGFR mutations were closely associated with advanced T stage (42.5% vs 20.5%, p = 0.017) and advanced clinical stage (40.0% vs 17.6%, p = 0.008). Furthermore, EP300 mutations exhibited a higher frequency in advanced T stage (47.5% vs 28.9%, p = 0.068), approaching statistical significance.

To investigate the impact of treatment on the 7-gene panel mutation spectrum, we conducted longitudinal monitoring of 30 OSCC patients. Among these patients, 19 ultimately experienced disease recurrence, whilst 11 remained recurrence-free. Following surgical treatment, ctDNA levels in all patients decreased below the detection limit. In subsequent monitoring, ctDNA became detectable again post-treatment in recurrent patients. Comparing the mutation spectrum before and after treatment, we observed high conservation of mutations across the 7-gene panel after treatment. Specifically, in oral rinse analysis, only 3 patients (ID 36, 44 and 117) lacked baseline mutations after treatment; in plasma ctDNA analysis, only 2 patients showed alterations: patient 24 lacked the EGFR amplification present at baseline, whilst patient 118 had no detectable mutations after treatment. All remaining recurrent patients maintained identical 7-gene panel mutation profiles compared to pre-treatment.

To gain deeper insight into the specific effects of treatment on mutation profiles, we conducted a detailed analysis of tissue sample data from a representative recurrent patient. Before treatment, the patient’s primary mutations included a TP53 termination mutation (p.R342*), TERT promoter mutation (c.-124C > T), NOTCH1 splicing variant and FAT1 truncation mutations, with VAF ranging from 5.04–8.37%. At clinical recurrence, these mutations were not only preserved but exhibited significantly increased VAFs, with TP53 reaching 30.03% and TERT reaching 29.71%. More notably, several new key driver events emerged at clinical recurrence, including FGFR1, ZNF703 and NSD3 amplifications, CDKN2A/B deletion, and an additional ATG2A mutation. Chromosome-level changes also underwent significant transformation: multiple chromosomal region deletions present before treatment essentially disappeared at clinical recurrence, whilst new amplifications in the 17q region and deletions in the 9p21.3 region emerged. These findings provide further evidence that treatment may induce tumour clonal composition remodeling and the emergence of new driver events (Supplementary Table 7).

Surveillance

Although not the primary objective of this study, it is interesting to determine whether tumour DNA can be detected in the oral rinse or plasma of patients after tumour resection. To explore the role of ctDNA in disease monitoring, we conducted a longitudinal analysis of ctDNA in 30 patients as a preliminary exploration. Among the 19 patients who experienced recurrence, ctDNA was detected in oral rinse samples from 17 patients an average of 132 days (range: 38-259 days) before clinical recurrence, and in plasma samples from 15 patients an average of 120 days (range: 36-259 days) before clinical manifestation (Fig. 9A, C).

Fig. 9: Longitudinal monitoring of oral rinse-derived and plasma ctDNA.
figure 9

AD Changes in ctDNA status and concentration from T0 to T2 in the oral rinse recurrence cohort (A), oral rinse non-recurrence cohort (B), plasma recurrence cohort (C), and plasma non-recurrence cohort (D). E, F Monitoring of serial oral rinse-derived (E) and plasma (F) samples from 19 patients, showing ctDNA detection (red circle) and subsequent relapse (yellow triangle). ctDNA was detected at all pre-operative time points (purple square). The dashed line indicates the total follow-up duration. ctDNA circulating tumour DNA.T0, pre-treatment. T1, post-operation. T2 post-operation detectable ctDNA.

The pattern of ctDNA detection varied among patients: three patients showed detectable tumor DNA exclusively in oral rinse, one patient only in plasma, and nine patients demonstrated simultaneous detection in both sample types. Detailed temporal analysis revealed interesting variations in detection timing. For instance, in four patients (patients 6, 15, 48, and 93), oral rinse-derived ctDNA was detected at 84,38, 82, and 539 days post-baseline, with plasma ctDNA detection following 110, 38, 64, and 47 days later, respectively. Conversely, in patient 36, plasma ctDNA was detected first at 76 days post-baseline, followed by oral rinse-derived ctDNA 33 days later (Fig. 9E, F).

Importantly, among patients without disease progression (n = 11), ctDNA remained consistently undetectable during dynamic sampling (Fig. 9B, D), supporting its potential specificity as a monitoring tool.

Discussion

In this study, we used a 551-gene targeted sequencing approach to analyze tumor tissue, plasma, and oral rinse samples from 123 HPV-negative OSCC patients. Through this comprehensive approach, our analysis reveals the molecular characteristics of OSCC and provides a framework for clinical application of liquid biopsy.

HPV-negative OSCC is most common in Asia1, characterized primarily by TP53 mutations and cell cycle regulation abnormalities17, these molecular features resulting in poorer prognosis, thus requiring specific biomarkers for risk stratification. A thorough understanding of the mutation landscape is essential for developing personalized treatment and monitoring methods that can help improve the high recurrence rates and survival outcomes in these patients.

Our study revealed significantly higher ctDNA detection rates in oral rinse (94.3%) compared to plasma (80.5%), consistent with previous findings by Wang et al and Mes et al.14,18. This differential detection efficiency likely stems from fundamental biological mechanisms governing tumor DNA release. Due to direct anatomical proximity, localized shedding of tumor DNA into the oral cavity occurs through processes including direct cellular exfoliation, local microvasculature leakage, and inflammatory cell-mediated clearance. These mechanisms are particularly relevant in early-stage disease, where at this stage tumors have not yet acquired the vascular invasion capabilities that are necessary for efficient systemic DNA release19,20. Notably, we observed that detection rates equalized in advanced disease stages (92.7% vs 89.1%), suggesting that as tumors progress, they develop enhanced capability to shed DNA into both local and systemic circulation. This detection rate convergence can primarily be attributed to the biological evolution of advanced OSCC. Specifically, advanced OSCC typically exhibits enhanced invasiveness, increased angiogenesis, and elevated metastatic potential. Consequently, these characteristics collectively facilitate the release of tumor DNA into systemic circulation21,22.

Our comparative mutation analysis between sample types further established the complementary nature of these liquid biopsy approaches. The distinct yet overlapping mutation profiles between oral rinse and plasma samples likely represent different aspects of tumor spatial heterogeneity and clonal evolution23. Importantly, unique mutations in liquid biopsies not detected in primary tumor tissue may originate from minor subclones or metastatic deposits, reflecting the ability of liquid biopsies to capture tumor heterogeneity more comprehensively than single-site tissue sampling24. This biological heterogeneity provides the rationale for our finding that combined testing significantly improved detection sensitivity to 48.6%, which represents a substantial advance over previous single-sample approaches.11,25, Collectively, these findings emphasize the value of integrated multi-compartment liquid biopsy strategies for comprehensive mutation profiling in OSCC.

We identified seven key mutated genes (TP53, TERT, IKZF1, EP300, MYC, EGFR, and PIK3CA) and two clinical factors (T stage and lymph node metastasis) associated with recurrence risk, validating the model in the TCGA dataset. Among these, EGFR, PIK3CA, MYC, TP53, and TERT emerged as the most significant predictors, highlighting their critical role in OSCC progression. These findings align with existing literature and further elucidate the importance of these genes in the molecular pathological mechanisms of OSCC.

TP53 mutations are the most common genetic alterations in HPV-negative HNSCC, with frequencies ranging from 73% to 100% in published literature26 and 60.1% (74/123) in our cohort, profoundly influencing tumor biology and treatment response. The clinical significance of these mutations stems from their unique molecular pathological mechanisms: mutant p53 not only loses the tumor-suppressive functions of wild-type p53 but also acquires new abilities to actively promote cancer progression. Stein et al. emphasized in their review that this “gain-of-function” makes mutant p53 an important therapeutic target, especially considering that approximately 50% of human cancers carry p53 mutations. At the molecular level, mutant p53 interacts with various transcription factors and effector molecules, creating a molecular environment favorable for tumor growth, invasion, and metastasis27. This concept has been validated in clinical research. In a 7-year prospective multicenter study involving 560 HNSCC patients, Poeta et al. classified TP53 mutations as disruptive or non-disruptive based on the predicted degree of disturbance to protein structure. The results indicated that the presence of any TP53 mutation, compared to wild-type TP53, was associated with decreased overall survival17. Further deepening this finding, Caponio et al. developed a computational algorithm through systematic bioinformatic analysis to precisely classify TP53 mutations into high-risk or low-risk subtypes, confirming that this classification method could effectively predict survival outcomes in HNSCC patients27. Notably, in their follow-up study, they further validated the clinical relevance of this classification system, finding that patients carrying high-risk TP53 mutations showed significantly reduced treatment response, with 3.3 times higher odds of stable disease or progression compared to wild-type patients28. This body of evidence aligns closely with our research results, explaining why TP53 mutations demonstrate such high predictive weight in our prediction model.

In our study, the frequency of TERT promoter mutations (50.4%, 62/123) was significantly higher than that reported in the TCGA-HNSCC database (0.6%, 3/508). Indeed, this disparity may be closely associated with HPV expression. Concurrently, multiple studies have demonstrated that TERT promoter mutations are prevalent in HPV-negative OSCC patients and strongly correlate with adverse prognosis29,30. More importantly, TERT promoter mutations enhance telomerase expression by creating new ETS transcription factor binding sites, thereby conferring cells with unlimited replicative potential31. Moreover, TERT exerts non-telomeric functions through signaling pathways such as WNT/β-catenin, NF-κB, and MYC, further promoting tumor invasion, metastasis, and immune evasion32. Notably, in various cancers, including bladder neuroendocrine carcinoma, metastatic clear cell renal cell carcinoma, and metastatic non-clear cell renal cell carcinoma, research consistently reveals high mutation rates in TP53 and TERT33,34,35. Furthermore, studies suggest a potential synergistic effect between these gene mutations in tumor progression. Specifically, TP53 mutations may compromise cell cycle regulation and DNA repair mechanisms, while TERT promoter mutations enhance telomerase activity, enabling tumor cells to acquire immortality36,37,38.

Both in our study and in the external validation using TCGA data, EGFR, PIK3CA, and MYC exhibited notably high mutation frequencies. Furthermore, at the molecular signaling pathway level, EGFR, PIK3CA, and MYC collectively form an interconnected signaling network that regulates cellular proliferation and survival39,40,41. EGFR, functioning as an upstream regulator, modulates multiple downstream signaling cascades through its tyrosine kinase activity42,43. When constitutively activated by factors such as gene amplification, EGFR dimerize at the cell membrane, subsequently activating two critical downstream pathways: the MAPK (Ras-Raf-MEK-Erk) pathway and the PI3K/AKT/mTOR pathway encoded by PIK3CA44,45. These pathways promote cellular proliferation by upregulating key genes such as CCND1/cyclin D1, while simultaneously activating MYC transcription through phosphorylation and promotion of Mad1 degradation. As a downstream effector, MYC controls numerous genes involved in cellular growth and metabolism46.

Functional interactions exist among these three genes: EGFR activation enhances MYC expression, while PIK3CA mutations increase MYC protein stability by inhibiting its degradation47,48. PIK3CA mutations generate constitutively active kinase variants, leading to aberrant activation of the PI3K/AKT/mTOR signaling pathway49, which converges with and amplifies EGFR signaling39. This interactive mechanism elucidates why the co-occurrence of these genetic alterations has significant implications for tumor progression.

In oral diseases, research demonstrates that these molecular alterations are closely associated with disease progression and treatment response. Atrophic/erosive oral lichen planus exhibits significantly higher EGFR expression compared to control groups, suggesting a greater potential for malignant transformation50. PIK3CA mutations play a crucial role in the progression of oral potentially malignant disorders to invasive carcinomas, with evidence indicating that these mutations can emerge during early transformation51,52.

Regarding treatment response, EGFR mutations and chromosome 11q13 amplifications (encompassing CCND1, FGF3, FGF4, and FGF19) are significantly associated with decreased PFS in HNSCC patients receiving PD-1 inhibitor immunotherapy, serving as potential biomarkers for identifying patients unsuitable for immunotherapy53. However, although EGFR-targeted therapies show efficacy in a subset of HNSCC patients, PIK3CA mutations frequently cause primary or acquired resistance. This explains why strategies targeting these pathways in combination demonstrate superior efficacy in preclinical models54,55, providing a theoretical foundation for future molecularly-guided personalized treatment approaches.

Based on the comprehensive understanding of these functionally relevant driver gene alterations, researchers have integrated these molecular markers into predictive models, creating analytical tools capable of capturing the biological complexity of OSCC. This integrative approach significantly improves the accuracy of recurrence risk stratification (training cohort AUC: 0.869, validation cohort AUC: 0.854), establishing a foundation for precision medicine in OSCC based on molecular characteristics.

The integration of pre-treatment plasma ctDNA status significantly enhanced our model’s predictive capability, consistent with emerging evidence that ctDNA can capture aspects of tumor biology that may not be readily apparent in single-site tissue sampling56. Notably, while ctDNA positive status was significantly associated with adverse clinical outcomes, ctDNA concentration measurements demonstrated limited predictive value, reflecting their distinct biological implications—the former indicates the presence of driver mutations, while the latter primarily reflects overall tumor burden57,58. Our findings demonstrate concordance with the studies by Cui et al. and Dal Secco et al., providing supporting evidence for the potential utility of ctDNA in monitoring the recurrence of OSCC. Our longitudinal monitoring suggested that ctDNA detection may precede clinical recurrence by approximately four months (132 days in saliva samples and 120 days in plasma samples), which aligns with observations by Cui et al., who reported that ctDNA detection potentially anticipated recurrence by 4.4 months (average 2.4 vs. 6.8 months)59. Similarly, Dal Secco et al. indicated that ctDNA detection might identify clinical recurrence several months in advance60. Additionally, our research noted that combining plasma and saliva analysis may enhance detection sensitivity, while Cui et al. also observed that integrating saliva and plasma samples could potentially improve overall tumor monitoring performance. Collectively, these findings suggest that ctDNA monitoring might serve as a complementary approach to routine clinical surveillance, providing auxiliary information for follow-up management of oral cancer patients.

More importantly, we uncovered patterns of treatment-induced molecular evolution suggesting a two-phase model of recurrence: initial persistence of founding driver mutations followed by acquisition of additional alterations that enhance fitness and treatment resistance. The significant increase in VAF of key driver genes such as TP53 and TERT at recurrence (reaching 30.03% and 29.71%, respectively) indicates expansion of resistant clones, while newly acquired FGFR1, ZNF703, and NSD3 amplifications and CDKN2A/B deletion provide complementary oncogenic advantages. These molecular dynamics highlight the importance of adaptive treatment strategies based on sequential liquid biopsies that can identify resistance mechanisms before clinical recurrence.

Several limitations in our study warrant consideration. Although our cohort of 123 patients provided meaningful insights, a larger sample size would strengthen the generalizability of our findings. The 551-gene panel, while comprehensive, was not specifically designed for OSCC, potentially missing some disease-specific mutations. Technical factors, including sample quality and extraction methods, might have influenced ctDNA detection sensitivity, particularly in oral rinse samples. Additionally, our predictive models, though promising, require validation in independent cohorts. Future studies should focus on developing OSCC-specific gene panels, optimizing detection methods, and conducting multicenter validations with longer follow-up periods.

Methods

Patient enrollment

This prospective cohort study was conducted at the Department of Oral and Maxillofacial Surgery, the First Affiliated Hospital of Zhengzhou University, China, between January 2019 and March 2024. The study protocol was reviewed and approved by the Ethics Committee of the Scientific Research Project of the First Affiliated Hospital of Zhengzhou University (approval number: SB20190200). All research involving human participants was performed in accordance with the Declaration of Helsinki. Written informed consent was obtained from all participants prior to their enrollment in the study.

Patients were enrolled based on the following criteria: (1) age ≥18 years; (2) histologically confirmed HPV-negative primary OSCC; (3) no prior radiation therapy, chemotherapy, or surgery; (4) agreement to follow-up protocols; and (5) no history of synchronous or metachronous cancers.

All patients had pathological examinations to confirm the diagnosis. HPV status was determined by p16 immunohistochemistry, with negative p16 expression confirming HPV-negative status.

Tumor staging was performed according to the 8th edition of the American Joint Committee on Cancer TNM classification system61. Following National Comprehensive Cancer Network guidelines, patients with positive cervical lymph nodes, perineural invasion, or close surgical margins received postoperative adjuvant chemoradiotherapy62.

For external validation, we analyzed data from The Cancer Genome Atlas (TCGA)-HNSCC dataset (single-nucleotide variants (SNV) data: n = 508; copy number variations (CNV) data with survival information: n = 485).

Sample collection and DNA extraction

We collected paired tissue, oral rinse, and plasma samples from all patients at baseline before any treatment initiation. Longitudinal monitoring was performed in a subset of 30 patients, with samples collected at pre-defined time points: one-month post-surgery (before adjuvant therapy), and then every three months during follow-up until disease progression was confirmed by RECIST 1.1 criteria63.

Fresh tissue samples were split for hematoxylin and eosin (H&E) confirmation and genomic DNA extraction.

Oral rinse and plasma samples were collected in the morning after overnight fasting. For oral rinse collection, patients first rinsed their mouth with mouthwash or purified water to remove debris. Under unstimulated conditions, patients were instructed to rinse with 10 mL of 0.9% saline solution for 30 seconds, then expectorate into cfDNA collection tubes. This process was repeated twice, yielding a total volume of approximately 20 mL of oral rinse64. Plasma samples were collected using Apostle tubes. All samples were transported at 2–8 °C and subsequently stored at -80 °C until analysis.

For DNA extraction, oral rinse samples were first centrifuged (1600 × g, 15 minutes, 4 °C) to separate the supernatant and pellet fractions. The supernatant was processed for cfDNA extraction using QIAamp DNA Blood Mini Kit (QIAGEN, Hilden, Germany), while the pellet was used for genomic DNA (gDNA) extraction using QIAamp DNA FFPE Tissue Kit (QIAGEN, Hilden, Germany). Plasma cfDNA was isolated using MagMAX DNA Multi-Sample Ultra Kit (Thermo Fisher Scientific, Waltham, MA, USA). DNA quantification was performed on the Qubit fluorometer with dsDNA HS Assay Kit (Thermo Fisher Scientific, Waltham, MA, USA), and fragment analysis was conducted using Agilent 4200 TapeStation (Agilent Technologies, Santa Clara, CA, USA).

Library construction

Probe hybridization capture was employed for library construction. Initially, 15 ng to 200 ng of gDNA was fragmented into 200–350 bp fragments using a Covaris S2 instrument. For library construction, 20–60 ng of cfDNA was used. The fragmented DNA and cfDNA underwent end repair, A-tailing, and adaptor ligation using the KAPA HyperPlus DNA Library Preparation Kit (Roche Diagnostics, Indianapolis, Indiana, United States) and the VAHTS Universal DNA Library Prep Kit (Vazyme, Nanjing, Jiangsu Province, China), respectively. Unligated adaptors were removed via size selection with Agencourt AMPure XP beads (Beckman Coulter, Brea, California, United States). The ligated products were PCR amplified to construct the precapture library. The final library was quantified using the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific) on a Qubit fluorometer, and its quality was assessed with the Agilent 4200 TapeStation (Agilent Technologies). Commercial reagents and custom probes were used for library construction and hybridization capture.

Library sequencing and bioinformatics analysis

DNA libraries were sequenced on NovaSeq 6000 platform (Illumina, San Diego, CA, USA) with 150 bp paired-end reads. Raw sequencing data were processed through a standardized bioinformatics pipeline. Briefly, reads were quality-filtered using Fastp (v2.20.0) for adapter trimming and low-quality bases65. BWA-MEM algorithm (v0.7.17) performed alignment to the human reference genome hg19 GRCh37. PCR duplicates were removed using Dedup66.

Variant calling was performed using VarDict (v1.5.7), with variant filtering based on the 1000 Genomes Project (August 2015) and ExAC (v0.3) browser67,68. Variants were annotated using InterVar. CNVs were identified using CNVkit (v1.1), and fusions were detected using Factera (v1.4.4)69.

VAF was defined as the percentage of mutant reads among the total reads. ctDNA concentration was calculated in nanograms per milliliter (ng/mL), determined as the product of the total cfDNA concentration and the maximum VAF of somatic mutations70.

Model development and validation

For predictive model construction, the cohort was randomly divided into training (n = 82) and validation (n = 41) sets. Feature selection involved three steps: (1) differential mutation analysis identified genes with >10% mutation frequency, (2) stepwise logistic regression selected significant mutations, and (3) univariate analysis identified significant clinical factors.

Four different machine learning approaches: Random Forest, XGBoost, Gradient Boosting Machine and Elastic Net were performed. For each model type, we constructed seven feature combinations:(1) clinical features only, (2) SNV features only, (3) CNV features only, (4) combined SNV and CNV features, (5) combined SNV and clinical features, (6) combined CNV and clinical features, and (7) integrated SNV, CNV, and clinical features.

For external validation, we constructed a simplified prognostic model in the TCGA-HNSCC cohort, incorporating T stage, N stage, and key CNV mutations identified in our study. This model was evaluated for its ability to stratify patients by overall survival using Cox proportional hazards regression.

For model performance assessment, we employed different evaluation metrics based on the nature of the outcome. For binary outcomes (such as 1-year recurrence), model performance was assessed using receiver operating characteristic (ROC) curve analysis with area under the curve (AUC) as the primary metric. For time-dependent survival outcomes (progression-free survival (PFS) and overall survival (OS)), we employed Harrell’s concordance index (C-index), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) as evaluation metrics. Internal validation was performed through tenfold cross-validation, and external validation was conducted in the validation cohort.

Statistical analysis and model construction

Associations between categorical variables were assessed using Fisher’s exact test (with Chi-square test when expected frequencies ≥5). Continuous variables were compared using the Mann–Whitney U test for two-group comparisons and one-way ANOVA for multiple groups. Correlations were evaluated using Pearson correlation coefficient. Associations between clinical features and ctDNA status were analyzed through univariate analysis, followed by multivariate logistic regression with Nagelkerke’s R² for significant variables (p < 0.05). For analyzing associations between ctDNA concentrations and clinical parameters, univariate and multiple linear regression analyses were used.

For ctDNA concentration, Univariate and multiple regression analyses were used to analyze potential associations between ctDNA concentrations and clinical parameters [Box–Cox transformation was applied to normalize distributions when the Kolmogorov–Smirnov normality test failed; adjusted Exp(β) was reported]. Survival outcomes were analyzed using the Kaplan–Meier method and compared using log-rank tests. Hazard ratios (HR) were calculated using Cox proportional hazards models.

All the statistical analyses and plotting were performed using R software version 4.4 (https://www.r-project.org/). A p value < 0.05 was considered to indicate statistical significance for all analyses.