Introduction

Nasopharyngeal carcinomas (NPC) are rare head and neck malignancies which arise from nasopharynx epithelial mucosa1. NPC accounted for less than 1% (129,000 cases) of cancers globally in 2018; over 70% of NPC incidence is reported from East and Southeast Asia alone1,2,3. In the United States, NPC accounts for 2% of head and neck cancer burden, with an incidence rate of less than 1 per 100,000 person-years4,5. The striking geographic disparity suggests an etiology featuring a mix of genetics, Epstein Barr Virus (EBV) infection, dietary habits, lifestyle, and exposures to environmental risk factors6,7. Historically, NPC’s poor prognosis has largely been attributed to lack of awareness of symptoms and risk, although detection of early-stage NPC and appropriate targeted therapy has been shown to mitigate this outlook6,7,8. As a result, effective population screening using plasma EBV DNA has emerged a useful tool in reducing population NPC burden4,9.

An underappreciated viral risk factor for NPC may be human papillomavirus (HPV), which has otherwise been extensively studied in oropharyngeal carcinoma (OPC)10,11,12. Among prior investigations of HPV association with NPC, several single-site studies within the USA and United Kingdom detected significant prevalence of unique NPC-subtypes comprised of EBV-negative, HPV-positive (HPV(+)) infection status while observing prognostic differences among HPV(+) and HPV-negative (HPV(−)) NPC-subtype groups13,14,15. In addition, larger studies across national-spanning patient cohorts have observed associations between history of HPV infection and higher incidence of NPC, alongside poorer prognostic outcomes of increased tumor burden independent of EBV-infection status, age, race-ethnicity, and smoking behaviors16,17,18. Additionally, growing epidemiologic trends have showcased robust independent associations between HPV-status and primary sites outside of traditional OPC sites19.

Given these clinical associations of HPV with NPC and other head and neck cancers, social determinants of health (SDoH), such as age, race-ethnicity, sex, insurance, and income level, have also been implicated in several HPV-related investigations of OPC, as well as in non-HPV-focused NPC studies observing Asian and black race-ethnicity5,8,20,21,22,23,24. However, SDoH-focused investigations into NPC on HPV-status with more recent samplings of national populations (2010 and after) have yet to be performed. This is in spite of prior evidence on the propensity for HPV-infection to independently influence squamous cell cancer sites outside of OPC sites. Moreover, multivariable models accounting for a wider variety of SDoH to allow enhanced real-world generalizability, namely comprising both individual and community-level factors, have yet to be considered.

To address these shortcomings, our retrospective cohort study aims to utilize comprehensive multivariable models for examining differences in SDoH-contribution of mortality, preliminary staging, and treatment delay across HPV(+) and HPV(−) NPCs. In doing so, this study incorporates the Yost Index, a composite metric comprising seven differentially weighted census-level socioeconomic status factors, alongside numerous individual-level SDoH-variables to predict specific, adjusted relationships between certain SDoH-factors and HPV-differentiated NPC outcomes.

Methods

This retrospective cohort study follows the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines. Due to this study utilizing retrospective, deidentified public databases, the Northwestern University Institutional Review Board deems this research as non-human research and therefore this study is exempted from the approval process per their institutional policy.

Databases

The National Cancer Institute-Surveillance, Epidemiology, and End Results Program (SEER) database contains national datasets of patient variables, pathological characteristics, treatment modalities, and prognostic outcomes. This study utilizes the specially authorized incidence database for head and neck tumors with linked census tract-level socioeconomic status (SES) and rurality-urbanicity, spanning patients from 2010 to 2018 and 18 registries across the USA.

The Yost Index is a census tract-level, composite SES measure provided within the specialized SEER dataset aforementioned. It is comprised of seven census-level measures for each of the 72,158 USA-Census tracts: median household income, percent below 150% of poverty line, median house value, median rent, percent working class, percent unemployed, and education index level18. To maintain chronological consistency of these measures based on the date of tumor diagnosis, these socioeconomic status was based on averaged measures of the seven factors across various sets of American Community Survey (ACS)/USA Census 5-year estimates of which the year of diagnosis overlaps within the ACS estimate (i.e. a tumor diagnosed in 2012 will have their Yost Index score and quintile calculated from the averaged measures of the 2008–2012, 2009–2013, 2010–2014, 2011–2015, and 2012–2016 ACS 5-year estimates).

Population definitions

All cases of nasopharyngeal carcinomas were selected from International Classification of Disease for Oncology version 3.8 (ICD‐O-3) tumor site codes [C11.0–11.9] between 2010 and 2018. Disease or histological classifications of squamous cell carcinoma were determined by ICD-O-3 codes recategorized by SEER-determined labels. Inclusion criteria were adult patients (20 + years) with confirmed HPV(+) or HPV(−) status, which was determined by IHC for p16 expression per SEER-documentation.

Characteristics and outcome variable definitions

The individual-level sociodemographic/SDoH variables consisted of age (< 65 and 65+ years as the categories), biological sex, race-ethnicity (Non-Hispanic White, Black, Hispanic, Asian and Native Pacific Islander, Native American, and Unknown) on a per-patient basis. The census-level SDoH variables consisted of census-level Yost-Index-SES score and a composite urbanicity-rurality comprised of the two available SEER-variables. For the former, patients were split into two equivalently sized cohorts of “Lower SES,” which encompassed quintiles of “Lowest,” “Lower” and “Middle” SES, which comprised quintiles of “Higher” and “Highest” SES. For the latter, rurality status was determined when the patient either had a RUCA designation of “Non-Urban” or a URIC designation of “Mostly Rural” or “All Rural.”

Outcome variables consisted of the following: cancer/disease-specific and all-cause-related mortality was determined by SEER-categorized variables, alongside time points for three-year and five-year mortality. Advanced staging for malignant tumors was based on SEER-designated variables labeled as “Stage IV,” “distant [expansion],” or “distal [expansion]” and recoded under SEER’s most recent tumor staging data: American Joint Committee on Cancer, 6th Edition (AJCC-6) classifications. Delay in treatment after diagnosis was based on months after preliminary diagnosis with designated timepoints of 1-month and 3-month delays.

For the specific specialized dataset utilized, all sociodemographic factors available and permitted were utilized in multivariable models as independent and dependent variable components.

Statistical analysis

All multivariable models consisted of the following covariates: younger age (being younger than 65 years old upon diagnosis, reference level being 65 and older), male sex (reference being female sex), being of black race-ethnicity (reference not being of black race-ethnicity), being of Hispanic race-ethnicity (reference not being of Hispanic race-ethnicity), being of Asian or Pacific Islander race-ethnicity (reference not being of Asian or Pacific Islander race-ethnicity), Yost Index/”Lower Socioeconomic Status” (being in the lowest, lower, or middle SES quintiles, reference being in the higher, highest SES quintiles), and increased rurality (being of “rural” RUCA-designation or “all rural” and “mostly rural” URIC-designations; reference being “urban” RUCA-designation or “all urban” and “mostly urban” URIC-designations); stratifying variate of HPV(+) or HPV(−) status. Covariates were selected based on inclusion in previous studies demonstrating granular disparities amongst socioeconomics and demographics in head and neck cancers based on HPV status and nasopharyngeal cancer outcomes5,10,22,25,26,27.

Multivariable cox-proportional hazard models were used to evaluate factors contributing to quicker time-to-mortal event secondary to all causes. Multivariable logistic regression models were also used to delineate covariate effects on aspects of mortality not well-represented within cox-proportional models of overall mortality/survival, which principally focuses on the whole timespan of mortal events with time as a continuous covariate while censoring significant events that obscure covariate effects at specific timepoints28. Multivariable logistic regression models were also used to assess odds of having a late-presenting stage (based on SEER-recoded combined staging, reference level being diagnosed with Stage I-III) and 1 or 3-month-or-more delay-in-treatment after diagnosis (reference level being treatment receipt in less than 1 months or 3 months, respectively) based on prior literature standards for time cutoffs29,30. These regressions were performed as separate, multivariable models within HPV(+)and HPV(−) groups based on traditionally noted demographic differences among HPV-statused populations31,32. Two-sided p-values were calculated and p < 0.05 was considered statistically significant. All statistical analyses were performed using R version 4.2.1.

Results

Patient demographics

Among the 1222 patients with NPC that were identified, 820 (57.7%) were HPV(−) and 402 (28.3%) were HPV(+). The highest represented sociodemographic characteristics were 45–64 years of age (n = 669, 54.7%), males (n = 881 , 72.1%), and white race (n = 521, 42.6%). The highest represented census-level characteristics were Low-Middle Socioeconomic Status by Yost-Index Score (n = 582, 50.2%), “All Urban” rural-urbanicity (n = 888, 75.3%). The highest represented clinical characteristics include NPC-staging of I, II, or III per AJCC-6 overall TNM-classifications (n = 653, 54.6%). These characteristics stratified by HPV status are further described in Table 1.

Table 1 Patient sample clinicodemographic characteristics stratified by HPV status.

Mortality

Mortality was assessed using four outcomes: time to overall mortality, and whether or not a cancer-specific mortality, 3-year all-cause mortality, or 5-year all-cause mortality would occur across HPV status cohorts.

Predictors of time to overall mortality

In both HPV(+) and HPV(−) NPCs, younger age less than 65 years old showed decreased overall mortality risk (respectively, HR 0.56, 95%CI 0.38–0.81, p = 0.003; HR 0.57, 0.45–0.73, p < 0.001), while being male showed increased overall mortality risk (respectively, HR 1.71, 95%CI 1.11–2.64, p = 0.001; HR 1.42, 95%CI 1.07–1.87, p = 0.012).

Within only HPV(−) NPCs, being of Hispanic (HR 0.51, 95%CI 0.32–0.82, p = 0.002) and Asian or Pacific Islander race-ethnicity (0.44, 0.33–0.58, p < 0.001) showed associations of decreased overall mortality risk while low Yost-Index (i.e. poor socioeconomic status) showed associations of increased risk of quicker time-to-mortal event (1.44, 1.14–1.87, p = 0.003). These are tabulated in Table 2.

Table 2 Cox-proportional hazards of time-to-overall mortal event.

Effectors of cancer/disease-specific mortality

With HPV(−) status, younger age less than 65 years old (OR 3.24, 95%CI 1.68–6.37, p < 0.001) and Hispanic race (6.11, 1.14–114, p = 0.032) uniquely demonstrated increased odds of cancer-specific mortality. Also unique to HPV(−) status NPCs, male sex (0.30, 0.11–0.73, p = 0.006) demonstrated decreased odds of cancer-specific mortality. These results are tabulated in Table 3.

Table 3 Cancer specific mortality across HPV status.

Effectors of 3-year all-cause mortality

In both HPV(+) and HPV(−) cancers, younger age showed decreased odds of 3-year all-cause mortality (respectively OR 0.43, 95%CI 0.23–0.80, p = 0.007; 0.41, 0.28–0.61, p < 0.001). Unique to HPV(+), Male sex demonstrated increased odds of 3-year all-cause mortality (1.81, 1.01–3.31, p = 0.049). Unique to HPV(−), being of Asian or Pacific Islander race-ethnicity (0.36, 0.24–0.55, p < 0.001) and Hispanic race-ethnicity (0.40, 0.20–0.78, p = 0.007) showed decreased odds. Also only found within HPV(−) cohorts, low Yost-Index demonstrated increased odds of 3-year all-cause mortality (1.72, 1.17–2.53, p = 0.006). These results are tabulated in Table 4.

Table 4 3-year all-cause mortality across HPV status.

Effectors of 5-year all-cause mortality

Amongst both HPV(+) and HPV(−) NPCs, younger age (respectively OR 0.30, 95%CI 0.11–0.71, p = 0.005; 0.38, 0.23–0.62, p < 0.001) and being of Asian or Pacific Islander race-ethnicity (respectively 0.37, 0.17–0.78, p = 0.009; 0.37, 0.23–0.60, p < 0.001) showed decreased odds of 5-year all-cause mortality. Unique to HPV(−), being of Hispanic race-ethnicity demonstrated decreased odds (0.38, 0.18–0.81, p = 0.012) while low Yost-Index had demonstrated increased odds of 5-year all-cause mortality (1.71, 1.11–2.66, p = 0.016). These results are tabulated in Table 5.

Table 5 5-year all-cause mortality across HPV status.

Effectors of advanced preliminary staging

HPV(−) NPCs uniquely showed younger age (OR 1.48, 95%CI 1.07–2.05, p = 0.018) and Black race (1.87, 1.14–3.09, p = 0.013) demonstrated increased odds of advanced preliminary staging. These are tabulated in Table 6.

Table 6 Advanced preliminary staging across HPV status.

Effectors of delay of treatment from diagnosis

Delay-in-treatment from diagnosis was assessed as primary treatment occurring at or after 1 month and 3 months since diagnosis of NPC. For patients diagnosed with HPV(−) NPCs, low Yost-Index uniquely showed increased odds of having a 1-month delay of treatment initiation (1.61, 1.15–2.25, p = 0.005) that markedly increased with having a 3-month delay of treatment (1.97, 1.16–3.42, p = 0.012). For patients diagnosed with HPV(+) NPCs, younger age was associated with decreased odds of having a 1-month delay of treatment (0.47, 0.27–0.84, p = 0.012). These results are tabulated in Table 7.

Table 7 Delay of treatment from diagnosis across HPV status.

Discussion

To our knowledge, this is the first study to show differences in detrimental contributions of a multitude of individual- and community-level SDoH-factors for nasopharyngeal carcinoma stratified by HPV status in outcomes of mortality, advanced preliminary staging, and delay of treatment.

NPC prognosis, like in other head and neck cancers, has been shown to be associated with several predictors of mortality such as age above 65, advanced tumor staging, White race, and lack of insurance8,33,34,35,36. While HPV(+) is a protective factor in non-NPC head and neck cancer, the prognostic role of HPV in NPC remains unclear10,20. In an analysis of 517 patients from the SEER database, Wotman et al. found that cause-specific survival was not associated with HPV status, although intergroup ethnicity differences manifested significant differences25. Interestingly, they found that 34.8% of patients in their study cohort were positive for HPV(−) associated NPC where East Asians and individuals over the age of 25 were more likely to be HPV(−)25. Tham et al. conducted a meta-analysis of HPV significance in NPC with pooled hazard ratios demonstrating no association with overall survival, although subgroup analysis and consideration of the different study designs included make it difficult to draw any definitive conclusions37. Nevertheless, evidence from USA population studies uncovering the clinical phenomenon of HPV(+) as opposed to EBV(+) NPC as well as the endemic burden of HPV provide impetus to better understand prognostic significance38,39.

We found several meaningful implications of HPV status differentiation in cancer/disease-specific mortality, 3-year all-cause mortality, and 5-year all-cause mortality. Patients with HPV(−) NPC cancer-specific mortality were more likely to be Hispanic or less than 65 years old, and they were more likely to be Male. In a retrospective study of the National Cancer Data Base (NCDB), Verma et al. found an association between age as a continuous variable with HPV(+) NPC40. Furthermore, younger age fails to be uniquely predicted across HPV status for both 3- and 5-year all-cause mortality, but Hispanic race presents as negatively predicted by HPV(−) status, potentially indicating that Hispanic race is differentially associated with either cancer-specific or all-cause mortality. To our knowledge, there have been no substantive associations between Hispanic race and HPV status in NPC prognosis, warranting further exploration. 3-year all-cause mortality analysis demonstrates that HPV(−) is a factor associated negatively with Asian race as opposed to White race, contradicting other studies finding an association with HPV(+) and White patients25,37,41,42. However, these studies may be limited by specific subsets of endemic populations, as Kang et al. found p16 expression in more than half of South Korean patients with NPC43. Nevertheless, 5-year all-cause mortality analysis fails to uniquely demonstrate HPV(−) as negatively associated with Asian/Pacific Islander race-ethnicity.

Of the notable differences in effects, slower time to overall mortality per cox-proportional models was observed with NPC patients of Hispanic or Asian-Pacific Islander race-ethnicity across both HPV(+) and HPV(−) groups. However, these observed associations were only present within HPV(−) observed occurrences of 3-year mortality and only became present across both HPV-cohorts for observed occurrences of 5-year mortality. Furthermore, when looking at disease-specific mortality, among patients that did suffer mortal events regardless of when it occurred, Hispanic NPC-HPV(−) patients displayed uniquely higher odds in comparison to other NPC patients of other races/ethnicities.

For both 3- and 5- year all-cause but not cancer-specific mortality, HPV(−) was positively associated with low Yost index, suggesting that patients with HPV(−) NPC are more likely to experience higher risks of earlier mortality with lower composite SES. Given that HPV(−) demographics (e.g. male, older) have traditionally shown riskier individual behaviors, these unique aspects of earlier all-cause mortality but not cancer-specific mortality remain consistent with these prior observed relationships. Consistent with these interpretations, earlier 3-year timepoints among HPV(−) showcase significant but lower magnitude detrimental associations with Yost-Index and other sociodemographic factors compared to the effect sizes observed with later 5-year timepoints. Such increases from 3-year to 5-years showcase an expected outcome of the above outlined relationship, where riskier individual behaviors over longer periods of time would experience increased mortality risks.

Additionally, for patients with a delay of treatment, HPV(−) was also uniquely positively associated with low Yost-Index. The Yost Index has previously been utilized in a variety of cancer studies ranging from intracranial tumor metastases to head and neck cancers such as NPC where there is a consistent association between socioeconomic status and disease prognosis5,44,45. Furthermore, these studies have displayedimportant impacts of socioeconomic disadvantage with treatment modality, clinical course, and preliminary tumor staging upon diagnosis5,44,45,46,47. The results of the present study support these trends and suggest that HPV(−) NPC possesses unique detrimental associations with socioeconomic disadvantage underlying lack of access to care, inequality of resources, or higher risk behaviors and environmental exposures in the USA cancer population48,49,50,51. Furthermore, HPV(−) NPC showed uniquely positive associations ofyounger age and Black race with advanced tumor staging upon preliminary diagnosis, corresponding with the literature on social factors like minority race and HPV(+) status on favorable head and neck cancer clinical presentations10,11,20,34. As more data accumulates on how HPV status may mediate the association between socioeconomics and demographic risk, mortality, treatment modality, and clinical course, it is necessary to design screening, intervention, and future research inclusive of quality of life and psychological considerations as well52,53.

This study has several limitations. First, while this analysis demonstrates unique prediction of patient exposure characteristics given HPV status, it does not necessarily indicate translation of exposure into either HPV(−) or HPV(+) NPC. In addition, the majority of studies on HPV in NPC are retrospective database studies of varying designs, sample sizes, and patient populations. Therefore, it will be important to substantiate our results and those of the literature alongside longitudinal prospective analyses or larger retrospective analyses employing more specific patient selection and exposure definition criteria. Another limitation of this study is the use of HPV status based on SEER p16 status data, which may not be entirely concordant but was necessitated due to it being the only designated variable available for determining HPV-status. In addition, our HPV-stratified approach of sociodemographic analyses may inherently present biases from which excluded mechanisms associated with changes in HPV-status are not properly ascertained. It should be denoted that the present study’s approach should restrict the findings of significant associations solely within the selected HPV-cohorts themselves, rather than cross-comparing to assess differential effects comparisons. It is also important to note that use of the Yost index approximates patient disadvantage with their census tracts, a useful and validated methodology in population-based health services research. Boscoe et al. further substantiated the utility of the Yost index by comparing it with other similarly designed socioeconomic indices, demonstrating high face validity with a cautionary note for inherent uncertainties accounting for up to ± 8 percentiles54,55. Future research should look into proprietary datasets that would allow the linkage of heterogenous, community-level SDoH-variables that go beyond the categories represented in our current models. Finally, this study examines HPV status in isolation of EBV status, although this interplay is necessary to explore further in extrapolation to the larger United States population.

Using multivariable models assessing individual-level and census-level measures of social determinant factors, this national investigation of nasopharyngeal carcinomas displayed disparities in outcomes of mortality, advanced preliminary staging, and delay of treatment differentiated by confirmed HPV status. It was found that the role of HPV status of race and ethnicity in mortality highlights a differential effect in Hispanic patients for cancer-specific vs all-cause mortality while complicating our understanding of Asian race and mortality across HPV status in NPC. Furthermore, HPV(−) status was positively associated with low community-level socioeconomic status, as measured by the Yost index, in all-cause mortality and treatment delay and positively associated with younger age and Black race in advanced preliminary staging. Altogether, our findings demonstrate that HPV-status is an important effector in understanding the impact of social determinants on the NPC population, promoting more directable means of recognizing and establishing health strategies and risk mitigation for providers and policymakers alike in tackling SDoH-disparities of HPV-differentiated nasopharyngeal carcinoma.