Background & Summary

Cancer-therapy induced mucositis (CTOM) is a major complication of anti-cancer treatments and has been the focus of numerous studies to identify pathophysiological elements and derive appropriate responses1,2,3,4,5,6,7. Among the keystone drivers in CTOM development and severity are direct epithelial cytotoxicity, leading to apoptosis and atrophy, followed by dysfunction of the epithelial barrier with endotoxin and bacterial translocation through tight mucosal junctions, subsequent innate immune activation and up-regulation of the inflammatory reaction6. The host-microbe interactions and dynamic changes in resident microbial composition (bacteria and fungi) during development of oral CTOM has also been recognized8,9,10,11,12,13,14,15. The development of appropriate anti-microbial therapy is difficult16, hence the need to better understand the role of the oral microbiota in the development of CTOM. While most microbial species cause no harm under healthy conditions, the disruption of the delicate hemostasis between host defense and the commensal microbiome contribute to CTOM17. Next generation sequencing (NGS) approaches (amplicons and shotgun metagenomic sequencing) along with informatics and statistical advances have made it feasible to analyze the diversity and relative abundance of individual microbial taxa, as well as their physiological potential across a wide range of human populations and clinical studies18,19,20,21.

In this study we employed amplicon sequencing to examine the dynamics of bacterial and fungal communities prior, during and after therapy for oral squamous cell carcinoma. Longitudinal sampling captured the development of mucositis stages and the post treatment amelioration of symptoms in affected patients. The primers targeted the bacterial small subunit ribosomal RNA (16S rRNA) genes and the fungal ribosomal internal transcribed spacer (ITS), respectively.

Methods

Cohort study design

The cohort for the longitudinal study included nineteen patients with biopsy-confirmed oral or oropharyngeal squamous cell carcinoma (SCC) and eleven healthy control volunteers, most of which recruited from first-degree relatives of the enrolled cancer patients. The hospital oncology team treated all patients with resection surgery, chemo- or radiation therapy as deemed appropriate, and all participants had a dental evaluation prior to sample collection. Individual clinical information is summarized in Table 1 and further detailed in the Supplementary Table. Excluded from the study were patients with squamous cell carcinoma of larynx or hypopharynx. All patients received both chemotherapy (high dose Cisplatin, weekly Cisplatin, Carbo/Taxol or Carboplatin/ 5- fluorouracil) and radiotherapy (in tumour bed or neck), except for one patient who underwent radiation therapy only. Not included in the study were patients receiving immunotherapy or stereotactic body radiation. The study (recruitment, sample collection and data sharing) was approved by the University of Tennessee Health Science Center Knoxville IRB (study number 4652) and all participants provided written informed consent.

Table 1 Subjects and associated metadata.

Oral cavity squamous cell carcinoma subjects

Eight patients with oral cavity SCC (three males and five females, average age of 65) were enrolled (Table 1 and Supplementary Table S1). One patient was deemed medically unresectable, while the other patients were all treated with definitive surgery. Patients received adjuvant radiation therapy using intensity-modulated radiation therapy (IMRT) with 6MV photons to the oral cavity and nodal basins at 2Gray per fraction to 50 Gy. The primary site was treated using a sequential boost to 60 Gy in five patients and to 70 Gy in two patients with a positive margin. All but one patient received concurrent chemotherapy.

Oro-pharyngeal squamous cell carcinoma subjects

Eleven patients with SCC of the oropharynx (nine males and two females, median age of 60) were enrolled (Table 1). Among them, 9 were p16 positive (82%). All patients were treated with IMRT with standard fractionation. One patient with a positive margin after resection was treated to 66 Gy with all remaining oropharynx patients were treated to 70 Gy. All oropharyngeal patients received concurrent chemotherapy.

Galera ROMAN clinical trial

Five of the cancer patients (OM16, OM 20, OM22, OM24, and OM28) were concomitantly enrolled on the Galera ROMAN phase 3 clinical trial22 and randomized to placebo or Avasopasem infusion daily one hour prior to radiation therapy. All patients enrolled on the Galera trial were treated with standard of care surgery, chemotherapy, and radiation as determined by their physicians, and were not given other mouthwash formulations. The physicians in the study were blinded to treatment arms of the trial.

Control subjects

Cancer-free control subjects were recruited from relatives of enrolled patients and from the general population. They included 4 males and 7 females between the ages of 24–73 years (Table 1 and Supplementary Table S2). Exclusion criteria were recent (within the past month) administration of antibiotics.

Patient assessment

Patients were evaluated weekly for the development of oral mucositis based on the World Health Organization (WHO) Oral Mucositis Assessment Scale23 and the Radiation Therapy Oncology Group (RTOG) Mucositis Assessment Scale. Grade 0 mucositis was reserved for patients with no significant oral findings. Patients with soreness/erythema were classified as grade 1. The presence of ulcers with preservation of ability to eat solid food was described as grade 2, whereas ulcers that required liquid diet only was classified as grade 3. Grade 4 mucositis was diagnosed when alimentation was not possible. The patients’ diet was recorded each week as solid food, soft food, liquids only, or NPO (all nutrition provided through a feeding tube). The presence or absence of thrush was recorded each week along with the use of any antibiotics, steroids and mouthwash (see Supplementary Table S1 for associated metadata).

Sample collection and processing

Unstimulated whole saliva samples and mucosal swabs were collected to evaluate the oral cavity microbiome. Initial samples were collected 7–10 days prior to the initiation of radiation or chemo-radiation treatments. Additional samples were collected again during the third week of radiation therapy (12–15 Gy) at a time point expected to coincide with mild mucositis. A third round of samples was collected between a dose of 60–70 Gy to correspond with severe mucositis stage. A final sample was collected one month after completing therapy. Samples were collected from healthy control participants at similar four time points as the SCC cancer patients to examine normal variations occurring in the oral cavity microbiome of healthy participants over time. Some samples could not be collected from some subjects. A schema of the sampling strategy is presented in Fig. 1.

Fig. 1
figure 1

Longitudinal sampling design for control and cancer subjects. Saliva (S), oral mucosa swab samples from tumour (T) or normal site (N) were collected at weeks 0, 2, 7 and 11. The number indicate absence (0, no shading) or presence (shaded) of mucositis of various grades (1,2,3) and/or thrush (x). Administration of anti-fungal medication is indicated by the star symbols. The cartoons summarize the type of collected samples.

All subjects provided 1 ml of saliva by passive drool into a cryovial using a Saliva Collection Aid (Salimetrics LLC, State College, PA). However, due to xerostomia that develops during radiation therapy collection of saliva was not possible in some patients during the third week of therapy and in most patients during the final week of radiation. Sterile rayon-tipped swabs (Copan Diagnostics Inc, CA, USA) were used to sample the mucosa or site of mucositis within the oral cavity or oropharynx by rolling/rubbing the swab over the affected surface. In cancer patients, one sample was collected from the tumour site or resection bed at each time point and a second sample was collected from the contralateral oral cavity or oropharynx. In healthy volunteers only a single swab was collected from the entire buccal mucosa at each time point. Swabs were immediately preserved in 1 ml of S1 Lysis buffer (PureLink™ Microbiome DNA Purification Kit, Invitrogen, Thermo Fisher Scientific, Waltham, MA). Swabs and saliva were transferred to the University of Tennessee Medical Canter Biorepository and stored at −80 °C until further processing. All participants and retrieved specimens were assigned a unique code in the biorepository. Only de-identified specimens were used for DNA extraction, amplicon sequencing, and data analyses.

For DNA extraction we used the PureLink Microbiome DNA Purification Kit following the manufacturers’ protocol for saliva and buccal swaps. Either 200 µl of saliva mixed with 600 µl of S1 lysis buffer or 800 µl of lysis buffer from preserved swabs were used. DNA concentration was determined using the Qubit dsDNA BR Assay kit (Life Technologies, Thermo Scientific Inc, Waltham, MA) with a Qubit 4 Fluorometer (Invitrogen, Thermo Fisher Scientific, Waltham, MA). Extracted DNA was aliquoted and stored at −80 °C.

Amplicon library preparation and sequencing

To generate the amplicon libraries, we used the Zymo Quick-16S™ NGS Library Prep Kit (Zymo Research Corporation, Irvine, CA, USA) with the multi-step Real Time Polymerase Chain Reaction approach, following manufacturer’s instructions. In the first step, Targeted Sequence Amplification was performed using a forward and reverse primer mix that targets the V4 variable region of the 16S rRNA gene of Archaea and Bacteria (515FYM: 5′GTGYCAGCMGCCGCGGTAA; 515F_TM7:5′GTGCCAGCMGCCGCGGTCA; 515Propioni: 5′ GTGCCAGCAGCCGCGGTGA; 806 R: 5′GGACTACNVGGGTWTCTAAT; 806R_Propioni: 5′GGACTACCAGGGTATCTAAG, at a ratio 40:5:5:45:5). The primers were fused to Illumina adapter sequences (5′ TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG for the forward primers and 5′ GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG for the reverse primers), according to manufacturer specifications. For fungi, we used primers targeting the ITS2 region, ITSF:5′CATCGATGAAGAACGCAG and ITSR:5′ TCCTSCGCTTATTGATATGC, also fused to Illumina adapters. The reactions were set up according to kit’s instructions and the PCR was performed in a BioRad iCycler instrument with the following conditions: 95 °C for 10 min, then 32 cycles of 95 °C for 30 seconds, 55 °C for 30 seconds, and 72 °C for 3 min, followed by fluorescence read. After completion of target sequence amplification, enzymatic reaction cleanup was performed at 37 °C for 15 minutes, followed by inactivation at 95 °C for 10 min. Amplicon libraries were then barcoded using the ZymoBIOMICS Index Primer Set’s A, B, and C in a 5-cycle real time PCR reaction, following the kit protocol. Barcoded amplicons were pooled and purified using the Zymo Select-a-Size DNA Clean & Concentrator MagBead kit. Paired end sequencing (2 × 300 bp) of pooled amplicon libraries was performed on an Illumina MiSeq instrument (Illumina, San Diego, CA).

Sequence analyses

Forward and reverse sequence reads were demultiplexed on the Miseq instrument and assigned to the individual samples based on barcodes. The sequences were imported as paired fastq files into QIIME224. The DADA2 plugin was used to trim, denoise, pair, purge chimeras and select amplicon sequence variants (ASVs), using the command “qiime dada2 denoise-paired”. Taxonomy was assigned using a pre-trained Naive Bayes classifier based on the SILVA 16S rRNA database (v138)25 trimmed to the 515 F/806 R region or the Unite (ITS) database26 for fungal sequences. Unassigned sequences, mitochondrial, and chloroplast sequences were removed. Summaries of the most relatively abundant genera and species of bacteria and fungi, averaged by subject category, are shown in Fig. 2. Sequence variant based richness and Shannon diversity were calculated within QIIME2 and tested for phenotype and sample type with ANOVA and Tukey HSD. Beta diversity (using Bray-Curtis dissimilarity distances) was calculated with the Phyloseq package in R and tested for phenotype and sample type with Permanova.

Fig. 2
figure 2

Microbiome diversity in healthy controls and cancer subjects. The most relatively abundant bacteria and fungi identified in healthy and cancer subjects were averaged across all sample types and timepoints for healthy subjects. For cancer subjects, data was averaged for weeks 0 and 7, respectively.

Data Record

The amplicon sequence data and all the associated metadata have been deposited in the Sequence Reads Archive (SRA) of the National Canter for Biotechnology Information (NCBI) under the SRA Study SRP53177027 as part of BioProject PRJNA1158838. Metadata includes clinical information (cancer stage, mucositis, treatments) as well as patient/healthy controls demographics and ancillary data (nutrition, dentition, tobacco and alcohol use). That data is also provided in the Supplementary Table S1, S2.

Technical Validation

Technical validation steps were performed using standard best practices in microbial ecology. Negative control DNA extraction and amplicon sequencing were performed and supported the absence of reagents or handling-associated contamination. The forward and reverse fastq sequence reads were trimmed during the assembly process based on quality (>Q30). Error correction and chimera detection by the DADA software filtered out PCR and sequencing artifacts. The median number of raw sequences was 54,406 (16S) and 51,037 (ITS) per sample, respectively. Following read assembly and quality filtering, the datasets contained a median 35,562 (16S) and 22,794 (ITS) sequences per sample. The amplicons were assigned to bacterial and fungal taxa by applying the QIIME2 classify-sklearn algorithm to the reference SILVA and Unite databases. Across all samples and subjects, the 16S ASVs were assigned to 323 bacterial taxa and the ITS ASVs to 431 fungal taxa (Supplementary Table S3, S4). Microbial and fungal alpha diversity varied across sampling site but did not correlate significantly with subject health category or sampling time, although the diversity spread was higher in cancer subjects (Fig. 3). Permanova tests indicate that the community composition (beta diversity, for both bacterial and fungal) were different between the healthy and the cancer cohorts (p < 0.001) and, in the disease group, were also impacted by chemotherapy (p < 0.001), participation in the Galera trial (p < 0.001) the presence of mucositis (bacteria p < 0.001, fungi p < 0.005, respectively) or thrush (p < 0.001). No significant differences were associated with the type of cancer, sample collection site and lifestyle metadata.

Fig. 3
figure 3

Microbiome alpha diversity. Shannon alpha diversity was calculated for healthy control and cancer cohorts for each sampling timepoint.

Usage Notes

A limitation of our study is the small cohort of subjects that were sampled relative to the many variables associated with oral cancer, different treatments, secondary complications and outcomes. The data we presented here could inform future studies to further address the complexity of oral cancer-microbiome-mucositis connections.

The raw sequence data (forward and reverse fastq sequences) can be downloaded from the NCBI SRA database. The following scripts can be used in QIIME2 to perform read processing and taxonomic assignments using the 16S rRNA and ITS references available in the QIIME2 repository. The output qza files can serve as input for a variety of statistical and diversity analyses in conjunction with other similar datasets from similar studies.

16S reads processing

# 16S processing for two different runs

qiime tools import --type ‘SampleData[PairedEndSequencesWithQuality]’ \

--input-path /16Sreads/ \

--output-path 16S_demux.qza \

--input-format CasavaOneEightSingleLanePerSampleDirFmt

qiime demux summarize \

--i-data 16S_demux.qza \

--o-visualization 16S_demux.qzv

qiime dada2 denoise-paired --i-demultiplexed-seqs. 16S_demux.qza \

--p-trim-left-f 19 \

--p-trim-left-r 19 \

--p-trunc-len-f 220 \

--p-trunc-len-r 220 \

--o-table 16S-dada2table.qza \

--o-representative-sequences 16S-rep-seqs.qza \

--o-denoising-stats 16S-denoising-stats.qza \

qiime metadata tabulate \

--m-input-file 16S-denoising-stats.qza \

--o-visualization 16S-stats.qzv

qiime feature-classifier classify-sklearn \

--i-classifier silva-132-99-515-806-nb-classifier.qza \

--i-reads 16S-rep-seqs.qza \

--o-classification 16S-taxonomy.qza \

qiime metadata tabulate \

--m-input-file 16S-taxonomy.qza \

--o-visualization 16S-taxonomy.qzv

ITS reads processing

qiime tools import–type ‘SampleData[PairedEndSequencesWithQuality]’ \

--input-path /ITSreads/ \

--output-path ITS_demux.qza \

--input-format CasavaOneEightSingleLanePerSampleDirFmt

qiime demux summarize \

--i-data path ITS_demux.qza \

--o-visualization ITS_demux.qzv

qiime dada2 denoise-paired–i-demultiplexed-seqs ITS-demux.qza \

--p-trim-left-f 19 \

--p-trim-left-r 19 \

--p-trunc-len-f 220 \

--p-trunc-len-r 220 \

--o-table ITS-dada2table.qza \

--o-denoising-stats ITS-denoising-stats.qza \

qiime metadata tabulate \

--m-input-file ITS-denoising-stats.qza \

--o-visualization ITS-stats.qzv

qiime feature-classifier classify-sklearn \

--i-classifier unite-ver8-dynamic-classifier-07Nov2019.qza \

--i-reads ITS-rep-seqs.qza \

--o-classification ITS-taxonomy.qza \