Abstract
Metastasis causes most cancer-related deaths in colorectal carcinoma (CRC), and microbiome markers may have prognostic value. We hypothesized that primary tumor microbiomes predict distant metastases. We analyzed 5-year metastasis-free survival (MFS) in a retrospective cohort of 900 ORIEN CRC tumor microbiomes (RNAseq). ORIEN findings were validated on an independent cohort using 16S rDNA sequencing and pathobiont-specific qPCR. Microbiome alpha diversity was higher in primary tumors than metastases and positively correlated with metastasis risk. Microbiome beta diversity distinguished primary vs. metastasis and predicted 5-year MFS. High primary tumor abundance of B. fragilis and low F. nucleatum were associated with short MFS. Enterobacteriaceae, including E. coli, were enriched in metastases. qPCR identified increased enterotoxigenic B. fragilis and pks + E. coli detection in CRC metastasizers. Microbial co-occurrence analysis identified a 3-species clique that predicts metastasis (OR 1.9 [1.4–2.6]). Results suggest that primary tumor microbiomes and specific pathobionts are precision markers for metastasis risk.
Similar content being viewed by others
Introduction
Mortality in colorectal carcinoma (CRC) is mediated by metastasis, despite expanding multimodal treatment options1. Low-stage primary tumors often have excellent outcomes after surgical resection, but CRC can recur and/or metastasize several years after surgery. For example, locally advanced T-stage tumors (T2-3) without lymph node metastasis (N0) can be managed without adjuvant chemotherapy. However, follow-up is needed to identify the 10–20% of patients who will have distant metastases within 5 years.
The CRC microbiome is composed of intestinal commensal bacteria, and “pathobionts” that contribute to carcinogenesis and disease progression. Intestinal bacteria also colonize CRC distant metastases and modulate the immune microenvironment, such as tumoricidal natural killer cell activity2,3. The most extensive prior research of pathobionts describes Fusobacterium nucleatum ssp. animalis, which adheres to host enterocytes through Fap2 and stimulates Wnt signaling through FadA interactions with E-cadherin to promote carcinogenesis and the epithelial-to-mesenchymal transition4,5. In animal models, F. nucleatum increases the risk of metastasis and may directly contribute to lymphatic or hematogenous spread3,6. F. nucleatum colonizes 10–20% of CRC with a predilection for right-sided location7, and distant metastases3. Other pathobionts have been mechanistically linked to CRC carcinogenesis and progression, such as Escherichia coli strains with biosynthetic genes (pks locus) to produce colibactin, a host DNA alkylating genotoxin8. Pathobiont pks + E. coli promotes carcinogenesis in the setting of colitis8, and the signature mutagenic effects of colibactin are detectable in human CRC9,10. Recent advances toward colibactin synthesis inhibition highlight the promise of microbiome-targeted therapies for prevention and/or treatment of CRC11. Host DNA damage can also be inflicted by protein toxins, such as the type III secretion system effector UshA in Citrobacter rodentium and cytolethal distending toxin (CDT) in Campylobacter jejuni that promote carcinogenesis in mouse models12,13. Some strains of the prevalent intestinal colonizer Bacteroides fragilis produce fragilysin or “enterotoxin” (ETBF), which promotes carcinogenesis14. ETBF enhances CRC cell migration in culture, likely through a miR-139-3p-dependent pathway15. ETBF also promotes “stemness” of CRC in culture through TLR4-NFAT516.
Reflecting the importance of bacteria in CRC biology, tumor microbiomes have potential links to survival outcomes in prior research. A microbiome signature classifier with substantial contributions from Ruminococcus spp. was predictive of survival in several CRC cohorts17. However, whether the classifier provides prognostic value beyond other dominant factors like the pathologic stage remains to be investigated. Highly powered correlational studies using the Nurses’ Health Study (NHS) and the Health Professionals Follow-up Study (HPFS) cohorts and a Chinese stage III/IV CRC cohort have suggested that F. nucleatum abundance predicts CRC-specific mortality (OR 1.52 [1.04–2.39]), though other studies found no correlation18,19,20. Several known covariates with F. nucleatum colonization, such as right-sided anatomic location and microsatellite instability, may confound this type of single-species retrospective analysis21. At least two studies have suggested that high ETBF abundance correlates with advanced stage and shorter survival22,23. One mechanism by which bacteria may contribute to survival outcomes in CRC is interaction with chemotherapeutic agents. Prior research indicates that 5-fluorouracil (5-FU), a cornerstone of CRC chemotherapy since the 1980s, inhibits F. nucleatum growth, while tumor-resident E. coli can metabolize 5-FU to eliminate its anti-tumor activity24.
Specific tumor-associated pathobionts have established causal roles in colorectal carcinogenesis and progression. For several organisms, rigorous prior research has identified mechanisms such as host genome modification by genotoxins and stimulation of pro-carcinogenic signaling25. Knowledge gaps remain regarding the roles of human CRC microbiomes in disease progression, particularly metastasis risk. We hypothesized that metastasis-promoting bacteria are enriched in primary tumors of patients who eventually develop distant metastasis. We designed a retrospective cohort study using a large multi-institutional CRC population with available tumor RNAseq data. We validated the findings using 16S rDNA sequencing of an independent stage-matched validation CRC cohort.
Results
Descriptive statistics of independent discovery and validation cohorts
Uniformly tabulated clinicopathologic data from the ORIEN cohort showed younger age in the pre-existing metastasis group, modest differences in BMI (lowest in >5-year MFS), slightly more current smokers in the 0–5 yr MFS group, and differences in known vs. unknown performance status (Table 1). Since the primary tumor stage is a dominant predictor of MFS26, the higher pathologic and clinical stage parameters and death rates among the groups with metastasis were expected. Neoadjuvant therapy is more commonly indicated and given in the advanced stage and short MFS groups. The length of follow-up difference was expected due to study design, because only the >5-year MFS group was required to have at least 5 years of follow-up.
Clinicopathologic data from the UIHC cohort showed more frequent microsatellite instability in the >5 yr MFS group (Table 2). The M stage was specified by study design. Adjuvant chemotherapy was more frequently given to metastasizers, as expected. Since adjuvant therapy was given after biospecimen collection in all cases, it does not affect the primary tumor microbiome.
CRC tumor microbiome diversity predicts metastasis-free survival
We first asked whether bacterial communities colonizing primary tumors are related to the risk of metastasis. To address this question, we compared tumor microbiomes from patients with metastasis preceding the biospecimen collection (pre-existing metastasis), those that metastasized within 5 years, and those with more than 5 years of MFS. Primary tumor microbiome alpha diversity was slightly lower in patients with long-term MFS, having the same trend and borderline statistical significance with multiple metrics in the discovery and validation cohorts (Fig. 1A–C, 2-way ANOVA p 0.02–0.08). All analyses were stratified by site (metastasis vs. primary tumor) and metastases consistently showed lower alpha diversity. Tumor microbiome beta diversity analysis also showed significant differences between the MFS groups, as well as sites (Fig. 1D, E). All the above global diversity trends were found in both cohorts. The ORIEN discovery cohort has several clinicopathologic differences among the MFS groups that could confound the microbiome associations with disease progression. Since the cohort is large enough to power multivariate modeling, we asked whether the MFS-microbiome link is independent of several potential confounders using a permutational clustering approach implemented in adonis2 (Table 3). A statistically significant correlation between microbiome beta diversity and MFS persisted after adjustment for potential confounders. Since neoadjuvant therapy can alter tumor microbiomes, we performed the same modeling approach while excluding 151 neoadjuvant treated patients. The microbiome association with MFS remained significant (adjusted adonis p = 0.015). High microsatellite instability (MSI-H) can alter CRC biology, including metastasis outcomes and response to therapy27. We performed the same modeling while excluding 39 patients with MSI-H status, and the microbiome association with MFS remained significant (adjusted adonis p = 0.03). Within the MSI-H subset, a simplified analysis with adjustment only for primary/metastasis biospecimen site found no significant association between microbiome and MFS (adjusted adonis p = 0.4). This could reflect differences in MSI-H tumor microbiome relationships to metastasis outcomes, or inadequate statistical power (n 39).
A, B In the ORIEN cohort, alpha diversity by several metrics is significantly lower in distant metastases (met), and among primary tumors, modestly lower in patients with long term metastasis-free survival (MFS). C Similarly, low alpha diversity in primary tumors correlated with shorter MFS in the UIHC cohort. Metastases had lower Shannon indices that did not reach statistical significance. Microbiome beta diversity in both the ORIEN D and UIHC E cohorts is also distinguishable by site (metastasis vs. primary tumor) and MFS group. Larger symbols represent calculated centroids. The findings indicate that tumor microbiome diversity corresponds to MFS, and therefore may have prognostic value.
To identify specific taxa that distinguish metastasis risk, some of which may be causally related to disease progression and thus therapeutic targets, we applied LefSe28 linear discriminant analysis (Fig. 2). Among the most discriminating features in the ORIEN dataset were at least six known pathobiont-containing taxa that contribute to carcinogenesis through known mechanisms (Fig. 2A, blue text). Surprisingly, F. nucleatum was highest in primary tumors from patients with long-term metastasis-free survival. This may be explained by an inverse correlation between F. nucleatum abundance and stage group, i.e., F. nucleatum is most abundant in tumors with a shallow depth of invasion and a lack of lymph node metastases (Kruskal-Wallis p = 0.003). Analysis of the stage-matched UIHC validation cohort revealed different discriminating taxa, although some overlap with the discovery cohort was seen (Fig. 2B, red text). Thus, beta diversity differences according to MFS are strongly consistent, but the specific discriminating taxa differ between the two cohorts, which could be due to covariance with their distinct clinicopathologic features (Tables 1, 2).
A Linear discriminant analysis of the ORIEN cohort identified several distinguishing taxonomic groups that contain known CRC-related pathobionts (blue). The Enterobacteriaceae S. enterica and E. coli are enriched in distant metastases. B. fragilis and Peptostreptococcus spp. in primary tumors predicted metastasis within 5 years, while F. nucleatum associated with long-term metastasis-free survival (MFS). B Discriminating taxa in the stage-matched UIHC cohort were not the same, but overlapping taxonomic groups are highlighted in red. C Tissue RNAseq- and 16S-derived microbiome data does not distinguish pathobiont strains, so qPCR for F. nucleatum, bft1-3 from enterotoxigenic B. fragilis (ETBF), and cnf2 and clbB genes from the pks locus in colibactin-producing E. coli were performed on tumors and non-neoplastic margins. Detection rates were significantly different (Chi-squared p < 0.01) with ETBF and pks + E. coli more frequently detected in metastasizer biospecimens. F. nucleatum detection rates did not differ in the UIHC cohort by MFS group.
The 16S rDNA sequencing cannot resolve specific pathobiont strains, and the RNAseq data were of insufficient depth to confidently discriminate the closely related genomes. For example, 16S rDNA sequences do not distinguish enterotoxigenic from nonenterotoxigenic B. fragilis. We selected three of the most studied CRC-promoting pathobionts for strain-specific detection and quantitation by PCR: F. nucleatum, enterotoxigenic B. fragilis (ETBF), and pks + E. coli. qPCR performed on all non-metastasis biospecimens from the UIHC cohort (primers listed in Table S1) showed more frequent detection of both ETBF and pks + E. coli in metastasizers than patients with long-term MFS (Fig. 2C, 33–40% vs. 20–28%, Chi squared p < 0.01). Detection of F. nucleatum was not associated with MFS in the stage-matched UIHC cohort.
Microbial co-occurrence predicts metastasis-free survival
We identified a microbial clique of the simultaneous presence of Escherichia coli and the absence of a Pseudomonas sp. and Bacteroides fragilis in the training data (randomly selected 65% of the samples). Almost half (47.6%, n = 565) of the total study sample had this microbial clique. This microbial clique was strongly associated with increased odds of metastasis (OR [95% CI]: 1.91 [1.40, 2.59], p value < 0.0001) across the whole sample (Fig. 3). It was also strongly associated with increased odds of metastasis (OR [95% CI]:1.88 [1.11, 3.20]) in the test data (35% of the sample).
A A forest plots shows the association between the identified microbial clique and the odds of metastasis. B A violin plot shows the distribution of the outcome among patients with and without the clique.
Primary tumor microbiome capacities for carbohydrate metabolism and quinone biosynthesis reflect metastasis risk
Although specific MFS-associated taxa, other than pathobionts, did not show a high degree of concordance between the two cohorts, we reasoned that the functional capacities of the microbiome community may correspond to tumor microenvironment differences underlying metastasis risk. These functions may also highlight targets for future mechanistic studies and microbiome-targeted interventions. Using predicted gene function in the UIHC cohort, we examined correlations with MFS (Fig. 4A). The dominant trends were the enrichment of certain carbohydrate catabolism pathways and quinone biosynthesis pathways in tumors from patients with long-term MFS. Several enriched pathways are involved in the biosynthesis of menaquinones, which are components of the cell membrane that participate in electron transport. Menaquinones are prevalent, reversible redox components in anaerobes and most Gram-positive aerobes29. For the ORIEN cohort, we leveraged a large database of phenotype inferences, ProTraits30, and mapped ~1300 of the detected taxa in our biospecimens to derive predicted trait abundances. A comparison of the 0–5 yr and >5 yr MFS group microbiomes revealed a few traits significantly enriched in the long-term MFS group (Fig. 4B). Among these were several carbohydrate utilization traits and strict anaerobic preference, which correspond well to the metabolism and anaerobe-enriched menaquinone biosynthesis in the UIHC cohort.
A For the UIHC cohort, linear discriminant analysis was performed on 16S rDNA sequence-based pathway abundance prediction from Picrust2. Several patterns emerged as enriched in the >5-year MFS group: carbohydrate catabolism pathways (blue), quinone biosynthesis (brown), and some amino acid biosynthesis pathways (red). The most enriched pathways in metastasizer primary tumors were nucleic acid metabolism pathways. B For the marker gene-derived microbiome data in the ORIEN cohort, the identified species were mapped to the Protraits bacterial traits database. Among the ~1300 species with phenotypic data, few specific associations were found related to MFS: strict anaerobes, alpha galactosidase, and hydrogen sulfide producers were modestly enriched in the >5-year MFS group.
Metastasis microbiomes derive from the primary tumor and have reduced alpha diversity
CRC metastases are colonized by intestinal bacteria including pathobionts3. We assessed correlations between the microbiomes of primary tumors and metastases through paired biospecimens from individual patients. We focused on the subset of 103 ORIEN cohort participants and 50 UIHC cohort participants with paired primary tumor and metastasis biospecimens. Consistent with our observation that metastasis microbiomes in general have lower alpha diversities (Fig. 1A–C), alpha diversity was lower in metastases compared to the paired primary tumor from the same subject in both the ORIEN and UIHC cohorts (Fig. 5A–C). To assess beta diversity relationships between primary/metastasis pairs, we quantified UniFrac distances. In the ORIEN cohort, paired primary tumor and metastasis microbiomes within one subject were more closely related than unpaired comparisons (Fig. 5D). This pattern did not hold in the UIHC validation cohort, though statistical power was limited by the number of paired biospecimens (Fig. 5E). Overall, the findings suggest that similar bacterial populations colonize both primary tumors and metastases within an individual. However, a restricted set of taxa (lower alpha diversity) can colonize and persist at distant sites.
Paired primary tumor and metastasis biospecimens with microbiome data were available for 103 patients in the ORIEN cohort. A, B A paired analysis demonstrated lower alpha diversity in distant metastases, a finding validated in the stage-matched UIHC cohort (C). D ORIEN cohort paired metastasis and primary tumor microbiomes were closely related by beta diversity distance metrics in contrast with unpaired biospecimens, compatible with similar bacterial communities from the primary tumors colonizing distant sites. Inter-subject primary tumors were the most distant. E However, this pattern was not seen in the UIHC cohort, where primary tumor microbiomes were most similar.
We then examined differentially abundant taxa in the paired primary tumor and metastasis microbiomes. Several taxonomic groups were significantly depleted in metastases compared to primary tumors (Fig. 6A, B). The facultative anaerobe and pathobiont-containing Bacteroidaceae family and Fusobacterium genus, as well as the Bifidobacterium genus, were significantly less abundant in metastases from both cohorts. Enterobacteriaceae, including E. coli, had similar abundances at both sites. The species Fusobacterium periodonticum and a Bacillus genus ASV were significantly enriched in the metastases of the UIHC cohort. The findings suggest that some major microbiome families, predominantly anaerobes, are less fit for survival in distant metastasis microenvironments. ProTraits functional group analysis (Fig. 6C) identified strict anaerobicity and several carbohydrate and amino acid utilization traits as depleted in metastases. Aerobes, facultative anaerobes, and alcohol utilization traits had modest advantages in metastases. Analysis with PICRUSt2 in the UIHC cohort identified few pathways enriched in metastases (none in the primary tumors), with themes of nucleic acid metabolism, quinone biosynthesis, and peptidoglycan biosynthesis (Fig. 6D). The findings suggest that nutrient sources (carbohydrates and amino acids) and oxygen levels in distant metastasis microenvironments negatively select some of the taxa that colonize primary tumors.
A In the ORIEN cohort, relative abundance comparison to primary tumors showed metastasis depletion of Fusobacterium spp., Bacteroidaceae, and Bifidobacterium spp., among others, while Enterobacteriaceae and Bacillus spp. were unchanged or enriched. B A similar analysis in the UIHC cohort identified similar patterns, except that Fusobacterium was not significantly depleted. C The Protrait database was used for enrichment analysis of bacterial functions based on taxonomy and relative abundance in the ORIEN cohort. Anaerobic bacteria were depleted in metastases, as well as H2S-producers and bacteria with the capacity to utilize certain sugars. The findings may reflect key selective pressures in the metastasis microenvironment, which tends to be oxygenated and likely lacks the diet-derived carbon sources available to bacteria in the gut. D Picrust2 predicted pathway analysis of the UIHC cohort identified modest enrichment of nucleic acid metabolism and peptidoglycan-related pathways in metastasis microbiomes. No pathways were significantly enriched in primary tumors.
Discussion
The main finding of this study is that primary CRC tumor microbiome diversity, gene function capacities, and microbial cliques are predictive of metastasis-free survival. While most prior detailed mechanistic investigations have focused on the microbiome’s contributions to initiating carcinogenesis, we highlight that human tumor-resident bacteria also associate with disease progression and can help predict the critical clinical outcome of distant metastasis. Several pathobiont taxa, including pks + E. coli and ETBF, are known to promote carcinogenesis and, in limited studies, have been shown to promote epithelial-mesenchymal transition15,16,31, are among the bacteria enriched in primary tumors that progress to distant metastasis. In a recent global study of CRC mutation signatures, colibactin-associated mutations showed regional variability and enrichment in early-onset colorectal carcinoma10. Colibactin produced by pks + E. coli directly influences cancer cell genetics by promoting dsDNA32, which may contribute to metastasis risk25. Pks + E. coli influence cancer-related signaling pathways and microRNAs that are linked to epithelial-mesenchymal transition32,33. Indirect mechanisms of pathobionts promoting metastasis through effects on the tumor microenvironment have been described, such as pks + E. coli-mediated induction of cellular senescence, promotion of growth factor release, and tumor progression34. Colibactin-producing E. coli promote the emergence of cancer stem cell populations that are resistant to chemotherapy32. ETBF can promote CRC metastasis through microRNA and epigenetic modifications in cancer cells35. B. fragilis can promote chemoresistance through host Notch1 signaling, independent of enterotoxin36. The capacities of these pathobionts to mutagenize host cells, stimulate pro-proliferation and -invasion signaling pathways, promote chemoresistance, and manipulate tumor immune cells likely contribute to metastasis risk. Thus, some mechanistic insight for bacterial promotion of CRC metastasis already exists. However, further study is needed to fully understand these mechanisms and identify causal relationships between other bacteria and metastasis. Ongoing efforts to inhibit B. fragilis enterotoxin and colibactin biosynthetic enzymes highlight a promising opportunity to target pathobionts in CRC11,37,38.
F. nucleatum is one of the most highly studied CRC pathobionts that promotes metastasis in animal models through mechanisms of host cell signaling and non-coding RNA modulation events linked to the epithelial-mesenchymal transition3,39,40. Our study did not find associations between F. nucleatum abundance or qPCR detection in tumor microbiomes and MFS outcomes. In fact, initial linear discriminant analysis indicated higher F. nucleatum in the >5 yr MFS group (Fig. 2A). However, further analysis determined that locally less advanced tumors (low T stage) tended to have more F. nucleatum, confounding the association with long-term MFS. This issue did not affect the UIHC cohort because of stage matching. Thus, tumor F. nucleatum abundances and/or functions may change during primary tumor progression, making associations to metastasis outcomes complex. Specific sub-species Fusobacterium taxa (the Fna C2 clade) have been implicated as key colonizers of human CRC41, and this level of granular genetic diversity is not measured with our 16S metagenomics approach. Differences between human CRC biology and animal models of metastasis, such as intact tumor immunity and differing microbiome communities, could also underlie the lack of a detectable F. nucleatum and MFS association in our study.
Our findings indicate that primary tumor microbiomes, pathobiont qPCR markers and specific microbial cliques have prognostic value for metastasis risk, independent of the pathologic stage at resection42. The pathobiont-containing species E. coli and B. fragilis are central to the predictive microbiome features in three parts of the analysis: 1) linear discriminant analysis by MFS outcomes, 2) pathobiont strain detection, and 3) their inclusion in a highly MFS-predictive microbial clique. These features may be useful prognostic markers for determining post-resection surveillance and decisions to treat stage II CRC with adjuvant therapy. While tumor microbiome metagenomics is not part of current clinical practice, detection of specific metastasis risk-related pathobionts such as pks + E. coli and ETBF by qPCR and/or in situ hybridization in CRC pathology specimens is imminently feasible as a clinical test43. Further study is needed to develop these tests and determine their potential clinical utility as precision oncology markers.
Tumor microbiome profiling requires tissue biospecimens from invasive procedures, and fecal biospecimens are a potential alternative for prognostic tests. Our findings of metastasis outcome prediction by tumor microbiomes differ from a recent meta-analysis of 3741 fecal microbiomes, which found no alpha or beta diversity associations with clinical stage (including distant metastases)44. Comparative studies have demonstrated differences between CRC tumor microbiomes, adjacent normal mucosal microbiomes, and the fecal microbiome, likely related to differences in the respective microenvironments45,46,47. Thus, diversity of the microbiome localized to tumors may be more predictive for this particular outcome.
The second significant finding of this study is that CRC metastasis microbiomes have a composition that is related to the primary tumor but with lower alpha diversity and depletion of specific bacterial traits. Microbiome capacities for carbohydrate utilization and anaerobic electron transport in tumor microbiomes are associated with long-term metastasis-free survival. One potential explanation for lower anaerobe abundances in metastasizing tumors could be greater oxygen content (less hypoxia) within the tumor microenvironment that selects against anaerobes. Bacteria in beneficial tumor microbiomes (patients with long-term MFS) have a high capacity for specific carbohydrate utilization, indicating the potential for dietary interventions to influence metastasis risk. The tissue microenvironments in the liver and lungs likely exert selective pressures on the CRC-associated microbiome due to factors such as high oxygen tension, less diverse food-derived carbon sources, and distinct immune cell populations. Prior research indicates that bacteria within metastases can influence tumor growth and response to therapy3,6,48. While the prior research in this area has focused almost exclusively on F. nucleatum, we find that Fusobacteriaceae and several other pathobiont-containing taxonomic groups are depleted in metastases compared to the primary tumor from the same patient. Understanding the drivers of microbiome composition in distant metastases may provide insight into bacterial roles in disease progression and highlight potential interventions such as dietary changes and selective antibiotics directed at tumor microbiomes.
An outstanding goal in the field of tumor microbiome studies is intervention with microbiome-targeted therapies. Our study suggests several avenues for further investigation toward this long-term goal. First, antimicrobials or biosynthetic gene inhibitors directed at some of the known CRC carcinogenic pathobionts (colibactin-producing E. coli and ETBF)11,38 are likely to also favorably impact disease progression. The differential capacity for carbohydrate and amino acid metabolism in primary tumors that metastasize suggests that it may be possible to manipulate the tumor microbiome with dietary (prebiotic) interventions. Dietary patterns such as the sulfur microbial diet are known to modify both the fecal microbiome and the risk of CRC49, but little is currently known regarding dietary effects on tumor-resident bacteria and disease progression. Finally, while this study emphasizes pathobionts, tumor bacteria associated with long-term MFS, such as B. thetaiotaomicron and Prevotella spp. (Fig. 2B) are candidate probiotics that can be further studied for metastasis prevention. Although neither taxonomic group has been extensively studied in CRC animal models, prior research suggests that short-chain fatty acids produced by B. thetaiotaomicron and the presence of Prevotella may suppress tumor growth and/or enhance response to chemotherapeutics50,51,52.
All tumor microbiome studies work from low bacterial biomass specimens, which, together with non-sterile handling of pathology specimens, increases the risks of contamination by exogenous bacterial DNA compared to fecal microbiome studies. These risks are mediated by the inclusion of controls and bioinformatic decontamination approaches (see methods). Although some degree of contamination undoubtedly remains, it is expected to be biased toward the null hypothesis (high variability, no difference between groups) because the study design compares biospecimens handled in the same way and is not expected to have systematically different contamination risks. Perhaps the greatest strength of this study is the use of two independent cohorts with different methodologies (RNAseq and 16S rDNA metagenomics), yielding highly consistent findings. As a retrospective biospecimen study, the results are primarily correlative and, in isolation, do not allow conclusions about the causal roles of the microbiome in metastasis. However, we highlight several organisms with known mechanistic causal roles in CRC carcinogenesis and progression that also emerged in our study, supporting the conclusion that some of the correlations do have an underlying causal relationship.
Methods
Human subject cohort and study design
All human subjects research conforms to the standards of the Declaration of Helsinki. Two subject cohorts were constructed, an Oncology Research Information Exchange Network (ORIEN) discovery cohort consisting of 1003 colorectal carcinoma biospecimens from 900 patients (103 with paired primary tumor and metastasis), and a University of Iowa Health Care (UIHC) cohort of 100 patients with colorectal carcinoma biospecimens. ORIEN is an alliance of US cancer centers established in 2014 using a standardized protocol for prospective clinical and biospecimen data collection with patients’ informed consent. Eligible patients had a tumor RNAseq dataset from at least one CRC biospecimen (primary tumor, metastasis, or both) and complete 5-year survival data (either metastasis within 5 years or at least 5 years of metastasis-free follow-up). The primary outcome was metastasis-free survival in 3 categories: 1) metastasis pre-existing at the time of biospecimen collection, 2) metastasis 0–5 years after collection, and 3) no metastasis in >5 years of follow-up. From 1315 potential candidates, 900 remained after application of the inclusion criteria. All data used in this study are de-identified with coded individual subject indicators. For the ORIEN cohort, clinicopathologic data (Table 1) were combined with microbiome relative abundance data derived from primary tumor and metastasis RNAseq with the exotic2 pipeline for processing, contaminant removal, and normalization53. Briefly, it accepts unaligned reads from tumor RNA-seq human gene expression processing (hg38 reference genome aligned with Bowtie254) and re-aligns them to the T2T-CHM13v2.0 human reference with STAR55. Unaligned reads are aligned to a custom KrakenUniq database containing finished bacterial genomes, high-quality fungal and viral genomes, as well as the T2T human reference genome and the Univec contaminants database as additional filters. Contaminants are filtered using decontam56, correlating the RNA concentration to the library preparation step, and removing common contaminants in sequencing preparations57.
The UIHC cohort was assembled using a pathology database under a protocol approved by the University of Iowa Institutional Review Board (#202212095). Fifty patients with paired primary tumor and metastasis resections (cases) were selected. Controls with only a primary tumor resection biospecimen and at least 5-year metastasis-free survival were matched 1:1 with cases on pathologic T/N stage, age, sex, and exposure to neoadjuvant therapy. Patients with less than 5 years of follow-up and no known metastases were excluded. Eligible participants had formalin-fixed paraffin-embedded (FFPE) pathology biospecimens and slides available and complete clinicopathologic data (Table 2). Starting from 200 potential controls, 57 were excluded due to incomplete clinicopathologic data or follow-up, and 50 were selected for optimal matching to cases. Clinicopathologic data were assembled for this cohort (Table 2) using Iowa Pathology databases and the electronic health record (Epic). A board-certified gastrointestinal pathologist reviewed slides to confirm all pathologic features and select tissue at a non-neoplastic mucosal margin, the primary colon tumor, and distant metastases. Several features of this cohort were designed to complement the discovery cohort: 1) orthogonal methods for microbiome profiling using 16S rDNA sequencing and strain-specific qPCR from FFPE biospecimens, and 2) control of the known potential microbiome confounders of pathologic T/N stage and neoadjuvant therapy using matching.
UIHC cohort 16S metagenomic sequencing
The selected FFPE tissue was dissected from glass slides and total DNA was extracted (QIAgen FFPE Advanced UNG kit). 16S rDNA amplification, library generation, and sequencing were performed as previously described58. Briefly, the V3-V4 region of the 16S rRNA gene was amplified using primers 5’- TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCTACGGGNGGCWGCAG-3’ and 5’-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACTACHVGGGTATCTAATCC-3’. Indexing and library construction were performed using a second round of PCR. Multiplexed samples were sequenced by paired-end sequencing on a MiSeq instrument (Illumina). The data were processed using QIIME 259. Sequences were demultiplexed, and quality profiles were visualized using demux and summarize functions. The DADA2 pipeline60 was used for sequence quality control and feature table generation. Phylogenetic trees were generated using the QIIME 2 phylogeny function. Alpha and beta diversity metrics with group significance statistics were calculated using the q2-diversity plugin in QIIME 2. Amplicon sequence variants were classified using a classifier trained on the Greengenes database (gg_12_8)61.
In contrast with fecal metagenomics, microbiome profiling from formalin-fixed paraffin-embedded tissue (FFPE) presents unique challenges. Despite this, prior research demonstrates that bacterial communities can be accurately profiled from this matrix62,63,64,65. Among the challenges are a predominance of human DNA extracted from tissue, DNA damage and cross-linking related to formalin fixation, and the potential for contaminating bacterial DNA during non-sterile specimen handling. Negative control samples included tissue-free FFPE blocks and reagents only. We applied several additional “decontamination” approaches: 1) exclusion of reads mapping to taxonomic groups that are known contaminants from FFPE processing at our institution, identified by a previous 10 subject intra-individual FFPE and fresh frozen tissue comparison, 2) statistical identification of likely contaminants with the decontam package in R56, and 3) exclusion of taxonomic groups that are not identified in at least 1% of high-quality fecal metagenomes from the Human Microbiome Project66 as measured with MetaPhlAn67. After decontamination, there was a median of 14,361 high-quality reads per sample (interquartile range 7048–23,975), comparable sequencing depth to previous studies of colon FFPE samples63.
Pathobiont gene quantitative PCR
Multiple qPCR primer pairs were designed for targeting F. nucleatum nusG and 16S rDNA, B. fragilis enterotoxin gene bft1, and E. coli pks locus genes clbB and cnf1. Total bacteria were quantified with primers targeting the V7 region of 16S rDNA, and human DNA using the actin gene. Primers were bioinformatically filtered for off-target effects using BLAST, NCBI databases, and a large collection of metagenome-assembled genomes (MAGs)68. qPCR was performed using SYBR green and thermal melt analysis to confirm the expected amplicon. Control reactions were performed on DNA extracted from pure pathobiont cultures. Pathobiont detection in biospecimens was defined as a CT value of less than 35 cycles and a thermal melt curve like pure culture controls. Detection rates were compared using chi-square testing. Relative quantities were calculated using the 2-ΔCt method69 with total bacteria (16S) as the referent. All qPCR primer sequences are in Supplementary Table 1.
Microbiome data analysis and statistics
The analysis was conducted in two primary stages. The first stage includes analyses with overall diversity measures, whereas the next stage includes identifying a group of microbes, or microbial cliques, associated with the 5-year metastasis-free survival (MFS) outcome. The alpha and beta diversity metrics to be analyzed were selected before the study. Alpha diversity was quantified with Shannon’s index, Simpson’s index, and Chao1. Beta diversity was quantified with unweighted UniFrac distance, Bray-Curtis dissimilarity, and principal component analyses of both metrics. For RNAseq-derived taxonomic abundances (ORIEN), a random taxonomy-based tree was used where required; for 16S rDNA data (UIHC), a tree was constructed from the alignment of sequence clusters. Statistical testing for alpha diversity differences by site and MFS was 2-way ANOVA. Beta diversity differences were identified by the adonis2 test in R for the ORIEN dataset and the PERMANOVA test in QIIME2 for the UIHC dataset. Differential taxon abundance testing by linear discriminant analysis (LDA) was performed with LefSe using a cutoff LDA score of >2.028. For paired primary tumor and metastasis biospecimens from the same subject, paired Wilcoxon tests were applied to alpha diversity metrics, and UniFrac distances were compared using Kruskal-Wallis tests. Gene function and bacterial trait analysis of the ORIEN dataset was performed by mapping ~1300 detected bacteria and their relative abundances to the Protraits database30. Statistical testing for trait enrichment was Mann-Whitney tests, corrected for multiple comparisons with a false discovery rate of 0.01. Gene function and pathway enrichment analysis of the UIHC 16S rDNA data was sequence-based using PICRUSt270 and LDA. Clinicopathologic data comparisons were Chi-square or Fisher’s exact tests for categorical data and Kruskal-Wallis or Student’s t-test for numerical data. Analysis outputs were visualized using R, QIIME 2 View (view.qiime2.org), and GraphPad Prism 9 (La Jolla, CA, USA).
To discover a microbial clique associated with the outcome with a combination of an interpretable machine learning tool in a regression-based framework called the microbial co-occurrence analysis (MiCA)71,72. Identifying a microbial clique consisting of multiple microbes is challenging because of the inherently large and computationally expensive combinatorial search space. MiCA utilizes a tree-based interpretable machine-learning tool to discover the most prevalent combination of microbes based on their relative abundance. Then, using a quantile-based threshold-finding algorithm creates a predictive rule associated with the outcome. Based on the relative abundances of the most frequently co-occurring microbes, this predictive rule intuitively creates a mathematical equivalence of a microbial clique. Lastly, we use a covariate-adjusted regression to estimate the effect of this microbial clique. Since machine-learning tools naturally tend to overfit, we trained the tree-based algorithm in MiCA using 65% of the data (n = 769). Then the remaining 35% of the data (n = 416) was used to validate the identified microbial clique. This two-stage procedure ensured the robustness of the identified microbial clique. The regression part of the MiCA algorithm incorporates multiple covariates, including age at diagnosis, reported sex, race, neoadjuvant therapy (yes/no), clinical stage group at biospecimen collection, and pathologic stage group. The dominant clique identified includes Escherichia coli, Bacteroides fragilis, and a Pseudomonas sp. ASV. The Kraken database and algorithm (the highest stringency parameter for taxon assignment was used) assigned this ASV as Pseudomonas yamanorum, which is neither a common gut commensal nor a common environmental contaminant. BLAST of several sequences from the taxonomic unit showed 100% identity to P. yamanorum genomes and >90% identity to at least five other Pseudomonas spp. Since we believe this could be an over-call of the species-level classification by Kraken, we refer to it as a Pseudomonas sp. in the results.
Data availability
Data from the Oncology Research Information Exchange Network (ORIEN) is accessible through project requests. All 16S rDNA sequencing data will be made available through NCBI SRA, BioProject PRJNA1244713 upon publication.
References
Rumpold, H. et al. Prediction of mortality in metastatic colorectal cancer in a real-life population: a multicenter explorative analysis. BMC Cancer 20, 1149 (2020).
Dupaul-Chicoine, J. et al. The Nlrp3 inflammasome suppresses colorectal cancer metastatic growth in the liver by promoting natural killer cell tumoricidal activity. Immunity 43, 751–763 (2015).
Bullman, S. et al. Analysis of Fusobacterium persistence and antibiotic response in colorectal cancer. Science 358, 1443–1448 (2017).
Rubinstein, M. R. et al. Fusobacterium nucleatum promotes colorectal carcinogenesis by modulating E-cadherin/beta-catenin signaling via its FadA adhesin. Cell Host Microbe 14, 195–206 (2013).
Abed, J. et al. Fap2 mediates Fusobacterium nucleatum colorectal adenocarcinoma enrichment by binding to tumor-expressed Gal-GalNAc. Cell Host Microbe 20, 215–225 (2016).
Kostic, A. D. et al. Fusobacterium nucleatum potentiates intestinal tumorigenesis and modulates the tumor-immune microenvironment. Cell Host Microbe 14, 207–215 (2013).
Borozan, I. et al. Molecular and pathology features of colorectal tumors and patient outcomes are associated with fusobacterium nucleatum and its subspecies animalis. Cancer Epidemiol. Biomark. Prev. 31, 210–220 (2022).
Arthur, J. C. et al. Intestinal inflammation targets cancer-inducing activity of the microbiota. Science 338, 120–123 (2012).
Pleguezuelos-Manzano, C. et al. Mutational signature in colorectal cancer caused by genotoxic pks(+) E. coli. Nature 580, 269–273 (2020).
Diaz-Gay, M. et al. Geographic and age variations in mutational processes in colorectal cancer. Nature. https://doi.org/10.1038/s41586-025-09025-8 (2025).
Volpe, M. R. et al. A small molecule inhibitor prevents gut bacterial genotoxin production. Nat. Chem. Biol. 19, 159–167 (2023).
Liu, Y. et al. Bacterial genotoxin accelerates transient infection-driven murine colon tumorigenesis. Cancer Discov. 12, 236–249 (2022).
He, Z. et al. Campylobacter jejuni promotes colorectal tumorigenesis through the action of cytolethal distending toxin. Gut 68, 289–300 (2019).
Chung, L. et al. Bacteroides fragilis toxin coordinates a pro-carcinogenic inflammatory cascade via targeting of colonic epithelial cells. Cell Host Microbe 23, 203–214 e205 (2018).
Wu, X. et al. Enterotoxigenic bacteroides fragilis (ETBF) enhances colorectal cancer cell proliferation and metastasis through HDAC3/miR-139-3p pathway. Biochem. Genet. https://doi.org/10.1007/s10528-023-10621-4 (2024).
Liu, Q. Q. et al. Enterotoxigenic Bacteroides fragilis induces the stemness in colorectal cancer via upregulating histone demethylase JMJD2B. Gut Microbes 12, 1788900 (2020).
Roelands, J. et al. An integrated tumor, immune and microbiome atlas of colon cancer. Nat. Med. 29, 1273–1286 (2023).
Mima, K. et al. Fusobacterium nucleatum in colorectal carcinoma tissue and patient prognosis. Gut 65, 1973–1980 (2016).
Mima, K. et al. Fusobacterium nucleatum and T Cells in Colorectal Carcinoma. JAMA Oncol. 1, 653–661 (2015).
Yan, X., Liu, L., Li, H., Qin, H. & Sun, Z. Clinical significance of Fusobacterium nucleatum, epithelial-mesenchymal transition, and cancer stem cell markers in stage III/IV colorectal cancer patients. Onco Targets Ther. 10, 5031–5046 (2017).
Mouradov, D. et al. Oncomicrobial community profiling identifies clinicomolecular and prognostic subtypes of colorectal cancer. Gastroenterology 165, 104–120 (2023).
Wei, Z. et al. Could gut microbiota serve as prognostic biomarker associated with colorectal cancer patients’ survival? A pilot study on relevant mechanism. Oncotarget 7, 46158–46172 (2016).
Boleij, A. et al. The Bacteroides fragilis toxin gene is prevalent in the colon mucosa of colorectal cancer patients. Clin. Infect. Dis. 60, 208–215 (2015).
LaCourse, K. D. et al. The cancer chemotherapeutic 5-fluorouracil is a potent Fusobacterium nucleatum inhibitor and its activity is modified by intratumoral microbiota. Cell Rep. 41, 111625 (2022).
Dougherty, M. W. & Jobin, C. Intestinal bacteria and colorectal cancer: etiology and treatment. Gut Microbes 15, 2185028 (2023).
Liu, C. et al. Distant metastasis pattern and prognostic prediction model of colorectal cancer patients based on big data mining. Front. Oncol. 12, 878805 (2022).
Saberzadeh-Ardestani, B. et al. Metastatic site and clinical outcome of patients with deficient mismatch repair metastatic colorectal cancer treated with an immune checkpoint inhibitor in the first-line setting. Eur. J. Cancer 196, 113433 (2024).
Segata, N. et al. Metagenomic biomarker discovery and explanation. Genome Biol. 12, R60 (2011).
Meganathan, R. Biosynthesis of menaquinone (vitamin K2) and ubiquinone (coenzyme Q): a perspective on enzymatic mechanisms. Vitam. Horm. 61, 173–218 (2001).
Brbic, M. et al. The landscape of microbial phenotypic traits and associated genes. Nucleic Acids Res 44, 10074–10090, https://doi.org/10.1093/nar/gkw964 (2016).
Bertocchi, A. et al. Gut vascular barrier impairment leads to intestinal bacteria dissemination and colorectal cancer metastasis to liver. Cancer Cell 39, 708–724 e711 (2021).
Dalmasso, G. et al. Colibactin-producing Escherichia coli enhance resistance to chemotherapeutic drugs by promoting epithelial to mesenchymal transition and cancer stem cell emergence. Gut Microbes 16, 2310215 (2024).
He, X. et al. Contribution of PKS+ Escherichia coli to colon carcinogenesis through the inhibition of exosomal miR-885-5p. Heliyon 10, e37346 (2024).
Dalmasso, G., Cougnoux, A., Delmas, J., Darfeuille-Michaud, A. & Bonnet, R. The bacterial genotoxin colibactin promotes colon tumor growth by modifying the tumor microenvironment. Gut Microbes 5, 675–680 (2014).
Wu, X. et al. Enterotoxigenic bacteroides fragilis (ETBF) enhances colorectal cancer cell proliferation and metastasis through HDAC3/miR-139-3p pathway. Biochem Genet 62, 3904–3919 (2024).
Ding, X. et al. Bacteroides fragilis promotes chemoresistance in colorectal cancer, and its elimination by phage VA7 restores chemosensitivity. Cell Host Microbe. https://doi.org/10.1016/j.chom.2025.05.004 (2025).
Jimenez-Alesanco, A. et al. Repositioning small molecule drugs as allosteric inhibitors of the BFT-3 toxin from enterotoxigenic Bacteroides fragilis. Protein Sci. 31, e4427 (2022).
Velilla, J. A. et al. Structural basis of colibactin activation by the ClbP peptidase. Nat. Chem. Biol. 19, 151–158 (2023).
Chen, Y. et al. Fusobacterium nucleatum Promotes Metastasis in Colorectal Cancer by Activating Autophagy Signaling via the Upregulation of CARD3 Expression. Theranostics 10, 323–339 (2020).
Yu, Y. et al. Fusobacterium nucleatum promotes colorectal cancer liver metastasis via miR-5692a/IL-8 axis by inducing epithelial-mesenchymal transition. J. Biomed. Sci. 32, 5 (2025).
Zepeda-Rivera, M. et al. A distinct Fusobacterium nucleatum clade dominates the colorectal cancer niche. Nature 628, 424–432 (2024).
Chen, K., Collins, G., Wang, H. & Toh, J. W. T. Pathological Features and Prognostication in Colorectal Cancer. Curr. Oncol. 28, 5356–5383 (2021).
Datorre, J. G. et al. Enhancing colorectal cancer screening with droplet digital PCR analysis of Fusobacterium nucleatum in fecal immunochemical test samples. Cancer Prev. Res. 17, 471–479 (2024).
Piccinno, G. et al. Pooled analysis of 3,741 stool metagenomes from 18 cohorts for cross-stage and strain-level reproducible microbial biomarkers of colorectal cancer. Nat. Med. 31, 2416–2429 (2025).
Masaadeh, A. H., Eletrebi, M., Parajuli, B., De Jager, N. & Bosch, D. E. Human colitis-associated colorectal carcinoma progression is accompanied by dysbiosis with enriched pathobionts. Gut Microbes 17, 2479774 (2025).
Adnan, D., Trinh, J. Q., Sharma, D., Alsayid, M. & Bishehsari, F. Early-onset colon cancer shows a distinct intestinal microbiome and a host-microbe interaction. Cancer Prev. Res. 17, 29–38 (2024).
Debelius, J. W. et al. The local tumor microbiome is associated with survival in late-stage colorectal cancer patients. Microbiol. Spectr. 11, e0506622 (2023).
Sakamoto, Y. et al. Relationship between Fusobacterium nucleatum and antitumor immunity in colorectal cancer liver metastasis. Cancer Sci. 112, 4470–4477 (2021).
Kato, I. & Sun, J. Microbiome and diet in colon cancer development and treatment. Cancer J. 29, 89–97 (2023).
Ryu, T. Y. et al. Human gut-microbiome-derived propionate coordinates proteasomal degradation via HECTD2 upregulation to target EHMT2 in colorectal cancer. ISME J. 16, 1205–1221 (2022).
Hou, X. Y. et al. Prevotella contributes to individual response of FOLFOX in colon cancer. Clin. Transl. Med. 11, e512 (2021).
Xu, Y. et al. Biglycan regulated colorectal cancer progress by modulating enteric neuron-derived IL-10 and abundance of Bacteroides thetaiotaomicron. iScience 26, 107515 (2023).
Hoyd, R. et al. Exogenous sequences in tumors and immune cells (exotic): a tool for estimating the microbe abundances in tumor RNA-seq data. Cancer Res. Commun. 3, 2375–2385 (2023).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Davis, N. M., Proctor, D. M., Holmes, S. P., Relman, D. A. & Callahan, B. J. Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data. Microbiome 6, 226 (2018).
Salter, S. J. et al. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 12, 87 (2014).
Shahi, S. K., Zarei, K., Guseva, N. V. & Mangalam, A. K. Microbiota analysis using two-step PCR and next-generation 16S rRNA gene sequencing. J Vis Exp. https://doi.org/10.3791/59980 (2019).
Bolyen, E. et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 37, 852–857 (2019).
Callahan, B. J. et al. DADA2: high-resolution sample inference from Illumina amplicon data. Nat. Methods 13, 581–583 (2016).
McDonald, D. et al. An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. ISME J. 6, 610–618 (2012).
Steiert, T. A. et al. A critical spotlight on the paradigms of FFPE-DNA sequencing. Nucleic Acids Res 51, 7143–7162 (2023).
Borgognone, A. et al. Performance of 16S metagenomic profiling in formalin-fixed paraffin-embedded versus fresh-frozen colorectal cancer tissues. Cancers 13, https://doi.org/10.3390/cancers13215421 (2021).
Cruz-Flores, R., Lopez-Carvallo, J. A., Caceres-Martinez, J. & Dhar, A. K. Microbiome analysis from formalin-fixed paraffin-embedded tissues: current challenges and future perspectives. J. Microbiol Methods 196, 106476 (2022).
Walker, S. P., Tangney, M. & Claesson, M. J. Sequence-based characterization of intratumoral bacteria-a guide to best practice. Front. Oncol. 10, 179 (2020).
Integrative, H. M. P. R. N. C. The integrative human microbiome project. Nature 569, 641–648 (2019).
Blanco-Miguez, A. et al. Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4. Nat. Biotechnol. 41, 1633–1644 (2023).
Richardson, L. et al. MGnify: the microbiome sequence data analysis resource in 2023. Nucleic Acids Res 51, D753–D759 (2023).
Navidshad, B., Liang, J. B. & Jahromi, M. F. Correlation coefficients between different methods of expressing bacterial quantification using real time PCR. Int J. Mol. Sci. 13, 2119–2132 (2012).
Douglas, G. M. et al. PICRUSt2 for prediction of metagenome functions. Nat. Biotechnol. 38, 685–688 (2020).
Midya, V. et al. Prenatal lead exposure is associated with reduced abundance of beneficial gut microbial cliques in late childhood: an investigation using Microbial Co-Occurrence Analysis (MiCA). Environ. Sci. Technol. 57, 16800–16810 (2023).
Midya, V. et al. Prenatal metal exposures and childhood gut microbial signatures are associated with depression score in late childhood. Sci. Total Environ. 916, 170361 (2024).
Acknowledgements
We gratefully acknowledge the University of Iowa Holden Comprehensive Cancer Center’s Microbiome Core for their essential support in providing microbiome sequencing services. The Histology Research Laboratory at Iowa provided histology services. This work was supported by the Holden Comprehensive Cancer Center ACS-IRG from the American Cancer Society (DEB). DEB was supported by the NIH, K08AI159619. RK was supported by the American Cancer Society (IRG-21-141-46-IRG) and the National Cancer Institute (R25CA273964). DS is supported by the National Institute on Aging (K01AG070310), the American Lung Association (1046611), and the American Cancer Society (RSG-23-1023205).
Author information
Authors and Affiliations
Contributions
B.P.—investigation, writing (original draft and editing); V.M.—methodology, formal analysis, writing (original draft and editing), R.K.—methodology, investigation, writing (original draft and editing), visualization; N.D.J.—methodology, investigation, writing (original draft and editing), visualization, data curation; S.E.— methodology, writing (editing), resources; D.S.—methodology, software, resources, writing (editing); R.H.— methodology, investigation, writing (editing); D.E.B.—conceptualization, methodology, investigation, software, resources, data curation, writing (original draft and editing), visualization, supervision, project administration, funding acquisition. All other authors are members of the ORIEN Microbiome Research Interest Group, with at least one member from each institution that contributed colorectal cancer cases to the study. B.S., C.H.F.C., M.L.C., R.J.R., S.Y., M.R.R., A.A.T., D.P.M., A.M., T.J.B., R.W.L., H.H., M.N.I., S.H., C.M.U., J.L.R., G.R., C.D.S. – resources, writing (editing).
Corresponding author
Ethics declarations
Competing interests
RWL acknowledges clinical research funding to institution from ALX Oncology, Merck, Lilly, Guardant, EDDC, Boehringer Ingelheim, and AstraZeneca, consulting/advisory role (honoraria to institution) for Agenus, Boehringer Ingelheim, Merck, Natera, and consulting/advisory role (unpaid) for Kahr and EDDC. GR has advisory roles with AstraZeneca, Pfizer, and Bayer in the past 2 years that have ended.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Parajuli, B., Midya, V., Kiddle, R. et al. Primary tumor microbiomes predict distant metastasis of colorectal cancer. npj Precis. Onc. 9, 405 (2025). https://doi.org/10.1038/s41698-025-01188-x
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41698-025-01188-x








