Isolation, engineering and ecology of temperate phages from the human gut

Dahlman, Sofia; Avellaneda-Franco, Laura; Rutten, Emily L.; Gulliver, Emily L.; Solari, Sean; Chonwerawong, Michelle; Kett, Ciaren; Subedi, Dinesh; Young, Remy B.; Campbell, Nathan; Gould, Jodee A.; Bell, Jasmine D.; Docherty, Callum A. H.; Turkington, Christopher J. R.; Nezam-Abadi, Neda; Grasis, Juris A.; Lyras, Dena; Edwards, Robert A.; Forster, Samuel C.; Barr, Jeremy J.

doi:10.1038/s41586-025-09614-7

Download PDF

Article
Open access
Published: 15 October 2025

Isolation, engineering and ecology of temperate phages from the human gut

Nature volume 647, pages 698–705 (2025)Cite this article

47k Accesses
13 Citations
218 Altmetric
Metrics details

Subjects

Abstract

Large-scale metagenomic and data-mining efforts have revealed an expansive diversity of bacteriophages (phages) within the human gut^1,2,3. However, functional understanding of phage–host interactions within this complex environment is limited, largely due to a lack of cultured isolates available for experimental validation. Here we characterize 134 inducible prophages originating from 252 human gut bacterial isolates using 10 different induction conditions to expand the experimentally validated temperate phage–host pairs originating from the human gut. Importantly, only 18% of computationally predicted prophages could be induced in pure cultures. Moreover, we construct a 78-member synthetic microbiome that, when co-cultured in the presence of human colonic cells (Caco2), led to the induction of 35% phage species. Using cultured isolates, we demonstrate that human host-associated cellular products may act as induction agents, providing a possible link between gastrointestinal cell lysis and temperate phage populations^4,5. We provide key insights into prophage diversity and genetics, including a genetic pathway for domestication, finding that polylysogeny was common and resulted in coordinated prophage induction, and that differential induction can be influenced by divergent prophage integration sites. More broadly, our study highlights the importance of culture-based techniques, alongside experimental validation, genomics and computational prediction, to understand the biology and function of temperate phages in the human gut microbiome. These culture-based approaches will enable applications across synthetic biology, biotechnology and microbiome fields.

Long-read metagenomics reveals phage dynamics in the human gut microbiome

Article Open access 26 November 2025

Prophages in the infant gut are pervasively induced and may modulate the functionality of their hosts

Article Open access 19 March 2025

Human gut prophage landscape identifies a prophage-mediated fucosylation mechanism alleviating colitis

Article Open access 22 November 2025

Main

The human gut microbiota consists of a plethora of microorganisms along with the viruses that infect them, including phages. These viruses are thought to shape the gut microbial community through predation, horizontal gene transfer and lysogenic conversion^6,7. Recent advances in computational mining of gut metagenomes have revealed an expansive collection of viral metagenome-assembled genomes and efforts cataloguing this diversity have led to the discovery of several important viral families^1,2,3,8,9,10. Moreover, lysogeny is common within the gut, with up to 90% of bacteria predicted to harbour prophages^11,12. However, the extent to which these prophages re-enter lytic replication remains unclear. For example, the inactivation of resident prophages represents a common strategy whereby the bacterial population can escape lysis while maintaining beneficial phage genes^13,14. Furthermore, initiation of lytic replication by resident prophages is complex, involving both host- and phage-specific cues^15,16. Within the gut, little is known about temperate phages and how they interact with our commensals.

Induction of human gut isolates

Advances in cultivation of the microbiota have enabled the isolation and archiving of previously ‘unculturable’ gut bacterial species^17,18, along with their phages. Here we use a collection of 252 human gut bacterial isolates (50 Actinomycetota, 1 Fusobacteriota, 51 Bacillota, 57 Pseudomonadota and 93 Bacteroidota) to computationally identify and experimentally validate inducible prophages (Fig. 1a and Supplementary Table 1). We began by exposing our bacterial isolate cultures to eight different induction agents and conditions, which included a standard medium control, well-known inducing agents such as mitomycin C (0.3 and 3 µg ml⁻¹) and hydrogen peroxide (0.5 mM), along with lesser-known induction conditions with potential relevance to the gut, including the sugar substitute Stevia (3.7 and 37 mg ml⁻¹) and two starvation conditions (50% carbon depletion and 100% short-chain fatty acid (SCFA) depletion)^{19,20,21,22,23}. After induction, the samples were processed for DNA extraction and 433 viral induction samples that passed our inclusion criteria were sequenced (Extended Data Fig. 1a,b and Supplementary Table 2). This resulted in the detection of 125 inducible gut prophages, representing 63 (23%) phage species (95% average nucleotide identity (ANI), over 85% alignment fraction (AF))²⁴ from 73 (29%) bacterial isolates (5 Actinomycetota, 1 Fusobacteriota, 10 Bacillota, 17 Pseudomonadota and 40 Bacteroidota) (Fig. 1b).

**Fig. 1: Gut prophages induced in pure and synthetic bacterial community cultures.**

Human cellular products induce prophages

To further expand on the prophage induction triggers tested above, we constructed a synthetic bacterial community based on a subset of our isolates (n = 78, 4 Actinomycetota, 1 Fusobacteriota, 4 Bacillota, 28 Pseudomonadota and 41 Bacteroidota; Supplementary Table 1). Using this community, we investigated the effects of bacterial co-culture, in which the competition for resources, production of microbial byproducts and quorum sensing may affect prophage induction, as well as community co-culture with a monolayer of human colonic epithelial cells (Caco2), to investigate human host-associated factors^5,16 (Extended Data Fig. 1c). In total, 29 phage species out of 162 (17%) were identified as induced in community co-culture using read mapping, yet, notably, 57 phage species (35%) were induced within the Caco2 co-culture, with a total of 22 phage species being newly identified as inducible across both experiments (Fig. 1c). However, there was a shift in the bacterial community composition within the Caco2 co-culture, dominated by Pseudomonadota and Bacteroidota, potentially leading to detection of prophages induced from isolates extinct in the community only co-culture²⁵ (Fig. 1d,e). In a complementary approach, we detected the induction of distinct prophage–host pairs by identifying unique k-mers within each prophage genome. Out of the 338 predicted prophages within the community, 150 contained unique k-mers, allowing for the detection of 43 (29%) induced prophages, 21 of which were detected only in the Caco2 co-culture (Fig. 1f).

Considering the increased prophage induction within the Caco2 co-cultured community, we wanted to investigate whether human cells or cell lysis products (independent of community effects) act as prophage induction triggers. To this end, we selected 32 bacterial isolates from the community (3 Actinomycetota, 1 Fusobacteriota, 2 Bacillota, 14 Pseudomonadota and 12 Bacteroidota; Supplementary Table 1) for pure culture induction assays using Caco2 cell monolayers, Caco2 cellular lysates and DMEM cell culture medium alone (Extended Data Fig. 1a). These conditions induced 29 prophages, with 25 observed within the Caco2 cell lysate condition and 14 in the Caco2 monolayer and DMEM cell medium (Fig. 1g). Importantly, nine of the induced prophages had not been previously detected in our bacterial isolate cultures using standard induction agents, indicating that human host-associated cellular products act as induction triggers. Taken together, 35 out of 146 (24%) prophages were found to be inducible across all conditions within these 32 bacterial isolates.

Only a fraction of gut prophages were induced

Consistent with previous reports of substantial lysogeny within the human gut¹¹, 237 out of 252 of bacterial isolates (94%) were computationally predicted to contain high-quality prophage regions. However, across all 10 of our tested induction conditions, only 32% (80 out of 252) of isolates were induced and 18% (134 out of 736) of the high-quality prophage predictions, or 24% (68 out of 274) of high-quality prophage species, corresponded to experimentally inducible prophages in pure culture conditions (Fig. 1h,i and Supplementary Table 3). The highest concordance between inducible and predicted prophage regions was observed within Bacteroidota isolates, in which 80 predictions (27%) from 41 isolates (44%) were inducible. Comparatively, in Pseudomonadota, which had the highest number of predicted prophages (4.5 per isolate), just 12% of prophages were found to be inducible. Moreover, combining the synthetic communities, a total of 36% of prophage species was detected as induced, coinciding with recent reports from human gut metagenomes (8–36%)^26,27. Although our experimental approach does not provide comprehensive identification of all gut prophages due to factors including detection limits and potential unidentified induction conditions, it is likely that a substantial portion of predicted prophages within our dataset rarely undergo induction.

Taxonomy of induced gut temperate phage

We next looked to assign phage taxonomy to our induced temperate phage collection using a database comprising 9,920 phage reference genomes. Given the inherent challenges in assigning taxonomy to phages²⁸, we applied both a gene voting-based search and a gene sharing network method using vContact2 (refs. ^2,29) (Fig. 2a, Extended Data Fig. 3a and Supplementary Table 4) with taxonomy assigned to the highest taxonomic resolution shared between methods. The resulting classification assigned 133 phages to the Caudoviricetes order and one phage within the Faserviricetes order (Supplementary Table 3). In total, 26% (35 out of 134) of phages could be assigned to ICTV (International Committee on Taxonomy of Viruses) accepted taxa at the family level or lower. These belonged to previously reported phage taxa infecting Pseudomonadota (Bcepmuvirus, Punavirus, Uetakevirus and Peduoviridae), one Spbetavirus infecting Bacillota and 16 prophages belonging to the Winoviridae family infecting Bacteroidota. Although lacking ICTV classification, 30 genomes could be grouped into viral clusters (genus-subfamily level) together with previously described phages. Notably, 19 of these clustered with Hankyphage, a recently described virus thought to lysogenize several Bacteroides species³⁰. Further taxonomic classification grouped ten prophages at the species level with Hankyphage, whereas the remaining nine clustered into seven potential novel species, forming a putative novel genus that we name Hankyvirus after the original phage characterized (Extended Data Fig. 2a). Comparing the Hankyvirus species to bacterial genomes in NCBI RefSeq database (95% ANI over 85% AF), we identified 52 host species originating from 9 genera and 5 families, indicating a broad host range of this genus (Extended Data Fig. 2b). Correspondingly, we find two Hankyvirus species induced within both Bacteroides and Phocaeicola isolates within our collection, providing experimental validation of these phages as actively replicating across these two host genera.

**Fig. 2: Taxonomy and prevalence of induced temperate phages within gut viromes.**

Inducible temperate phages are prevalent

We next sought to place our temperate phage genomes within the larger context of the human gut by comparing their prevalence to the reference genomes, including the Crassvirales order³¹. Approximately half of our inducible prophage species (28 out of 68) could be detected in gut viromes⁹ (n = 1,241; Fig. 2b and Supplementary Table 5). LoVEphage, a recently discovered Bacteroidota phage^9,10, was the most common, being detected in around 8% (97 out of 1,241) of the viromes and representing up to 64% of reads within one virome (Supplementary Table 3). Comparatively, the most abundant Crassvirales genome, belonging to the alpha/gamma family, was found in approximately 19% of the viromes investigated (Extended Data Fig. 3b). Three phages in our collection were species-level members of LoVEphage, induced from Bacteroides thetaiotaomicron, Phocaeicola dorei and Phocaeicola vulgatus hosts (Extended Data Fig. 2b). An additional eight phage species were detected in 2–5% of gut viromes (Supplementary Table 3). These included the four species within the Hankyvirus genus, one Uetakevirus infecting Escherichia coli and three previously uncharacterized Bacteroidota phages (Wilby, Saffi and Shia; Extended Data Fig. 3c).

DGRs are common within gut prophages

Discernible integrase or site-specific recombination genes, both of which are used as hallmark genes for a temperate lifestyle^32,33, were absent in 28% (19 out of 68) of our inducible phage species, including Hankyviruses. We found transposases in ten of these viruses, while the remaining nine lacked any discernible integration genes, illustrating the difficulty in assigning phage lifestyle based on genomic data alone. Diversity-generating retroelements (DGRs) are prevalent within the gut virome, and tail-targeting DGRs are known to enable rapid host switching in a Bordetella phage^34,35. We found DGRs in 19% (13 out of 68) of our inducible prophage species, the majority of which were seen in Bacteroidota phages, in which 43% (12 out of 28) of species encoded DGRs targeting known and genomically predicted tail proteins (Supplementary Table 3). Concordantly, we found eight Bacteroidota prophage species actively replicating across different bacterial species, three of which replicating across different bacterial genera (Fig. 1b (connecting lines)), highlighting the involvement of DGRs in phage host range expansion through diversification of tail proteins³⁴. More recently, bacterial DGRs were implicated in anti-phage defence mechanisms, and targeted engineering using DGRs accelerated evolution within both host receptor and the reciprocal phage binding domain^36,37. Notably, we found four prophage species containing DGRs that encoded a second variable region (VR) targeting genes distal from the reverse transcriptase cassette⁹ (Extended Data Fig. 3c). The second VR was found in proximity to counter defence genes, such as DNA methyltransferase, indicating further involvement of DGRs in the phage–host arms race³⁴.

Differential gene enrichment patterns

The retention of cryptic prophages is known to provide the host with adaptive fitness advantages and has been shown to result in a bimodal size distribution of prophage genomes^13,14. Concordantly, we find bimodal length distributions of prophages across all host phyla within our collection (Extended Data Fig. 4a), with the early peak corresponding to sequences with low completeness scores (<50% complete, n = 1,236) and later peaks corresponding to high-quality predictions (>50% complete, n = 736) and experimentally inducible prophages (n = 134; Fig. 3a and Supplementary Table 6). To investigate differences in gene content between these groups, we performed gene enrichment analysis of annotated PHROG gene categories and found that small prophage genomes lacked essential phage genes (such as structural, head and packaging, and lysis genes) but were enriched in accessory genes and genes of unknown function³⁸ (Extended Data Fig. 4b). Similarly, when limiting the comparison to high-quality predictions, we found an enrichment of structural genes (head and packaging, connector, lysis and tail) within induced prophages, whereas non-induced predictions were enriched for accessory genes and genes of unknown function, indicating that a subset of high-quality predictions might be cryptic prophage-like elements or poor predictions (Fig. 3b).

**Fig. 3: Comparison of induced versus predicted prophages.**

We next sought to investigate potential genetic mechanisms leading to trapping of prophages within the host genome by comparing experimentally induced prophages to highly similar, non-induced prophages (95% ANI, 85% AF; Supplementary Table 7). To classify these non-induced prophages as putatively cryptic, we restricted the analysis to prophages that had been sequenced (but not induced) in the same condition(s) as their inducible counterparts, with the rationale that highly similar prophages should respond to the same induction triggers. This resulted in a total of 231 prophage pairs between 65 induced and 58 non-induced prophages. No significant changes were found in gene frequency (P > 0.05, Fisher’s exact test), indicating that, although gene loss may be characteristic of cryptic prophages, it is unlikely to be the initial cause of inactivation. Moreover, although we detected 201 homologous gene transfer (HGT) and 65 insertion–deletion events, there was no significant difference in the number of total events when compared to a set of high sequence similarity induced prophage pairs (222 pairs, P = 0.46, Wilcoxon rank-sum test). Comparing host ANI between the induced and non-induced prophage pairs (Extended Data Fig. 4c,d), we found no association with induction (P = 0.6, Pearson’s correlation), suggesting that prophage inactivation was not driven by diversification of the host or integration into divergent non-permissive hosts.

Excision gene mutations trap prophages

To investigate whether non-induced prophages have an elevated number of mutations, we measured the ratio of non-synonymous to synonymous substitution rates (dN/dS) within the set of induced to non-induced prophage pairs, and their associated hosts. We found an overall elevated mutation rate in prophages (mean = 0.89, median = 0.18) compared with the host genome (mean = 0.16, median = 0.095, P < 2 × 10⁻¹⁶, Wilcoxon rank-sum test), but no significant difference between induced to induced or induced to non-induced prophage pairs or their host genomes (P = 0.99 and P = 0.62 respectively, Wilcoxon rank-sum test; Fig. 3c and Supplementary Table 8). Comparing gene substitution rates, we found 143 genes with elevated dN/dS rates (>1), indicating diversifying selection (Fig. 3d and Supplementary Table 9). Notably, the majority (110 out of 143) of these genes lacked a known function and 40% (57 out of 143) were associated with DGRs, highlighting an active and not yet deciphered role of DGRs within gut prophages. Importantly, within the non-induced prophages we found a significant increase in the dN/dS substitution rates in genes involved in integration and excision (P = 0.002, Wilcoxon rank-sum test), suggesting that non-functional mutations in these genes provides a pathway for the inactivation of prophages.

To test whether the inactivation of integration and excision genes could trap a prophage inside its host genome, we constructed a gene deletion mutant of the inducible prophage Pomma by knocking out its DNA transposition protein (ΦPomma ∆tran) within the Bacteroides faecis host strain CC01414 (Extended Data Fig. 4e and Supplementary Table 10). From our previous inductions, we found that prophage Pomma was selectively induced by hydrogen peroxide and Stevia (37 mg ml⁻¹). Using these two inducers, we compared the induction of wild-type ΦPomma versus the ∆tran mutant using quantitative PCR (qPCR) and sequencing of chloroform and DNase-treated supernatants. qPCR analysis showed a 3.5 and 2.6 log increase in ΦPomma concentration within the wild type versus ∆tran deletion mutant in samples treated with Stevia and hydrogen peroxide, respectively (Fig. 3e and Supplementary Table 11). Through sequencing, we observed clear induction in the wild-type strain, with 35- and 17-fold increased coverage over the bacterial background in the Stevia-treated and hydrogen-peroxide-treated samples, respectively, whereas no increase over the bacterial background was detected in the ∆tran mutant strain (Fig. 3f).

Phyla-specific cues may govern induction

To determine whether prophage induction was linked to phylogeny, we examined the induction response across our ten conditions and standard growth control (Fig. 4a and Supplementary Table 3). Combined, the two concentrations of mitomycin C induced the largest number of prophages (n = 70) and the most Pseudomonadota prophages (n = 17). Hydrogen peroxide induced 43 prophages, including the largest number of Bacteroidota prophages (n = 35). However, these well-known induction agents exhibited only marginally increased induction compared to spontaneous induction during standard growth condition (n = 36). The Caco2 human cell induction conditions induced 29 prophages from 32 tested hosts, with Bacteroidota (n = 16) followed by Pseudomonadota (n = 9) showing the largest numbers of induced prophages. Considerable overlap was observed between prophage induction in standard media and induction agents (mitomycin C, n = 25; hydrogen peroxide, n = 15; Stevia, n = 19; carbon depletion, n = 9; SCFA depletion, n = 11; Caco2 induction conditions, n = 5). Comparing induction conditions across each phylum, the only significant difference observed was within the Pseudomonadota phyla in response to 3 µg ml⁻¹ mitomycin C (P = 0.024, Fisher’s exact test).

**Fig. 4: Comparison of induction agents and analysis of polylysogeny within gut isolates.**

Polylysogeny and host genetics influence induction

We next investigated polylysogeny across our collection and its influence on induction (Fig. 4b). Polylysogeny was most prevalent within the Bacteroidota isolates, in which 28 out of 41 (68%) of lysogens had more than one inducible prophage compared with 11 out of 38 isolates (29%) across the other phyla (P = 0.002, Fisher’s exact test). We then compared whether polylysogeny influenced induction, observing a positive correlation between the number of co-inhabiting inducible prophages and conditions leading to induction of each prophage (τ = 0.22, P = 0.002, Kendall’s rank correlation; Fig. 4c). Prophages residing in polylysogens (n = 90) were induced on average in 2.7 conditions compared with 2.1 conditions in single lysogens (n = 35, P = 0.03, Wilcoxon rank-sum test), suggesting that polylysogeny may promote simultaneous prophage induction and reduce stability within lysogens.

Finally, we investigated differential induction within polylysogens by measuring the abundance of phage DNA in the supernatants of five highly similar (99% ANI) Bacteroidota caccae isolates harbouring the same two prophages (ΦWilby and ΦPomma; Supplementary Table 11). We identified an overall preferential induction of ΦWilby within standard medium (P = 0.006), but not in hydrogen-peroxide-treated samples (P = 0.9, Wilcoxon rank-sum test), with isolate CC01407 demonstrating the most marked difference between the two phages (P = 0.026, paired t-test; Fig. 4d). Calculating the ratio of ΦWilby over ΦPomma within each isolate, we found a significant variance of means between the isolates in both standard medium (P = 0.012) and hydrogen peroxide (P = 0.0008, analysis of variance (ANOVA)). These results implied that the host genetic background, even within highly similar isolates, may affect prophage induction. We previously identified phage ΦPomma as a transposable prophage that does not use site-specific integration, but randomly inserts into the host genome³⁹. To investigate the prophage integration sites within our isolates, we used long-read sequencing on the five B. caccae strains. Genomic analysis identified ΦWilby integrated into the same tRNA gene location, which is characteristic of site-specific integration; however, the transposable prophage ΦPomma was found in four different genomic locations within the five isolates (Fig. 4e), implicating the integration site as a possible driver for the observed differential induction within these isolates.

Discussion

The high microbial load within the human gut represents an optimal environment for temperate phages, as frequent interactions with their hosts provide ample opportunity for lysogeny¹². Concordantly, the majority of bacteria within the gut are predicted to be lysogens, with up to 90% of bacteria harbouring at least one prophage^11,12. However, the degree to which these prophages engage in lytic replication is poorly understood. Using our defined culture collection of 252 gut bacterial isolates, we predict that the majority harbour prophage-like elements (94%), but find that only a fraction of predicted prophages could be experimentally induced in pure culture (18%). Caveats to our approach include experimental cut-offs for detection and minimum amounts of DNA required for sequencing, which could exclude detection of low-level-inducing prophages. Moreover, there are probably biases towards induction and detection of Caudoviricetes prophages and, indeed, all but one prophage (from the Inoviridae family) belonged to this class. Moreover, considering little is known about prophage induction triggers within the gut, it is plausible that some of our isolates carry prophages that were not induced due to a lack of appropriate induction triggers. To address this, we constructed a synthetic microbiome community and co-cultured it with or without human cells to simulate the biologically relevant conditions within the human gut. Within the community co-culture, we estimate that around 29% of prophages were induced, with around half of these induced only in co-culture with human epithelial Caco2 cells. However, whether this induction was triggered by human cell factors or was the result of spontaneous induction mirroring the shift in host taxa was unclear^36,40. We therefore investigated human host factors in the absence of community effects, using 32 pure culture isolates exposed to Caco2 cell monolayers, Caco2 cellular lysates and DMEM cell culture medium alone. We observed a modest increase in induction with lysed Caco2 cellular products compared with cell culture medium or intact cells, suggesting that human cellular lysis products act as prophage induction triggers. This is in accordance with previous observations of temperate virion expansion found in patients with inflammatory bowel disease that are associated with increased inflammation and cell death⁴.

Recent advances have highlighted the complexities governing prophage induction within natural environments, ranging from SOS-independent induction triggers, interprophage competition and interference by defence mechanisms^{41,42,43,44,45,46,47,48}. Across our pure culture and community inductions, only a minority of predicted prophages was detected to undergo lytic replication. We therefore propose that, although the genetic pool of integrated prophages within the gut is large, only a fraction of these will readily undergo lytic replication. This is consistent with previous studies estimating low induction rates within the human gut and reduced lytic infection rates of temperate gut phages^25,49. Furthermore, we detect distinct gene enrichment patterns where non-induced prophage predictions encoded fewer structural and lysis-associated genes, indicating that a portion of high-quality predictions might consist of prophage remnants or poor prophage predictions. Moreover, non-induced predictions with high sequence similarity to experimentally induced prophages exhibited increased non-synonymous substitution rates in integrase and excision-related genes. Deletion of one of these genes in an active prophage led to complete abolishment of induction, providing evidence for a genetic pathway towards prophage domestication.

A considerable portion of our isolates (52%) with inducible prophages were polylysogens, harbouring more than one replicating prophage⁴⁸. We found a positive correlation between polylysogeny and successful prophage induction conditions, which is consistent with previous reports of phage anti-repressor proteins targeting non-cognate prophages leading to synchronized prophage induction⁵⁰. Finally, we show that induction of polylysogenic prophages varied between near identical isolates, which correlated with divergent prophage integration sites within the host genome. Thus, prophage induction is complex and influenced by growth condition, polylysogeny and prophage integration site. In summary, we demonstrate the feasibility of culture-based approaches to provide insights into temperate phage biology and their interactions within human-associated commensals, and provide a validated collection of phage–host pairs for future use in synthetic biology, microbiome and biotechnological advances.

Methods

Bacterial culture conditions

A culture collection of 252 bacterial isolates previously isolated and sequenced from human gut samples was used for prophage induction⁵¹. All bacterial culture work was performed in yeast-extract casitone fatty acid (YCFA) medium at 37 °C under anaerobic conditions (Whitely A95 anaerobic workstation) containing 10% carbon dioxide, 10% hydrogen and 80% nitrogen⁵². Each isolate was streaked onto YCFA agar plates and incubated for 24 h before a single colony was inoculated in 1 ml YCFA medium in a 96-well plate and incubated for 24 h. Frozen stocks of the 96-well master plates were maintained in glycerol suspension (25%, v/v) at −80 °C. Before each induction, 96-well plates containing 1 ml YCFA were inoculated from the frozen master plate and grown overnight.

Bacterial phylogeny and prophage prediction

A set of 40 single-copy marker genes were extracted from the 252 bacterial isolates using progenome-classifier⁵³ and translated into amino acid sequences using SeqKit⁵⁴ (v.2). The protein sequences were concatenated and aligned using MAFFT⁵⁵ (v.7.310) before gaps were trimmed with trimAI⁵⁶ (v.1.4.1). Maximum-likelihood trees were constructed using RAxML⁵⁷ (v.8.2.12) PROTGAMMALGF model with 100 bootstraps replicates and visualized in iTOL⁵⁸. Bacterial clusters sharing 99% ANI were identified using dRep⁵⁹ (v.3.0.0) with the ‘-pa 0.9 --sa 0.99’ flags. Prophage regions were predicted using Virsorter⁶⁰, Vibrant⁶¹ (default settings) and VirFinder⁶² (minimum length 5 kb; 0.7 score; and P value 0.05). Completeness was predicted using CheckV⁶³ and contaminating bacterial regions were removed. Trimmed predictions were located within their cognate bacterial genome and overlapping predictions were merged using R IRanges⁶⁴ (v.2.28.0).

Pure culture prophage induction and sequencing

Prophages were induced by one of two methods. (1) Overnight starter cultures were diluted 1:50 in 1.5 ml standard YCFA medium and grown for 5 h before the addition of mitomycin C¹⁹ (0.3 or 3 μg ml⁻¹, M4287, Sigma-Aldrich) or hydrogen peroxide²⁰ (0.5 mM, H1009, Sigma-Aldrich). (2) Starter cultures were diluted 1:50 directly into standard YCFA medium, YCFA medium supplemented with Stevia²³ (3.7 or 37 mg ml⁻¹, SweetLeaf, organic Stevia leaf extract), carbon-depleted medium²² (YCFA medium with 50% reduced carbon source) or SCFA-depleted²¹ medium (YCFA medium without SCFAs). All cultures were then grown for 20–25 h followed by centrifugation at 4,000g for 30 min and 1 ml supernatants were collected. The supernatants were treated with 10 μg ml⁻¹ DNase I (DN25, Sigma-Aldrich) and 0.01 volume RNase A (R6148, Sigma-Aldrich) for 1 h at 37 °C. Viral particles were precipitated in 7% PEG 8000 0.3 M NaCl overnight at 4 °C, followed by centrifugation at 14,000g for 30 min, after which the pellets were dissolved in 50 μl TE buffer at 4 °C. Next, 20 μl of each sample was mixed with 5 μl loading dye containing 0.8% SDS and 60 mM EDTA and heated at 65 °C for 10 min. The samples were loaded onto 0.4% agarose gels and run in TAE for 1.5 h, followed by visualization of phage sized (about 50 kb) DNA bands using Image Studio Lite (LI-COR Biosciences) with a sample to control well signal ratio cut off of 0.03 (refs. ⁶⁵). Samples with suspected viral DNA were treated with 0.5% SDS and 100 μg ml⁻¹ proteinase K at 55 °C for 1 h followed by a 10 min inactivation at 65 °C. Phenol–chloroform–isoamyl alcohol (25:24:1) extraction was performed followed by sodium acetate (0.3 M final) and 70% ethanol precipitation with 0.4 mg ml⁻¹ glycogen overnight at 4 °C. The DNA quantity was validated using Qubit (Thermo Fisher Scientific), with a minimum of 2 ng µl⁻¹ required for sequencing. From these, a subset of samples was selected for sequencing as follows. First, all of the samples grown in standard YCFA medium or induced with mitomycin C were selected. Second, samples from at least one isolate within a bacterial cluster (99% ANI) grown in the remaining five induction conditions were selected, except for the Fusobacteria isolate, which was sequenced only in standard medium and mitomycin C. For 17 out of 84 clusters, more than one isolate was sequenced in all conditions. Nextera-XT libraries were constructed and sequenced on either the Illumina NextSeq 2000 or Illumina NextSeq 550 system.

Human cell culture prophage induction

Human colonic epithelial immortalized cells (Caco2 TC7⁶⁶; genotype verified by the AGRF Human Cell Line Identification Service) were maintained in Dulbecco’s modified Eagle medium (DMEM, low glucose, GlutaMAX supplement, pyruvate) (Thermo Fisher Scientific), containing 10% FCS (Bovogen) in 5% CO₂ at 37 °C in T75 cm² flasks (Nunc, Thermo Fisher Scientific). Cells were routinely tested for the presence of mycoplasma contamination (MycoStrip, Invivogen) and were confirmed to be mycoplasma negative. For sonicated cell inductions, 4 × 10⁷ cells in 5 ml DMEM (without FCS) were sonicated on ice (5 cycles, 30 s on and off at 40% frequency). Sonicated Caco2 cells were confirmed by observing efficient lysis of cells by bright-field microscopy and stored at −20 °C. Before the addition of bacterial cells, sonicated cells were thawed on ice and pre-reduced DMEM was resuspended with the cells for a final cell density of 5 × 10⁵ cells per ml in the Whitley A95 Anaerobic Workstation (Don Whitley Scientific) at 37 °C. Sonicated cells in DMEM (2 ml) were added to 6-well tissue culture plates to achieve a total of 1 × 10⁶ cells per well.

Two days before induction, 1 × 10⁶ Caco2 cells were seeded in 6-well tissue culture plates (Nunc, Thermo Fisher Scientific) in a final volume of 2 ml per well with DMEM containing 10% FCS and incubated for 48 h at 37 °C under 5% CO₂. Caco2 confluent cell layers were serum starved under anaerobic conditions for 2 h by replacing the cell medium with 2 ml pre-reduced DMEM (no FCS) per well in the Whitley A95 anaerobic workstation at 37 °C.

Individual working stocks of bacterial isolates were prepared from overnight bacterial cultures that were centrifuged at 4,000g for 10 min, resuspended to an optical density at 600 nm (OD₆₀₀) of 5.5, combined 1:1 with 150 μl 50% glycerol and stored as frozen glycerol stocks (25% final glycerol). Glycerol stocks of individual bacterial isolates were thawed and 8.5 µl of each isolate was added to 6-well tissue culture plates containing 2 ml pre-reduced DMEM medium only, 2 ml sonicated Caco2 cells (total cell number, 1 × 10⁶ cells per well) or confluent Caco2 cell layer. All cultures were grown for 24 h followed by centrifugation at 4,000g for 30 min after which 1.8 ml supernatants were collected, and viral DNA was extracted and sequenced on the Illumina NextSeq 2000 system as previously described.

Regions of interest

Reads were trimmed using Trimmomatic⁶⁷ (v.0.38) (SLIDINGWINDOW:4:25 MINLEN:100) and used to identify induced prophages using two approaches. First, high-quality prophage predictions (>50% completeness) were validated for induction as follows. Read coverage for each library were obtained on their corresponding genome using Bowtie2 (ref. ⁶⁸) (v.2.3.5) (default settings). Genome coverage in 100 bp increments was obtained using Samtools⁶⁹ (v.1.9) and Deeptools⁷⁰ (v.3.1.3) and the average modified z score, coverage fold increase and Cohen’s D of prophage regions was calculated as follows:

$$z{\text{-score}}_{{\rm{ave}}}={\rm{mean}}(0.6745\times ({x}_{p}-\widetilde{x})/{\rm{median}}\lceil {x}_{h}-\widetilde{x}\rceil )$$

(1)

where z-score_ave is the average z score of the predicted region, x_p is 100 bp coverage increments of the phage region, x_h is 100 bp coverage increments of the host and $\tilde{x}$ is median coverage of the host and

$${\rm{Cohen}}\mbox{'}{\rm{s}}\,D=\frac{{\rm{m}}{\rm{e}}{\rm{a}}{\rm{n}}({x}_{h})-{\rm{m}}{\rm{e}}{\rm{a}}{\rm{n}}({x}_{p})}{\sqrt{\frac{{S}_{h}^{2}+{S}_{p}^{2}}{2}}}$$

(2)

where Cohen’s D is the effect size of prophage versus host coverage, S_h and S_p is the s.d. of the host and phage coverage, respectively⁷¹. Regions with a minimum average modified z score of 3.5 or an average twofold coverage and Cohen’s D larger than 0.7 were retained. A custom Python script was then used to refine the start stop positions of the prophage regions within each genome, removing flanking 100 bp increments with coverage less than 25% of the mean prophage coverage (code is available at https://doi.org/10.26180/29946902.v1). In a second approach, regions of increased coverage were identified without previous prophage predictions using hafeZ⁷² (v.1.0.2) (default settings with the -N -S flags). Some of the identified regions of interest were found to be split across several host contigs. To resolve these into full-length phage contigs, de novo assembly using MetaViral SPade⁷³ (default setting) was performed and contigs overlapping with hafeZ prediction were retained. The resulting contigs were dereplicated at 99% ANI over 85% of the AF using scripts from the CheckV repository and the longest representative of each prophage were retained for further analysis.

Identification of induced temperate phages

Proteins from the resulting contigs were predicted and annotated using PROKKA⁷⁴ (v.1.14.6) (default settings, --hmms) with the PHROGS³⁸ database. Furthermore, all proteins were scanned against the hmm databases provided by Cenote-Taker2 (ref. ⁷⁵) using Hmmer⁷⁶ (v.3.3.1) hmmscan (-E 1e-9). To remove potential fragmented protein hits against hallmark genes, a custom phage database was constructed of genomes from refs. ^9,31 and the INphared database (December 2021 version) dereplicated at 95% ANI over 85% AF using CheckV scripts⁷⁷. The same HMM searches were performed on proteins from the database and half the average length of the middle 80% percentile was calculated as a cut-off for each Caudoviricetes hallmark gene (265 amino acids for terminase large subunit, 245 amino acids for portal protein and 186 amino acids for major head protein). To identify any non-Caudoviricetes genomes, HMM searches for Microviridae, Tectiviridae and Inoviridae were performed as follows. Microviridae hallmark VP1 proteins from ref. ⁷⁸ was made into HMM profiles using MAFFT v.7.310 with the standard settings followed by TABAJARA (-t 0.5 -p 50 -w 15 -b 15 -mb 15 -m c -gs 20 -md 3 -cs yes -mb 20)⁷⁹. Multiple-sequence alignments of the double jelly-roll hallmark protein of Tectiviridae were obtained from a previous study⁸⁰ and turned into HMM profiles using HMMER v.3.3.1 hmmbuild. These and the Inoviridae morphogenesis protein family HMMs provided by ref. ⁸¹ were searched against all proteins using hmmscan (-E 1e-9 and score ≥ 30). Contigs containing at least one viral hallmark gene were retained.

Synthetic microbiome community prophage induction

Working stocks of the synthetic microbiome community were prepared by combining 1.5 ml of overnight bacterial culture diluted to OD₆₀₀ 0.7, after which the community was centrifuged at 8,000g for 10 min, resuspended in 10 ml fresh YCFA and stored as frozen glycerol stocks (25% final glycerol). Human Caco2 confluent cell layers grown for 48 h in 175 cm² tissue culture flasks (NUNC) in DMEM containing 10% FCS with 5% CO₂ at 37 °C were transferred to anaerobic conditions (Whitley A95 anaerobic workstation) and the medium in each flask was replaced with 70 ml pre-warmed and pre-reduced (overnight) DMEM (without FCS). Next, 200 μl of the frozen community stock was added to each 70 ml tissue culture flasks containing Caco2 cell layers as well as to culture bottles containing 200 ml pre-reduced YCFA medium, both in five replicates. All cultures were grown anaerobically with samples (14 ml) taken at 24, 48 and 72 h. Total metagenome DNA extraction was performed using the FastDNA SPIN Kit for Soil (MP Biomedicals) on 1 ml of each sample. The remainder of the sample was centrifuged at 3,000g for 30 min and the supernatant was collected and filtered through 0.45 μM syringe filters (Acrodisc, Pall) followed by incubation for 15 min with 0.1 volumes of chloroform at room temperature. The samples were then centrifuged at 4,000g for 15 min and 9 ml of the aqueous phase was collected and treated with 10 μg ml⁻¹ DNase I (DN25, Sigma-Aldrich) and 120 μl RNase A (R6148, Sigma-Aldrich) for 1 h at room temperature. Viral particles were precipitated using 7% PEG 8000 incubated overnight at 4 °C, centrifuged at 12,000g for 1 h and resuspended in 100 μl TE buffer. Viral DNA was extracted and sequenced on the Illumina NextSeq 2000 system as previously described.

Synthetic microbiome community prophage detection

Reads from the synthetic microbiome community were trimmed using Trimmomatic⁶⁷ (v.0.38) (SLIDINGWINDOW:4:25 MINLEN:100), decontaminated of human reads (GCA_000001405.29) using Bowtie2 (v.2.3.5) and Samtools (v.1.19) and de-interleaved using bbmap (v.39.06). A database of community prophage genomes (high-quality predictions n = 338) and bacterial host genomes (n = 78, masked for prophage regions using bedtools (v.2.26.0)) was constructed⁷⁰. Decontaminated community reads from samples were aligned against the database using Bowtie2 (default flags) and coverage was obtained using Samtools ‘coverage’. Host abundance was calculated from the Samtools outputs of total DNA extracted sequence libraries.

In viral-enriched samples, prophage species were regarded to be induced if at least one representative genome was covered with reads ≥85% of the length with a twofold increase in coverage (depth) over the mean host genome coverage (sum coverage of host contigs normalized by length) in a minimum three out of five replicates. For detection of individual prophage genomes, a custom metagenomic read classification database was built using KrakenUniq (v.1.0.4)⁸² containing prophage and prophage-masked host genomes. For database construction purposes, phage sequences were assigned the NCBI taxonomy IDs of their host bacteria. NCBI taxonomic data were downloaded using the ‘krakenuniq-download --db taxonomy’ command on 7 July 2024. The KrakenUniq database was constructed with the krakenuniq-build command using Jellyfish (v.1.1.12) for k-mer extraction, a k-mer size of 21 and the --taxids-for-genomes option. Paired-end reads were merged using the read_merger.pl script within KrakenUniq and subsequently classified using the krakenuniq command with default parameters. On the basis of data from pure isolate inductions, a cut-off of 0.25 k-mer coverage, 10 reads and 100 unique k-mers was selected for calling detection of phage and a cut-off of 10 reads and 18,000 unique k-mers for calling detection of bacterial host genome was used. Prophage genomes were regarded as induced if they had a twofold increase in kmerDuplicity over mean host genome kmerDuplicity (sum duplicity of host contigs normalized to length) in a minimum of three out of five replicates. Prophages from undetected isolates were regarded as induced if detected in a minimum of three replicates. To estimate the number of detectable prophages within the community (prophages with at least 100 unique k-mers), pairwise distances between all prophage and prophage-masked host genomes were calculated using Mash (v.2.2.2) with a k-mer size 21 and sketch size 5000 (ref. ⁸³). Neighbour-joining was applied to the resulting distance matrix as implemented in RapidNJ (v.2.3.2) with default parameters⁸⁴. Using the resulting tree, a metagenomic classification database was constructed using Expam (v.1.2.2.5) with a k-mer size 21 and number of unique k-mers per genome was obtained using the CountUniqueKmers.py script (https://github.com/seansolari/expam/scripts/database/CountUniqueKmers.py)⁸⁵.

Taxonomic annotation and DGR identification

Viral taxonomy was assigned based on a combination of the protein alignment method previously described² against the INphared database and genus level clustering using vContact2²⁹ against phage genomes in the custom made database used for hallmark gene searches. In cases in which the taxonomic assignments from the protein voting and genus-level clustering method differed, the lowest common classification was assigned. Species level dereplication was performed at 95% ANI over 85% AF using scripts from the CheckV repository. DGRs were identified using DGRscan⁸⁶ with the default settings, and remote VRs were identified querying the template repeat using BLASTn (v.2.7.1+) (-dust no -perc_identity 75 -qcov_hsp_perc 50 -ungapped -word_size 4). DGR-positive genomes were defined as genomes encoding both a reverse transcriptase gene and containing repeat regions.

Metagenomic read mapping

The fractional abundance and prevalence of induced prophages within gut viromes were performed as described previously⁹ using the 1,241 human gut viromes described therein. Reads from each virome were competitively aligned to the temperate phage species genomes together with the custom database previously described. The number of reads and read coverage was obtained using Samtools ‘coverage’ (v.1.9). The fractional abundance of a genomes was calculated as follows:

$$\text{Fractional abundance}\,=\,\frac{{{\rm{reads}}}_{{\rm{genome}}}/{{\rm{length}}}_{{\rm{genome}}}}{{\text{Total reads}}_{{\rm{virome}}}/\mathrm{50,000}}$$

(3)

and the sum fractional abundance was normalized to 1 as previously described⁸⁷. A genome was counted as present within a virome if at least 70% of the genome length was covered by reads.

Analysis of non-induced prophages

Proteins of predicted prophages were predicted using PROKKA v.1.14.6 (default setting, --hmms) and annotated using the PHROG database. The total gene counts per PHROG category and the presence–absence of PHROG categories within each genome were obtained for induced, high-quality predictions (>50% completeness) and low-quality predictions (<50% completeness). The percentage gene frequency change of PHROG categories between induced, high-quality and low-quality predictions was calculated for total genes and presence–absence counts as follows:

$${\rm{Frequency}}\,{\rm{change}}( \% )=\frac{100\times ({f}_{{\rm{cry}}}-{f}_{{\rm{in}}})}{{f}_{{\rm{in}}}}$$

(4)

where f_cry and f_in are the gene frequencies in the high completeness prediction and induced prophage set, respectively.

High-quality predictions were aligned to induced prophages using BLASTn (v.2.15.0+) and pairs with a minimum of 95% ANI over 85% AF (Checkv anicalc.py⁶³ script) were further filtered to only include hosts that had been sequenced in the same condition(s) as the induced prophage. The same search was performed to identify induced–induced prophage pairs. The number of HGT and insertion–deletion events between the pairs was calculated using R IRanges (v.2.28.0) and splicejam (v. 0.0.77) packages, where an HGT event was defined as a gap of a at least 50 bp within the alignment present in both pairs and insertion–deletion events was defined as gap (minimum 50 bp) present in one of the pairs but not the other⁸⁸. Gaps involving the ends of either prophage were excluded. Host ANI of prophage pairs was calculated using fastANI⁸⁹ (v.1.33). dN/dS ratios between prophage pairs was calculated using dRep (compare --SkipMash --S_algorithm goANI) and the dnds_from_drep.py⁹⁰ script.

Inactivation of ΦPomma in B. faecis isolate CC01414

Gene deletion of the DNA transposition protein gene within ΦPomma in B. faecis isolate CC01414 was achieved using the CRISPR–Cas-based system described previously⁹¹. First, we redesigned pB025, which contains the FnCas12a system, with a sgRNA targeting the DNA transposition protein gene (gene location, base pairs 5610–6524) along with a repair template containing 1,000 bp of homologous DNA up and downstream of this gene (Supplemental Table 10). This plasmid (pB025_09) was transformed into competent E. coli S17 and grown aerobically in LB medium supplemented with 100 µg ml⁻¹ ampicillin. B. faecis was grown in brain–heart infusion (BHI) liquid medium supplemented with haemin, resazurin and vitamin K3 (menadione) under anaerobic conditions. Conjugation was performed under anaerobic conditions and B. faecis transconjugants were selected for with 200 µg ml⁻¹ gentamicin and 25 µg ml⁻¹ erythromycin. A deletion mutant was identified before anhydrotetracycline (aTc) induction, presumably due to leaky expression. This deletion mutant was verified by PCR and sanger sequencing confirming the clean deletion of the DNA transposition protein gene (CC01414 ∆tran).

Induction of ΦPomma in CC01414 wild type and deletion mutant (∆tran) was performed as described previously using hydrogen peroxide (0.5 mM) and stevia (37 mg ml⁻¹) using three separate induction reactions for each condition and isolate. Lysates were treated with 2% chloroform, centrifuged for 20 min at 4,000g at 4 °C, DNase treated and phage precipitation with PEG, DNA extraction and sequencing was performed as described previously. qPCR of DNA extracted phage lysates was performed in technical triplicates using SYBR Green I Master Mix (Roche Diagnostics) with the Roche Lightcycler 480 system containing 1 μM of each primer, 2 μl of DNA template and 1× SYBR Green I Master Kit, in a final reaction volume of 20 μl. Cycle parameters were as follows: initial denaturation at 95 °C for 10 min; followed by 45 cycles of 95 °C for 20 s, 62 °C for 20 s, and 72 °C for 30 s. Primers were designed using Primer blast and no cross reactivity to bacterial background was detected (https://www.ncbi.nlm.nih.gov/tools/primer-blast). The standard curve was produced with gBlock sequence from IDT containing the sequence targeted by ΦPomma primers.

Differential prophage induction qPCR

Isolates were streaked onto YCFA plates and grown for 24 h. Three sperate colonies from each isolate were inoculated into 1 ml YCFA broth and grown overnight. Overnight cultures were diluted 1:50 into 1.5 ml standard YCFA medium and hydrogen peroxide was added after 5 h of growth. All cultures were grown for an additional 20 h and lysates were treated with 2% chloroform and centrifuged for 20 min at 4,000g at 4 °C and frozen at −80 °C until analysis was performed. qPCR was performed as previously described using 5 μl of DNA template and annealing temperature of 60 °C for 30 s and elongation of 30 s. qPCR primer pairs were custom designed using Primer 3 (https://primer3.org/). In silico PCR amplification (http://insilico.ehu.eus/user_seqs/PCR/) did not show cross reactivity of primers to the rest of the bacterial genome. Standard curves for primer efficiency analysis were generated by tenfold dilution in PCR-grade H₂O. Samples were diluted tenfold and qPCRs was performed in triplicates. The efficiency of each primer calculated as in equation (5) and corrected ΔC_t values calculated as in equation (6):

$${\rm{Efficency}}={10}^{-1/{\rm{slope}}}$$

(5)

$${C}_{{\rm{t}}}=\frac{\text{Efficiency}{{\rm{t}}}_{x}^{{{C}_{{\rm{t}}}}_{x}}}{\text{Efficiency}{{\rm{t}}}_{y}^{{{C}_{{\rm{t}}}}_{y}}}$$

(6)

Long-read sequencing

Isolates were streaked onto YCFA plates and grown for 24 h. Single colonies were grown overnight in 40 ml of YCFA medium, pelleted by centrifugation at 4,000g for 10 min and washed four times in 1 ml of PBS. DNA was extracted using the Monarch HMW DNA extraction kit (New England Biolabs) according to the Gram-positive bacteria protocol, with modifications. Cells were lysed in 300 μl of STET buffer (8% sucrose 5% Triton X-100 50 mM EDTA, 50 mM Tris pH 8) containing 10 mg ml⁻¹ lysozyme, 300 μl of HMW gDNA tissue lysis buffer and 20 μl of proteinase K and incubated at 56 °C for 10 min. The lysates were treated with 10 μl of RNase A at 56 °C for 5 min followed by 300 μl of protein separation solution. The samples were mixed by inversion for 2 min then centrifuged at 4 °C for 20 min at 16,000g. The supernatants were collected and 550 μl of isopropanol was added to 800 μl of supernatant. The samples were inverted for 5 min, or until DNA was precipitated, and DNA was pelleted by centrifugation at 4 °C for 10 min at 12,000g. The resulting pellet was washed twice with 500 μl of gDNA wash buffer and resuspended in nuclease free water. Library preparation and Oxford Nanopore MinION sequencing was performed using either the Oxford Nanopore ligation sequencing kit (SQK-LSK109) with native barcoding expansion kit (EXP-NBD114) (CC01407, CC01390, CC01401 and CC01405) or the rapid barcoding kit 96 (SQK-RBK110.96, CC01404). Resulting long reads were hybrid assembled with Illumina short reads into closed genomes using dragonfly (v.1.0.14) (CC01407, CC01390, CC01401 and CC01405) or into near complete genome using unicycler⁹² (v.0.4.7) (CC01404) with subsequent scaffolding using RagTag⁹³ with CC01407 genome as reference.

Statistical analysis and visualization

Significance of PHROG gene category was calculated with Fisher’s exact test (two sided) and P values were adjusted with the Hochberg method using R base stats (R v.4.1.3) and rstatix (v.0.7.0) packages. Pearson’s correlation test between host ANI and phage pair inducibility as well as Kendall’s rank correlation between the number prophages within lysogens and prophage inducibility was calculated and plotted using the R ggpubr (v.0.4.0) package. Significance of horizontal gene transfers and dN/dS data was calculated with Wilcoxon rank-sum test (two sided) and adjusted by Hochberg method using R ggpubr (v.0.4.0) and rstatix (v.0.7.0) packages. Normality of qPCR fold change between induced prophage in polylysogens calculated with Shapiro–Wilk test and significance tested with paired t-test (two sided) using rstatix (v.0.7.0), preferential induction calculated with Wilcoxon rank-sum test (two sided) and variance of means between isolates was calculated with ANOVA using the R base stats (R v.4.1.3) package. Genome maps were visualized using the R gggenomes (v.0.9.9.9000) package, and genome read coverage was visualized using R ggplot2 (v.3.5.1).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data for this study have been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under project number PRJEB64565 and accession numbers are provided in Supplementary Table 1, 3 and 12. Bacterial isolates are available through the Australian Microbiome Culture Collection (AusMiCC). Metadata of viromes used in this study along with accession numbers are provided in Supplementary Table 5. Bioinformatic scripts and figure data are available at Figshare⁹⁴ (https://doi.org/10.26180/29946902.v1). Illustrations were made using Inkscape (https://inkscape.org).

Code availability

This study did not generate any new code. Analysis scripts are available at Figshare⁹⁴ (https://doi.org/10.26180/29946902.v1).

References

Camarillo-Guerrero, L. F., Almeida, A., Rangel-Pineros, G., Finn, R. D. & Lawley, T. D. Massive expansion of human gut bacteriophage diversity. Cell 184, 1098–1109 (2021).
Article CAS PubMed PubMed Central Google Scholar
Nayfach, S. et al. Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome. Nat. Microbiol. 6, 960–970 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gregory, A. C. et al. The Gut Virome Database reveals age-dependent patterns of virome diversity in the human gut. Cell Host Microbe 28, 724–740 (2020).
Article CAS PubMed PubMed Central Google Scholar
Clooney, A. G. et al. Whole-virome analysis sheds light on viral dark matter in inflammatory bowel disease. Cell Host Microbe 26, 764–778 (2019).
Article CAS PubMed Google Scholar
Gogokhia, L. et al. Expansion of bacteriophages is linked to aggravated intestinal inflammation and colitis. Cell Host Microbe 25, 285–299 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rodriguez-Valera, F. et al. Explaining microbial population genomics through phage predation. Nat. Rev. Microbiol. 7, 828–836 (2009).
Article CAS PubMed Google Scholar
Chevallereau, A., Pons, B. J., van Houte, S. & Westra, E. R. Interactions between bacterial and phage communities in natural environments. Nat. Rev. Microbiol. 20, 49–62 (2022).
Article CAS PubMed Google Scholar
Zuo, T. et al. Human-gut-DNA virome variations across geography, ethnicity, and urbanization. Cell Host Microbe 28, 741–751 (2020).
Article CAS PubMed Google Scholar
Benler, S. et al. Thousands of previously unknown phages discovered in whole-community human gut metagenomes. Microbiome 9, 78 (2021).
Article CAS PubMed PubMed Central Google Scholar
Van Espen, L. et al. A previously undescribed highly prevalent phage identified in a Danish enteric virome catalog. mSystems 6, e0038221 (2021).
PubMed Google Scholar
Govier, T. & Verwoerd, W. The promise and pitfalls of prophages. Preprint at bioRxiv https://doi.org/10.1101/2023.04.20.537752 (2023).
Anthenelli, M. et al. Phage and bacteria diversification through a prophage acquisition ratchet. Preprint at bioRxiv https://doi.org/10.1101/2020.04.08.028340 (2020).
Bobay, L. M., Touchon, M. & Rocha, E. P. C. Pervasive domestication of defective prophages by bacteria. Proc. Natl Acad. Sci. USA 111, 12127–12132 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, X. et al. Cryptic prophages help bacteria cope with adverse environments. Nat. Commun. 1, 147–149 (2010).
Article ADS PubMed Google Scholar
Erez, Z. et al. Communication between viruses guides lysis-lysogeny decisions. Nature 541, 488–493 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Silpe, J. E., Duddy, O. P. & Bassler, B. L. Natural and synthetic inhibitors of a phage-encoded quorum-sensing receptor affect phage–host dynamics in mixed bacterial communities. Proc. Natl Acad. Sci. USA 119, e2217813119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Browne, H. P. et al. Culturing of ‘unculturable’ human microbiota reveals novel taxa and extensive sporulation. Nature 533, 543–546 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Forster, S. C. et al. A human gut bacterial genome and culture collection for improved metagenomic analyses. Nat. Biotechnol. 37, 186–192 (2019).
Article CAS PubMed PubMed Central Google Scholar
Otsuji, N., Sekiguchi, M., Iijima, T. & Takagi, Y. Induction of phage formation in the lysogenic Escherichia coli K-12 by mitomycin C. Nature 184, 1079–1080 (1959).
Article ADS CAS Google Scholar
Łoś, J. M., Łoś, M., Wȩgrzyn, A. & Wȩgrzyn, G. Hydrogen peroxide-mediated induction of the Shiga toxin-converting lambdoid prophage ST2-8624 in Escherichia coli O157:H7. FEMS Immunol. Med. Microbiol. 58, 322–329 (2010).
Article PubMed Google Scholar
Oh, J.-H. et al. Dietary fructose and microbiota-derived short-chain fatty acids promote bacteriophage production in the gut symbiont Lactobacillus reuteri. Cell Host Microbe 25, 273–284 (2019).
Article CAS PubMed Google Scholar
Morris, R. M., Cain, K. R., Hvorecny, K. L. & Kollman, J. M. Lysogenic host–virus interactions in SAR11 marine bacteria. Nat. Microbiol. 5, 1011–1015 (2020).
Article CAS PubMed PubMed Central Google Scholar
Boling, L. et al. Dietary prophage inducers and antimicrobials: toward landscaping the human gut microbiome. Gut Microbes 11, 721–734 (2020).
Article PubMed PubMed Central Google Scholar
Roux, S. et al. Minimum information about an uncultivated virus genome (MIUVIG). Nat. Biotechnol. 37, 29–37 (2019).
Article CAS PubMed Google Scholar
Lopez, J. A. et al. Abundance measurements reveal the balance between lysis and lysogeny in the human gut microbiome. Preprint at bioRxiv https://doi.org/10.1101/2024.09.27.614587 (2024).
Sutcliffe, S. G., Reyes, A. & Maurice, C. F. Bacteriophages playing nice: lysogenic bacteriophage replication stable in the human gut microbiota. iScience 26, 106007 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Shalon, D. et al. Profiling the human intestinal environment under physiological conditions. Nature 617, 581–591 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Adriaenssens, E. M. Phage diversity in the human gut microbiome: a taxonomist’s perspective. mSystems 6, e0079921 (2021).
Article PubMed Google Scholar
Bin Jang, H. et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 37, 632–639 (2019).
Article Google Scholar
Benler, S. et al. A diversity-generating retroelement encoded by a globally ubiquitous Bacteroides phage. Microbiome 6, 191 (2018).
Article PubMed PubMed Central Google Scholar
Yutin, N. et al. Analysis of metagenome-assembled viral genomes from the human gut reveals diverse putative CrAss-like phages with unique genomic features. Nat. Commun. 12, 1044 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Reyes, A. et al. Viruses in the faecal microbiota of monozygotic twins and their mothers. Nature 466, 334–338 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Shkoporov, A. N. et al. The human gut virome is highly diverse, stable, and individual specific. Cell Host Microbe 26, 527–541 (2019).
Article CAS PubMed Google Scholar
Liu, M. et al. Reverse transcriptase-mediated tropism switching in Bordetella bacteriophage. Science 295, 2091–2094 (2002).
Article ADS CAS PubMed Google Scholar
Roux, S. et al. Ecology and molecular targets of hypermutation in the global microbiome. Nat. Commun. 12, 3076 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Laurenceau, R. et al. Harnessing diversity generating retroelements for in vivo targeted hyper-mutagenesis. Preprint at bioRxiv https://doi.org/10.1101/2025.03.24.644984 (2025).
Doré, H. et al. Targeted hypermutation of putative antigen sensors in multicellular bacteria. Proc. Natl Acad. Sci. USA 121, e2316469121 (2024).
Article PubMed PubMed Central Google Scholar
Terzian, P. et al. PHROG: families of prokaryotic virus proteins clustered using remote homology. NAR Genom. Bioinform. 3, lqab067 (2021).
Article PubMed PubMed Central Google Scholar
O’Brien, S., Kümmerli, R., Paterson, S., Winstanley, C. & Brockhurst, M. A. Transposable temperate phages promote the evolution of divergent social strategies in Pseudomonas aeruginosa populations. Proc. R. Soc. B 286, 20191794 (2019).
Article PubMed PubMed Central Google Scholar
Moreno-Gallego, J. L. et al. Virome diversity correlates with intestinal microbiome diversity in adult monozygotic twins. Cell Host Microbe 25, 261–272 (2019).
Article CAS PubMed PubMed Central Google Scholar
Silpe, J. E., Duddy, O. P. & Bassler, B. L. Induction mechanisms and strategies underlying interprophage competition during polylysogeny. PLoS Pathog. 19, e1011363 (2023).
Article CAS PubMed PubMed Central Google Scholar
Refardt, D. Within-host competition determines reproductive success of temperate bacteriophages. ISME J. 5, 1451–1460 (2011).
Article PubMed PubMed Central Google Scholar
Azulay, G. et al. A dual-function phage regulator controls the response of cohabiting phage elements via regulation of the bacterial SOS response. Cell Rep. 39, 110723 (2022).
Article CAS PubMed PubMed Central Google Scholar
Guo, Y. et al. Control of lysogeny and antiphage defense by a prophage-encoded kinase-phosphatase module. Nat. Commun. 15, 7244 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Song, S. et al. CRISPR-Cas controls cryptic prophages. Int. J. Mol. Sci. 23, 16195 (2022).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. & Qimron, U. The Escherichia coli CRISPR system protects from λ lysogenization, lysogens, and prophage induction. J. Bacteriol. 192, 6291–6294 (2010).
Article CAS PubMed PubMed Central Google Scholar
Silpe, J. E. & Bassler, B. L. A host-produced quorum-sensing autoinducer controls a phage lysis-lysogeny decision. Cell 176, 268–280 (2019).
Article CAS PubMed Google Scholar
Silpe, J. E. et al. Small protein modules dictate prophage fates during polylysogeny. Nature 620, 625–633 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Mathieu, A. et al. Virulent coliphages in 1-year-old children fecal samples are fewer, but more infectious than temperate coliphages. Nat. Commun. 11, 378 (2020).
Lemire, S., Figueroa-Bossi, N. & Bossi, L. Bacteriophage crosstalk: coordination of prophage induction by trans-acting antirepressors. PLoS Genet. 7, e1002149 (2011).
Article CAS PubMed PubMed Central Google Scholar
D’Adamo, G. L. et al. Bacterial clade-specific analysis identifies distinct epithelial responses in inflammatory bowel disease. Cell Rep. Med. 4, 101124 (2023).
Article PubMed PubMed Central Google Scholar
Stewart, C. S., Hold, G. L., Duncan, S. H., Flint, H. J. & Harmsen, H. J. M. Growth requirements and fermentation products of Fusobacterium prausnitzii, and a proposal to reclassify it as Faecalibacterium prausnitzii gen. nov., comb. nov. Int. J. Syst. Evol. Microbiol. 52, 2141–2146 (2002).
Article PubMed Google Scholar
Mende, D. R. et al. ProGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes. Nucleic Acids Res. 45, D529–D534 (2017).
Article CAS PubMed Google Scholar
Shen, W., Le, S., Li, Y. & Hu, F. SeqKit: a cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLoS ONE 11, e0163962 (2016).
Article PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Article PubMed PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Letunic, I. & Bork, P. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Article CAS PubMed PubMed Central Google Scholar
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. DRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 11, 2864–2868 (2017).
Article CAS PubMed PubMed Central Google Scholar
Roux, S., Enault, F., Hurwitz, B. L. & Sullivan, M. B. VirSorter: mining viral signal from microbial genomic data. PeerJ 3, e985 (2015).
Article PubMed PubMed Central Google Scholar
Kieft, K., Zhou, Z. & Anantharaman, K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome 8, 90 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ren, J., Ahlgren, N. A., Lu, Y. Y., Fuhrman, J. A. & Sun, F. VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data. Microbiome 5, 69 (2017).
Article PubMed PubMed Central Google Scholar
Nayfach, S. et al. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat. Biotechnol. 39, 578–585 (2021).
Article CAS PubMed Google Scholar
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
Article CAS PubMed PubMed Central Google Scholar
Alexeeva, S., Guerra Martínez, J. A., Spus, M. & Smid, E. J. Spontaneously induced prophages are abundant in a naturally evolved bacterial starter culture and deliver competitive advantage to the host. BMC Microbiol. 18, 120 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chantret, I. et al. Differential expression of sucrase-isomaltase in clones isolated from early and late passages of the cell line caco-2: evidence for glucose-dependent negative regulation. J. Cell Sci. 107, 213–225 (1994).
Article CAS PubMed Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kieft, K. & Anantharaman, K. Deciphering active prophages from metagenomes. mSystems 7, e00084-22 (2022).
Article PubMed PubMed Central Google Scholar
Turkington, C. J. R., Abadi, N. N., Edwards, R. A. & Grasis, J. A. hafeZ: active prophage identification through read mapping. Preprint at bioRxiv https://doi.org/10.1101/2021.07.21.453177 (2021).
Antipov, D., Raiko, M., Lapidus, A. & Pevzner, P. A. Metaviral SPAdes: assembly of viruses from metagenomic data. Bioinformatics 36, 4126–4129 (2020).
Article CAS PubMed Google Scholar
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).
Article CAS PubMed Google Scholar
Tisza, M. J., Belford, A. K., Dominguez-Huerta, G., Bolduc, B. & Buck, C. B. Cenote-Taker 2 democratizes virus discovery and sequence annotation. Virus Evol. 7, veaa100 (2021).
Article PubMed Google Scholar
Eddy, S. R. Accelerated profile HMM searches. PLoS Comp. Biol. 7, e1002195 (2011).
Cook, R. et al. INfrastructure for a PHAge REference Database: identification of large-scale biases in the current collection of cultured phage genomes. Phage 2, 214–223 (2021).
Article PubMed PubMed Central Google Scholar
Wang, H. et al. Gut virome of mammals and birds reveals high genetic diversity of the family Microviridae. Virus Evol. 5, vez013 (2019).
Article PubMed PubMed Central Google Scholar
Ibrahim, B. et al. Bioinformatics meets virology: the European Virus Bioinformatics Center’s second annual meeting. Viruses 10, 256 (2018).
Article PubMed PubMed Central Google Scholar
Yutin, N., Bäckström, D., Ettema, T. J. G., Krupovic, M. & Koonin, E. V. Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis. Virol. J. 15, 67 (2018).
Article PubMed PubMed Central Google Scholar
Roux, S., Krupovic, M., Daly, R.A. et al. Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earth’s biomes. Nat. Microbiol. 4, 1895–1906 (2019).
Breitwieser, F. P., Baker, D. N. & Salzberg, S. L. KrakenUniq: confident and fast metagenomics classification using unique k-mer counts. Genome Biol. 19, 198 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ondov, B. D. et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 17, 132 (2016).
Article PubMed PubMed Central Google Scholar
Simonsen, M., Mailund, T. & Pedersen, C. N. S. in Algorithms in Bioinformatics (eds Crandall, K. A. & Lagergren, J.) 113–122 (Springer, 2008).
Solari, S. M., Young, R. B., Marcelino, V. R. & Forster, S. C. Expam—high-resolution analysis of metagenomes using distance trees. Bioinformatics 38, 4814–4816 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ye, Y. Identification of diversity-generating retroelements in human microbiomes. Int. J. Mol. Sci. 15, 14234–14246 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cobián Güemes, A. G. et al. Viruses as winners in the game of life. Annu. Rev. Virol. 3, 197–214 (2016).
Article PubMed Google Scholar
O’Donnell, S. & Fischer, G. MUM&Co: accurate detection of all SV types through whole-genome alignment. Bioinformatics 36, 3242–3243 (2020).
Article PubMed Google Scholar
Jain, C., Rodriguez-R, L. M., Phillippy, A. M., Konstantinidis, K. T. & Aluru, S. High throughput ANI analysis of 90 K prokaryotic genomes reveals clear species boundaries. Nat. Commun. 9, 5114 (2018).
Article ADS PubMed PubMed Central Google Scholar
Olm, M. R. et al. Consistent metagenome-derived metrics verify and delineate bacterial species boundaries. mSystems 5, e00731-19 (2020).
Article PubMed PubMed Central Google Scholar
Zheng, L. et al. CRISPR/Cas-based genome editing for human gut commensal Bacteroides species. ACS Synth. Biol. 11, 464–472 (2022).
Article CAS PubMed Google Scholar
Wick, R. R., Judd, L. M., Gorrie, C. L. & Holt, K. E. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput. Biol. 13, e1005595 (2017).
Article ADS PubMed PubMed Central Google Scholar
Alonge, M. et al. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol. 23, 258 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dahlman, S. et al. Data and code for ‘Isolation, engineering and ecological dynamics of temperate phages from the human gut’. Figshare https://doi.org/10.26180/29946902.v1 (2025).

Download references

Acknowledgements

This work was supported by the Australian Research Council (ARC) Discovery Project grant (DP210103296). S.D. and L.A.-F. were supported by Monash University Postgraduate Research Scholarship funding their doctoral studies; R.A.E. by an award from the National Institute of Health (NIH NIDDK; RC2DK116713) and awards from the ARC (DP220102915, DP250103825, and FL250100019); S.C.F. by a CSL Centenary Fellowship; and J.J.B by a National Health and Medical Research Council (NHMRC) Investigator Grant Leadership Level 1 (NHMRC; 2026130) and Monash University Research Talent Accelerator (RTA) 2023 program. We acknowledge the staff at the Hudson Genomics Facility, Monash eResearch Team and the Victorian State Government Operational Infrastructure Scheme for support.

Funding

Open access funding provided by Monash University.

Author information

These authors contributed equally: Samuel C. Forster, Jeremy J. Barr

Authors and Affiliations

School of Biological Sciences, Monash University, Melbourne, Victoria, Australia
Sofia Dahlman, Laura Avellaneda-Franco, Ciaren Kett, Dinesh Subedi, Nathan Campbell & Jeremy J. Barr
Centre for Innate Immunity and Infectious Disease, Hudson Institute of Medical Research, Melbourne, Victoria, Australia
Emily L. Rutten, Emily L. Gulliver, Sean Solari, Michelle Chonwerawong, Remy B. Young, Jodee A. Gould, Jasmine D. Bell, Callum A. H. Docherty & Samuel C. Forster
Department of Molecular and Translational Sciences, Monash University, Melbourne, Victoria, Australia
Emily L. Rutten, Emily L. Gulliver, Sean Solari, Michelle Chonwerawong, Remy B. Young, Jodee A. Gould, Jasmine D. Bell, Callum A. H. Docherty & Samuel C. Forster
Centre to Impact AMR, Monash University, Melbourne, Victoria, Australia
Dinesh Subedi
School of Optometry and Vision Science, UNSW Medicine, University of New South Wales, Sydney, New South Wales, Australia
Dinesh Subedi
APC Microbiome Ireland & School of Microbiology, University College Cork, Cork, Ireland
Christopher J. R. Turkington & Neda Nezam-Abadi
Department of Molecular and Cell Biology, University of California, Merced, CA, USA
Juris A. Grasis
Monash Biomedicine Discovery Institute Department of Microbiology, Monash University, Melbourne, Victoria, Australia
Dena Lyras
College of Science and Engineering, Flinders University, Adelaide, South Australia, Australia
Robert A. Edwards

Authors

Sofia Dahlman
View author publications
Search author on:PubMed Google Scholar
Laura Avellaneda-Franco
View author publications
Search author on:PubMed Google Scholar
Emily L. Rutten
View author publications
Search author on:PubMed Google Scholar
Emily L. Gulliver
View author publications
Search author on:PubMed Google Scholar
Sean Solari
View author publications
Search author on:PubMed Google Scholar
Michelle Chonwerawong
View author publications
Search author on:PubMed Google Scholar
Ciaren Kett
View author publications
Search author on:PubMed Google Scholar
Dinesh Subedi
View author publications
Search author on:PubMed Google Scholar
Remy B. Young
View author publications
Search author on:PubMed Google Scholar
Nathan Campbell
View author publications
Search author on:PubMed Google Scholar
Jodee A. Gould
View author publications
Search author on:PubMed Google Scholar
Jasmine D. Bell
View author publications
Search author on:PubMed Google Scholar
Callum A. H. Docherty
View author publications
Search author on:PubMed Google Scholar
Christopher J. R. Turkington
View author publications
Search author on:PubMed Google Scholar
Neda Nezam-Abadi
View author publications
Search author on:PubMed Google Scholar
Juris A. Grasis
View author publications
Search author on:PubMed Google Scholar
Dena Lyras
View author publications
Search author on:PubMed Google Scholar
Robert A. Edwards
View author publications
Search author on:PubMed Google Scholar
Samuel C. Forster
View author publications
Search author on:PubMed Google Scholar
Jeremy J. Barr
View author publications
Search author on:PubMed Google Scholar

Contributions

J.J.B. and S.C.F. conceived and designed the study. R.A.E., D.L. and J. A. Grasis contributed ideas and expertise. S.D. performed in vitro inductions, molecular work, sequencing, informatic analyses and most of the data preparation. L.A. assisted with informatic analyses, data interpretation, molecular work and sequencing. C.K. performed DNA extractions. D.S. performed qPCR assays. R.B.Y., J. A. Gould and E.L.R. assisted with molecular work and sequencing. E.L.R., E.L.G. and R.B.Y. isolated and assisted with the cultured bacterial isolates. N.C. assisted in Bacteroides gene editing. C.J.R.T. and N.N.-A. assisted with identification of induced prophages. E.L.R., E.L.G., J.J.B., C.A.H.D. and M.C. performed synthetic microbiome co-culture and pure-culture inductions. S.S. assisted in design of analysis methods for synthetic microbiome inductions. S.D., S.C.F. and J.J.B. wrote the paper. J.J.B. supervised all aspects of the work and all of the authors approved the final manuscript.

Corresponding author

Correspondence to Jeremy J. Barr.

Ethics declarations

Competing interests

S.C.F., S.S. and R.B.Y. are advisors to or employees of Biomebank. The other authors declare no competing interests.

Peer review

Peer review information

Nature thanks Benjamin Adler, Martha Clokie, Mart Krupovic and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Methods overview single isolate and community induction.

a, Schematic of bacterial single isolate inductions. Isolates were grown 20–25 h in standard media, Mitomycin C (0.3 and 3 μg/mL), hydrogen peroxide (0.5 mM), Stevia (3.7 and 37 μg/mL), carbon depleted or SCFA depleted media (n = 2,016). Samples were centrifugated, DNAse treated, PEG concentrated, and phage sized DNA bands were detected on agarose gels. DNA was extracted from samples with phage sized bands (n = 859) and DNA concentrations >2 ng/µL were chosen for sequencing based on condition and bacterial cluster (n = 452), 19 of which failed library preparation. Second, a subset of isolates (n = 32) were induced in DMEM media with or without sonicated or intact human colonic epithelial cells (Caco2) and DNA was extracted (n = 96), 3 of which failed library preparation. Prophages were predicted using Virsorter, Vibrant and VirFinder (n = 1972) and predictions >50% complete were trimmed for bacterial flanking regions (n = 736). Prophage reads (n = 526) were aligned to host genome and predictions with >2-fold coverage and Cohen’s D > 0.7 or mean zscore >3.5 were retained. Further, reads were investigated for induction using HafeZ. De novo assembly using MetaViralSpades was used to resolve predictions spanning host contigs. Predictions were dereplicated at 99% ANI and 85% AF retaining the longest representative with at least one viral hallmark gene. b, Phylogenetic tree of bacterial isolates with heatmap depicting sequenced phage-enriched samples (shown in blue); Actinomycetota yellow (20/409), Fusobacteriota black (6/8), Bacillota blue (37/414), Pseudomonadota red (216/498) and Bacteroidota teal (247/786). c, Schematic of the synthetic community inductions. Isolates (n = 78) were co-cultured with or without Caco2 cell monolayer for 24, 48 and 72 h. Host abundance was calculated on total metagenome DNA using read mapping. Prophage induction was detected in phage enriched DNA at species level using read mapping as well as on individual prophage using KrakenUnique.

Extended Data Fig. 2 Hankyvirus genus overview and host range of Hankyvirus and LoVEphage.

a, Annotated genome maps of the eight Hankyvirus genus phages species. Genomes are scaled by length with the ruler displayed at the bottom of the figure. Genes coloured by PHROG categories and unknown genes coloured in grey. DGR template and variable repeats denoted with black lines. b, Host range of Hankyvirus and LoVEphage species induced in this study. Light green denotes hosts found in NCBI RefSeq bacterial genome database; dark green denotes hosts in which the phage was actively induced in this study.

Extended Data Fig. 3 Phage species diversity and genomes with double variable repeat DGRs.

a, Gene sharing network between induced prophage species (solid circles, n = 68) and database representatives (9920). Representatives in translucence and coloured by host phyla when applicable otherwise in grey, except Crassvirales genomes from Yutin et al. 2021 which are highlighted with solid black boarded grey circles³¹. b, Mean fractional abundance and prevalence of Caudoviricetes phages within gut viromes (1241) shown as in Fig. 2b, but highlighting Crassvirales genomes in large solid grey circles. c, Annotated genome maps of four phage species encoding a second variable repeat (VR) distal from the RT cassette. Genes coloured by PHROG categories and unknown genes in grey. DGR template and variable repeats denoted with black lines.

Extended Data Fig. 4 Comparison of induced versus predicted prophage sequences and host nucleotide identity.

a, Length distribution of all predicted prophages in this study separated by host phyla. Fusobacteriota excluded as only single isolate. A bimodal length distribution was observed for all phyla, with an initial peak at around 8 kb followed by a second peak at around 37 kb. b, Frequency of PHROG gene categories across induced (green, n = 134), high quality (> 50% completeness, orange, n = 736) and low quality (< 50% completeness, blue, n = 1236) predictions. Significant p values calculated using Fisher’s exact test (two sided) and adjusted by Hochberg method shown above bars. c, Host ANI comparisons of induced to induced (n = 222) and induced to non-induced (n = 231) isolates with high similarity prophage pairs. Comparisons coloured by genus and shape based on NCBI taxon. Host pairs with less than 80% similarity grouped separately as too divergent for reliable ANI score. d, Density of ANI comparisons of all isolates within the dataset separated by phyla, coloured as in panel a. e, PCR amplification with phage specific primers surrounding the targeted gene (DNA transposition protein), visualized by agarose gel electrophoresis (single experiment) alongside GeneRuler 1 kb DNA ladder, showing deletion of a ~ 900 bp fragment in øPomma ∆tran mutant (gene product size, 915 bp).

Supplementary information

Supplementary Information (download PDF )

The legends for Supplementary Tables 1–12.

Reporting Summary (download PDF )

Supplementary Tables 1–12 (download XLSX )

Supplementary Tables 1–12.

Peer Review File (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dahlman, S., Avellaneda-Franco, L., Rutten, E.L. et al. Isolation, engineering and ecology of temperate phages from the human gut. Nature 647, 698–705 (2025). https://doi.org/10.1038/s41586-025-09614-7

Download citation

Received: 10 September 2023
Accepted: 10 September 2025
Published: 15 October 2025
Version of record: 15 October 2025
Issue date: 20 November 2025
DOI: https://doi.org/10.1038/s41586-025-09614-7

This article is cited by

Bacteriophages in gut metagenomes: from analysis to application
- Zakharevich Natalia
- Strokach Aleksandra
- Klimina Ksenia
Virology Journal (2026)
Large-scale capsid-mediated mobilisation of bacterial genomic DNA in the gut microbiome
- Tatiana Borodovich
- Colin Buttimer
- Andrey N. Shkoporov
Nature Communications (2026)