Introduction

The organization of cellular processes is essential for life. Depending on the species and context, this organization can be achieved by strategies such as subcellular compartmentalization, cytoskeletal structures, and phase separation. Viruses are no exception to this rule1. Nucleus-forming bacteriophages have been recently described as a group of bacterial viruses that form a proteinaceous replication compartment with many functional similarities to the eukaryotic nucleus2,3. The phage nucleus houses and protects phage DNA2,3,4,5,6; it is the site of DNA replication, recombination, and transcription2,3,7; and it exhibits selective import of proteins and export of mRNA2,3,8,9. In addition to the subcellular organization provided by the phage nucleus, some of these phages also assemble structures of mature virions known as bouquets10, and most currently characterized nucleus-forming phages encode a tubulin (PhuZ) that can perform diverse roles such as positioning the phage nucleus within the host cell, delivering capsids to the phage nucleus for genome packaging, and rotating the phage nucleus to distribute capsids evenly across its surface2,3,11,12,13,14,15,16,17.

All currently characterized and putative nucleus-forming phages belong to one monophyletic clade that makes up the viral family Chimalliviridae13. This family shares a core genome13,18,19 of 68 unique genes, including chimallin (ChmA) which makes up the phage nucleus shell6,20,21 and two multisubunit RNA polymerases (msRNAP)7,22,23,24. The two msRNAPs are the virion RNA polymerase (vRNAP), which is encapsidated along with the DNA, and the nonvirion RNA polymerase (nvRNAP), which localizes to the phage nucleus22,25. At the earliest stage of infection, the phage injects its DNA into the host cell with several proteins, including the vRNAP, that are required for early infection25,26, into a membrane-bound early infection intermediate6,27,28,29 called the EPI (early phage infection) vesicle29. In Pseudomonas-infecting chimalliviruses, genome degradation is observed immediately after infection 3,11,16,28. During these first stages of infection, inside the EPI vesicle, the vRNAP transcribes early genes. Later, the nvRNAP transcribes mid to late genes in the phage nucleus7,24.

Studying diverse chimalliviruses reveals novel insights into the variations in the nucleus-based phage replication mechanism. PhiKZ-like Pseudomonas phages PhiKZ30, 201phi2-131, and PhiPA332 were the first phages where the nucleus-based replication mechanism was characterized2,3. This intricate replication strategy was initially discovered while exploring the phage-encoded tubulin PhuZ, which makes triple-stranded filaments that position the viral replication compartment during 201phi2-1 infections11,14,16. Since then, many different variations of this replication strategy have been observed. For example, unlike the PhiKZ-like phages, Serratia phage PCH45 does not degrade the host chromosome4,33. Erwinia phage RAY also does not degrade the chromosome and makes a five-stranded tubulin filament13 rather than a three-stranded one14. Escherichia phage Goslar makes a vortex of PhuZ filaments rather than a bipolar spindle and centers its phage nucleus in a bulge in the host cell rather than at midcell12. However, one commonality between all of these phages is that they use tubulin-based PhuZ filaments to help organize their infections. Yet despite its high conservation, both dominant negative PhuZ mutants2,3,11,12,13,15,16,34 and genetic knockouts35 show that PhuZ is not essential for the replication of nucleus-based phages: The expression of dominant negative PhuZ only has a ~ 50% decrease in infection efficiency11. This suggests that, while PhuZ is not essential for nucleus-based phage replication, it does improve infection efficacy, at least for the PhiKZ-like phages.

PhuZ is not only non-essential for the replication of nucleus-based phages, but it is also not part of the conserved core genome of the Chimalliviridae family13. In fact, a recent publication has featured a nucleus-forming phage that does not encode a PhuZ homolog (Vibrio phage Ariel, also known as phiKT1028), much like its close relative Vibrio phage pTD136. However, Ariel and pTD1 are part of a clade that includes PhuZ-encoding Vibrio phages Aphrodite1 and Eric (also known as phiKT1019)36.

In addition to a handful of phages like Ariel that lack PhuZ but belong to clades where most of the members encode a PhuZ homolog, an entire clade of chimalliviruses exists where none of the members have PhuZ homologs (Fig. 1). This raises questions about how the organization and dynamics of chimallivirus infections without PhuZ differ from those with PhuZ: If PhuZ normally positions and rotates the phage nucleus in PhuZ-encoding chimalliviruses, do PhuZ-less phages use a different method for nucleus positioning and rotation, or are one or both of these functions dispensable? Do PhuZ-less phages still form a phage nucleus that reorganizes the host cell during infection but have evolved not to depend on a tubulin-based filament, or do they no longer make a phage nucleus, opting to use a different replication strategy from previously characterized chimalliviruses?

Figure 1
figure 1

Phylogenetic trees of chimalliviruses. (A) A whole-genome phylogenetic tree made using VICTOR45 of representative chimalliviruses, highlighted in rainbow colors according to predicted genus assignment (Fig. S1), plus 10 outgroup phages that do not encode chimallin. Phages encoding a PhuZ homolog are marked with orange circles, and phages without PhuZ homologs are marked with blue X marks. The only clade where every member lacks PhuZ (besides singleton Vibrio phage Yong XC31) is highlighted in blue and labeled “Phuzzywuzzyviruses”. The color-coded clades seen in this whole genome-based tree contain the same members as a protein tree based on major capsid protein homologs (Fig. S2). (B) A whole genome phylogenetic tree made using VICTOR45 of all currently known members of the phuzzywuzzyvirus clade. For both trees, bootstrap values are displayed on the branches. More information on all phage genomes used in these trees can be found in Table S1.

In order to address these questions, we studied the PhuZ-less chimallivirus vB_EamM_Asesino (here termed simply Asesino). Asesino belongs to the clade of chimalliviruses where all members lack PhuZ (Fig. 1A,B). The host of Asesino is Erwinia amylovora, an agriculturally important plant pathogen that infects plants of the Rosaceae family, including apple and pear trees37. E. amylovora is a good model for studying the variations in nucleus-based phage replication since many diverse chimalliviruses across five different clades have been isolated that infect this host38,39,40 (Fig. 1A). By studying Asesino, we set out to determine how nucleus-based phage replication in the absence of a PhuZ homolog compares to previously characterized nucleus-forming phages.

Results

Phylogenetics

Analyzing the conservation of PhuZ across Chimalliviridae members reveals a distinct clade within the family where every member lacks PhuZ (Fig. 1A, blue). All known members of this clade are shown in Fig. 1B. Twenty-three of these phage infect Erwinia, and the remaining thirteen phage infect Salmonella, Proteus, Escherichia, Enterobacter, Vibrio, or Cronobacter (Fig. 1B). Based on the branch lengths in the whole genome tree, these phage appear to be descended from a common ancestor that lost its PhuZ homolog and subsequently evolved to infect a range of bacteria (Fig. 1A). Erwinia phage Asesino is part of this clade of PhuZ-less phages, hereafter referred to as phuzzywuzzyviruses due to their apparent evolutionary loss of PhuZ. In contrast to phuzzywuzzyvirus Asesino, previously characterized Vibrio phage Ariel (also known as phiKT1028) groups with a different clade of chimalliviruses, the Tidunavirus genus, and appears to have lost its PhuZ homolog more recently since it has closer relatives, such as Vibrio phage Aphrodite1, that encode PhuZ (Fig. 1A, light red). The multiple instances of nucleus-forming phages that appear to have lost PhuZ (Fig. 1A, blue X marks), including the puzzywuzzyvirus and Tidunavirus clades, suggest that it is adaptive to lose PhuZ in some circumstances.

Asesino phage nucleus formation

To determine if Asesino forms a phage nucleus, we infected E. amylovora with Asesino, stained the DNA with DAPI, and imaged the infected cells at 75 min post-infection (mpi) using fluorescence microscopy. We observed a bright central zone of concentrated DNA in infected cells consistent with formation of a phage nucleus. (Fig. 2A, magenta). To confirm that this concentrated mass of DNA is a phage nucleus made by Asesino, we tagged its ChmA homolog (gp039) with GFP on the N-terminus and expressed it during infection. GFP-ChmA formed a ring enclosing the bright DAPI-stained region, confirming the presence of a chimallin-based phage nucleus (Fig. 2B, cyan). Similar to previously characterized Erwinia phage RAY13 and Serratia phage PCH454, Asesino does not noticeably degrade the host DNA, leaving some DAPI staining outside the phage nucleus. This extranuclear DAPI staining co-localizes with GFP-tagged histone-like protein H-NS, which coats bacterial DNA, further suggesting that it is host DNA (Fig. 2C).

Figure 2
figure 2

Asesino makes a nucleus during its infection cycle. (A) DAPI staining shows a bright circular region consistent with a phage nucleus, as well as extranuclear DNA consistent with undegraded bacterial DNA. (B) Asesino’s chimallin homolog (gp039) tagged with GFP encloses the bright DAPI signal. (C) E. amylovora H-NS tagged with GFP localizes with the extranuclear DNA during infection, supporting the idea that this is undegraded host DNA. For all panels, yellow is FM4–64 (membrane stain), magenta is DAPI (DNA stain), cyan is GFP, and grayscale is brightfield. Scale bars represent 1 μm.

Infection organization

We next assessed whether Asesino organizes its infection similarly to previously characterized nucleus-forming phages. The phage-encoded metabolic enzyme thymidylate kinase (gp117) which is predicted to contribute to the synthesis of nucleotide pools localizes to the cytoplasm (Fig. 3A), whereas the nvRNAP β′ subunit 1 (gp040), which is required for transcription of mid to late phage genes7, and the phage-encoded RecA-like recombination protein (UvsX, gp238) localize to the phage nucleus (Fig. 3B,C). Virion structural proteins (major capsid protein gp091 and tail protein gp166) localize between the phage nucleus and the bacterial nucleoids (Fig. 3D,E), suggesting that mature viral particles accumulate in the space between the phage nucleus and the host chromosome. These results are similar to Erwinia phage RAY, which forms a phage nucleus but does not appear to assemble its mature viral particles into bouquets13.

Figure 3
figure 3

Protein localization in Asesino is consistent with other nucleus-forming phages. (A) Thymidylate kinase is localized in the cytoplasm. (B) A nonvirion RNA polymerase subunit and (C) RecA homolog UvsX are localized in the nucleus. (D) The major capsid protein and (E) a tail protein localize around the phage nucleus. For all panels, yellow is FM4-64 (membrane stain), magenta is DAPI (DNA stain), cyan is GFP, and grayscale is brightfield. Scale bars represent 1 μm.

Phage nucleus dynamics in the absence of PhuZ

Since Asesino makes a phage nucleus, we next determined if the effects we normally attribute to PhuZ were indeed missing during its replication cycle, or if they were somehow supplemented by other genes. To do this, we analyzed nucleus rotation dynamics and positioning. Since Erwinia phage RAY nucleus rotation was previously observed occurring most often at 60 mpi, and the RAY and Asesino infection cycles both take approximately 120 min, we observed Asesino infections at 60 mpi in time-lapse microscopy to see if any infected cells had rotating phage nuclei. However, nucleus rotation was not observed during Asesino infection (n = 46), supporting the idea that PhuZ is necessary for phage nucleus rotation.

Asesino phage nucleus positioning was quantified and compared to Erwinia phage RAY since both phages infect the same host, and neither degrades the host genome. In RAY, it was shown that lack of host genome degradation influences phage nucleus positioning, likely because the bacterial nucleoids take up space and occlude regions of the cell near the poles, preventing the phage nucleus from reaching those spaces even when PhuZ activity was inhibited using a dominant negative mutant13. Asesino appears not to position its nucleus as actively as RAY: When only one bacterial nucleoid is present, Asesino nucleus positioning is not biased toward midcell (Fig. 4A). However, similar to RAY, it appears that the presence of bacterial nucleoids aid in positioning Asesino’s nucleus closer to midcell when there is one on each side (Fig. 4B). It is interesting to note that 43% of uninfected E. amylovora cells appear to have two nucleoids that are visible due to the cell actively dividing (n = 109) while during Asesino infection 59% of cells appear to have two nucleoids (n = 113) and during RAY infection 65% of cells appear to have two nucleoids (n = 108). While this increased number of bacterial cells with two nucleoids during infection could be due to Asesino and RAY preferentially infecting actively dividing cells that have already replicated their chromosomes, it could also mean that infected cells with two nucleoids still only have one chromosome copy that is connected by strands of DNA that are too small to be visible by DAPI staining.

Figure 4
figure 4

Phage nucleus positioning in Asesino compared to RAY. (A) When only one bacterial nucleoid is present, the Asesino phage nucleus has a bias away from midcell positioning, with most phage nuclei being found between the pole and midcell (20–30% of the cell length, blue, n = 46), but the RAY phage nucleus has a slight bias toward midcell positioning (light green, n = 38). (B) When two bacterial nucleoids are present, both the Asesino (blue, n = 67) and RAY (light green, n = 70) phage nuclei are positioned at or near midcell, with Asesino having a slightly wider distribution.

Discussion

Even before the discovery of the phage nucleus, PhuZ had been shown to perform an important role in positioning phage DNA during the infection cycle of nucleus-forming phages11,14,16. Yet upon further investigation, it was discovered to be an accessory factor that is not essential for the nucleus-based phage replication cycle11,13,34,35. In studies using dominant negative or null mutants of PhuZ, the phage nucleus becomes lodged adjacent to the cell pole, occluding a significant fraction of the surface area available for capsid docking and DNA packaging which is proposed to be the reason for decreased virion production11,35,41. How, then, do the many chimalliviruses that lack a tubulin homolog entirely (Fig. 1A,B) avoid the problem of large portions of the phage nucleus surface area becoming inaccessible by apposition to the membrane? Here we studied the Erwinia-infecting phuzzywuzzyvirus Asesino in order to gain a better understanding of PhuZ-less chimallivirus replication.

We found that Asesino forms a nucleus-like structure that compartmentalizes phage DNA away from the cytoplasm and partitions proteins into two distinct compartments, similar to other nucleus-forming phages2,3,4,12,13,36. During Asesino infection, the bacterial chromosome is not degraded. This contrasts with Vibrio phage Ariel, the only other PhuZ-less phage whose replication mechanism has been studied to date, in which phage replication is accompanied by the degradation of bacterial DNA and the swelling of host cells into spheroids36. Cell swelling and bacterial genome degradation could be a mechanism for increasing diffusion around the phage nucleus. However, our results suggest that Asesino does not cause such drastic changes in cell shape, nor does it noticeably degrade the host DNA (Fig. 2). The Asesino phage nucleus is preferentially localized near midcell when there are two bacterial nucleoids present (Fig. 4B), and near the one-quarter cell position when only one nucleoid is present (Fig. 4A). This is similar to RAY phage nucleus positioning when a dominant negative PhuZ mutant is expressed13. By leaving the host DNA intact, the Asesino phage nucleus competes for space with bacterial nucleoids, which appear to act as obstacles that help center the Asesino nucleus when two nucleoids are present (Fig. 4B), as has been proposed for Erwinia phage RAY13. Additionally, it has been proposed that polysomes can be involved in physically separating DNA in bacterial cells42,43. A similar mechanism could be used by Asesino, where actively translating ribosomes cluster around the phage nucleus and act as physical buffers that center it between the host nucleoids.

We propose that each phage has evolved its own unique proteins and mechanisms for optimizing phage production and specializing on the microenvironment within the bacterial strain it infects. For instance, PhiKZ-like phages inject their genomes at the cell pole and cause the Pseudomonas cell to swell at midcell2,3 (Fig. 5A). This allows the phage nucleus to grow larger, replicate more phage genomes, and therefore produce more phage particles. In these phages, the PhuZ spindle positions the phage nucleus at midcell, where the bulge forms. In the absence of a tubulin spindle, the phage nucleus forms and grows at the cell pole, where a large fraction of its surface would be occluded, limiting exchange of molecules into and out of the phage nucleus and restricting the surface area accessible for capsid docking, as happens for PhiKZ-like phages with dominant negative or null mutants of PhuZ2,11,15,34,35 (Fig. 5B). It is tempting to speculate that centering the phage nucleus and bulging at midcell co-evolved to optimize progeny production in these phages. In the case of the PhuZ-less phage Ariel, it has adapted to its own unique challenge of replicating in Vibrio parahaemolyticus without a PhuZ spindle by degrading the host chromosome and causing the cell to swell and become spheroidal36 (Fig. 5C). A cytoskeletal filament would not be required to find the middle of an isotropic cell, thus alleviating the selective pressure for positioning: It is impossible for the phage nucleus to get lodged in a low-diffusion environment such as the cell pole when no cell pole exists. In the case of Erwinia phage Asesino, the host cell remains anisotropic, does not exhibit significant cell swelling, and does not appear to contain a cellular region that would be more conducive for phage replication, although it does appear to leave the host DNA intact (Fig. 5D), which could aid in positioning its phage nucleus13 (Fig. 4), as discussed above. Thus, Asesino may have evolved these traits to benefit its replication efficiency and best take advantage of its host in response to selection pressures that we currently do not understand.

Figure 5
figure 5

Infection models. Diagrams comparing (A) a PhiKZ-like Pseudomonas phage 201phi2-1 infected cell2; (B) a 201phi2-1 infected cell expressing a catalytically dead, dominant negative PhuZ mutant2; (C) a Vibrio phage Ariel infected cell36; and (D) an Erwinia phage Asesino infected cell.

While the phage nucleus provides protection against DNA-targeting host defenses by physically excluding harmful nucleases4,5,8, nucleus-forming phages are not exempt from arms races with their hosts4,33,35,44 and are under constant selective pressure from other types of host defense systems, including RNA-targeting CRISPR systems which target conserved genes. Since phage are incredibly adaptable and evolve quickly, it is not surprising to find clades of phage where non-essential conserved genes such as PhuZ have been lost. Depending on the circumstances, loss of PhuZ may be advantageous (e.g. to avoid Cas13 targeting) or disadvantageous (e.g. random loss through genetic drift that slightly decreases replication efficiency). Once lost, it may be more difficult for nucleus-forming phages to reacquire genes, compared to canonically-replicating phages, since the phage nucleus can act as a barrier that limits genetic exchange34.

While all chimalliviruses characterized to date share the nucleus-based phage replication strategy, each has its own unique variation on this theme. PhuZ-encoding chimalliviruses, such as the PhiKZ-like phages, as well as PhuZ-less chimalliviruses, such as Vibrio phage Ariel and Erwinia phage Asesino, all use unique methods for ensuring efficacious phage replication in their respective hosts (Fig. 5). While each strategy employed by chimalliviruses allows them to specialize on their specific host in their own unique ways, it is remarkable that at the same time, they share the core conserved nucleus-based phage replication mechanism.

Methods

Phylogenetic analyses

Whole genome phylogenetic trees were made by running VICTOR on amino acid data using the D6 formula45 and annotated in Illustrator (Adobe) based on VICTOR annotations and the presence of PhuZ based on NCBI PSI-BLAST results. Protein based phylogenetic trees were made using Clustal Omega46 and annotated in iTOL47. We selected representative chimalliviruses for constructing our phylogenetic trees because there are over 100 publically available chimallivirus genomes to date, and using all of these sequences would make this tree unreadable. Therefore, representatives were chosen to improve the clarity of this figure. To choose these representatives, first chimallin homologs were found using NCBI PSI-BLAST. All phage genomes that contained chimallin homologs were analyzed using ViPTree to confirm that they cluster with known nucleus-forming phages. Since they all cluster within this group (the Chimalliviridae family), we used VICTOR to compare the genomes. For any group of phages that VICTOR predicted to belong to the same species, one representative was chosen and the other genomes were discarded. However, as VICTOR predictions do not always agree with published taxonomy, we looked through the discarded genomes and added back any that were officially classified by the ICTV as separate species. This resulted in fewer than 100 phage genomes, which we then used as the representative chimalliviruses in Fig. 1A. Information on the phages used in Fig. 1 can be found in Table S1.

Bacterial growth and phage propagation conditions

Erwinia amylovora ATCC 29780 was generally grown on LB plates at room temperature or in rotating liquid LB cultures at 30 °C. Erwinia phage Asesino (vB_EamM_Asesino) and Erwinia phage RAY (vB_EamM_RAY) were propagated by infecting 0.5 mL of dense E. amylovora from an overnight culture grown in NBSYE (13 g/L Difco Nutrient Broth, 2.5 g/L Bacto-Yeast Extract, 5 g/L sucrose) with 10 μL of tenfold serial dilutions of phage lysate (original lysates are 109–1011 pfu/mL). After waiting 15 min for the phage to adsorb, the infected cells were mixed with 4.5 mL molten (~ 55 ℃) NBSYE 0.5% top agar and poured over an NBSYE 1.6% agar plate. These plates were incubated overnight at room temperature. The following day, fresh phage lysates were collected by soaking the plates exhibiting web lysis with 5 mL phage buffer (10 mM Tris pH 7.5, 10 mM MgSO4, 4 g/L NaCl, 1 mM CaCl2) for 5 h, collecting the liquid by aspiration, pelleting the cell debris by centrifugation (3220 rcf for 10 min), and filtering the clarified phage lysate through a 0.45 μm filter. Phage lysates were stored at 4 °C.

GFP fusions

GFP-tagged proteins were designed using GFPmut18. ChmA (gp039), nvRNAP β′1 (gp040), MCP (gp091), and UvsX (gp238) were tagged on the N-terminus, and TMK (gp117), a tail protein (gp166), and H-NS (host histone-like nucleoid structuring protein) were tagged on the C-terminus. These GFP constructs were inserted into pHERD-30T and transformed into E. amylovora via electroporation. Electroporation was done in 2 mm cuvettes using an electric pulse of 2.5 kV voltage with 25 μF capacitance and 200 Ω resistance. Cells were recovered in 1 mL SOC (20 g/L Bacto tryptone, 5 g/L Bacto yeast extract, 0.58 g/L NaCl, 0.19 g/L KCl, 10 mM MgSO4, 10 mM MgCl2, 20 mM glucose) at 30 °C for 2 h before being plated on LB plates with 15 μg/mL gentamicin sulfate at 30 °C overnight to select for transformants.

Fluorescence microscopy

Fluorescently labeled phage proteins were tagged with GFPmut1 and expressed from the pHERD-30T plasmid under the inducible control of the AraBAD promoter. 15 μg/mL gentamicin sulfate was used for selection prior to microscopy. To prepare bacterial cells for microscopy, imaging pads were prepared (1% agarose, 25% LB, and 0–0.2% arabinose depending on the particular construct imaged) in welled slides and inoculated with E. amylovora. The slides were incubated at 30 °C in humid chambers for 3 h for the bacteria to begin actively growing. They were then moved to room temperature and infected with 10 μL undiluted phage lysate (109–1011 pfu/mL). 8 μL of dye mix (12.5 μg/mL FM4–64, 25 μg/mL DAPI, 25% LB) was added to the microscopy pads immediately prior to imaging. Slides were imaged using a DeltaVision Elite deconvolution microscope (Applied Precision) and deconvolved using the aggressive algorithm in the DeltaVision softWoRx program (Applied Precision). All analysis was performed prior to deconvolution.

Nucleus rotation analysis

Time lapses were obtained between 60 and 90 mpi and observed to detect rotation, using the same fluorescence microscopy methods as above but with time-resolved imaging (4 s intervals for 2 min). Since no rotating phage nuclei were observed, no further analysis was performed to determine rotation dynamics.

Phage nucleus positioning

The length of the cell and the distance between the center of the phage nucleus and the cell pole was measured using FIJI. The position of the nucleus within the cell was calculated as the ratio of the distance from the phage nucleus to the pole over the length of the cell. The data were graphed as a relative frequency histogram using Prism (GraphPad).