Cryo-EM Structure of the relaxosome, a complex essential for bacterial mating and the spread of antibiotic resistance genes

Williams, Sunanda M.; Raffl, Sandra; Kienesberger, Sabine; Ilangovan, Aravindan; Zechner, Ellen L.; Waksman, Gabriel

doi:10.1038/s41467-025-60116-6

Download PDF

Article
Open access
Published: 27 May 2025

Cryo-EM Structure of the relaxosome, a complex essential for bacterial mating and the spread of antibiotic resistance genes

Nature Communications volume 16, Article number: 4906 (2025) Cite this article

5890 Accesses
3 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Bacterial mating, or conjugation, was discovered nearly 80 years ago as a process transferring genes from one bacterial cell (the donor) to another (the recipient). It requires three key multiprotein complexes in the donor cell: a DNA-processing machinery called the relaxosome, a double-membrane spanning type 4 secretion system (T4SS), and an extracellular appendage termed pilus. While the near-atomic resolution structures of the T4SS and pilus are already known, that of the relaxosome has not been reported to date. Here, we describe the cryo-EM structure of the fully assembled relaxosome encoded by the paradigm F plasmid in two different states corresponding to distinct functional steps along the DNA processing reaction. By varying the structures of model DNAs we delineate conformational changes required to initiate conjugation. Mutational studies of the various protein-protein and protein-DNA interaction hubs suggest a complex sensitive to trigger signals, that could arise from cell-to-cell contacts with recipient cells.

Cryo-EM structure of a type IV secretion system

Article Open access 22 June 2022

Architecture of the outer-membrane core complex from a conjugative type IV secretion system

Article Open access 25 November 2021

Inner membrane components of the plasmid pKM101 type IV secretion system TraE and TraD are DNA-binding proteins

Article Open access 04 March 2025

Introduction

The nucleoprotein complex termed relaxosome contains the relaxase and other auxiliary transfer proteins, which assembles at the “origin of transfer” (oriT) region of a plasmid and prepares it for transfer to another bacterial cell^1,2. This process whereby plasmids spread among bacterial populations through cell-to-cell contact is termed bacterial conjugation, an important way to exchange genetic material and plays a crucial role in bacterial adaptation and evolution³. It is also one of the most effective means by which antibiotic resistance genes spread among bacterial populations, resulting in antimicrobial resistance (AMR)^4,5,6,7,8. AMR is considered one of the most severe global public health and development threats of this century⁹. The large family of F plasmids which we are focusing on here, are the main carriers of antibiotic resistance genes in E. coli and historically associated with resistance or ‘R factors’¹⁰.

All DNAs transferred via conjugation do so from an essential sequence called oriT, a stretch of DNA about 400 base pairs (bp) in the F plasmid^11,12,13. The F plasmid relaxosome complex contains, in addition to oriT, three plasmid-encoded proteins including the DNA-processing enzyme TraI (generally known as the “relaxase”) and 2 accessory proteins TraY and TraM, and IHF, a protein encoded by the bacterial genome (Supplementary Fig. 1a). To date, multiple binding sites for these proteins have been characterised on oriT (2 TraI binding sites (traI_helicase and traI_TE), 2 IHF binding sites (IHFa,b), 3 TraM binding sites (sbmA,B,C) and 2 TraY binding sites (sbyA,C); details in Supplementary Fig. 1b).

TraI is a multi-domain protein that consists of an N-terminal trans-esterase (TE) domain that contains its “relaxase” activity, followed by two helicase domains (one vestigial (VH), the other active (AH)), and a C-terminal domain (CTD) responsible for interactions with TraM (Supplementary Fig. 1c, d)^{14,15,16,17,18}. IHF, TraY, and TraM are non-catalytic proteins. IHF, a binary complex of two very similar proteins, IHFα and IHFβ, is known to induce a sharp U-turn bend on its binding site (Supplementary Fig. 1e)¹⁹. TraM is responsible for binding to both its cognate oriT regions and to the C-terminal tail of TraD (Supplementary Fig. 1f)^20,21, a T4SS protein that functions as recruitment platform for the relaxosome. TraY has DNA-bending ability and is known to stimulate TraI activity when bound to oriT²².

Once formed, the relaxosome is recruited to the T4SS²³ and the relaxosome-T4SS super-complex may rest quiescent until contacts between donor and recipient cells are established through the pilus²⁴, activating the relaxosome. Activation initiates a yet uncharacterised cascade of molecular events resulting in the nicking of one of the plasmid DNA strands (the T-strand to be transferred) at a site called “nic” (Supplementary Fig. 1b) by the TE activity of TraI. The TraI catalysed nicking is stimulated by the other proteins in the relaxosome^22,25. Nicking is followed by covalent attachment of TraI to the free 5’ phosphate resulting from the nicking reaction yielding a protein-ssDNA conjugate that, once unwound from its complementary strand (the R strand), is transported through the T4SS into the recipient cell^26,27,28,29. To achieve this, another molecule of TraI functions as a helicase, unwinding the dsDNA in the 5’ to 3’ direction^14,30,31. Once transfer is completed, both the transferred T-strand and the copy retained by the donor (the R-strand) undergo replication to duplex DNA thereby increasing the abundance of the transferred genome (and any antibiotic resistance genes it may carry) within a bacterial population³².

A fully functional assembly of the relaxosome is crucial for effective nicking and transfer of the plasmid. Mutation of a catalytic tyrosine in the TraI active site reduces plasmid transfer by ~ 5000-fold³³. oriT was shown to have functional domains which individually contribute to nicking and transfer functions³⁴. Deletions of oriT binding sites and/or in the absence of other relaxosome proteins, TraI can only bind to ssDNA and cannot initiate transfer from ds oriT³¹.

Molecular events controlling the start of conjugative gene transfer are yet to be understood. Some structural information is available on individual protein components of the relaxosome in gram-negative (Supplementary Fig. 1d–f)^14,19,35 and more recently in gram-positive bacteria^36,37,38,39. However, crucially, there is to date no information on the structure of a fully or even partially assembled relaxosome.

Results and Discussion

Assembly of the relaxosome and identification of DNase I footprints of protein components

We first confirmed the feasibility of purifying a fully-assembled relaxosome from purified components. The four proteins were mixed in molar excess of a purified DNA of 316 bp (oriT₃₁₆) encompassing all known binding sites for each protein and the resulting complex (oriT₃₁₆-R) was purified (Fig. 1a, b).

**Fig. 1: Biochemical characterisation of the F plasmid relaxosome.**

To map the footprint of the fully assembled relaxosome complex, the purified oriT₃₁₆-R complex was subjected to DNase I cleavage to hydrolyse regions of oriT that are not protected upon complex formation. We show here (Fig. 1b) that most protected footprint sequences map to regions of oriT₃₁₆ extending from the nic site to include traI_TE, IHFa, sbyC, sbyA and partial sbmC sites. A single outlier was identified encompassing IHFb and sbmB sites.

From these results, we derived a dsDNA of 170 bp termed ds_{-27_+143} in Figs. 2a, 1c and Supplementary Fig. 2 (the nomenclature of ss, ds, and ss/ds DNAs is explained in Fig. 1c). This DNA encompasses the protected region and additionally the traI_helicase site and the full sbmC site. Using ds_{-27_+143}, a relaxosome complex termed “ds_{-27_+143}-R” was reconstituted from purified components and stabilised by crosslinking.

**Fig. 2: Structures of the ds_{-27_+143}-R relaxosome, the ss_{-27_+8}ds_{+9_+143}-R relaxosome, and the fully-assembled ss_{-27_+8}ds_{+9_+143}-R relaxosome.**

Determination of the Cryo-EM structure of ds_{-27_+143}-R

The structure of the ds_{-27_+143}-R complex was derived from a “locally-refined” cryo-EM map with average resolution of 3.78 Å (Supplementary Figs. 3, 4a, g, 5a–d; Supplementary Table 1a, b; also see definition of “locally-refined” versus “global” in Supplementary Fig. 4). Cryo-EM is the most suitable structural biology method to use in this case as the structure is modular and cannot be crystallised. The density was of sufficient quality to build a model that includes (Fig. 2b): i- a large region of dsDNA from base pairs +12 to +94 (sequence in bold in Fig. 2a), ii- the IHF α and β chains, iii- a train of three TraY molecules and iv- two sub-domains of the VH domain of TraI, termed 2 A, and 2B/2B-like (labelled VH_{2A+2B/2B-like} in Fig. 2b; see details of the TraI sub-domain structure in Supplementary Fig. 1c, d)¹⁷.

Overall, the salient feature of this structure is an asymmetric U-shaped dsDNA hairpin, generated by IHF binding. The region of ordered dsDNA in the density stretches over 83 bp, 52 bp on one side of IHF containing the TraY-binding sites (termed the TraY-binding arm of the dsDNA hairpin in Fig. 2b; bp₊₄₃ to bp₊₉₄), and 31 bp on the other side (termed the nic arm of the dsDNA hairpin in Fig. 2b; bp₊₁₂ to bp₊₄₂). On the former arm, three bound TraY proteins are observed, two repeated directly (TraY1 and TraY2) and the other inverted (TraY3). This train of three TraY molecules induces longitudinal bending of the dsDNA along its long axis, changing its trajectory. On the nic arm, a combination of sequence-induced dsDNA bending⁴⁰ between bp₊₂₀ to bp₊₃₄ and dsDNA bending induced by VH_{2A+2B/2B-like} binding is observed (Supplementary Fig. 5f–h). Bends induced by IHF and accessory proteins may position the nic site for subsequent cleavage by TraI.

The VH_{2A+2B/2B-like} module interacts with the two arms of the dsDNA hairpin and also with IHF and TraY. This dense network of interactions is seen in all relaxosome structures presented here (see details below).

Towards reconstructing the entire relaxosome structure

We sought next to obtain a more complete relaxosome structure by experimenting with DNA sequences and ss/ds hybrid substrates. Previous X-ray crystallography of the TraI TE domain alone mapped its binding site to 11 nucleotides spanning nic (nucleotides -2 to +9 in Fig. 2c)^33,41. Therefore, we designed a heteroduplex DNA termed “ss_{-27_+8}ds_{+9_+143}”, containing a single-stranded (ss) T-strand region from −27 to +8 and a double-stranded (ds) region from +9 to +143 (Figs. 2c, 1c, Supplementary Fig. 2). The corresponding relaxosome complex is termed “ss_{-27_+8}ds_{+9_+143}-R”. This structure was built into a locally-refined cryo-EM map with an overall resolution of 3.45 Å (Supplementary Figs. 4b, 6a; Supplementary Table 1a, b).

The following differences were readily apparent compared to the ds_{-27_+143}-R structure (Fig. 2d): i- the TE domain of TraI was entirely observable and could be built in the density as well as the portion of the T-strand bound to it (nucleotides -2 to +9), ii- dsDNA from base pairs +10 to +93, VH_{2A+2B/2B-like}, both chains of IHF, as well as the three TraY molecules present in the ds_{-27_+143}-R structure were better defined in the density and improved models could therefore be derived, and iii- density for the NTD and 1 A domains of VH (VH_NTD+1A) exhibited clear secondary structural elements in which a main-chain model could be fitted in density.

Additional densities were observed in the “global” density map of the ss_{-27_+8}ds_{+9_+143}-R complex (Supplementary Figs. 4b, h). These additional densities may correspond to the AH and CTD domains of TraI or/and to TraM, a protein that is also part of the complex. To locate these domains/proteins within these densities, we solved the cryo-EM structures of the ss_{-27_+8}ds_{+9_+143}-R complex where i- a version of TraI lacking the AH and CTD domains was used (TraI_1-863; complex termed ss_{-27_+8}ds_{+9_+143}-RΔ_AH+CTD; Supplementary Figs. 6b, 4c, i) or ii- TraM was omitted in complex formation (complex termed ss_{-27_+8}ds_{+9_+143}-RΔ_TraM (Supplementary Figs. 6c, 4d, j). Results of global map superpositions for both ss_{-27_+8}ds_{+9_+143}-RΔ_AH+CTD and ss_{-27_+8}ds_{+9_+143}-RΔ_TraM with ss_{-27_+8}ds_{+9_+143}-R are shown in Fig. 3a, b, respectively. These identified the density for TraM and for TraI AH and CTD, the models^17,19,35 of which were subsequently fitted in the corresponding density. Further validation of TraM docking is detailed in Supplementary Fig. 9c. Together, these results provide the basis to reconstruct a fully-assembled relaxosome (Fig. 2e and Supplementary Data 1).

**Fig. 3: Use of maps where parts of the relaxosome have been deleted to generate the complete structure of the relaxosome.**

Structure of the fully-assembled relaxosome

The structure of the ss_{-27_+8}ds_{+9_+143}-R complex consists of successive but overlapping interaction centres analogous to railway hubs organised along a rigid track of DNA (Fig. 4a). Four hubs each combining protein-DNA and protein-protein interactions can be defined. Hub 1 made of IHF/TraI VH_{2A+2B/2B-like}/TraY1 and dsDNA between base pairs +13 and +74 (Fig. 4b, c and Supplementary Fig. 7a, b), Hub 2 formed by a “train” of three TraYs and dsDNA between base pairs +61 and +90 (Fig. 5a and Supplementary Fig. 8), Hub 3 made of TraI TE, TraI VH_2B/2B-like, and part of the nic arm of the DNA hairpin that includes the ssDNA between -2 and +9 and dsDNA between +10 and +15 (Fig. 5b, c and Supplementary Fig. 9a, b), and Hub 4 made of TraM and dsDNA between base pair +93 and +117 (Supplementary Fig. 9c, d).

**Fig. 4: Hub architecture of the ssDNA_{-27_+8}ds_{+9_+143}-R fully assembled relaxosome structure and overview of hub 1.**

**Fig. 5: Hub architecture of the ssDNA_{-27_+8}ds_{+9_+143}-R fully assembled relaxosome structure: overview of hubs 2 and 3, and footprint of relaxosome proteins on DNA.**

Hub 1 is the largest hub in terms of buried surface area ( ~ 8000 Å²). Interactions between IHF chains and between IHFαβ and its dsDNA-binding site (between bp+23 and bp+60; Fig. 4b) account for much of it (5300 Å²). The 160° bending induced by IHF binding is caused by the intercalation of two β-hairpins (β3 and β4) within the DNA’s minor groove (Fig. 4b)¹⁹. In the hub, IHFα reaches out to TraY1 (158 Å²) while both its chains interact with the TraI VH_2B/2B-like sub-domain ( ~ 400 Å²). Furthermore, TraY1 provides a binding interface to TraI VH_2B/2B-like (410 Å²) (Fig. 4b). These interaction areas are not individually extensive but their cumulative effect on TraI VH serves to calibrate TraI VH_2A domain against its dsDNA binding site on the TraY arm of the dsDNA hairpin between bp +61 and +74 (Fig. 4c, Supplementary Fig. 7b).

Hub 2, composed of the train of TraYs, is characterised by a large network of protein-protein interactions between TraY subunits and protein-DNA interactions that anchor the train to 3 binding sites on the dsDNA (Fig. 5a, Supplementary Fig. 8). TraY is an RHH (ribbon-helix-helix) family protein, which are usually dimers. In contrast, F TraY is a monomer containing all the secondary structures and tertiary fold of the RHH dimer in one single polypeptide. Overall, the three TraY molecules bury 2833 Å² of surface area upon interaction with their binding sites (see details in Supplementary Fig. 8). Interaction between TraY molecules is also extensive. Remarkably, TraY appears to be able to use very different surfaces to mediate protein-protein contacts (details in Supplementary Fig. 8).

In Hub 3, the TraI TE domain is observed bound to its previously characterised ssDNA-binding site between -2 and +9³³. However, in marked contrast with previous observations (Supplementary Fig. 9a, b), it is also seen to make extensive contacts with the dsDNA between +10 and +15 (Fig. 5b). The latter interface accounts for nearly half of the total TE-DNA interaction area in the relaxosome (1295 Å² TE-dsDNA versus 1365 Å² for TE-ssDNA interactions). Interactions details are described in Fig. 5b and Supplementary Fig. 9a, b. Interestingly, residues in the β9-αF loop (Fig. 5b) displaces the R-strand base (a C) of the CG + 9 base pair out of its pairing arrangement, resulting in only G₊₉ of the T-strand being visible in the electron density. Thus, the TraI TE domain is able to melt/unwind the +9 base pair, an observation that prompted us to ask whether this ability to melt/unwind dsDNA extends beyond +9 (see below). Finally, the TE domain provides a small but stabilising interface that anchors TraI VH_2B/2B-like sub-domain within interactions with the dsDNA near the ss/ds DNA junction (Fig. 5c).

Next, to assess the magnitude of the conformational change needed in the T-strand to go from unbound to TE domain-bound, we in silico extended the ds_{-27_+143} DNA to include nic and superimposed it onto the ss/dsDNA of the ss_{-27_+8}ds_{+9_+143}-R structure (Supplementary Fig. 9e). We observe that not only has the nic site moved 29 Å away from its unbound location, but also that the T-strand on TE-binding has undergone a U-turn, imposed by its interaction with the TE domain and notably residues in the β10-β11 region (Fig. 5b). Thus, oriT is subjected to two topological U-turns, one in its dsDNA region upon IHF-binding, and another one in its ssDNA region upon TE-binding.

TraM is made of two domains (Supplementary Fig. 1f): an N-terminal domain (NTD) that interacts with DNA and a C-terminal domain (CTD) responsible for tetramerization and known to interact with the coupling protein TraD^35,42. Thus, TraM can also be defined as a “hub” of protein-DNA and protein-protein interactions. This hub, Hub 4 (described in Supplementary Fig. 9c, d), is distinct from hubs 1-3 in that it not only mediates interactions between intra-relaxosome components but also with the T4SS⁴².

An intriguing observation is that core relaxosome assembly is independent of the presence or absence of TraM as evidenced by the ss_{-27_+8}ds_{+9_+143}-RΔ_TraM structure (Supplementary Fig. 4d, j) and the relative absence of interactions with other relaxosome proteins. Earlier deletion mutagenesis of the IHF, TraY and TraM binding sites (IHFa, sbyA/C and sbmA-C sites, respectively; Supplementary Fig. 1b) had shown that sbm sites were required for efficient DNA transfer, but not relaxosome formation, whereas IHFa and sbyA sites were required for both^34,43. On the other hand, although not essential for the reaction, TraM has been shown to stimulate TraI-mediated cleavage at oriT through interactions with the CTD of TraI^20,25. This is consistent with our findings that TraM locates close to the CTD of TraI in the fully assembled ss_{-27_+8}ds_{+9_+143}-R relaxosome structure (Fig. 2e). In the complete absence of TraM binding sites, the frequency of DNA transfer is diminished 10,000-fold consistent with a key role for Hub 4³⁴.

A striking feature of the ss_{-27_+8}ds_{+9_+143}-R structure is the very large DNA footprint made by the various proteins involved (Fig. 5d). Protein-DNA interactions involve a staggering 9022 Å² of surface area, with IHF and the TraI TE domain accounting for more than half of this total. When the TraM/DNA interaction is added, over 10,000 Å² of buried surface area are involved in assembling the relaxosome components on oriT.

Many features of the F plasmid relaxosome assembly presented here apply to other plasmid systems, due to the TE domain of relaxases from multiple incompatibility groups such as the IncN, IncW, IncQ and other F-like IncF plasmids (includes R1, R100 and pED208) sharing a conserved fold (see above) (Supplementary Fig. 10)³³. In R1 plasmid, the relaxase shares 92% identity with the F plasmid relaxase and has a similar domain organisation. TraY from R1 plasmid has a single RHH motif on a polypeptide and unlike F TraY forms a dimeric functional unit and may bind to the three subsites ATGT, ATTT (imperfect direct repeats) and ACAT (imperfect inverted repeat) on its oriT DNA. The consensus IHF binding sequence on R1 (5’ TGATTTGCTATTTGAATCATTAACTTA 3’) has a gap of 11 bp from the TraY1 binding site (ATGT), all of which points to a similar relaxosome organisation to that of F. Similarly, the F plasmid relaxosome proteins are able to bind and conjugatively transfer chimeric oriT sequences combining the F plasmid nic site with the binding sites for TraY, IHF and TraM of plasmid R100²⁰.

The IncW prototype R388 plasmid encodes a relaxase, TrwC, lacking the VH domain of F relaxases. It has a TE domain required to produce the nick, an AH containing the ATPase and helicase activities, and a CTD (Supplementary Fig. 10a–d). The R388 relaxosomes are composed of the oriT DNA, the relaxase TrwC, the accessory protein TrwA, analogous in function to F TraM with an N-terminal RHH domain and C-terminal domain that interacts with the coupling protein (termed TrwB in that system), and host protein IHF^44,45,46. The R388 relaxosome therefore would be a smaller-scale assembly without the analogous TraY train. The AH of TrwC (structurally superposable to VH of F TraI) could sit across the IHF bend in a similar fashion, helping to position the TE domain close to the ‘nic’ site (Supplementary Fig. 10e). DNA bending seems to be a recurrent theme in many relaxosome assemblies. An AlphaFold model of the RelSt3, a relaxase from the gram positive ICESt3/Tn916/ICEBs1 superfamily, along with putative relaxosome partners was proposed to bend DNA resulting in the catalytic domain of RelSt3 being positioned close to its nic site⁴⁷.

The TE and Helicase binding modes of TraI

In a previous study¹⁷, we characterised two forms for TraI (Supplementary Fig. 1c, d). The first form was observed when TraI was bound to the ssDNA sequence 5’ of the nic site (traI_TE in Fig. 1b). It is sensitive to mild-proteolysis i.e. it adopts a flexible and “open” structure. In contrast, the second form is protease-resistant, i.e. rigid and “closed”, and is observed only when TraI is bound to the ssDNA sequence 3’ of the nic site (traI_helicase in Fig. 1b). The structure of TraI bound to traI_helicase was solved¹⁷: in this structure, traI_helicase is bound through the helicase domains and this closed form of TraI was designated the TraI_Helicase form of TraI.

With the ss_{−27_+8}ds_{+9_+143}-R relaxosome structure, we capture a state of TraI where the protein engages with its TE-binding site i.e. the sequence 5’ of the nic site. It is also very flexible since the AH and CTD domains of TraI remain largely semi-unstructured. It therefore captures the open form characterised previously¹⁷. We name this form of TraI “TraI_TE”. In this conformational state, TraI has undergone a dramatic re-arrangement of its domain and subdomain structure compared to its helicase mode (Fig. 6a, b): i- while the AH domain was proximal to the TE domain in TraI_Helicase, in TraI_TE, it is the VH domain that is now proximal to that domain (Fig. 6a); ii- In TraI_TE, the VH domain itself has undergone a large re-arrangement of its sub-domains, with the NTD and the 2 A + 2B/2B-like module pivoting 47° and 95° degrees each relative to the 1 A sub-domain (Fig. 6b). Near atomic resolution was not achieved for the AH and CTD domains, nonetheless we were able to locate these domains within the relaxosome. Apparently, this region makes no specific and stabilising contacts with proteins or DNA nearby. Finally, when a model of the ss_{-27_+8}ds_{+9_+143}-R is placed onto the 2D-classes of ds_{-27_+143}-R, we observe that the TE domain maps to a region of the 2D-class that forms a fuzzy protrusion from the dsDNA (Fig. 6c), indicating that this region of ds_{-27_+143}-R, which has poor density and could not be interpreted, corresponds to the TraI TE domain. Thus, when in a dsDNA-bound relaxosome, TraI is in its TraI_TE mode.

Fig. 6: TraI conformational changes, requirement for bubble formation 3’ of *nic.*

TE binding requires the formation of a bubble at the helicase binding site

As mentioned above, in the ss_{-27_+8}ds_{+9_+143}-R structure, the R-strand base of the first double-stranded pair of the ss/dsDNA duplex (pair +9 in Fig. 5b) is not observed in the density due to its displacement by the β9-αF loop. Luo et al. determined that in vivo, DNA in the vicinity of the “nic” site showed increased sensitivity to KMnO4, indicating distortion/unwinding in this region⁴⁸. We therefore asked whether TraI TE can melt a more extended region of dsDNA.

Thus, we reconstituted relaxosome complexes with decreasing ssDNA length, progressively extending the dsDNA part by increments of 5 nucleotides towards and beyond the nic site (Fig. 1c and Supplementary Fig. 2). The DNA employed were: ss_{−27_+3}ds_{+4_+143}, ss_{-27_-3}ds_{-2_+143}, ss_{-27_-8}ds_{-7_+143} and ss_{-27_-13}ds_{-12_+143}. Cryo-EM datasets were collected for each complex. 2D classes and 3D maps were computed (see Methods and Supplementary Table 2). The ss_{-27_+3}ds_{+4_+143}-R and ss_{-27_-3}ds_{-2_+143}-R were solved to high resolution and are virtually identical to ss_{-27_+8}ds_{+9_+143}-R with DNA observed bound to a well-ordered TraI TE domain (Supplementary Fig. 4e, f, Supplementary Table 1a, b). For ss_{-27_-8}ds_{-7_+143}-R and ss_{-27_-13}ds_{-12_+143}- R, only 2D classes and initial low resolution 3D maps were obtained because it became clear early on in the processing that their structure was similar to that of ds_{−27_+143}-R, i.e. with a disordered TraI TE domain (Fig. 7a, b). Thus, in the ss_{-27_+3}ds_{+4_+143}-R and ss_{-27_-3}ds_{-2_+143}-R complexes, the TraI TE domain engages with its binding site while, in the ss_{-27_-8}ds_{-7_+143}-R and ss_{-27_-13}ds_{-12_+143}-R complexes, it does not. Together, these results indicated that TraI TE is unable to melt sequences beyond base pair -3. A corollary conclusion is that a single-strand of DNA is required before base pair -3 in order for TraI TE to melt and engage its binding site. To investigate this, we designed two experiments. Firstly, the relaxosome complex was reconstituted with a blunt end dsDNA of 115 nucleotides which include base pairs -2 to +9 (ds_{-2_+113}) but does not extend beyond base pair -2 (Fig. 1c and Supplementary Fig. 2). A cryo-EM data set similar in size as for the 4 previous complexes described above was collected. 2D classes and 3D reconstruction maps show an ordered TE domain similar to those obtained for ss_{-27_+8}ds_{+9_+143} (Fig.7a, b), indicating that a single-strand is not required to melt the TE-binding site, provided that the double strand does not extend beyond base pair -3. Secondly, we wondered whether a bubble immediately 3’ to the nic site would be sufficient for TraI TE to melt its DNA-binding site. To test this hypothesis, a dsDNA containing 15 bases (poly dT15) was inserted between position -17 and -3 as described in Fig. 1c or Supplementary Fig. 2. The DNA was extended on the 3’ side of the T-strand to stabilise the bubble. TraM and its binding site were not included. This DNA is referred to as ds_{-67_+113}(poly-dT15_{-17_-3}). A limited cryo-EM dataset as above was collected, 2D classes and a 3D map were obtained and shown to resemble those of ss_{-27_+8}ds_{+9_+143} indicating an ordered TE domain (Fig. 7a, b). Thus, a single strand bubble immediately 3’ of nic is required for the binding reaction to occur.

Fig. 7: Requirement for bubble formation 3’ of *nic.*

Validation of the relaxosome structure

To validate the structures presented here, we initiated an extensive mutational study, targeting residues in three hubs, Hub1-3 (Fig. 8a–d and Supplementary Table 3). All mutations were made in traI and analyzed for conjugative plasmid transfer activity. Firstly, we targeted residues interacting with dsDNA (Fig. 8a). Both interactions of VH_2B/2B-like residues with the nic arm of the dsDNA hairpin and of VH_2A residues with the TraY arm were tested by generating negatively charged substitutions. Double and triple mutants were made (Fig. 8a). We observed either no effect for residues of VH_2A or very large increases in conjugation rates for residues in VH_2B/2B-like (nearly 300-fold for S641E-S696E for example), pointing to an interaction hub that plays important roles. Very similar results are obtained when mutating some residues at the interface between TraI VH_2B/2B-like and TraY1 (Fig. 8b), between TraI VH_2B/2B-like and IHF (Fig. 8c), and between TraI VH_2B/2B-like and TraI TE (Fig. 8d). These results all emphasise the functional importance of the various hubs we have characterised in the structure. But perhaps more significantly, they point to the possibility that our structure represents a quiescent or pre-initiation state of the relaxosome, the activation of which can be triggered readily by changes in the structure, possibly involving disruption of the key hub interactions analysed here. The activation step is predicted to occur in response to signals such as pilus termination or the presence of a recipient cell^49,50. Our mutational study results indicate that changing any combination of residues disrupt a finely-tuned quiescent complex that can easily be triggered by extracellular signals.

**Fig. 8: Mutational study of the relaxosome and Mechanistic model of conjugation.**

Mechanism of conjugation

A myriad of biochemical studies have demonstrated the relevance of in vitro reconstitution work from purified components in providing mechanistic insights on biological processes involving macromolecular machines. Here, we present not only the structure of a relaxosome but also describe various DNA-bound states of this important complex. The structural characterisation of many more conformational states will be required to gain a full understanding of relaxosome function. Nevertheless, from the insights obtained here, a molecular-level mechanistic model for relaxosome recruitment and activation emerges (Fig. 8e). It starts with the formation of a quiescent relaxosome at oriT with TraI in its TraI_TE mode (step 1) and its subsequent recruitment to the T4SS (step 2) via the interactions of the coupling protein TraD with TraM and of TraI TSA and TSB modules with an unknown receptor^51,52. Once pilus biogenesis is completed and a recipient cell has contacted a donor cell, a signal is sent (step 3) that activates the relaxosome by generating an ssDNA bubble’ immediately 3’ to the nic site on the T-strand (step 4). This allows i- the TE domain of TraI_TE to access its binding site as in the structure of ss_{-27_+8}ds_{+9_+143}-R and ii- the loading of TraI_helicase¹⁴ (step 5). The former functions in its so-called TraI_TE mode when bound to traI_TE site (Supplementary Fig. 1b), proceeds to cleave nic and attach covalently to the 5’phosphate resulting from the nicking reaction, while the latter when bound 3’ of nic (traI_Helicase site in Supplementary Fig. 1b) initiates DNA unwinding (step 6). In a final step, the DNA is driven through the T4SS, through the pilus, and finally injected into the recipient cell (step 7).

Methods

Cloning, protein expression and purification

All strains, plasmids, constructs, and primer sequences used in this study are listed in Supplementary Table 4a–d, respectively.

A tagless construct of IHF was generated by PCR amplification excluding the tag from a vector where ihfA and ihfB genes were cloned in tandem in the MCS of pRSF-1b plasmid (Novagen), to generate pRSF-IHF. The traM and traY genes synthesised and cloned into the NcoI and HindIII sites of pET28a vector were purchased from GenScript. All TraI expression constructs used in this study were engineered from pTrc99a-TraI⁵³ harbouring wild-type traI gene cloned into the MCS. Mutagenesis following the In-Fusion cloning protocol was used to generate the single Y16F mutant TraI variant (TraI_Y16F). Expression construct for TraI_Y16F domain deletion variant TraI_ΔAH+CTD was generated by PCR amplification of pTrc99a-TraI, excluding the region of TraI between residue 835 to the C-terminus and annealing by In-Fusion seamless cloning technology (Takara Bio). All constructs were verified by DNA sequencing (Eurofins Scientific).

IHF was purified as a heterodimer with a protocol adapted from Nash et al.⁵⁴. Briefly, E. coli BL21 (DE3) cells (ThermoFisher) transformed with pRSF-IHF were grown in 2 L of LB media supplemented with 50 µg/ml kanamycin at 37 °C. Upon reaching an OD₆₀₀ of 0.7, expression was induced with IPTG at a final concentration of 1 mM and incubated overnight at 18 °C. The cells were harvested by centrifugation, resuspended in 20 mM Tris-HCl pH 7.5, 100 mM NaCl (buffer A), one protease inhibitor cocktail tablet (Roche) and lysed by sonication (10–15 cycles of 5 s ON and 10 s OFF using a Sonics Vibra Cell VCX 130 sonicator set at 30% amplitude). The lysed cells were centrifuged, and the resultant supernatant was subjected to stepwise precipitation with ammonium sulphate to 50% and 70% saturation. The 70% precipitate was dialysed in 20 mM Tris-HCl pH 7.5, 50 mM NaCl (buffer B) and applied to a 5 ml HiTrap Heparin column (GE Healthcare) equilibrated with the same buffer. A gradient elution was performed using 50mM-1M NaCl in 20 mM Tris-HCl pH7.5, with fractions containing IHF heterodimer then concentrated and subjected to size exclusion chromatography using a Superdex 75 Increase 10/300 column (GE Healthcare) with buffer A as the mobile phase. The fractions containing the protein were concentrated, flash frozen in liquid nitrogen and stored in −80 °C.

TraM was purified as follows. C41 (DE3) cells (Lucigen) harbouring pET28a-TraM plasmid were grown in 1 L LB media with 50 µg/ml kanamycin at 37 °C. On reaching an OD₆₀₀ of 0.7, protein was induced by IPTG at a final concentration of 1 mM and incubated further at 18 °C, overnight. The pelleted cells were resuspended in buffer A with a protease inhibitor tablet (Roche). Lysis was carried out by sonication as described for IHF and the clarified lysate was subjected to precipitation with ammonium sulphate to 40% and 100% saturation. The 100% pellet was dialysed in buffer B and applied to a 5 ml HiTrap Heparin column (GE Healthcare). Elution was carried out using a gradient of 50mM-1M NaCl in 20 mM Tris-HCl pH7.5 with fractions containing TraM, detected by SDS-PAGE, pooled and diluted until the conductivity was equivalent to buffer B. This was applied to a 6 ml resource Q column (Cytiva) equilibrated with buffer B. A linear gradient of 50mM-1M NaCl resulted in the elution of proteins and the desired fractions were further diluted with buffer B. This was applied to a 5 ml HiTrap Blue column (Cytiva) and gradient elution was carried out with 50mM-2M NaCl. The fractions having the protein were then concentrated and subjected to size exclusion chromatography using a Superdex 200 Increase 10/300 column (GE Healthcare) equilibrated with buffer A. Fractions containing purified TraM tetramers were concentrated, flash frozen in liquid nitrogen and stored in −80 °C.

TraY was expressed in C41 (DE3) cells transformed with pET28a-TraY and lysed by sonication following the steps described for TraM. The clarified lysate was then loaded directly onto a 5 ml HiTrap Q column (Cytiva) connected in tandem to a 6 ml resource S column (Cytiva) such that the flow-through from the Q column passes through resource S. The Q column was then disconnected and gradient elution with 100mM-1M NaCl in 20 mM Tris-HCl pH7.5 was carried out on the resource S column. The fractions containing TraY were concentrated and further fractionated on a Superdex 75 Increase gel filtration column equilibrated with buffer A. The purified fractions were further concentrated and stored in −80 °C after flash-freezing in liquid nitrogen.

TraI, TraI_Y16F and its truncation variant TraI_ΔAH+CTD were expressed in BL21 (DE3) cells (ThermoFisher) harbouring the corresponding pTrc99a clone, grown in 2 L of LB media supplemented with 100 µg/ml carbenicillin at 37 °C. Once the cells reached an OD₆₀₀ of 0.7, expression was induced with a final IPTG concentration of 1 mM and incubated overnight at 18 °C. The cells were harvested by centrifugation, resuspended in buffer A with one protease inhibitor tablet (Roche) and lysed by sonication as described above. The clarified cell lysate was subjected to ammonium sulphate precipitation (0.3 g/ml) with the resulting pellet dialysed in buffer B after resuspension in the same buffer. The solution after dialysis was applied onto a 5 ml HiTrap Heparin column (GE Healthcare) equilibrated with the same buffer. A linear gradient elution was performed using 50mM-1M NaCl in 20 mM Tris-HCl pH7.5 and the eluted fractions containing the protein diluted until conductivity was equal to buffer B. This was then applied to a 5 ml HiTrap Q column (GE Healthcare) equilibrated with buffer B. Gradient elution was carried out using 50mM-1M NaCl in 20 mM Tris-HCl pH7.5 and fractions having the protein were concentrated and subjected to size exclusion chromatography on a Superose 6 Increase column equilibrated with buffer A. The purified protein fractions were concentrated and stored in -80 °C after flash-freezing in liquid nitrogen.

DNA synthesis and purification

Synthesis of duplex oriT₃₁₆ DNA fragment was performed by large scale PCR amplification of the oriT region of F plasmid synthesised and cloned in pUC57 (pUC57-oriT₃₁₆), purchased from Genscript. A 5 ml PCR reaction containing 5 μg of template DNA, 0.5 μM final concentrations each of primers (ForiT_F, ForiT_R), 2X master mix of CloneAmp HiFi PCR Premix (Clonetech, USA) was carried out in a thermocycler with the following PCR conditions: 98°C for 2 min, and 30 cycles of 98°C for 10 s, 55°C for 5 s, 72°C for 5 s, and a final extension at 72°C for 10 min. The DNA was precipitated by addition of 1/10 volume of 3 M sodium acetate pH 5.2, 3 times volume of 100% ethanol and incubated o/n in -80 °C. Precipitated DNA was recovered by high-speed centrifugation, after which the supernatant was discarded. The pellet was dried at RT, washed with 100% isopropanol, further dried at RT and resuspended in 1 ml buffer A and a further step of purification was carried out on a Superose 6 Increase column equilibrated with buffer A. Fractions were assessed on a 4-20% TBE PAGE gel (ThermoFisher) after which the pure fractions were pooled giving a yield of 60-80 μg DNA, and stored in -20 °C.

For generation of the ds_{-27_+143} DNA by restriction digestion, a sequence containing five consecutive ds_{-27_+143} DNA fragments separated by six HindIII restriction sites, synthesised and cloned in pUC57-mini vector (pUC57mini- ds_{-27_+143}_5X), was purchased from Genscript. A large-scale restriction enzyme reaction mix containing 500 μg of plasmid, 10,000 units of HindIII-HF (NEB) and 1X CutSmart buffer was made up to a final volume of 12.5 ml with MilliQ water. The reaction was incubated at 37 °C for 1 h followed by heat inactivation at 80 °C for 20 mins and any aggregates were removed by centrifugation. The 12.5 ml mixture was concentrated to a volume of 100 μl in an Amicon Centricon Filter (Millipore) with a 3 kDa cut-off. The digested DNA was size-fractionated on a native-PAGE (5% TBE gel, 40% acrylamide/bis-acrylamide) using Mini Prep Cell (Bio-Rad). The fractions were inspected on a 4-20% TBE PAGE gel (ThermoFisher) and the ones containing ds_{-27_+143} DNA were desalted with buffer A and concentrated, in a 3 kDa cut-off Amicon Centricon Filter (Millipore), giving a yield of 50-60 μg DNA, and stored in -20 °C.

All the other DNA constructs used for relaxosome reconstitution were generated from PAGE purified Ultramer^TM Oligos purchased from IDT. The corresponding R- and T- strand oligos (Supplementary Table 4e) were mixed at 1:1 molar ratio at a final concentration of 10 μM in a volume of 100 μl and annealed by incubating at 98 °C for 5 min, lowering the temperature at a rate of 1 °C/min until reaching 24 °C, in Annealing Buffer (50 mM TrisHCl pH7.2, 50 mM NaCl, 1 mM EDTA, 1 mM MgCl₂). The annealed DNA was purified by electrophoresis though a native acrylamide gel column (12% TBE gel, 40% acrylamide/bis-acrylamide) using Mini Prep Cell (Bio-Rad). The fractions were analysed for purity on a 4-20% TBE PAGE gel (ThermoFisher) and the pure fractions were desalted with buffer A and concentrated, in a 3 kDa cut-off Amicon Centricon Filter (Millipore), yielding 60-80 μg of DNA. Correct annealing of all annealed DNA heteroduplexes were verified by assessing the length of the two constituent ssDNA strands using a 10% TBE-Urea denaturing gel (ThermoFisher). The ds_{-67_+113}(poly-dT15_{-17_-3}) was additionally verified to confirm the unpaired region by S1 nuclease (Thermofisher) digestion to release its release right and left duplexes of 115 and 50 bp, respectively. The purified oligos were stored in -20 °C.

Relaxosome DNaseI footprinting and sequencing of resulting fragments

The relaxosome complex was formed by mixing 0.25 μM oriT₃₁₆ DNA, molar excess of proteins at 8 μM each of IHF and TraM, 10 μM TraY, 5 μM TraI, 100 μl of 10X DNase I buffer (Roche) made up to a volume of 1 ml with Buffer A and was allowed to incubate for 30 min at RT. To this, 200 units of DNase I recombinant RNase-free (Roche) was added and further incubated for another 45 min at RT. Following this, an equal volume of premixed phenol:chloroform:isoamyl alcohol 25:24:1 (v/v) (ThermoFisher) was added, sample was mixed vigorously and centrifuged at 17,000 x g for 10 min. The upper aqueous phase containing DNA was transferred to a new tube, mixed with 1/10 volume of 3 M sodium acetate pH 5.2, 3 times volume of 100% ethanol and incubated o/n in -80 °C. Precipitated DNA was recovered by centrifugation at 17,000 x g for 30 min at 4 °C, after which the supernatant was discarded and the pellet was washed with 1 ml Isopropanol. The sample was centrifuged at 17,000 x g for 30 min at 4 °C, supernatant discarded and the pellet was dried at RT. The air-dried pellet was resuspended in 100 μl MilliQ and purified by 1.5% agarose gel electrophoresis followed by extraction of DNA fragments from the excised gel band with Nucleospin gel and PCR Clean-up kit (Takara).

To facilitate cloning of DNA fragments into pCR-BluntII-TOPO vector (ThermoFisher), the fragments were subjected to blunting and dephosphorylation. For the blunting reaction 2 × 20 μl reactions each containing 1 μg DNA, 100 μM of each dNTP (NEB), 1 Unit T4 DNA polymerase (NEB), 1 × 2.1 Buffer (NEB) were incubated at 12 °C for 15 mins followed by heat inactivation at 75 °C for 20 mins. The reaction was purified with Nucleospin gel and PCR Clean-up kit (Takara) following the manufacturer’s instructions for PCR clean-up. This was followed by dephosphorylation of the blunt ends, where 8 ×20 μl reaction volumes each containing 1 pmol DNA ends, 1 X CutSmart buffer (NEB) and 1 unit of Shrimp Alkaline Phosphatase (rSAP) (NEB) was incubated at 37 °C for 30 mins followed by heat inactivation at 65 °C for 5 mins. The reactions were pooled together and purified with Nucleospin gel and PCR Clean-up kit (Takara) according to the manufacturer’s instructions for PCR clean-up.

A footprint library was generated by cloning the modified fragments into a pCR-BluntII-TOPO vector following instructions in the Zero Blunt TOPO PCR cloning kit (Invitrogen). Briefly, 6 μl reactions where DNA was mixed to achieve 1:10 or 1:20 molar vector-to-insert ratio, with 1 μl salt solution provided with the kit, were incubated at RT for 5 min. 4 μl of the reaction was used to transform Top10 competent cells (Invitrogen), plasmids were isolated from resultant colonies and the insert sequence established by DNA sequencing (Eurofins Scientific) (Supplementary Table 4f).

Relaxosome complex formation, cross-linking and purification for cryoEM

A nickase inactive version of TraI harbouring Y16F mutation (TraI_Y16F) was used for all cryoEM complex formations⁵⁵ in order to prevent the forward reaction that might destabilise the complex. For cryoEM studies, complex formation was carried out as follows. To 0.25 μM oriT DNA supplemented with 20 mM MgCl₂, 20 times molar excess of purified proteins were added in the order of IHF, TraM and TraY followed by 5 times molar excess of TraI_Y16F or TraI_ΔAH+CTD in a volume of 1 ml in Buffer A. This mixture was allowed to incubate for 30 min at RT following which the unbound proteins were separated from the DNA-protein complex on a Superose 6 Increase 10/300 column (GE Healthcare), equilibrated with 20 mM Hepes pH 7.5, 100 mM NaCl (Buffer C). The peak fractions containing the relaxosome were pooled (1 ml total volume) to which Glutaraldehyde (Grade I, 25% in H₂O, Sigma) was added to a final concentration of 0.5% to increase complex stability under cryo conditions. The cross-linking was allowed to proceed for 5 mins at RT and was terminated by the addition of TrisHCl pH 7.5 to a final concentration of 100 mM. The sample was subjected to a further round of gel filtration on a Superose 6 Increase 10/300 column (GE Healthcare), equilibrated with Buffer C, the fractions were pooled and concentrated using 30 kDa MWCO Microcon centrifugal filters (Millipore). The sample was analysed on a 4-12% Bolt Bis-Tris Plus gel (Invitrogen) and the gel band corresponding to the cross-linked relaxosome were excised and constituent proteins identified by peptide mass fingerprinting where appropriate. The oriT DNA and protein components indicated in Supplementary Table 4g were used to form the different relaxosome complexes described in this study, but the protocol is as outlined above for each of the complexes.

Negative stain EM

To check the quality and concentration of samples, 4 μl of 0.05 mg/ml cross-linked relaxosome was applied on glow discharged formvar-coated copper grids (Agar Scientific), incubated at RT for 1 min. The excess sample was blotted, and the grid stained with 4 μl of 2% uranyl acetate for 30 s. Excess stain was removed by blotting and the grids were allowed to dry at RT. The grids were imaged on a Tecnai T10 transmission electron microscope (Thermo Fisher Scientific) operated at 100 keV.

CryoEM Grid preparation and data collection

For cryoEM analysis, 3 μl of the sample was applied to UltrAuFoil grids (Quantifoil, Germany; 2/2 200 mesh) that had been previously negatively glow discharged using PELCO Easiglow (Ted Pella, USA) and coated with a layer of graphene oxide⁵⁶. The grids were incubated after sample application for 30 s in the chamber of a Vitrobot Mark IV (Thermo Fisher Scientific, USA) at 4 °C and 95% humidity and excess sample was blotted and vitrified in liquid ethane. The data were collected either at the ISMB Birkbeck EM facility or at the eBIC National facility using a Titan Krios microscope (ThermoFisher, USA) operated at 300 keV. The Krios was equipped with a post-GIF direct electron detector (Gatan,USA) and a BioQuantum energy filter (Gatan, USA) with a slit width of 20 eV. Images were collected automatically using the EPU software (ThermoFisher, USA) in super resolution mode with the total dose fractionated over 50 frames. The data collection parameters used for the different datasets are listed in Supplementary Tables 1a and 2.

Image processing

The movies were first corrected for drift and dose-weighted using MotionCor2⁵⁷, followed by contrast transfer function (CTF) estimation using CTFFIND v4.1⁵⁸ within RELION⁵⁹. Micrographs with a CTF-fit score better than 8 Å were selected for further processing. Workflows for image processing are reported in Supplementary Figs. 3 and 6.

Image processing of ds_{-27_+143}-R

A total of 55,914 movies were collected for ds_{-27_+143}-R from three different sessions giving three datasets which were processed separately and combined at a later stage. Initially, a subset of 4000 images were selected from dataset 1, to generate an initial model to test data quality and create templates for particle picking. From the 4000 images, 953,598 particles picked by reference-free auto-picking (Laplacian-of-Gaussian) in RELION were extracted with a box size of 400 pixels (1.06 Å/pixel sampling) and subjected to several rounds of 2D classification in cryoSPARC⁶⁰. The resulting 374,358 particles were used for ab initio 3D classification followed by 3D refinement (cryoSPARC) of the best class (based on discernible features) to yield an initial 3D map (cryoSPARC). The map low pass filtered to 20 Å was used to obtain 30 randomized 2D projections, which were used as templates for reference-based Gautomatch v0.56⁶¹ particle picking for datasets 1-3.

A total of approximately 3.5, 6 and 10 million particles were picked with permissive picking parameters for datasets 1, 2 and 3, respectively. Particles in datasets 2 and 3 were further split into smaller subsets for downstream classification steps to speed up processing. Several rounds of 2D classification in cryoSPARC resulted in a few high resolution 2D classes in a few preferred orientations but showing clear secondary structure features, and a majority of low-resolution classes comprising broken and junk particles. Selecting only the high-resolution 2D classes resulted in maps with resolution anisotropy. Therefore, to avoid throwing away rare views the selection was kept liberal at the 2D classification stage, retaining 2D classes with any semblance of particle-like features and discarding only the ‘obvious’ junk classes. Instead, to tackle heterogeneity and clean up junk, we subjected the particles from 2D classification to rigorous rounds of ab initio 3D classification.

Best particles were selected from successive runs of ab initio classification where in each round the best 3D class was selected for the next round until an ab initio model with high-resolution features was obtained. The outcomes of datasets 1 and 2 were combined (556,418 particles) followed by 3D refinement in cryoSPARC resulting in a map of 5.97 Å. The map was used as an initial model for a round of alignment-free 3D classification in RELION with 6 classes and a regularisation parameter Tau of 16, to resolve further heterogeneity. Particles from the best two classes were combined with particles established from a similar workflow from dataset 3 and was refined to 4.31 Å resolution in cryoSPARC.

This consensus map of 337,238 particles showed disordered regions which disappeared when applying higher contour levels, indicating conformational heterogeneity. The density in the ordered region showing secondary structure details was subjected to local refinement in cryoSPARC with a mask excluding the disordered regions, yielding a map of 3.78 Å resolution. This map was used to build and refine models for oriT DNA, IHF, three molecules of TraY and TraI VH_2A+2B/2Blike. We could identify a poorly resolved region suggestive of 1 A of VH domain, but this region lacked secondary structure features and therefore no model was derived from it. The density in the disordered regions did not improve with further processing such as focused classification/refinement, particle subtraction etc.

Image processing of ss_{-27_+8}ds_{+9_+143-}R, ss_{-27_+8}ds_{+9_+143}-RΔ_TraM, ss_{−27_+8}ds_{+9_+143}-RΔ_AH+CTD

The overall strategy for the image processing of datasets with the ss/ds oriT heteroduplexes follows the one described above (Supplementary Fig. 6a–c). We were able to achieve high resolution maps with fewer images as compared to ds_{-27_+143}-R, since the samples seemed to have a higher degree of stability and homogeneity, likely due the binding of TraI TE which resulted in the arms of the DNA hairpin being less flexible. In the datasets of ss_{-27_+8}ds_{+9_+143}-R and ss_{-27_+8}ds_{+9_+143}-RΔ_TraM which were collected from regions of thicker ice, more particles in diverse orientations were observed as evidenced by 2D classes and the angular distribution calculated from cryoSPARC heat maps (Supplementary Fig. 4b, d), yielding better 3D reconstructions in both cases.

For ss_{-27_+8}ds_{+9_+143}R, a final set of 155,077 best particles were selected, resulting in a global refined map of 3.77 Å resolution. The ordered regions showing secondary structure details in the global map were further subjected to local refinement yielding a map of 3.45 Å resolution. This locally-refined map was then used to build and refine models of the TraI TE domain bound to ssDNA, in addition to oriT DNA, IHF, three molecules of TraY and TraI VH_2A+2B/2Blike. Upon inspecting the global map, secondary structural features were clearly recognisable for the NTD and 1 A subdomains of VH and was used to dock AlphaFold 2 models for the same (see below).

In the global map of ss_{-27_+8}ds_{+9_+143}R we could also identify two poorly resolved regions, one next to the TraI VH domain and another at the distal end of the TraY arm of the oriT DNA. By comparing the global map from ss_{-27_+8}ds_{+9_+143}-RΔ_AH+CTD (3.93 Å, 165,577 particles; Supplementary Fig. 4i) to the ss_{−27_+8}ds_{+9_+143}R map, we could assign this disordered region to the active helicase (AH) and C-terminal domains (CTD) of TraI. Similarly, the global ss_{-27_+8}ds_{+9_+143}-RΔ_TraM map (3.11 Å, 330,708 particles; Supplementary Fig. 4j) was used to assign the density in the distal end of the oriT DNA to TraM and its DNA binding region.

Image processing of ss_{-27_+3}ds_{+4_+143}-R and ss_{-27_-3}ds_{-2_+143}-R

To further explore if the relaxosome could lead to localised melting of oriT near the nic site to facilitate TE docking, we extended the 5’ end of the R-strand by five and ten nucleotides to yield ss_{-27_+3}ds_{+4_+143}-R and ss_{-27_-3}ds_{-2_+143}-R complexes, respectively. The data collection and processing were essentially similar to that of the previous ssds constructs with the final locally refined map resolved to 3.68 Å (159,080 particles; Supplementary Fig. 4e) for ss_{-27_+3}ds_{+4_+143}-R and 3.42 Å (177,743 particles; Supplementary Fig. 4f) for ss_{-27_-3}ds_{-2_+143}-R.

Image processing of ss_{-27_-8}ds_{-7_+143}-R, ss_{-27_-13}ds_{-12_+143}-R, ds_{-2_+113}-R, and dsR_{-67_+113}(poly-dT15_{-17_-3})

It was evident from the processing of earlier datasets that the docking of TE domain indicating localised melting of oriT could be visualised from high resolution 2D classes and was observable even in initial 3D maps obtained from ab initio models in cryoSPARC. Therefore, for the next constructs tested for oriT melting, the data processing was carried out only till we obtained initial 3D maps. Briefly, pre-processing of the datasets were carried out as above, followed by 2D classification and a round of ab initio classification with refinement of the best class yielding low resolution 3D maps. For comparison, initial 3D maps obtained during early processing steps of ds_{-27_+143}-R and ss_{-27_+8}ds_{+9_+143}-R at a similar resolution are shown (Fig. 7a, b). The data collection statistics, resolution and initial and final particles obtained during processing are given in Supplementary Table 2.

All maps used for model building were sharpened using DeepEMhancer⁶² and local resolution estimation was performed using cryoSPARC.

Model building and refinement

Atomic model building into the 3.78 Å ds_{-27_+143}-R locally refined map was initiated by fitting the crystal structure of the IHF-DNA module (PDB: 1IHF) using UCSF Chimera⁶³. Regular B-form DNA generated by Web 3DNA⁶⁴ was used as an initial model for the DNA arms and modified to fit the trajectory and helical pitch of the visible DNA density, using all-atom refine in COOT v0.9.8⁶⁵ with ProSMART restraints⁶⁶. Three copies of a model of TraY generated using AlphaFold2 (AF)⁶⁷ (Supplementary Fig. 5a–d) were fitted into the three TraY densities visible in the cryoEM map using UCSF Chimera. AlphaFold2 predicted model of F TraI (Supplementary Fig. 5a–d) was similar to the experimental structure of its homologue R1 TraI in its helicase mode. From the F TraI AF model, the VH_2A+2B/2Blike sub-domain structure was used for fitting into the corresponding density in the map. It should be noted that although the VH_2A+2B/2Blike and AH_2A+2B/2Blike superpose well, there are loops and secondary structure elements essential for ATP and ssDNA binding present in the latter, but absent in the VH_2A+2B/2Blike sub-domain (Supplementary Fig. 5e). The complete model was then subjected to a few rounds of iterative manual rebuilding in COOT and refinement in PHENIX⁶⁸.

The low resolution of the map compounded by motion of the DNA arms as predicted from the 3D variability analysis in cryoSPARC, led to some ambiguity in the DNA base density. Although individual bases could be resolved, confident assignment and differentiating between purines and pyrimidines in the oriT density remained challenging. To assign the DNA sequence, we decided to take advantage of the three known TraY binding sub-sites described by the Schildbach group⁶⁹. Therefore, starting from the three known TraY sites we manually updated the DNA sequence in COOT following the oriT DNA register. This assignment was only approximate, and the sequence was later reassigned guided by the ss/ds heteroduplex relaxosome maps which were of much higher quality in the DNA region (see below).

Further fitting of individual residues into the density were carried out by a combination of manual building using COOT and MDFF using ISOLDE⁷⁰. The model was inspected for correct rotamer/Ramachandran outliers and clashes in ISOLDE. Real-space refinement of the atomic model of ds_{-27_+143}-R was carried out in PHENIX with Ramachandran, secondary structure and nucleic-acid restraints applied.

To build the ss_{-27_+8}ds_{+9_+143}-R model into the corresponding 3.45 Å locally-refined map, the de novo built ds_{-27_+14}-R model was initially fitted into the density using UCSF Chimera. The crystal structure of the F TraI trans-esterase (TE) domain bound to its cognate ssDNA (PDB ID: 2A0I) was used as a model for fitting in the ordered TE density of the map. The higher resolution of the map and the stability of DNA arms resulted in an improved DNA density where the purines and pyrimidines were now clearly resolved. Therefore, the DNA sequence was accurately reassigned in the model manually using COOT. Individual protein and DNA residues were rebuilt into densities by iterative rounds of a combination of manual building in COOT and MDDF in ISOLDE. The model was inspected to correct rotamer/Ramachandran outliers and clashes using ISOLDE followed by refinement in PHENIX with restraints applied.

The completed atomic model for ss_{-27_+8}ds_{+9_+143}R without the TE-ssDNA was then used to further improve the ds_{-27_+14}-R model. After fitting into the ds_{-27_+14}-R map, it was rebuilt using COOT and ISOLDE with iterative rounds of real space refinement with restraints applied in PHENIX, to yield the final model for ds_{-27_+143}-R.

The ss_{-27_+8}ds_{+9_+143}-R model was also used as an initial model for the ss_{-27_+8}ds_{+9_+143}RΔ_AH+CTD, ss_{-27_+8}ds_{+9_+143}RΔ_TraM, ss_{-27_+3}ds_{+4_+143}R and ss_{-27_-3}ds_{-2_+143}R, rebuilt using COOT and ISOLDE with iterative rounds of real space refinement with restraints applied in PHENIX, to yield the final models for each of the complexes. These atomic models were nearly identical to ss_{-27_+8}ds_{+9_+143}-R with an average rms deviation of 0.738 Å for the Cα superposition.

For all the models, regions of poor Cα backbone density were subsequently deleted, and the side chains removed for areas with poor densities for side chains. The models were validated using Molprobity⁷¹. The statistics for model building, refinement and validation are reported in Supplementary Table 1b.

For fitting AF models of VH_NTD, VH_1A, AH, CTD and TraM (Supplementary Fig. 5a–d) to generate the ss_{-27_+8}ds_{+9_+143}-R global model, the unsharpened global ss_{-27_+8}ds_{+9_+143}-R 3.77 Å map was used. Initially, the refined ss_{-27_+8}ds_{+9_+143}-R structure was fitted in the global map. To generate a model for F TraM bound to DNA, an AF model of F TraM (Supplementary Fig. 5a–d) was first superposed onto the pED208 TraM:sbmA structure (PDB ID: 3ON0) with the CTDs as a guide. The NTDs of F TraM were adjusted to position on the NTDs of the DNA bound conformation of pED208 TraM to generate a model of F TraM bound to pED208 sbmA. This F TraM:pED208 sbmA model was docked as a rigid body into the map using UCSF chimera Fit in Map tool (cross-correlation 0.80 of map to model). The ends of the ss_{-27_+8}ds_{+9_+143}-R DNA proximal to TraM was extended by two base pairs, joined to pED208 sbmA DNA and the sequence was updated in the F TraM:pED208 sbmA model to match the oriT sequence by mutating DNA residues in COOT. From the AF generated model of F TraI, VH_NTD (residues 307-385) and VH_1A (residues 383-564) subdomains were separated and individually fitted in their densities in the map (correlation coefficients of 0.87 for VH_NTD and 0.89 for VH_1A). Next, Isolde was used to flexibly refine these models with adaptive distance restraints implemented. The models were then merged with the VH_2A+2B/B-like sub-domain to yield the complete VH domain.

For the AH domain, sub-domains of the F TraI AF model (Supplementary Fig. 5a–d) were fitted individually. The fitting was initiated with AH_1A for which the density exhibited partial secondary structure features and the AH_1A model (residues 934-1126) could be fitted with a correlation of 0.87. AH_NTD (residues 835-933) was fitted in its density (correlation coefficient of 0.91) and both the models were flexibly refined in Isolde with adaptive distance restraints. Next the CTD (residues 1460-1630) model was rigid body fitted in the map with a correlation of 0.84, and flexible fitting was carried out in Isolde with restraints applied. The AH_2A+2B/B-like model (residues 1127-1459) was manually fitted in the density as this region was fragmented and poorly defined. The fitting was done such that the N-terminus and C-terminus of the AH_2A+2B/B-like model was proximal to the C terminus of AH_1A model and the N terminus of CTD, respectively. AH_NTD, AH_1A and AH_2A+2B/B-like models were joined to give the full AH domain. This gave us the composite global model of a fully assembled relaxosome. The models thus assembled gave rise to minimal clashes in the structure and the fitting of sub-domains as described above resulted in the corresponding N and C termini being in proximity such that they could all be merged confidently. Note that the side chains for VH_NTD, VH_1A, AH and CTD were removed in the final model.

Interactions, analysis, visualisation and representation

Interactions were calculated using the PDBePisa server⁷². Domain movements were analysed using the DynDom program⁷³. CryoEM maps and models were visualised and figures prepared using UCSF Chimera⁶³, UCSF ChimeraX⁷⁴ and PyMOL⁷⁵.

Construction of p99I⁺ mutant derivatives

Double- or triple- site mutations in F traI were generated by cutting p99I⁺ with Eco0901 to remove a 1.4 kb fragment of TraI or in case of variant S73E W659 with KpnI/Eco0901 to remove 3 kb. Forward primer MutI fw and the reverse primer MutI rev were used to create all mutant alleles except mutant S73E W659E which required forward primer MutI17fw (Supplementary Table 4c). Mutant-specific primer pairs (Supplementary Table 4d) containing the mutations in their overlap region were used together with these forward/reverse pairs to generate fragments suitable for error free joining via seamless assembly of multiple DNA Fragments [NEBuilder HiFi DNA Assembly Kit (NEB)]. E. coli DH5α cells were transformed with the ligation products. Oligonucleotide synthesis and Sanger sequencing of the mutated variants was performed by Microsynth Austria.

Bacterial conjugation

Conjugative plasmid transfer was performed as described in ref. ⁷⁶. The conjugation deficient phenotype of E. coli MS411 [pOX38ΔtraI] donor cells was complemented in trans with wild-type [p99I⁺] or mutant traI alleles. Overnight cultures of donor strains were diluted to an optical density at 600 nm (OD₆₀₀) of 0.005 in 900 µl antibiotic-free LB medium and incubated for 1 h at 37 °C before 100 µl of recipient culture (OD₆₀₀ of 2) was added. Conjugation was allowed to proceed for 1 h at 37 °C and then stopped by vortexing for 1 min. Transconjugants were selected on plates containing 25 µg/ml Sm and 40 µg/ml Km. Conjugation frequencies were expressed as transconjugants per donor cell (Supplementary Table 3).

Statistical analysis

Statistical analysis was performed using GraphPad prism version 10. Comparisons of conjugation frequencies of wild-type TraI and mutated variants for each interface hub was performed by ordinary one-way ANOVA with Holm-Šídák’s multiple comparisons test; P < 0.05 was considered significant (Supplementary Table 3).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

EM maps and atomic models were deposited to the Electron Microscopy Data Bank (EMDB) and Protein Data Bank (PDB) databases. Accession codes can be found in Supplementary Tables 1 a, b and 2. PDB codes for the various structures reported in this manuscript are 9F0X, 9F0Y, 9F0Z, 9F10, 9F11, 9F12 and the EMDB accession codes are EMD-50117, EMD-50118, EMD-50119, EMD-50120, EMD-50121, EMD-50122, EMD-50131, EMD-50132, EMD-50133, EMD-50098, EMD-50099, EMD-50102, EMD-50103, EMD-50104, EMD-50105 and EMD-53548. A model of the fully-assembled ss_{-27_+8}ds_{+9_+143}-R relaxosome and an associated ChimeraX session can be found with the Supplementary Data 1. All constructs (wild type and mutants) used in this study can be obtained on request to GW. Source data are provided with this paper.

References

Furste, J. P., Pansegrau, W., Ziegelin, G., Kroger, M. & Lanka, E. Conjugative transfer of promiscuous IncP plasmids: interaction of plasmid-encoded products with the transfer origin. Proc. Natl Acad. Sci. USA 86, 1771–1775 (1989).
Article ADS CAS PubMed PubMed Central Google Scholar
Zechner, E. L., Moncalian, G. & de la Cruz, F. Relaxases and plasmid transfer in gram-negative bacteria. Curr. Top. Microbiol Immunol. 413, 93–113 (2017).
CAS PubMed Google Scholar
Chen, C. Y., Fuqua, C., Jackson, C. R., Kadlec, K. & Top, E. M. Editorial: Plasmid transfer-mechanisms, ecology, evolution and applications. Front Microbiol 13, 993628 (2022).
Article PubMed PubMed Central Google Scholar
Winter, M., Buckling, A., Harms, K., Johnsen, P. J. & Vos, M. Antimicrobial resistance acquisition via natural transformation: context is everything. Curr. Opin. Microbiol 64, 133–138 (2021).
Article CAS PubMed Google Scholar
Colavecchio, A., Cadieux, B., Lo, A. & Goodridge, L. D. Bacteriophages contribute to the spread of antibiotic resistance genes among foodborne pathogens of the enterobacteriaceae family - A review. Front Microbiol 8, 1108 (2017).
Article PubMed PubMed Central Google Scholar
Waksman, G. From conjugation to T4S systems in gram-negative bacteria: a mechanistic biology perspective. EMBO Rep 20 https://doi.org/10.15252/embr.201847012 (2019).
Arnold, B. J., Huang, I. T. & Hanage, W. P. Horizontal gene transfer and adaptive evolution in bacteria. Nat. Rev. Microbiol 20, 206–218 (2022).
Article CAS PubMed Google Scholar
Graf, F. E., Palm, M., Warringer, J. & Farewell, A. Inhibiting conjugation as a tool in the fight against antibiotic resistance. Drug Dev. Res 80, 19–23 (2019).
Article CAS PubMed Google Scholar
Antimicrobial Resistance, C. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet 399, 629-655 (2022).
Stephens, C. et al. F Plasmids are the major carriers of antibiotic resistance genes in human-associated commensal escherichia coli. mSphere 5 https://doi.org/10.1128/mSphere.00709-20 (2020).
Frost, L. S., Ippen-Ihler, K. & Skurray, R. A. Analysis of the sequence and gene products of the transfer region of the F sex factor. Microbiol Rev. 58, 162–210 (1994).
Article CAS PubMed PubMed Central Google Scholar
Liu, G. et al. oriTDB: a database of the origin-of-transfer regions of bacterial mobile genetic elements. Nucleic Acids Res https://doi.org/10.1093/nar/gkae869 (2024).
Ares-Arroyo, M., Coluzzi, C. & Rocha, E. P. C. Origins of transfer establish networks of functional dependencies for plasmid transfer by conjugation. Nucleic Acids Res 51, 3001–3016 (2023).
Article CAS PubMed Google Scholar
Ilangovan, A. et al. Cryo-EM Structure of a Relaxase Reveals the Molecular Basis of DNA Unwinding during Bacterial Conjugation. Cell 169, 708–721.e712 (2017).
Article CAS PubMed PubMed Central Google Scholar
Matson, S. W. & Ragonese, H. The F-plasmid TraI protein contains three functional domains required for conjugative DNA strand transfer. J. Bacteriol. 187, 697–706 (2005).
Article CAS PubMed PubMed Central Google Scholar
Guogas, L. M., Kennedy, S. A., Lee, J. H. & Redinbo, M. R. A novel fold in the TraI relaxase-helicase c-terminal domain is essential for conjugative DNA transfer. J. Mol. Biol. 386, 554–568 (2009).
Article CAS PubMed Google Scholar
Krishna, B. et al. (1)H, (13)C, (15)N resonance assignment of the C-terminal domain of the bifunctional enzyme TraI of plasmid R1. Biomol. NMR Assign. 13, 121–125 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. G., Breidenstein, A., Berntsson, R. P. & Christie, P. J. Conjugative transfer of the IncN plasmid pKM101 is mediated by dynamic interactions between the TraK accessory factor and TraI relaxase. FEBS Lett https://doi.org/10.1002/1873-3468.15011 (2024).
Rice, P. A., Yang, S., Mizuuchi, K. & Nash, H. A. Crystal structure of an IHF-DNA complex: a protein-induced DNA U-turn. Cell 87, 1295–1306 (1996).
Article CAS PubMed Google Scholar
Fekete, R. A. & Frost, L. S. Characterizing the DNA contacts and cooperative binding of F plasmid TraM to its cognate sites at oriT. J. Biol. Chem. 277, 16705–16711 (2002).
Article CAS PubMed Google Scholar
Lu, J. & Frost, L. S. Mutations in the C-terminal region of TraM provide evidence for in vivo TraM-TraD interactions during F-plasmid conjugation. J. Bacteriol. 187, 4767–4773 (2005).
Article CAS PubMed PubMed Central Google Scholar
Nelson, W. C., Howard, M. T., Sherman, J. A. & Matson, S. W. The traY gene product and integration host factor stimulate Escherichia coli DNA helicase I-catalyzed nicking at the F plasmid oriT. J. Biol. Chem. 270, 28374–28380 (1995).
Article CAS PubMed Google Scholar
Mace, K. et al. Cryo-EM structure of a type IV secretion system. Nature 607, 191–196 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Costa, T. R. D. et al. Structure of the bacterial sex f pilus reveals an assembly of a stoichiometric protein-phospholipid complex. Cell 166, 1436–1444.e1410 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ragonese, H., Haisch, D., Villareal, E., Choi, J. H. & Matson, S. W. The F plasmid-encoded TraM protein stimulates relaxosome-mediated cleavage at oriT through an interaction with TraI. Mol. Microbiol 63, 1173–1184 (2007).
Article CAS PubMed Google Scholar
Meir, A., Mace, K., Vegunta, Y., Williams, S. M. & Waksman, G. Substrate recruitment mechanism by gram-negative type III, IV, and VI bacterial injectisomes. Trends Microbiol 31, 916–932 (2023).
Article CAS PubMed Google Scholar
de la Cruz, F., Frost, L. S., Meyer, R. J. & Zechner, E. L. Conjugative DNA metabolism in Gram-negative bacteria. FEMS Microbiol Rev. 34, 18–40 (2010).
Article PubMed Google Scholar
Fraikin, N., Couturier, A. & Lesterlin, C. The winding journey of conjugative plasmids toward a novel host cell. Curr. Opin. Microbiol 78, 102449 (2024).
Article CAS PubMed Google Scholar
Ryan, M. E., Damke, P. P. & Shaffer, C. L. DNA Transport through the Dynamic Type IV Secretion System. Infect. Immun. 91, e0043622 (2023).
Article PubMed Google Scholar
Lahue, E. E. & Matson, S. W. Escherichia coli DNA helicase I catalyzes a unidirectional and highly processive unwinding reaction. J. Biol. Chem. 263, 3208–3215 (1988).
Article CAS PubMed Google Scholar
Dostal, L. & Schildbach, J. F. Single-stranded DNA binding by F TraI relaxase and helicase domains is coordinately regulated. J. Bacteriol. 192, 3620–3628 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ilangovan, A., Connery, S. & Waksman, G. Structural biology of the Gram-negative bacterial conjugation systems. Trends Microbiol 23, 301–310 (2015).
Article CAS PubMed Google Scholar
Larkin, C. et al. Inter- and intramolecular determinants of the specificity of single-stranded DNA binding and cleavage by the F factor relaxase. Structure 13, 1533–1544 (2005).
Article CAS PubMed Google Scholar
Fu, Y. H., Tsai, M. M., Luo, Y. N. & Deonier, R. C. Deletion analysis of the F plasmid oriT locus. J. Bacteriol. 173, 1012–1020 (1991).
Article CAS PubMed PubMed Central Google Scholar
Wong, J. J., Lu, J., Edwards, R. A., Frost, L. S. & Glover, J. N. Structural basis of cooperative DNA recognition by the plasmid conjugation factor, TraM. Nucleic Acids Res 39, 6775–6788 (2011).
Article CAS PubMed PubMed Central Google Scholar
Crespo, I. et al. Structural and biochemical characterization of the relaxosome auxiliary proteins encoded on the Bacillus subtilis plasmid pLS20. Comput Struct. Biotechnol. J. 20, 757–765 (2022).
Article CAS PubMed PubMed Central Google Scholar
Rehman, S. et al. Enterococcal PcfF Is a ribbon-helix-helix protein that recruits the relaxase pcfg through binding and bending of the oriT sequence. Front Microbiol 10, 958 (2019).
Article PubMed PubMed Central Google Scholar
Traore, D. A. K. et al. Crystal structure of TcpK in complex with oriT DNA of the antibiotic resistance plasmid pCW3. Nat. Commun. 9, 3732 (2018).
Article ADS PubMed PubMed Central Google Scholar
Wisniewski, J. A. et al. TcpM: a novel relaxase that mediates transfer of large conjugative plasmids from Clostridium perfringens. Mol. Microbiol 99, 884–896 (2016).
Article CAS PubMed Google Scholar
Tsai, M. M., Fu, Y. H. & Deonier, R. C. Intrinsic bends and integration host factor binding at F plasmid oriT. J. Bacteriol. 172, 4603–4609 (1990).
Article CAS PubMed PubMed Central Google Scholar
Stern, J. C. & Schildbach, J. F. DNA recognition by F factor TraI36: highly sequence-specific binding of single-stranded DNA. Biochemistry 40, 11586–11595 (2001).
Article CAS PubMed Google Scholar
Lu, J. et al. Structural basis of specific TraD-TraM recognition during F plasmid-mediated bacterial conjugation. Mol. Microbiol 70, 89–99 (2008).
Article CAS PubMed Google Scholar
Howard, M. T., Nelson, W. C. & Matson, S. W. Stepwise assembly of a relaxosome at the F plasmid origin of transfer. J. Biol. Chem. 270, 28381–28386 (1995).
Article CAS PubMed Google Scholar
Llosa, M., Grandoso, G., Hernando, M. A. & de la Cruz, F. Functional domains in protein TrwC of plasmid R388: dissected DNA strand transferase and DNA helicase activities reconstitute protein function. J. Mol. Biol. 264, 56–67 (1996).
Article CAS PubMed Google Scholar
Llosa, M., Zunzunegui, S. & de la Cruz, F. Conjugative coupling proteins interact with cognate and heterologous VirB10-like proteins while exhibiting specificity for cognate relaxosomes. Proc. Natl Acad. Sci. USA 100, 10465–10470 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Moncalian, G. & de la Cruz, F. DNA binding properties of protein TrwA, a possible structural variant of the Arc repressor superfamily. Biochim Biophys. Acta 1701, 15–23 (2004).
Article CAS PubMed Google Scholar
Laroussi, H. et al. Exploration of DNA processing features unravels novel properties of ICE conjugation in Gram-positive bacteria. Nucleic Acids Res 50, 8127–8142 (2022).
Article CAS PubMed PubMed Central Google Scholar
Luo, Y., Gao, Q. & Deonier, R. C. Boundaries of the nicking region for the F plasmid transfer origin, oriT. Mol. Microbiol 15, 829–837 (1995).
Article CAS PubMed Google Scholar
Goldlust, K. et al. The F pilus serves as a conduit for the DNA during conjugation between physically distant bacteria. Proc. Natl Acad. Sci. USA 120, e2310842120 (2023).
Article CAS PubMed PubMed Central Google Scholar
Hu, B., Khara, P. & Christie, P. J. Structural bases for F plasmid conjugation and F pilus biogenesis in Escherichia coli. Proc. Natl Acad. Sci. USA 116, 14222–14227 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Lang, S. et al. Molecular recognition determinants for type IV secretion of diverse families of conjugative relaxases. Mol. Microbiol 78, 1539–1555 (2010).
Article CAS PubMed Google Scholar
Redzej, A. et al. Structure of a translocation signal domain mediating conjugative transfer by type IV secretion systems. Mol. Microbiol 89, 324–333 (2013).
Article CAS PubMed PubMed Central Google Scholar
Haft, R. J. et al. General mutagenesis of F plasmid TraI reveals its role in conjugative regulation. J. Bacteriol. 188, 6346–6353 (2006).
Article CAS PubMed PubMed Central Google Scholar
Nash, H. A., Robertson, C. A., Flamm, E., Weisberg, R. A. & Miller, H. I. Overproduction of Escherichia coli integration host factor, a protein with nonidentical subunits. J. Bacteriol. 169, 4124–4127 (1987).
Article CAS PubMed PubMed Central Google Scholar
Dostal, L., Shao, S. & Schildbach, J. F. Tracking F plasmid TraI relaxase processing reactions provides insight into F plasmid transfer. Nucleic Acids Res 39, 2658–2670 (2011).
Article CAS PubMed Google Scholar
Cheng, K., Wilkinson, M., Chaban, Y. & Wigley, D. B. A conformational switch in response to Chi converts RecBCD from phage destruction to DNA repair. Nat. Struct. Mol. Biol. 27, 71–77 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rohou, A. & Grigorieff, N. CTFFIND4: Fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Article PubMed PubMed Central Google Scholar
Zivanov, J. et al. A Bayesian approach to single-particle electron cryo-tomography in RELION-4.0. Elife11, https://doi.org/10.7554/eLife.83724 (2022).
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Google Scholar
Zhang, K. Fully automatic acccurate, convenient and extremely fast particle picking for EM. (2017).
Sanchez-Garcia, R. et al. DeepEMhancer: a deep learning solution for cryo-EM volume post-processing. Commun. Biol. 4, 874 (2021).
Article PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Zheng, G., Lu, X. J. & Olson, W. K. Web 3DNA–a web server for the analysis, reconstruction, and visualization of three-dimensional nucleic-acid structures. Nucleic Acids Res 37, W240–W246 (2009).
Article CAS PubMed PubMed Central Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr D. Biol. Crystallogr 66, 486–501 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Nicholls, R. A., Fischer, M., McNicholas, S. & Murshudov, G. N. Conformation-independent structural comparison of macromolecules with ProSMART. Acta Crystallogr D. Biol. Crystallogr 70, 2487–2499 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr D. Struct. Biol. 75, 861–877 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Lum, P. L., Rodgers, M. E. & Schildbach, J. F. TraY DNA recognition of its two F factor binding sites. J. Mol. Biol. 321, 563–578 (2002).
Article CAS PubMed Google Scholar
Croll, T. I. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr D. Struct. Biol. 74, 519–530 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Williams, C. J. et al. MolProbity: More and better reference data for improved all-atom structure validation. Protein Sci. 27, 293–315 (2018).
Article CAS PubMed Google Scholar
Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 372, 774–797 (2007).
Article CAS PubMed Google Scholar
Hayward, S. & Berendsen, H. J. Systematic analysis of domain motions in proteins from conformational change: new results on citrate synthase and T4 lysozyme. Proteins 30, 144–154 (1998).
Article CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82 (2021).
Article CAS PubMed Google Scholar
DeLano, W. L. Pymol: an open-source molecular graphics tool. CCP4. Newsl. Protein Crystallogr. 40, 82–92 (2002).
Lang, S., Gruber, C. J., Raffl, S., Reisner, A. & Zechner, E. L. Common requirement for the relaxosome of plasmid R1 in multiple activities of the conjugative type IV secretion system. J. Bacteriol. 196, 2108–2121 (2014).
Article PubMed PubMed Central Google Scholar
Trokter, M. & Waksman, G. Translocation through the conjugative type IV secretion system requires unfolding of its protein substrate. J. Bacteriol. 200, https://doi.org/10.1128/JB.00615-17 (2018).
Cheng, Y., McNamara, D. E., Miley, M. J., Nash, R. P. & Redinbo, M. R. Functional characterization of the multidomain F plasmid TraI relaxase-helicase. J. Biol. Chem. 286, 12670–12682 (2011).
Article CAS PubMed PubMed Central Google Scholar
Schreiter, E. R. & Drennan, C. L. Ribbon-helix-helix transcription factors: variations on a theme. Nat. Rev. Microbiol 5, 710–720 (2007).
Article CAS PubMed Google Scholar
Williams, S. https://BioRender.com/tecqen1 (2025).

Download references

Acknowledgements

We would like to thank Dr. David Houldershaw for IT support and Drs. Natasha Lukoyanova and Shu Chen for collection of data on the in-house Krios. This work was supported by Wellcome grants 098302 and 217089 to GW and the Austrian Science Fund (FWF) [10.55776/DOC50 Molecular Metabolism] to ELZ. Cryo-EM data were collected at the ISMB EM facility at Birkbeck College, University of London with financial support from Wellcome Trust (202679/Z/16/Z and 206166/Z/17/Z) and at the UK national electron Bio-Imaging Centre (eBIC).

Author information

Authors and Affiliations

Institute of Structural and Molecular Biology, School of Natural Sciences, Birkbeck College, Malet Street, London, WC1E 7HX, UK
Sunanda M. Williams, Aravindan Ilangovan & Gabriel Waksman
Institute of Molecular Biosciences, University of Graz, BioTechMed-Graz, Humboldtstrasse 50, 8010, Graz, Austria
Sandra Raffl, Sabine Kienesberger & Ellen L. Zechner
Centre for Molecular Cell Biology, School of Biological and Behavioural Sciences, Queen Mary University of London, Newark Street, London, E1 2AT, UK
Aravindan Ilangovan
Institute of Structural and Molecular Biology, Division of Biosciences, Gower Street, University College London, London, WC1E 6BT, UK
Gabriel Waksman

Authors

Sunanda M. Williams
View author publications
Search author on:PubMed Google Scholar
Sandra Raffl
View author publications
Search author on:PubMed Google Scholar
Sabine Kienesberger
View author publications
Search author on:PubMed Google Scholar
Aravindan Ilangovan
View author publications
Search author on:PubMed Google Scholar
Ellen L. Zechner
View author publications
Search author on:PubMed Google Scholar
Gabriel Waksman
View author publications
Search author on:PubMed Google Scholar

Contributions

S.M.W. purified the DNA and protein components, established the conditions for relaxosome assembly and set up the DNaseI footprinting protocol. S.M.W. conducted NS and cryoEM sample preparations, data collection, processing, model building and refinement. SMW and GW carried out structural analyses. S.R., S.K., and E.L.Z. performed the mutational analysis of traI. AI was involved with relaxosome characterisation in the initial phase of the project. S.M.W., E.L.Z. and G.W. wrote the manuscript with inputs from SR. GW supervised the work.

Corresponding authors

Correspondence to Sunanda M. Williams or Gabriel Waksman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Williams, S.M., Raffl, S., Kienesberger, S. et al. Cryo-EM Structure of the relaxosome, a complex essential for bacterial mating and the spread of antibiotic resistance genes. Nat Commun 16, 4906 (2025). https://doi.org/10.1038/s41467-025-60116-6

Download citation

Received: 19 December 2024
Accepted: 13 May 2025
Published: 27 May 2025
Version of record: 27 May 2025
DOI: https://doi.org/10.1038/s41467-025-60116-6