Prevalent chromosome fusion in Vibrio cholerae O1

Cuénod, Aline; Chac, Denise; Khan, Ashraful I.; Chowdhury, Fahima; Hyppa, Randy W.; Markiewicz, Susan M.; Rice, Amelia; Kholwadwala, Akhil; Calderwood, Stephen B.; Ryan, Edward T.; Harris, Jason B.; LaRocque, Regina C.; Bhuiyan, Taufiqur R.; Smith, Gerald R.; Qadri, Firdausi; Lypaczewski, Patrick; Weil, Ana A.; Shapiro, B. Jesse

doi:10.1038/s41467-025-60699-0

Download PDF

Article
Open access
Published: 01 July 2025

Prevalent chromosome fusion in Vibrio cholerae O1

Nature Communications volume 16, Article number: 5830 (2025) Cite this article

2217 Accesses
2 Citations
39 Altmetric
Metrics details

Subjects

Abstract

Two circular chromosomes are a defining feature of the bacterial family Vibrionaceae, including the pathogen Vibrio cholerae, with rare reports of isolates with a single, fused chromosome. Here, we use long-read sequencing to analyse 467 V. cholerae O1 isolates from 47 cholera patients and household contacts in Bangladesh. We identify several independent chromosome fusion events that are likely transmissible within a household. Fusions occur in a 12 kilobase-pair homologous sequence shared between the two chromosomes and are stable for at least 200 generations under laboratory conditions. We find no detectable effect of fusion on V. cholerae growth, virulence factor expression, or biofilm formation. The factors promoting fusion, affecting chromosome stability, and subtle phenotypic or clinical consequences merit further investigation.

Dynamic transitions of initiator binding coordinate the replication of the two chromosomes in Vibrio cholerae

Article Open access 08 January 2025

MatP local enrichment delays segregation independently of tetramer formation and septal anchoring in Vibrio cholerae

Article Open access 15 November 2024

Rationally designed chromosome fusion does not prevent rapid growth of Vibrio natriegens

Article Open access 02 May 2024

Introduction

Cholera is a waterborne infectious disease affecting millions of people yearly and causing outbreaks where sanitary infrastructure is inadequate¹. The ongoing seventh cholera pandemic is caused by a pathogenic lineage (7PET) of Vibrio cholerae O1 carrying four virulence-associated genomic islands: VPI-1, VPI-2, VSP-I and VSP-II^2,3,4. In Bangladesh, where cholera is endemic, the 7PET sublineage BD2 was dominant between 2009 and 2018, followed by BD1.2, which was responsible for a large outbreak in Dhaka in 2022⁵. V. cholerae typically carries two chromosomes: the larger ~3 megabase-pair (Mbp) chromosome 1 and the smaller ~1 Mbp chromosome 2. When chromosome 2 replication is impaired under laboratory conditions, the two chromosomes can fuse to restore cell division⁶. Out of thousands of sequenced genomes, only three V. cholerae with fused chromosomes have been reported to date from natural environments^6,7,8,9. These have typically been considered rare exceptions to the bipartite genome structure. However, due in part to limitations of short-read sequencing, the prevalence of chromosome fusion in V. cholerae remains unknown.

Here, we observe chromosome fusion in 58 out of 467 clinical V. cholerae isolates collected from 47 patients living in 21 households in Dhaka, Bangladesh. We identify multiple independent fusion events that appear to be stable enough to be transmitted within households. Fusions occur in a 12-kilobase-pair homologous sequence shared between the two chromosomes and are stable for 200 generations under laboratory conditions. We find no detectable effect of fusion on V. cholerae growth, virulence factor expression, or biofilm formation. More subtle phenotypic effects or clinical impacts of chromosome fusions remain to be investigated.

Results and discussion

Chromosome fusion is prevalent in clinical V. cholerae O1 isolates

We aimed to detect chromosome fusion in clinical V. cholerae isolates and identify potential fusion mechanisms. To do so, we used long-read nanopore sequencing of 467 V. cholerae isolates, collected between 2015 and 2018 from 47 patients (21 index cases and 26 household contacts) from 21 households in Dhaka, Bangladesh (Fig. 1A, Supplementary Data 1, Fig. S1). All isolates were identified as serotype O1. Of these, 409 genomes assembled into two circular chromosomes (3 and 1 Mbp each) and 58 into a single 4 Mbp chromosome. All 58 single-chromosome genomes resulted from an apparent fusion of chromosomes 1 and 2. These fused chromosomes were identified in ten different people from five different households. These people include four index cases and six household contacts, two of whom experienced cholera symptoms (Supplementary Data 1).

**Fig. 1: *V. cholerae* with a fused chromosome identified in multiple patients and households.**

No phenotypic effect of chromosome fusion identified

Comparing closely related fused and non-fused isolates from the same patient (n = 2 pairs, 0 and 2 high-quality SNVs between genomes within a pair; Methods) we found no difference in growth or in expression of the key virulence factors cholera toxin (ctx) and toxin co-regulated pilus (tcp), or in their capability to form biofilms (Fig. S2). This does not exclude the possibility of subtle phenotypic differences that could be detected with larger sample sizes or with other assays.

Chromosome fusion is confirmed using Pulsed-Field Gel Electrophoresis

As independent verification of chromosome fusion, we subjected a subset of isolates to pulsed-field gel electrophoresis (PFGE). We included one putative fused-chromosome isolate per patient for which at least one fused chromosome was assembled (n = 10) and three putative non-fused-chromosome isolates for comparison. As expected, we detected one band at 4 Mbp for all putative fused-chromosome isolates and two bands at 3 and 1 Mbp for the non-fused isolates, corresponding to the known sizes of chromosomes 1 and 2, respectively (Fig. 1B). Some of the fused isolates have a weak, poorly defined band at about 1 Mbp in addition to the band at 4 Mbp, but the lack of a band at 3 Mbp in these isolates indicates that fusion did occur. We hypothesize the weak band to arise from fragmented DNA (<1 Mbps) which ran to this position in the pulsed field gel. Chromosome fusion thus does not appear to be an artefact of sequencing or assembly.

Chromosome fusions occur at 12 Kbp homologous sequence

To understand the mechanism of fusion, we scanned the flanking regions on either side of the fusion site. We found that in fused chromosomes, the chromosome 2 sequence is flanked by a 12 kilobase-pair (Kbp) homologous sequence (HS1) oriented in the same direction on either side of the integrated chromosome 1 sequence. In many non-fused strains, HS1 appears twice: once on chromosome 1 and once on chromosome 2 (Fig. 2A). This suggests homologous recombination at HS1 as a potential fusion mechanism. To further support the observations of fusion in the assemblies, we screened the raw sequence data for reads spanning HS1 and its flanking regions, which were highly concordant with the results of the assemblies (Fig. S3). On chromosome 1, HS1 is located within VPI-2 and contains genes VC1788-VC1803 whereas on chromosome 2 HS1 occurs between VCA0451 and VCA0453. HS1 encodes multiple proteins linked to horizontal gene transfer (Fig. 2B). These include two ISVch4 elements of the IS3 family, which have previously been identified as fusion sites in V. cholerae strains which fused upon experimental deletion of the DNA adenine methylase Dam⁶ and in a naturally fused V. cholerae O1 isolate from a patient who travelled to Indonesia in 1997⁸. These are not the same recombination sequences or IS3 elements that have previously been described in two other naturally fused non-O1 strains^7,9, indicating multiple viable fusion mechanisms.

**Fig. 2: Chromosome fusion events occur in *V. cholerae* sublineages with a shared homologous sequence on each chromosome.**

Multiple independent fusion events appear to be stable enough to be transmitted

We next examined the phylogenetic distribution of fused and non-fused chromosomes. The phylogenetic clustering of patients within a household is consistent with previous studies^10,11 suggesting V. cholerae transmission within households (Fig. 2C), including instances of potential fused chromosome transmission in households P, I, and F (Fig. 2D). All fused chromosomes were part of the 7PET sublineage BD2 (Fig. 2C). Most BD2 genomes in our dataset contained two copies of HS1 (one on each chromosome), likely explaining their propensity for fusion. By contrast, other sublineages – notably IND1.3 – contained only one HS1 copy (Fig. 2C), preventing chromosome fusion through the same mechanism.

The phylogeny indicates five potential fusion events, defined as closely related genomes including at least one with a fused chromosome, collected from the same household. Four of these five events are likely independent because they are separated on the phylogeny by other well-supported clades with two chromosomes (Fig. 2C; ultrafast bootstrap approximation support >95). Event 1 is a single fusion event in a household contact, with no evidence of onward transmission (Figs. 2D, S4). In event 5, 7/10 isolates from the index case had a fused chromosome, but 0/10 isolates from the household contact were fused, which appear to be only distantly related to the index case isolates (12–29 SNVs; Fig. S4), again suggesting a single fusion event and no onward transmission. In events 2-4, we observe fused chromosomes amongst the closely related genomes (typically 0–2 SNVs; Fig. S4) isolated from both index cases and household contacts, suggesting transmission of the fusion. The clade containing event 2 includes an index case (with 1 fused chromosome out of 9 genomes sequenced) and contact (5/10 fused) from household P, as well as non-fused chromosomes from household O. This observation is consistent with a variety of scenarios, including a single fusion event within household P, followed by transmission of a mixed fused/non-fused population within the household. Alternatively, only non-fused V. cholerae could have been transmitted, followed by independent fusion events in the index and contact within the household P. The clades containing events 3 and 4 each involve a single household with one index and two contacts, all of which contained fused chromosomes (Fig. 2D). The isolates from these two households are closely related and separated by a low-supported branch of non-fused isolates (ultrafast bootstrap approximation support <25). This is consistent with either independent fusion events in each household or a single fusion event stable enough to be transmitted between households. Within each household, these clades are also consistent with transmission of a mixed fused/non-fused population, or with 2–3 independent fusion events. A more parsimonious scenario would be a single fusion event in the index case, followed by fused-chromosome transmission to both contacts, and a single fission event in one contact. Although we cannot formally distinguish between these scenarios, it is clear that chromosome fusion occurs repeatedly and is stable enough to be transmitted within and possibly between households.

Chromosome fusion is stable under laboratory conditions

To further quantify the stability of chromosome fusion, we conducted a passaging experiment and found the fusion state to be stable over 200 generations under laboratory conditions (Supplementary Data 2, Fig. S5). This apparent stability could be explained if fused isolates are recombination-deficient and therefore unable to reverse the fusion event. However, this does not appear to be the case: genes involved in recombination (rec and mut genes) did not contain any fusion-specific mutations. To search for other mutations that could potentially stabilize chromosomal fusion, we used a genome-wide association study (Methods) but did not find any protein-coding variants significantly associated with fusion state. Nor did we identify any significant differences in the number of SNVs accumulated during the passage experiment between fused and unfused isolates (p-value > 0.05, Mann-Whitney U), which would have been expected had they differed in their recombination or DNA repair proficiency (Fig. S6). In conclusion, we find no evidence that fused isolates are recombination-deficient. Yet, we detected closely-related fused and non-fused isolates collected from the same household and patient, suggesting fusion/fission events can occur within patients, likely on time scales of a few days. How chromosome fusion is stabilized under laboratory conditions and whether fusion/fission events are more dynamic within patients remains to be examined.

Fused chromosomes identified in publicly available genome sequences

We further investigated the frequency of chromosome fusion more broadly across the order Vibrionales, in which a bipartite genome is considered a defining feature. We downloaded publicly available long-read sequences (n = 302, 251 of which passed our quality controls), 73.7% (185/251) of which were V. cholerae (Fig. S7A). Of the fully circular assemblies (n = 203), four assembled to one fused chromosome (Fig. S7B). One of these was a V. natriegens genome, whose chromosomes were lab-engineered to be fused¹². The remaining three were clinical V. cholerae isolates^13,14. Two of these, from the IND1.1 sublineage, contained two directly-oriented copies of HS1, as in our isolates. The third genome, which was related to IND2, also contained two copies of HS1 but in the opposite orientation, suggesting a local inversion of one HS1 copy. In a larger dataset of short-read sequenced V. cholerae genomes (n = 1223), we observe HS1 mainly to be duplicated in sub-lineage BD2 (Fig. S7C). These results and previous studies^7,8,9 suggest that fusion, while rarely observed, can occur in different Vibrio species and V. cholerae sublineages.

VSP-I could be another potential fusion site

We next asked if HS1 is unique, or if other potential fusion sites exist in the genome. For each circular, non-fused public genome (n = 199), we compared chromosome 2 against chromosome 1 for regions of homology. We identified two such regions longer than 10 Kbp in V. cholerae, one of which corresponded to HS1 and the other to VSP-I² (Fig. S8). Although VSP-I might serve as a potential fusion site, we currently lack evidence for this as none of the fused-chromosome genomes carries more than one VSP-I copy. Whether fusion at VSP-I takes place and is viable therefore remains unknown.

Defects in chromosome replication are unlikely to explain fusion

Several genes are known to be involved in V. cholerae chromosome replication, and these all appear to be present and intact in fused chromosomes. Both V. cholerae chromosomes encode two partitioning (parAB) genes involved in separating chromosomes to daughter cells¹⁵. We detect all four parAB genes in the fused chromosomes sequenced here (n = 58), with no mutations compared to par genes on non-fused chromosomes. Fused chromosomes also contain origins of replication from both chromosomes (ori1 and ori2) and crtS (Chr2 replication triggering Site). Previous studies showed that the two V. cholerae chromosomes fuse temporarily upon deletion of crtS^16,17. crtS is located closer to ori1 than ori2 is to ori1 in all fused chromosomes. This arrangement of loci has previously been associated with ori2 being active in a naturally fused V. cholerae chromosome¹⁸. Finally, chromosome fusion can be selected experimentally through depletion of the DNA adenine methylase Dam⁶. The dam gene was present in all genomes described here with no non-synonymous mutations detected; therefore, loss of dam function is unlikely to explain the observed fusions. Further experiments will be needed to understand how fused chromosomes replicate and whether fusion affects bacterial growth or other phenotypes under different conditions.

Conclusions

Together, our results show that chromosome fusion via homologous recombination is more prevalent and potentially more stable than previously thought. The clinical or phenotypic consequences of fusion appear to be minimal but remain to be comprehensively explored. This study reveals that chromosome fusion in clinical V. cholerae O1 occurs at an unprecedented scale and highlights the power of long-read sequencing to identify structural variation in bacterial genomes.

Methods

Ethical statement

The Ethical and Research Review committees of the ICDDR, B (approval number PR-11041) and the Institutional Review Boards of Massachusetts General Hospital, the University of Washington, and McGill University (A07-M43-21B (21-07-026)) approved the study. All adult subjects in the study provided written informed consent and the parents/guardians of children provided written informed consent.

V. cholerae isolate collection

Stool and rectal swabs were sampled from patients with cholera admitted to the International Centre for Diarrhoeal Disease Research, Bangladesh (ICDDR, B), Dhaka Hospital and from their household contacts, as described in prior studies¹⁹. Patients presenting to the hospital with severe acute diarrhoea and a stool culture positive for V. cholerae O1 were considered index patients. Persons who shared the same cooking pot with the index patient for 3 or more days are considered household contacts and were enrolled within 6 h of the presentation of the index patient to the hospital. Rectal swabs were collected daily from household contacts during a 10-day period after presentation of the index case. Household contacts underwent daily clinical assessment of symptoms. Household contacts were defined as infected if any rectal swab culture was positive for V. cholerae O1. V. cholerae serotypes were determined using slide agglutination testing with polyvalent and specific antisera as in prior studies²⁰. We excluded patients below 2 years of age and above 60 years old or with major comorbid conditions^21,22. Rectal swabs and stool from the day of enrollment and follow-up time points were collected and placed immediately on ice after collection and stored at −80 °C until DNA extraction.

V. cholerae isolates were cultured from stool samples from culture-positive participants. Stool samples were streaked directly onto tellurite taurocholate gelatin agar, a selective medium for V. cholerae, and incubated at 37 °C for 18–24 h. Ten colonies from each participant were selected and inoculated into Luria-Bertani (LB) broth (BD Difco) and grown at 37 °C overnight. Liquid cultures were used to make 30% glycerol stocks and stored at −80 °C. Frozen glycerol stocks were then shipped to the University of Washington.

V. cholerae DNA extraction

V. cholerae from frozen glycerol stocks were streaked onto LB agar plates and incubated at 30 °C for 24 h. One colony per plate was used for liquid culturing in LB broth and incubated at 30 °C for 18 h with agitation. DNA was extracted from saturated liquid cultures using the DNeasy Blood and Tissue 96-kit (Qiagen) according to the manufacturer’s protocol. Briefly, V. cholerae samples were pelleted and resuspended in Buffer ATL and treated with proteinase K at 56 °C for 30 min followed by RNase A (Qiagen) at room temperature for 5 min and Buffer AL for an additional incubation at 56 °C for 10 min. Samples were then treated with 100% EtOH and transferred to Qiagen DNA spin columns. DNA was washed using Buffer AW1 and Buffer AW2. Purified DNA was eluted in 100 µL 10 mM Tris-HCl and stored at −80 °C until ready for sequencing.

DNA sequencing

The extracted DNA was prepared for sequencing using the Nanopore Rapid Barcoding 96 v14 (Oxford Nanopore Technologies) with approximately 200 ng of purified DNA per sample to generate sequencing libraries. The libraries were sequenced on a R10.4.1 M PromethION flow cell. Raw sequencing data was basecalled and demultiplexed to FASTQ files using the Dorado basecaller integrated into MinKNOW v 23.07.5 (Oxford Nanopore Technologies) using the model dna_r10.4.1_e8.2_400bps_sup@v4.2.0 with read splitting, adapter trimming, and barcode trimming enabled in the basecaller. Read data generated for this study is available on NCBI SRA (BioProject PRJNA1121190).

Sequence analysis of isolates collected for this study

Assembly, quality control and annotation

Reads of each sample were assembled using Flye^23,24 in –nano-hq and deterministic mode. We used dnaapler²⁵ to reorient the contigs such that all chromosomes start with the initiating codon of dnaA if present (chromosomes 1 and fused chromosomes) or with repA homologous sequences if dnaA was not present (chromosomes 2). We used ReCycled²⁶ to verify the circularity for all contigs. We used Kraken2²⁷ to identify potentially contaminated assemblies and screened these using SprayNPray²⁸, which assigns species membership per contig. All contigs that were not identified as V. cholerae were removed from the assemblies. These included 50 short contigs with repetitive sequences and no species membership assigned, likely arising from sequencing artifacts.

As a control, we sequenced and assembled the reference strain V. cholerae N16961, which assembled into two chromosomes, as expected. To further reduce the chances of misassembly, we manually assembled sequences of 13 isolates using Trycycler²⁹. This subset included one putative fused-chromosome isolate per patient for which at least one fused-chromosome was assembled (n = 10) and three non-fused chromosome isolates as references. We followed the Trycycler workflow. First, the reads of each sample were filtered using Filtlong³⁰ (minimum length 1000 bp and excluding the worst 5% of read bases) and subsampled to 12 files. Of each isolate, 3 subsets were assembled with Flye²³, Miniasm³¹ and Minipolish³², Raven^32,33 and Canu^32,33,34. All assembled contigs were clustered per sample, aligned, and manually curated. We considered contig clusters as valid if they (i) occurred in at least two assemblies and were of similar size, (ii) no kilobase of sequence was below the identity threshold of 25% when compared to the rest of the contig, and (iii) could be circularly assembled. Of each valid contig cluster, a consensus was built, which together form the assembly for each sample. The assemblies resulting from the Trycyler workflow confirmed the chromosome fusion state observed from the Flye assemblies. We used the Flye assemblies for all isolates for all subsequent analysis. We used Nanostat³⁵ to assess the quality of our assemblies. All assemblies were annotated using Bakta^36,37.

Phylogeny

We called variants of each of our isolates to the V. cholerae reference genome N16961 (GCF_001250235.2) using Medaka³⁸ with a minimal quality score threshold of 40 and excluding indels using vcftools³⁹. We identified varying sites from the resulting consensus sequences using snp-sites⁴⁰ and constructed a maximum-likelihood phylogenetic tree using IQtree (v2.3.6)⁴¹ and assessed branch support using UltraFast bootstrapping approximation⁴².

Sub-lineage assignment

We aimed to assign our isolates to one of the sub-lineages within the clonal lineage causing the current 7th pandemic (7PET). We selected one previously assigned and publicly available genome per sub-lineage⁵ as a reference and called variants of each of our sequences to each reference using the Medaka variant caller³⁸. Each isolate was assigned to the sub-lineage to whose reference the smallest number of high-quality variants was identified (using a minimal quality score threshold of 40).

Identification of HS1

To identify potential fusion sites, we used blastn⁴³, identifying sequences that are shared between chromosome 1 and chromosome 2. We detected a 12 Kbp-long sequence that was identical between chromosome 1 and chromosome 2 in most strains (HS1) (>99.6% sequence identity in assemblies with two copies of HS1). We screened for the occurrence of HS1 in all our genomes using blastn⁴³.

To further substantiate the observed chromosome fusion, we next aimed to identify reads that span HS1 and are either flanked by chromosome 1 or chromosome 2 sequences (non-fused case) or are flanked by sequences of each chromosome on each side (fused case). To do so, we mapped the reads of each sample to a fused-chromosome assembly (135Vc06) and a non-fused-chromosome assembly (135Vc05). From the bam file, we extracted reads that span the HS1 and an additional 500 additional bp on each side using samtools. We excluded secondary mappings and mappings with large insertions (>1000 bp).

We excluded reads in which a sequencing adapter sequence was identified, as these could potentially be artifactual hybrid molecules. To do so, we converted the reads to FASTA files and screened for adapter sequences using blastn. All reads in which an adapter sequence with >95% coverage and identity was identified were excluded from further analysis.

Screening of genomic islands

We queried our genomes for the presence and location of the virulence-associated genomic islands VPI-1, VPI-2, VSP-I, and VSP-II. We screened our genomes for the known flanking regions⁴⁴ of these islands using blastn⁴³ and extracted the sequence between the flanking sides using bedtools getfasta^44,45.

Comparison of sequences involved in chromosome replication and recombination

par genes

V. cholerae typically encodes two sets of partitioning (par) genes (parAB on chromosome 1 and parAB2 on chromosome 2), which are involved in separating the chromosome molecules to daughter cells. As chromosome 2 can be lost upon depletion of parAB2, we hypothesised that parAB2 might be lost or mutated in strains with a fused chromosome. To test this, we downloaded the amino acid sequence of reference V. cholerae par proteins (QEO41389.1, QEO42567.1, AAF97006.1, and AAF97005.1 for ParA, ParB, ParA2, and ParB2, respectively) and screened their occurrence in our genomes using tblastn⁴³.

dam and crtS

To compare the dam gene sequence and the presence of the origins of replication on the fused chromosomes, we examined the Bakta annotation. We screened for crtS (Chr2 replication triggering Site)¹⁸ in our genomes using blastn⁴³.

mut and rec genes

To infer the phylogeny, we had called variants of each of our isolates to the V. cholerae reference genome N16961 (GCF_001250235.2) using Medaka³⁸ with a minimal quality score threshold of 40 (see above) and annotated these using snpEff⁴⁶. We then evaluated in R whether any SNVs were identified within the N16961 mut or rec genes. V. cholerae N16961 was annotated using Bakta.

Comparison to previously reported chromosome fusion

We extracted sequences described by Xie et al.⁹ to be either directly involved in homologous recombination (VAA049_1594, VAA049_2432 from NCSV1 and VAB027_307, VAB027_1228 from NCSV2) as well as the IS3 elements (VAA049_2433 from NCSV1 and VAB027_276, VAB027_1254 from NCSV2) as amino acid sequences from the published fused chromosome genomes (NCSV1 (NZ_CP010811.1) and NCSV2 (NZ_CP010812.1)). We compared these to HS1 using tblastn and found no matches with a sequence identity >60% or a query coverage >37%, suggesting that these are not the same sequences.

Genome-wide association study

Aiming to explain the stability of the fused chromosome under laboratory conditions, we used pyseer⁴⁷ (v1.3.12) to identify sequence variants associated with fused-chromosome genomes. We used unitigs (non-redundant sequence elements of variable lengths) as inputs, which we constructed via unitig-counter⁴⁸ (v1.1.0). We used random effects to correct for population structure (‘-lmm’ mode) using the phylogenetic tree as input (compiled from high-quality core SNV, see above). The number of unique patterns (n = 1036) was used to determine a significance threshold (p-value < 4.83E−05). Significant unitigs were mapped against the Bakta genome annotations from all genomes (n = 467).

V. cholerae plug preparation and pulsed-field gel electrophoresis (PFGE)

V. cholerae isolates from frozen glycerol stocks were cultured as described above. V. cholerae plugs were created as previously described (CDC Pulsenet protocol)⁴⁹. Briefly, V. cholerae cells were inoculated into a cell suspension buffer (100 mM Tris, 100 mM EDTA, pH 8.0) to an optical density (OD) of 2.0. Plugs were cast using 0.5 mg/mLProteinase K (ThermoFisher) and 1% SeaKem Gold agarose (Lonza) prepared in Tris-EDTA (TE) buffer (10 mM Tris, 1 mM EDTA, pH 8.0). Plugs were lysed in cell lysis buffer (50 mM Tris, 50 mM EDTA, pH 8.0 with 1% Sarcosine (Sigma) and 0.1 mg/mL Proteinase K), incubated at 55 °C for 30 min with shaking at 200 rpm. Plugs were then washed twice with sterile, distilled water and four times with TE buffer. Plugs were stored in TE buffer at 4 °C until PFGE was performed.

Gels, 21 cm long, were cast for PFGE using 0.8% SeaKem Agarose in 1X Tris acetate EDTA (TAE) buffer. PFGE was performed on a CHEF-DR III system (Biorad) with runtime of 60 h, at 2 Volts/cm with 30 min switch time (initial and final) at a 106° angle with 1X TAE running buffer chilled to 14 °C. The gel was then stained using SYBR Gold (Invitrogen) and visualised using Amersham Typhoon 5 (Cytiva). Schizosaccharomyces pombe⁵⁰ DNA was used as a DNA marker.

Phenotypic comparison of fused and non-fused isolates

Isolate selection

To assess phenotypic effects of chromosome fusion, we chose two pairs of fused and non-fused isolates, each collected from the same participants (114 and 117) and with a minimal number of SNVs between isolate pairs. SNVs were called using clair3 using the non-fused strains 114Vc01 and 117Vc10 as reference and 114Vc03 and 117Vc01 as query, and a minimal quality score of 20, minimal read depth of 10 and minimal ALT allele frequency of 0.8.

qPCR measurements of ctxA and tcpA

V. cholerae virulence-inducing conditions

Liquid cultures of V. cholerae were obtained in LB broth, 37 °C with agitation. Saturated cultures were then diluted to OD = 0.01 into 10 mL AKI media containing 1.5% peptone (HIMEDIA), 0.5% sodium chloride (Fisher Bioreagents), 0.4% yeast extract (Fisher), and 0.3% sodium bicarbonate (Fisher Chemical). Diluted cultures were incubated stationary at 37 °C for 4 h and then switched into a 250 mL flask with shaking at 250 rpm for 3 h. After the incubations, the cultures were centrifuged at 4,000 x g for 10 min at 4 °C. Pellets were resuspended in 1 mL TRIzol (Invitrogen) and stored at 4 °C for RNA extraction.

RNA extraction and Real-time PCR (RT-PCR)

Samples in TRIzol had RNA extracted using chloroform (Fisher Chemical), ethanol (Fisher Chemical), and RNeasy kit (QIAGEN) according to the manufacturer’s protocol. Extracted RNA was treated with TURBO DNase (Invitrogen) to remove DNA. cDNA synthesis was performed on 1 µg of RNA using High Capacity cDNA reverse transcriptase (Applied Biosystems) and resulting cDNA was diluted 1:3. RT-PCR was performed to measure ctxA and tcpA expression. The primers used were (CtxA-F) 5′-TTGGAGCATTCCCACAACCC-3′, (CtxA-R) 5′-GCTCCAGCAGCAGATGGTTA-3’′ – amplicon 109 bp⁵¹, (TcpA-F) 5′-CGCTGAGACCACACCCATA-3′, (TcpA-R) 5′-GAAGAAGTTTGTAAAAGAAGAACACG-3′ – amplicon 103 bp⁵², (groEL-F) 5′-ATGATGTTGCCCACGCTAGA-3′, and (groEL-R) 5′-GGTTATCGCTGCGGTAGAAG-3′ – amplicon 117 bp⁵². GroEL was used as a reference housekeeping gene. RT-PCR was performed using a 10uL reaction with SYBR green (Invitrogen) with 0.3 µM of specific primer sets and 2 µL of cDNA. PCR amplification was conducted on the QuantStudio3 (ThermoFisher Scientific) with the following conditions: 95 °C for 5 min, 40 cycles of 95 °C for 5 s, 58 °C for 10 s, and 72 °C for 15 s, and a final melting temperature analysis of PCR products. Each RT-PCR run included a no-template and water negative control. Each sample was tested in duplicate. Relative expression was calculated according to the Livak method⁵³.

Biofilm crystal violet assay

Liquid cultures of V. cholerae were generated from single colonies in LB broth at 37 °C with agitation. Saturated cultures were normalized to OD = 0.01 in 200 µl into 96-well flat-bottom clear polystyrene microplates, sealed with a gas-permeable sealing film (BrandTech). Plates were incubated at 30 °C for 24 h. Crystal violet staining was performed as previously described^11,54. Briefly, bacterial cultures were removed, and plates were washed with distilled water by submerging the plates three times. Adherent biofilm was stained using 0.1% crystal violet (Fisher Chemical) at room temperature for 15 min. Excess crystal violet was removed, and the plate was air-dried for 3 h. The Crystal violet stain was then dissolved in 95% ethanol for 15 min, and the absorbance was measured at 550 nm using a Synergy HT plate reader (Biotek) with Gen5 software (Agilent BioTek).

Growth curves

Growth curves with 96-well plate

Liquid cultures of V. cholerae were obtained in LB broth at 37 °C with agitation. Cultures were then diluted to OD = 0.01 in 200 µl into 96-well flat-bottom clear polystyrene microplates, sealed with a gas-permeable sealing film (BrandTech). Plates were incubated at 37 °C for 20 h in the Stratus plate reader (Cerillo) with continuous OD600 measurements every 20 min.

Growth curves in roller drum

Liquid cultures of V. cholerae were obtained in LB broth at 37 °C with agitation. Cultures were then diluted to OD = 0.01 in 10 mL LB broth and incubated in a roller drum at 37 °C for 24 h. OD600 was measured using a 1 mL in a cuvette (Fisherbrand) in a Cell Density Meter (Transilluminator) at 0, 2, 4, 6, 8, 10, and 24 h.

Continued passaging of V. cholerae

Experimental setup

V. cholerae was continuously passaged as previously described¹⁸. Briefly, V. cholerae isolates were plated on LB agar and incubated at 30 °C for 72 h. On Day 1, one CFU was inoculated into 10 mL LB broth in a 15 mL snap-cap polystyrene tube and incubated at 37 °C for 24 h in a roller drum. The following day, saturated cultures were diluted 1:10000 into fresh LB broth, and this procedure was repeated daily. On Day 5, liquid cultures were streaked on LB agar and incubated at 30 °C for 72 h. This process was repeated for 20 days. Glycerol stocks using 20% glycerol (Fisher Bioreagents) in LB broth were periodically saved at −80 °C, and liquid cultures were streaked on agar to check for contamination twice a week.

Sequence analysis

Single colonies were obtained from the glycerol stocks collected on day 0, 3, 8, 11, 16, and 20 of the passaging experiment. V. cholerae isolates were cultured in LB broth and DNA was extracted as described above. DNA was prepared for long-read whole-genome sequencing and sequenced on a R10.4.1 M PromethION as described above. Genomes with an estimated read coverage of less than 10X were excluded from further analysis (Supplementary Data 2).

Reads of each sample were assembled using Flye²³ in –nano-hq and deterministic mode. We evaluated how many chromosomes were assembled and counted reads that mapped HS1 in its fused-chromosome environment (chromosome 1-HS1-chromosome 2 and chromosome 2-HS1-chromosome 1) and in its non-fused environment (chromosome 1-HS1-chromosome 1 and chromosome 2-HS1-chromosome 2) (see ‘Identification of HS1’).

We called SNV of each isolate compared to its source isolate (114Vc01, 114Vc03, 117Vc01, and 117Vc10) using clair3⁵⁵ and minimal read depth >=10, minimal allele frequency >=0.8 and minimal quality >=40. We used snpEff⁴⁶ to annotate the identified SNV.

Sequence analysis of publicly available sequences

Selection of publicly available sequences

To contextualise the sequences acquired for this study, we used two different datasets, which we compiled from publicly available sequences: (a) long-read sequenced genomes of the order Vibrionales and (b) short-read sequenced genomes of the species V. cholerae.

Dataset (a) consisted of all Virbrionales genomic sequences that were acquired using Oxford Nanopore Technology (ONT) or Pacbio and available from NCBI SRA on the 19^th of December 2023 (n = 302).

We compiled dataset (b), aiming to reflect the diversity within clinical V. cholerae and to focus our study period (2015-2018) and geographic region of isolation (Bangladesh) for comparison. To do so, we downloaded the following sets of short reads: (b.i) Clinical V. cholerae overview: from NCBI pathogens, we selected V. cholerae which were collected between the 1st of January 2000 and the 18th of January 2024 and included a maximum of three samples per year and country (n = 790, 457 of which could be successfully downloaded using NCBI Batch entrez).

(b.ii) Diversity within V. cholerae O1: We downloaded a previously compiled selection of V. cholerae O1⁵⁶ (all O1 isolates from this study).

(b.iii) 7PET diversity from South and East Asia: Of a previously compiled 7PET set of genomes⁵⁷, we included strains which were isolated between 2000 and 2018 in Eastern Asia, Southeast Asia, and South Asia. Moreover, we included the sequences acquired in this⁵⁷ and another study conducted in Bangladesh around our study period⁵⁸.

Assembly, quality control and annotation

Long-read sequences were assembled using Flye^23,24 (--nano-raw mode) and the species was identified using GTDB-tk⁵⁹. We controlled the quality of the assemblies using Kraken2²⁷ / Bracken⁶⁰, as well as Nanostat³⁵. We excluded sequences which were not identified as part of the order Vibrionales, where >10% of the contigs were not assigned as the most abundant genus (suggesting contamination), whose total assembly length was smaller than 3.5 or larger than 7 MB, or which had an average read depth below 5X, resulting in 251 genomes of 24 different species.

Short-reads were trimmed using Trimmomatic⁶¹ and were assembled using Spades⁶² via Unicycler⁶³. We assessed the quality of the resulting assemblies using Quast⁶⁴ and used Metaphlan2³⁶ to screen for possible contaminations. We excluded sequences that had a Metaphlan purity below 95%, an average read depth below 20X, an average read quality below 75, where the resulting assembly was shorter than 3.65 or longer than 4.5 MB, or had N50 value below 2000. This resulted in a set of high-quality V. cholerae short-read genomes (n = 1,223). The accession numbers of all publicly available sequences analysed in this study can be found in Supplementary Data 3.

Phylogeny

For the Vibrionales long reads, we built an SNV-based phylogeny, as described above. Briefly, we called variants of each sample to the V. cholerae reference genome N16961 (GCF_001250235.2) using Medaka³⁸ (minimal quality threshold 40), identified varying sites of the resulting consensus sequence using snp-sites⁴⁰, and built a phylogenetic tree using RaxML (v8.2.12, model GTRCAT)⁶⁵.

We built a core genome for the V. cholerae short reads using panaroo⁶⁶ and aligned it using mafft⁶⁷. Variant sites were identified using snp-sites⁴⁰, and a phylogenetic tree was constructed using fasttree⁶⁸.

(Sub)-Lineage assignments

We used ‘Is it 7PET’⁶⁹ to identify which publicly available V. cholerae sequences belonged to the 7PET lineage. To identify the sub-lineages within the 7PET, we quantified the number of SNV to one reference per sub-lineage each, and assigned the sub-lineage with the least number of SNV, similar as described above. SNV were called using Freebayes⁷⁰ via snippy⁷¹ (minimum read depth = 10, minimum fraction = 0.7, and minimum quality = 100).

Copy number estimation of potential fusion sites

We used blastn⁴³ to identify the presence and location of the potential fusion sites HS1 and VSP-I in the publicly available V. cholerae genomes (>95% identity and >90% coverage). We used bwa⁷² to align the reads of each isolate to its respective assembly. From the resulting bam files, we retrieved the average read depth of the genome and from the specific potential fusion sites using samtools depth⁷³.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Sequence data generated for this study (raw reads and assemblies) have been deposited on NCBI under the accession code PRJNA1121190. All sequence analysis data generated, which are required to reproduce the figures and analysis presented in this study, are provided in the Source Data file and via the OpenScienceFoundation (https://osf.io/xyfvg/). Source data are provided with this paper.

Code availability

Software code used to analyse the data presented in this study is available on GitHUb (https://github.com/acuenod111/Single_chromosome_Vc; https://doi.org/10.5281/zenodo.15298067) and the files to reproduce the analysis and figures can be accessed via the OpenScienceFoundation (https://osf.io/xyfvg/).

References

Kanungo, S. et al. Cholera. Lancet 399, 1429–1440 (2022).
Article Google Scholar
Dziejman, M. et al. Comparative genomic analysis of Vibrio cholerae: genes that correlate with cholera endemic and pandemic disease. Proc. Natl Acad. Sci. USA. 99, 1556–1561 (2002).
Article ADS CAS PubMed Central Google Scholar
Karaolis, D. K. et al. A Vibrio cholerae pathogenicity island associated with epidemic and pandemic strains. Proc. Natl Acad. Sci. USA. 95, 3134–3139 (1998).
Article ADS CAS PubMed Central Google Scholar
Jermyn, W. S. & Boyd, E. F. Characterization of a novel Vibrio pathogenicity island (VPI-2) encoding neuraminidase (nanH) among toxigenic Vibrio cholerae isolates. Microbiology 148, 3681–3693 (2002).
Article CAS Google Scholar
Monir, M. M. et al. Genomic attributes of Vibrio cholerae O1 responsible for 2022 massive cholera outbreak in Bangladesh. Nat. Commun. 14, 1154 (2023).
Article ADS CAS PubMed Central Google Scholar
Val, M. E. et al. Fuse or die: How to survive the loss of Dam in Vibrio cholerae. Mol. Microbiol. 91, 665–678 (2014).
Johnson, S. L. et al. Complete Genome Assemblies for Two Single-Chromosome Vibrio cholerae Isolates, Strains 1154-74 (Serogroup O49) and 10432-62 (Serogroup O27). Genome Announc. 3, e00462-15 (2015).
Yamamoto, S. et al. Single Circular Chromosome Identified from the Genome Sequence of the Vibrio cholerae O1 bv. El Tor Ogawa Strain V060002. Genome Announc. 6, e00564-18 (2018).
Xie, G. et al. Exception to the rule: genomic characterization of naturally occurring unusual Vibrio cholerae strains with a single Chromosome. Int. J. Genom. 2017, 8724304 (2017).
Google Scholar
Domman, D. et al. Defining endemic cholera at three levels of spatiotemporal resolution within Bangladesh. Nat. Genet. 50, 951–955 (2018).
Article CAS PubMed Central Google Scholar
Levade, I. et al. Vibrio cholerae genomic diversity within and between patients. Microb. Genom. 3, e000142 (2017).
PubMed Central Google Scholar
Ramming, L. et al. Rationally designed chromosome fusion does not prevent rapid growth of Vibrio natriegens. Commun. Biol. 7, 1–10 (2024).
Article Google Scholar
Fuesslin, V. et al. Prediction of antibiotic susceptibility profiles of Vibrio cholerae isolates from whole genome Illumina and nanopore sequencing data: CholerAegon. Front. Microbiol. 13, 909692 (2022).
Article PubMed Central Google Scholar
Antibiotic resistance in Vibrio cholerae El Tor strains isolated during cholera complications in Siberia and the Far East of Russia. Infect. Genet. Evol. 78, 104096 (2020).
Yamaichi, Y., Fogel, M. A. & Waldor, M. K. par genes and the pathology of chromosome loss in Vibrio cholerae. Proc. Natl Acad. Sci. USA. 104, 630–635 (2007).
Article ADS CAS Google Scholar
Val, M.-E. et al. A checkpoint control orchestrates the replication of the two chromosomes of Vibrio cholerae. Sci. Adv. 2, e1501914 (2016).
Article ADS PubMed Central Google Scholar
Niault, T. et al. Dynamic transitions of initiator binding coordinate the replication of the two chromosomes in Vibrio cholerae. Nat. Commun. 16, 1–16 (2025).
Article Google Scholar
Bruhn, M. et al. Functionality of two origins of replication in Vibrio cholerae strains with a single Chromosome. Front. Microbiol. 9, 427674 (2018).
Article Google Scholar
Midani, F. S. et al. Human Gut microbiota predicts susceptibility to Vibrio cholerae infection. J. Infect. Dis. 218, 645–653 (2018).
Article CAS PubMed Central Google Scholar
Qadri, F. et al. Comparison of immune responses in patients infected with Vibrio cholerae O139 and O1. Infect. Immun. 65, 3571–3576 (1997).
Article CAS PubMed Central Google Scholar
Harris, J. B. et al. Susceptibility to Vibrio cholerae infection in a cohort of household contacts of patients with Cholera in Bangladesh. PLoS Negl. Trop. Dis. 2, e221 (2008).
Article PubMed Central Google Scholar
Weil, A. A. et al. Clinical outcomes in household contacts of patients with cholera in Bangladesh. Clin. Infect. Dis. 49, 1473–1479 (2009).
Article Google Scholar
Kolmogorov, M., Yuan, J., Lin, Y. & Pevzner, P. A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546 (2019).
Article CAS Google Scholar
Lypaczewski, P. et al. Vibrio cholerae O1 experiences mild bottlenecks through the gastrointestinal tract in some but not all cholera patients. Microbiol. Spectr. 12, e00785–24 (2024).
Article PubMed Central Google Scholar
GitHub - gbouras13/dnaapler: Reorients assembled microbial sequences. GitHub https://github.com/gbouras13/dnaapler.
Somerville, V., Schmid, M., Dreier, M. & Engel, P. ReCycled: A Tool to Reset the Start of Circular Bacterial Chromosomes. bioRxiv 2025.04.07.647662 https://doi.org/10.1101/2025.04.07.647662 (2025).
Wood, D. E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol. 20, 257 (2019).
Article CAS PubMed Central Google Scholar
Garber, A. I. et al. SprayNPray: user-friendly taxonomic profiling of genome and metagenome contigs. BMC Genom. 23, 202 (2022).
Article Google Scholar
Wick, R. R. et al. Trycycler: consensus long-read assemblies for bacterial genomes. Genome Biol. 22, 266 (2021).
Article CAS PubMed Central Google Scholar
GitHub - rrwick/Filtlong: quality filtering tool for long reads. GitHub https://github.com/rrwick/Filtlong.
Li, H. Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences. Bioinformatics 32, 2103–2110 (2016).
Article CAS PubMed Central Google Scholar
Wick, R. R. & Holt, K. E. Benchmarking of long-read assemblers for prokaryote whole genome sequencing. F1000Res. 8, 2138 (2019).
Article Google Scholar
Vaser, R. & Šikić, M. Time- and memory-efficient genome assembly with Raven. Nat. Comput Sci. 1, 332–336 (2021).
Article Google Scholar
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Article CAS PubMed Central Google Scholar
De Coster, W., D’Hert, S., Schultz, D. T., Cruts, M. & Van Broeckhoven, C. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics 34, 2666–2669 (2018).
Article PubMed Central Google Scholar
Truong, D. T. et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat. Methods 12, 902–903 (2015).
Article CAS Google Scholar
Schwengers, O. et al. Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microb. Genom. 7, 000685 (2021).
GitHub - nanoporetech/medaka: Sequence correction provided by ONT Research. GitHub https://github.com/nanoporetech/medaka.
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156 (2011).
Article CAS PubMed Central Google Scholar
Page, A. J. et al. Rapid efficient extraction of SNPs from multi-FASTA alignments. Micro. Genom. 2, e000056 (2016).
Google Scholar
Minh, B. Q. et al. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Article CAS PubMed Central Google Scholar
Minh, B. Q., Nguyen, M. A. T. & von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013).
Article CAS PubMed Central Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinforma. 10, 1–9 (2009).
Article Google Scholar
O’Hara, B. J., Alam, M. & Ng, W.-L. The Vibrio cholerae Seventh Pandemic Islands act in tandem to defend against a circulating phage. PLoS Genet. 18, e1010250 (2022).
Article PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogasterstrain w1118; iso-2; iso-3. Fly. (Austin) 6, 80–92 (2012).
Article CAS Google Scholar
Lees, J. A., Galardini, M., Bentley, S. D., Weiser, J. N. & Corander, J. pyseer: a comprehensive tool for microbial pangenome-wide association studies. Bioinformatics 34, 4310–4312 (2018).
Article CAS PubMed Central Google Scholar
Jaillard, M. et al. A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events. PLoS Genet. 14, e1007758 (2018).
Article PubMed Central Google Scholar
Cooper, K. L. F. et al. Development and validation of a PulseNet standardized pulsed-field gel electrophoresis protocol for subtyping of Vibrio cholerae. Foodborne Pathog. Dis. 3, 51–58 (2006).
Article CAS Google Scholar
Hyppa, R. W. & Smith, G. R. Using Schizosaccharomyces pombe meiosis to analyze DNA recombination intermediates. Methods Mol. Biol. 557, 235–252 (2009).
Article CAS PubMed Central Google Scholar
Zhao, W., Caro, F., Robins, W. & Mekalanos, J. J. Antagonism toward the intestinal microbiota and its effect on Vibrio cholerae virulence. Science https://doi.org/10.1126/science.aap8775 (2018).
Fykse, E. M., Skogan, G., Davies, W., Olsen, J. S. & Blatny, J. M. Detection of Vibrio cholerae by real-time nucleic acid sequence-based amplification. Appl Environ. Microbiol. 73, 1457–1466 (2007).
Article ADS CAS PubMed Central Google Scholar
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 25, 402–408 (2001).
Article CAS Google Scholar
O’Toole, G. A. Microtiter dish biofilm formation assay. J. Vis. Exp. https://doi.org/10.3791/2437 (2011).
Zheng, Z. et al. Symphonizing pileup and full-alignment for deep learning-based long-read variant calling. Nat. Comput Sci. 2, 797–803 (2022).
Article Google Scholar
Ramamurthy, T. et al. Vibrio cholerae O139 genomes provide a clue to why it may have failed to usher in the eighth cholera pandemic. Nat. Commun. 13, 3864 (2022).
Article ADS CAS PubMed Central Google Scholar
Taylor-Brown, A. et al. Genomic epidemiology of Vibrio cholerae during a mass vaccination campaign of displaced communities in Bangladesh. Nat. Commun. 14, 3773 (2023).
Article ADS PubMed Central Google Scholar
Monir, M. M. et al. Genomic characteristics of recently recognized Vibrio cholerae El Tor lineages associated with Cholera in Bangladesh, 1991 to 2017. Microbiol Spectr. 10, e0039122 (2022).
Article Google Scholar
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk v2: memory friendly classification with the genome taxonomy database. Bioinformatics 38, 5315–5316 (2022).
Article CAS PubMed Central Google Scholar
GitHub - jenniferlu717/Bracken: Bracken (Bayesian Reestimation of Abundance with KrakEN) is a highly accurate statistical method that computes the abundance of species in DNA sequences from a metagenomics sample. GitHub https://github.com/jenniferlu717/Bracken.
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed Central Google Scholar
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
Article MathSciNet CAS PubMed Central Google Scholar
Wick, R. R., Judd, L. M., Gorrie, C. L. & Holt, K. E. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput. Biol. 13, e1005595 (2017).
Article ADS PubMed Central Google Scholar
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Article CAS PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed Central Google Scholar
Tonkin-Hill, G. et al. Producing polished prokaryotic pangenomes with the Panaroo pipeline. Genome Biol. 21, 180 (2020).
Article PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed Central Google Scholar
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS One 5, e9490 (2010).
Article ADS PubMed Central Google Scholar
GitHub - amberjoybarton/is-it-7pet: Identifying whether a cholera sample belongs to the seventh pandemic El tor (7PET) sub-lineage based on sequencing data or an assembly. GitHub https://github.com/amberjoybarton/is-it-7pet.
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. (2012).
GitHub - tseemann/snippy::scissors: Rapid haploid variant calling and core genome alignment. GitHub https://github.com/tseemann/snippy.
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. (2013).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Article PubMed Central Google Scholar

Download references

Acknowledgements

We are grateful to the people of Dhaka, where our study was undertaken, to the field, laboratory, and data management staff, who provided a tremendous effort to make the study successful, and to the people who provided valuable support in our study. We also acknowledge Matthew Doucette for sharing protocols. AC was supported by a Postdoc. Mobility Fellowship from the Swiss National Science Foundation (P500PB_214356). BJS and PL were supported by a Canadian Institutes for Health Research (CIHR) Project Grant and Fellowship, respectively. This work was also supported by a grant from the U.S. National Institutes of Health/NIAID AI106878 (ETR, FQ), K08AI123494 (AAW), and T32HD007233 (DC). RWH and GRS were supported by research grant R35 GM118120 from the National Institutes of Health of the United States of America.

Author information

Authors and Affiliations

Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
Aline Cuénod, Patrick Lypaczewski & B. Jesse Shapiro
Department of Medicine, University of Washington, Seattle, WA, USA
Denise Chac, Susan M. Markiewicz, Amelia Rice & Ana A. Weil
International Centre for Diarrhoeal Disease Research, Bangladesh, (ICDDR, B), Dhaka, Bangladesh
Ashraful I. Khan, Fahima Chowdhury, Taufiqur R. Bhuiyan & Firdausi Qadri
Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
Randy W. Hyppa & Gerald R. Smith
Department of Biology, McGill University, Montréal, QC, Canada
Akhil Kholwadwala
Division of Infectious Diseases, Massachusetts General Hospital, Boston, MA, USA
Stephen B. Calderwood, Edward T. Ryan & Regina C. LaRocque
Department of Medicine, Harvard Medical School, Boston, MA, USA
Stephen B. Calderwood, Edward T. Ryan, Jason B. Harris & Regina C. LaRocque
Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Edward T. Ryan
Department of Pediatrics, Harvard Medical School, Boston, MA, USA
Jason B. Harris
Division of Global Health, Massachusetts General Hospital for Children, Boston, MA, USA
Jason B. Harris
Department of Global Health, University of Washington, Seattle, WA, USA
Ana A. Weil
McGill Genome Centre, McGill University, Montréal, QC, Canada
B. Jesse Shapiro
McGill Centre for Microbiome Research, McGill University, Montréal, QC, Canada
B. Jesse Shapiro

Authors

Aline Cuénod
View author publications
Search author on:PubMed Google Scholar
Denise Chac
View author publications
Search author on:PubMed Google Scholar
Ashraful I. Khan
View author publications
Search author on:PubMed Google Scholar
Fahima Chowdhury
View author publications
Search author on:PubMed Google Scholar
Randy W. Hyppa
View author publications
Search author on:PubMed Google Scholar
Susan M. Markiewicz
View author publications
Search author on:PubMed Google Scholar
Amelia Rice
View author publications
Search author on:PubMed Google Scholar
Akhil Kholwadwala
View author publications
Search author on:PubMed Google Scholar
Stephen B. Calderwood
View author publications
Search author on:PubMed Google Scholar
Edward T. Ryan
View author publications
Search author on:PubMed Google Scholar
Jason B. Harris
View author publications
Search author on:PubMed Google Scholar
Regina C. LaRocque
View author publications
Search author on:PubMed Google Scholar
Taufiqur R. Bhuiyan
View author publications
Search author on:PubMed Google Scholar
Gerald R. Smith
View author publications
Search author on:PubMed Google Scholar
Firdausi Qadri
View author publications
Search author on:PubMed Google Scholar
Patrick Lypaczewski
View author publications
Search author on:PubMed Google Scholar
Ana A. Weil
View author publications
Search author on:PubMed Google Scholar
B. Jesse Shapiro
View author publications
Search author on:PubMed Google Scholar

Contributions

A.I.K., F.C., S.B.C., E.T.R., J.B.H., R.C.L., T.R.B., and F.Q. conducted the study in Dhaka, Bangladesh, and collected the patient samples. D.C. extracted the DNA for all samples. D.C., A.R. conducted the in vitro experiments. P.L. developed the sequencing workflow and compiled the initial assemblies. P.L. and A.K. sequenced all samples. A.C. conducted all other bioinformatic analysis and generated the figures. D.C., R.W.H., G.R.S., and S.M.M. designed and performed the PFGE assay. A.A.W., B.J.S., and P.L supervised the project. E.T.R., F.Q., D.C., G.R.S., B.J.S., A.A.W., P.L., and A.C. acquired funding for this project. A.C. and B.J.S. wrote the original manuscript. D.C., A.I.K., F.C., R.W.H., S.M.M., A.R., A.K., S.B.C., E.T.R., J.B.H., R.C.L., T.R.B., G.R.S., F.Q., P.L., and A.A.W. reviewed and approved the manuscript.

Corresponding authors

Correspondence to Aline Cuénod, Patrick Lypaczewski, Ana A. Weil or B. Jesse Shapiro.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Shanmuga Sozhamannan and the other anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Dataset 1

Supplementary Dataset 2

Supplementary Dataset 3

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cuénod, A., Chac, D., Khan, A.I. et al. Prevalent chromosome fusion in Vibrio cholerae O1. Nat Commun 16, 5830 (2025). https://doi.org/10.1038/s41467-025-60699-0

Download citation

Received: 27 June 2024
Accepted: 02 June 2025
Published: 01 July 2025
DOI: https://doi.org/10.1038/s41467-025-60699-0

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and discussion

Chromosome fusion is prevalent in clinical V. cholerae O1 isolates

No phenotypic effect of chromosome fusion identified

Chromosome fusion is confirmed using Pulsed-Field Gel Electrophoresis

Chromosome fusions occur at 12 Kbp homologous sequence

Multiple independent fusion events appear to be stable enough to be transmitted

Chromosome fusion is stable under laboratory conditions

Fused chromosomes identified in publicly available genome sequences

VSP-I could be another potential fusion site

Defects in chromosome replication are unlikely to explain fusion

Conclusions

Methods

Ethical statement

V. cholerae isolate collection

V. cholerae DNA extraction

DNA sequencing

Sequence analysis of isolates collected for this study

Assembly, quality control and annotation

Phylogeny

Sub-lineage assignment

Identification of HS1

Screening of genomic islands

Comparison of sequences involved in chromosome replication and recombination

par genes

dam and crtS

mut and rec genes

Comparison to previously reported chromosome fusion

Genome-wide association study

V. cholerae plug preparation and pulsed-field gel electrophoresis (PFGE)

Phenotypic comparison of fused and non-fused isolates

Isolate selection

qPCR measurements of ctxA and tcpA

V. cholerae virulence-inducing conditions

RNA extraction and Real-time PCR (RT-PCR)

Biofilm crystal violet assay

Growth curves

Growth curves with 96-well plate

Growth curves in roller drum

Continued passaging of V. cholerae

Experimental setup

Sequence analysis

Sequence analysis of publicly available sequences

Selection of publicly available sequences

Assembly, quality control and annotation

Phylogeny

(Sub)-Lineage assignments

Copy number estimation of potential fusion sites

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links