Abstract
Listeria monocytogenes (Lm) is a highly pathogenic bacterium that can cause listeriosis, a relatively rare food-borne infectious disease that affects farm, domestic, wild animals and humans as well. The infected livestock is the frequent sources of Lm. Vaccination is one of the methods of controlling listeriosis in target farm animals to prevent Lm-associated food contamination. Here we report the complete sequence of the Lm strain AUF attenuated from a fully-virulent Lm strain by ultraviolet irradiation, successfully used since the 1960s as a live whole-cell veterinary vaccine. The de novo assembled genome consists of a circular chromosome of 2,942,932 bp length, including more than 2,800 CDSs, 17 pseudogenes, 5 antibiotic resistance genes, and 56/92 virulence genes. Two wild Lm strains, the EGD and the 10403S that is also used in cancer Immunotherapy, were the closest homologs for the Lm strain AUF. Although all three strains belonged to different sequence types (ST), namely ST12, ST85, and ST1538, they were placed in the same genetic lineage II, CC7.
Design Type(s) | sequence analysis objective • sequence assembly objective• sequence annotation objective • genotyping by high throughput sequencing design |
Measurement Type(s) | whole genome sequencing • genome assembly • sequence annotation |
Technology Type(s) | DNA sequencing • whole genome sequencing |
Factor Type(s) | Organism Strain • live whole cell vaccine genome |
Sample Characteristic(s) | Listeria monocytogenes |
Similar content being viewed by others
Background & Summary
Listeria monocytogenes (Lm), a highly pathogenic bacterium, is a well-known causative agent of listeriosis. This infection has been proven to be one of the important life-threatening food-borne infections for many species, such as humans, wildlife and farm animals, wild birds, and poultry, being a major concern for both veterinary and public health worldwide1,2,3,4,5,6,7. Lm pathogen was first recognized as a pathogen in the 1920s during an outbreak in laboratory animals, such as rabbits1,2,8. However, research interest in Listeria has risen significantly since the 1980s, when the critical role of Lm as the etiologic agent responsible for sporadic cases and numerous outbreaks of human listeriosis was established. These episodes were strictly associated with the consumption of contaminated foods, mainly in Western Europe, Canada, the USA, and Africa2,3,6,9,10,11,12,13,14,15. Listeriosis is a typical primary zoonotic infection that can be also transmitted vertically in infected pregnant animals and humans, crossing the blood-brain and placental barriers and leading to an invasion into the central nervous system and fetus infection2,3,5,8,9,14,16,17,18,19,20,21,22. This ubiquitous Gram-positive bacterium can produce in infected mammals a systemic infection leading to bacteremia with serious complications, such as septicemia, meningitis and other central nervous systems (CNS) pathology. Moreover, the pathogen can cause severe focal infections and adverse maternal–neonatal outcomes, including abortion, stillbirth, and preterm birth. Overall, neonatal listeriosis accounts for more than 20% of fatal cases2,3,5,8,19,20,21,22. Pregnant women, the elderly, and immunocompromised individuals have the highest risk of developing complications and mortality from listeriosis2,3,5,8,13,19,20.
Infected farm animals are one of the sources of Lm5,6,7,10. Fecal shedding of Lm was found in dairy cattle (46.3%), beef cattle (30.6%) and sheep herds (14.2%)10. Vaccination is considered one of the methods of choice to control listeriosis in target farm animals to prevent the spread of this infection and Lm-associated food contamination, especially for meat and dairy products4,6,10,17,18,23,24. Nevertheless, inactivated, live avirulent, or fully virulent Lm strains have been reported to fail in inducing the protective immune response against animal listeriosis6.
The attenuated Lm strain AUF has been successfully used for almost 50 years in some regions of the former USSR as a live whole-cell vaccine (LWCV) against listeriosis in farm animals. The Lm strain AUF derives from the fully virulent Lm strain ‘A’ isolated in 1965 from the pathological material of the brain of an ewe with neurolisteriosis23,24. The attenuation was done by a combination of 17 repeated exposures of the original strain ‘A’ to Ultraviolet radiation (UVR)23,24. The selected Lm strain AUF demonstrated low virulence in animal models (outbred white mice and rabbits) and pronounced immunogenicity in target farm animals (sheep, goats, cows, and pigs)23,24. Since the late 1960s, the commercial LWCV based on the Lm strain AUF has been manufactured for the prevention of listeriosis in animal husbandry. However, the genome of the Lm strain AUF has not been sequenced. It is critical to identify the possible genetic markers contributing to the attenuation and residual virulence of this particular Lm strain, which has been used as a commercial LWCV for a long time.
In order to elucidate the basic genomic features of this unique strain, two platforms - Illumina HiSeq 2500 (Illumina Inc., USA) and Nanopore MinION (Oxford Nanopore, UK), intended for short-read and long-read sequencing strategies, respectively, were used in this study in parallel. After quality filtering following the trimming of ‘raw’ massive data, the de novo hybrid assembly of the complete genome sequence of the Lm strain AUF represented by a single chromosome 2.94 Gb in size, was successfully obtained. Structural and functional analysis of the Lm strain AUF genome25 showed that the assembled complete Lm strain AUF chromosome contained 2,963 genes, including 2,874 CDSs, 17 pseudogenes, 89 RNA genes, 6 rRNA (5 S, 16 S, 23 S), 67 tRNA and 4 non-coding RNA, 5 antibiotic resistance genes (fosX, mprF, lin, norB and sul), 56 of 92 genes associated with Lm virulence26,27 and immunogenicity factors26,27, including such key virulence factors as inlA and inlB which encoded internalin A (InlA) and internalin B (InlB) responsible for binding the host cell receptors E-cadherin and Met, respectively; six genes of the pathogenicity island LIPI-1, plcA, hly, mpl, actA, plcB and prfA, which encoded the transcriptional regulator positive regulator factor A (PrfA) controlling the expression of both inlAB locus and LIPI-128,29,30,31,32 etc.). Additionally, the five-gene stress survival islet (SSI-1, lmo0444, lmo0445, lmo0446, lmo0447 and lmo0448), which is known to confer resistance to environmental stress, such as low pH, high osmolarity, bile and nisin33, was annotated in the Lm strain AUF suggesting the possible contribution of these genes to the adaptation of the Lm strain AUF to the relevant conditions. Unfortunately, the culture and sequence of the parental Lm strain ‘A’ isolated more than 50 years ago, is not currently available. In fact, analyses for mutations that might have been induced by the UVR exposure can therefore not be accurately confirmed but rather inferred from closely related fully virulent Lm strains. For this reason, the whole-genome sequence of the Lm strain AUF was compared with nine assembled Lm strains available in GenBank, which had been earlier isolated from different animals with listeriosis, including the referent fully virulent Lm standard strains, the Lm strain FDAARGOS_607, and the Lm strain EGD-e strains approved by the US Food and Drug Administration and the European Consortium, respectively34,35,36,37. Additionally, the genomes of the Lm strain 10403S and the Lm strain EGD, which have utilized together with the Lm strain EGD-e to analyse the Lm virulence38, were selected. We found two groups of loci, which were identified in: (i) both the Lm strain AUF and fully virulent reference strains (virulence (56/73, 76.7%), antibiotic resistance (5/5, 100%), SSI-1 (5/5, 100%), Lm Genomic Islands (2/34, 5.9%), motility (n = 29/31, 93.5%); (ii) only in the fully virulent Lm strains, but not in the Lm strain AUF (virulence (17/73, 23.3%), cadmium resistance (n = 2/2, 100%), SSI-2 (n = 2/2, 100%), Lm Genomic islands (n = 32/34, 94,1%), motility (n = 2/31, 6,5%). Moreover, marked polymorphisms were instantly detected in both nucleotide sequences and allelic profiles of the Lm strain AUF major virulence genes, such as pfrA, hly, inlA and inlB which were present in other fully and weakly virulent Lm strains. Importantly, the Lm strain AUF was not absolutely genetically close (3,690 SNPs) to the Lm strain EGD; the Lm strain AUF demonstrated no point mutation in the transcriptional regulator prfA typical for the Lm strain EGD compared with the Lm strain EGD-e as G145S38. Surprisingly, the Lm strain AUF was genetically close (148 SNPs) to the Lm strain 10403S, which was successfully utilized for the development of attenuated Lm-based cancer vaccine vectors, exploring the Lm unique life cycle and ability to induce robust Cytotoxic T lymphocytes (CTLs) immune responses39,40,41,42,43,44. Nevertheless, contrary to the Lm vectors used in the Lm cancer vaccination platforms which contained truncated virulence genes prfA, actA and plcB39,40,41,42,43,44, both Lm strains, the Lm strain AUF and the Lm strain 10403S, demonstrated the presence of the intact variants of the relevant genes. The Lm strain AUF was markedly distinct from other Lm reference strains, such as the Lm strain EGD-e (31,385 SNPs), the Lm strain NTSN (141,219 SNPs), the Lm strain FDAARGOS_607 (144,871 SNPs), the Lm strain UKVDL9 (178,505 SNPs), the Lm strain UKVDL4 (178,552 SNPs), the Lm strain UKVDL7 (180,314 SNPs), the Lm strain 4/52-1953 (182,819 SNPs) and the Lm strain FSL-J1-158 (196,422 SNPs).
The data presented here is the first report highlighting the genome loci which could be either directly or indirectly involved in the attenuation (LIPI-3, srtB, agrC, vip, gltB, gltA, aut_IVb, metal resistance genes and Lm Genomic Islands-associated genes) and residual virulence (LIPI-1, plcA, hly, mpl, inlA, inlB, prfA, hly, actA and plcB) of the single Lm strain AUF with a long history of application as an effective veterinary LWCV against listeriosis. Our data could also serve as the basis for unravelling the mechanisms of virulence and pathogenicity in Lm. The results of this study will be useful for improving our knowledge of bacterial vaccinology overall and developing of a new generation of vaccines against listeriosis in farm animals. This is critical for animal health and welfare, food safety, and human public health worldwide.
Methods
Bacterial strain
The Lm strain AUF was obtained from the Collection of Microorganisms of the Department for Microbiology and Biotechnology, Saratov State University of Genetics, Biotechnology and Engineering named after N.I. Vavilov, Saratov, Russia. The Lm strain AUF was stored in lyophilized form. The Lm strain AUF was routinely cultivated on Tryptone Soy Yeast Extract Agar (TSYE Agar) (Merck, EU) overnight prior to the experiments as described45,46.
DNA Extraction for short & long reads sequencing
Genomic DNA was extracted from the lysate of the Lm strain AUF culture, using a commercial DNA DNeasy Blood & Tissue Kit (Qiagen, Germany) according to the manufacturer’s instructions as described45,46. The final DNA concentration was measured using a spectrophotometer from BioRad (Bio-Rad, USA).
Whole genome sequencing
The isolated DNA from the Lm strain AUF strain was used in two parallel sequencing platforms, Illumina HiSeq 2500 (Illumina Inc., USA) and Nanopore MinION (Oxford Nanopore, UK). For this purpose, the preparation of DNA libraries was conducted using either Nextera XT DNA Library Preparation Kit (Illumina Inc., USA) or Nanopore Kit SQK-LSK109 (Oxford Nanopore, UK), respectively, as we described recently45,46. In the first case, the DNA was sequenced commercially as 250-bp single-Read (Illumina Inc., USA) at Genoanalytica LLC (https://www.genoanalytica.ru/, Moscow, Russia). A FLO-MIN-106 R9.4 Flow cell (Oxford Nanopore Technologies, Oxford, UK) was routinely used to perform sequencing with the MinION and the MinKNOW software as recommended (https://nanoporetech.com/).
Genome assembling and annotation
Unicycler v0.4.9 with default parameters (https://github.com/rrwick/Unicycler) was used to generate a hybrid high-quality de novo assembly of the Lm strain AUF strain genome47. The annotation of chromosome was performed by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) with default parameters on the submission portal of NCBI GenBank (https://www.ncbi.nlm.nih.gov/genome/annotation_prok/)48. Construction of the phylogenetic tree based on the Lm strain AUF and other Listeria spp. strain whole genomes was conducted with the online tool REALPHY 1.13 using default parameters (https://realphy.unibas.ch/realphy/)49 and MEGA-7 software50 by the maximum likelihood method for the generation of the phylogenetic tree based on the internalin genes. Antibiotic resistance genes were identified using CARD (https://card.mcmaster.ca/analyze/rgi) and the Antibiotic Resistance scheme of the Institute Pasteur Bacterial Isolate Genome Sequence Database (BIGSdb Version 1.42.0) (https://bigsdb.pasteur.fr/listeria/). Comparative analysis of the CDSs in the Lm strain AUF strain relative to the Lm reference strains was performed using «Proteome Comparison Service» tool on The Bacterial and Viral Bioinformatics Resource Center (BV-BRC) platform51. Visualization of the linear map of the Lm strain AUF genome was generated using the online tool Proksee (https://proksee.ca/)52. Genes encoding virulence factors in the Lm strain AUF and other Lm strains were found using the BIGSdb-Lm database (https://bigsdb.pasteur.fr/listeria/). Identification of allele profiles of the Lm strain AUF and reference Lm strains whole genome and Multilocus sequence typing (MLST) was also performed using the BIGSdb-Lm database (https://bigsdb.pasteur.fr/listeria/). Comparative analysis of pseudogenes in the Lm strain AUF and other Lm strains was performed using BLAST (https://blast.ncbi.nlm.nih.gov/Blast.cgi).
A panel of Lm reference strains
We compared the annotated completed genomes of ten Lm strains available in the NCBI GenBank (https://www.ncbi.nlm.nih.gov/). The relevant whole genome sequences were downloaded: the Lm strain FDAARGOS_60753, the Lm strain EGD-e54, the Lm strain EGD55, the Lm strain FSL-J1-15856, the Lm strain NTSN57, the Lm strain UKVDL958, the Lm strain UKVDL459, the Lm strain 4/52-195360, the Lm strain UKVDL761 and the Lm strain 10403S62. Additionally, the genomes of 12 strains of other Listeria spp. were used for the construction of a phylogenetic tree in order to demonstrate the phylogenetic relationships between the Lm strain AUF and other Listeria, such as: the Listeria grayi strain NCTC1081263, the Listeria valentina strain CLIP 2019/0064264, the Listeria kieliensis strain Kiel-L165, the Listeria floridensis strain FSL S10-118766, the Listeria fleischmannii strain 199167, the Listeria rustica strain FSL W9-058568, the Listeria newyorkensis strain CMB19106369, the Listeria cornellensis strain FSL F6-096970, the Listeria rocourtiae strain CECT 7972 Ga0244616_10171, the Listeria booriae strain FSL A5-028172, the Listeria riparia strain FSL S10-120470, the Listeria phage strain A11873.
Data Records
The completed whole genome sequence of the Lm strain AUF has been deposited in GenBank with the accession number CP048400.147. All of the reads for the Lm strain AUF genome have been deposited in the NCBI Sequence Read Archive under the accession numbers SRR25210281 for Oxford Nanopore data74 and SRR25180708 for Illumina HySeq 2500 data)75.
Technical Validation
Before starting the genome assembly, raw reads were prepared using an automated AfterQC script to remove low-quality reads and Illumina adapters in default settings for short reads, as well as to remove adapters using Filtlong v0.2.1 software. To filter long reads by quality obtained on the Oxford Nanopore platform, we used the Filtlong script (https://github.com/rrwick/Filtlong) with the filter parameters --min_length 1000 --keep_percent 90. Trimming and searching for Nanopore adapter sequences in long reads was performed using the Porechop script according to the recommended parameters (https://github.com/rrwick/Porechop). The de novo assembly using both short and long reads, obtained from both platforms, Illumina and Nanopore MinION, generated the Lm strain AUF circular chromosome of 2,942,932 bp in size with a GC content of 37.98%.
To clarify the taxonomic affiliation of the assembled Lm strain AUF genome to other Listeria spp. representatives, we constructed a phylogenetic tree based on the whole genomes available in the NCBI GenBank (https://www.ncbi.nlm.nih.gov/). For this purpose, we selected the reference genomes of 15 different Listeria spp. and 9 Lm strains associated with listeriosis in animals. The Lm strain AUF formed a separate branch with representatives of the Lm only, but not with other Listeria spp. reference bacteria strains, which proved its genetic affiliation to the indicated Lm species (Fig. 1). Importantly, the Lm strain AUF visualized in a single clade with the fully virulent Lm strain EGD, which was isolated in 1924 in the United Kingdom from a guinea pig infected with biomaterial derived from a rabbit with listeriosis during the disease outbreak in laboratory animals1,2,8. The data obtained recognized the EGD as the closest homolog to the Lm strain AUF, in contrast to other Lm representatives, including both the Lm strain EGD-e and the Lm strain FDAARGOS_607 (Fig. 1). The Lm strain EGD-e was assigned as the second homolog for the Lm strain AUF located in the same cluster with the Lm strain AUF and the Lm strain EGD strains (Fig. 1). The Lm strain AUF was also distinguished from the Lm 4/52-1953, had also been isolated on the territory of the former USSR, although a decade earlier46. In fact, recently phylogenetic analysis based on whole genomes of Lm strains (n = 257) available in the NCBI GenBank (https://www.ncbi.nlm.nih.gov/) showed that these two Lm strains belonged to different Clusters, the Lm strain AUF to the Cluster II, represented by genetic lineages I and II, whereas the Lm strain 4/52-1953 - to the Cluster I, formed by genetic lineage III only46. Interestingly, the Lm strain AUF was phylogenetically closer to the Lm strain 10403S derived from the clinical sample of a human with listeriosis46. Both strains, the Lm strain AUF and the Lm strain 10403S, formed a single Cluster II46. The prfA-, or actA- or plcB- defective derivatives of the Lm strain 10403S were reported to be safe live attenuated Lm vectors for the development of cancer immunotherapy vaccines39,40,41,42,43,44.
Phylogenetic analysis of the Lm strain AUF using the reference whole genome sequences of Lm and other Listeria spp. available in NCBI GenBank (https://ncbi.nlm.nih.gov/). The phylogenetic tree was built using REALPHY 1.13 (https://realphy.unibas.ch/realphy/) and visualized with MEGA-730.
Through MLST, based on the sequences of seven housekeeping genes (abcZ, bglA, cat, dap, dat, ldh and lhkA), we found that the Lm strain AUF together with the closest homologs, the fully virulent reference strains, the Lm strain EGD, the Lm strain EGD-e and the Lm strain 10403S, belonged to the same genetic lineage II, although these strains sourced from different hosts, such as sheep, rabbit and human, respectively (Table 1). Moreover, the Lm strain AUF, the Lm strain EGD and the Lm strain 10403S corresponded to the identical clonal complex (CC) CC7, contrary to the Lm strain EGD-e which was assigned to CC9. Nevertheless, the Lm strain AUF, the Lm strain EGD and the Lm strain 10403S related to the different sequence types (ST), ST1538, ST12 and ST85, respectively, although all three strains demonstrated six of seven identical alleles (abcZ, bglA, cat, dap, dat, and lhkA) with polymorphism in only a single allele, ldh (Table 1). We found only one individual allele, lhkA, which was identical for the Lm strain AUF, the Lm strain EGD, the Lm strain 10403S and the Lm strain EGD-e. No identical alleles were revealed between the Lm strain AUF and two ovine fully virulent Lm strains belonging to the genetic lineage I, the Lm FDAARGOS_607 and the Lm strain NTSN, and the ovine, bovine, porcine, equine and caprine Lm strains, the Lm strain UKVDL9, the Lm strain UKVDL4, the Lm strain 4/52-1953, the Lm strain UKVDL7, the Lm strain FSL-J1-158, the Lm strain the Lm strain UKVDL7 and the Lm strain of the genetic lineages III-IV belonging to different STs, ST1194, ST1069, ST201, ST1140 and ST563 of clonal complexes ST1194, ST1069, CC69, CC1070 and ST563 (Table 1). Comparative analysis of the coding regions using BLASTP51 proved the marked discrimination of the Lm strain AUF genome from the other ten representative Lm strains (Fig. 2). As expected, the Lm strain AUF showed the highest homology of coding regions with the Lm strain 10403S, the Lm strain EGD, a few somewhat lower one with the Lm strain EGD-e one, and much lower one with other Lm reference strains compared.
Visualization of the alignment of the coding regions of the Lm strain AUF versus ten reference Lm strains using «Proteome Comparison Service» tool on The Bacterial and Viral Bioinformatics Resource Center (BV-BRC) platform31. Protein sequence identity is determined on a colorimetric scale, where purple/blue colors correspond to a higher percentage of identity than orange/red colored regions (based on the Best Bidirectional Hits and Unidirectional best hit comparison algorithms). The white areas represent the absence of coding regions annotated at that location in the chromosome sequence of the respective strain.
Comparative analysis of the whole genomes of the Lm strain AUF versus nine Lm reference strains of zoonotic source on the BV-BRC platform (https://www.bv–brc.org/) showed 100% homology in more than 90% CDSs in the Lm strain EGD strain, approximately 50% in the Lm strain EGD-e and only 12–14% in other Lm genomes (Table 2). Similarly, the highest homology in CDSs ranging from 20–79% demonstrated in the majority of Lm strains except both closest homologs. Also, approximately 200 CDSs were present in only the Lm strain AUF and absent in all other strains except the Lm strain EGD. About 100 CDSs were annotated in the Lm strain AUF but not in the Lm strain EGD76. These Lm strain AUF-specific genes were located on the Lm strain AUF chromosome as several compact regions, which were visualized as extended insertions in the Lm strain AUF compared with other reference Lm strains used, including both closest homologs, the Lm strain EGD and the Lm strain EGD-e, and the Lm strain 10403S as well (Fig. 2). The majority of these Lm strain AUF-specific genes encoded uncharacterized hypothetical proteins, Listeria CRISPR-associated proteins (Cas1/Cas2/Cas9/Csn2)77, phage proteins, Type I restriction-modification system components, and in contrast to some Lm strains, GNAT family Acetyltransferase, Gp45 protein, 4-hydroxy-2-oxoglutarate aldolase (EC 4.1.3.16), 2-dehydro-3-deoxyphosphogluconate aldolase (EC 4.1.2.14), transcriptional regulator, ADP-ribosyl glycohydrolase, putative EsaC protein analog (Listeria type 3), cassette chromosome recombinase B, and mobile element proteins. Bacteriophage A118 genes which have been known as Listeria phage A118 (NCBI Acc. number: NC_003216.1), and initially hosted in the Lm strain EGD-e78,79,80,81, were successfully detected in the Lm strain AUF and all other Lm strains used in the current study except the Lm strain EGD76. Comparison of the whole genomes of the Lm strain AUF with the Lm strain 10403S sourced from human sample using the similar BV-BRC platform (https://www.bv–brc.org/) demonstrated 100% homology for about 93% CDSs, 80 – 99% identity for slightly more than 3% CDSs and less than 1% in CDSs ranging from 20-79% (Table 2). Only 93 CDSs were annotated in the Lm strain AUF. These were absent in the Lm strain 10403S76. Basically, there were the same Lm strain AUF-specific genes which encoded uncharacterized hypothetical proteins and phage proteins76.
Additionally, we found (Table 3) in the Lm strain AUF approximately 60% (56/92) of genes that have been recently recognized as those involved in Lm virulence26. This amount was 7-10% less than that revealed in both fully virulent Lm strains of the genetic lineage I, the Lm strain NTSN (62/92, 67.4%) and the Lm strain FDAARGOS_607 (64/92, 69.6%). However, the number of virulence genes in the Lm strain AUF was certainly higher than that in the Lm strains of the genetic lineages III-IV, up to 1.3 - 3.9 times, for instance, in the Lm strain UKVDL7 (14/92, 15.2%) and the Lm strain 4/52-1953 (41/92, 44.6%), respectively. In fact, the Lm strain AUF demonstrated almost similar characteristics in this regard compared with the other two Lm strains of zoonotic origin, the Lm strain EGD and the Lm strain EGD-e (57/92, 61.9%), and with the Lm human strain 10403S (57/92, 61.9%), also belonging to the genetic lineage II.
Importantly, 14 specific loci were identified in both Lm strains of the lineage I, the Lm strain FDAARGOS_607 and the Lm strain NTSN (Table 1), but not in the Lm strain AUF25. There were the following genes: of LIPI-3, the additional sub-lineage pathogenicity island encoding listeriolysin S (LLS), a hemolytic toxin, a bacteriocin31,32 (LIPI3_llsB (LMOf2365_1116), LIPI3_llsD (LMOf2365_1118), LIPI3_llsY (LMOf2365_1117), LIPI3_llsG (LMOf2365_1113), LIPI3_llsH (LMOf2365_1114), LIPI3_llsP (LMOf2365_1119), LIPI3_llsX (LMOf2365_1115), LIPI3_llsA (LMOf2365_1112a); srtB (lmo2181), encoded the sortase B peptidoglycan-anchored protein induced in low iron conditions and involved with other sortases in many aspects of pathogenesis, from biofilm formation, adhesion, and immune suppression to bacterial loads and lethality82,83; agrC (lmo0050), responsible for the expression of AgrC, an integral membrane protein, a member of the class 10 receptor histidine protein kinases which are identified in Lm and involved in a quorum sensing system for the regulation of virulence (internalization, toxins) and biofilm84,85,86,87; vip (lmo0320), encoded vegetative insecticidal proteins (VIPs), toxins, during the vegetative growth phase by different bacterial species88; gltB (LMOf2365_2741), which encodes the major subunit of the glutamate synthase; gltA (LMOf2365_2740), citrate synthase gene, a critical mediator of site-specific fitness of pathogens during infection due to its influence on metabolic flexibility89; aut_IVb (LMOF2365_RS00075), which encodes autolysin, a member of a group of bacterial hydrolases involved in Lm invasion90,91,92. Less difference was noted between the Lm strain AUF and the Lm strain EGD. The latter strain contained only two additional loci, srtB (lmo2181) and agrC (lmo0050) which were absent in the Lm strain AUF, but were found in the fully virulent strains, such as the Lm strain EGD-e, the Lm strain NTSN, and the Lm strain FDAARGOS_607. We identified three loci, srtB (lmo2181), agrC (lmo0050), and vip (lmo0320) in the Lm strain EGD-e and in the Lm strain 10403S but not in the Lm strain AUF. The Lm strain NTSN had 13 additional loci which were not found in the Lm strain AUF. The same loci were present in the Lm strain FDAARGOS_607, except for aut_IVb (LMOF2365_RS00075), and absent in the Lm strain AUF. Only two specific loci (agrA (lmo0051) and comK (LMOf2365_2303)) were revealed in the Lm strain AUF, being absent in the Lm strain 10403S25. The locus prfA (lmo0200), which encodes the transcriptional factor PfrA, the master regulator of Lm major virulence genes39,93,94,95, was identified in the Lm strain AUF similarly with other Lm reference strains25. Remarkably, the Lm strain AUF PrfA demonstrated no 2 amino acid changes typical for the Lm strain EGD (Serine → Glycine, S145G; Tyrosine → Cysteine, Y229C), resulting in the constitutive overexpression in the Lm strain EGD of several major virulence genes39 although the marked prfA nucleotide sequence polymorphism96. Multiple sequence alignment of the Lm strains AUF PrfA protein versus other Lm reference strains showed its identity with those in the Lm strains, the Lm strain EGD-e, the Lm strain FDAAGROS_607, the Lm strain NTSN, the Lm strain UKVDL4 and the Lm strain 10403S97, thus supposing the possible expression in the Lm strain AUF of the PrfA-regulated virulence genes comparable to the Lm strain EGD-e and other virulent Lm strains as well. The locus hly (lmo0202) encoded the pore-forming toxin listeriolysin-O (LLO), allowing Lm to escapes the phagosome to avoid lysosomal killing43,98 was also identified in the Lm strain AUF and other Lm reference strains96,99. The alignment of the Lm strain AUF LLO protein was almost identical to those in the Lm strain EGD, the Lm strain EGD-e and the Lm strain 10403S, although differed from all the other Lm strains by a single amino acid substitution of one Serine to Lysine, S523K. Also, there were 3 other amino acid substitutions, namely: (i) Threonine to Asparagine, T309N in the Lm strain AUF and other strains except the Lm strain FSL-J1-158; (ii) Asparagine, N in position 31 in the Lm strain AUF, the Lm strain EGD, the Lm strain EGD-e, the Lm strain 10403S, the Lm strain 4/52-1953, the Lm strain FDAAGROS_607 and the Lm strain NTSN instead of either Histidine, H in the Lm strain UKVDL4, the Lm strain UKVDL7 and the Lm strain UKVDL9 or Glutamine, Q in the Lm strain FSL-J1-158; (iii) Serine, S in the position 35 in the Lm strain AUF, the Lm strain EGD, the Lm strain EGD-e, the Lm strain 10403S, the Lm strain 4/52-1953 and the Lm strain FSL-J1-158 instead of Leucine, L in the Lm strain FDAAGROS_607, the Lm strain NTSN, the Lm strain UKVDL4, the Lm strain UKVDL7 and the Lm strain UKVDL999.
Also, 6 specific loci (agrA (lmo0051), ami (lmo2558), aut (lmo1076), inlG (lmo0262), tagB (lmo1088), and inlL (LMON_RS10535), were found only in the Lm strain AUF and were not identified in the Lm strain FDAARGOS_607, while only a single locus, inlC (lmo1786) was absent in the Lm strain EGD (locus identified in the Lm strain FDAARGOS_607). We found in the Lm strain AUF two loci, inlD (LMON_RS01345) and comK (LMOf2365_2303) that were not recognized in the Lm strain EGD-e (both of them were also identified in the Lm strain FDAARGOS_607). At least seven loci, prsA2 (lmo2219), fbpA (lmo1829), ami (lmo2558), aut (lmo1076), inlG (lmo0262), inlL (LMON_RS10535), and tagB (lmo1088) were undetectable in the Lm strain NTSN strain while present in the Lm strain AUF25. There were 12 out of 56 (21.5%) identical loci present in the Lm strain AUF and the Lm strain FSL-J1-158 of the genetic lineage IV, and from 13 out of 56 (23.2%) to 39 out of 56 (69.6%) in the Lm strains of the genetic lineage III. Apparently, either all or some of these loci could be potentially either directly or indirectly involved in the mechanisms of residual virulence of the Lm strain AUF.
Furthermore, 14 genes encoding biosynthesis of members of the internalin multigene family100, internalins (inlA (lmo0433), inlB (lmo0434), inlC (lmo1786), inlC2 (LMON_RS01340), inlD (LMON_RS01345), inlE (lmo0264), inlF (lmo0409), inlG, inlH (lmo0263), inlI (lmo0333), inlJ (lmo2821), inlK (lmo1290), inll and inlP (lmo2470) were found in the Lm strain AUF and the majority of the Lm strains belonging to the genetic lineages I-II, while rarely detected in the representatives of the genetic lineages III-IV (Table 4). Comparative analysis of the main internalin genetic characteristics showed that the majority of genes and the relevant allelic profiles of the Lm strain AUF were almost identical to those found in the Lm strain EGD and the Lm strain 10403S strains. The inlC was present in the Lm strain AUF and absent in the Lm strain EGD. The difference between the Lm strain AUF and the Lm strain 10403S was found only in the alleles for the inlB. Only a single identical allele (inlG) was present in both the Lm strain AUF and the Lm reference strain EGD-e. No identical allele profiles for these genes were found between the Lm strain AUF and both Lm strains of the genetic lineage I, the Lm strain FDAARGOS_607, and the Lm strain NTSN. The Lm strain AUF, likewise other Lm strains of the genetic lineage II, the Lm strain 10403S, the Lm strain EGD, and the Lm reference strain EGD-e, showed a certain diversity in internalin gene compositions versus the Lm strains of the genetic lineages III-IV, similarly to other reports100,101,102. No internalin genes, including inlA and inlB, in the form of either pseudogenes or truncated variants were present in the Lm strain AUF and the Lm strain 10403S, the Lm strain EGD, and the Lm strain EGD-e unlike the Lm reference strains of both these genetic lineages (Table 4). However, there was a marked polymorphism in the multiple sequence alignment of inlA96,103 and inlB96,104. The Lm strain AUF together with the Lm strain EGD and the Lm strain 10403S differed from other Lm reference strains by the presence of 3 major amino acid substitutions in InlA protein as T51A (Alanine instead of Threonine), S187N (Asparagine instead of Serine), and A594P (Proline instead of Alanine). Further, we found in the same protein of the Lm strain AUF and 3 other phylogenetically close Lm strains, the Lm strain EGD, the Lm strain EGD-e and the Lm strain 10403S at least 10 amino acid substitutions, such as R3K (Lysine instead of Arginine), S142T (Threonine instead of Serine), N474S (Serine instead of Asparagine), S476P (Proline instead of Serine), Y530H (Histidine instead of Tyrosine), T648S (Serine instead of Threonine), T664A (Alanine instead of Threonine), N738D (Aspartate instead of Asparagine), I781L (Leucine instead of Isoleucine) and V790M (Methionine instead of Valine)97,103. More differences were found for inlB alignments. The InlB amino acid sequences of the Lm strain AUF and both the Lm strain EGD and the Lm strain 10403S had a single amino acid substitution in position 396 resulting in a change of Alanine to Threonine (A396T). The Lm strain AUF demonstrated 23 either single or double substitutions as amino acid changes in the InlB identical to those found in three genetically related Lm strains, the Lm strain EGD, the Lm strain EGD-e and the Lm strain 10403S, which were not identified among other Lm reference strains96,104. There were such changes as: L176I (Isoleucine instead of Leucine), S205A (Alanine instead of Serine), S246P (Proline instead of Serine), T/M251S (Serine instead of Threonine/Methionine), I262T (Threonine instead of Isoleucine), I291T (Threonine instead of Isoleucine), S373N (Asparagine instead of Serine), M387V (Valine instead of Methionine), E446K (Lysine instead of Glutamate), I479M (Methionine instead Isoleucine), I483R (Arginine instead Isoleucine), P486S (Serine instead Proline), A489S (Serine instead of Alanine), TL501-502KH (Lysine-Histidine instead of Threonine-Leucine), D533G (Glycine instead of Aspartate), I555K (Lysine instead of Isoleucine), IQ558-559TR (Threonine-Arginine instead of Isoleucine-Glutamine), GN568-569AG (Alanine-Glycine instead of Glycine-Asparagine), V578A (Alanine instead of Valine), S580N (Asparagine instead of Serine), W584R (Arginine instead of Tryptophan), T594K (Lysine instead of Threonine), RT599-600CQ (Cysteine-Glutamine instead of Arginine-Threonine). As expected, phylogenetically, all the internalins of the Lm strain AUF formed a single branch with only the Lm strain EGD and the Lm strain 10403S (Fig. 3a,b,e,f,h,j,k,l), being additionally clustered with the Lm strain EGD-e (Fig. 3c,d) but not with other Lm strains (Fig. 3a-l). Moreover, in the Lm strain AUF some of the internalins, inlA, inlB, inlC2, inlD, inlE and inlG were found as internalin gene clusters105, inlAB and inlC2DEG, while others, inlC, inlF, inlH, inlI, inlJ, inlK, inlL and inlP were located on the chromosome outside any cluster (Fig. 4.1) similarly with the majority of the Lm reference strains (Fig. 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 4.10, 4.11). Two identical internalin gene clusters were identified in the Lm strain EGD (Fig. 4.3) and in the Lm strain 10403S (Fig. 4.11). The internalin gene cluster inlAB relevant to the Lm strain AUF was also revealed in the Lm strain EGD-e and almost all the Lm reference strains (Fig. 4.4, 4.5, 4.7, 4.9, 4.10) except the Lm strain UKVDL7 (Fig. 4.8). The second internalin cluster inlC2DEG identified in the Lm strain AUF, the Lm strain EGD and in the Lm strain 10403S was only partially presented in other Lm reference strains as either inlC2DE in the Lm strain NTSN (Fig. 4.5) or inlDH in the Lm strain FDAARGOS_607 (Fig. 4.4), or inlEH in the Lm strain UKVDL4 (Fig. 4.7), or the Lm strain UKVDL7 (Fig. 4.8), or inlDEH in the Lm strain UKVDL9 (Fig. 4.9). The position of both internalin gene clusters was on the relevant Lm chromosome region on about 2.0 Mbp (Fig. 4.1), while for the Lm strain EGD-e, the Lm strain EGD, the Lm strain NTSN, the Lm strain UKVDL4, the Lm strain UKVDL9 and the Lm strain 10403S these loci were found within 0.3 – 0.5 Mbp (Fig. 4.2, 4.3, 4.5, 4.7, 4.9, 4.11) or about 0.9 – 1.1. Mbp for the Lm strain FDAARGOS_607 (Fig. 4.4), or 0.5 Mbp for the Lm strain FSL-J1-158 and the Lm strain 4/52-1953 (Fig. 4.6, 4.10). Probably, some of these genes can encode for the products potentially involved in the “residual virulence” of the Lm strain AUF, resulting in occasional vaccine-related adverse effects in vaccinated animals.
Phylogenetic analysis of the internalins in the Lm strain AUF and the reference Lm strains. (a) the Lm internalin A. (b) the Lm internalin B. (c) the Lm internalin C. (d) the Lm internalin C2. (e) the Lm internalin D. (f) the Lm internalin E. (g) the Lm internalin F. (h) the Lm internalin H. (i) the Lm internalin I. (j) the Lm internalin J. (k) the Lm internalin K. (l) the Lm internalin P. Phylogenetic trees were constructed using MEGA-7 (https://www.megasoftware.net/). The Bootstrap value is 100.
Linear maps of chromosomes of 10 Lm strains indicating virulence genes, including internalins. (1) The Lm strain AUF. (2) The Lm strain EGD-e. (3) The Lm strain EGD. (4) The Lm strain FDAARGOS_607. (5) The Lm strain NTSN. (6) The Lm strain FSL-J1-158. (7) The Lm strain UKVDL4. (8) The Lm strain UKVDL7. (9) The Lm strain UKVDL9. (10) The Lm strain 4/52-1953. (11) The Lm strain 10403S. Vertical bars reflect CDSs.
Five antibiotic resistance genes were annotated in the Lm strain AUF related to Listeria antibiotic resistance (fosX (lmo1702), fosfomycins106; mprF (lmo1695), cationic antimicrobial peptides107; lin (lmo0919), lincosamides108; norB (lmo2818), quinolones109; and sul (lmo0224), sulfonamides109), which were similar to those existing in two fully virulent Lm strains of the lineage I, the Lm strain FDAARGOS_607 and the Lm strain NTSN, and both homologs, the Lm strain EGD, the Lm strain EGD-e and the Lm strain 10403S (the lineage II), as well as in the Lm strain 4/52-1953 isolate of the genetic lineage III (Table 3). Four of the same five genes, fosX, mprF, lin and sul, were found in the Lm strain UKVDL4. Only some of these genes were annotated in other Lm strains of the lineages III-IV, the Lm strain UKVDL7 (n = 1), the Lm strain UKVDL9 (n = 2), and the Lm strain FSL-J1-158 (n = 1)25.
No metal resistance genes were found in the Lm strain AUF using BIGSdb-Lm database (https://bigsdb.pasteur.fr/listeria/)25. Similarly, no relevant genes were revealed in the majority of other Lm reference strains while cadmium resistance genes, cadA and cadC 110,111, were identified in the Lm strain EGD-e and the Lm strain FDAARGOS_607 (Table 3)25. We found no significant difference in the number of genes involved in motility in the Lm strain AUF compared with other Lm strains. However, the number of Lm Genomic Islands-associated genes found in the Lm strain AUF (n = 2) was markedly (18 times) lesser those present exclusively in the Lm strain FDAAGROS_607 (n = 36).
Quantitatively, the number of genes of SSI-1 (n = 5) in the Lm strain AUF strain was the same only with the closest homologs, the Lm strain EGD, the Lm strain EGD-e and the Lm strain 10403S, and exceeded those which were annotated for other reference strains independently from their genetic lineage, including both fully virulent strains, the Lm strain FDAARGOS_607, and the Lm strain NTSN, by 2.5 - 5 times. In contrast to the Lm strain AUF, only a single gene, SSI1_lmo0444 (lmo0444), was found in the Lm strain FDAARGOS_607, the Lm strain 4/52-1953, the Lm strain UKVDL7, the Lm strain NTSN and the Lm strain FSL-J1-158. Two genes of the SSI-2, associated in Lm with a tolerance to alkaline and oxidative stresses, SSI2_lin0464 (lin0464) and SSI2_lin0465 (lin0465)112 and absent in the Lm strain AUF, were revealed in the Lm strain UKVDL4 and the Lm strain UKVDL925.
Importantly, we found pronounced polymorphisms in the majority of the alleles of the genes responsible for antibiotic resistance and motility, SSI-1 and determination of Lm Genomic Islands in the Lm strain AUF compared with the other Lm strains used25. The Lm strain AUF demonstrated almost total identity exclusively with only the Lm strain EGD and the Lm strain 10403S but not with other Lm strains independently from their genetic lineage, ST, and CC.
In order to examine the specific contribution of UVR to the inactivation of the Lm strain AUF genes, we compared all the 17 relevant pseudogenes found in the genome of this strain with other Lm genomes available in GenBank using the BLAST resource (https://blast.ncbi.nlm.nih.gov/Blast.cgi). We identified only four putative genes, No. 2 (locus tag: GZH80_03675), No. 4 (locus tag: GZH80_06950), No. 6 (locus tag: GZH80_08750), and No. 13 (GZH80_11730), which were assigned in the Lm strain AUF as pseudogenes, while in the majority of other Lm strains (up to 99-100 of 100 strains available) were annotated as genes encoding functional protein products113. These proteins were the following: (i) 100 out 100 Lm strains: tetratricopeptide repeat protein, sensor histidine kinase; and amino acid permease, and (ii) 99 out of 100 Lm strains: a hypothetical protein which was found as the pseudogene with 100% identity to the Lm strain AUF in only a single Lm strain 3453. The relevant genes demonstrated homology on the level of 92.87 – 99.93% versus the Lm strain AUF as a result of the presence of either frameshift mutations in genes No. 2, 6, and 13, or missing C-terminus in the gene No. 4. The products expressed by these genes are involved in bacterial pathogenesis by controlling the expression of virulence, biofilm formation, protein-protein interactions, antimicrobial resistance, quorum sensing and signaling27,114,115,116. The relevant mutants in wild Lm strains have been reported to be defective for intracellular growth and cell-to-cell spread and were severely attenuated for virulence in a mouse model114. At the same time, when the genome of the Lm strain AUF was compared with the genomes of its closest homolog, the Lm strain EGD, we found only a single pseudogene (1/17 representation in the Lm strain AUF), No. 9 (LMON_0171), with possible expression of only a truncated hypothetical protein113. Furthermore, in the genome of the Lm strain 10403S four of 17 pseudogenes related to those in the Lm strain AUF78 due to frameshift mutations were annotated, such as: No. 7 (LMRG_02848), No. 11 (LMRG_00154), No. 12 (LMRG_00322) and No. 15 (LMRG_02951). We believe some of these genes could be damaged by UVR resulting in the UVR-induced DNA mutations leading to the Lm strain AUF attenuation.
Usage Notes
Currently, we present the complete genome sequence and the gene annotations for the Lm strain AUF, which has been used for decades as a live whole-cell veterinary vaccine. We compared the Lm strain AUF genome with the whole genomes of reference fully virulent Lm strains, of the Lm isolates derived from animals with listeriosis, and the Lm strain used for the development of live attenuated vaccine vectors against cancer. We believe that the data obtained will be useful to unravel the mechanisms of attenuation and virulence in Listeria and other pathogenic microorganisms. However, it is very difficult to make objective conclusions on the Lm strain AUF attenuation in the absence of the wild parental Lm strain ‘A’. Nevertheless, we hope that these data will be valuable as the basic platform for development of the effective and safe new-generation vaccine(s) with improved characteristics for prophylaxis of listeriosis in animals worldwide .
Code availability
Software used in the generation or processing of our data is stated in the Methods section. Detailed information including versions of software and database are provided in Table 5.
References
Seeliger, H. P. R. Listeriosis-history and actual developments. Infection 16, S80–S84 (1988).
Lecuit, M. Listeria monocytogenes, a model in infection biology. Cellular microbiology 22, e13186 (2020).
Farber, J. M. & Peterkin, P. Listeria monocytogenes, a food-borne pathogen. Microbiological reviews 55, 476–511 (1991).
Dhama, K. et al. Listeriosis in animals, its public health significance (food-borne zoonosis) and advances in diagnosis and control: a comprehensive review. Vet Q 35, 211–235 (2015).
Koopmans, M. M., Brouwer, M. C., Vázquez-Boland, J. A. & van de Beek, D. Human Listeriosis. Clin Microbiol Rev 36, e00060192023 (2023).
Luo, X. & Cai, X. A combined use of autolysin p60 and listeriolysin O antigens induces high protective immune responses against Listeria monocytogenes infection. Curr Microbiol 65, 813–8 (2012).
Félix, B. et al. A European-wide dataset to uncover adaptive traits of Listeria monocytogenes to diverse ecological niches. Sci Data 9, 190 (2022).
Murray, E. G. D., Webb, R. E. & Swami, M. B. R. A disease of rabbits characterised by a large mononuclear leucocytosis, caused by a hitherto undescribed bacillus Bacterium monocytogenes (n.sp.). J Pathol Bacteriol 29, 407–439 (1926).
Williams, E. N., Van Doren, J. M., Leonard, C. L. & Datta, A. R. Prevalence of Listeria monocytogenes, Salmonella spp., Shiga toxin-producing Escherichia coli, and Campylobacter spp. in raw milk in the United States between 2000 and 2019: A systematic review and meta-analysis. J Food Prot 86, 100014 (2023).
Esteban, J. I., Oporto, B., Aduriz, G., Juste, R. A. & Hurtado, A. Faecal shedding and strain diversity of Listeria monocytogenes in healthy ruminants and swine in Northern Spain. BMC Vet Res 5, 2 (2009).
Gaulin, C., Ramsay, D. & Bekal, S. Widespread listeriosis outbreak attributable to pasteurized cheese, which led to extensive cross-contamination affecting cheese retailers, Quebec, Canada, 2008. J Food Prot 75, 71–78 (2012).
Jackson, K. A. et al. Multistate outbreak of Listeria monocytogenes associated with Mexican-style cheese made from pasteurized milk among pregnant, Hispanic women. J Food Prot 74, 949–953 (2011).
Johnsen, B. O., Lingaas, E., Torfoss, D., Strom, E. H. & Nordoy, I. A large outbreak of Listeria monocytogenes infection with short incubation period in a tertiary care hospital. J Infect 61, 465–470 (2010).
Altissimi, C., Noé-Nordberg, C., Ranucci, D. & Paulsen, P. Presence of Foodborne Bacteria in Wild Boar and Wild Boar Meat-A Literature Survey for the Period 2012-2022. Foods 12, 1689 (2023).
Thomas, J. et al. Outbreak of Listeriosis in South Africa Associated with Processed Meat. N Engl J Med 382, 632–643 (2020).
Bevilacqua, A. et al. Microbiological Risk Assessment in Foods: Background and Tools, with a Focus on Risk Ranger. Foods 12, 1483 (2023).
Ali, S. & Alsayeqh, A. F. Review of major meat-borne zoonotic bacterial pathogens. Front Public Health 10, 1045599 (2022).
Jibo, G. G. et al. A systematic review and meta-analysis of the prevalence of Listeria monocytogenes in South-East Asia; a one-health approach of human-animal-food-environment. One Health 15, 100417 (2022).
Huang, C., Lu, T. L. & Yang, Y. Mortality risk factors related to listeriosis - A meta-analysis. J Infect Public Health 16, 771–783 (2023).
Osek, J. & Wieczorek, K. Listeria monocytogenes-How This Pathogen Uses Its Virulence Mechanisms to Infect the Hosts. Pathogens 11, 1491 (2022).
Pogreba-Brown, K. et al. Complications Associated with Foodborne Listeriosis: A Scoping Review. Foodborne Pathog Dis 19, 725–743 (2022).
Tan, W., Wang, G., Pan, Z., Yin, Y. & Jiao, X. Complete Genome Sequence of Listeria monocytogenes NTSN, a Serovar 4b and Animal Source Strain. Genome Announc 3, e01403–14 (2015).
Kichemazova, N. V., Khizhnyakova, M. A., Lyapina, A. M., Kolosova, A. A. & Feodorova, V. A. Vaccine prophylaxis of listeriosis in farm animals. J. Veterinariya 3, 18–25 (2023).
Selivanov, A. V., Kotylev, O. A. & Sedov, N. K. Listeriosis vaccine from strain AUF listeriae. Veterinariia 12, 38–39 (1974).
Feodorova, V. et al. Supplementary Table 1. figshare https://doi.org/10.6084/m9.figshare.24298378.v6 (2024).
Moura, A. et al. Whole genome-based population biology and epidemiological surveillance of Listeria monocytogenes. Nature microbiology 2, 1–10 (2016).
Blatch, G. L. & Lässle, M. The tetratricopeptide repeat: a structural motif mediating protein-protein interactions. Bioessays 21, 932–9 (1999).
Dussurget, O. New insights into determinants of Listeria monocytogenes virulence. Int Rev Cell Mol Biol 270, 1–38 (2008).
Gelbíčová, T., Kolácková, I., Pantucek, R. & Karpíšková, R. A novel mutation leading to a premature stop codon in inlA of Listeria monocytogenes isolated from neonatal listeriosis. New Microbiol 38, 293–296 (2015).
Maury, M. M. et al. Uncovering Listeria monocytogenes hypervirulence by harnessing its biodiversity. Nat Genet 8, 308–313 (2016).
Quereda, J. J., Meza-Torres, J., Cossart, P. & Pizarro-Cerdá, J. Listeriolysin S: A bacteriocin from epidemic Listeria monocytogenes strains that targets the gut microbiota. Gut Microbes 8, 384–391 (2017). Erratum for: Addendum to, Quereda, J. J. et al. Bacteriocin from epidemic Listeria strains alters the host intestinal microbiota to favor infection. Proc Nat Acad Sci USA 113, 5706–11 (2016).
Lakicevic, B. Z., Den Besten, H. M. W. & De Biase, D. Landscape of Stress Response and Virulence Genes Among Listeria monocytogenes Strains. Front Microbiol 12, 738470 (2022).
Ryan, S., Begley, M., Hill, C. & Gahan, C. G. A five-gene stress survival islet (SSI-1) that contributes to the growth of Listeria monocytogenes in suboptimal conditions. J Appl Microbiol 109, 984–995 (2010).
Sichtig, H. et al. FDA-ARGOS is a database with public quality-controlled reference genomes for diagnostic use and regulatory science. Nat Commun 10, 3313 (2019).
Database for Reference Grade Microbial Sequences (FDA-ARGOS) https://www.fda.gov/medical-devices/science-and-research-medical-devices/database-reference-grade-microbial-sequences-fda-argos (2023).
Reference genome ASM19603v1 European Consortium. Strain: EGD-e. https://www.ncbi.nlm.nih.gov/datasets/taxonomy/169963/ (2003).
Genome assembly ASM19603v1, reference https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000196035.1/.
Bécavin, C. et al. Comparison of widely used Listeria monocytogenes strains EGD, 10403S, and EGD-e highlights genomic variations underlying differences in pathogenicity. mBio 5, e00969–14 (2014).
Maciag, P. C., Radulovic, S. & Rothman, J. The first clinical use of a live-attenuated Listeria monocytogenes vaccine: A Phase I safety study of Lm-LLO-E7 in patients with advanced carcinoma of the cervix. Vaccine 27, 3975–3983 (2009).
Flickinger, J. C. Jr., Rodeck, U. & Snook, A. E. Listeria monocytogenes as a Vector for Cancer Immunotherapy: Current Understanding and Progress. Vaccines (Basel) 6, 48 (2018).
Gunn, G. R. et al. Two Listeria monocytogenes vaccine vectors that express different molecular forms of human papilloma virus-16 (HPV-16) E7 induce qualitatively different T cell immunity that correlates with their ability to induce regression of established tumors immortalized by HPV-16. J Immunol 167, 6471–9 (2001).
Gilley, R. P. & Dube, P. H. Checkpoint blockade inhibitors enhances the effectiveness of a Listeria monocytogenes-based melanoma vaccine. Oncotarget 11, 740–754 (2020).
Oladejo, M., Paterson, Y. & Wood, L. M. Clinical Experience and Recent Advances in the Development of Listeria-Based Tumor Immunotherapies. Front Immunol 12, 642316 (2021).
Wood, L. M. & Paterson, Y. Attenuated Listeria monocytogenes: a powerful and versatile vector for the future of tumor immunotherapy. Front Cell Infect Microbiol 4, 51 (2014).
Bespalova, T. Y. et al. Novel Sequence Types of Listeria monocytogenes of Different Origin Obtained in the Republic of Serbia. Microorganisms 9, 1289 (2021).
Zaitsev, S. S., Khizhnyakova, M. A. & Feodorova, V. A. Retrospective Investigation of the Whole Genome of the Hypovirulent Listeria monocytogenes Strain of ST201, CC69, Lineage III, Isolated from a Piglet with Fatal Neurolisteriosis. Microorganisms 10, 1442 (2022).
Zaitsev, S., Khizhnyakova, M. A. & Feodorova, V. A. Data of de novo genome assembly of the Listeria monocytogenes vaccine strain AUF. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP048400 (2020).
Li, W. et al. RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation. Nucleic Acids Res 49, D1020–D1028 (2021).
Bertels, F., Silander, O. K., Pachkov, M., Rainey, P. B. & van Nimwegen, E. Automated reconstruction of whole-genome phylogenies from short-sequence reads. Mol Biol Evol 31, 1077–88 (2014).
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol 33, 1870–4 (2016).
Olson, R. D. et al. Introducing the Bacterial and Viral Bioinformatics Resource Center (BV-BRC): a resource combining PATRIC, IRD and ViPR. Nucleic Acids Res 51, D678–D689 (2023).
Grant, J. R. et al. Proksee: in-depth characterization and visualization of bacterial genomes Nucleic Acids Research, gkad326, https://doi.org/10.1093/nar/gkad326 2023.
Mason, C. et al. FDA dAtabase for Regulatory Grade micrObial Sequences (FDA-ARGOS): Supporting development and validation of Infectious Disease Dx tests. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP041014.1 (2019).
Glaser, P. et al. Comparative genomics of Listeria species. GenBank https://www.ncbi.nlm.nih.gov/nuccore/AL591824 (2001).
Becavin, C. et al. Comparison of widely used Listeria monocytogenes strains EGD, 10403S, and EGD-e highlights genomic variations underlying differences in pathogenicity. GenBank https://www.ncbi.nlm.nih.gov/nuccore/HG421741.1 (2014).
Bechtel, T. & Gibbons, J. Genome sequence of three persistent L. monocytogenes strains isolated from a food processing facility and a livestock outbreak. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP090057.1 (2022).
Tan, W., Wang, G., Pan, Z., Yin, Y. & Jiao, X. Complete Genome Sequence of Listeria monocytogenes NTSN, a Serovar 4b and Animal Source Strain. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP009897.1 (2015).
Senay, T. E. et al. Neurotropic Lineage III Strains of Listeria monocytogenes Disseminate to the Brain without Reaching High Titer in the Blood. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP065028.1 (2020).
Senay, T. E. et al. Neurotropic Lineage III Strains of Listeria monocytogenes Disseminate to the Brain without Reaching High Titer in the Blood. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP076644.1 (2021).
Zaitsev, S., Khizhnyakova, M. A., Filonova, N. & Feodorova, V. A. Data of de novo genome assembly of the Listeria monocytogenes strain of zoonotic origin. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP048401.1 (2020).
Senay, T. E. et al. Neurotropic Lineage III Strains of Listeria monocytogenes Disseminate to the Brain without Reaching High Titer in the Blood. GenBank https://www.ncbi.nlm.nih.gov/nuccore/CP076669.1 (2020).
Borowsky, M. et al. The Genome Sequence of Listeria monocytogenes strain 10403S. GenBank https://www.ncbi.nlm.nih.gov/nuccore/NC_017544.1 (2010).
Wellcome Trust Sanger Institute. GenBank https://www.ncbi.nlm.nih.gov/nuccore/LR134483.1 (2018).
Quereda, J. J. et al. Listeria valentina sp. nov., isolated from a water trough and the faeces of healthy sheep. GenBank https://www.ncbi.nlm.nih.gov/nuccore/JAATJW010000100 (2020).
Schardt, J., Mueller-Herbst, S., Scherer, S. & Huptas, C. Direct Submission. GenBank https://www.ncbi.nlm.nih.gov/nuccore/LARY01000001 (2015).
Den Bakker, H. C. et al. Listeria floridensis sp. nov., Listeria aquatica sp. nov., Listeria cornellensis sp. nov., Listeria riparia sp. nov. and Listeria grandensis sp. nov., from agricultural and natural environments. GenBank https://www.ncbi.nlm.nih.gov/nuccore/NZ_AODF00000000 (2014).
Chiara, M. et al. Draft genome sequences of L. newyorkensis and L. fleischmannii clarify phylogenetic relationships within the genus Listeria and provide additional insights into the evolution of pathogenicity in Listeria sensu strictu. GenBank https://www.ncbi.nlm.nih.gov/nuccore/KQ130608 (2015).
Weller, D. et al. Listeria ohnekaius sp. nov. and Listeria portnoyii sp. nov. isolated from non-agricultural and natural environments. GenBank https://www.ncbi.nlm.nih.gov/nuccore/NZ_JABJVM000000000.1 (2023).
Bell, G., Cornelius, A. & Taylor, W. Listeria newyorkensis isolated from tuaki (NZ cockles). GenBank https://www.ncbi.nlm.nih.gov/nuccore/NZ_CP113980.1 (2023).
Den Bakker, H. C. et al. Listeria floridensis sp. nov., Listeria aquatica sp. nov., Listeria cornellensis sp. nov., Listeria riparia sp. nov. and Listeria grandensis sp. nov., from agricultural and natural environments. GenBank https://www.ncbi.nlm.nih.gov/nuccore/NZ_AODE00000000.1 (2014).
Whitman, W. Genomic Encyclopedia of Type Strains, Phase III (KMG-III): the genomes of soil and plant-associated and newly described type strains. GenBank https://www.ncbi.nlm.nih.gov/nuccore/NZ_SNZK01000001 (2019).
Weller, D., Andrus, A., Wiedmann, M. & den Bakker, H. C. Listeria booriae sp. nov. and Listeria newyorkensis sp. nov., from food processing environments in the USA. GenBank https://www.ncbi.nlm.nih.gov/nuccore/NZ_JNFA00000000.1 (2015).
Loessner, M. J., Inman, R. B., Lauer, P. & Calendar, R. Complete nucleotide sequence, molecular analysis and genome structure of bacteriophage A118 of Listeria monocytogenes: implications for phage evolution. GenBank https://www.ncbi.nlm.nih.gov/nuccore/NC_003216.1 (2000).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR25210281 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR25180708 (2023).
Feodorova, V. et al. Supplementary Table 2. figshare https://doi.org/10.6084/m9.figshare.24299374.v1 (2023).
Espinoza-Mellado, M. D. R. & Vilchis-Rangel, R. E. Review of CRISPR-Cas Systems in Listeria Species: Current Knowledge and Perspectives. Int J Microbiol 2022, 9829770 (2022).
Mihara, T. et al. Linking virus genomes with host taxonomy. Viruses 8, 66 (2016).
Loessner, M. J., Inman, R. B., Lauer, P. & Calendar, R. Complete nucleotide sequence, molecular analysis and genome structure of bacteriophage A118 of Listeria monocytogenes: implications for phage evolution. Mol Microbiol 35, 324–40 (2000).
Loessner, M. J., Wendlinger, G. & Scherer, S. Heterogeneous endolysins in Listeria monocytogenes bacteriophages: a new class of enzymes and evidence for conserved holin genes within the siphoviral lysis cassettes. Mol Microbiol 16, 1231–41 (1995).
Hu, M. et al. A Meta-Analysis and Systematic Review of Listeria monocytogenes Response to Sanitizer Treatments. Foods 12, 154 (2022).
McCafferty, D. G. & Melvin, J. A. Handbook of Proteolytic Enzymes (Academic Press is an imprint of Elsevier, 2013).
Klebba, P. E., Charbit, A., Xiao, Q., Jiang, X. & Newton, S. M. Mechanisms of iron and haem transport by Listeria monocytogenes. Mol Membr Biol 29, 69–86 (2012).
Gray, B., Hall, P. & Gresham, H. Targeting agr- and agr-Like quorum sensing systems for development of common therapeutics to treat multiple gram-positive bacterial infections. Sensors (Basel) 13, 5130–66 (2013).
Autret, N., Raynaud, C., Dubail, I., Charbit, A. & Berche, P. Identification of the agr locus of Listeria monocytogenes: Role in bacterial virulence. Infec Immun 71, 4463–4471 (2003).
Rieu, A., Weidmann, S., Garmyn, D., Piveteau, P. & Guzzo, J. Agr system of Listeria monocytogenes EGD-e: Role in adherence and differential expression pattern. Appl Environ Microbiol 73, 6125–6133 (2007).
Riedel, C. U. et al. AgrD-dependent quorum sensing affects biofilm formation, invasion, virulence and global gene expression profiles in Listeria monocytogenes. Mol Microbiol 71, 1177–1189 (2009).
Gupta, V.K. & Jindal, V. in Integrated Pest Management Current Concepts and Ecological Perspective (ed. Abrol D. P.) Ch. 16 (Elsevier, 2014).
Vornhagen, J. et al. The Klebsiella pneumoniae citrate synthase gene, gltA, influences site specific fitness during infection. PLoS Pathog 15, e1008010 (2019).
Cabanes, D., Dussurget, O., Dehoux, P. & Cossart, P. Auto, a surface associated autolysin of Listeria monocytogenes required for entry into eukaryotic cells and virulence. Mol Microbiol 51, 1601–1614 (2004).
Wang, L. & Lin, M. A novel cell wall-anchored peptidoglycan hydrolase (autolysin), IspC, essential for Listeria monocytogenes virulence: Genetic and proteomic analysis. Microbiol 154, 1900–1913 (2008).
Leimeister-Wächter, M., Haffner, C., Domann, E., Goebel, W. & Chakraborty, T. Identification of a gene that positively regulates expression of listeriolysin, the major virulence factor of Listeria monocytogenes. Proc Natl Acad Sci USA 87, 8336–8340 (1990).
Mengaud, J. et al. Pleiotropic control of Listeria monocytogenes virulence factors by a gene that is autoregulated. Mol Microbiol 5, 2273–2283 (1991).
De las Heras, A., Cain, R. J., Bielecka, M. K. & Vázquez-Boland, J. A. Regulation of Listeria virulence: PrfA master and commander. Curr Opin Microbiol 14, 118–127 (2011).
Feodorova, V. et al. Supplementary Table 4. figshare https://doi.org/10.6084/m9.figshare.24298378.v1 (2023).
Feodorova, V. et al. Alignment of the nucleotide (A) and amino acid (B) sequences of the prfA gene derived from the Listeria monocytogenes strains AUF, EGD-e, EGD, NTSN, UKVDL4, UKVDL9, UKVDL7, 4/52-1953, FDAARGOS 607 and FSL-J1-158, respectively. figshare https://doi.org/10.6084/m9.figshare.24796788.v5 (2024).
Portnoy, D. A., Jacks, P. S. & Hinrichs, D. J. Role of hemolysin for the intracellular growth of Listeria monocytogenes. J Exp Med 167, 1459–1471 (1988).
Feodorova, V. et al. Alignment of the nucleotide (A) and amino acid (B) sequences of the hly gene derived from the Listeria monocytogenes strains AUF, EGD, EGD-e, FDAARGOS 607, FSL-J1-158, UKVDL7 and 4/52-1953. figshare https://doi.org/10.6084/m9.figshare.24796863.v6 (2024).
Ireton, K., Mortuza, R., Gyanwali, G. C., Gianfelice, A. & Hussain, M. Role of internalin proteins in the pathogenesis of Listeria monocytogenes. Mol Microbiol 116, 1407–1419 (2021).
Tsai, Y. H., Orsi, R. H., Nightingale, K. K. & Wiedmann, M. Listeria monocytogenes internalins are highly diverse and evolved by recombination and positive selection. Infect Genet Evol 6, 378–89 (2006).
Rychli, K. et al. Listeria monocytogenes Isolated from Illegally Imported Food Products into the European Union Harbor Different Virulence Factor Variants. Genes (Basel) 9, 428 (2018).
Feodorova, V. et al. Alignment of the nucleotide (A) and amino acid (B) sequences of the inlA gene derived from the Listeria monocytogenes strains AUF, EGD, EGD-e, FDAARGOS 607, NTSN, UKVDL4, 4/52-1953 and UKVDL7. figshare https://doi.org/10.6084/m9.figshare.24796872.v4 (2024).
Feodorova, V. et al. Alignment of the nucleotide (A) and amino acid (B) sequences of the inlB gene derived from the Listeria monocytogenes strains AUF, EGD, EGD-e, FDAARGOS 607, NTSN, UKVDL4, 4/52-1953 and FSL-J1-158. figshare https://doi.org/10.6084/m9.figshare.24796878.v4 (2024).
Raffelsbauer, D. et al. The gene cluster inlC2DE of Listeria monocytogenes contains additional new internalin genes and is important for virulence in mice. Mol Gen Genet 260, 144–58 (1998).
Fillgrove, K. L., Pakhomova, S., Schaab, M. R., Newcomer, M. E. & Armstrong, R. N. Structure and mechanism of the genomically encoded fosfomycin resistance protein, FosX, from Listeria monocytogenes. Biochemistry 46, 8110–8120 (2007).
Thedieck, K. et al. The MprF protein is required for lysinylation of phospholipids in listerial membranes and confers resistance to cationic antimicrobial peptides (CAMPs) on Listeria monocytogenes. Mol Microbiol 62, 1325–1339 (2006).
Dar, D. et al. Term-seq reveals abundant ribo-regulation of antibiotics resistance in bacteria. Science 352, aad9822 (2016).
Haubert, L., Kremer, F. S. & da Silva, W. P. Whole-genome sequencing identification of a multidrug-resistant Listeria monocytogenes serotype 1/2a isolated from fresh mixed sausage in southern Brazil. Infect Genet Evol 65, 127–130 (2018).
Lebrun, M., Audurier, A. & Cossart, P. Plasmid-borne cadmium resistance genes in Listeria monocytogenes are similar to cadA and cadC of Staphylococcus aureus and are induced by cadmium. J Bacteriol 176, 3040–3048 (1994).
Parsons, C., Lee, S. & Kathariou, S. Heavy Metal Resistance Determinants of the Foodborne Pathogen Listeria monocytogenes. Genes (Basel) 24, 11 (2018).
Harter, E. et al. Stress Survival Islet 2, Predominantly Present in Listeria Monocytogenes Strains of Sequence Type 121, Is Involved in the Alkaline and Oxidative Stress Responses. Appl Environ Microbiol 83, e00827–17 (2017).
Feodorova, V. et al. Supplementary Table 3. figshare https://doi.org/10.6084/m9.figshare.24299377.v4 (2024).
Sun, A. N., Camilli, A. & Portnoy, D. A. Isolation of Listeria monocytogenes small-plaquemutants defective for intracellular growth and cell-to-cell spread. Infect Immun 58, 3770–3778 (1990).
Vaval, T. D. M., Xayarath, B. & Freitag, N. E. Two Permeases Associated with the Multifunctional CtaP Cysteine Transport System in Listeria monocytogenes Play Distinct Roles in Pathogenesis. Microbiol Spectr 11, e0331722 (2023).
Ishii, E. & Eguchi, Y. Diversity in Sensing and Signaling of Bacterial Sensor Histidine Kinases. Biomolecules 11, 1524 (2021).
Glaser, P. et al. Comparative genomics of Listeria species. Science 294, 849–52 (2001).
Toledo-Arana, A. et al. The Listeria transcriptional landscape from saprophytism to virulence. Nature 459, 950–6 (2009).
Chatterjee, S. S. et al. Intracellular gene expression profile of Listeria monocytogenes. Infect Immun 74, 1323–38 (2006).
Den Bakker, H., Bowen, B., Rodriguez-Rivera, L. & Wiedmann, M. FSL J1-208, a Virulent Uncommon Phylogenetic Lineage IV Listeria monocytogenes Strain with a Small Chromosome Size and a Putative Virulence Plasmid Carrying Internalin-Like Genes. Applied and environmental microbiology 78, 1876–89 (2012).
Senay, T. E. et al. Neurotropic Lineage III Strains of Listeria monocytogenes Disseminate to the Brain without Reaching High Titer in the Blood. mSphere 5, e00871–20 (2020).
Acknowledgements
The reported study was funded by Russian Science Foundation, the project number 22-16-00165. We would like to thank Dr. Daria G. Oglodina for her technical assistance in the manuscript preparation.
Author information
Authors and Affiliations
Contributions
V.A.F. provided funding, designed and supervised the study, analysed the data, and wrote the manuscript, M.A.K. optimized and performed the sequencing, S.S.Z., and M.S.L. performed the bioinformatic analyses and prepared graphical visualization of the data obtained, Y.V.S. contributed in computational analysis, A.D.Z., and O.S.L. provided material support, and edited the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Feodorova, V.A., Zaitsev, S.S., Khizhnyakova, M.A. et al. Complete genome of the Listeria monocytogenes strain AUF, used as a live listeriosis veterinary vaccine. Sci Data 11, 643 (2024). https://doi.org/10.1038/s41597-024-03440-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-024-03440-8






