Introduction

Antimicrobial resistance (AMR) has been described as a silent pandemic, emerging as one of the most critical public health challenges of the 21st century, causing millions of deaths annually and posing a significant economic threat1,2,3,4.

Studies have shown that the discharge of wastewater containing antibiotic residues, antibiotic resistant bacteria (ARB), antibiotic resistance genes (ARGs), and mobile genetic elements (MGEs) is a significant contributor to the spread of AMR5,6. In particular, effluents originating from hospitals consist of a complex mixture of hazardous contaminants. Such contaminants include pharmaceutically active compounds, hormones, heavy metals, disinfectants, radioisotopes, X-ray contrast media, and pathogenic microbes, along with antibiotic residues and associated resistance genes7,8,9. Antibiotics administered to patients, once excreted, subsequently enter the hospital wastewater effluents, resulting in a heightened risk of transmitting antimicrobial resistance from the microbiota of hospitalized patients in clinical settings to downstream environments10,11.

Although recently developed and approved drugs have shown promise for the treatment of life-threatening infections caused by multidrug resistant (MDR) microorganisms, emerging resistance to these novel molecules has recently been reported12. This is in part due to the misuse of antibiotics, which has compromised their effectiveness, ultimately exacerbating the spread of drug-resistant infections5,13. Bacteria can acquire antibiotic resistance through two primary mechanisms: (1) spontaneous mutations of chromosomal genes (resulting in the production of ARGs) and (2) acquisition of ARGs from other strains of the same or different species via horizontal gene transfer (HGT). The latter has been shown to significantly contribute to the rapid spread of resistance. Exchange of genetic information between two organisms through HGT primarily involves conjugation via plasmids, transduction mediated by bacteriophages, and natural transformation facilitated by extracellular DNA released by dead ARB14,15. An acquired resistome through HGT involves MGEs such as plasmids, transposons, and integrons, which are responsible for transferring resistance16.

The spread of ARGs is strongly influenced by a range of socio-economic factors, including limited access to quality healthcare, inadequate wastewater treatment, weak regulatory governance, and overcrowded living conditions17,18. These challenges are particularly pronounced in developing countries such as Lebanon. Lebanon presents an interesting case in assessing ARG dissemination due to its unique demographic pressures as it hosts the highest number of refugees per capita and per square kilometer globally. Additionally, its large expatriate population with frequent returns for visits, far exceeds the number of residing citizens, leading to periodic surges in population density19,20. These dynamics have placed significant strain on the country’s already fragile healthcare and economic infrastructure21, potentially accelerating the spread of ARGs.

A recent review article discussing the status of carbapenem-resistant Enterobacterales (CRE) in Lebanon highlighted the emergence of CRE pathogens in hospital isolates. These primarily included Escherichia coli and Klebsiella pneumoniae, with numerous reported cases over the years attributed to several factors, including the Syrian refugee crisis, water contamination, and antimicrobial misuse22. With respect to hospital wastewater specifically, other work has detected occurrence of NDM-1-producing Enterobacter cloacae, in addition to extended-spectrum beta-lactamase (ESBL) producing isolates of E. coli and Klebsiella spp.23. Still, no studies to date have investigated the antibiotic resistome, mobilome, or microbiome of Lebanese hospital wastewaters using molecular methods on untreated effluent samples.

Globally, a wide range of work has investigated the occurrence of ARGs in hospital wastewater environments. Such studies have commonly utilized both quantitative methods (e.g., qPCR) and high-throughput methods (e.g., shotgun metagenomics)24,25,26. Published research using qPCR, in particular, has been valuable for determining potential co-occurrences of ARGs with MGEs and/or establishing correlations between ARGs and bacterial groups11,27,28. Other work has utilized qPCR to assess the impacts that on-site hospital wastewater treatment can have on ARG and bacterial removal rates29,30. Still, most studies that have employed microbial analysis in combination with qPCR for ARG detection have focused on a small number of hospital sampling sites (i.e., n ≤ 3), with only one such study assessing 10 or more hospitals’ wastewaters31. Such larger-scale campaigns are, however, valuable for gaining a more universal perspective on the state of antibiotic resistance spread within a region/country.

The present study is the first to comprehensively characterize the resistome (ARGs), mobilome (MGEs), and microbial community structure in hospital wastewaters across Lebanon, and to integrate this data with wastewater physicochemical properties. This combined approach provides novel insights into the co-occurrence patterns among ARGs, MGEs, and microbial communities, while also identifying potentially influencing wastewater effluent attributes.

Materials and methods

Sample collection and processing

A total of 13 wastewater samples were collected from Lebanese hospitals in sterilized amber glass 1-L bottles (PP28, LBG, Labbox, Spain), and designated sample IDs as HWW1 through HWW13. Information regarding the sampled hospitals (including region, capacity, etc.) is provided in Table S1 of the supplementary information (SI). After collection, the samples were transported to the laboratory under aseptic conditions in refrigerated containers and processed within 12 h. The bottles were mixed to ensure sample homogeneity prior to any analysis or filtration-based concentration. A vacuum microfiltration system was used to filter 100 mL of each wastewater sample through a 0.22 μm mixed cellulose ester (MCE) membrane filter (Microlab Scientific) to be used for subsequent DNA extraction.

Total genomic DNA was extracted from the membrane filters using the DNeasy PowerWater Kit (QIAGEN, Germany) according to manufacturer’s instructions. The DNA was eluted in 100 µL of TE buffer solution (Tris-EDTA, 1X Solution, pH 7.4, Sigma-Aldrich) and passed through the spin filter twice to maximize yield. Purity and quality of the extracted DNA were confirmed by measuring the A260:A280 ratio using a Nanodrop ND-1000 spectrophotometer (Thermo Scientific, USA). DNA samples were then stored at -20 °C until further analysis.

Methods for physicochemical characterization

The collected wastewater samples were initially tested and analyzed for several water quality parameters. pH was measured using a bench pH meter (WPA CD 500, UK), and electrical conductivity was measured using a conductivity meter (Sartorius-PT-20, USA). Chemical Oxygen Demand (COD) was tested following the USEPA digestion method and measured using method 8000 of a Hach DR3900 spectrophotometer (HACH, USA). Total phosphorus (TP), total nitrogen (TN), total Kjeldahl nitrogen (TKN), ammonia-nitrogen (NH3-N), and sulfate were measured using the DR 3900 spectrophotometer according to methods 8190, 10,072, 10,242, 10,031, and 10,227, respectively.

Total organic carbon (TOC) was calculated as the difference between total carbon (TC) and inorganic carbon (IC) measured with the TOC-L/SSM5000A analyzer (Shimadzu, Japan). For TOC analysis, 1 mL of homogenous samples was weighed in a ceramic boat, covered with ceramic fiber, then transferred into the Solid Sample Module (SSM-5000 A) at an oxidizing furnace temperature of 900 °C for determination of TC content. The IC content was transformed to CO2 at a temperature of 200 °C using concentrated phosphoric acid. The SSM was equipped with purified oxygen at 200 kPa and the carrier gas flow was set at 0.5 L/min.

Microbial community characterization

DNA extracted from all samples was sent to Novogene Bioinformatics Technology Co., Ltd. (Beijing, China) for 16 S rRNA gene amplicon sequencing using the Illumina NovaSeq 6000 sequencing platform. PCR amplification of the V3-V4 region of the 16 S rRNA gene was performed using the primer set 341 F (5ʹ-CCTAYGGGRBGCASCAG-3ʹ) and 806R (5ʹ-GGACTACNNGGGTATCTAAT-3ʹ), resulting in a 470 bp fragment, followed by library preparation to generate 250 bp paired-end raw reads.

The Galaxy Mothur Toolset was employed for bioinformatics data processing and analysis, with the Silva 132 reference database used for alignment and the Ribosomal Database Project (RDP) 16 S rRNA training set No. 19 utilized for classification32. The VSEARCH algorithm was applied to filter chimeric sequences. Sequences were clustered into operational taxonomic units (OTUs). Alpha diversity parameters were calculated using Mothur. The dataset was subsampled to an even depth across the samples, using the lowest number of sequence reads detected. The representative sequences obtained were compared with all 16 S rRNA sequences available from the National Center for Biotechnology Information (NCBI) using the nucleotide Basic Local Alignment Search Tool (BLAST). A similarity of ≥ 97% was considered the threshold for identification at the species level.

ARG and MGE assessment by high-throughput quantitative PCR (HT-qPCR)

To assess abundances of ARGs and MGEs, DNA samples were sent for high-throughput quantification at the Resistomap Oy Laboratory (Helsinki, Finland) using the SmartChip™ Real-Time PCR system (Takara Bio, CA, USA). A total of 96 validated primer sets targeting major classes of ARGs and MGEs were employed. These included 79 resistance genes that confer resistance to sulfonamides (2), quinolones (2), aminoglycosides (12), phenicol (8), macrolide-lincosamide-streptogramin B (MLSB) (8), multidrug resistance (MDR) (9), tetracyclines (9), β-lactams (18), vancomycin (2), trimethoprim (3), colistin, and others. Additionally, 16 primer sets targeted genes representing MGEs including integrons, plasmids, transposons, and insertion sequences. The 16 S rRNA gene was also quantified as a basis for determining normalized gene abundances. The primer sequences selected for this study were based on those previously developed33. Detailed information regarding primer sets used is provided in Tables S2 and S3 of the SI.

Amplification for each primer set was conducted in triplicate using the conditions described by Wang et al. (2014)34. This involved an initial enzyme activation step at 95 °C for 10 min, followed by 40 cycles of denaturation at 95 ℃ for 30 s and annealing at 60 ℃ for 30 s. To ensure reaction specificity and prevent false positives, particularly for low-abundance genes, no-template control (NTC) was included to monitor potential contamination. Additionally, melting curve analysis and PCR efficiency assessment were performed on all the samples for each primer set. Amplicons with non-specific melting curves and multiple peaks, as indicated by the slope of melting profile, were considered false positives. Reactions with amplification efficiency outside the range of 1.8 to 2.2 were excluded from further analysis. To ensure reproducible detection, a threshold cycle (Ct) of 27 was used as the detection limit, and only amplifications observed in all replicates were considered positive35. The mean Ct of three technical replicates for each qPCR reaction was used to calculate the ΔCt values. Genes quantified in only one of the three technical replicates were excluded from the analysis. These quality-control measures collectively ensured reliable detection and high reproducibility of qPCR. The 2−ΔCt method was used to calculate abundances of detected genes relative to the 16 S rRNA gene in each sample. The results are expressed as gene copies per 16 S rRNA gene36.

Statistical analysis and data visualization

Heatmaps were generated using the “ComplexHeatmap” (v2.21.1) package in RStudio (version 2024.09.0-375). Spearman correlations between microbes and genetic elements were calculated using the “Hmisc” package (v5.1-3) in RStudio. Spearman’s rank correlation coefficients (ρ) were also calculated to determine correlations between measured water quality parameters and different bacterial groups, ARGs, and MGEs. Values of individual variables were ranked, and the Spearman coefficient was calculated using the Correlation function in Microsoft Excel. It should be noted that Spearman’s correlation provides insight into potential associations rather than direct functional relationships.

The association between the hospital wastewater microbiome and the resistome-mobilome was examined using Procrustes analysis performed on Bray-Curtis distance matrices created from both the microbial matrix and the normalized abundance matrix of ARG-MGEs. Principal coordinates analyses (PCoA) of both matrices were conducted and utilized for the Procrustes analysis. The first two PCoA axes of each matrix were used to calculate the symmetric Procrustes correlation coefficients, the measure of fit (sum of squared distances), the p-value, and the Procrustes residuals. The Procrustes correlations were visualized using “ggplot2” (v3.5.1). Microbiome and resistome clustering were also performed on PCoA coordinates using the k-means function from the “stats” package (v4.4.2). The optimal number of clusters was determined using the elbow method. A Permutational Analysis of Variance (PERMANOVA) was conducted using the adonis2 function with 999 permutations to assess the significance of both clusters and geographic regions. PERMANOVA and Procrustes analysis were conducted using the “vegan” package (v2.6-10) in RStudio. Network analysis was performed to investigate the co-occurrence patterns of genetic elements (ARGs and MGEs) and microbes. The correlation network was visualized using the interactive platform Gephi (0.10.1)37 under the Fruchterman-Reingold layout algorithm38.

Results and discussion

Water quality-associated parameters

The physicochemical characterization of the hospital wastewater samples is presented in Table 1. Sample HWW7 had the highest values for multiple parameters (TP 97.6 mg/L, TN 224 mg/L, COD 1137 mg/L, NH3-N 175.6 mg/L, and TOC 614.3 mg/L). Organic matter content in terms of COD varied widely across samples (between 14 and 1137 mg/L), with the Bekaa region having the highest average values at 437 ± 188 mg COD/L. These findings fall within a range of values similar to those previously reported for hospital wastewaters39,40. Sample conductivity ranged between 2.1 and 1928 µS/cm, representing the largest variability among all parameters. Overall, differences in physicochemical parameters between samples did not appear to be definitively affected by a hospital’s region or size.

Table 1 Physicochemical characterization of hospital wastewater samples.

Spearman coefficients calculated between parameters showed that TP, TN, TKN, sulfate, and NH3-N were all positively and significantly correlated for all 13 samples (Table S4). This was exemplified by the simultaneously highest and lowest measured parameters in HWW7 and HWW13, respectively. COD was also correlated with TN, NH3-N, and TOC. These observations confirm that hospital wastewater nutrient components such as TP, TN, TKN, and NH3-N exhibit more closely associated fluctuation patterns compared to sulfate or TOC. Ammonium and phosphorus concentrations in hospital wastewaters have been shown to have the most direct role in shaping their associated microbial communities41.

Microbial community profiles

Bacillota, Pseudomonadota, Bacteroidota, Campylobacterota, and Actinomycetota were the most dominant phyla in the hospital wastewater samples; this is reflective of previous findings42,43. A total of 90 microbial groups with relative abundances of over 1% in at least one sample were identified at the genus or species level. Common gut-associated microbes and potentially pathogenic genera, including Arcobacter, Enterococcus, Acinetobacter, Escherichia, and Aeromonas, were dominant in most samples (Figure S1).

Nonetheless, each of the hospital wastewaters assessed in this study exhibited a relatively distinct microbial composition, with notable differences in dominant groups and their relative abundance levels (Fig. 1). Arcobacter aquimarinus and Acinetobacter johnsonii were only dominant in HWW1 (respective relative abundances of 16.7% and 9.7%). Likewise, Megamonas funiformis was only predominant in HWW2 (58.7%) and Aeromonas veronii in HWW10 (25.2%). Aliarcobacter cryaerophilus was also highly abundant in both HWW1 and HWW10 (21.7% and 22.3%, respectively). Segatella copri and Leyella stercorea were most abundant in HWW7, HWW11, and HWW13, all of which are located in the North region. Particularly, Leyella stercorea was nearly three times higher in HWW7 and coincided with the highest recorded values of several physicochemical parameters (Table 1). Also, Parabacteroides chartae, unclassified Azonexaceae, and unclassified Rhodocyclales were exclusively significantly detected in HWW8 (8.6%, 11.9% and 11.7%, respectively). This is indicative of HWW8’s especially unique microbial profile. Incidentally, this was the only governmental hospital (Table S1), suggesting that patient type and medical treatment administered play an important role in shaping the microbial communities of discharged wastewater.

Fig. 1
figure 1

Relative abundances of microbial groups with ≥ 5% in at least one sample across all hospital wastewaters. The values have been normalized to total 100% within each sample, based solely on the displayed groups.

Alpha diversity indices were calculated to assess community richness and diversity (Table S6). This analysis showed the highest overall diversity (Shannon Index, H’) in samples HWW6 (Beirut, H’ = 4.95) and HWW9 (Mount Lebanon, H’ = 4.81), followed by similar values in the Bekaa and North region samples (average H’ of 3.65 ± 0.65 and 3.64 ± 0.47, respectively). HWW6, the only hospital located in the capital (Beirut), had a significantly higher richness value (2395) as compared to other samples. Richness was shown to significantly correlate with certain microbial groups, such as Arcobacter aquimarinus (ρ = -0.596, Table S5), indicating that other opportunistic pathogens may also be influenced in such samples. These trends suggest a certain level of geographic influence on microbial community structure; with urban hospitals (Beirut and Mount Lebanon) likely treating a more heterogeneous patient population compared to the North region, other selective pressures in the North may have had a greater influence (Table S1).

Assessment of ARGs and MGEs

Among the 95 targeted genes (excluding the 16 S rRNA gene), the total number detected per hospital wastewater sample ranged from 86 to 89 (shown in Table S7). The primary mechanisms of antibiotic resistance were antibiotic deactivation and cellular protection (mechanisms shown in Table S2). In contrast, ARGs associated with efflux pumps contributed the least to resistance in the hospital wastewaters. 62 of the analyzed genes (48 ARGs and 14 MGEs) were detected at a normalized abundance greater than 0.01 gene copies per 16 S rRNA gene copy in at least one sample (Fig. 2). Numerical values of normalized abundance of genes in all wastewater samples are provided in Supplementary Material 1.

Fig. 2
figure 2

Heatmap of normalized abundances of ARGs and MGEs (log-scale) detected using high-throughput qPCR in all samples. Genes are categorized by class, and only those with a normalized abundance of ≥ 0.01 in at least one sample are displayed. The abundances of all detected ARGs and MGEs are available in the (Supplementary Material 1).

Absolute abundances of both ARGs and MGEs were highest in HWW10, with a total of 7.44 × 1010 copies/100 mL, and lowest in HWW13 (Supplementary Material 2, Figure S2A). Calculated Spearman’s rank correlation coefficients indicated that classes including phenicol, quinolone, tetracycline, and trimethoprim were significantly positively correlated with each of TP, TN, TKN, and NH3-N (Table S5). Similarly, HWW7, having the highest measured concentrations of several physicochemical parameters, also had the highest overall abundance of tetracycline-associated ARGs (Figure S2B). These observations align with previous research work that also indicated that the phosphorus and nitrogen compounds in raw hospital wastewater have played an important role in shaping the ARG profiles41.

The heatmap showing normalized ARG abundances per 16 S rRNA gene (Fig. 2) revealed that the ARGs detected at the highest concentrations in the hospital wastewaters confer resistance to a majority of clinically important antibiotic classes, including the MLSB, β-lactams, aminoglycosides, tetracyclines, and sulfonamides, among others44. Such antibiotics are known to be extensively administered to hospitalized patients, which inevitably contributes to heightened levels of their associated resistance genes44,45,46. Conversely, ARGs associated with phenicol and trimethoprim had the lowest normalized abundance among all samples.

Figure 2 shows that genes encoding for integrons, along with the plasmid replication initiation protein repA, were consistently prevalent as MGEs in all samples (normalized abundance values ranging from 0.07 to 1). Interestingly, these gene types were also positively correlated with each of TN and TKN (Table S5). A high abundance of the repA gene indicates an increased likelihood of resistance transfer through plasmids. Class 1 and class 3 integrons (intI1 and intI3) have been frequently and extensively reported in wastewaters36,47,48 and are recognized as universal biomarkers for anthropogenic pollution49. Both of these integrons often carry multiple resistance genes and are commonly mobile on plasmids and transposons; this ultimately compounds the threat of their associated ARGs50. Exemplifying this, several samples displayed a high normalized abundance of the quaternary ammonium compound resistance gene protein (qacEdelta1), reaching up to 0.31 copies per 16 S rRNA. This gene is commonly embedded in class 1 integrons51, which likely explains its co-occurrence with intI1 in samples HWW5, HWW8, and HWW10.

The normalized abundance of intI2 was consistently low across all samples, ranging from 4.92 × 10− 5 to 5.54 × 10− 3 (Supplementary Material 1). Similar results were also recently reported by others52. The integron integrase encoded for by intI2 is inactive, resulting in a non-functional protein53,54.

The transposase-coding gene tnpA_1, commonly associated with the insertion sequence IS26 that plays an important role in the dissemination of resistance in Gram-negative bacteria55,56, exhibited especially high normalized abundances in many instances (reaching up to 0.36). Other MGEs, including the insertion sequences IS6100 and IS613, as well as other transposase-coding genes, were also prevalent among samples (values ranging from 0.02 to 0.31). Coincidentally, tnpA_1, tnpA_3, tnpA_6, and tnpA_7 transposase-coding genes were all positively correlated with the total phosphorus measured in the wastewater samples.

Sulfonamide resistance genes sul1 and sul2 were found to be highly prevalent across all samples (respective average normalized abundances of 0.46 and 0.23). Notably, the Guiana extended-spectrum β-lactamase gene (blaGES) was found in high abundance in five of the thirteen hospitals (0.16–0.74 copies per 16 S rRNA), four of which fall in the North region. Additional high-prevalence ARGs identified in the majority of wastewater samples included β-lactams (blaOXA10, cfxA, and blaTEM), tetracyclines (tet(O), tet(G), tet(Q), and tet(X)), aminoglycosides (aac(3)-iid_iia, aac6-aph2, spcN, aacA/aphD, aadA1, strB, and aph(3’’)-ia), MLSB (ermF, mphA, ereA, and ermB), multidrug (copA), vancomycin (vanA), and bacitracin resistance (bacA).

In the context of Lebanon, recent work has also reported high abundances of transposase genes in co-occurrence with a diverse range of ARGs in estuaries and rivers across the country57. It remains to be seen, though, whether certain combinations/ratios of physicochemical markers (such as TP, TN, and NH3-N) might in the future be utilizable as indicators of elevated waterborne risk associated with antibiotic resistance and its mobility.

Analysis of clinically relevant ARGs

According to the WHO AWaRe (Access, Watch and Reserve) classification58, the Reserve group of antibiotics include those that are designated as a last-resort option for treating infections caused by MDR organisms. From the genes targeted in this study, genes conferring resistance to Reserve group antibiotics were identified using the Comprehensive Antibiotic Resistance Database (CARD)59. Specific examples are mcr1 and mcr2 (colistin resistance), tet(X) (tigecycline resistance), and tet(M), tet(36), tet(W), tet(Q), tet(S), and tet(O) (minocycline resistance). The absolute copy numbers of these ARGs varied significantly across samples (Supplementary Material 2), with the highest levels observed in HWW1 (Bekaa), HWW7, HWW10, and HWW11 (the latter three located in the North region), potentially indicating the presence of localized resistance reservoirs.

Absolute abundances of the mcr1 gene ranged from 4.81 × 105 to 6.4 × 107 copies/100 mL; in contrast, mcr2 was only sporadically detected in three of the samples (ranging from 4.09 × 104 to 3.22 × 105 copies/100 mL). The mcr1 gene was first reported in Lebanon in an E. coli strain isolated from human clinical samples in 201960. One year later, another variant, mcr-8.1, was identified in a K. pneumoniae isolate obtained from a urine sample in Lebanon61. In general, elevated levels of these ARGs in specific hospital wastewaters underscores the potential for heightened risk of dissemination to the environment under specific circumstances and/or in specific regions.

The semi-synthetic lipoglycopeptide antibiotics dalbavancin and telavancin, both classified as Reserve group antibiotics, are considered ineffective against vancomycin-resistant enterococci (VRE) that carry the vanA gene62. vanA was detected at high concentrations in certain samples, reaching 6.03 × 108 copies/100 mL in HWW10, and was also found to be correlated with both TN (\(\:\rho\:\)=0.61) and TKN (\(\:\rho\:\)=0.621). A nationwide surveillance study conducted across 16 tertiary care centers in Lebanon from 2011 to 2013 reported a prevalence of 1% for VRE based on antimicrobial susceptibility profiles63. However, the incidence of VRE infections increased to 5.9% in 201864, and it still appears to be frequently detected in Lebanese hospitals currently65. The release of vanA from hospitals through their wastewaters raises specific concerns, especially given that vancomycin is the last-resort antibiotic for treating infections caused by Gram-positive bacteria such as methicillin-resistant Staphylococcus aureus66.

Other concerns associated with proliferation of carbapenem-resistant Enterobacterales (CRE) are often attributable to the genes responsible for production of carbapenemase. Different \(\:{\upbeta\:}\)-lactamase classes (A, B, C, and D) are known to produce different carbapenemases (e.g. Klebsiella pneumoniae carbapenemase – KPC, metallo-β-lactamases – MBLs, New-Delhi-metallo-β-lactamases – NDM, and oxacillinase (OXA)-48-like β-lactamases)67,68. Among these, blaKPC, blaNDM, and blaOXA−48 were consistently detected in all 13 hospital wastewater samples, with highest measured absolute abundances of 1.04 × 108, 4.25 × 107, and 1.17 × 108 copies/100 mL, respectively. Except for blaKPC, they were also found to be positively correlated with each of TP, TN, TKN, and NH3-N (Table S5). The first report of an NDM-1-producing K. pneumoniae in Lebanon was associated with an Iraqi expatriate in 201269,70. Three variants of NDM (NDM-1, NDM-4, and NDM-6) have since been identified in CRE, E. cloacae, E. coli, and K. pneumoniae, isolated from pathological specimens obtained from hospitalized patients in Lebanon. Of these, some E. coli isolates were found to co-harbor blaNDM−4 and mcr-1 genes71. Given their significance as last-resort antibiotics, novel variants of carbapenems such as meropenem-vaborbactam and imipenem-cilastatin are being considered for treatment of CRE infections, but research on implementation of such alternatives is ongoing67,72.

Correlation analysis between the Microbiome and resistome-mobilome

Clusters observed in Bray-Curtis-based PCoA plots were identified and related to variations in both the microbiome and resistome-mobilome of all hospital wastewater samples (Fig. 3-B). While alpha diversity patterns suggested potential regional variation, beta diversity analyses showed that geographic region alone had no significant effect on either (P > 0.05). However, nested PERMANOVA (regions within clusters) revealed statistically significant effects of regional variation on microbial community composition (R² = 0.97, P = 0.001) and resistome-mobilome profiles (R² = 0.85, P = 0.006), indicating localized geographic structuring within clusters. The previously noted higher ARG levels in certain North region hospitals reflect hospital-level variation rather than a broad regional differences. This region-based clustering was evident in the PCoA plots, which showed the grouping of Northern hospitals HWW7, HWW11, and HWW13 based on their microbiome, as well as HWW5, HWW8, and HWW10 based on their resistome-mobilome. Further, samples representing hospitals in the Beirut, Bekaa, and Mount Lebanon regions all showed notable similarity based on their microbiome PCoA, with the only exception being HWW1, which was clustered with HWW8 and HWW10 from the Northern region).

Procrustes analysis was conducted to elucidate the extent to which microbial community composition influenced the resistome-mobilome (Fig. 3-A). A significant correlation between the resistome-mobilome and the microbiome was observed (Procrustes sum of squares: M2 = 0.4665, Correlation: ρ = 0.7304, Significance: P = 0.002, Permutation: free, Number of permutations: 999). These findings highlight the influence of the microbial community on the resistome-mobilome in hospital wastewaters in Lebanon. Given that Procrustes analysis only provides the overall correlation between ARGs, MGEs, and the microbial communities, a Spearman’s correlation analysis was performed to identify specific relationships.

Fig. 3
figure 3

(A) Procrustes analysis of correlations between the microbiome and the resistome-mobilome. Arrows are shown between microbiome and resistome-mobilome coordinates for each HWW sample. Red triangles represent microbiome and blue squares represent resistome-mobilome. (B) Principal Coordinates Analysis (PCoA) of microbiome and resistome-mobilome composition based on Bray-Curtis dissimilarity matrices. Points represent samples, colored by statistically significant k-means clusters (k = 3) and shaped by geographic region, with dashed ellipses indicating 95% confidence intervals (for cluster with ≥ 3 samples). R² values and P-values from PERMANOVA are shown on respective plots. Axis labels indicate the percentage of variance.

Co-occurrence patterns of ARGs and MGEs

MGEs such as transposons and integrons commonly carry ARGs (one or multiple) that are highly prone to horizontal transfer between cells50,73. Due to this, previous works have extensively demonstrated that MGEs and ARGs are commonly co-occurring in various environmental settings74,75. To visualize specific relationships in the hospital wastewaters of this study, a heatmap showing the Spearman correlation coefficients between MGEs and ARGs in all samples was generated (Fig. 4). This analysis showed numerous significant correlations. intI1 was found to influence the most resistance genes (20 positive interactions), followed by the insertion sequence IS6100 (16 positive interactions). Transposons also co-occurred with several ARGs (e.g., aadA1, tet(M), tet(Q)) across multiple classes of antibiotics75,76.

MGEs such as intI1, IS6100, tnpA_1, tnpA_2, and tnpA_3 were significantly linked to a common array of ARGs, including sul1, qacEdelta1, cmlA, blaKPC, blaOXA10, and blaOXA1/blaOXA3044,55,56,77,78. The multidrug resistance gene copA was highly correlated to intI1 and intI3, while both tolC_1 and tolC_2 were significantly associated with IncI1_repI1, suggesting a potential plasmid-mediated dissemination of drug efflux-associated resistance79. Transposase encoding genes tnpA_6 and tnpA_7 displayed significant correlation with the MLSB resistance gene ermB, which has been previously reported80,81. More alarmingly, several MGEs showed significant positive correlations with clinically important ARGs such as vanA and vanB. Vancomycin resistance genes were found to co-occur with intI1, intI3, and IncI1_repI1, suggesting a considerable role for plasmids in the HGT of these ARGs. IncI1, one of the two most common plasmid types associated with the dissemination of antimicrobial resistance genes encoding for third-generation cephalosporins, was significantly correlated with aph(3’’)-ia, floR, and blaCTX−M, as has been previously reported82. Tetracycline resistance genes that are linked to resistance to Reserve group antibiotics, such as tet(Q), tet(M), tet(X), and tet(W) also showed significant positive correlations with tnpA_1, tnpA_5, Tp614, IS613, and repA.

Notable correlations were also observed among the MGEs themselves (Table S8), which may suggest simultaneous transfer of different types of MGEs. Transposition typically occurs when a transposase enzyme recognizes an insertion sequence in the same cell, and integrons are mobilized when present on plasmids or transposons. For instance, intI1 significantly correlated with repA (\(\:\rho\:\)=0.643), tnpA_2 (\(\:\rho\:\)=0.797), and tnpA_3 (\(\:\rho\:\)=0.731). intI1 is commonly located on a plasmid that contains repA, which is necessary for the plasmid replication process73,83. Further, tnpA_1 and tnpA_2, both of which are associated with insertion sequences of the IS6 group, were shown to correlate with IS6100 (belonging to the same family)55,56,84. Abundances of transposons were also previously found to be correlated with occurrence of class 1 integrons74. Overall, the high abundance of MGEs in this study, and their co-occurrence with a wide range of ARGs, underscores the critical need to improve understanding of MGE dynamics so as to limit the dissemination of antibiotic resistance through HGT.

Fig. 4
figure 4

Heatmap showing Spearman correlation coefficients between the normalized abundances of ARGs and MGEs in all hospital wastewater samples. The color gradient within the rectangular box indicates the strength of the Spearman correlation coefficient. Asterisks represent statistical significance correlations (* \(\:\rho\:\) ≥ 0.55, P < 0.05; ** \(\:\rho\:\) ≥ 0.68, P < 0.01).

Network analysis of relationships between microbial communities and ARGs/MGEs

The co-occurrence patterns between microbial genera and genetic elements (ARGs and MGEs) in samples were also examined by network analysis (Fig. 5). The network consisted of 91 nodes (representing 45 microbial taxa and 46 ARGs/MGEs) and 150 edges built from significant positive correlations (ρ > 0.55 and P < 0.05) and a modularity of 0.585. The phylum Pseudomonadota showed the highest number of positive interactions with resistance genes (72). Unclassified Rhodocyclales and Thauera phenylacetica were strongly associated with shared MGEs and ARGs, such as intI1, tnpA_2, tnpA_3, IS6100, qacEdelta1, and blaOXA1/blaOXA30, all of which were correlated (Fig. 4)85,86. These findings suggest that these genetic elements are co-transferred within the same bacterial groups in the wastewater.

Several potentially pathogenic genera, including Arcobacter, Enterococcus, Acinetobacter, Clostridium, Pseudomonas, Escherichia, Klebsiella, Enterobacter, and Aeromonas (among others, Table S9), demonstrated multiple associations with ARGs and MGEs (Figure S3 and Fig. 5), suggesting that these bacteria may serve as significant vectors for waterborne antibiotic resistance arising from clinical settings. Acinetobacter johnsonii was found to be correlated with the floR gene (conferring florfenicol resistance). The presence of the floR gene in Acinetobacter spp. of human origin has been recently reported87. A. johnsonii was also significantly correlated to the MDR gene acrA88.

E. coli were positively associated with genes encoding for multidrug efflux system proteins (acrA and mdtL)89 and MLSB resistance gene (ermX). Some resistance genes were shared among opportunistic pathogens belonging to the phylum Bacillota, such as ermX in Clostridium saudiense, Streptococcus pasteurianus, and Lactobacillus delbrueckii. This finding suggests the presence of common resistance mechanisms among these closely related taxa. In addition, both tet(W) and ermX were significantly linked with genera within Bacillota (including Blautia, Dialister, Mediterraneibacter, Anaerobutyricum, Streptococcus, and Dorea)41 (Figure S3). These gut microbiome-associated groups were found to co-occur in several hospital wastewater samples (Fig. 1).

Aliarcobacter cryaerophilus (Campylobacterota) was found to be a potential host for ARGs associated with phenicol ( catB3 and cmlA…) and vanA90.

Within the Bacteroidota phylum, genera belonging to the Prevotellaceae family, along with Leyella, Segatella, and Massiliprevotella, were positively correlated with common ARGs (tet(Q), ermF, and cfxA) and MGEs (Tp614)91. Similarly, Parabacteroides chartae and Bacteroides graminisolvens were both significantly correlated to tnpA_2 and IS6100, as well as to ARGs associated with these MGEs such as cmlA, qacEdelta1, sul1, blaVEB, and blaOXA1/blaOXA30.

vanA and vanB were found to be associated with multiple Gram-negative microbial genera (an exception being Enterococcus malodoratus) (Figure S3). These ARGs and Gram-negative microbes co-occurred with several MGEs, particularly intI1, IS6100, tnpA_2, and tnpA_3. Such findings suggest a potential for HGT of vancomycin resistance beyond their common enterococcal hosts. Although Gram-negative bacteria are intrinsically resistant to vancomycin, recently developed vancomycin conjugates have demonstrated promising efficacy through disruption of the membrane structure and integrity, enhancing cell permeability92,93. Alarmingly, these results suggest that Gram-negative bacteria harboring vanA and vanB genes could already have the potential to be resistant to these conjugates once clinically approved.

Beyond this, correlations observed in this analysis between mcr2 and Agathobacter rectalis (as well as between several tet genes and other non-pathogenic gut microbes) (Figure S3), raises concerns regarding the potential of clinically-influenced wastewater to serve as a reservoir for the rapid dissemination of resistance to Reserve antibiotics into downstream environments94.

Further, Streptococcus minor, unclassified Actinomycetaceae, and Propionibacteriaceae were also linked to ermB and tnpA_6, suggesting that tnpA_6 may play a role in the mobilization and dissemination of ermB within these bacterial hosts (Figure S3)95,96. These findings indicate that specific microbial genera found in hospital wastewaters can harbor multiple ARGs and MGEs that increase the risk of ARG dissemination into non-clinical environments.

Fig. 5
figure 5

Co-occurrence network analysis of the ARGs-MGEs and microbial taxa present at a normalized abundance of ≥ 2% in at least one hospital wastewater sample (Figs. 2 and S1). Node colors differentiate ARGs, MGEs, and phyla types. Node size is proportional to the number of connections (degree). Edges represent strong and statistically significant positive correlations (Spearman’s ρ > 0.55 and P < 0.05), with edge width proportional to correlation strength.

Conclusion

Hospital wastewater is known to serve as a unique reservoir for antibiotic resistant bacteria and their associated resistance genes. Given the wide range of potential influences on MGE-mediated HGT of ARGs in such environments, acute challenges with understanding (and ultimately limiting) the spread of resistance remain. The work presented here utilized a unique combination of high-throughput qPCR, molecular-based microbial community profiling, and physicochemical water quality characterization to gain globally-relevant insight into trends in ARG co-occurrence with a wide range of significant factors. This study is also the first comprehensive assessment of the antibiotic resistome, mobilome, and microbiome of hospital wastewaters in Lebanon.

Specific relationships between ARGs, MGEs, bacterial groups, and wastewater characteristics were identified in this work. The 13 hospital wastewaters surveyed were found to have widely varying microbial communities, which directly influenced their associated resistomes. ARGs conferring resistance to WHO-classified Reserve group antibiotics were prevalent and appeared to be influenced by hospital region and wastewater characteristics that included nitrogen and phosphorus content. MGEs were widespread across all samples; genes encoding for class 1 and class 3 integrons, as well as specific transposase encoding genes and insertion sequences, showed very high 16 S rRNA gene-normalized abundances. Many of the dominant MGEs and ARGs correlated with individual bacterial groups and physicochemical characteristics, suggesting that tailored water quality parameter-based treatment approaches for hospital wastewaters could be effective at reducing HGT-based spread of ARGs from hospitals. Future research should focus on expanding surveillance to include emerging resistance genes, particularly those that confer resistance to Reserve antibiotics. Additionally, research evaluating the effectiveness of wastewater treatment technologies in reducing the load of ARGs, MGEs, and associated microbial hosts in hospital effluents is needed.