Similarity of drug targets to human microbiome metaproteome promotes pharmacological promiscuity

Beaudoin, Christopher A.; Norget, Shannon; Omran, Ziad; Hala, Sharif; Daqeeq, Abdullah H.; Burnet, Philip W. J.; Blundell, Tom L.; van Tonder, Andries J.

doi:10.1038/s41397-025-00367-0

Download PDF

Article
Open access
Published: 17 April 2025

Similarity of drug targets to human microbiome metaproteome promotes pharmacological promiscuity

Christopher A. Beaudoin ORCID: orcid.org/0000-0002-0232-0281¹,
Shannon Norget²,
Ziad Omran³,
Sharif Hala^4,5,
Abdullah H. Daqeeq⁶,
Philip W. J. Burnet⁷,
Tom L. Blundell⁸ &
…
Andries J. van Tonder ORCID: orcid.org/0000-0002-4380-5250⁹

The Pharmacogenomics Journal volume 25, Article number: 9 (2025) Cite this article

3566 Accesses
Metrics details

Subjects

Abstract

Similarity between candidate drug targets and human proteins is commonly assessed to minimize the occurrence of side effects. Although numerous drugs have been found to disrupt the health of the human microbiome, no comprehensive comparison between established drug targets and the human microbiome metaproteome has yet been conducted. Therefore, herein, sequence and structure alignments between human and pathogen drug targets and representative human gut, oral, and vaginal microbiome metaproteomes were performed. Both human and pathogen drug targets were found to be similar in sequence, function, structure, and drug binding capacity to proteins in diverse pathogenic and non-pathogenic bacteria from all three microbiomes. The gut metaproteome was identified as particularly susceptible overall to off-target effects. Certain symptoms, such as infections and immune disorders, may be more common among drugs that non-selectively target host microbiota. These findings suggest that similarities between human microbiome metaproteomes and drug target candidates should be routinely checked.

A data-driven approach for predicting the impact of drugs on the human microbiome

Article Open access 17 June 2023

Drug-microbiota interactions: an emerging priority for precision medicine

Article Open access 09 October 2023

High-throughput transcriptomics of 409 bacteria–drug pairs reveals drivers of gut microbiota perturbation

Article 17 January 2024

Introduction

Drug discovery campaign success has improved throughout the past three decades [1]. In recent years, an average of approximately 50 drugs per year have been approved by the USA Federal Drug Administration Center for Drug Evaluation and Research for therapeutic use, which is an increase compared to the historic average of 34 drugs approved per year since 1993 [2]. Nevertheless, developing novel and safe drugs poses several challenges and requires significant time and resources to validate clinical efficacy, toxicity, sensitivity, and specificity [3, 4]. To expedite the process for discovering new drugs, several experimental and computational approaches have been developed to enable high throughput small molecule screening campaigns [5, 6]. Furthermore, significant advances in genome sequencing technologies [7,8,9] and protein structure determination methodologies (e.g. X-ray crystallography [10], cryo-EM [11]) have provided profound insights into drug binding sites and inhibitory mechanisms for various human and pathogen proteins. In combination with in vitro and in vivo functional and phenotypic screens [12], these data have allowed for the generation of numerous sets of principles that may inform selection of protein targets for the rational design of therapeutic small molecules [13]. The rational selection of a new drug target for human diseases and microbial pathogens requires several considerations: binding pocket features (e.g. depth, electrostatics), binding site conservation, gene expression level, number of interacting pathways [14]. In all cases, the drug should be highly specific to one or specific targets in order to minimize off-target effects [15]. Checking the homology of the drug target to the human proteome is crucial to ensure that the functions of essential human proteins are not modulated [16]. Additional considerations in drug target selection may further enhance the success and safety of drug screening campaigns.

The human microbiome comprises all organisms living on the surface or within the human body [17]. The number of unique genes in the human microbiome has been estimated to be 100–150 times higher than the number of human genes [18, 19]. The abundance and functional features of microbial partners have been reported to be important for the maintenance of health at specific body sites [20]. Microbiota are also sensitive to perturbations resulting from changes in diet, location, circadian rhythm, and numerous other endogenous and exogenous factors [21]. Notably, although the effects of antibiotics on the human microbiome have been extensively surveyed, numerous human (non-antibiotic) drugs have been discovered to both affect microbial health and be metabolized by the microbiome [22, 23]. Additionally, the microbiome may metabolize drugs, thus changing the target-binding properties and off-target effects [19]. Although the composition of microbial species in a healthy microbiome differs between individuals from genetic and geographic backgrounds, gene function in the metagenomes have been shown to be largely consistent, suggesting that gene function may be more indicative of the roles that microbes play in host-microbe interactions [20]. Therefore, novel insights into drug target selection parameters that can be modulated to minimize off-target effects on the microbiome may provide safer treatment options.

Several studies have previously proposed methodologies for predicting the non-selective targeting of microbiota, e.g. using machine learning models that incorporate drug chemical properties and the microbial genome features [24, 25]. However, to date, no comprehensive comparison of the sequence homology and structural features of the drug binding pockets for human and pathogen drug targets with proteins in the human microbiome has yet to be performed. Therefore, using sequence and structure based bioinformatics, the similarity and, thus, drug binding potential for 1346 FDA-approved drugs that collectively target 739 human and pathogen drug targets [26] were assessed on representative metaproteomes of the human skin, oral, and gut microbiomes [27]. In summary, both human and pathogen drug targets were found to be highly similar in sequence, structure, function, and drug-binding capacity to microbiome drug targets. These results highlight the utility of checking sequence and structural homology of the phylogenetically diverse organisms in the microbiome to candidate drug targets for binding site selections. Further validation of microbiome drug target promiscuity may shed light on the etiology of unintended clinical outcomes.

Results

Sequence and functional similarities between drug targets and microbiome metaproteomes

Protein sequence homology has been widely used to infer structure and function of proteins. An amino acid sequence identity of over 30% has been suggested to be indicative of a common evolutionary ancestry and, thus, structure and a sequence identity of above 40–60% for shared function [28,29,30,31,32]. Protein sequence homology has also been used to infer promiscuity of ligand binding [33]. Recently, Santos et al. reviewed the mechanism of action of FDA-approved drugs and their corresponding protein targets [26]. Additionally, the MGnify database [27, 34] has made available comprehensive datasets on the human gut, oral, and vaginal metagenome assembled proteomes. The protein sequences correspond to 289,026, 1225, and 618 nonredundant metagenome assembled genomes (MAG) in the gut, oral, and vaginal microbiomes. Utilizing these combined data, the human drug target protein sequences were mapped to the gut, oral, and vaginal metaproteome sequences to find similarities in amino acid sequence and function. BlastP was used to perform global alignments between the drug target and metaproteome sequences [35]. Sequence similarity between drug target and microbiome sequences may reveal microbiome species that are most likely to be non-selectively targeted.

Among the total 737 drug target sequences, 126 total (77 human and 51 pathogen) were mapped with above a 30% global sequence identity to metaproteome sequences. Alignments of the microbiome metaproteomes and drug target sequences reveal differences in sequence identity distributions with respect to targeted organism (Fig. 1A). The pathogen drug target sequences reported higher average sequence identity than those of the human targets to the metaproteome sequences in all three microbiome datasets. As expected, no human targets were found to be identical to any metaproteome sequences, while 174, 22, and 20 unique metaproteome sequences were found to be identical to pathogen targets in the gut, oral, and vaginal microbiome, respectively. Notably, the average sequence identity and number of identical matches between the pathogen drug targets and the gut metaproteome (average: 70.4%), in particular, were much higher than oral (48%) and vaginal (46.3%) metaproteomes. Sequence identity distributions comparing the human targets and the three metaproteomes were all similar.

**Fig. 1: Sequence similarity between drug target and metaproteome sequences.**

More putative off-target species were found in common between the gut and oral metaproteomes than either of the two with the vaginal microbiome. Interestingly, 96% of the potential unintended targeted species were found to be specific to each microbiome, and only five species were found in common between all three microbiomes (Fig. 1B). The primary affected phyla in the gut and oral microbiomes for both human and pathogen drug targets were Proteobacteria, Firmicutes, Bacteroidota, and Actinobacteriota, while the those in the vaginal microbiome largely comprised Bacteroidota, Bacillota, and Actinomycetota (Fig. 1C). Although the sequence identity distribution profiles between the three metaproteomes and human drug target sequences were similar, the number of metaproteome sequences found to be potentially targeted in the gut microbiome (19,369) is much higher than the oral (6980) and vaginal (4601) microbiomes. The number of species affected by drugs for pathogen infections followed a similar trend to the human drugs across the three microbiomes: gut: 35,695, oral: 23,168, vaginal: 18,343. The highest number of off-target species was reported for the human targets (2243), followed by those of Escherichia coli (1146), Mycobacterium tuberculosis (1060), Clostridioides difficile (990), and Staphylococcus aureus (863) (Fig. 1D). In summary, phylogenetically diverse bacteria may be susceptible to drugs irrespective of the targeted organism.

Comparing the functions between mapped sequences may provide insights into which classes of drug targets might be differentially targeting metaproteomes. First, the functions of the drug targets were compared with the annotated functions of the metaproteome sequences, for which the data were available, to determine functional overlap. A sequence identity cutoff of over 50% was chosen based on previous findings, to reduce analytic constraints, and to validate sequence function comparisons [28, 29]. Among human targets, alcohol dehydrogenase mapped only to S-(hydroxymethyl)-glutathione dehydrogenase, and the other metaproteome functional annotations all mapped to identical or similar functions (Fig. 2A). The human target peptidyl-prolyl cis-trans isomerase protein sequence mapped to hypothetical proteins, peptidyl-prolyl cis-trans isomerases, and FK506-binding proteins in the metaproteomes. The annotated functions of the metaproteomes mapped to identical or similar functions for all pathogen targets (Fig. 2B). The dihydrofolate reductase, 30S ribosomal protein S9, and 50S ribosomal protein L3 were similar in identity to hypothetical proteins; and dihydrofolate reductase mapped to IS1595 family transposases in the metaproteomes. Overall, the functional similarities between the mapped drug target and metaproteome sequences are evident. Therefore, an analysis of the species affected by drug target class may shed light on which drugs are most pharmacologically promiscuous.

**Fig. 2: Relationship between target function and non-selectively targeted microbiome protein function.**

The phyla non-specifically targeted by the human drugs are varied among the targeted proteins. As shown in Fig. 3A, B, more phyla that represent the microbiome metaproteomes were found to be mapped to the human targets (24) than the pathogen targets (17). Most human and pathogen drug targets were found to be similar to Firmicutes, Proteobacteria, Bacteroidota, and Actinobacteriota, while five human drug targets (3-oxo-5-alpha-steroid 4 dehydrogenase, carbamoyl-phosphate synthase, neprilysin, dipeptidyl peptidase, sodium/glucose cotransporter (2) were highly if not exclusively similar to Bacteroidota. Proteobacteria are highly enriched in four pathogen drug target classes (D-alanyl-D-alanine carboxypeptidase, beta-lactamase, peptidoglycan synthase, and penicillin-binding proteins). The four pathogen drug targets (DNA gyrase, DNA topoisomerase, DNA-directed RNA polymerase, and ATP synthase) with the most similar metaproteome sequences were found to be nearly or more than double the number of all other human or pathogen genes, thus highlighting the increased potential for off-target effects. The sequence and function similarities between drug targets to the metaproteome proteins provide novel insights into the potential for drug promiscuity among the human microbiome. Additional investigation into the conservation of residues and structural features that are critical for drug binding may further clarify the capacity for off-target protein-ligand interactions.

**Fig. 3: Relationship between target function and non-selectively targeted microbiome proteins.**

Structural and molecular similarities between drug targets and microbiome proteins

Protein-ligand interactions are largely determined based on the structural and electrostatic complementarity between the drug and binding pocket. Drug promiscuity has been attributed to similarity between drug binding pockets in different target proteins [36]. Therefore, examining the differences in drug binding pockets between the drug target and metaproteome protein structures may lend more insights into off-target interactions with the human microbiome. All experimentally-determined protein structures resolved with the exact small molecule from the drug target dataset were extracted from the RCSB Protein Databank (PDB) [37]. In total, four human and three pathogen drug-bound target structures were available for further analysis (Table 1). As shown in Fig. 4A, the average sequence identity for the four human targets (41.8%) was lower than that of the three pathogen targets (54.0%). To practically test the drug promiscuity between phylogenetically distant organisms, the metaproteome sequence with the lowest identity to the drug target sequence was selected. Homology modelling of the microbiome protein structures was performed using the resolved drug target protein structures as templates. Subsequently, the corresponding drugs were docked into the metaproteome models based on the poses in the experimentally-determined drug target structures. The predicted affinity and protein-ligand interactions were generated and compared between the drugs with the microbiome and drug target proteins.

Table 1 Predicted protein-drug affinities for drug target and microbiome proteins.

Full size table

Human targets

In descending order of average sequence identity to the metaproteome sequences, the human targets consisted of peptidyl-prolyl cis-trans isomerase FKBP1A, sodium/glucose cotransporter 2 (SCL5A2), acetylcholinesterase (ACHE), and carbonic anhydrase 2 (CA2). Inspection of the interactions between rapamycin and the FKBP1A models revealed three single amino acid differences in the selected microbiome protein drug binding pocket (Fig. 4B). Notably, these differences correspond to one residue insertion and to nonsynonymous variants, which may partially explain the decrease in predicted affinity of the microbiome FKBP1A to rapamycin compared to the human FKBP1A (Table 1). Interestingly, although SCL5A2 was second highest among the four human targets in average sequence identity, there were numerous variations in binding pocket residues, and the predicted affinity of empagliflozin with the microbiome SCL5A2 was approximately half of that with the human SCL5A2. The virtual docking of tacrine to the selected microbiome protein similar in sequence to human ACHE revealed a notable difference in the binding pocket: a tryptophan in the human ACHE exhibits pi stacking with tacrine, while this tryptophan is missing in the microbiome protein (Fig. 4C). Of note, this and all other microbiome proteins that mapped to the human ACHE were predicted to be para-nitrobenzyl esterases. The non-overlapping putative function likely explains the differences in active site residues and, potentially, drug binding. Although the microbiome CA2 sequences were the lowest in sequence identity to the human CA2, the predicted affinities were almost identical and only 2 out of 15 binding site residues were different (Fig. 4D). Remarkably, the human targets that mapped to the metaproteome sequences with the highest and lowest sequence similarities were found to have the most conserved binding pocket residues, while the binding pockets of the middle two were less likely to share structural features that enable high affinity interactions with the same drugs.

Pathogen targets

In decreasing order of average sequence identity to the metaproteome sequences, the pathogen drug targets included DNA-directed RNA polymerase subunits (rpoA, rpoB, rpoC, rpoZ), dihydrofolate reductase (folA), and Probable arabinosyltransferase A (embA). The available structure of the target DNA-directed RNA polymerase (PDB: 7l7b) shows that fidaxomicin is bound to three subunits (rpoB, rpoC, and rpoD). Therefore, the protein assembly was modelled using the sequences from the same MAG. Among the three subunits, a comparable predicted affinity and only four total single residue variants were discovered between the target (Clostridium difficile) DNA-directed RNA polymerase and that of the human microbiome (Fig. 4E). Comparing the binding pockets of the microbiome and target (Mycobacterium tuberculosis) embA revealed only one single amino acid difference, although several residues surrounding and leading into the binding pocket were not identical. Two binding pocket residues were different between the target (Escherichia coli) and microbiome folA proteins (Fig. 4F). The predicted binding affinities for all three pathogen targets were comparable to their modelled microbiome counterparts. Overall, the pathogen targets were found to contain less structural and functional changes in the binding pockets between species even at approximately 30% global sequence identity. These data support the notion that drugs targeting pathogens may bind more promiscuously than human targets to microbiota proteins.

Clinical implications of drug promiscuity in the human microbiome

Nearly every aspect of human physiology (e.g. weight management, cardiovascular health, gut-brain axis) has been linked to the maintenance of microbiome composition and functional capacity [38]. The off-target effects of both antibiotics and non-antibiotic drugs on the human microbiome has been linked to various and diverse clinical manifestations [39]. Therefore, the data above may be used to better understand the effect of non-selective drug targeting on (1) the non-pathogenic bacteria in the gut, oral, and vaginal microbiomes and (2) the presence side effects when comparing drugs that do or not target proteins similar to the microbiome metaproteomes. Bartlett et al. [40] curated a list of 1513 bacterial species that have been recorded to lead to infection, even considering opportunistic infections. Additionally, the SIDER Side Effect Resource [41] presents all recorded side effects, as defined in the MedDRA [42], associated with the use of thousands of drugs. A better understanding of the clinical implications may help us better untangle the complexity of pharmacological promiscuity and guide clinical decision making for personalized cases based on microbiome data.

Non-selectively targeted pathogenic and non-pathogenic bacteria

Among the pathogenic species listed by Bartlett et al. 285 and 313 were found to be non-selectively targeted by human and pathogen drug targets, respectively (Fig. 5A). A total of 1876 and 1987 other bacterial species – referred to as non-pathogenic bacteria – were mapped to the human and pathogen drug targets, respectively. Interestingly, the number of affected species between the human and pathogen drug targets were similar across the gut, oral, and vaginal microbiomes. Notably, more than half of the affected species are non-pathogenic, although the proportions are different for the gut microbiome than the oral and vaginal. The disproportionate number of non-pathogenic species affected in the gut microbiome provides additional data that suggest that the gut microbiome may be more susceptible to non-selective targeting than the oral and vaginal microbiomes. The number of shared affected non-pathogenic bacteria between the microbiomes show a similar trend to the overall data, wherein the gut and oral microbiomes have more overlapping off-target species (Fig. 5B). In summary, diverse and nearly equal numbers of total non-pathogenic species that are found to be potentially non-selectively targeted were found for both human and pathogen drug targets. These data draw attention to the necessity for target specificity in drug design, discovery, and development.

**Fig. 5: Clinical associations with drug target-metaproteome sequency identity.**

Differential side effects of drugs based on non-selective of targeting microbiome proteins

The 126 and 611 drug targets that did and did not map to the microbiome metaproteome sequences were assigned as microbiome-affecting and non-microbiome-affecting, respectively. Upon filtering the SIDER datasets for side effects corresponding to the drug and drug target, 314 symptoms were unique to the targets that mapped to microbiome proteins, while 2248 were shared (Fig. 5C). Among the unique microbiome-affecting symptoms, 42 (13%) were related directly to infection and inflammation, such as fungal and bacterial infections, while the others were distributed among the remaining 25 high-level MedDRA System classifications. Notably, the symptoms of wound complications and surgical site reaction – potentially affecting the skin microbiome – were also found exclusively among drugs that non-selectively target microbiome proteins. Additionally, an analysis of the difference in prevalence among the shared symptoms may also reveal the specific effects of off-target effects on the microbiome. When comparing the percentage of prevalence of shared symptoms among drug targets that are and are not similar to the metaproteome sequences, certain MedDRA Systems seem to be particularly enriched (Fig. 5D). Interestingly, several Infection (12 / 48), Blood and lymphatic (11), and Immune (5) symptoms were at least 10% more prevalent in microbiome-affecting targets. The prevalence of Cardiac (5) and Psychiatric (7) symptoms was particularly enriched for drugs that do not non-selectively target microbiome proteins. These data suggest that specific symptoms or organ system conditions may be more affected by drugs that non-selectively target the human microbiome.

Discussion

The effects of numerous drugs on the human microbiome have been widely shown in various studies [22, 43,44,45,46,47]. Considering that different surfaces on the human body are inhabited by phylogenetically diverse microbes, there may be structural and functional overlap between current drug targets and microbial proteins. However, to date, no study has assessed the similarity between established drug targets and the proteins expressed by bacteria in different human microbiomes. Therefore, herein, sequence and structure bioinformatics methodologies were utilized to investigate how sequence identity between drug targets and microbiome proteins relates to structural features and, thus, drug promiscuity. These data were then combined with symptoms related to drugs and drug targets to gain insight into potential clinical outcomes associated with non-selective targeting of microbiome bacteria.

The average sequence identity between pathogen drug targets and the metaproteome of the gut microbiome, in particular, was notably higher than that of the drug target and microbiome associations. The significantly higher diversity of gut microbes compared to other human microbiomes has been noted previously [20], which, in combination with the findings in this study, suggest that a higher diversity may necessarily increase the likelihood that off-target drug targeting may occur. Furthermore, a higher number of non-selectively targeted species were found in common between the gut and oral microbiomes, which is not surprising considering their physical connection. The overarching differences in composition of off-targeted bacterial phyla between the gut/oral and vaginal microbiome metaproteomes highlights the diversity of species potentially targeted by drugs despite the target organism. The widely affected targets warrant further investigation into how drug targets for similar proteins may be affecting microbiome health.

These data suggest that a sequence identity of 30% may be used to determine similarity between drug targets of diverse phylogenetic ancestry. Notably, the pathogen target sequences were found to be more correctly mapped to microbiome protein sequences with identical functional annotations than the mappings of human target sequences. All three pathogen targets that were investigated using protein and drug structure analyses revealed consistent features to microbiome proteins, whereas the human drug targets were more dissimilar. Importantly, however, the human drug target with the lowest sequence identity (of the four with structural information) to the microbiome protein reported the most stable protein-ligand interaction compared to the other human drug targets. Incorporating structural information into drug target similarity is, therefore, crucial to accurately assess how drugs may interact with proteins that are homologous to the target. Variations in residues outside of the binding site may also affect binding site dynamics, and thus may guide drug design efforts. Of note, several hypothetical proteins were found to be similar to the drug targets, which may add putative functional information for these genes. The sequence, structural, and functional similarities between established drug targets and proteins in the human microbiome, as exhibited in this study, suggest that such checks should be routinely considered during target selection for drug discovery campaigns.

These results support previous findings that the microbes from various human micro-environments may be influenced by drugs taken via different routes of entry and, thus, have widespread effects on host-microbiome interactions [48]. An increase in the prevalence of side effects classified under immune system and blood disorders and infections were specifically linked to drugs that were found to non-selectively target microbiome species. These findings are interesting since drug-induced perturbations of the human microbiome have been shown to result in the proliferation of opportunistic bacterial pathogens [49, 50]. Microbiome diversity forms the basis for maintaining protection against colonization by opportunistic pathogens [51]. Therefore, promiscuous drugs may disrupt the healthy microbiome diversity – as evidenced by the diverse number of phylogenetically-distinct non-pathogenic microbiome species with similar protein sequences to both human and pathogen drug targets – and give rise to micro-environments suitable for opportunistic pathogens [52]. A drug-induced expansion of opportunistic pathogens would support a recent report stating that bacteria in the patient microbiome are common causative agents in surgical site infections, as also seen in the side effect associations explored in this study [53].

Importantly, the reported associated side effects could be a result of other numerous and complex interactions that occur either solely in the human body or, perhaps, partly in connection with microbiota [54]. Considering that the potential off-target effects analyzed in this study were only assessed in the context of similar drug targets and not the vast protein space that may encompass the true landscape of drug promiscuity, further investigation is required to determine the side effects that are uniquely or differentially prevalent for certain drugs and drug targets [55]. Furthermore, drugs may be altered through biotransformation in microbial communities and, thus, affect downstream processes unrelated to the original target protein [56]. Bacteria non-selectively targeted may also be under evolutionary pressure to develop genetic mutations that nullify the effects of the drug, which may be transmitted horizontally to other microbiome bacteria and, thus, promote the spread of antimicrobial resistance mechanisms [57]. The impact of diet on both the microbiota and their combined effect on drug metabolism and off-target binding are also to be considered in the analysis of side effects [58]. Nevertheless, the preliminary clinical associations from this study suggest that certain side effects may be more prevalent when non-pathogenic microbiota are targeted.

Several factors have been suggested to help guide the selection of both human and pathogen targets for drug discovery efforts. In addition to the commonly performed sequence homology check between targets and human protein sequences, the data presented in this study point to the relevance for including a protein sequence identity screening for unintended microbiome targets. Artificial intelligence and machine learning approaches may also help further design drugs that are selective for specific targets and minimize off-target microbiome and human effects [59, 60]. Alternatively, such data may be leveraged to target specific bacteria to reverse disease states that may be caused by phylogenetically related or distinct pathogens [61, 62]. Future work that compares between the homologous protein sequences and structures of commensal microbes may reveal features in drug binding pockets or on epitopes that are unique to pathogens, thus accelerating the search for novel antimicrobials [63,64,65,66]. Notably, many of these factors can be applied when considering genes or proteins for diagnostic or prognostic measures as well [67]. In summary, the non-selective targeting of microbiome proteins may be minimized by ensuring that the intended protein target sequence lacks identity to metaproteome sequences. Further assessment of drug promiscuity among non-pathogenic microbes may inform how sequence, structural, and functional information can be utilized to minimize off-target effects.

Methods

Data collection and analysis

The drug target protein UniProt [68] accessions were obtained from Santos et al. [26] and the protein sequences from UniProt. Only UniProt “Reviewed” drug target accessions were selected. The Gut v2.0.2, Oral v1.0.1, and Vaginal v1.0 microbiome metaproteome sequences, predicted functions, and phylogenetic lineages were downloaded from the MGnify database [27, 34]. BlastP v2.6.0+ was used to map the drug target sequences to the microbiome metaproteome sequences with an e-value cut-off of 10⁻⁶³⁵. Drug names and drug chemical characteristics, e.g. pKa, were obtained from PubChem [69] and DrugBank [70]. Drug side effects were retrieved from the SIDER 4.1 Side Effect Resource [41, 71], and symptoms and organ system classifications were derived from MedDRA [42]. Protein and ligand structures were obtained from PDB [37]. Plots were generated using R v4.3.3 [72].

Protein structure modelling and ligand docking

Template-based protein structure homology modelling was performed using MODELLER v10.5 [73] with molecular dynamics-level optimization and refinement [74, 75]. MAFFT was used to align protein structure sequences [76]. Structural templates for the microbiome protein modelling are listed in the “PDB” column of Table 1. In the case of tacrine, a hydrogen was added to the secondary amine based on its pKa value of 9.85 [77] using the attach function in PyMol v2.5 [78]. CB-Dock2 [79, 80] and MODELLER were used to perform template-based ligand docking (using the PDB structures in Table 1 as templates) of the drugs to the microbiome proteins. The drugs were also re-docked to the target protein as a control for affinity predictions. Protein-drug affinities and interactions were calculated using PRODIGY [81]. PyMol was used to visualize protein and drug structures.

Data availability

All data is available upon request.

References

Yildirim O, Gottwald M, Schüler P, Michel MC. Opportunities and challenges for drug development: public–private partnerships, adaptive designs and big data. Front Pharmacol. 2016;7:461. https://doi.org/10.3389/fphar.2016.00461.
Article PubMed PubMed Central Google Scholar
Mullard A. 2022 FDA approvals. Nat Rev Drug Discov. 2023;22:83–8.
Article PubMed CAS Google Scholar
Bano I, Butt UD, Mohsan SAH Chapter 25 - new challenges in drug discovery. In: Das S, Thomas S, Das PP (eds). Novel platforms for drug delivery applications. Sawton: Woodhead Publishing; 2023. pp 619-43.
Tautermann CS. Current and future challenges in modern drug discovery. Methods Mol Biol. 2020;2114:1–17.
Article PubMed CAS Google Scholar
Radoux CJ, Olsson TSG, Pitt WR, Groom CR, Blundell TL. Identifying interactions that determine fragment binding at protein hotspots. J Med Chem. 2016;59:4314–25.
Article PubMed CAS Google Scholar
Thomas SE, Collins P, James RH, Mendes V, Charoensutthivarakul S, Radoux C, et al. Structure-guided fragment-based drug discovery at the synchrotron: screening binding sites and correlations with hotspot mapping. Philo Trans A: Math Phys Eng Sci. 2019;377:20180422.
CAS Google Scholar
Pandurangan AP, Ascher DB, Thomas SE, Blundell TL. Genomes, structural biology and drug discovery: combating the impacts of mutations in genetic disease and antibiotic resistance. Biochem Soc Trans. 2017;45:303–11.
Article PubMed PubMed Central CAS Google Scholar
Waman VP, Vedithi SC, Thomas SE, Bannerman BP, Munir A, Skwark MJ, et al. Mycobacterial genomics and structural bioinformatics: opportunities and challenges in drug discovery. Emerg Microbes Infect. 2019;8:109–18.
Article PubMed PubMed Central CAS Google Scholar
Alsulami AF, Thomas SE, Jamasb AR, Beaudoin CA, Moghul I, Bannerman B, et al. SARS-CoV-2 3D database: understanding the coronavirus proteome and evaluating possible drug targets. Brief Bioinform. 2021;22:769–80.
Article PubMed PubMed Central CAS Google Scholar
Blundell TL. A personal history of using crystals and crystallography to understand biology and advanced drug discovery. Crystals. 2020;10:676.
Article CAS Google Scholar
Liang S, Thomas SE, Chaplin AK, Hardwick SW, Chirgadze DY, Blundell TL. Structural insights into inhibitor regulation of the DNA repair protein DNA-PKcs. Nature. 2022;601:643–8.
Article PubMed PubMed Central CAS Google Scholar
Moffat JG, Vincent F, Lee JA, Eder J, Prunotto M. Opportunities and challenges in phenotypic drug discovery: an industry perspective. Nat Rev Drug Discov. 2017;16:531–43.
Article PubMed CAS Google Scholar
Gashaw I, Ellinghaus P, Sommer A, Asadullah K. What makes a good drug target? Drug Discov Today. 2011;16:1037–43.
Article PubMed CAS Google Scholar
Agüero F, Al-Lazikani B, Aslett M, Berriman M, Buckner FS, Campbell RK, et al. Genomic-scale prioritization of drug targets: the TDR targets database. Nat Rev Drug Discov. 2008;7:900–7.
Article PubMed PubMed Central Google Scholar
MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, et al. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat Chem Biol. 2006;2:329–37.
Article PubMed CAS Google Scholar
Omeershffudin UNM, Kumar S. Antibiotic resistance in Neisseria gonorrhoeae: broad-spectrum drug target identification using subtractive genomics. Genomics Inform. 2023;21:e5.
Article PubMed PubMed Central Google Scholar
Lloyd-Price J, Abu-Ali G, Huttenhower C. The healthy human microbiome. Genome Med. 2016;8:51.
Article PubMed PubMed Central Google Scholar
Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464:59–65.
Article PubMed PubMed Central CAS Google Scholar
Javdan B, Lopez JG, Chankhamjon P, Lee Y-CJ, Hull R, Wu Q, et al. Personalized mapping of drug metabolism by the human gut microbiome. Cell. 2020;181:1661–1679.e22.
Article PubMed PubMed Central CAS Google Scholar
Huttenhower C, Gevers D, Knight R, Abubucker S, Badger JH, Chinwalla AT, et al. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207–14.
Article CAS Google Scholar
Hasan N, Yang H. Factors affecting the composition of the gut microbiota, and its modulation. PeerJ. 2019;7:e7502.
Article PubMed PubMed Central Google Scholar
Zimmermann M, Zimmermann-Kogadeeva M, Wegmann R, Goodman AL. Separating host and microbiome contributions to drug pharmacokinetics and toxicity. Science. 2019;363:eaat9931.
Article PubMed PubMed Central CAS Google Scholar
Weersma RK, Zhernakova A, Fu J. Interaction between drugs and the gut microbiome. Gut. 2020;69:1510–9.
Article PubMed CAS Google Scholar
Algavi YM, Borenstein E. A data-driven approach for predicting the impact of drugs on the human microbiome. Nat Commun. 2023;14:3614.
Article PubMed PubMed Central CAS Google Scholar
McCoubrey LE, Gaisford S, Orlu M, Basit AW. Predicting drug-microbiome interactions with machine learning. Biotechnol Adv. 2022;54:107797.
Article PubMed CAS Google Scholar
Santos R, Ursu O, Gaulton A, Bento AP, Donadi RS, Bologa CG, et al. A comprehensive map of molecular drug targets. Nat Rev Drug Discov. 2017;16:19–34.
Article PubMed CAS Google Scholar
Gurbich TA, Almeida A, Beracochea M, Burdett T, Burgin J, Cochrane G, et al. MGnify genomes: a resource for biome-specific microbial genome catalogues. J Mol Biol. 2023;435:168016.
Article PubMed PubMed Central CAS Google Scholar
Pearson WR. An introduction to sequence similarity (“Homology”) searching. Curr Protoc Bioinformatics. 2013;3:3.1.1–8. https://doi.org/10.1002/0471250953.bi0301s42.
Article Google Scholar
Tian W, Skolnick J. How well is enzyme function conserved as a function of pairwise sequence identity? J Mol Biol. 2003;333:863–82.
Article PubMed CAS Google Scholar
Pearson WR. Comparison of methods for searching protein sequence databases. Protein Sci. 1995;4:1145–60.
Article PubMed PubMed Central CAS Google Scholar
Pearson WR. Effective protein sequence comparison. Methods Enzymol. 1996;266:227–58.
Article PubMed CAS Google Scholar
Collins JF, Coulson AF, Lyall A. The significance of protein sequence similarities. Comput Appl Biosci. 1988;4:67–71.
PubMed CAS Google Scholar
Gupta MN, Alam A, Hasnain SE. Protein promiscuity in drug discovery, drug-repurposing and antibiotic resistance. Biochimie. 2020;175:50–7.
Article PubMed CAS Google Scholar
Richardson L, Allen B, Baldi G, Beracochea M, Bileschi ML, Burdett T, et al. MGnify: the microbiome sequence data analysis resource in 2023. Nucleic Acids Res. 2023;51:D753–D759.
Article PubMed CAS Google Scholar
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421.
Article PubMed PubMed Central Google Scholar
Haupt VJ, Daminelli S, Schroeder M. Drug promiscuity in PDB: protein binding site similarity is key. PLoS ONE. 2013;8:e65894.
Article PubMed PubMed Central CAS Google Scholar
Burley SK, Bhikadiya C, Bi C, Bittrich S, Chao H, Chen L, et al. RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning. Nucleic Acids Res. 2023;51:D488–D508.
Article PubMed CAS Google Scholar
Ahrodia T, Das S, Bakshi S, Das B. Chapter three - structure, functions, and diversity of the healthy human microbiome. In: Das B, Singh V (eds). Progress in molecular biology and translational science. London: Academic Press; 2022. pp 53–82.
Maier L, Pruteanu M, Kuhn M, Zeller G, Telzerow A, Anderson EE, et al. Extensive impact of non-antibiotic drugs on human gut bacteria. Nature. 2018;555:623–8.
Article PubMed PubMed Central CAS Google Scholar
Bartlett A, Padfield D, Lear L, Bendall R, Vos M. A comprehensive list of bacterial pathogens infecting humans. Microbiology. 2022;168:001269.
Article CAS Google Scholar
Kuhn M, Letunic I, Jensen LJ, Bork P. The SIDER database of drugs and side effects. Nucleic Acids Res. 2016;44:D1075–1079.
Article PubMed CAS Google Scholar
Brown EG, Wood L, Wood S. The medical dictionary for regulatory activities (MedDRA). Drug Saf. 1999;20:109–17.
Article PubMed CAS Google Scholar
Vich Vila A, Collij V, Sanna S, Sinha T, Imhann F, Bourgonje AR, et al. Impact of commonly used drugs on the composition and metabolic function of the gut microbiota. Nat Commun. 2020;11:362.
Article PubMed PubMed Central CAS Google Scholar
Wan Y, Zuo T. Interplays between drugs and the gut microbiome. Gastroenterol Rep. 2022;10:goac009.
Article Google Scholar
Forslund SK, Chakaroun R, Zimmermann-Kogadeeva M, Markó L, Aron-Wisnewsky J, Nielsen T, et al. Combinatorial, additive and dose-dependent drug–microbiome associations. Nature. 2021;600:500–5.
Article PubMed CAS Google Scholar
Zimmermann M, Patil KR, Typas A, Maier L. Towards a mechanistic understanding of reciprocal drug–microbiome interactions. Mol Syst Biol. 2021;17:e10116.
Article PubMed PubMed Central CAS Google Scholar
Whang A, Nagpal R, Yadav H. Bi-directional drug-microbiome interactions of anti-diabetics. EBioMedicine. 2019;39:591–602.
Article PubMed Google Scholar
Xue L, Ding Y, Qin Q, Liu L, Ding X, Zhou Y, et al. Assessment of the impact of intravenous antibiotics treatment on gut microbiota in patients: clinical data from pre-and post-cardiac surgery. Front Cell Infect Microbiol. 2022;12:1043971.
Article PubMed CAS Google Scholar
van Winkelhoff AJ, Rams TE, Slots J. Systemic antibiotic therapy in periodontics. Periodontol 2000. 1996;10:45–78.
Article PubMed Google Scholar
Querido SMR, Back-Brito GN, dos Santos SSF, Leão MVP, Koga-Ito CY, Jorge AOC. Opportunistic microorganisms in patients undergoing antibiotic therapy for pulmonary tuberculosis. Braz J Microbiol. 2011;42:1321–8.
Article PubMed PubMed Central Google Scholar
Spragge F, Bakkeren E, Jahn MT, Araujo EBN, Pearson CF, Wang X, et al. Microbiome diversity protects against pathogens by nutrient blocking. Science. 2023;382:eadj3502.
Article PubMed PubMed Central CAS Google Scholar
Dey P, Ray Chaudhuri S. The opportunistic nature of gut commensal microbiota. Crit Rev Microbiol. 2023;49:739–63.
Article PubMed Google Scholar
Long DR, Bryson-Cahn C, Waalkes A, Holmes EA, Penewit K, Tavolaro C, et al. Contribution of the patient microbiome to surgical site infection and antibiotic prophylaxis failure in spine surgery. Sci Transl Med. 2024;16:eadk8222.
Article PubMed PubMed Central CAS Google Scholar
Birer C, Wright ES. Capturing the complex interplay between drugs and the intestinal microbiome. Clin Pharmacol Ther. 2019;106:501–4.
Article PubMed Google Scholar
Bolz SN, Schroeder M. Promiscuity in drug discovery on the verge of the structural revolution: recent advances and future chances. Expert Opin Drug Discov. 2023;18:973–85.
Article PubMed CAS Google Scholar
Pant A, Maiti TK, Mahajan D, Das B. Human gut microbiota and drug metabolism. Microb Ecol. 2022;86:97–111.
Gillings MR. Evolutionary consequences of antibiotic use for the resistome, mobilome and microbial pangenome. Front Microbiol. 2013;4:4. https://doi.org/10.3389/fmicb.2013.00004.
Article PubMed PubMed Central Google Scholar
Sharma A, Buschmann MM, Gilbert JA. Pharmacomicrobiomics: the holy grail to variability in drug response? Clin Pharmacol Ther. 2019;106:317–28.
Article PubMed Google Scholar
Jiménez-Luna J, Grisoni F, Weskamp N, Schneider G. Artificial intelligence in drug discovery: recent advances and future perspectives. Expert Opin Drug Discov. 2021;16:949–59.
Article PubMed Google Scholar
Gaudelet T, Day B, Jamasb AR, Soman J, Regep C, Liu G, et al. Utilizing graph machine learning within drug discovery and development. Brief Bioinform. 2021;22:bbab159.
Article PubMed PubMed Central Google Scholar
Jia W, Li H, Zhao L, Nicholson JK. Gut microbiota: a potential new territory for drug targeting. Nat Rev Drug Discov. 2008;7:123–9.
Article PubMed CAS Google Scholar
Ratiner K, Ciocan D, Abdeen SK, Elinav E. Utilization of the microbiome in personalized medicine. Nat Rev Microbiol. 2024;22:291–308.
Article PubMed CAS Google Scholar
Skwark MJ, Torres PHM, Copoiu L, Bannerman B, Floto RA, Blundell TL. Mabellini: a genome-wide database for understanding the structural proteome and evaluating prospective antimicrobial targets of the emerging pathogen mycobacterium abscessus. Database. 2019;2019:baz113.
Article PubMed PubMed Central Google Scholar
Beaudoin CA, Blundell TL. Antigenic structural similarity as a predictor for antibody cross-reactivity. J Bacteriol Parasitol. 2021;0:1–3.
Google Scholar
Beaudoin CA, Bartas M, Volná A, Pečinka P, Blundell TL. Are there hidden genes in DNA/RNA vaccines? Front Immunol. 2022;13:801915.
Article PubMed PubMed Central CAS Google Scholar
Beaudoin C. Evolution of targets at the host-pathogen interface. Thesis, University of Cambridge, 2024.
Schmalenberg M, Beaudoin C, Bulst L, Steubl D, Luppa PB. Magnetic bead fluorescent immunoassay for the rapid detection of the novel inflammation marker YKL40 at the point-of-care. J Immunol Methods. 2015;427:36–41.
Article PubMed CAS Google Scholar
The UniProt Consortium. UniProt: the universal protein knowledgebase in 2023. Nucleic Acids Res. 2023;51:D523–D531.
Article Google Scholar
Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, et al. PubChem 2023 update. Nucleic Acids Res. 2023;51:D1373–D1380.
Article PubMed Google Scholar
Knox C, Wilson M, Klinger CM, Franklin M, Oler E, Wilson A, et al. DrugBank 6.0: the DrugBank knowledgebase for 2024. Nucleic Acids Res. 2024;52:D1265–D1275.
Article PubMed CAS Google Scholar
Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P. A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol. 2010;6:343.
Article PubMed PubMed Central Google Scholar
R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2023. https://www.R-project.org/.
Webb B, Sali A. Protein structure modeling with MODELLER. Methods Mol Biol. 2017;1654:39–54.
Article PubMed CAS Google Scholar
Sali A, Blundell TL. Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993;234:779–815.
Article PubMed CAS Google Scholar
Beaudoin CA, Jamasb AR, Alsulami AF, Copoiu L, van Tonder AJ, Hala S, et al. Predicted structural mimicry of spike receptor-binding motifs from highly pathogenic human coronaviruses. Comput Struct Biotechnol J. 2021;19:3938–53.
Article PubMed PubMed Central CAS Google Scholar
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.
Article PubMed PubMed Central CAS Google Scholar
Mitra S, Muni M, Shawon NJ, Das R, Emran TB, Sharma R, et al. Tacrine derivatives in neurological disorders: focus on molecular mechanisms and neurotherapeutic potential. Oxid Med Cell Longev. 2022;2022:e7252882.
Article Google Scholar
Schrödinger, LLC, DeLano W. PyMOL. 2020. http://www.pymol.org/pymol.
Liu Y, Yang X, Gan J, Chen S, Xiao Z-X, Cao Y. CB-Dock2: improved protein–ligand blind docking by integrating cavity detection, docking and homologous template fitting. Nucleic Acids Res. 2022;50:W159–W164.
Article PubMed PubMed Central CAS Google Scholar
Yang X, Liu Y, Gan J, Xiao Z-X, Cao Y. FitDock: protein–ligand docking by template fitting. Brief Bioinform. 2022;23:bbac087.
Article PubMed Google Scholar
Xue LC, Rodrigues JP, Kastritis PL, Bonvin AM, Vangone A. PRODIGY: a web server for predicting the binding affinity of protein-protein complexes. Bioinformatics. 2016;32:3676–8.
Article PubMed CAS Google Scholar

Download references

Acknowledgements

CAB was supported by Antibiotic Research UK (ANTSRG 01/2019-PHZJ/687).

Author information

Authors and Affiliations

Department of Biochemistry, University of Cambridge, Cambridge, UK
Christopher A. Beaudoin
Department of Psychology, Health & Technology, University of Twente, Enschede, the Netherlands
Shannon Norget
King Abdullah International Medical Research Center, King Saud Bin Abdelaziz University for Health Sciences, Jeddah, Saudi Arabia
Ziad Omran
Biothreat Department, Public Health Laboratory, Public Health Authority, Riyadh, Saudi Arabia
Sharif Hala
Pathogen Genomics Laboratory, Biological and Environmental Sciences and Engineering, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Sharif Hala
Department of Anesthesia, International Medical Center, Jeddah, Kingdom of Saudi Arabia
Abdullah H. Daqeeq
Department of Psychiatry, University of Oxford, Oxford, UK
Philip W. J. Burnet
Victor Phillip Dahdaleh Heart and Lung Research Institute, Biomedical Campus, Trumpington, Cambridge, UK
Tom L. Blundell
Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
Andries J. van Tonder

Authors

Christopher A. Beaudoin
View author publications
Search author on:PubMed Google Scholar
Shannon Norget
View author publications
Search author on:PubMed Google Scholar
Ziad Omran
View author publications
Search author on:PubMed Google Scholar
Sharif Hala
View author publications
Search author on:PubMed Google Scholar
Abdullah H. Daqeeq
View author publications
Search author on:PubMed Google Scholar
Philip W. J. Burnet
View author publications
Search author on:PubMed Google Scholar
Tom L. Blundell
View author publications
Search author on:PubMed Google Scholar
Andries J. van Tonder
View author publications
Search author on:PubMed Google Scholar

Contributions

CAB: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Writing - Original Draft, Writing - Review & Editing, Funding acquisition. SN: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Writing - Original Draft, Writing - Review & Editing. ZO: Methodology, Validation, Formal analysis, Investigation, Writing - Original Draft, Writing - Review & Editing. SH: Methodology, Validation, Formal analysis, Investigation. AHD: Methodology, Validation, Formal analysis. PWJB: Supervision, Project administration, Investigation, Writing - Original Draft, Writing - Review & Editing. TLB: Conceptualization, Supervision, Project administration, Funding acquisition. AJvT: Conceptualization, Methodology, Supervision, Project administration.

Corresponding authors

Correspondence to Christopher A. Beaudoin or Andries J. van Tonder.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval and consent to participate

No new animal or human data was generated in this study. All methods were performed in accordance with the relevant guidelines and regulations.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Beaudoin, C.A., Norget, S., Omran, Z. et al. Similarity of drug targets to human microbiome metaproteome promotes pharmacological promiscuity. Pharmacogenomics J 25, 9 (2025). https://doi.org/10.1038/s41397-025-00367-0

Download citation

Received: 15 July 2024
Revised: 27 February 2025
Accepted: 24 March 2025
Published: 17 April 2025
Version of record: 17 April 2025
DOI: https://doi.org/10.1038/s41397-025-00367-0