Introduction

ASD is a neurodevelopmental condition characterized by early-onset social communication deficits and restricted, repetitive patterns of behavior or interests1,2. Over the past decade, the prevalence of ASD has shown a gradual increase, currently affecting approximately 1% of the global population3. While the precise etiology remains unclear, traditional research has primarily focused on the interplay between genetic variations, synaptic dysfunction, and environmental factors1,4,5. Notably, accumulating evidence suggests that the microbiota-gut-brain axis plays a crucial role in the pathophysiology of ASD4,6.

The gut microbiota plays a pivotal role in bidirectional communication between the gut and the brain, suggesting its potential involvement in neurodevelopment, neurotransmission, and behavioral regulation, thereby influencing the pathogenesis of various neurodevelopmental, psychiatric, and neurological disorders4. Growing evidence indicates that gut dysbiosis significantly contributes to the development of ASD7,8,9. Clinical studies have found that children with ASD are often accompanied by disturbances in the composition of the intestinal flora (e.g., abnormal Firmicutes/Bacteroidetes ratio), increased intestinal barrier permeability, and changes in serum levels of microbial metabolites10,11. Animal studies have demonstrated that fecal microbiota transplantation from ASD patients can induce social behavior deficits in mice, while probiotic interventions or microbiota modulation may ameliorate certain ASD-like symptoms12. Notably, a clinical trial utilizing microbiota transfer therapy reported symptomatic improvement in 18 children with ASD13,14.These findings highlight the promising therapeutic potential of gut microbiota modulation in ASD. However, the precise mechanisms underlying the microbiota’s therapeutic effects require further investigation.

Network pharmacology, an interdisciplinary approach integrating systems biology and pharmacology15, enables comprehensive analysis through target identification, protein-protein interaction (PPI) network construction, and molecular docking validation16,17,18. Recent network pharmacology studies have successfully identified potential therapeutic metabolites derived from gut microbiota for related disorders19,20. Therefore, we propose to employ network pharmacology methodologies to systematically investigate the molecular mechanisms underlying ASD-gut microbiota interactions, providing novel insights for future ASD-microbiota research. The workflow is represented in Fig. 1.

Fig. 1
figure 1

The workflow of this study.

Methods

Selection of gut microbial metabolites and targets

Using the gutMGene v2.0 database (accessed February 2025)21(http://bio-annotation.cn/gutmgene), we retrieved information on human gut microbiota, gut metabolites, and their associated targets. The SMILES representations of metabolites were obtained from PubChem (https://pubchem.ncbi.nlm.nih.gov/)22. To identify target genes for each metabolite, we utilized the Swiss Target Prediction (STP) database (accessed February 2025) (https://www.swisstargetprediction.ch/)23 and the Similarity Ensemble Approach (SEA) database (accessed February 2025) (https://sea.bkslab.org/)24. Finally, the Evenn online tool (http://www.ehbio.com/)25 was employed to integrate and intersect the metabolite target genes identified from both databases, yielding the final set of gut metabolite-associated target genes. All data were sourced from database versions released before March 2025.

Search for targets of ASD

We accessed the GeneCards database26 (https://www.genecards.org/) and the Online Mendelian Inheritance in Man (OMIM) database27 (https://www.omim.org/) on February 22, 2025, using “autism spectrum disorder” as the keyword to identify ASD genes. For GeneCards, we set a relevance score of ≥ 10 as the screening threshold. The two datasets were merged into a union set, and duplicate genes were removed to establish the ASD gene set. Next, we intersected the ASD gene set with the gut metabolite targets and further cross-referenced the overlapping genes with human intestinal targets in the gutMGene database to identify potential therapeutic targets for ASD based on gut metabolites. All data were sourced from database versions released before March 2025.

PPI network construction and hub gene screening

Protein–protein interaction (PPI) analysis of the 51 ASD-related host genes of gut microbial metabolites (AHGGMM) was performed using the STRING database (https://string-db.org/)28,29. To identify hub genes within the PPI network, we utilized the CytoHubba plugin in Cytoscape30, applying four topological analysis algorithms: Degree, Edge Percolated Component (EPC), Maximal Clique Centrality (MCC), and Maximum Neighborhood Component (MNC). The Degree algorithm reflects the centrality of a node based on its number of direct connections. The EPC algorithm evaluates the stability of a node within the network, with higher EPC scores indicating inclusion in more robust subnetworks. MCC is a highly sensitive and accurate method for detecting key nodes in complex networks. MNC highlights nodes that occupy central positions in densely connected local regions31. The highest-scoring targets identified across all four algorithms were then selected as the most valuable protein-coding targets for ASD32.

Gene ontology (GO) and kyoto encyclopaedia of genes and genomes (KEGG) enrichment analysis

GO analysis characterizes gene sets through cellular components (CC), molecular functions (MF) and biological processes (BP) to uncover biological significance33. KEGG, an integrated database of genomic and chemical information, provides metabolic pathway maps for understanding gene functions34,35,36. KEGG enrichment reveals gene set involvement in metabolic pathways. We used the Sangerbox online tool37 (http://www.sangerbox.com/) to perform GO and KEGG enrichment analyses and visualize the results for the 51 overlapping genes. A threshold of P < 0.05 was considered statistically significant for the enrichment results.

Microbiota-metabolites-targets-signaling (MMTS) pathways network analysis

To comprehensively elucidate the relationships among gut microbiota, metabolites, the core targets (AKT1 and IL6), and their signaling pathways, we first utilized the gutMGene database to identify gut microbial communities and metabolites directly associated with the core targets AKT1 and IL6. We then further queried the gutMGene database for additional metabolites or microbes associated with these gut microbial and metabolites, thereby indirectly identifying potential gut-derived regulators of AKT1 and IL6. KEGG pathway enrichment analysis was performed on the final set of target genes, and the KEGG pathways involving AKT1 and IL6 were used to construct the MMTS network. The MMTS network was visualized using Cytoscape (version 3.10.3).

Prediction of drug similarity and toxicity parameters of metabolites

The drug-likeness is determined based on the Lipinski’s rule of five: (1) molecular weight < 500; (2) lipid-water partition coefficient < 5; (3) hydrogen bond acceptor count < 5; (4) hydrogen bond donor count < 5; (5) polar surface area < 140. Metabolites meeting Lipinski’s Rule of Five criteria were selected for further analysis through the SwissADME platform38. Thus, we confirmed the six parameters by using ADMETlab 3.0 platform39: hERG blockers obstruct potassium channels40, cause human hepatotoxicity41, ames mutagenicity42, skin sensitization43, Lethal Dose 50 (LD50) of acute toxicity44, and Drug Induced Liver Injury (DILI)45.

The docking testing of metabolites and targets for molecular

To evaluate the interaction and binding of intestinal metabolites with the core targets AKT1 and IL6, we obtained the protein structure files of the core targets AKT1 (PDB ID: 3O96) and IL6 (PDB ID: 4O9H) from the PDB46 database, and removed co-crystalline ligands, ions, and water molecules from the protein structures of the core targets using PyMOL software. PyMOL software was used to remove co-crystalline ligands, ions and water molecules from the protein structure of the core targets, and polar hydrogen and Kollman charges were added using AutoDock. Core metabolites were obtained as sdf format files from Pubchem and pre-docking processed using Chembio3D Ultra. The core metabolites were then analyzed to determine the binding pockets of the core target structures for molecular docking using AutoDock Vina47. The center coordinates of the two key targets are: AKT1: x = − 15.877, y = 9.362, z = 37.115 and IL-6: x = 11.214, y = 33.473, z = 11.159. The docking site was set in cubic box (x = 40 Å, y = 40 Å, and z = 40 Å) in a central point of each target. and the binding energies were calculated and finally combined with the Protein-Ligand Interaction Profiler online tool (https://plip-tool.biotec.tu-dresden.de/plip-web/plip/index) with PyMOL software to visualize them.

Results

Identification of ASD-Related host genes of gut microbial metabolites

We first retrieved 278 gut microbial metabolites and 238 human intestinal targets from the gutMGene v2.0 database. Using SEA and STP databases, we predicted the targets of the 278 metabolites and identified 1323 and 1040 targets, respectively. Taking the intersection of these two datasets yielded 755 gut microbial metabolite target genes (Fig. 2A). To identify ASD-related genes, we queried the GeneCards and OMIM databases with the term “autism spectrum disorder.” After merging and removing duplicates, a total of 4,576 ASD-related genes were obtained. Intersecting these with the 755 gut microbial metabolite targets resulted in 399 genes associated with both gut microbial metabolites and ASD (Fig. 2B). Further comparison of these 399 genes with the 238 host intestinal targets revealed 51 common genes, which were defined as ASD-related host genes of gut microbial metabolites (AHGGMM) (Fig. 2C) (Supplementary Table S1S3).

Fig. 2
figure 2

(A) The 755 gut microbial metabolite target genes. (B) The 399 overlapping genes (ASD-related gut microbial metabolite target genes) between the 755 gut metabolite target genes and the 4576 ASD genes. (C) The 51 overlapping genes (ASD-related host genes of gut microbial metabolites) between the 399 ASD-related gut microbial metabolite target genes and the 238 host genes.

Enrichment analysis

GO and KEGG enrichment analyses were performed on the 51 AHGGMM, identifying a total of 122 signaling pathways, 210 biological processes (BP), 72 molecular functions (MF), and 24 cellular components (CC). GO enrichment analysis showed that the most significantly enriched BPs were response to stress, cellular response to chemical stimulus, and response to organic substance (Fig. 3A), suggesting the involvement of these genes in host adaptive responses modulated by gut microbial metabolites. The top enriched CC terms were protein-containing complex, nuclear part, and nucleoplasm (Fig. 3B), indicating a predominant localization of these genes in nuclear and protein complexes. For MF, the top terms included enzyme binding, transcription factor binding, and RNA polymerase II transcription factor binding (Fig. 3C), highlighting their regulatory roles in transcriptional processes. KEGG pathway analysis revealed that these genes were primarily enriched in signaling pathways such as the AGE-RAGE signaling pathway in diabetic complications, IL-17 signaling pathway, and PI3K/Akt signaling pathway. In addition, enrichment in human disease-related pathways, including pathways in cancer and Yersinia infection, indirectly reflected the strong association of these genes with inflammatory processes (Fig. 3D).

Fig. 3
figure 3

GO and KEGG enrichment analysis. (A). The BP analysis of 51 AHGGMM. (B). The CC analysis of 51 AHGGMM. (C) The MF analysis of 51 AHGGMM. (D) KEGG pathway analysis. AHGGMM, ASD-related host genes of gut microbial metabolites.

Protein–protein interaction network analysis

We subjected the 51 AHGGMM obtained to PPI network analysis, the total of 114 nodes and 144 edges were identified in the network (Fig. 4A). The top 10 genes in the four algorithms of “Degree”, “EPC”, “MCC” and “MNC” were ranked as hub genes, among which AKT1 and IL6 proteins had the highest scores in all four algorithms (Fig. 4B,E). Indicating that they play a pivotal role in the gene network of intestinal flora in the regulation of ASD. Therefore, we chose AKT1 and IL-6 as the core targets of ASD regulation by gut flora metabolites in our subsequent studies. (Table 1)

Fig. 4
figure 4

(A) The PPI network from STRING. (BE) Four algorithms for calculating hub genes in CytoHubba. (B) Degree algorithm. (C) Edge Percolated Component (EPC) algorithm, (D) Maximal Clique Centrality (MCC) algorithm, and (E) Maximum Neighborhood Component (MNC) algorithm.

Table 1 The top 10 genes of the four algorithms*.

The MMTS network analysis

We queried the gutMGene database to identify gut microbes and metabolites associated with the two core targets, AKT1 and IL6. The results showed that IL6 was directly associated with six gut microbes and five gut-derived metabolites, whereas AKT1 was associated with three metabolites: vancomycin, glycocholic acid, and indole. We further explored other metabolites or microbes functionally linked to these directly associated factors in the gutMGene database, thereby indirectly identifying gut-derived factors with potential regulatory effects on AKT1 and IL6. KEGG pathway enrichment analysis of 51 AHGGMM revealed enrichment in 122 signaling pathways, including 28 pathways involving AKT1 and 10 pathways involving IL6. Based on these findings, we constructed a MMTS network centered on the two core targets, AKT1 and IL6, comprising 72 gut microbes, 12 gut metabolites, and 30 signaling pathways (Fig. 5). (Supplementary Table S1S3).

Fig. 5
figure 5

The MMTS network. The orange nodes represent the two core genes, the pink nodes represent gut microbes, the blue nodes represent gut metabolites, and the green nodes represent signaling pathways. The edges indicate the associations between two nodes. hsa04151: PI3K-Akt signaling pathway, hsa04933: AGE-RAGE signaling pathway in diabetic complications, hsa04625: C-type lectin receptor signaling pathway, hsa04066: HIF-1 signaling pathway, hsa04620: Toll-like receptor signaling pathway, hsa04668: TNF signaling pathway, hsa04926: Relaxin signaling pathway, hsa04010: MAPK signaling pathway, hsa04660: T cell receptor signaling pathway, hsa04012: ErbB signaling pathway, hsa04722: Neurotrophin signaling pathway, hsa04919: Thyroid hormone signaling pathway, hsa04071: Sphingolipid signaling pathway, hsa04915: Estrogen signaling pathway, hsa04024: cAMP signaling pathway, hsa04014: Ras signaling pathway, hsa04370: VEGF signaling pathway, hsa04917: Prolactin signaling pathway, hsa04662: B cell receptor signaling pathway, hsa04068: FoxO signaling pathway, hsa04022: cGMP-PKG signaling pathway, hsa04062: Chemokine signaling pathway, hsa04910:Insulin signaling pathway, hsa04371: Apelin signaling pathway, hsa04072: Phospholipase D signaling pathway hsa04630: JAK-STAT signaling pathway, hsa04664: Fc epsilon RI signaling pathway, hsa04920: Adipocytokine signaling pathway, hsa04657: IL-17 signaling pathway, hsa04621: NOD-like receptor signaling pathway.

Prediction of drug similarity and toxicity parameters of metabolites

We further evaluated the drug-like properties and toxicity of the major metabolites. Among these eight metabolites, vancomycin violated Lipinski’s rule of five in its ADME parameters, particularly due to its topological polar surface area, which does not meet the characteristics of orally active drugs. The remaining Indole, glycocholic acid, 3- Indole, glycocholic acid, 3-Indolepropionic acid, acetate, butyrate, trimethylamine oxide, propionate, metabolites are all in accordance with the Lipinski’s five rules (Table 2). Toxicity testing results (Table 3) showed that acetate, butyrate, trimethylamine oxide, and propionate were positive for H-HT, suggesting potential hepatotoxicity.

Table 2 The physicochemical properties of the metabolites from key metabolites.
Table 3 Toxicological properties of the metabolites from the key metabolites.

The docking testing of metabolites and targets for molecular

We evaluated the binding affinities of AKT1 with two metabolites, glycocholic acid and indole, as well as IL6 with five metabolites, including 3-indolepropionic acid, acetate, butyrate, propionate, and trimethylamine oxide, in co-crystallized complexes. For AKT1 (PDB ID: 3O96), glycocholic acid exhibited the highest binding energy (− 10.2 kcal/mol), followed by indole (− 5.9 kcal/mol). Glycocholic acid promoted the formation of the glycocholic acid–AKT1 complex (Fig. 6) through hydrogen bonding interactions with GLN79, VAL83, ASP292, and GLY294, as well as hydrophobic interactions with GLN79, TRP80, THR82, VAL270, TYR272, and ARG273. In contrast, for IL6 (PDB ID: 4O9H), 3-indolepropionic acid showed the strongest binding affinity, with a binding energy of − 4.9 kcal/mol (Table 4).

Fig. 6
figure 6

The molecular docking test on core targets-metabolites pairs. (A) Indole-AKT1, (B) Glycocholic acid-AKT1.

Table 4 Molecular docking results of metabolites and core targets.

Discussion

In recent years, the complex interactions between gut microbial metabolites and the pathogenesis of ASD have attracted increasing research attention, with the gut-brain axis emerging as a crucial regulatory mechanism in ASD48. While existing studies have demonstrated the therapeutic potential of gut microbiota in ASD treatment11,49, the specific metabolites involved and their molecular targets remain unclear. Recent advances in network-based systems pharmacology have enabled the investigation of relationships between gut microbial metabolites and disease mechanisms50. Our study systematically elucidates the molecular mechanisms underlying ASD regulation by gut microbial metabolites through an integrated approach combining multi-database analysis, network pharmacology, and molecular docking.

During the data collection process, we collected microbial and disease gene targets through three different platforms, gutMGene21, Genecard26 and OMIM27 three different platforms collected microbial and disease gene targets, and 51 key targets were obtained after analysis. GO enrichment analysis of these target genes revealed that ASD-related targets were significantly enriched in processes such as stress response and chemical stimulus response, emphasizing the role of gut-derived metabolites in regulating host adaptation to environmental challenges. And KEGG pathway analysis further identified PI3K/Akt and IL-17 as the core pathways of ASD.

We then performed a PPI network analysis, in which we used four algorithms to identify AKT1 and IL6 as key targets of gut microbial metabolites regulating ASD. Notably, AKT1 is a core component of the PI3K/Akt pathway, and impairment of its PI3K/Akt signaling pathway may lead to defective synaptic pruning and abnormal neuronal connectivity to affect ASD51. Its downstream AKT-mTOR signaling pathway is a common therapeutic target in ASD52. AKT1 is a core member of the AKT/PKB (protein kinase B) signaling pathway, which plays an important role in the regulation of neural development, synaptic plasticity and metabolic homeostasis53,54,55. In addition, AKT1 plays a key role in the PI3K-AKT-mTOR signaling pathway, which is involved in the regulation of a variety of biological processes56. Abnormalities in the mTOR signaling pathway are considered to be one of the potential pathogenic mechanisms of ASD, and a large body of evidence suggests that the mTOR inhibitor rapamycin is effective in alleviating social dysfunction in a variety of animal models of ASD52,57,58,59. Dysregulation of AKT1 is associated with neurodevelopmental disorders including ASD. In addition, the expression level of AKT1 in the amygdala correlates with emotional memory learning, suggesting that it may be involved in social interaction disorders in ASD patients60.

IL6, a pro-inflammatory cytokine, has also been identified as another core target. In the IL-17 signaling pathway, transactivation of the IL-6 receptor promotes the release of IL-17 A, and IL-6 is an essential cytokine for the polarization of Th0 cells into Th17 cells, which makes it involved in ASD expression61. IL-6 levels were significantly elevated in frozen brain tissue from ASD patients compared to healthy subjects62. In a study of ASD, Weili Wu et al. found that IL-6 activation in the maternal placenta is a necessary mediator that affects fetal brain development and impairs their ability to explore socially63. Similarly, in patients with ASD, elevated levels of IL6 are associated with neuroinflammation, microglia activation, and behavioral abnormalities61. Recent studies have demonstrated that IL-6 enhances activation of the PI3K-AKT/mTOR-GSK-3β pathway in hippocampal neurons of ASD mice by upregulating GRPR64. In summary, AKT1 and IL6 play important roles in the pathogenesis of autism, and the present study found that intestinal microbial metabolites could exert therapeutic effects on autism mainly by affecting AKT1 through network pharmacological analysis.

In MMTS network analysis, we identified three metabolites (indole, glycylcholic acid, and vancomycin) associated with AKT1, and five metabolites (3-indolepropionic acid, acetic acid, butyric acid, trimethylamine oxide, and propionic acid) associated with IL6. We hypothesized that these eight metabolites may activate the related key genes (AKT1 and IL6) through different mechanisms, which in turn are directly or indirectly involved in the pathological process of ASD through signaling pathways. Through research reports, we found that the levels of the metabolite TMAO were associated with cardiovascular diseases65, NAFLD66,67 and metabolic disorders (e.g., type 2 diabetes mellitus)68, etc. In addition, a recent study reported that TMAO was observed in the urine of children with ASD69,70. In contrast, butyrate, acetate, and propionate—classified as short-chain fatty acids (SCFAs)—have been reported to alleviate ASD-related symptoms71,72 and reduce inflammation73. 3-indolylpropionic acid and indoles are neuroprotective and enhance intestinal barrier function, helping to reduce neuroinflammation and decrease intestinal permeability, thereby alleviating ASD symptoms74,75. Based on the above findings, we hypothesize that these metabolites may be potentially involved in the pathogenesis of ASD. Their key genes may affect the development of ASD by activating specific signaling pathways. This hypothesis needs to be further verified in future studies.

In this study, we further identified AKT1 and IL6 as key targets within the network of ASD-related host genes influenced by gut microbial metabolites through PPI network analysis. These targets are likely to play significant roles in the regulation of autism spectrum disorder (ASD) onset and progression by gut metabolites. Correlation analysis revealed direct associations between AKT1/IL6 and eight metabolites. To further assess the potential regulatory capacity of these metabolites, we employed molecular docking to predict their binding strength to the two target genes. The results demonstrated that glycodeoxycholic acid, produced by Escherichia coli, Prevotella, Wickerhamomyces, and Akkermansia, exhibited the strongest binding affinity to AKT1 (− 10.2 kcal/mol), indicating its potential as a key regulatory metabolite for AKT1 function. Notably, Akkermansia, recognized as a probiotic genus, exerts functions that enhance intestinal barrier integrity and modulate the immune system76,77. Previous studies have reported that Akkermansia ameliorates aberrant behaviors and intestinal dysfunction in mouse models of autism78,79. Consistent with the results of studies showing that the bile acid derivative glycocholic acid can attenuate gastrointestinal tract via metabolic and inflammatory pathways, its effects on ASD should be investigated and it could be very interesting80. The molecular docking results not only provide a structural basis for the predicted microbe–host interactions but also lay the foundation for subsequent screening of candidate therapeutic agents. In conclusion, integrating network pharmacology with molecular docking not only enhances the reliability of target prediction but also provides a theoretical framework for developing microbiota-based intervention strategies for ASD. This finding provides new clues to the mechanism of association between intestinal metabolites and ASD, and future studies could focus on their potential role in the development of ASD.

Furthermore, network pharmacology leverages extensive omics and public databases, enabling it to capture and visualize multidimensional interactions between metabolites and multiple host targets/pathways. This approach overcomes the limitations inherent in traditional single-target research and aligns better with the complex pathophysiology of ASD. Through topological network analysis, it also facilitates the identification of critical regulatory nodes (AKT1 and IL6) from numerous potential targets and predicts potential relationships among metabolites, targets, and pathways. This provides a theoretical basis for subsequent experimental validation81. These advantages underscore the utility of network pharmacology in revealing mechanisms underlying gut microbiota-host interactions in ASD.

Future perspectives

Although this study elucidates the potential mechanisms by which gut microbial metabolites regulate ASD-associated pathways, several challenges and opportunities remain. Firstly, the identified core genes and metabolites require experimental validation in cellular and animal models to confirm their roles in ASD pathogenesis. Secondly, longitudinal clinical studies are needed to evaluate whether modulating specific gut microbiota or supplementing targeted metabolites can ameliorate ASD symptoms. Furthermore, the potential hepatotoxicity risks associated with certain metabolites warrant careful evaluation in therapeutic applications. Future research should also explore how inter-individual variations in gut microbial composition and metabolism impact treatment efficacy. Integrating multi-omics data, including metagenomics, metabolomics, and transcriptomics, holds promise for refining our understanding of the gut-brain axis in ASD. These efforts will ultimately advance the development of precise and safe gut microbiota-based management strategies for ASD.

Conclusion

This study employed a network pharmacology approach to systematically identify and analyze the potential regulatory mechanisms of gut microbial metabolites in autism spectrum disorder (ASD). We identified a total of 51 ASD-associated host genes targeted by gut microbial metabolites, which were enriched in pathways such as PI3K/Akt and IL-17. Further protein-protein interaction (PPI) network analysis highlighted AKT1 and IL6 as core hub genes. Centered on these two key targets, we constructed a Microbe-Metabolite-Target-Signaling (MMTS) network, revealing complex interactions among gut microbiota, metabolites, and host signaling pathways in ASD pathogenesis. Molecular docking results further validated the high binding affinity between key metabolites and these core targets, suggesting their potential functional significance. Collectively, this study provides novel insights into the molecular interactions between gut microbiota and the host in ASD, and proposes AKT1 and IL6 as potential targets for microbiota-based interventions.