Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Enhanced drug disease association prediction through multimodal data integration and meta path guided global local feature fusion
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 25 February 2026

Enhanced drug disease association prediction through multimodal data integration and meta path guided global local feature fusion

  • Shengnan Wu1,2,
  • Wen Wang1,2,
  • Huizhi Jiao1,2,
  • Danhong Dong1,2,
  • Kexin Zhang1,2 &
  • …
  • Xuechen Luo1 

Scientific Reports , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Computational models
  • Data mining
  • Databases
  • Diseases
  • Machine learning

Abstract

Accurately predicting Drug-Disease Associations (DDAs) is of great significance for drug repurposing and new drug development. Although existing methods have promoted the development of this field to a certain extent, most of them are still limited to single-modal data and cannot fully characterize the complex features of drugs, diseases, and genes. At the same time, many methods only focus on either local neighborhoods or global structures during feature extraction, lacking the organic combination of the two, which limits the accuracy and generalization of predictions. To address this, this paper proposes MedPathEx, a drug-disease association prediction method that combines multi-modal data integration and local-global feature learning. Specifically, we first construct a drug-gene-disease heterogeneous network and fuse multi-modal attributes such as drug chemical structures, ATC classifications, side effects, disease phenotypes and semantic information, as well as gene function annotations to generate more comprehensive node representations. Subsequently, we use graph convolutional networks to extract the attribute features of nodes themselves, capture local semantic relationships through meta-path modeling with a multi-head attention mechanism, and introduce a global attention mechanism to extract overall topological patterns, thereby achieving “micro-macro complementary” feature learning. Finally, by fusing node attributes and structural features, MedPathEx obtains a more discriminative comprehensive representation for the prediction of potential DDAs. Experimental results show that MedPathEx outperforms existing methods in key indicators such as AUC, AP, and F1. Moreover, it successfully identifies new candidate drugs in cases of coronary artery disease and hypertension, demonstrating its great potential in practical applications.

Data availability

All datasets used in this study are publicly available from their respective official sources: BioSNAP ([https://snap.stanford.edu/biodata/](https:/snap.stanford.edu/biodata)),CTD ([https://ctdbase.org/](https:/ctdbase.org)),PharmGKB ([https://www.pharmgkb.org/](https:/www.pharmgkb.org)),DrugBank([https://go.drugbank.com/](https:/go.drugbank.com)),ChEMBL([https://www.ebi.ac.uk/chembl/](https:/www.ebi.ac.uk/chembl)),SIDER(https://sideeffects.embl.de/),OMIM(https://www.omim.org/),MeSH ([https://www.ncbi.nlm.nih.gov/mesh/](https:/www.ncbi.nlm.nih.gov/mesh)),Gene Ontology Consortium (http://geneontology.org/).

Code Availability

The implementation of MedPathEx and the preprocessed data is available at https://github.com/wwiswwlucky/MedPathEx.

References

  1. Wang, Y., Song, J., Dai, Q. & Duan, X. Hierarchical negative sampling based graph contrastive learning approach for Drug-Disease association prediction. IEEE J. Biomedical Health Inf. 28, 3146–3157 (2024).

    Google Scholar 

  2. Pasrija, P., Jha, P., Upadhyaya, P., Khan, M. S. & Chopra, M. Machine learning and artificial intelligence: A paradigm shift in big Data-Driven drug design and discovery. Curr. Top. Med. Chem. 22, 1692–1727 (2022).

    Google Scholar 

  3. Xiang, Z., Yunqiu, Z., Shaodan, S. & Liman, Z. Research on drug knowledge discovery method fusing meta-path features of heterogeneous knowledge networks: taking the prediction of drug–target relations as an example. Data Anal. Knowl. Discov. 8, 125–135 (2024).

    Google Scholar 

  4. Gu, Y., Zheng, S., Yin, Q., Jiang, R. & Li, J. R. E. D. D. A. Integrating multiple biological relations to heterogeneous graph neural network for drug-disease association prediction. Comput. Biol. Med. 150, 106127 (2022).

    Google Scholar 

  5. Jarada, T. N., Rokne, J. G. & Alhajj, R. A review of computational drug repositioning: strategies, approaches, opportunities, challenges, and directions. J. Cheminform. 12, 46 (2020).

    Google Scholar 

  6. Pan, X. et al. Deep learning for drug repurposing: Methods, databases, and applications. WIREs Comput. Mol. Sci. 12, e1597 (2022).

    Google Scholar 

  7. Luo, H. et al. Biomedical data and computational models for drug repositioning: a comprehensive review. Brief. Bioinform. 22, 1604–1619 (2020).

    Google Scholar 

  8. Kim, Y. et al. Drug-Disease Association Prediction Using Heterogeneous Networks for Computational Drug Repositioning. Biomolecules 12, 1497 (2022).

    Google Scholar 

  9. Liu, H., Song, Y., Guan, J., Luo, L. & Zhuang, Z. Inferring new indications for approved drugs via random walk on drug-disease heterogenous networks. BMC Bioinform. 17, 539 (2016).

    Google Scholar 

  10. Wu, G., Liu, J. & Yue, X. Prediction of drug-disease associations based on ensemble meta paths and singular value decomposition. BMC Bioinform. 20, 134 (2019).

    Google Scholar 

  11. Jamali, A. A., Tan, Y., Kusalik, A. & Wu, F. X. NTD-DR: nonnegative tensor decomposition for drug repositioning. PLOS ONE. 17, 1–18 (2022).

    Google Scholar 

  12. Wu, P. et al. Integrating gene expression and clinical data to identify drug repurposing candidates for hyperlipidemia and hypertension. Nat. Commun. 13, 46 (2022).

    Google Scholar 

  13. Huang, S., Wang, M., Zheng, X., Chen, J. & Tang, C. Hierarchical and dynamic graph attention network for Drug-Disease association prediction. IEEE J. Biomedical Health Inf. 28, 2416–2427 (2024).

    Google Scholar 

  14. Zhang, M. L. et al. RLFDDA: a meta-path based graph representation learning model for drug–disease association prediction. BMC Bioinform. 23, 516 (2022).

    Google Scholar 

  15. Sun, X., Jia, X., Lu, Z., Tang, J. & Li, M. Drug repositioning with adaptive graph convolutional networks. Bioinformatics 40, btad748 (2024).

    Google Scholar 

  16. Zhao, B. W. et al. Fusing higher and Lower-Order biological information for drug repositioning via graph representation learning. IEEE Trans. Emerg. Top. Comput. 12, 163–176 (2024).

    Google Scholar 

  17. Muzio, G., O’Bray, L. & Borgwardt, K. Biological network analysis with deep learning. Brief. Bioinform. 22, 1515–1530 (2020).

    Google Scholar 

  18. Shi, C., Li, Y., Zhang, J., Sun, Y. & Yu, P. S. A survey of heterogeneous information network analysis. IEEE Trans. Knowl. Data Eng. 29, 17–37 (2017).

    Google Scholar 

  19. Jiao, Q. et al. Springer International Publishing,. NSAP: A neighborhood subgraph aggregation method for drug–disease association prediction. In Proceedings of the International Conference on Intelligent Computing, 79–91 (2022).

  20. Zhou, R. et al. NEDD: a network embedding based method for predicting drug-disease associations. BMC Bioinform. 21, 387 (2020).

    Google Scholar 

  21. Fu, X., Zhang, J., Meng, Z. & King, I. M. A. G. N. N. Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding. In Proceedings of The Web Conference 2020 (WWW ’20), 2331–2341 (Association for Computing Machinery, 2020).

  22. Zitnik, M., Sosić, R., Maheshwari, S. & Leskovec, J. BioSNAP Datasets: Stanford Biomedical Network Dataset Collection. (2018). Available at: http://snap.stanford.edu/biodata

  23. Davis, A. P. et al. Comparative toxicogenomics database (CTD): update 2023. Nucleic Acids Res. 51, D1257–D1262 (2023).

    Google Scholar 

  24. Thorn, C. F., Klein, T. E. & Altman, R. B. PharmGKB: the pharmacogenomics knowledge base. In Pharmacogenomics (eds. Innocenti, F. & van Schaik, R.) Methods Mol. Biol. 1015, 311–320 (2013).

  25. Mendez, D. et al. ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res. 47, D930–D940 (2019).

    Google Scholar 

  26. Knox, C. et al. DrugBank 6.0: the drugbank knowledgebase for 2024. Nucleic Acids Res. 52, D1265–D1275 (2024).

    Google Scholar 

  27. Jaccard, P. Nouvelles recherches Sur La distribution Florale. Bull. Soc. Vaud Sci. Nat. 44, 223–270 (1908).

    Google Scholar 

  28. Kuhn, M., Letunic, I., Jensen, L. J. & Bork, P. The SIDER database of drugs and side effects. Nucleic Acids Res. 44, D1075–D1079 (2016).

    Google Scholar 

  29. van Driel, M. A., Bruggeman, J., Vriend, G., Brunner, H. G. & Leunissen, J. A. M. A text-mining analysis of the human phenome. Eur. J. Hum. Genet. 14, 535–542 (2006).

    Google Scholar 

  30. Yu, G. Using meshes for mesh term enrichment and semantic analyses. Bioinformatics 34, 3766–3767 (2018).

    Google Scholar 

  31. Yu, G. et al. GOSemSim: an R package for measuring semantic similarity among GO terms and gene products. Bioinformatics 26, 976–978 (2010).

    Google Scholar 

  32. Zhang, S., Tong, H., Xu, J. & Maciejewski, R. Graph convolutional networks: a comprehensive review. Comput. Social Networks. 6, 1–23 (2019).

    Google Scholar 

  33. Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).

    Google Scholar 

  34. Hearst, M. A., Dumais, S. T., Osuna, E., Platt, J. & Scholkopf, B. Support vector machines. IEEE Intell. Syst. Their Appl. 13, 18–28 (1998).

    Google Scholar 

  35. Kruse, R., Mostaghim, S., Borgelt, C., Braune, C. & Steinbrecher, M. Multi-layer perceptrons. In Computational Intelligence: A Methodological Introduction 53–124 (Springer International Publishing, 2022). https://doi.org/10.1007/978-3-030-42227-1_5.

    Google Scholar 

  36. Wang, X. et al. Heterogeneous graph attention network. In Proceedings of The Web Conference 2019 (WWW ’19), 2022–2032 (2019).

  37. Meng, Y. et al. A weighted bilinear neural collaborative filtering approach for drug repositioning. Brief. Bioinform. 23, bbab581 (2022).

  38. Li, Y. et al. A comparative benchmarking and evaluation framework for heterogeneous network-based drug repositioning methods. Brief. Bioinform. 25, 172 (2024).

    Google Scholar 

  39. Brown, J. C., Gerhardt, T. E. & Kwon, E. Risk factors for coronary artery disease (StatPearls [Internet]. StatPearls Publishing, 2023).

    Google Scholar 

  40. Tackling, G. & Borhade, M. B. Hypertensive heart disease. in StatPearls [Internet].StatPearls publishing (2023).

  41. Alfaddagh, A. et al. Inflammation and cardiovascular disease: from mechanisms to therapeutics. Am. J. Prev. Cardiol. 4, 100130 (2020).

    Google Scholar 

  42. McIver, L. A. & Siddique, M. S. Atorvastatin. StatPearls [Internet]. StatPearls Publishing, Treasure Island (FL) (2024).

    Google Scholar 

  43. Pireddu, R. et al. Pyridylthiazole-based Ureas as inhibitors of Rho associated protein kinases (ROCK1 and 2). Medchemcomm 3, 699–709 (2012).

    Google Scholar 

  44. Nohria, A. et al. Rho kinase Inhibition improves endothelial function in human subjects with coronary artery disease. Circ. Res. 99, 1426–1432 (2006).

    Google Scholar 

  45. Mohri, M., Shimokawa, H., Hirakawa, Y., Masumoto, A. & Takeshita, A. Rho-kinase Inhibition with intracoronary fasudil prevents myocardial ischemia in patients with coronary microvascular spasm. J. Am. Coll. Cardiol. 41, 15–19 (2003).

    Google Scholar 

  46. Loscalzo, J. Molecular interaction networks and drug development: novel approach to drug target identification and drug repositioning. FASEB J. 37, e22660 (2023).

    Google Scholar 

  47. Panwar, V. et al. Multifaceted role of mTOR (mammalian target of rapamycin) signaling pathway in human health and disease. Signal. Transduct. Target. Ther. 8, 375 (2023).

    Google Scholar 

  48. Chanan-Khan, A. A. et al. Pomalidomide: the new Immunomodulatory agent for the treatment of multiple myeloma. Blood Cancer J. 3, e143 (2013).

    Google Scholar 

  49. Chamberlain, P. P. et al. Evolution of Cereblon-Mediated protein degradation as a therapeutic modality. ACS Med. Chem. Lett. 10, 1592–1602 (2019).

    Google Scholar 

  50. Wiysonge, C. S., Volmink, J. & Opie, L. H. Beta-blockers and the treatment of hypertension: it is time to move on. Cardiovasc. J. Afr. 18, 351–352 (2007).

    Google Scholar 

  51. Knowlton, A. A. & Lee, A. R. Estrogen and the cardiovascular system. Pharmacol. Ther. 135, 54–70 (2012).

    Google Scholar 

  52. Ni, X. et al. β–estradiol alleviates hypertension– and Concanavalin A–mediated inflammatory responses via modulation of connexins in peripheral blood lymphocytes. Mol. Med. Rep. 19, 3743–3755 (2019).

    Google Scholar 

  53. Qiu, T. et al. Comorbidity of anxiety and hypertension: common risk factors and potential mechanisms. Int. J. Hypertens. 9619388 (2023).

Download references

Acknowledgements

The authors thank the referees for suggestions that helped improve the paper substantially.

Funding

This research received no external funding.

Author information

Authors and Affiliations

  1. School of Management, Shanxi Medical University, Jinzhong, 030600, Shanxi, China

    Shengnan Wu, Wen Wang, Huizhi Jiao, Danhong Dong, Kexin Zhang & Xuechen Luo

  2. Shanxi Key Laboratory of AI-Enhanced Clinical Decision Support, JinZhong, 030600, Shanxi, China

    Shengnan Wu, Wen Wang, Huizhi Jiao, Danhong Dong & Kexin Zhang

Authors
  1. Shengnan Wu
    View author publications

    Search author on:PubMed Google Scholar

  2. Wen Wang
    View author publications

    Search author on:PubMed Google Scholar

  3. Huizhi Jiao
    View author publications

    Search author on:PubMed Google Scholar

  4. Danhong Dong
    View author publications

    Search author on:PubMed Google Scholar

  5. Kexin Zhang
    View author publications

    Search author on:PubMed Google Scholar

  6. Xuechen Luo
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Shengnan Wu and Wen Wang produced the main ideas, and did the modeling, computation and analysis and also wrote the manuscript. Huizhi Jiao, Danhong Dong, Kexin Zhang and Xuechen Luo provided supervision and effective scientific advice and related ideas, research design guidance, and added value to the article through editing and contributing completions. All authors contributed to the article and approved the submitted version.

Corresponding author

Correspondence to Shengnan Wu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, S., Wang, W., Jiao, H. et al. Enhanced drug disease association prediction through multimodal data integration and meta path guided global local feature fusion. Sci Rep (2026). https://doi.org/10.1038/s41598-026-36223-9

Download citation

  • Received: 06 March 2025

  • Accepted: 10 January 2026

  • Published: 25 February 2026

  • DOI: https://doi.org/10.1038/s41598-026-36223-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Drug-disease prediction
  • Heterogeneous network
  • Multi-modal data
  • Meta-path
  • Feature fusion
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing