Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Data
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific data
  3. data descriptors
  4. article
An in-depth transcriptomic atlas deciphering traditional Chinese medicine mechanisms and disease associations
Download PDF
Download PDF
  • Data Descriptor
  • Open access
  • Published: 05 March 2026

An in-depth transcriptomic atlas deciphering traditional Chinese medicine mechanisms and disease associations

  • Hongying Zhao1 na1,
  • Peiqi Ben1 na1,
  • Zhimiao Liu1 na1,
  • Marui Guan1,
  • Lin Lin1,
  • Dongchen Han2,
  • Jincheng Guo2 &
  • …
  • Li Wang1 

Scientific Data , Article number:  (2026) Cite this article

  • 720 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Abstract

Transcriptomic profiling of Traditional Chinese Medicine (TCM) perturbations is essential for elucidating the molecular mechanisms of therapeutic interventions. Although data from TCM treatment experiments are scattered across public repositories, a comprehensive, harmonized dataset remains unavailable due to heterogeneous experimental designs and inconsistent metadata. Here, we present a curated, harmonized resource comprising 362 human gene expression profiles derived from 27 TCMs and 137 TCM-derived ingredients spanning 26 human disease contexts, re-processed via a unified bioinformatics pipeline. This atlas captures TCM-induced genome-wide alterations in both protein-coding genes and long non-coding RNAs. We confirmed the dataset’s biological fidelity by validating the high reproducibility of the dataset, the enrichment of known pharmacological targets, and recapitulated the well-established therapeutic associations between TCM and disease treatment. This standardized dataset serves as a foundational resource for researchers to systematically investigate therapeutic mechanisms and predict clinical indications of TCM.

Similar content being viewed by others

TCMEval-SDT: a benchmark dataset for syndrome differentiation thought of traditional Chinese medicine

Article Open access 13 March 2025

Integrating transcriptomic data with a novel drug efficacy prediction model for TCM active compound discovery

Article Open access 05 March 2025

The current situation and factors influencing the use of traditional Chinese medicine therapies among patients with chronic disease in china: a cross-sectional study

Article Open access 22 August 2025

Data availability

The harmonized gene-level expression matrices and associated metadata generated in this study are publicly available at Figshare: https://doi.org/10.6084/m9.figshare.31094347.

Code availability

The bioinformatic analyses were conducted using R statistical software (version 4.4.3). No custom algorithms or software were developed for this study; all analyses utilized standard functions from publicly available R packages. Data cleaning and manipulation were performed using dplyr (v1.1.4) and stringr (v1.6.0). Differential expression analysis was conducted using limma (v3.62.2). GO and KEGG pathway enrichment analyses, as well as Gene Set Enrichment Analysis (GSEA), were performed using clusterProfiler (v4.14.0). Visualizations were generated using ggplot2 (v4.0.0) and enrichplot (v1.26.1).

References

  1. Wang, J., Wong, Y.-K. & Liao, F. What has traditional Chinese medicine delivered for modern medicine? Expert Reviews in Molecular Medicine 20, e4 (2018).

    Google Scholar 

  2. Zhang, X., Qiu, H., Li, C., Cai, P. & Qi, F. The positive role of traditional Chinese medicine as an adjunctive therapy for cancer. Bioscience trends 15, 283–298 (2021).

    Google Scholar 

  3. Gao, S. et al. Novel Natural Carrier‐Free Self‐Assembled Nanoparticles for Treatment of Ulcerative Colitis by Balancing Immune Microenvironment and Intestinal Barrier. Advanced healthcare materials 12, 2301826 (2023).

    Google Scholar 

  4. Guo, W. et al. Aloperine Suppresses Cancer Progression by Interacting with VPS4A to Inhibit Autophagosome‐lysosome Fusion in NSCLC. Advanced Science 11, 2308307 (2024).

    Google Scholar 

  5. Liu, X. et al. Natural medicines of targeted rheumatoid arthritis and its action mechanism. Frontiers in Immunology 13, 945129 (2022).

    Google Scholar 

  6. Huang, Z. et al. Artesunate inhibits the cell growth in colorectal cancer by promoting ROS-dependent cell senescence and autophagy. Cells 11, 2472 (2022).

    Google Scholar 

  7. Li, H. et al. Modulation the crosstalk between tumor-associated macrophages and non-small cell lung cancer to inhibit tumor migration and invasion by ginsenoside Rh2. BMC cancer 18, 579 (2018).

    Google Scholar 

  8. Guo, C. et al. Novel Chinese angelica polysaccharide biomimetic nanomedicine to curcumin delivery for hepatocellular carcinoma treatment and immunomodulatory effect. Phytomedicine 80, 153356 (2021).

    Google Scholar 

  9. Li, J. et al. Purification, structural characterization, and immunomodulatory activity of the polysaccharides from Ganoderma lucidum. International journal of biological macromolecules 143, 806–813 (2020).

    Google Scholar 

  10. Wang, K. et al. Inhibition of inflammation by berberine: Molecular mechanism and network pharmacology analysis. Phytomedicine 128, 155258 (2024).

    Google Scholar 

  11. Barrett, T. et al. NCBI GEO: archive for functional genomics data sets—update. Nucleic acids research 41, D991–D995 (2012).

    Google Scholar 

  12. Zhao, H. et al. So3D: a comprehensive three-dimensional spatial omics resource for decoding tissue architecture in physiology and disease. Nucleic Acids Research 54, D1281–D1290 (2026).

    Google Scholar 

  13. Zhao, H. et al. LncTarD 2.0: an updated comprehensive database for experimentally-supported functional lncRNA–target regulations in human diseases. Nucleic acids research 51, D199–D207 (2023).

    Google Scholar 

  14. Zhao, H. et al. An in-depth transcriptomic atlas deciphering traditional Chinese medicine mechanisms and disease associations. figshare https://doi.org/10.6084/m9.figshare.31094347.v6 (2026).

  15. Cancer Genome Atlas Research Network, J. The cancer genome atlas pan-cancer analysis project. Nat. Genet 45, 1113–1120 (2013).

    Google Scholar 

  16. Liang, Y. et al. Astragali Radix-Curcumae Rhizoma normalizes tumor blood vessels by HIF-1α to anti-tumor metastasis in colon cancer. Phytomedicine 140, 156562 (2025).

    Google Scholar 

  17. Li, X. et al. Gypenoside‐induced apoptosis via the PI3K/AKT/mTOR signaling pathway in bladder cancer. BioMed Research International 2022, 9304552 (2022).

    Google Scholar 

  18. Long, F., Wang, P., Ma, Y., Zhang, X. & Wang, T. Chemopreventive effects of atractylenolide-III on mammary tumorigenesis via activation of the Nrf2/ARE pathway through autophagic degradation of Keap1. Biomedicine & Pharmacotherapy 176, 116852 (2024).

    Google Scholar 

  19. Xu, H. et al. Atractylenolide‐1 affects glycolysis/gluconeogenesis by downregulating the expression of TPI1 and GPI to inhibit the proliferation and invasion of human triple‐negative breast cancer cells. Phytotherapy Research 37, 820–833 (2023).

    Google Scholar 

  20. Nguyen, L. T. H., Nguyen, N. P. K., Tran, K. N., Shin, H.-M. & Yang, I.-J. Network Pharmacology and Experimental Validation to Investigate the Antidepressant Potential of Atractylodes lancea (Thunb.) DC. Life 12, 1925 (2022).

    Google Scholar 

Download references

Acknowledgements

We are grateful to all contributors to this study and acknowledge the funding sources that provided financial support. This work was supported by the National Natural Science Foundation of China (62372144, 62573169, 62572155) and Outstanding Youth Foundation of Heilongjiang Province of China (YQ2023F004).

Author information

Author notes
  1. These authors contributed equally: Hongying Zhao, Peiqi Ben, Zhimiao Liu.

Authors and Affiliations

  1. College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China

    Hongying Zhao, Peiqi Ben, Zhimiao Liu, Marui Guan, Lin Lin & Li Wang

  2. School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, 100029, China

    Dongchen Han & Jincheng Guo

Authors
  1. Hongying Zhao
    View author publications

    Search author on:PubMed Google Scholar

  2. Peiqi Ben
    View author publications

    Search author on:PubMed Google Scholar

  3. Zhimiao Liu
    View author publications

    Search author on:PubMed Google Scholar

  4. Marui Guan
    View author publications

    Search author on:PubMed Google Scholar

  5. Lin Lin
    View author publications

    Search author on:PubMed Google Scholar

  6. Dongchen Han
    View author publications

    Search author on:PubMed Google Scholar

  7. Jincheng Guo
    View author publications

    Search author on:PubMed Google Scholar

  8. Li Wang
    View author publications

    Search author on:PubMed Google Scholar

Contributions

H.Y.Z.: Study conception and design, methodological design, data processing and analysis, manuscript drafting and revision. P.Q.B: Data processing and analysis, figure and table visualization, manuscript drafting and revision. Z.M.L.: Data collection and collation. M.R.G.: Data processing. L.L.: Data collection and collation. D.C.H.: Data processing. J.C.G.: Study conception and design. L.W.: Study conception and design, methodological design, resource provision and support, supervision and guidance.All authors read, reviewed, and approved the final manuscript.

Corresponding authors

Correspondence to Hongying Zhao, Jincheng Guo or Li Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, H., Ben, P., Liu, Z. et al. An in-depth transcriptomic atlas deciphering traditional Chinese medicine mechanisms and disease associations. Sci Data (2026). https://doi.org/10.1038/s41597-026-06988-9

Download citation

  • Received: 13 November 2025

  • Accepted: 28 February 2026

  • Published: 05 March 2026

  • DOI: https://doi.org/10.1038/s41597-026-06988-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Editors & Editorial Board
  • Journal Metrics
  • Policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Contact

Publish with us

  • Submission Guidelines
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Data (Sci Data)

ISSN 2052-4463 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing