Abstract
Transcriptomic profiling of Traditional Chinese Medicine (TCM) perturbations is essential for elucidating the molecular mechanisms of therapeutic interventions. Although data from TCM treatment experiments are scattered across public repositories, a comprehensive, harmonized dataset remains unavailable due to heterogeneous experimental designs and inconsistent metadata. Here, we present a curated, harmonized resource comprising 362 human gene expression profiles derived from 27 TCMs and 137 TCM-derived ingredients spanning 26 human disease contexts, re-processed via a unified bioinformatics pipeline. This atlas captures TCM-induced genome-wide alterations in both protein-coding genes and long non-coding RNAs. We confirmed the dataset’s biological fidelity by validating the high reproducibility of the dataset, the enrichment of known pharmacological targets, and recapitulated the well-established therapeutic associations between TCM and disease treatment. This standardized dataset serves as a foundational resource for researchers to systematically investigate therapeutic mechanisms and predict clinical indications of TCM.
Similar content being viewed by others
Data availability
The harmonized gene-level expression matrices and associated metadata generated in this study are publicly available at Figshare: https://doi.org/10.6084/m9.figshare.31094347.
Code availability
The bioinformatic analyses were conducted using R statistical software (version 4.4.3). No custom algorithms or software were developed for this study; all analyses utilized standard functions from publicly available R packages. Data cleaning and manipulation were performed using dplyr (v1.1.4) and stringr (v1.6.0). Differential expression analysis was conducted using limma (v3.62.2). GO and KEGG pathway enrichment analyses, as well as Gene Set Enrichment Analysis (GSEA), were performed using clusterProfiler (v4.14.0). Visualizations were generated using ggplot2 (v4.0.0) and enrichplot (v1.26.1).
References
Wang, J., Wong, Y.-K. & Liao, F. What has traditional Chinese medicine delivered for modern medicine? Expert Reviews in Molecular Medicine 20, e4 (2018).
Zhang, X., Qiu, H., Li, C., Cai, P. & Qi, F. The positive role of traditional Chinese medicine as an adjunctive therapy for cancer. Bioscience trends 15, 283–298 (2021).
Gao, S. et al. Novel Natural Carrier‐Free Self‐Assembled Nanoparticles for Treatment of Ulcerative Colitis by Balancing Immune Microenvironment and Intestinal Barrier. Advanced healthcare materials 12, 2301826 (2023).
Guo, W. et al. Aloperine Suppresses Cancer Progression by Interacting with VPS4A to Inhibit Autophagosome‐lysosome Fusion in NSCLC. Advanced Science 11, 2308307 (2024).
Liu, X. et al. Natural medicines of targeted rheumatoid arthritis and its action mechanism. Frontiers in Immunology 13, 945129 (2022).
Huang, Z. et al. Artesunate inhibits the cell growth in colorectal cancer by promoting ROS-dependent cell senescence and autophagy. Cells 11, 2472 (2022).
Li, H. et al. Modulation the crosstalk between tumor-associated macrophages and non-small cell lung cancer to inhibit tumor migration and invasion by ginsenoside Rh2. BMC cancer 18, 579 (2018).
Guo, C. et al. Novel Chinese angelica polysaccharide biomimetic nanomedicine to curcumin delivery for hepatocellular carcinoma treatment and immunomodulatory effect. Phytomedicine 80, 153356 (2021).
Li, J. et al. Purification, structural characterization, and immunomodulatory activity of the polysaccharides from Ganoderma lucidum. International journal of biological macromolecules 143, 806–813 (2020).
Wang, K. et al. Inhibition of inflammation by berberine: Molecular mechanism and network pharmacology analysis. Phytomedicine 128, 155258 (2024).
Barrett, T. et al. NCBI GEO: archive for functional genomics data sets—update. Nucleic acids research 41, D991–D995 (2012).
Zhao, H. et al. So3D: a comprehensive three-dimensional spatial omics resource for decoding tissue architecture in physiology and disease. Nucleic Acids Research 54, D1281–D1290 (2026).
Zhao, H. et al. LncTarD 2.0: an updated comprehensive database for experimentally-supported functional lncRNA–target regulations in human diseases. Nucleic acids research 51, D199–D207 (2023).
Zhao, H. et al. An in-depth transcriptomic atlas deciphering traditional Chinese medicine mechanisms and disease associations. figshare https://doi.org/10.6084/m9.figshare.31094347.v6 (2026).
Cancer Genome Atlas Research Network, J. The cancer genome atlas pan-cancer analysis project. Nat. Genet 45, 1113–1120 (2013).
Liang, Y. et al. Astragali Radix-Curcumae Rhizoma normalizes tumor blood vessels by HIF-1α to anti-tumor metastasis in colon cancer. Phytomedicine 140, 156562 (2025).
Li, X. et al. Gypenoside‐induced apoptosis via the PI3K/AKT/mTOR signaling pathway in bladder cancer. BioMed Research International 2022, 9304552 (2022).
Long, F., Wang, P., Ma, Y., Zhang, X. & Wang, T. Chemopreventive effects of atractylenolide-III on mammary tumorigenesis via activation of the Nrf2/ARE pathway through autophagic degradation of Keap1. Biomedicine & Pharmacotherapy 176, 116852 (2024).
Xu, H. et al. Atractylenolide‐1 affects glycolysis/gluconeogenesis by downregulating the expression of TPI1 and GPI to inhibit the proliferation and invasion of human triple‐negative breast cancer cells. Phytotherapy Research 37, 820–833 (2023).
Nguyen, L. T. H., Nguyen, N. P. K., Tran, K. N., Shin, H.-M. & Yang, I.-J. Network Pharmacology and Experimental Validation to Investigate the Antidepressant Potential of Atractylodes lancea (Thunb.) DC. Life 12, 1925 (2022).
Acknowledgements
We are grateful to all contributors to this study and acknowledge the funding sources that provided financial support. This work was supported by the National Natural Science Foundation of China (62372144, 62573169, 62572155) and Outstanding Youth Foundation of Heilongjiang Province of China (YQ2023F004).
Author information
Authors and Affiliations
Contributions
H.Y.Z.: Study conception and design, methodological design, data processing and analysis, manuscript drafting and revision. P.Q.B: Data processing and analysis, figure and table visualization, manuscript drafting and revision. Z.M.L.: Data collection and collation. M.R.G.: Data processing. L.L.: Data collection and collation. D.C.H.: Data processing. J.C.G.: Study conception and design. L.W.: Study conception and design, methodological design, resource provision and support, supervision and guidance.All authors read, reviewed, and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhao, H., Ben, P., Liu, Z. et al. An in-depth transcriptomic atlas deciphering traditional Chinese medicine mechanisms and disease associations. Sci Data (2026). https://doi.org/10.1038/s41597-026-06988-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-026-06988-9


