Abstract
Tea, derived from the leaves of Camellia sinensis, is a globally consumed beverage with considerable nutritional and economic value. Specific cultivars exhibit a striking purple leaf coloration due to anthocyanin accumulation, yet the molecular mechanisms governing this trait remain incompletely understood. In this study, we identified a sense-intronic long non-coding RNA, Cs_lncRNA.18443.6, that is co-expressed with CsUFGT (UDP-glucose: flavonoid 3-O-glucosyltransferase) and is predicted to act in cis on this gene. Together with the transcription factor CsMYB12, these components form a hypothesized three-tier regulatory module that contributes to anthocyanin accumulation in purple tea leaves. CsUFGT emerges as a potential regulatory hub in anthocyanin biosynthesis. Weighted gene co-expression network analysis (WGCNA), combined with the construction of a competing endogenous RNA network construction reveals Cs_lncRNA.18443.6 as a cis-acting lncRNA associated with CsUFGT expression. This association was supported by RNA fluorescence in situ hybridization (FISH), transient expression assays in transgenic tobacco, and RT-qPCR analysis. Dual-luciferase reporter assays provided preliminary evidence that Cs_lncRNA.18443.6 influences CsUFGT transcription by affecting CsMYB12-dependent promoter activation. These findings uncover a previously uncharacterized lncRNA association with anthocyanin biosynthesis and offer new hypotheses and provide candidate targets for the molecular breeding of anthocyanin-enriched tea cultivars.
Similar content being viewed by others
Data availability
All the data used in this research have been deposited to the National Center for Biotechnology Information (NCBI) under the BioProject accession number PRJNA1019822 and BioSample: SAMN37497780. All Supplementary Tables are found in Supplementary Data 1. The numerical source data (Figs. 6, 8d, h–k) for the graphs can be found in Supplementary Data 2. Supplementary Fig. 1, Data 1 and 2 are available online.
Code availability
The main code sources for the prediction of lncRNA used in this study are available at Zenodo: https://zenodo.org/records/10976644.
References
Xia, E. H. et al. The tea tree genome provides insights into tea flavor and independent evolution of caffeine biosynthesis. Mol. Plant 10, 866–877 (2017).
Liao, Y., Smyth, G. & Shi, W. FeatureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923 (2014).
Zeng, L. T., Watanabe, N. & Yang, Z. Y. Understanding the biosyntheses and stress response mechanisms of aroma compounds in tea (Camellia sinensis) to safely and effectively improve tea aroma. Crit. Rev. Food Sci. Nutr. 59, 2321–2334 (2019).
Grand View Research. Tea Market Size and Outlook: Horizon Databooks. https://www.grandviewresearch.com/horizon/outlook/tea-market-size/global (Grand View Research, Inc., 2024).
Li, G. T. et al. Research progress of Yunnan endemic tea variety-zijuan. China Tea 35, 10–12 (2013).
Li, X. X. et al. Anthocyanin metabolism and its differential regulation in purple tea (Camellia sinensis). Plant Physiol. Biochem. 201, 107875 (2023).
Wang, K. R., Liang, Y. R., Li, M. & Zhang, L. J. Progress in the development of germplasm resources for albino and purple tea cultivars. China Tea Processing 3, 5–8 (2015).
Ji, Q. Y., Liu, Y., Zhou, H. J., Shao, J. N. & He, W. Z. Biochemical component analysis and germplasm selection of tea tree germplasm resources with reddish violet leaves. Jiangsu Agric. Sci. 50, 110–116 (2022).
Mozos, I. et al. Effects of anthocyanins on vascular health. Biomolecules 11, 811 (2021).
Nistor, M. et al. Anthocyanins as key phytochemicals acting for the prevention of metabolic diseases: an overview. Molecules 27, 4254 (2022).
Zhang, W. et al. Genome assembly of wild tea tree DASZ reveals pedigree and selection history of tea varieties. Nat. Commun. 11, 3719 (2020).
Ferrer, J. L., Austin, M. B., Stewart, C. Jr & Noel, J. P. Structure and function of enzymes involved in the biosynthesis of phenylpropanoids. Plant Physiol. Biochem. 46, 356–370 (2008).
Kanehisa, M., Furumichi, M., Sato, Y., Ishiguro-Watanabe, M. & Tanabe, M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021).
Lai, Y. S. et al. The dark-purple tea cultivar ‘ziyan’ accumulates a large amount of delphinidin-related anthocyanins. J. Agric. Food Chem. 64, 2719–2726 (2016).
Kumari, M. et al. Regulation of color transition in purple tea (Camellia sinensis). Planta 251, 35 (2020).
Li, F. et al. Integrated transcriptome and metabolome provide insights into flavonoid biosynthesis in ‘P113’, a new purple tea of Camellia tachangensis. Beverage Plant Res. 3, 3 (2023). 14.
Amaral, P. P., Clark, M. B., Gascoigne, D. K., Dinger, M. E. & Mattick, J. S. lncRNAdb: a reference database for long noncoding RNAs. Nucleic Acids Res. 39, D146–D151 (2011).
Zhou, C. Z. et al. Hidden players in the regulation of secondary metabolism in tea plant: focus on non-coding RNAs. Beverage Plant Res. 2, 19 (2022).
Palos, K., Yu, L., Railey, C. E., Nelson Dittrich, A. C. & Nelson, A. D. L. Linking discoveries, mechanisms, and technologies to develop a clearer perspective on plant long noncoding RNAs. Plant Cell 35, 1762–1786 (2023).
Rinn, J. L. & Chang, H. Y. Genome regulation by long noncoding RNAs. Annu. Rev. Biochem. 81, 145–166 (2012).
Ma, L., Bajic, V. B. & Zhang, Z. On the classification of long non-coding RNAs. RNA Biol. 10, 925–933 (2013).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Zhang, X. et al. Haplotype-resolved genome assembly provides insights into evolutionary history of the tea plant Camellia sinensis. Nat. Genet. 53, 1250–1259 (2021).
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Sun, L. et al. Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucleic Acids Res. 41, e166 (2013).
Li, A., Zhang, J. & Zhou, Z. PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme. BMC Bioinforma. 15, 311 (2014).
Kang, Y. J. et al. CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features. Nucleic Acids Res. 45, W12–W16 (2017).
Wucher, V. et al. FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome. Nucleic Acids Res. 45, e57 (2017).
Buchfink, B., Reuter, K. & Drost, H. G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 18, 366–368 (2021).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Huerta-Cepas, J. et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 47, D309–D314 (2019).
Ma, X. et al. TarHunter, a tool for predicting conserved microRNA targets and target mimics in plants. Bioinformatics 34, 1574–1576 (2018).
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinforma. 9, 559 (2008).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Sun et al. Reference genes for real-time fluorescence quantitative PCR in Camellia sinensis. Chin. Bull. Bot. 45, 579–587 (2010).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 25, 402–408 (2001).
Hartley, J. L., Temple, G. F. & Brasch, M. A. DNA cloning using in vitro site-specific recombination. Genome Res. 10, 1788–1795 (2000).
Cui, L. et al. Identification of UDP-glycosyltransferases involved in the biosynthesis of astringent taste compounds in tea (Camellia sinensis). J. Exp. Bot. 67, 2285–2297 (2016).
He, X. J. et al. Isolation and characterization of key genes that promote flavonoid accumulation in purple-leaf tea (Camellia sinensis L. Sci. Rep. 8, 130 (2018).
Shen, J. et al. Metabolic analyses reveal different mechanisms of leaf color change in two purple-leaf tea plant (Camellia sinensis L.) cultivars. Horticulture Res. 5, 7 (2018).
Cai, J. et al. Integrative analysis of metabolomics and transcriptomics reveals molecular mechanisms of anthocyanin metabolism in the Zikui tea plant (Camellia sinensis cv. Zikui). Int. J. Mol. Sci. 23, 4780 (2022).
Liu, S. J. et al. CRISPRi-based genome-scale identification of functional long noncoding RNA loci in human cells. Science 355, aah7111 (2017).
Varshney, D. et al. Tissue specific long non-coding RNAs are involved in aroma formation of black tea. Ind. Crops Products 133, 79–89 (2019).
Zhu, C. et al. Transcriptome and phytochemical analyses provide new insights into long non-coding RNAs modulating characteristic secondary metabolites of oolong tea (Camellia sinensis) in solar-withering. Front. Plant Sci. 10, 1638 (2019).
Griffiths-Jones, S., Bateman, A., Marshall, M., Khanna, A. & Eddy, S. R. Rfam: an RNA family database. Nucleic Acids Res. 31, 439–441 (2003).
Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Res. 42, D222–D230 (2014).
Liu, J. et al. Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in Arabidopsis. Plant Cell 24, 4333–4345 (2012).
Mattick, J. S. et al. Long non-coding RNAs: definitions, functions, challenges and recommendations. Nat. Rev. Mol. Cell Biol. 24, 430–447 (2023).
Gil, N. & Ulitsky, I. Regulation of gene expression by cis-acting long non-coding RNAs. Nat. Rev. Genet. 21, 102–117 (2020).
Zhou, C. Z. et al. Establishment of an efficient in planta transformation method for Camellia sinensis. Biotechnol. Bull. 38, 263–265 (2022).
Ulitsky, I. Evolution to the rescue: using comparative genomics to understand long non-coding RNAs. Nat. Rev. Genet. 17, 601–614 (2016).
Acknowledgements
This work was supported by the National Guidance Foundation for Local Science and Technology Development of China (2023-009), the National Natural Science Foundation of China (32260086), China Scholarship Council. Grant (No. 202108525029) and the Cultivation Project of Guizhou University (Gzu.2020 No.65).
Author information
Authors and Affiliations
Contributions
S.N. and B.X. conceived the project and designed the study; Q.S. and L.Z. collected and raised the plants; Q.L. and Y.Y. sampled the materials; B.X., Q.L., and Y.Y. performed the formal experiments; B.W., Q.L., and Y.Y. performed the transgenic tobacco experiments; B.X. and L.Z. performed the bioinformatic data analysis; B.X., Q.L., and Y.Y. performed the formal analysis and data curation; B.X. and Q.L. designed and visualized the tables and figures; B.X. and L.Z. wrote the first manuscript; Q.C. provided suggestions and facilities and edited and revised the manuscript; S.N., Q.C., and B.X. supervised the entire project; All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Biology thanks Zhimin Qiu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Kaliya Georgieva.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Xiong, B., Zhang, L., Li, Q. et al. A long noncoding RNA modulates anthocyanin biosynthesis in Camellia sinensis. Commun Biol (2026). https://doi.org/10.1038/s42003-026-09785-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-026-09785-7


