Abstract
Bioactive natural C-glycosides are rare and chemical C-glycosylation faces challenges while enzymatic C-glycosylation catalyzed by C-glycosyltransferases provides an alternative way. However, only a small number of C-glycosyltransferases have been found, and most of the discovered C-glycosyltransferases prefer to glycosylate phenols with an acyl side chain. Here, a promiscuous C-glycosyltransferase, AbCGT, which is capable of C-glycosylating scaffolds lacking acyl groups, is identified from Aloe barbadensis. Based on the substrate promiscuity of AbCGT, 16 C-glycosides with inhibitory activity against sodium-dependent glucose transporters 2 are chemo-enzymatically synthesized. The C-glycoside 46a shows hypoglycemic activity in diabetic mice and is biosynthesized with a cumulative yield on the 3.95āgāLā1 scale. In addition, the key residues involved in the catalytic selectivity of AbCGT are explored. These findings suggest that AbCGT is a powerful tool in the synthesis of lead compounds for drug discovery and an example for engineering the catalytic selectivity of C-glycosyltransferases.
Similar content being viewed by others
Introduction
Sugar moieties attached to aglycons through O-, S-, N-, and C-glycosidic bonds are important for the pharmacological properties and physiological activities of many bioactive compounds1,2,3. Compared with O-, S-, N-glycosides, C-glycosides not only maintain the pharmacological properties but also are remarkably more stable, as their CāC bonds are resistant to glycosidase or acid hydrolysis4,5,6,7. The most widely known bioactive C-glycoside is dapagliflozin, the first global hypoglycemic drug as a sodium-dependent glucose cotransporter 2 (SGLT2) inhibitor. Dapagliflozin was developed from the natural O-glycoside phlorizin as the lead compound. Compared with phlorizin, the C-glycoside dapagliflozin exhibits not only higher inhibitory activity but also better selectivity toward SGLT2 and stability in vivo8,9. However, natural C-glycosides are comparatively rare, and chemical C-glycosylation is restricted by challenges, including poor regio- and stereoselectivity as well as the tedious protection and deprotection of functional groups10,11,12. Although a palladium-catalyzed C-glycosylation method was recently developed13, the efficient and environment-friendly synthesis of bioactive C-glycosides remains challenging. By contrast, enzymatic C-glycosylation catalyzed by specific C-glycosyltransferases (CGTs) can alleviate these disadvantages and is considered to be a prospective approach to synthesize C-glycosides14,15,16.
During the past decades, microbial CGTs have attracted considerable interest, and great progress has been achieved in identification of CGTs from bacteria17,18,19,20,21. However, studies on plant CGTs were rather limited until the year of 200922,23. OsCGT was the first flavonoid CGT identified from Oryza sativa, which initiated the exploration of plant CGTs. From 2009 to 2020, about ten plant CGTs were identified15,23,24,25,26,27,28,29,30,31. Recently, a highly promiscuous CGT (GgCGT) with C-, O-, N-, and S-glycosylation activity was identified from Glycyrrhiza glabra32. GgCGT could efficiently catalyze C-glycosylation of substrates containing a flopropione unit. However, these discovered CGTs function primarily on C-glycosylating flavonoids23,24,25,26,27,28,29,30 and other phenols with acyl groups15,31,32. In our previous work, although the substrate scopes of the mutated variants of a Mangifera indica CGT were expanded, the structural diversity of substrates is still limited, for example, an acyl substituent of phenolic acceptors is necessary for C-glycosylation31. Therefore, mining CGTs with catalytic stereospecificity and promiscuity toward other structurally different substrates and further establishing a green and economical biosynthesis route for bioactive C-glycosides are significant for drug design and discovery.
Aloe barbadensis is a traditional Chinese medicinal herb, and two types of bioactive C-glycosides, namely, aloin and aloesin (rare C-glycoside derivatives of anthraquinones and chromones, respectively), have been isolated from its leaves33,34,35. However, little is known about the corresponding CGTs, which prompted us to search for CGTs with specific catalytic properties. Herein, we report the identification of a CGT, designated AbCGT, from A. barbadensis. The substrate promiscuity and catalytic specificity of AbCGT were systematically investigated. Most importantly, the unusual catalytic property of AbCGT was further used to design and economically synthesize a series of unnatural C-glycosides with the aim of finding SGLT2 inhibitors.
Results
Cloning and identification of the C-glycosyltransferase gene
The large amount of C-glycosides in Aloe barbadensis suggest the high expression level of the corresponding CGT genes. According to the transcriptome data, 30 candidate CGT genes (AbGT1ā30) with high expression levels were selected. Subsequently, a neighbor-joining phylogenetic tree was constructed to analyze the evolutionary relationships of AbGT1ā30 with GTs from different phyla (Supplementary Fig.Ā 1). Intriguingly, AbGT27 and other plant CGTs were grouped into a single clade, which was clearly divergent from the OGTs. Thus, the full-length cDNA of AbGT27 was cloned and expressed in Transetta (DE3) Escherichia coli. The catalytic activity of AbGT27 was screened with phloracetophene (11) and phloretin (17), which are commonly accepted substrates for plant CGTs, and uridine 5ā²-diphosphoglucose (UDP-Glc). AbGT27 exhibited high glycosylation activity (conversion rates 100%) to phloracetophene (11) and phloretin (17) generating 11aa and 17aa, respectively (Supplementary Figs.Ā 2ā3). C-glycosylation can be distinguished from other types of glycosylation by MS/MS. The characteristic MS/MS fragment ions of C-glucosides are [MāHā120]ā and [MāHā90]ā, while that of O-, S-, and N-glucoside is [MāHā162]ā/[Mā+āHā162]+23,36,37. Therefore, the fragment ions, [MāHā240]ā and [MāHā180]ā, of 11aa and 17aa suggest the production of di-C-glucosides. AbGT27 was also able to catalyze mono-C-glucosylation. The ratio of mono- and di-C-glycosylated products was positively correlated with the molar ratio of the sugar acceptor and donor. If insufficient sugar donors (such as acceptor:donorā=ā1:0.5) were provided, only mono-C-glycoside (17a) was generated (Supplementary Fig.Ā 4). However, when excess sugar donors (such as acceptor:donorā=ā1:2.5) were provided, the acceptor (17) was di-glycosylated, and only di-C-glycoside (17aa, in 100% yield) and its spontaneously oxidative derivative (17o) were generated. When the molar ratio of the sugar acceptor and donor was 1:1, both mono- and di-C-glycoside were observed with 52 and 22% yields, respectively. Therefore, mono- and di-C-glycosylated products of phloracetophene (11) and phloretin (17) were prepared with different ratios of substrates, and their structures were further identified as 3-C-β-D-glucosides (11a and 17a) and 3,5-di-C-β-D-glucosides (11aa and 17aa) by MS, 1H and 13C NMR, respectively. This catalytic property can be used to generate C-glycosides with one or two C-sugar moieties by adjusting the ratio of acceptor and donor substrates.
The cDNA sequence of AbGT27 (1413ābp, GenBank accession number MN747045) contained an ORF encoding 470 amino acids and showed the highest identity (56%) to an unidentified GT from Phoenix dactylifera38. AbGT27 was purified to >90% homogeneity, and its native function still remains unknown as its failure to accept hypothetical substrates (s1ās6) designed based on the proposed biosynthetic pathways of aloin and aloesin (Supplementary Figs.Ā 5ā6). Therefore, AbGT27 was tentatively assigned as a CGT designated AbCGT.
Generally, most of the known GTs, including OGTs and CGTs, exhibit the highest catalytic activity at pH 7.0ā9.0 and lower catalytic activity at pH >10.015,23,24,25,26,27,28,29,30,31,32. However, the maximum catalytic activity of AbCGT was observed at pH 11.0 (Supplementary Fig.Ā 7a). C-glycosylation catalyzed by CGTs is achieved through direct nucleophilic displacement at the anomeric carbon by an aromatic carbanion, suggesting that the C-glycosylation is a FriedelāCrafts-like reaction39. Formation of the CāC-glycosidic bond is positively correlated with the electronic density of the aromatic carbon. At an alkaline environment (pH 11.0), the electronic density of the aromatic carbon is further improved due to the salification of aromatic hydroxyls. Therefore, with the increasing pH of the reaction buffer from 6.0 to 11.0, the yield of mono-glycosylated product (17a) was firstly increasing and then decreasing due to the second glycosylation generating the di-glycosylated product (17aa). This is the only CGT ever known with maximum catalytic activity in such an alkaline environment. For the effects of temperatures, the highest conversion rate was observed at 45ā°C (Supplementary Fig.Ā 7b). Mn2+, Mg2+, Ca2+, and Ba2+ greatly enhanced the catalytic activity of AbCGT (Supplementary Fig.Ā 7c). This enhanced catalytic activity was observed with short reaction time (10āmin), as longer-time incubation resulted in the same conversion rates. The kinetic parameters of AbCGT with phloracetophene (11) and phloretin (17) were also determined (Supplementary Fig.Ā 8). The high catalytic efficiency of AbCGT was represented by the high Kcat values for phloracetophene (144āminā1) and phloretin (120āminā1).
Investigation of the substrate promiscuity of AbCGT
Plant CGTs usually recognize only substrates with the basic core structure of acyl phloroglucinol, which possesses three hydroxyl groups and one acyl group15,23,24,25,26,29,31,32. Thus, to explore the effects of the number and position of phenolic hydroxyl groups on the catalytic activity of AbCGT, an acceptor library containing acyl phenols (1ā3), acyl resorcinols (4ā9), acyl phloroglucinols (10ā19 and 23), phloroglucinol derivatives (20ā22) without basic acyl groups, and coumarins (24ā26) was used for the reactions (Fig.Ā 1). To probe the sugar tolerance of AbCGT, six different NDP sugars were also used as sugar donors (Fig.Ā 1).
a Percent conversion of each aglycon acceptor with AbCGT. The aglycon acceptors and sugar donors are listed based on the structural scaffolds shown in part b and c. The red, blue, light blue, green, and black columns represent conversion rates of O-, mono-C-, di-C-, S-, and N-glycosylation reactions, respectively. b The structures of the aglycon acceptors and corresponding glycosylated products prepared from the scale-up enzymatic reactions. c The structures of sugar donors used in this work. The sugar donor promiscuity was tested with phloretin (17) as the acceptor. ND not detected. Experiments were performed at pH 7.4 and 30ā°C for up to 12āh. Conversion rates represent meanā±āSD of three independent replicates (nā=ā3).
From the first-pass analysis with HPLC-UV/ESIMS, AbCGT exhibited O-glycosylation activity to acyl phenols with one hydroxyl (1ā3), C-glycosylation activity to acyl phloroglucinol with three hydroxyls (11) and mixed O-/C-glycosylation activities to acyl resorcinols with two hydroxyls (5ā7) (Fig.Ā 1 and Supplementary Figs.Ā 9ā14). Similarly, this change in catalytic reaction patterns depending on the hydroxyl groups was also observed with acceptors 8ā10, 12ā19, and 23 (Supplementary Figs.Ā 15ā24). As mentioned above, the formation of the CāC-glycosidic bond is positively correlated with the electronic density of the aromatic carbon, which can be increased by the number of substituted hydroxyl(s). Therefore, the O-glycosylation activity (acyl phenols) was changed to C-glycosylation activity (acyl phloroglucinols) through the stage of O-/C-glycosylation activity (acyl resorcinols). For 2,4-dihydroxyacetophenone (6) with two phenolic hydroxyl groups, AbCGT showed both C- and O-glycosylation activity simultaneously. However, when the H-6 of 6 was replaced by a methyl group (electron-donating group) (9), AbCGT showed specific C-glycosylation activity with the high conversion rate. The methyl substitution may increase the electron density of the aromatic ring for nucleophilic attack in C-glycosidic bond formation. Logically, the O-methylation of phenolic hydroxyl groups (13ā15) decreased the glycosylation activity of AbCGT as the electron-donating ability of the hydroxymethyl was weaker than that of the hydroxyl. According to the structures of the substrates and isolated products, the C-glycosylation catalyzed by AbCGT needs at least two hydroxyl groups in the aromatic rings of acceptors and occurs in the ortho-position of one phenolic hydroxy. AbCGT seems to have no regioselectivity in O-glycosylation of the small molecules (1ā3). The O-glycosylation can occur in any hydroxyl groups of the aromatic ring.
Rare di-C-glycosides generally possess various bioactivities, and some have been used as clinical drugs in China, such as hydroxysafflor yellow A40. When excess sugar donors were provided, AbCGT showed high di-C-glycosylating activity with conversion rates of 100% to the acceptors (10, 11, and 16ā19) with the unit of phloroglucinol. Therefore, AbCGT is a prospective enzymatic tool for the synthesis of rare bioactive di-C-glycosides.
In addition, according to the LCMS data, AbCGT catalyzes 7-hydroxy-4-methylcoumarin (24), 7-mercapto-4-methylcoumarin (25) and 7-amino-4-methylcoumarin (26) to generate their corresponding O-, S-, and N-glucosides, indicating that AbCGT is a versatile GT with C-, O-, N-, and S-glycosylation activities (Supplementary Figs.Ā 25ā27). Besides the aglycon promiscuity, AbCGT also possesses a broad substrate spectrum with diverse sugar donors. When phloretin (17) was used as an acceptor, AbCGT accepted five sugar donors, including UDP-α-D-glucose, TDP-α-D-glucose, UDP-α-D-galactose, UDP-α-D-xylose, and UDP-α-D-glucuronic acid, to generate the corresponding C-glycosides, respectively (Supplementary Figs.Ā 28ā32 and Supplementary TableĀ 1). Therefore, besides the C-glucosides (17a and 17aa), the C-galactoside (17b) and C-xyloside (17c) were also prepared and their structures were identified as phloretin-3ā²-C-β-D-galactoside (17b) and phloretin-3ā²-C-β-D-xyloside (17c) by MS, 1H and 13C NMR, respectively. Biological assays revealed that C-glucoside 17a with only one glucosyl moiety exhibited higher SGLT2 inhibitory activity than those of other C-glycosides (17aa, 17b, and 17c) (Supplementary TableĀ 2). Therefore, UDP-Glc is selected as the sugar donor in the following application of AbCGT in the synthesis of C-glycosides.
Acyl groups, as electron-donating and potential positioning groups, are necessary for the substrates of known plant CGTs15,23,24,25,26,27,28,29,30,31,32. Among the substrates of acyl resorcinols, which should be both C- and O-glycosylated according to the above catalytic characteristics of AbCGT, there was an exception that 3,5-dihydroxyacetophenone (4) was only O-glycosylated (Supplementary Fig.Ā 33). In comparison to other acyl resorcinols (5ā9), the acyl group of 3,5-dihydroxyacetophenone (4) is not in ortho-position to the hydroxyl groups. Therefore, we initially believed that the C-glycosylation activity of AbCGT was also dependent on the acyl group, similar to other known CGTs. However, to our surprise, AbCGT exhibited catalytic activity when phloroglucinol derivatives (20ā22) without acyl groups attached to the aromatic rings were used as substrates (Supplementary Figs.Ā 34ā36). To further confirm the enzymatic reactions, the unnatural products of aglycons 20 and 21 were isolated from scaled-up reactions and characterized as 3-C-β-D-glucosides (20a and 21a) by MS, 1H and 13C NMR. AbCGT exhibited 100% conversion rates to 20 and 21, suggesting its substrate tolerance to unusual substrates.
Design and application of AbCGT in chemoenzymatic synthesis of potential SGLT2 inhibitors
The O-glycoside phlorizin is a SGLT1/2 inhibitor41. However, due to the poor selectivity for SGLT2, metabolic instability and substantial side effects, phlorizin failed in the clinical treatment of diabetes. Based on the prototypic phlorizin, various SGLT2 inhibitors have been developed by changing the types of glycosidic bonds and replacing the acyl groups with saturated carbon atoms42. In addition, biological assays revealed that products 20a and 21a without acyl groups displayed inhibitory activity toward SGLT2 (TableĀ 1). Therefore, to explore the catalytic activity of AbCGT in recognizing unusual substrates without acyl groups and to further apply AbCGT in antidiabetic drug discovery, diverse aglycons of potential SGLT2 inhibitors were designed (Fig.Ā 2a). Firstly, three commercially available C-/N-substituted resorcinols (27ā29) were prepared. Then we designed and chemically synthesized 25 potential substrates (30ā54). Functional groups, including aliphatic carbon chains (30ā33), saturated carbon cycles (34ā37), carbon chain-substituted benzenes (38ā41), and N- and S-heterocycles (42ā45), were chemically attached to phloroglucinol by a one-step reaction43. Furthermore, various derivatives (46ā54) of 1-benzylbenzene-2,4,6-triol (38) and 1-phenethylbenzene-2,4,6-triol (39) were chemically synthesized to investigate the effects of different substituents in the B ring on the pharmacological activity. The introduction of sugar moieties to these acceptors was performed with purified AbCGT and whole-cell system, respectively (Fig.Ā 2b, c).
a The redesigned structures of the substrates and corresponding glycosylated products. b Conversion rate of each substrate with purified AbCGT. c Conversion rate of each substrate with whole-cell system. *Not reported C-glycosides previously. ND not detected. Experiments were performed at 30ā°C in M9 medium for 12āh with shaking (200ārpm), and the conversion rates represent meanā±āSD of three independent replicates (nā=ā3).
The substrate tolerance and catalytic efficiency of purified AbCGT were emerged by the capability of recognizing all types of chemically synthesized unusual substrates, and 85% (24 of 28) of the substrates were completely C-glycosylated to their corresponding C-glycosides (Fig.Ā 2b and Supplementary Figs.Ā 37ā55). For the acceptors with a basic unit of phloroglucinol, AbCGT exhibited 100% conversion rates, except for aglycon 45 with a bulky substituent group. Reasonably, AbCGT can also C-glycosylate all derivatives (46ā54) of 1-benzylbenzene-2,4,6-triol (38) and 1-phenethylbenzene-2,4,6-triol (39) with 100% conversion rates. To the best of our knowledge, no known CGT has such high efficiency to acceptors without acyl groups. Subsequently, scaled-up reactions were performed, and 19 enzymatic C-glycosides (20a, 21a, 28a, 30a, 38a, 39a, 42aā44a, 46aā54a, and 47aa) were prepared. The structures, including the position of the introduced sugar moiety and the anomeric stereochemistry, were elucidated by MS, 1H and 13C NMR spectroscopic analyses. All C-glycosylation sites of the enzymatic products were regioselective at C-3/3ā². Large anomeric proton-coupling constants indicated the stereo-specific formation of β-anomers and an inversion mechanism for AbCGT, as UDP-Glc possesses an α-anomer (Supplementary TableĀ 3). Among the 19 prepared C-glycosides, 18 (20a, 21a, 28a, 38a, 39a, 42aā44a, 46aā54a, and 47aa) were not reported previously. Therefore, potential SGLT2 inhibitors were redesigned and efficiently synthesized by applying AbCGT in a chemoenzymatic method.
Evaluation of the SGLT2 inhibitory activity of C-glycosides in vitro
The SGLT2 inhibitory activity of the 16 designed C-glycosides (20a, 21a, 38a, 39a, 42aā44a, and 46aā54a) and their 16 corresponding aglycons was firstly evaluated at the cellular level. All C-glycosides exhibited inhibitory activity against SGLT2 (Fig.Ā 3). Furthermore, five (44a, 46a, 47a, 49a, and 50a) of these designed C-glycosides showed inhibitory rates >96% at a concentration of 10ā5 M, and their IC50 values against SGLT2 were 1.74ā4.89āĆā10ā7āM (TableĀ 1). Notably, the five C-glycosides also exhibited high selectivity in inhibiting SGLT2, as their IC50 values against SGLT1 were much higher (ā„10ā5āM) than those against SGLT2 (~10ā7āM). It is worth mentioning that all the aglycons exhibited no inhibitory activity or much lower inhibitory activity than the corresponding C-glycosides (Fig.Ā 3). Therefore, the C-linked sugar moiety not only generated structurally different compounds but also afforded aglycons with SGLT2 inhibitory activity and selectivity. The C-glycosylated products (38a and 46aā50a) of 1-benzylbenzene-2,4,6-triols showed higher SGLT2 inhibitory activity than those (39a and 51aā54a) of 1-phenethylbenzene-2,4,6-triols. These findings demonstrated that the benzyl group is a better substituent than the ethylbenzene group for bioactivity. Different substituents in the aromatic ring of the benzyl group (B ring) also resulted in different degrees of pharmacological activity. Replacing the B ring of 38a with a thiophene group decreased the SGLT2 inhibitory activity, while halogen, phenyl, and alkyl group substitution enhanced the inhibitory activity (Fig.Ā 3). The methyl substitution in the B ring (46a, IC50 1.74āĆā10ā7āM) increased the inhibitory activity more than 17 times compared to that of the unsubstituted glycoside (38a, IC50 3.09āĆā10ā6āM).
The shade of red filled circles represents the SGLT2 inhibition level with aglycons and C-glycosides at 10āμM. The depth of red color represents the SGLT2 inhibition rate. Experiments were performed in triplicate.
Establishment, optimization, and utilization of a biocatalytic system for the production of SGLT2 inhibitors
The potent SGLT2 inhibitory activity of C-glycoside 46a inspired us to further evaluate in vivo activity as the candidate drug hit. Although AbCGT efficiently synthesized bioactive C-glycosides with UDP-Glc (~400 USD per gram) as the sugar donor at a laboratory scale, for large-scale preparation, the high cost of the sugar donors and the instability of the catalyst greatly limit the practical application of GTs. Thus, an engineered E. coli strain harboring its native biosynthetic pathway of UDP-Glc and pET28a-AbCGT was developed as a catalyst. As shown in Fig.Ā 2c, this engineered strain economically glycosylated most of the potential acceptors without adding expensive UDP-Glc. Therefore, a chemo-biosynthetic approach combining a simple one-step chemical method for the synthesis of aglycons and an efficient biocatalytic method for C-glycosylation was successfully established for the total synthesis of 46a.
To economically obtain 46a with yield in grams scale, the cell density of cultures and input concentration of substrate 46 of the biocatalytic system were optimized. At a cell density of OD600ā=ā13.0, the whole-cell biocatalyst glycosylated 300āmgāLā1 of acceptor 46 to produce 450āmgāLā1 of C-glycoside 46a with a conversion rate of 90% (Fig.Ā 4a, b). Aglycon 46 was chemically synthesized by a one-step method with an isolated yield of 65%, and the regio- and stereoselective introduction of the C-sugar moiety was accomplished by a biocatalyst with an isolated yield of 80%. Thus, the total synthesis of SGLT2 inhibitor 46a was accomplished with the isolated yield of 52%. Generally, in vitro enzymes are often unstable as one-time catalysts. In the whole-cell system, however, the AbCGT enzyme is located inside the cell, which provides a relatively stable environment. Thus, the reactions were scaled up to 500āmL to investigate the recyclability and stability of the whole-cell biocatalyst. The biocatalyst was reused more than ten times, which means one batch of whole-cell catalyst could catalyze more than ten batches of acceptor 46 (Fig.Ā 4c). Cumulatively, 3.95āgāLā1 of target C-glycoside 46a was generated by whole cells with an average conversion rate of 75%. The recyclability and stability indicated E. coli-AbCGT is a potential biocatalyst for the synthesis of C-glycosides for industrial applications.
a The effects of cell density to the catalytic activity of E. coli-AbCGT. b The effects of substrate concentration to the conversion rate of aglycon 46 and production of C-glycoside 46a. c Recycling utilization of engineered cells with aglycon 46 (final concentration 0.3āg/L/round) for ten rounds in production of C-glycoside 46a. Reactions were performed at 25ā°C in M9 medium for 12āh with shaking (180ārpm). The total yield of 46a and the conversion rate of 46 for each round are indicated. The relative activity (a) and conversion rates (b) represent meanā±āSD of three independent replicates (nā=ā3).
Evaluation of the pharmacological activity of C-glycosides in vivo
With sufficient amount of 46a in hand, its hypoglycemic activity was evaluated in three mouse models, including normal ICR mice, alloxan-induced diabetic mellitus (DM) mice, and T2DM KKAy mice. The oral glucose-tolerance test (OGTT) showed that 100āmgākgā1 46a significantly lowered the peak value of blood glucose after glucose loading in ICR mice (Fig.Ā 5a, b). In alloxan-induced DM mice, 46a exhibited obviously hypoglycemic activity 0āāā8āh after oral administration (Fig.Ā 5c, d). In KKAy mice, the blood glucose level was decreased at 1āh after 46a administration and was maintained at a lower level for 24āh compared to the control (solvent-treated) group (Fig.Ā 5e, f). Given the SGLT2 inhibitory activity of 46a in vitro, the hypoglycemic mechanism may have involved the prevention of renal glucose reabsorption with elevated urinary glucose excretion (Fig.Ā 5g). These hypoglycemic activities suggested that C-glycoside 46a is a promising drug lead for treating diabetes.
a, b show 46a enhanced glucose tolerance in ICR mice. a Summary data for changes of blood glucose after glucose loading in control (nā=ā8). Dapagliflozin-treated (nā=ā8), and 46a-treated (nā=ā7) mice in OGTT. b Area under the blood glucose-time curve (AUC) in OGTT. c, d indicate the hypoglycemic effects of 46a (single administration) in alloxan-induced diabetic mellitus (DM) mice. c Summary data for changes in blood glucose in DM (nā=ā8). Dapagliflozin-treated (nā=ā8), and 46a-treated (nā=ā8) mice; d AUC for 0ā8āh, 8ā24āh, 24ā48āh, respectively. eā g exhibit 46a (single administration) promoted urinary glucose excretion and lowered the blood glucose in T2DM KKAy mice. e Summary data for changes of blood glucose in KKAy (nā=ā6). Dapagliflozin-treated (nā=ā7), and 46a-treated (nā=ā7) mice; (f), AUC for 0ā8āh, 8ā24āh, 24ā48āh, respectively; g glucose in urine at 4āh and 6āh (nā=ā4). The solvent was used as the blank control, and dapagliflozin was used as positive control. Data are represented as meanā±āSEM. Significance was determined using one-way ANOVA with Dunnettā²s multiple comparisons test. Exact P values (Pā<ā0.05) are provided in the corresponding figures, respectively.
Exploration of the active sites involved in the C-/O-glycosylation activity
CGTs and OGTs share similar structures but different functions30. The related key residues determining reaction patterns are still unknown. Finding the switch that controls the C- and O-glycosylation activity of GTs is significant to enzymatic glycosylation mechanism and protein engineering. As mentioned above, AbCGT showed both C- and O-glycosylation activity toward 2-hydroxynaringenin (23) (Fig.Ā 1 and Supplementary Fig.Ā 24), suggesting two different relative positions of substrates in the binding pocket, that provided an example to explore the key amino acids residues involved in C- and O-glycosylation. Homology modeling was performed with the template GgCGT (PDB entry, 6L5R.1), a di-C-glycosyltransferase32. It is reported that Val200 of UGT85H2, an (iso)flavonoid UGT from Medicago truncatula, is interacting with the 4ā²-hydroxyl group of the substrate kaempferol44,45. Mutating this neutral residue to that with charged polar side chain could greatly influence the enzyme activity45. A similar key amino acid residues Val183, which interacts with the 4ā²-hydroxyl group of 2-hydroxynaringenin (23), was found in AbCGT based on molecular docking and sequence alignment (Fig.Ā 6a, b). Therefore, saturation mutagenesis was performed on Val183 of AbCGT.
a Structure homology modeling of AbCGT. The crystal structure of GgCGT (PDB entry, 6L5R.1) was used as a template for homology modeling. 2-hydroxynaringenin was docked into the putative binding pocket of AbCGT. H19, F89, F90, F194, and V183 are labeled. b Sequence alignment of AbCGT and UGT85H2. The conversed amino acid residues (H19 and V183) in the binding pocket are labeled with red triangles. c Percent conversion of 2-hydroxynaringenin by AbCGT and AbCGT variants. d Exploring the catalytic specificity of variants V183D, V183E and V183P in C-/O-glycosylation. The red, blue columns represent conversion rates of O- and C-glycosylation reactions, respectively. Reactions were performed at pH 7.4 and 30ā°C for 12āh. The conversion rates represent meanā±āSD of three independent replicates (nā=ā3).
When Val183 was mutated to residues of Asp, Glu and Pro, respectively, the C-glycosylation selectivity was greatly improved with 2-hydroxynaringenin (23) (Fig.Ā 6c and Supplementary Figs.Ā 56ā57). Furthermore, the O-glycosylation activity of these variants was decreased with all acceptors (1ā7, 13, 14, 22, and 23) those can be O- or both C- and O-glycosylated by wild-type AbCGT (Fig.Ā 6d). The affinity of the variants for 2-hydroxynaringenin (23) is higher than that of wild-type AbCGT, while the enzyme efficiency (Kcat/Km) of the variants is lower than that of wild-type AbCGT (Supplementary Fig.Ā 58). It seems that the mutations (V183D V183E and V183P) in AbCGT affected both the catalytic selectivity and efficiency of the enzyme. The V183D and V183E variants showed relatively high specific C-glycosylation activity to 2-hydroxynaringenin (23). The potential mechanism might be the binding sites of AbCGT not tightly capturing the acceptor, thereby resulting in flexible substrate orientation. Thus, both O- and C-glycosylation sites of 2-hydroxynaringenin (23) could be positioning to UDP-Glc. The mutant residue with a charged polar side chain may form a stronger interaction with the acceptor, which further fixes the orientation of the acceptor45. However, this hypothesis cannot explain the enhanced catalytic selectivity of the V183D and V183E variants to smaller acceptors without the 4ā²-hydroxyl group. The work on the structural biology of AbCGT is in progress and will help to elucidate the catalytic mechanism. Even so, investigation of the key amino acid residues of AbCGT involved in the control of C-/O-glycosylation provides hints for the further engineering of other promiscuous CGTs and insights into the switch between the C- and O-glycosylation activities of GTs.
Discussion
In this study, we explored and applied the substrate promiscuity of AbCGT, a CGT from A. barbadensis. As a natural plant CGT, AbCGT exhibited amazing catalytic activity and promiscuity toward unnatural acceptors, especially toward those without acyl side chains which are usually good potential SGLT2 inhibitors. Chemical synthesis of this type of C-glycosides including clinical gliflozins, faces challenges involving the protection and deprotection of hydroxyls as well as poor stereoselectivity and complex synthetic routes. To overcome these obstacles, AbCGT was developed as a tool for stereoselective C-glycosylation with high efficiency. Combining enzymatic C-glycosylation with the chemical approach for synthesizing aglycons, bioactive C-glycosides were totally synthesized by two simple steps, and a chemo-biocatalysis route for the synthesis of C-glycosides was constructed. In this route, common glucose was used as the sugar donor for the second step, and whole-cell biocatalyst can be reused for times making the cost greatly decreasedĀ and lending potential to this route for industrial application. Based on this route, diverse bioactive C-glycosides were designed and chemo-enzymatically synthesized. This type of C-glycosides possessed SGLT2 inhibitory activity and showed a significant therapeutic effect on diabetic mice. In addition, some SGLT2 inhibitors such as 44a even exhibited anti-influenza A (H1N1) activity (IC50 ~10ā6āM, Supplementary TableĀ 4), which suggested the wide application of AbCGT in the synthesis of bioactive glycosides other than SGLT2 inhibitors. In this study, the natural AbCGT breaks through the dependency of the acyl group in CGT catalyzing C-glycosylation, and diverse SGLT2 inhibitors were successfully chem-biosynthesized.
AbCGT is a versatile GT that simultaneously possesses C-, O-, S-, and N-glycosylation activities. Increasing numbers of CGTs have been found to possess multifunctional characteristics in forming glycosidic bonds. For examples, UGT708A6 from Zea mays exhibits both C- and O-glycosylation activities24, MiCGT from M. indica showed C-, O-, and N-glycosylation activities15, and GgCGT from G. glabra generates C-, O-, N-, and S-glycosides32. The multifunctional characteristics hint the GTs catalyzing C-, O-, N-, and S-glycosylation is determined by substrate structures and enzyme-binding sites. Furthermore, the recently reported crystal structures of TcCGT1 and GgCGT, which are similar to the structures of OGTs, also supports this hypothesis30,32. The catalytic types may be determined by several key active sites. For multifunctional AbCGT, one possible binding site involved in C- and O-glycosylation was identified, and the C-glycosylation specificity of AbCGT was improved by rational design. In addition, progress in the engineering of AbCGT to improve the C-glycosylation specificity on aromatic aglycons with two hydroxyls was achieved in this work, and the door of enhancing the C-glycosylation activity of CGTs was opened. These findings not only provide a way to artificially manipulate the substrate promiscuity of CGTs but also shed light on the mechanisms of C- and O-glycosylation. Further studies on screening or engineering CGTs to recognize aromatic aglycons with only one hydroxyl or even no hydroxyl will be very challenging but significant for CGTs to progress from the laboratory to industry for the synthesis of clinical C-glycoside drugs and biological C-glycosides.
Methods
General
Chemicals and reagents including commercially available aglycons (1ā15, 17, 19, and 23ā29) and sugar donors were purchased from Sigma-Aldrich (St. Louis, USA), J&K Scientific Ltd (Beijing, China), InnoChem Science & Technology Co., Ltd (Beijing, China), BioBioPha (Kunming, China), Beijing BG Biotech Co., Ltd (Beijing, China), and Beijing Adhoc International Technologies Co., Ltd (Beijing, China). Aglycons 16, 18, and 30 were chemically synthesized in our previous work15, and aglycons 20ā22 and 31ā54 were synthesized in this work. KOD-Plus neo DNA polymerase was purchased from Toyobo Biotech Co., Ltd. (Shanghai, China). Primers were synthesized, and DNA was sequenced by Sangon Biotech Co., Ltd. (Shanghai, China). Restriction enzymes were purchased from Takara Biotechnology Co., Ltd. (Dalian, China). The analyses of substrate specificity and the determinations of the conversion rates were performed on an Agilent 1260 series HPLC system (Agilent Technologies, Germany) coupled with an LCQ Fleet ion trap mass spectrometer (Thermo Electron Corp., USA) equipped with an electrospray ionization (ESI) source. For quantification, three parallel assays were routinely performed. HRESIMS data were measured with an Agilent Technologies 6520 Accurate Mass Q-TOF LC/MS spectrometer. The 1H and 13C NMR data of prepared aglycons and C-glycosylated products were obtained by Bruker AVIIIHD-600 spectrometer.
Plant materials
Fresh leaves of Aloe barbadensis were collected, immediately frozen with liquid nitrogen, and stored at ā80ā°C until use.
Molecular cloning of AbGTs
The total RNA of A. barbadensis leaves was prepared using the E.Z.N.A.TM Plant RNA Kit (Omega Bio-Tek, USA) and reverse-transcribed to cDNA with SmartScribe reverse transcriptase (Clontech, USA) following the manufacturerās instructions. Full-length cDNAs of the AbGTs were amplified by PCR using gene-specific primer pairs.
Expression and purification of AbCGT
The AbCGT gene was amplified using the AbCGT-F and AbCGT-R primer pair (Supplementary TableĀ 5). After verification of the sequence, the coding region of AbCGT was then amplified using the AbCGT-FpET28a and AbCGT-RpET28a primer pair and inserted into pET28a according to the ClonExpressĀ® II One Step Cloning Kit (Vazyme Biotech, China). The recombinant plasmid was then transformed into Transetta (DE3) E. coli (TransGen Biotech, China) for heterologous expression. Luria-Bertani (LB) medium (400āmL) containing 50āμgāmLā1 kanamycin and 34āμgāmLā1 chloromycetin was inoculated with 1āmL of an overnight culture of recombinant E. coli cells. Cells were grown at 37ā°C with shaking (200ārpm) until OD600 nm reached 0.4ā0.6. The recombinant N-terminal His6-AbCGT expression was achieved by induction with 0.1āmM Isopropyl β-D-thiogalactoside (IPTG) for 12āh at 18ā°C with shaking (200ārpm). Cells were harvested by centrifugation at 8000āĆāg for 10āmin at 4ā°C. Pellets were resuspended in 40āmL of chilled binding buffer (20āmM phosphate buffer, 0.5āM NaCl, 20āmM imidazole, pH 7.4) containing 1āmM phenylmethylsulfonyl fluoride (PMSF). Cells were disrupted by sonication in an ice bath, and the cell debris was removed by centrifugation at 10,000āĆāg and 4ā°C for 30āmin. The soluble fraction was passed through a 0.22-μm syringe filter unit and applied to 1āmL Ni-NTA resin (GE, USA) loaded in a column, which was pre-equilibrated with binding buffer. The resin was subsequently washed with 10āmL binding buffer under a flow rate of 1āmLāminā1 at 4ā°C. After washing with 5āmL washing buffer (20āmM phosphate buffer, 0.5āM NaCl, and 100āmM imidazole; pH 7.4), elution was performed with 5āmL of elution buffer (20āmM phosphate buffer, 0.5āM NaCl, and 200āmM imidazole; pH 7.4). Finally, the recombinant protein was desalted in the desalting buffer (50āmM Tris-HCl buffer, 50āmM NaCl, 1āmM DTT, and 1% glycerol; pH 7.4) by concentration and dilution using an Amicon Ultra-30K (Millipore, USA). Protein purity was confirmed by SDS-PAGE, and protein concentration was determined by the Protein Quantitative Kit (Bradford) (TransGen Biotech, China). The approximate protein yield of AbCGT (50.8ākDa) was 6.9āmgāLā1.
Glycosyltransferase activity assays
The reaction mixture contained 0.5āmM sugar donors (1.0āmM for di-glycosylation), 0.25āmM aglycons, 50āmM Tris-HCl (pH 7.4), and 50ā100āμg of purified AbCGT in a final volume of 100āμL. Due to the instability of some acceptors in the buffer of pH 11.0, the reactions were performed under the physiological conditions (pH 7.4, 30ā°C) to compare the catalytic activities of AbGT27 with different substrates under the same conditions. The reactions were performed at 30ā°C for up to 12āh and terminated by the addition of 200āμL of ice-cold methanol. Subsequently, samples were centrifuged at 15,000āĆāg for 30āmin to collect the supernatant, and aliquots were analyzed by HPLC-UV/ESIMS. HPLC analysis was performed on a Shiseido CAPCELL PAK C18 MG III column (250āmmāĆā4.6āmm I.D., 5āμm, Shiseido Co., Ltd., Japan) at a flow rate of 1āmLāminā1. The mobile phase was a gradient elution of solvents A (0.1% formic acid aqueous solution) and B (methanol). The gradient programs were used for the analysis of the reactions (Supplementary TableĀ 6). The conversion rates of the enzyme reactions were calculated from peak areas of glycosylated products and substrates as analyzed by HPLC. To facilitate inferential statistical analysis, three parallel assays were routinely performed; the meansā±āSD from triplicate analyses are reported here.
Effects of pH, temperature, and divalent metal ions in enzymatic reactions
To determine the optimal pH, the enzymatic reaction was performed at 30ā°C in various reaction buffers with pH values in the ranges of 5.0ā6.0 (citric acidāsodium citrate buffer), 6.0ā7.0 (Na2HPO4āNaH2PO4 buffer), 7.0ā9.0 (Tris-HCl buffer) and 9.0ā12.0 (Na2CO3āNaHCO3 buffer). To assay for the optimal reaction temperature, the reaction mixtures were incubated at pH 11.0 and different temperatures (15ā60ā°C). To test the necessity of divalent metal ions for AbCGT, BaCl2, CaCl2, CoCl2, CuCl2, FeCl2, MgCl2, MnCl2, ZnCl2, and EDTA were used individually at a final concentration of 5āmM. The reactions were performed with phloretin (17, 0.25āmM) as the acceptor and UDP-Glc (1.0āmM) as the sugar donor and 5āμg of purified AbCGT in a final volume of 100āμL for 10āmin. For quantification, three parallel assays were routinely carried out.
Determination of kinetic parameters of AbCGT
Assays were performed in a final volume of 100āμL 50āmM Tris-HCl (pH 7.4) at 30ā°C, and contained purified recombinant AbCGT (1.5āμg) and saturating UDP-Glc (5āmM) while varying phloracetophene (11) and phloretin (17) concentrations (5ā500āμM), respectively. The reactions terminated by the addition of 200āμL of ice-cold methanol. Subsequently, samples were centrifuged at 15,000āĆāg for 30āmin to collect the supernatant, and aliquots were analyzed by HPLC as described above. All experiments were performed in triplicate. The value of Km was calculated with the method of LineweaverāBurk plot.
Preparation of potential acceptors of AbCGT by chemical synthesis
In brief, a solution of phloroglucinol (2.0āmmol) and K2CO3 or Cs2CO3 (1.0āmmol) in DMF (1.2āmL) was stirred vigorously at 0ā°C for 10āmin. Various bromine substituents (1.0āmmol) in DMF (1.2āmL) were added in a dropwise manner. The reaction mixture was stirred at 0ā°C for 12āh and then poured into water (6āmL) and extracted with EtOAc (3āmLāĆā2). The combined organic phases were washed with brine (3āmL) and water (3āmL) successively. The organic layer was then dried (Na2SO4) and concentrated, and the residue was purified by semi-HPLC. The detailed experimental process and yields of the prepared aglycons are shown in theĀ Supplementary Information.
Scale-up enzymatic reactions
Generally, 10ā30āμmol of aglycon was dissolved in 1āmL of DMSO and diluted with buffer solution (50āmM Tris HCl, pH 7.4; total volume 25āmL). UDP-Glc (20ā60āμmol) was added along with 20āmg of purified AbCGT. Due to the high cost of UDP-Glc, excess purified AbCGT was used to make sure the full use of UDP-Glc in vitro enzymatic reactions. The reactions were performed at 30ā°C for 12āh followed by extraction five times with 5āĆā50āmL of ethyl acetate. The organic phase was pooled and evaporated to dryness under reduced pressure. For phloracetophene (11) and phloretin (17), the reaction mixtures were directly evaporated to dryness under reduced pressure. The residue was then dissolved in 1.5āmL of methanol and purified by reverse-phase semi-preparative HPLC. The obtained products were analyzed by MS and NMR.
Glycosylation assays catalyzed by whole-cell biocatalyst E. coli-AbCGT
The induced cells of E. coli-AbCGT were harvested by centrifugation at 3000āĆāg for 5āmin at 4ā°C and washed with ddH2O. Pellets were resuspended by M9 minimal medium containing 2% glucose to appropriate cell density (OD600ā=ā3.0). Prepared whole-cell biocatalyst of E. coli-AbCGT (500āμL) was then added to 2-mL tubes containing various acceptors at a concentration of 0.25āmM. The mixture was incubated at 30ā°C for 12āh with shaking (200ārpm). The reactions were analyzed using GT activity assays as described above. For quantification, three parallel assays were routinely carried out.
Optimization of cell density and substrate concentration in whole-cell biocatalysis
To study the optimal cell density, the reaction was performed in various cell densities with OD600 values in the range of 1.0ā15.0 and a substrate concentration of 150āmgāLā1. To assay for the optimal substrate concentration, the reactions were performed with 50ā500āmgāLā1 of aglycon 46 at OD600ā=ā13.0. For quantification, three parallel assays were routinely carried out.
Investigation of the recyclability and stability of the whole-cell biocatalyst
For the first round of catalytic reaction, 500āmL of engineered cell biocatalyst of E. coli-AbCGT (OD600ā=ā13.0) was mixed with 150āmg of 46 and incubated at 25ā°C for 12āh with shaking (180ārpm). Cells were then collected by centrifugation at 3000āĆāg for 5āmin and washed with M9 minimal medium. Pellets were resuspended with M9 minimal medium and reused in the second round as described in the first round. In total, ten rounds of reactions were performed to evaluate the recyclability and stability of the engineered cell biocatalyst in consecutively catalyzing reactions. The concentrations of 46 and 46a were obtained according to their corresponding standard curves, and three parallel assays were routinely carried out.
Inhibitory activity and selectivity assay against SGLT2 for C-glycosides in vitro
Human full-length SGLT2 cDNA (NM_003041.3) was synthesized by Invitrogen Co., Ltd (Shanghai, China) and amplified with a forward primer (5ā²-CCGCTCGAGGCCACCATGGACAGTAGCACCTGGAGC-3ā²) and reverse primer (5ā²-CCGGAATTCTCAGGCAAAATATGCATGGCAAAAG-3ā²). The PCR products were cloned into the retrovirus vector pMSCVpuro at the Xho I and EcoR I sites generating pMSCVpuro-SGLT2, and the sequences were confirmed by Sanger sequencing. To produce retrovirus, a mixture containing the plasmid pMSCVpuro-SGLT2 and the packing plasmids (Gag-Pol and pVSV-G) were transfected into 293T cells with Neofect (Neobiotech Co Ltd, Korea). Culture medium was changed after 16āh post transfection. The supernatant containing retrovirus was collected after 48āh post transfection and filtered through a 0.45-μm PVDF filter membrane. For generating the cells stably expressing SGLT2, HEK293 cells were transduced with retrovirus. After 48āh post infection, puromycin (2āμgāmLā1) was used to screen the positive clone cell lines for 2 days, and the SGLT2 expression status was validated by western blot analysis46. To evaluate the selectivity of compounds, pMSCVpuro-SGLT1 stably transfected HEK293 cell line was also constructed in the same way. The full-length SGLT1 cDNA was obtained by PCR with forward primer: 5ā²-CCGCTCGAGGCCACCATGGACAGTAGCACCTGGAGC-3ā² and reverse primer: 5ā²-CCGGAATTCTCAGGCAAAATATGCATGGCAAAAG-3ā².
D-glucose transport was measured by performing 100āµM 1-[N-(7-Nitrobenz-2-oxa-1,3-diazol-4-yl)amino]-1-deoxy-D-glucose (1-NBDG) uptakes in cells over-expressing SGLT1 or SGLT2. Cells were seeded in 24-well plates (10,0000 cells/well), grown to 90ā95% confluency, and then the transport assays were performed. Cells were incubated in transport Na+ buffer containing 120āmM NaCl, 4.7āmM KCl, 1.2āmM MgCl2, 2.2āmM CaCl2, 10āmM HEPES, pH 7.4 with various concentrations of C-glycosides. For nonspecific transport determination, cells were incubated in Na+-free buffer containing 140āmM choline chloride, 4.7āmM KCl, 1.2āmM MgCl2, 2.2āmM CaCl2, 10āmM HEPES, pH 7.4. After 4āh-incubation at 37ā°C, cells were washed twice with ice-cold stop buffer (Na+-free buffer containing 0.5āmM phlorizin), incubated at room temperature in cell lysis buffer (0.1āmM NaOH), and then neutralized with 0.1āmM HCl. Fluorescence intensity at 485/535ānm was then measured to determine the inhibitory activity of SGLT2 for C-glycosides. All experiments were repeated in triplicate. The IC50 values were determined using nonlinear regression with Prism 5 software.
Evaluation of the hypoglycemic effect of 46a in vivo
The hypoglycemic effect of 46a was firstly evaluated by oral glucose-tolerance test (OGTT) in mice with normal blood glucose levels. Male Institute of Cancer Research (ICR) mice weighing 24ā26āg were obtained from Beijing Vital River Laboratory Animal Technology Co., Ltd (Beijing, China) and divided into three groups (nā=ā7ā8). Briefly, ICR mice were orally administered vehicle or test compounds (5āmgākgā1 dapagliflozin and 100āmgākgā1 46a). After 2āh of fasting, blood samples were collected from the tails for determination of baseline values of blood glucose (0āmin). The mice were then orally administered glucose (2āgākgā1 BW), and blood samples were collected at 15, 30, 60, and 120āmin. A glucose oxidase (GOD) assay was used for blood glucose-level measurement. The values of area under the glucose-time curve (AUC) were calculated.
Type 1 diabetes mellitus (T1DM) mice were established by intravenous injection of alloxan tetrahydrate (68āmgākgā1 BW). Mice with serum glucose >200āmgādLā1 72āh after alloxan injection were considered diabetic and randomly divided into three groups (nā=ā8) as follows: the alloxan-induced DM group, positive drug group (dapagliflozin, 5āmgākgā1 BW), and 46a treatment group (100āmgākgā1 BW). Nonfasting blood glucose was kinetically monitored at 1, 2, 4, 6, 8, 24, and 48āh after treatment with 46a. AUC values during 0ā8āh, 8ā24āh, and 24ā48āh were calculated.
To evaluate the hypoglycemic effect of 46a in type 2 diabetes mellitus (T2DM), female KKAy mice weighing 42ā48āg were obtained from the Animal Center of the Institute of Laboratory Animal Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College. The model mice were divided into the following three groups (nā=ā6ā7): model control KKAy, dapagliflozin (5āmgākgā1 BW), and 46a (100āmgākgā1 BW). Blood samples were collected at 1, 2, 4, 6, 8, 24, and 48āh after treatment. AUC values during 0ā8āh, 8ā24āh, and 24ā48āh were calculated. The glucose levels in urine 4āh and 6āh after administration were also determined using a Biosen C-Line Glucose analyzer (EKF Diagnostics, Germany).
All mice were housed in a temperature- and humidity-controlled room and allowed free access to standard chow diet for 1 week before the study. All experiments were complied with all relevant ethical regulations and approved by the Institutional Animal Care and Use Committee of the Institute of Materia Medica, Chinese Academy of Medical Sciences.
Homology modeling and molecular docking
To build the 3D structure of AbCGT binding with 2-hydroxynaringenin (23), homology modeling was performed using SWISS-MODEL (https://swissmodel.expasy.org/) based on the GgCGT crystal structure (PDB code 6L5R.1)32. Molecular docking between AbCGT and 2-hydroxynaringenin was investigated using AutoDockTools-1.5.6. All docking calculations were restricted to the predicted binding pocket by defining the active site with residue His19, F89, F90, and F194.
Site-saturation mutagenesis of AbCGT
Site-saturation mutagenesis of AbCGT at residue 183 was performed by PCR using pET28a-AbCGT as a template and the corresponding degenerate primers, which are listed in Supplementary TableĀ 5. PCR products were purified and transformed into Trans1-T1 (DE3) E. coli. The sequences of pET28a-AbCGT mutants were confirmed by sequencing, and transformed into Transetta (DE3) E. coli for heterologous expression following protein purification.
Reporting summary
Further information on research design is available in theĀ Nature Research Reporting Summary linked to this article.
Data availability
The authors declare that all data supporting the findings of this study are available within the paper and itsĀ Supplementary Information Files, including experimental details, characterization data, and 1H and 13C NMR spectra (Supplementary Figs.Ā 59ā166) of all prepared compounds. All the data are available from the authors upon reasonable request. The gene sequence of the AbCGT is deposited in GenBank, accession no. MN747045.Ā Source data are provided with this paper.
References
Thibodeaux, C. J., MelanƧon, C. E. & Liu, H. Natural-product sugar biosynthesis and enzymatic glycodiversification. Angew. Chem., Int. Ed. 47, 9814ā9859 (2008).
Gantt, R. W., Peltier-Pain, P. & Thorson, J. S. Enzymatic methods for glyco(diversification/randomization) of drugs and small molecules. Nat. Prod. Rep. 28, 1811ā1853 (2011).
Cobucci-Ponzano, B. & Moracci, M. Glycosynthases as tools for the production of glycan analogs of natural products. Nat. Prod. Rep. 29, 697ā709 (2012).
Thibodeaux, C. J., MelanƧon, C. E. & Liu, H. Unusual sugar biosynthesis and natural product glycodiversification. Nature 446, 1008ā1016 (2007).
HƤrle, J. et al. Rational design of an aryl-C-glycoside catalyst from a natural product O-glycosyltransferase. Chem. Biol. 18, 520ā530 (2011).
Bililign, T., Griffith, B. R. & Thorson, J. S. Structure, activity, synthesis and biosynthesis of aryl-C-glycosides. Nat. Prod. Rep. 22, 742ā760 (2005).
Lalitha, K., Muthusamy, K., Prasad, Y. S., Vemula, P. K. & Nagarajan, S. Recent developments in β-C-glycosides: synthesis and applications. Carbohydr. Res. 402, 158ā171 (2015).
Jesus, A. R. et al. Targeting type 2 diabetes with C-glucosyl dihydrochalcones as selective sodium glucose co-transporter 2 (SGLT2) inhibitors: synthesis and biological evaluation. J. Med. Chem. 60, 568ā579 (2017).
Han, S. et al. Dapagliflozin, a selective SGLT2 inhibitor, improves glucose homeostasis in normal and diabetic rats. Diabetes 57, 1723ā1729 (2008).
Yang, Y. & Yu, B. Recent advances in the chemical synthesis of C-glycosides. Chem. Rev. 117, 12281ā12356 (2017).
Lemaire, S. et al. Stereoselective C-glycosylation reactions with arylzinc reagents. Org. Lett. 14, 1480ā1483 (2012).
Satoh, H. & Manabe, S. Design of chemical glycosyl donors: does changing ring conformation influence selectivity/reactivity? Chem. Soc. Rev. 42, 4297ā4309 (2013).
Wang, Q. et al. Palladium-catalysed C-H glycosylation for synthesis of C-aryl glycosides. Nat. Catal. 2, 793ā800 (2019).
Hoffmeister, D., DrƤger, G., Ichinose, K., Rohr, J. & Bechthold, A. The C-glycosyltransferase UrdGT2 is unselective toward D-and L-configured nucleotide-bound rhodinoses. J. Am. Chem. Soc. 125, 4678ā4679 (2003).
Chen, D. et al. Probing the catalytic promiscuity of a regioā and stereospecific Cāglycosyltransferase from Mangifera indica. Angew. Chem. Int. Ed. 54, 12678ā12682 (2015).
Luzhetskyy, A., Mendez, C., Salas, J. A. & Bechthold, A. Glycosyltransferases, important tools for drug design. Curr. Trends Med. Chem. 8, 680ā709 (2008).
Fischbach, M. A., Lin, H., Liu, D. R. & Walsh, C. T. In vitro characterization of IroB, a pathogen-associated C-glycosyltransferase. Proc. Natl Acad. Sci. USA 102, 571ā576 (2005).
Dürr, C. et al. The glycosyltransferase UrdGT2 catalyzes both Cā and Oāglycosidic sugar transfers. Angew. Chem. Int. Ed. 43, 2962ā2965 (2004).
Liu, T., Kharel, M. K., Fischer, C., McCormick, A. & Rohr, J. Inactivation of gilGT, encoding a C-glycosyltransferase, and gilOIII, encoding a P450 enzyme, allows the details of the late biosynthetic pathway to gilvocarcin V to be delineated. ChemBioChem 7, 1070ā1077 (2006).
Tam, H. K. et al. Structural characterization of Oā and Cāglycosylating variants of the landomycin glycosyltransferase LanGT2. Angew. Chem. Int. Ed. 54, 2811ā2815 (2015).
Li, L., Wang, P. & Tang, Y. C-glycosylation of anhydrotetracycline scaffold with SsfS6 from the SF2575 biosynthetic pathway. J. Antibiot. 67, 65ā70 (2014).
GandiaāHerrero, F. et al. Detoxification of the explosive 2, 4, 6-trinitrotoluene in Arabidopsis: discovery of bifunctional O- and C-glucosyltransferases. Plant J. 56, 963ā974 (2008).
Brazier-Hicks, M. et al. The C-glycosylation of flavonoids in cereals. J. Biol. Chem. 284, 17926ā17934 (2009).
Ferreyra, M. L. F. et al. Identification of a bifunctional maize C- and O-glucosyltransferase. J. Biol. Chem. 288, 31678ā31688 (2013).
Nagatomo, Y. et al. Purification, molecular cloning and functional characterization of flavonoid C-glucosyltransferases from Fagopyrum esculentum M. (buckwheat) cotyledon. Plant J. 80, 437ā448 (2014).
Hirade, Y. et al. Identification and functional analysis of 2-hydroxyflavanone C-glucosyltransferase in soybean (Glycine max). FEBS Lett. 589, 1778ā1786 (2015).
Sasaki, N. et al. Identification of the glucosyltransferase that mediates direct flavone C-glucosylation in Gentiana triflora. FEBS Lett. 589, 182ā187 (2015).
Wang, X., Li, C., Zhou, C., Li, J. & Zhang, Y. Molecular characterization of the C-glucosylation for puerarin biosynthesis in Pueraria lobata. Plant J. 90, 535ā546 (2017).
Ito, T., Fujimoto, S., Suito, F., Shimosaka, M. & Taguchi, G. C-Glycosyltransferases catalyzing the formation of di-C-glucosyl flavonoids in citrus plants. Plant J. 91, 187ā198 (2017).
He, J. et al. Molecular characterization and structural basis of a promiscuous C-glycosyltransferase from Trollius chinensis. Angew. Chem. Int. Ed. 58, 11513ā11520 (2019).
Chen, D. et al. Probing and engineering key residues for bis-C-glycosylation and promiscuity of a C-glycosyltransferase. ACS Catal. 8, 4917ā4927 (2018).
Zhang, M. et al. Functional characterization and structural basis of an efficient di-C-glycosyltransferase from Glycyrrhiza glabra. J. Am. Chem. Soc. 142, 3506ā3512 (2020).
Hutter, J. A. et al. Antiinflammatory C-glucosyl chromone from Aloe barbadensis. J. Nat. Prod. 59, 541ā543 (1996).
Surjushe, A., Vasani, R. & Saple, D. G. Aloe vera: a short review. Indian J. Dermatol. 53, 163ā166 (2008).
Irshad, S., Butt, M. & Younus, H. In-vitro antibacterial activity of Aloe barbadensis Miller (Aloe vera). Int. Res. J. Pharm. 1, 59ā64 (2011).
Kazuno, S., Yanagida, M., Shindo, N. & Murayama, K. Mass spectrometric identification and quantification of glycosyl flavonoids, including dihydrochalcones with neutral loss scan mode. Anal. Biochem. 347, 182ā192 (2005).
Ferreres, F., Silva, B. M., Andrade, P. B., Seabra, R. M. & Ferreira, M. A. Approach to the study of C-glycosyl flavones by ion trap HPLC-PAD-ESI/MS/MS: application to seeds of quince (Cydonia oblonga). Phytochem. Anal. 14, 352ā359 (2003).
Hong, Y. J., Tomas-Barberan, F. A., Kader, A. A. & Mitchell, A. E. The flavonoid glycosides and procyanidin composition of Deglet Noor dates (Phoenix dactylifera). J. Agric. Food Chem. 54, 2405ā2411 (2006).
Gutmann, A. & Nidetzky, B. Switching between O- and C-glycosyltransferase through exchange of active-site motif. Angew. Chem., Int. Ed. 51, 12879ā12883 (2012).
Li, L. et al. The value of safflower yellow injection for the treatment of acute cerebral infarction: a randomized controlled trial. J. Evid.-Based Complementary Altern. Med. 2015, ID 478793 (2015).
Ehrenkranz, J. R. L., Lewis, N. G., Kahn, C. R. & Roth, J. Phlorizin: a review. Diabetes/Metab. Res. Rev. 21, 31ā38 (2005).
Meng, W. et al. Discovery of dapagliflozin: a potent, selective renal sodium-dependent glucose cotransporter 2 (SGLT2) inhibitor for the treatment of type 2 diabetes. J. Med. Chem. 51, 1145ā1149 (2008).
Zhao, Y. et al. Facile synthesis of acacetin and its derivatives. Bioorg. Med. Chem. Lett. 26, 3577ā3580 (2016).
Li, L. et al. Crystal structure of Medicago truncatula UGT85H2āinsights into the structural basis of a multifunctional (iso) flavonoid glycosyltransferase. J. Mol. Biol. 370, 951ā963 (2007).
Modolo, L. V., Escamilla-TreviƱo, L. L., Dixon, R. A. & Wang, X. Single amino acid mutations of Medicago glycosyltransferase UGT85H2 enhance activity and impart reversibility. FEBS Lett. 583, 2131ā2135 (2009).
Zhang, X. L., Wang, Y. N. & Ye, F. Development of a method to study the activity and selectivity of SGLT2 inhibitors. Acta Pharm. Sin. 52, 897ā903 (2017).
Acknowledgements
We acknowledge the National Natural Science Foundation of China (81602999 to K.X.), the Drug Innovation Major Project (2018ZX09711001-001-006 to J.D. and 2018ZX09711001-003-005 to F.Y.), the CAMS Innovation Fund for Medical Sciences (CIFMS-2016-I2M-3-012 to J.D. and F.Y.), PUMC Disciplinary Development of Synthetic Biology (201920100801 to J.D.), and the National Infrastructure of Microbial Resources (NIMR-2017-3 to J.D.).
Author information
Authors and Affiliations
Contributions
K.X., J.D., and F.Y. conceived this project. K.X. conducted the cloning and expression of the GTs gene, protein purification, biochemical kinetics, chemo-biocatalysis, and isolation and structural determination of substrates and products. S.S. contributed to isolate the enzymatic products. X.Z. and F.Y. contributed to bioassay of the SGLT2 inhibitory activity of C-glycosides in vitro and in vivo. K.X. performed the homology modeling, molecular docking, and saturation mutation. All authors discussed the results and wrote the paper.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks Ren-Wang Jiangm Ben Schumann and Shanteri Singh for their contribution to the peer review of this work. Peer review reports are available.
Publisherās note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xie, K., Zhang, X., Sui, S. et al. Exploring and applying the substrate promiscuity of a C-glycosyltransferase in the chemo-enzymatic synthesis of bioactive C-glycosides. Nat Commun 11, 5162 (2020). https://doi.org/10.1038/s41467-020-18990-9
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-020-18990-9
This article is cited by
-
UGT708S6 from Dendrobium catenatum, catalyzes the formation of flavonoid C-glycosides
BMC Biotechnology (2024)
-
Unravelling and reconstructing the biosynthetic pathway of bergenin
Nature Communications (2024)
-
Creating glycoside diversity through stereoselective carboboration of glycals
Nature Communications (2024)
-
Advances in glycosyltransferase-mediated glycodiversification of small molecules
3 Biotech (2024)
-
Engineering Escherichia coli for efficient and economic production of C-glycosylflavonoids by deleting YhhW and regulating pH
Bioprocess and Biosystems Engineering (2023)








