Abstract
N-acyl tyrosines, a prominent class of N-acyl amino acid biomolecules, are produced by selected species in at least three bacterial phyla: Pseudomonadota, Actinomycetota and Planctomycetota. Long-chain N-acyl tyrosines with a characteristic 2,3-dehydrotyrosine core structure and additional taxon-specific chemical modifications were previously reported under the names thalassotalic acids, kyonggic acids and stieleriacines. However, the underlying pathway for their biosynthesis in the different bacterial taxa remains largely unexplored. Here, we focused on the identification of biosynthetic enzymes in the two known stieleriacine-producing planctomycetal strains of the eponymous genus Stieleria. Comparative genome analyses of stieleriacine-, thalassotalic acid- and kyonggic acid producers suggest a common pathway for N-acyl dehydrotyrosine biosynthesis based on conserved genes encoding a putative adenylyltransferase/cyclase, nitroreductase and the hallmark protein N-acyl amino acid synthase (NasY). The targeted deletion of three predicted nasY genes in Stieleria neptunia indicates that one of the three encoded enzymes predominantly produces stieleriacines. We also confirmed the absolute structure of stieleriacine C by synthesis of its epimer and structural derivatives, which serve as the basis for the future investigation of the biological function of N-acyl tyrosines.

Similar content being viewed by others
Introduction
N-acyl amino acids (NAAs) consist of two components, a fatty acid and an amino acid linked by an amide bond, and are widely distributed across various organisms, tissues, and bacterial membranes1,2,3. While simple in their structural components, their structural variability in terms of saturation degree and carbon number renders this class of signaling molecule highly diverse exhibiting species- or context-specific biological functions4.
In humans, NAAs influence, e.g., immune homeostasis, building of fat mass levels, regulation of energy expenditure related to obesity, and affect other processes such as pain, memory, and insulin levels5. One of the structurally most simple NAAs are glycine lipids (GlyLs), which were initially characterized as cytolipin in the gliding bacterium Cytophaga johnsonae (recommended name Flavobacterium johnsoniae) (Fig. 1)6. Since their discovery, GlyLs and their related glycine-serine dipeptido-lipids, known as flavolipins (FLs), have been identified in numerous members of the phylum Bacteroidota, including species associated with the gut and oral microbiomes7,8,9. NAA-derived products from human commensal bacteria are hypothesized to influence G-protein-coupled receptors via chemical mimicry of eukaryotic metabolites10. They have also been shown to affect host behavior, notably enhancing motivation for physical activity through fatty acid amide-dependent activation of the endocannabinoid receptor CB1 and subsequent stimulation of TRPV1 (Transient Receptor Potential Vanilloid 1)-sensory neurons in the host periphery11. Another example of an acylated polar amino acid is volicitin, which consists of a glutamate head group that is connected to a triple-unsaturated fatty acid via an amide linkage and is found in insect saliva and responsible for induction of chemical defenses in plants upon insect grazing12.
Several studies showed that marine bacteria are an important source of structural variants of NAAs13. Aliphatic NAAs (1-3) isolated from the marine γ-proteobacterium Pseudoalteromonas sp. P1-9 were found to have moderate antimicrobial activity14. Tyrosine-derived NAAs, termed thalassotalic acids, have been isolated from the γ-proteobacterium Thalassotalea sp. PP2-45915. A related family of N-acylated tyrosines, called stieleriacines A-C, has been identified in the planctomycete Stieleria maiorica Mal15T. Stieleriacines D and E, that differ from stieleriacines A-C in fatty acid chain length and tyrosine aromatic ring methylation patterns, have been isolated from Stieleria neptunia Enr13T (Fig. 1)16,17. In S. maiorica Mal15T, stieleriacines appear to play a role in microbial interactions by reducing its lag phase and modulating biofilm formation of competing bacteria; thereby influencing microbial community composition strategically. Beyond their ecological role, stieleriacines share notable structural features with NAAs identified through the heterologous expression of environmental DNA in Escherichia coli18,19. Only recently, a third group of structurally related tyrosine and phenylalanine derivatives named kyonggic acids (4-7) have been isolated from Massilia spp.20, while kyonggic acid congeners were described before from heterologous expression of environmental DNA21,22. The three compound families, thalassotalic acids, stieleriacines and kyonggic acids are structurally closely related, but show key differences in acyl chain length and aromatic ring substitution pattern23. In addition, thalassotalic acids A-C, stieleriacines A, B, D and E as well as kyonggic acids 4 and 5 exhibit a double bond at the 2,3-position of the tyrosine moiety and thus belong to the 2,3-dehydroamino acids often found in ribosomally and non-ribosomally-synthesized peptides24. In contrast, kyonggic acids 6 and 7 and stieleriacine C represent saturated derivatives, and could be either representatives of biosynthetic precursors or side products thereof.
NAAs are likely biosynthesized from ligation of an acyl carrier protein (ACP)-activated fatty acid thioester (acyl-ACP) presumably derived directly from primary metabolism, and an amino acid, which can be either a canonical or non-canonical amino acid harboring further modifications2. Dedicated N-acyl amino acid synthases (NASs) catalyze the ligation of the fatty acid to the amino group of the amino acid derivative, forming an amide bond in the resulting product25,26. As an example, in Bacteroides thetaiotaomicron, the biosynthesis of GlyL begins with the N-acylation of glycine with a primary β-hydroxy fatty acid through the N-acyltransferase encoded by the glsB gene27. This reaction produces a mono-acylated amine, such as N-acyl-β-hydroxy-palmitoyl glycine (commendamide), which then undergoes O-acylation (esterification) with a secondary fatty acid, catalyzed by an O-acyltransferase encoded by the glsA gene, yielding a mature diacylated amino acid lipid. While the biosynthesis of many NAAs has been studied, the underlying biosynthetic pathways for the modified NAAs such as thalassotalic acids (Thalassotalea sp.; phylum Pseudomonadota), kyonggic acids (Massilia spp.; phylum Pseudomonadota), and stieleriacines (Stieleria spp.; phylum Planctomycetota) remain to be elucidated.
We initiated our study with a comparative genomics approach to search for biosynthetic gene clusters (BGCs) potentially involved in the formation of the three groups of N-acyl tyrosines. The analysis was guided by genes encoding putative NASs, the hallmark proteins for N-acyl amino acid biosynthesis. The computational analysis was complemented by wet lab studies that included the construction of knock-out mutants in the stieleriacine D- and E-producing S. neptunia Enr13T and by chemical total synthesis of epi-stieleriacine C and non-natural derivatives.
Results and discussion
Comparative genomics of thalassotalic acid, stieleriacine and kyonggic acid producers
For the identification of putative N-acyl phenylalanine/tyrosine-associated BGCs, the genomes of the thalassotalic acids-producer Thalassotalea sp. PP2-459 (NCBI acc. no. GCF_001913705.1), the two stieleriacine-producing planctomycetal strains S. neptunia Enr13T (acc. no. GCF_007754155.1) and S. maiorica Mal15T (acc. no. GCF_008035925.1) as well as the kyonggic acids producer Massilia kyonggiensis TSA1T ( = JCM 19189) (acc. no. GCF_024756235.1) were comparatively analyzed (Supplementary Table 1).
First, genomes were mined for putative NAS-encoding homologous gene sequences, while in a second step genes coding for modifying enzymes, such as O- or C-methyltransferases and oxidoreductases likely responsible for introduction of the double bond (Fig. 2), were taken into consideration. S. maiorica Mal15T that produces stieleriacines A-C bearing a meta-C-methylation on the aromatic ring was expected to encode a dedicated C-methyltransferase, while S. neptunia Enr13T that produces stieleriacines D and E was hypothesized to encode an O-methyltransferase homolog. For comparison, a close relative of M. kyonggiensis, Massilia umbonata LP01T (acc. no. GCF_005280315.1) was included since it produces the saturated kyonggic acid derivatives 6 and 7 that resemble stieleriacine C (10)20 and thus the producer strain was expected to lack the hypothesized nitro-/oxidoreductase involved in the double bond formation.
Our comparative analysis of six genomes revealed several clusters, in which either one, three or four putative NAS-encoding genes are co-located within a certain proximity with putative tailoring genes (Fig. 3) based on our biosynthetic proposal (Fig. 2). The genomes of M. kyonggiensis (producer of kyonggic acids) and Thalassotalea sp. PP2-459 (producer of thalassotalic acids) each contain a gene cluster comprising a single putative NAS-encoding gene co-located with a gene encoding a ThiF family protein (RefSeq locus tags NX783_RS15590 and Bl291_02100, respectively), as well as a gene encoding a putative nitroreductase family protein (NX783_RS15595 and Bl291_02095) (Fig. 2). The nitroreductase family proteins are known to catalyze desaturation reactions, suggesting a functional link to the encoded NAS. This potential relationship is further supported by the operon-like organization of the genes, indicated by short intergenic regions of less than 10 base pairs or even overlapping open reading frames in both, Thalassotalea sp. PP2-459 and M. kyonggiensis. The encoded nitroreductase family proteins show similarity to TyzC (Uniprot entry P95233), which catalyzes the O2-dependent desaturation of N-acyl tyrosine in the C12:0-tyrazolone biosynthesis pathway of Mycobacterium tuberculosis28. The detected ThiF-like adenylyltransferase/cyclase family proteins show similarity to TyzB (Uniprot entry P95234), which catalyzes the ATP-dependent cyclization of N-acyl tyrosine/2,3-dehydrotyrosine yielding the heterocyclic tyrazolones in M. tuberculosis28.
Only genes putatively involved in the biosynthesis of tyrazolones, thalassotalic acids, kyonggic acids, and stieleriacines are shown. The analysis is based on the RefSeq-annotated genomes available from NCBI (locus tags are provided on the arrows). For simplicity, the length and variability of the acyl chain (R) is omitted.
Homologous genes encoding ThiF- and nitroreductase family proteins were also identified within the predicted BGCs of the two Stieleria species known to produce stieleriacines (Fig. 3). In contrast, these two genes are absent in the candidate BGCs of M. umbonata (only producing saturated kyonggic acids), reinforcing the association between their presence and the biosynthesis of compounds containing the 2,3-double bond in the tyrosine moiety via a O2-dependent desaturation. However, the detailed role and biochemistry of ThiF family proteins in the biosynthesis of stieleriacines, kyonggic acids or thalassotalic acids remains enigmatic as yet no acyl-oxazolone derivatives of these three compound families (as present in M. tuberculosis) were observed.
A closer examination of possible BGC candidates in Stieleria spp. revealed one region in S. neptunia (O-methylated stieleriacines) that entails one NAS-encoding gene homolog in proximity to three genes encoding modifying enzymes: a methyltransferase (locus Enr13x_31210), a ThiF family protein (locus Enr13x_31220) and a nitroreductase family protein (locus Enr13x_31230). In contrast, the genome of S. maiorica (C-methylated stieleriacines) harbors a longer region that entails four putative NAS genes within the proximity of the necessary accessory genes, encoding again a methyltransferase (locus Mal15_37250), a ThiF-family protein (locus Mal15_37260), and a nitroreductase family protein (locus Mal15_37310).
To further investigate the biosynthesis of stieleriacines in Stieleria spp., we analyzed the potential substrate specificity of the identified putative biosynthetic enzymes by first constructing a phylogenetic tree based on NAS protein sequences from the genomes under study. However, the resulting clustering primarily reflected the phylogenetic relationships among the strains rather than functional differences, suggesting that redundant or overlapping activities in strains containing multiple NAS enzymes could not be ruled out at this stage (Fig. 4A). We therefore focused on the distinguishing feature of C- versus O-methylation in stieleriacines. To investigate this, a second phylogenetic tree was constructed using sequences of characterized C- and O-methyltransferases (Fig. 4B), along with those located in close proximity to the identified nasY genes. For S. maiorica, the methyltransferase protein sequence clustered with the tyrosine-C3-methyltransferase SfmM2 (saframycin BGC) of Streptomyces lavendulae29, and is due to the related functions hypothesized to be involved in the C-methylation of the stieleriacine precursor (Fig. 2).
A Tree showing the clustering of candidate N-acyl amino acid synthases from Stieleria spp., Thalassotalea sp. PP2-459 and Massilia spp. Since not all proteins were listed in the UniProt database, the NCBI protein accession numbers are shown in brackets instead. B Tree showing the clustering of candidate methyltransferases from S. neptunia and S. maiorica with characterized methyltransferases (in bold). The UniProt entry numbers for all analyzed sequences are shown in brackets. Bootstrap values from 1000 re-samplings are shown on the branches (in %). The tree scale indicates the number of substitutions per amino acid position.
In case of S. neptunia, the candidate methyltransferase encoded in the putative BGC (locus Enr13x_31210) clustered unexpectedly together with the ubiquinione C-methyltransferase UbiE of E. coli and thus was considered a less likely candidate for the biosynthesis of O-methylated stieleriacines. A broader manual targeted genome analysis uncovered an additional alternative candidate (locus Enr13x_12160), which clustered next to a homolog of a tyrosine O-methyltransferase MfnG (marformycin BGC, Streptomyces drozdowiczii)23, and was thus considered a more likely candidate for the O-methylation of the stieleriacine precursor in S. neptunia.
Overall, our findings are consistent with previous reports showing that, in the phylum Planctomycetota, biosynthetic pathways for secondary metabolites are often not encoded by clustered genes. This limits the predictive power of current algorithms, which are primarily trained on well-characterized clusters from distantly related taxa, and thus requires manual curation of datasets and in-depth molecular biological studies for validation.
Analysis of stieleriacine production in wild type and deletion mutants of S. neptunia
Given that the most significant variable in the biosynthetic pathway was the localization and nature of the putative NAS enzymes, we sought to first spearhead the identification of the most likely functional candidate through gene knockout experiments in the genetically tractable S. neptunia. The type strain Enr13T encodes three putative NAS candidates referred to as NasY1, 2 and 3 in the following sections. Single-gene deletion mutants were generated using a double homologous recombination approach. This method replaced the coding region of the targeted nasY gene with a chloramphenicol resistance gene, which served as a positive selection marker. Mutants were named S. neptunia ΔnasY1 (deletion of locus Enr13x_30590), S. neptunia ΔnasY2 (locus Enr13x_31280; in the BGC) and S. neptunia ΔnasY3 (locus Enr13x_41680). The deletion mutants were subsequently analyzed for stieleriacine production using a semi-targeted high-resolution tandem mass spectrometry (MS/MS) analysis and comparison to wild-type production30. Notably, in addition six previously undetected stieleriacine derivatives were identified in the culture broth of wild-type S. neptunia, which clustered within the molecular network of stieleriacines (Fig. 5A). Based on a detailed analysis of their HRMS/MS fragmentation patterns, we were able to propose preliminary structures for these detectable derivatives; however, precise determination of their absolute structures was not yet possible due to their low production levels.
A Molecular network of stieleriacine-related molecular ion features, showing in addition to stieleriacine D and E, six additional mass features of stieleriacine derivatives. Structures of stieleriacine F, H, and I were predicted based on MS/MS fragmentation analysis. Extracted ion chromatograms of B stieleriacine D (m/z 458.326 [M + H]+) and C stieleriacine E (m/z 460.342 [M + H]+) of S. neptunia Enr13T wild type (WT; black), ∆nasY1 (orange), ∆nasY2 (blue) and ∆nasY3 (green).
Extracted ion chromatogram analysis targeting the molecular features of all detectable stieleriacines in the ∆nasY mutants revealed that ∆nasY2 and ∆nasY3 maintained production levels comparable to the wild type (Fig. 5A, B). In contrast, the ∆nasY1 mutant showed a significantly reduced stieleriacine production, with levels of all derivatives approaching the detection limit (Supplementary Figs. 1–8). As we expect that residual production observed in ∆nasY1 may be attributed to genomic redundancy due to the presence of multiple nasY copies, these findings led to the conclusion that NasY1 (locus Enr13x_30590) is the primary enzyme responsible for stieleriacine biosynthesis. These results suggest, once again, that the co-localization of biosynthetic genes in Planctomycetota appears to be more the exception than the rule and future knock-out studies for each individual gene candidate of a biosynthetic pathway using the herein established procedure will be required to fully elucidate the biosynthetic principle in Planctomycetota.
Synthesis of N-acyl tyrosine derivatives and epi-stieleriacine C
To complement future biosynthetic studies on the stieleriacine/kyonggic acid biosynthesis and circumvent the labor-intensive process of isolation from natural sources, we also decided to synthesize a small, yet representative library of NAA derivatives featuring the stieleriacine core structure, including the first synthesis of stieleriacine derivative C as a reliable substitute for the future quantification of stieleriacines31. As starting materials for the amino acid building blocks, commercially available H-Tyr-OMe*HCl, H-Tyr(Me)-OMe*HCl, and synthesized H-Tyr(3-Me)-OMe*HCl were selected. The use of methyl esters was intended to prevent unwanted side reactions that could arise if starting directly from the amino acids. The compounds were acylated using three fatty acids: lauric acid, E-dodec-2-enoic acid, and E-hexadec-2-enoic acid for which two different coupling methods were applied. The first method followed a one-pot procedure described by Johansson et al.31, utilizing 1,1’-Carbonyldiimidazole (CDI) to activate lauric acid. The second method employed coupling reagents for activating unsaturated fatty acids during the coupling step. Using H-Tyr-OMe*HCl as the starting material afforded amides 18–20 in 60–91% yield, whereas H-Tyr(Me)-OMe*HCl gave amides 24–26 in 82–94% yield. Employing H-Tyr(3-Me)-OMe*HCl furnished amides 31–33 in 72–94% yield.
The resulting amides were directly subjected to saponification, achieving nearly quantitative yields of 21–23 and 27–29, while only moderate yields were obtained for 33–35 for yet unknown reasons. However, any attempt to synthesize dehydrotyrosine-derivatives including e.g. stieleriacines A and B from these and other precursors failed despite several different synthetic procedures tested32. Overall, the synthesis of 21 N-acyl amino acids derivatives was achieved, including a stieleriacine C stereoisomer (34).
Comparison of the NMR data for isolated stieleriacine C with the synthetic product 34 and related derivatives revealed strong agreement in chemical shifts (1H, 13C) and coupling patterns, confirming the identity of the isolated compound (Supplementary Table 2). However, a significant discrepancy was observed in the optical rotation values between isolated stieleriacine C (\({[{{{\rm{\alpha }}}}]}_{{{{\rm{D}}}}}^{21}\) = −28 (c = 1 in MeOH)) and the synthesized compound (\({[{{{\rm{\alpha }}}}]}_{{{{\rm{D}}}}}^{20}\) = +28.3 (c = 0.15 in MeOH)). This mismatch suggests an inverted stereocenter at C-2 for the natural product, and the synthesis of epi-stieleriacine C. Comparison of the analytical data of kyonggic acid C with synthetic product 18 revealed strong agreement in chemical shifts (1H, 13C) and coupling patterns, confirming the identity of the isolated compound. In addition, comparison of optical rotation values between isolated kyonggic acid derivative 6 and 7 (\({[{{{\rm{\alpha }}}}]}_{{{{\rm{D}}}}}^{20}\) =+3°) and structurally related synthetic compounds (21–23) (Fig. 6) including previously reported N-myristoyl l-tyrosine exhibited almost exclusively positive values. Thus, we concluded that the natural stieleriacine C is likely composed of a D-amino acid, while kyonggic acids are composed of L-amino acids. At this stage, it remains speculative whether stieleriacine C is derived from L- or D-amino acids. While L-amino acids are generally more abundant in the cellular environment, the biosynthesis of stieleriacine C would require a yet enigmatic additional epimerization step, for example through a reversible oxireductase-mediated transformation (Fig. 2), or alternatively the formation of a yet unidentified tyrazolone-like intermediate, as reported for Mycobacterium tuberculosis (Fig. 3).
Bioactivity
Previous studies have demonstrated that NAAs, including stieleriacines, exhibit a broad range of bioactivities including antimicrobial activity against human opportunistic bacterial pathogens, while N-acylated tyrosine derivatives have been shown to act as inhibitors of tyrosinase20. Selected compounds were tested for antimicrobial activity against a panel of test strains in a standardized disk diffusion assay (Supplementary Table S3). The results revealed that methyl ester derivatives (e.g. 18–20, 24–26) were generally inactive against the panel of test strain, while the respective acid derivatives (21–23, 27–29) showed growth inhibitory activity against Staphylococcus aureus 134/94 (MRSA), Enterococcus faecalis 1528 (VRSA) and Mycobacterium vaccae 10670, although bacterial colonies were observed within the inhibition zones indicating emergence of resistance. It is worth noting that the presence of a para-methoxy group or a free para-hydroxy group had no significant effect on the antimicrobial properties.
Conclusion
The phylum Planctomycetota still poses significant challenges for targeted genome mining efforts as functional biosynthetic components are often dispersed across the genome rather than clustered in canonical biosynthetic gene clusters. Thus, manual genome mining paired with targeted knock-out studies was essential to gain first insights into the biosynthesis of N-acyl tyrosine-derivatives: stieleriacines.
This study provides first concepts on the targeted knock-out of the major NAS-encoding gene of stieleriacine biosynthesis and verifies the predominantly responsible enzyme candidate for stieleriacines in S. neptunia Enr13T. Our molecular biological studies also pave the way for more in-depth molecular biological and biochemical studies of additional enzymes related to the biosynthesis of this compound class in S. neptunia Enr13T specifically, and more generally for the broader natural product family and members of this unexplored phylum Planctomycetota. Moreover, our synthetic approach enabled not only the synthesis of a dedicated compound library that can now be used as internal standards for quantification in future enzymatic and ecological assays but also the determination of the absolute structure of stieleriacine C and its unsual D-amino acid configuration through the synthesis of its epimer.
Methods
Genome analyses and construction of phylogenetic trees
GenBank- or RefSeq-annotated genomes of the analyzed producer strains were downloaded from NCBI (accession numbers provided in the results section) and BGCs were analyzed with antiSMASH v.833 In case that no NAS-encoding (nasY) gene was predicted by antiSMASH, the genome was annotated with eggnog-mapper v2.1.1234 and the gene was identified by manual inspection of the re-annotated genome supported by protein blast analyses. For the construction of phylogenetic trees, protein sequences were aligned with ClustalW and trees were reconstructed with Fasttree 2.2 using the JTT + CAT model35. The trees were visualized with iTOL v636. The accession numbers of the used protein sequences are shown in brackets in the trees.
Cultivation
E. coli TOP10 was used for cloning of plasmids harboring homology regions of S. neptunia and was cultivated in LB medium at 37 °C. Wild-type S. neptunia Enr13T and deletion mutants were cultivated in baffled flasks in M1H NAG ASW medium at 28 °C with shaking37. For cultivation of the deletion mutants, the medium was supplemented with 50 mg/L chloramphenicol, whereas 34 mg/L was used for E. coli during plasmid constructions.
Construction of gene deletion mutants
Gene deletion mutants in S. neptunia Enr13T were constructed using a homologous recombination strategy enforcing two simultaneous crossing-over events based on a previously published protocol38. The expected deletion was checked by PCR-based amplification of the inserted region (chloramphenicol resistance gene) with primers binding outside of the up- and downstream homology regions used for the recombination.
Extraction and UHPLC-ESI-HRMS analysis
After cultivation in three replicates, supernatant and biomass of S. neptunia Enr13T cultures were separated by centrifugation (4500 rpm, 4 °C, 30 min) (Supplementary Notes 1 and 2). The cell pellet was extracted with MeOH, while the culture supernatant was extracted with EtOAc. Organic fractions were combined and dried in vacuo. UHPLC-HESI-HRMS measurements of each sample were carried out on a Vanquish Flex UHPLC system (Thermo Scientific) combined with an Orbitrap Exploris 120 mass spectrometer (Thermo Scientific) equipped with a heated electrospray ionization (HESI) source. Metabolites were separated using reverse phase liquid chromatography at 40 °C using a Kinetex C18 column (50 × 2.1 mm, particle size 1.7 µm, 100 Å, Phenomenex) preceded by a C18 SecurityGuardTM ULTRA guard cartridge (2.1 mm, Phenomenex). Mobile phases consisted of H2O + 0.1% formic acid (buffer A) and acetonitrile + 0.1% formic acid (buffer B). Five µL sample concentrated at 200 µg/mL was injected into a gradient as follows: 0–1 min, 5% B; 1–10 min, 97% B; 10–12 min, 97% B; 12–13 min, 5% B; 13–15 min, 5% B at a constant flow rate of 0.3 mL/min. Data-dependent acquisition of MS2 spectra was performed in positive mode. MS1 full scans were recorded at m/z 150–1500 with a resolving power of 60,000 at m/z 200. Up to four MS2 spectra per MS1 survey scan were recorded with a resolving power of 30,000 at m/z 200 (see Supplementary Note 1 and 2).
Molecular network analysis
A molecular network was created using the Global Natural Products Social Molecular Networking (GNPS) platform30,39. The precursor ion mass tolerance was set to 0.02 Da and an MS/MS fragment ion tolerance of 0.02 Da, while edges were filtered to have a cosine score above 0.8 and more than six peaks. Spectral networks were visualized using Cytoscape 3.10.240.
Organic synthetic methods
For organic synthetic methods and analytical dataset, see Supplementary Note 3, Supplementary Table 2 and Supplementary Figs. 10–41).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
HRMS/MS data have been deposited on the MassIVE server (MSV000098273; https://doi.org/10.25345/C5F766K86). NMR files deposited on Zenodo (https://doi.org/10.5281/zenodo.15737373).
References
Battista, N., Bari, M. & Bisogno, T. N-acyl amino acids: metabolism, molecular targets, and role in biological processes. Biomolecules 9, 822 (2019).
Bhandari, S., Bisht, K. S. & Merkler, D. J. The biosynthesis and metabolism of the N-acylated aromatic amino acids: N-acylphenylalanine, N-acyltyrosine, N-acyltryptophan, and N-acylhistidine. Front. Mol. Biosci. 8, 2021 (2022).
Liu, S. et al. Aminolipids in bacterial membranes and the natural environment. ISME J. 18, https://doi.org/10.1093/ismejo/wrae229 (2024).
Bhandari, S., Bisht, K. S. & Merkler, D. J. The biosynthesis and metabolism of the N-acylated aromatic amino acids: N-acylphenylalanine, N-acyltyrosine, N-acyltryptophan, and N-acylhistidine. Front. Mol. Biosci. 8, 801749 (2022).
Mannochio-Russo, H. et al. The microbiome diversifies long- to short-chain fatty acid-derived N-acyl lipids. Cell https://doi.org/10.1016/j.cell.2025.05.015 (2025).
Kawazoe, R., Okuyama, H., Reichardt, W. & Sasaki, S. Phospholipids and a novel glycine-containing lipoamino acid in Cytophaga johnsonae Stanier strain C21. J. Bacteriol. 173, 5470–5475 (1991).
Shiozaki, M. et al. Revised structure and synthesis of flavolipin. Tetrahedron 54, 11861–11876 (1998).
Gomi, K., Kawasaki, K., Kawai, Y., Shiozaki, M. & Nishijima, M. Toll-like receptor 4-MD-2 complex mediates the signal transduction induced by flavolipin, an amino acid-containing lipid unique to Flavobacterium meningosepticum. J. Immunol. 168, 2939–2943 (2002).
Chang, F.-Y. et al. Gut-inhabiting Clostridia build human GPCR ligands by conjugating neurotransmitters with diet- and human-derived fatty acids. Nat. Microbiol. 6, 792–805 (2021).
Ryan, E., Joyce, S. A. & Clarke, D. J. Membrane lipids from gut microbiome-associated bacteria as structural and signalling molecules. Microbiology 169, 001315 (2023).
Cohen, J. A. et al. Cutaneous TRPV1 neurons trigger protective innate type 17 anticipatory immunity. Cell 178, 919–932.e14 (2019).
Turlings, T. C. J., Alborn, H. T., Loughrin, J. H. & Tumlinson, J. H. Volicitin, an elicitor of maize volatiles in oral secretion of spodoptera exigua: isolation and bioactivity. J. Chem. Ecol. 26, 189–202 (2000).
MacIntyre, L. W., Charles, M. J., Haltli, B. A., Marchbank, D. H. & Kerr, R. G. An ichip-domesticated sponge bacterium produces an N-acyltyrosine bearing an α-methyl substituent. Org. Lett. 21, 7768–7771 (2019).
Guo, H. et al. Natural products and morphogenic activity of γ-Proteobacteria associated with the marine hydroid polyp Hydractinia echinata. Bioorg. Med. Chem. 25, 6088–6097 (2017).
Deering, R. W. et al. N-acyl dehydrotyrosines, tyrosinase inhibitors from the marine bacterium Thalassotalea sp. PP2-459. J. Nat. Prod. 79, 447–450 (2016).
Kallscheuer, N. et al. The planctomycete Stieleria maiorica Mal15T employs stieleriacines to alter the species composition in marine biofilms. Commun. Biol. 3, 303 (2020).
Sandargo, B. et al. Stieleriacines, N-acyl dehydrotyrosines from the marine planctomycete Stieleria neptunia sp. nov. Front. Microbiol. 11, 1408 (2020).
Brady, S. F., Chao, C. J. & Clardy, J. New natural product families from an environmental DNA (eDNA) gene cluster. J. Am. Chem. Soc. 124, 9968–9969 (2002).
Brady, S. F. & Clardy, J. N-acyl derivatives of arginine and tryptophan isolated from environmental DNA expressed in Escherichia coli. Org. Lett. 7, 3613–3616 (2005).
Steinmetz, T., Lindig, A., Lütz, S. & Nett, M. Molecular networking-guided discovery of kyonggic acids in Massilia spp. Eur. J. Org. Chem. 27, e202400017 (2024).
Lee, C.-M. et al. Characterization of a novel antibacterial N-acyl amino acid synthase from soil metagenome. J. Biotechnol. 294, 19–25 (2019).
Brady, S. F. & Clardy, J. Long-chain N-Acyl amino acid antibiotics isolated from heterologously expressed environmental DNA. J. Am. Chem. Soc. 122, 12903–12904 (2000).
Liu, J. et al. Biosynthesis of the anti-infective marformycins featuring pre-NRPS assembly line N-formylation and O-methylation and post-assembly line C-hydroxylation chemistries. Org. Lett. 17, 1509–1512 (2015).
Siodłak, D. α,β-Dehydroamino acids in naturally occurring peptides. Amino Acids 47, 1–17 (2015).
Haeger, G., Wirges, J., Bongaerts, J., Schörken, U. & Siegert, P. Perspectives of aminoacylases in biocatalytic synthesis of N-acyl-amino acids surfactants. Appl. Microbiol. Biotechnol. 108, 495 (2024).
Kua, G. K. B., Nguyen, G. K. T. & Li, Z. Enzymatic strategies for the biosynthesis of N-acyl amino acid amides. Chembiochem 25, e202300672 (2024).
Lynch, A., Tammireddy Seshu, R., Doherty Mary, K., Whitfield Phillip, D. & Clarke David, J. The glycine lipids of Bacteroides thetaiotaomicron are important for fitness during growth in vivo and in vitro. Appl. Environ. Microbiol. 85, e02157–18 (2019).
Grigg, J. C. et al. Deciphering the biosynthesis of a novel lipid in Mycobacterium tuberculosis expands the known roles of the nitroreductase superfamily. J. Biol. Chem. 299, 104924 (2023).
Li, L. et al. Characterization of the saframycin A gene cluster from Streptomyces lavendulae NRRL 11002 revealing a nonribosomal peptide synthetase system for assembling the unusual tetrapeptidyl skeleton in an iterative manner. J. Bacteriol. 190, 251–263 (2008).
Wang, M. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 34, 828–837 (2016).
Johansson, S. J. et al. A convenient protocol for the synthesis of fatty acid amides. Synlett 30, 213–217 (2019).
Schulz, J. M., Lanovoi, H. T., Ames, A. M., McKegg, P. C. & Patrone, J. D. Concise modular synthesis of thalassotalic acids A-C. J. Nat. Prod. 82, 1045–1048 (2019).
Blin, K. et al. antiSMASH 8.0: extended gene cluster detection capabilities and analyses of chemistry, enzymology, and regulation. Nucleic Acids Res. gkaf334, https://doi.org/10.1093/nar/gkaf334 (2025).
Cantalapiedra, C. P., Hernández-Plaza, A., Letunic, I., Bork, P. & Huerta-Cepas, J. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol. Biol. Evol. 38, 5825–5829 (2021).
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS ONE 5, e9490 (2010).
Letunic, I. & Bork, P. Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 52, W78–W82 (2024).
Boersma, A. S. et al. Alienimonas californiensis gen. nov. sp. nov., a novel Planctomycete isolated from the kelp forest in Monterey Bay. Antonie van Leeuwenhoek 113, 1751–1766 (2020).
Haufschild, T. et al. Novel tools for genomic modification and heterologous gene expression in the phylum Planctomycetota. Appl. Microbiol. Biotechnol. 109, 79 (2025).
Leao, T. F. et al. Quick-start infrastructure for untargeted metabolomics analysis in GNPS. Nat. Metab. 3, 880–882 (2021).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Acknowledgements
The authors would like to thank Huijuan Guo for initial work on this study. This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Project-ID 239748522—CRC 1127 ChemBioSys. C.B. greatly acknowledges funding from the European Union’s Horizon 2020 research and innovation program (ERC Grant number: 802736, MORPHEUS).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
Ma.Sa. performed the synthesis of the NAAs and characterization, My.St. cultivated the strains and contributed to the genome analyses, S.B. performed the comparative analysis of knock-out strains, N.K. constructed the deletion mutants, analyzed the genomes for suitable BGCs and performed the comparative genomics, C.J. and C.B. supervised the study and have written the manuscript. All authors designed figures and contributed to the manuscript text and agree with the submitted version.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests. Dr. Huijuan Guo, named in the acknowledgements, is a Senior Editor for Communications Chemistry, but was not involved in the editorial review of, or the decision to publish this article.
Peer review
Peer review information
Communications Chemistry thanks Giang Kien Truc Nguyen and the other, anonymous, reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Sauer, M., Staack, M., Balluff, S. et al. Investigations into the biosynthesis of stieleriacines and related N-acyl tyrosines by comparative genomics, knock-out studies and total synthesis of epi-stieleriacine C. Commun Chem 8, 283 (2025). https://doi.org/10.1038/s42004-025-01654-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42004-025-01654-4