Multi-omics profiling reveals atypical sugar utilization and a key membrane composition regulator in Streptococcus pneumoniae

de Bakker, Vincent; Liu, Xue; Tang, Jonah; Barbisan, Matthew; Baker, Jonathon L.; Veening, Jan-Willem

doi:10.1038/s41467-025-66611-0

Download PDF

Article
Open access
Published: 21 November 2025

Multi-omics profiling reveals atypical sugar utilization and a key membrane composition regulator in Streptococcus pneumoniae

Nature Communications volume 16, Article number: 10429 (2025) Cite this article

4526 Accesses
2 Citations
20 Altmetric
Metrics details

Subjects

Abstract

The human body comprises many different microenvironments, each with their own challenges for microorganisms to overcome in order to survive and possibly cause infection. The human pathogen Streptococcus pneumoniae is notoriously flexible in this regard, and can adapt to a wide range of host niches, including the nasopharynx, lungs, and cerebrospinal fluid. However, the molecular and genetic foundation of this ability remain largely uncharted. In this work, we demonstrate that niche adaptation imposes genome-wide changes on multiple levels, including gene essentiality, expression and membrane lipid composition, by using infection-mimicking growth conditions. In general, we show that gene expression and fitness profiling couple orthogonal sets of genes to environmental stimuli. For instance, import (manLMN) and catabolism (nagAB) genes are required, but not differentially expressed during growth on N-acetylglucosamine (GlcNAc), opposite to the pattern of other amino sugar metabolism pathways. Surprisingly, we find that pneumococci do not necessarily prefer glucose over GlcNAc and that uptake of GlcNAc in absence of subsequent catabolism is toxic. Moreover, we identify a previously overlooked fatty acid saturation regulator, FasR, controlling membrane composition, rendering it important during heat stress. As nutrient availability and temperature fluctuations are distinctive facets of infection environments, these findings may inform anti-infective strategies.

Tools, tactics and objectives to interrogate cellular roles of O-GlcNAc in disease

Article 21 December 2021

Prospective multicenter study identifying prognostic biomarkers and microbial profiles in severe CAP using BALF, blood mNGS, and PBMC transcriptomics

Article Open access 09 May 2025

Understanding the regulatory grammar of sepsis-causing Staphylococcus aureus bacteria using contexualised DNA language models

Article Open access 14 October 2025

Introduction

The human body is the natural habitat of many microorganisms. Various parts of the body impose distinct environmental conditions, such as differences in nutrient availability, surrounding temperature, or host immune responses. Nevertheless, some organisms are highly adaptable and manage to survive and grow in multiple human niches. Among these is the bacterium Streptococcus pneumoniae (the pneumococcus); a commensal of the human nasopharynx that can cause severe disease states upon invasion of other niches, such as pneumonia in the lungs, or meningitis in the cerebrospinal fluid^1,2,3. As such, pneumococci are the main cause of lower respiratory tract infections worldwide, and are associated with most deaths in children under five years old^4,5.

This high degree of niche flexibility is genomically well reflected by the large number of carbohydrates the pneumococcus can use as a carbon source⁶. Indeed, this obligate fermenter has an arsenal of 28 transport systems to consume at least 32 different carbon substrates⁷. Many of these systems display a two-way redundancy: substrates can often enter the cell through multiple importers, and many transporters can import multiple substrates⁷. Of these transporters, 21 are phosphoenolpyruvate–carbohydrate phosphotransferase systems (PTS), which canonically also function in carbon catabolite control (CCR): the classic process through which bacteria shut down the import and metabolism of other carbon sources in the presence of a preferred one, conventionally glucose^8,9. Indeed, pneumococci readily take up glucose when provided while growing on galactose, mannose, or N-acetylglucosamine (GlcNAc)¹⁰. Of these sugars, galactose is consumed last, and while GlcNAc is used first, mannose can be taken up simultaneously¹¹. Mannose and GlcNAc are both taken up through the PTS ManLMN, which also imports glucose and appears to be a key component of pneumococcal CCR^7,12. Central to this process is the catabolite control protein A (CcpA), which interacts with a PTS component to transcriptionally regulate carbon metabolism genes and has been shown to affect transcript levels of up to 19% of pneumococcal genes¹³. Given this dominant regulatory role, it is unsurprising that ccpA and manLMN are important for virulence, highlighting the importance of metabolic regulatory systems to maintain fitness in changing host niches^14,15,16,17.

This is also true for lipid metabolism, where FabT controls expression of the type II fatty acid synthesis (FASII) genes in S. pneumoniae¹⁸. An important exception is fabM, required for the production of unsaturated fatty acids, whose regulation remains obscure^19,20. On top of de novo biosynthesis, pneumococci are also capable of exogenous (e.g., host-derived) fatty acid acquisition through the FakAB system^21,22,23. Whether synthesized or imported, fatty acids differ in acyl chain length and saturation degree, and can be incorporated into the plasma membrane as key building blocks of phospholipids²⁴. So, the bacteria can modulate their membrane composition by regulating relative intracellular availability of different fatty acid types, in turn affecting membrane properties such as viscosity or thickness²⁵. Since these properties are also affected by environmental factors, like temperature, the regulation of fatty acid metabolism to maintain membrane homeostasis is a crucial niche adaptation strategy²⁶. Indeed, the textbook example of this phenomenon is homeoviscous adaptation, in which cells adjust the saturated to unsaturated fatty acid (SFA:UFA) ratio in the membrane to counteract temperature-mediated changes in fluidity²⁷.

Given the hyperadaptive lifestyle of S. pneumoniae, a significant part of its genome may be expected to be dedicated to such niche adaptation mechanisms. However, many pneumococcal genes remain of unknown function (as much as 14% of all genes for strain D39V, NCBI accession CP027540^28,29), in part exacerbated by its extensive pangenome³⁰. We have previously examined transcriptomes in several infection-relevant growth conditions to infer gene function and probe genotype-phenotype links at scale³¹. Although insightful, we note that transcriptional changes are generally only moderately reflected on the protein level^32,33,34,35, and are not necessarily informative on gene essentiality^{36,37,38,39,40,41}. The latter has previously been quantified on a genome-wide level using Tn-seq and CRISPRi-seq approaches, providing gene function insights as well^42,43. However, genome-wide Tn-seq coupled to RNA-seq studies in S. pneumoniae indeed showed poor links between transcriptional stress responses and gene essentiality³⁶.

Here, we address these shortcomings by measuring gene expression on both the transcriptomic and proteomic levels, and by interrogating both fitness loss and gain effects of nearly all S. pneumoniae D39V genes using CRISPRi-seq⁴⁴. Since this includes baseline essential genes (in contrast to Tn-seq), it allows us to draw comprehensive comparisons between expression levels of all genes and their impact on the bacterium’s fitness, and link these to distinct growth conditions. In this way, we show that niche adaptation happens on all these regulatory levels, and that expression and fitness indeed provide orthogonal, complementary sets of information on the cell state. Specifically, we show that these data can shed light on concrete molecular processes by delving into an unexpected requirement for N-acetylglucosamine catabolism, even in the presence of glucose, and a hitherto overlooked putative membrane composition regulator. Together, these results provide a systems-level perspective of pneumococcal niche adaptation and lay the groundwork for extensive gene function studies.

Results

Infection-relevant conditions impose distinct fitness landscapes

Previously, we described pneumococcal transcriptomic profiles for a set of 22 infection-relevant growth conditions, based on chemically defined media³¹. Here, we set out to assess how these differences in expression relate to the importance of those genes for growth in each environment. To this end, we generated genome-wide fitness profiles using CRISPRi-seq with an IPTG-inducible dCas9-sgRNA library in a subset of growth conditions that previously showed clearly distinct transcriptomes⁴³.

The selected conditions aim to simulate the host environment as encountered by pneumococci in the nasopharynx (as in colonization), lungs (pneumonia), blood (bacteremia) or cerebrospinal fluid (meningitis), mostly in terms of temperature, acidity, and nutrient availability (Supplementary Table 1)³¹. We also included the commonly used complex media THY and C+Y, the latter both with and without chemically inducing competence by the addition of synthetic competence-stimulating peptide 1 (CSP-1), whose transcriptional responses have been well characterized^45,46,47.

We observed clear condition-specific fitness landscapes among all conditions tested (Fig. 1a), with 362 operons (corresponding to 615 genes) causing conditional fitness effects (∆|log₂FC | >1, P_adj < 0.05, Fig. 1b), and a core essentialome of 139 operons (259 genes) (log₂FC < 1, P_adj < 0.05, Supplementary Fig. 1a, Supplementary Data 1). We made these essentiality calls easily accessible through our recently updated online genome browser PneumoBrowse 2 (https://veeninglab.com/pneumobrowse)²⁹. Apart from the CRISPRi induction effect itself (PC1), the largest source of fitness variation was associated with the difference between complex and defined media (PC2, Fig. 1a). Indeed, most of the genes driving this distinction had metabolic functions, such as members of the Shikimate pathway, enabling aromatic amino acids biosynthesis, and the AmiACDEF oligopeptide permease system (Fig. 1b). Specifically, the former were exclusively essential in the chemically defined media, whereas the latter showed the opposite profile, suggesting that bacteria scavenge these amino acids in the form of peptides under nutrient-rich conditions, as previously reported⁴⁸. These patterns were not surprising, since nutrient availability constitutes the biggest difference between the growth conditions used here (Supplementary Table 1). In particular, the presence of one key nutrient appeared to cause clear, conditional growth phenotypes: the amino sugar N-acetylglucosamine (GlcNAc, Fig. 1b). We next sought to validate the corresponding gene hits.

**Fig. 1: Genome-wide fitness profiles by CRISPRi-seq.**

GlcNac catabolism is essential upon import despite glucose availability

The genes nagA and nagB encode the two proteins responsible for the breakdown of GlcNAc, feeding into glycolysis via D-fructose 6-phosphate (Fig. 1b)^11,49. Accordingly, both genes were exclusively essential in the nasopharynx- and lung-mimicking conditions (NMC/LMC), where this was the only sugar added to the medium (Fig. 1b, Supplementary Table 1). To test if we could indeed change the fitness status of these genes from dispensable to essential in C + Y as well, we replaced their regular sugars, glucose and sucrose, with GlcNAc. Although we could confirm the effect even in this complex medium with growth curves of knockout and aTc-inducible complementation strains, we noticed these knockout strains also displayed a severe growth defect on an equimolar mix of both GlcNAc and glucose (Fig. 2a). This growth defect was reproducible and dependent on GlcNAc concentration (Supplementary Fig. 1b). This result was surprising, because glucose is commonly believed to be the preferred carbon source in S. pneumoniae, which boasts a classic CcpA-based carbon catabolite repression system^10,13. In addition, neither nagA nor nagB has any known role in glucose uptake or catabolism. So, in the presence of sufficiently high glucose levels, as in this 1:1 molar sugar mix, we would expect all strains to grow well, including the mutants.

Fig. 2: N-acetylglucosamine (GlcNAc) catabolism genes nagA and nagB are essential on a sugar mix with glucose (Glc). — **Fig. 2: N-acetylglucosamine (GlcNAc) catabolism genes *nagA* and *nagB* are essential on a sugar mix with glucose (Glc).**

We therefore decided to measure changes in the glucose concentration in the media during exponential growth using glucose oxidase assays, as a proxy for glucose uptake. Whereas all strains readily depleted the medium of all glucose when no GlcNAc was added, the ∆nagA and ∆nagB mutants failed to do so in the same time window when both sugars were present (Fig. 2a). Moreover, even in wild-type (WT) bacteria glucose uptake was slower when grown on the sugar mix compared to the glucose-only medium, suggesting GlcNAc presence inhibits glucose import (Fig. 2a). We validated this result by repeating the assay measuring every 20 min instead of every 2 h, and found a consistent 2-h delay in complete glucose depletion in the presence of GlcNAc (Fig. 2b). Despite this considerable slowdown in glucose uptake, supplementation with GlcNAc did not impact the WT growth rate (Supplementary Fig. 1c). This implies that the bacteria take up GlcNAc even in the presence of glucose, making up for the apparent reduced sugar import to fuel growth. These results suggest that GlcNAc competes for import with glucose, which might therefore not necessarily be the preferred carbon source for S. pneumoniae.

Since GlcNAc versus glucose uptake seemed competitive, and ∆nagA/∆nagB mutants did not appear to import glucose, we hypothesized that their growth defect on the sugar mix was likely caused by the inability to break down GlcNAc after import. Limiting intracellular GlcNAc concentrations by knocking out its importer should then alleviate the growth defect. The main GlcNAc importer has previously been reported to be the phosphotransferase system ManLMN, which was indeed conditionally, albeit not exclusively, essential in NMC and LMC in our CRISPRi-seq assay (Fig. 1b)^7,50. Additionally, manLMN deletion caused a strong fitness defect on a GlcNAc-only medium, further confirming its importance in GlcNAc consumption (Fig. 2c). However, on the sugar mix, manLMN deletion drastically increased ∆nagA and ∆nagB viability, in line with our hypothesis (Fig. 2c). Moreover, we performed a suppressor screen by plating both ∆nagA and ∆nagB mutants on agar containing both sugars. We isolated 20 colonies, 10 for each parent strain, all of which phenocopied the partial rescue of the double knockout strains in liquid (Supplementary Fig. 1d). Sequencing results of 10 of these isolates, five for each parent strain, revealed that all of them were mutated in the manLMN locus, indeed suggesting selection for GlcNAc import reduction (Fig. 2d).

Taken together, our results reveal an unusual sugar preference in S. pneumoniae and imply the presence of toxic intermediates in the GlcNAc catabolic pathway. Since all intermediates are sugar phosphates (Fig. 2e), and the toxicity is seen in both the ∆nagA and ∆nagB mutants (Fig. 2a, c), we speculate this could be a form of sugar-phosphate stress, as has been described for other species and sugars⁵¹.

Heat stress imposes specific genetic requirements

Although metabolic effects dominated the conditional fitness profiles by separating complex media from defined ones, the FEVER condition appeared more distinct (Fig. 1a, b). Since bacteria in this condition were grown at an elevated temperature of 40 °C, we sought to explore whether this could explain FEVER-specific effects. To this end, we compared it with growth in CSFMC: the same medium, but at 37 °C (Supplementary Table 1). Although growth was approximately three times slower at 40 °C compared to 37 °C, we ensured all of these samples were grown for the same pooled average of seven generations to allow the CRISPRi strains to compete for an overall equal number of cell divisions (Supplementary Table 1, Supplementary Data 1).

Strikingly, we uniquely observed fitness gain effects for the knockdown of certain genes in FEVER (Fig. 1b). Among the strongest of these was arginine metabolism regulator argR1 (Figs. 1b and 3a). ArgR1 functions as a dimer with AhrC, whose gene indeed had a similar effect on fitness⁵². Since these two genes are located more than 780 kbp apart, this cannot be due to CRISPRi polar effects. Most other strong fitness gains were achieved through repression of genes neighboring ahrC (xseA, xseB, ipsA-spv_1064, recN), which we therefore expect to be due to polarity (Fig. 3a, Supplementary Data 1). In addition, arginine is the precursor for polyamine biosynthesis in S. pneumoniae, and most of the genes involved in this pathway also showed mild fitness gain effects upon repression, of which four significantly so (log₂FC > 1, P_adj < 0.05, Figs. 1b and 3a, Supplementary Data 1)⁵³. Knockdown of these genes thus likely increases intracellular arginine levels, an effect also associated with ahrC and argR1 repression⁵². Together, these results suggest arginine retention is favorable for S. pneumoniae at higher temperatures, although this remains to be investigated more deeply.

Fig. 3: spv_0647 is important during heat stress and functionally related to genes modulating fatty acid saturation levels. — **Fig. 3: *spv_0647* is important during heat stress and functionally related to genes modulating fatty acid saturation levels.**

As expected, repression of genes encoding known heat-shock proteins such as groEL/ES, ftsH, and the hrcA-grpE-dnaK-spv_2171-dnaJ operon resulted in strong fitness losses (Fig. 3a, Supplementary Data 1)⁵⁴. The strongest FEVER-specific fitness loss was, however, caused by a two-gene operon hitherto not associated with heat stress: comEB-spv_0647 (Fig. 3a). We first validated this result with an operon-based deletion and complementation strain, and subsequently confirmed that deletion of spv_0647, but not comEB, causes growth defects at higher temperatures (Supplementary Fig. 2a).

SPV_0647 is critical for membrane homeostasis at high temperatures

Since spv_0647 encodes a hypothetical, putative transcriptional regulator (TetR family), we made a clean knockout strain and performed RNA-seq comparing its transcriptome to that of WT at two different temperatures, in order to uncover its potential regulon (Supplementary Data 2). To limit toxic side effects, and because the growth defect was already visible at 37 °C, we opted to draw the comparison between this temperature and 30 °C, where the growth defect of the mutant was virtually absent (Supplementary Fig. 2a).

Indeed, the transcriptomes diverged substantially at 37 °C, confirming a temperature-dependent effect (Fig. 3b). While many more genes were differentially expressed at 37 °C, spv_0647 mRNA was always depleted in the mutant regardless of temperature, as expected (Fig. 3c). Similarly, multiple genes involved in fatty acid metabolism were also consistently downregulated, most notably fabM, fakB3, and fatty acid biosynthesis cofactor biotin transporter bioY (Fig. 3c).

We argued that dysregulation of fatty acid metabolism could alter membrane composition, affecting properties such as its permeability and fluidity. As temperature changes themselves are also known to affect these properties, we hypothesized that the growth defect of the mutant at higher temperatures is the result of an interaction between heat stress and fatty acid dysregulation, compromising membrane integrity. In turn, membrane weakening would likely disrupt many downstream processes, especially those involving transmembrane transport or membrane-anchored proteins. Changes in, for instance, protein localization or in- and efflux of nutrients and signaling molecules could stimulate or inhibit sensing pathways, potentially offering an explanation for the massive, global transcriptome divergence we observed at 37 °C, including pyrimidine biosynthesis (PyrR), bacteriocin production (Blp), and multiple sugar metabolism loci (maltosaccharide, cellobiose, beta-glucoside; Fig. 3c).

Fatty acid biosynthesis in S. pneumoniae is controlled by FabT, regulating expression of the FASII operon¹⁸. However, the first gene of the locus, fabM, is not controlled by FabT¹⁹. FabM balances substrate availability for the production of unsaturated versus saturated fatty acids, and a mutant cannot synthesize unsaturated variants^20,55. Although pneumococci can only make mono-unsaturated fatty acids, they are able to take up exogenous poly-unsaturated fatty acids through binding of FakB3 and subsequent phosphorylation by FakA, making these species also available for membrane incorporation^{21,23,56,57,58}. As such, downregulation of either fabM or fakB3 should yield relatively higher saturated to unsaturated fatty acid ratios (SFA:UFA) in the membrane, which is indeed known to be a major factor affecting membrane properties like fluidity^26,27. Moreover, fakB3 is located directly upstream of spv_0647 on the chromosome, in antisense orientation (Fig. 3d). Since the genes encoding transcriptional regulators are often situated adjacent to the genes they control, it is tempting to speculate on a potential fakB3-regulating role for SPV_0647. Of note, transcriptional repression of neither fakA/B3 nor fabM resulted in significantly decreased fitness at 40 °C (Fig. 3a), implying these genes are not individually responsible for the heat-sensitive phenotype.

To get more clues regarding SPV_0647 function, we compared its predicted folding to determined crystal structures in the RCSB Protein Data Bank (PDB) using FoldSeek⁵⁹. Strikingly, we found that despite strong sequence dissimilarities, the protein is predicted to have a structure similar to multiple known TetR-like lipid metabolism transcriptional regulators, including activators (Table 1, Supplementary Fig. 2b, Supplementary Data 3). These hits point to a role of SPV_0647 in fatty acid regulation.

Table 1 FoldSeek relevant top hits with similar folding to the predicted SPV_0647 structure

Full size table

We next revisited our recently published S. pneumoniae D39V genome-wide genetic interaction data generated by dual CRISPRi-seq⁶⁰, and found negative interactions of the spv_0647-comEB operon with fakA and fabM (Fig. 3e). Strikingly, this effect was not observed for the other two sgRNAs targeting the directly downstream fabT and fabK in the FASII locus. This implies the interaction only involves fabM. Furthermore, asp23 displayed a similar negative interaction, which is likely a polar effect, given its location directly upstream of fakA without intermediate terminator²⁸. The negative interaction with spv_1145, a dNTP triphosphohydrolase, is likely due to the polar knockdown of comEB, a dCMP deaminase. Lastly, the genetic interaction with spv_1294 implies a potential role for the encoded hypothetical protein in either DNA or fatty acid metabolism. These results suggest spv_0647 might have a function in FakA- and FabM-related processes, influencing fatty acid saturation levels in the membrane.

FasR (SPV_0647) controls fatty acid saturation balance mediated by FabM

To test whether the growth phenotype was brought about by fabM or fakB3 downregulation (Fig. 3c), we tried to rescue the bacteria by overexpression of either or both genes in the ∆spv_0647 mutant. fakB3 overexpression improved growth marginally at best, which was not surprising, as the growth medium did not contain an explicit excess of polyunsaturated acids. However, fabM overexpression clearly restored growth (Fig. 4a). This implies that generating a surplus of (substrate for) unsaturated fatty acids rescues the mutant, which might thus lack sufficient levels of these fatty acid species.

Fig. 4: fasR (spv_0647) affects heat resistance by modulating membrane composition. — **Fig. 4: *fasR* (*spv_0647*) affects heat resistance by modulating membrane composition.**

Finally, we aimed to assess such changes in the composition of the membrane itself. Using gas chromatography–mass spectrometry of fatty acid methyl esters (GC–FAME), we detected eight fatty acid species in the membranes of WT, mutant, complementation and overexpression strains grown to mid-exponential phase at 30, 37, and 40 °C (Supplementary Fig. 2c, Supplementary Data 4), and found that temperature affected the obtained composition profiles in a strain-dependent fashion (compositional ANOVA P < 0.05, Fig. 4b, Supplementary Fig. 2c). Indeed, they were similar at 30 °C but diverged at higher temperatures (Supplementary Fig. 2d), matching the growth and transcriptome phenotypes observed before (Supplementary Fig. 2a, Fig. 3b). Temperature increase was generally associated with a slight decrease in average acyl chain length, but specifically in the ∆spv_0647 mutant with an inability to maintain unsaturated fatty acid levels (Fig. 4c), in line with fabM downregulation (Fig. 3c). This phenotype could not only be rescued by spv_0647 complementation, but also by fabM overexpression (Fig. 4b, c, Supplementary Fig. 2c). These results correspond exactly to the growth phenotypes we observed for the same strains (Fig. 4a).

Together, these findings support a model in which SPV_0647 confers heat resistance by modulating the SFA:UFA balance in the membrane through positive regulation of fabM and possibly fakB3. We have therefore renamed SPV_0647 to FasR, for fatty acid saturation regulator.

Expression differences do not reflect fitness effects

We next wanted to know to what extent these and other differential fitness requirements translate to expression patterns. To this effect, we measured both the transcriptome and proteome of S. pneumoniae D39V WT grown in six of the infection-mimicking conditions in which CRISPRi-seq was performed (Supplementary Table 1). Using RNA-seq and quantitative, label-free LC–MS, we detected 2146 different RNA species and 870 proteins, of which we retained 2140 and 736, respectively, following standard normalization and imputation methods (Supplementary Data 5). Small, lowly abundant, and membrane proteins were underrepresented in the proteomics measurements (Supplementary Fig. 3a–c). Normalized quantifications of these data were also made accessible in PneumoBrowse 2²⁹.

In general, transcriptome-proteome correlations resembled those reported before for other organisms^32,35. Briefly, transcript and protein levels correlated moderately (R² = 0.37–0.51) within growth conditions (Supplementary Fig. 3d), and protein levels tended to increase with transcript levels of individual genes across conditions (Supplementary Fig. 3e). Intuitively, this effect was strongest for genes of which both transcript and protein products were differentially enriched between at least two growth conditions, indicating tighter mRNA-protein regulation (Supplementary Fig. 3f).

Moreover, dimension reduction by multi-omics factor analysis (MOFA)⁶¹ showed reproducible, integrated, condition-specific proteo-transcriptomic landscapes, while accounting for both the paired nature of the data and non-coding transcripts (Fig. 5a). Its first two dimensions (factors) explained most of the variance in either data set (Supplementary Fig. 4a), correlated well with the most highly differentially expressed genes (Fig. 5b), and revealed three main patterns.

**Fig. 5: Major differential expression patterns and orthogonality with fitness profiles.**

Firstly, natural competence was clearly activated in the CSP, FEVER, and CSFMC conditions, distinguishing them from the others (Fig. 5a, c). For these latter two, this can likely be attributed to the relatively high pH in these media (Supplementary Table 1)⁶². The late competence (ComX-regulated) protein response also appeared to be lagged in these two conditions compared to CSP, suggesting slower competence activation (Fig. 5c). Secondly, genes in the rtg and vp1 loci were strongly upregulated in the defined media (Fig. 5c, Supplementary Fig. 4b). These loci both encode a double cell–cell communication loop, where Rgg regulators control glycine–glycine peptide expression^63,64. It is likely that these quorum-sensing systems were not activated in the complex media due to competition for uptake by the AmiACDEF oligopeptide permease system between the Rgg peptide pheromones and medium-derived peptides^64,65,66. Although upregulated, the rtg and vp1 systems were not important for growth in the defined media, further supported by the amiACDEF importer genes being essential in the complex media, but not in the defined ones (Figs. 1b and 5d). Lastly, genes responsible for the uptake and metabolism of sucrose and amino sugars were indeed associated with the media containing these types of sugars: C+Y/CSP and LMC/NMC, respectively (Fig. 5c, Supplementary Table 1).

Although the sucrose import operon scrAK was upregulated in the corresponding media, our CRISPRi-seq assay indicated it was not essential for growth, which can be explained by the additional presence of glucose (Supplementary Table 1). In contrast, the sucrose catabolism operon scrBR was both upregulated and conditionally essential (Fig. 5d). These results imply again the toxicity of phosphotransferase sugar import without degradation, as for GlcNAc. Indeed, recent work from our laboratory showed this toxicity can be alleviated by simultaneous knockdown of scrA and scrB, inhibiting sucrose import and, with that, likely sugar-phosphate toxicity⁶⁰. These results also imply that, as we showed for GlcNAc, pneumococci import sucrose even in the presence of glucose, at least in sufficient amounts to be toxic in the absence of downstream catabolism.

Strikingly, nagA, nagB, and manLMN were not among the many amino sugar metabolism genes upregulated in NMC and LMC. Instead, these mostly comprised genes involved in uptake (nanP, satABC) and metabolism (nanA, nanB, nanE-1, nanK, nanE-2) of other amino sugars not present in the media (Fig. 5d). This observation of orthogonality between gene essentiality and expression, whether referring to the transcriptome or proteome, was dominant across the whole genome and all tested conditions (Fig. 5d, Supplementary Fig. 4c). We did find exceptions to this trend, e.g., the example of scrB given above, or the conditional upregulation and essentiality of the HrcA-regulated heat-shock protein-encoding operon at 40 °C, the latter of which was not observed on the proteome level presumably due to the short time between heat exposure and sample processing (5 min) (Fig. 5d). fasR however followed the general trend: despite its importance for growth at 40 °C, it was not differentially expressed, suggesting its basal expression levels are sufficient for this survival phenotype (Fig. 5d). Indeed, our results indicated that differentially expressed genes were rarely differentially essential and vice versa (Supplementary Fig. 4c), which fits observations made by others in the same and other organisms^{36,37,38,39,40,41}.

Discussion

In this work, we investigated how pneumococci adapt to different environmental settings in terms of genome-wide gene fitness effects and expression, both on the transcript and protein level. This allowed us to not just re-assess known relationships between these regulatory layers in S. pneumoniae, but also to work out specific molecular responses to concrete stimuli.

Global patterns roughly corroborate biology as it is known in other organisms: transcriptomes correlate moderately with proteomes^32,33,34,35, and either appear almost entirely statistically independent from genome-wide fitness impact^{36,37,38,39,40,41}. However, Jensen and colleagues noted this does not necessarily imply biological independence: genes relating to the same general pathways could be either differentially essential or differentially expressed, and as such still coordinated³⁶. We see one such example, where nagA, nagB, and manLMN are required for growth on GlcNAc, but not differentially regulated, whereas other amino sugar metabolism genes are, while they are not essential. The data suggest that basal expression levels mostly suffice for bacterial survival in distinct environments, rendering differential expression of essential genes generally redundant. Conversely, differential expression does not signal a changed need for a functional gene copy per se: for instance, a gene can be upregulated in one growth condition compared to another, but essential in both. Moreover, expression-fitness correlations could be masked by genetic redundancy, which could be addressed by multiplexed knockdown approaches such as dual CRISPRi-seq^60,67. In addition, it is important to acknowledge that the selective pressures that shaped the regulation of certain genes in response to specific cues in the natural habitat might be absent in our artificial laboratory conditions, whereas those cues can still be present, potentially leading to differential expression without a difference in essentiality. Our data indicate that expression and fitness effects provide almost completely mutually complementary information on how bacteria adapt to different environments. This in turn makes the case for multi-omics approaches to understand how bacteria deal with their surroundings and stresses, including, for instance, antibiotic pressures, with potentially important implications for the treatment of infections. It also suggests that differential expression assays by themselves are not necessarily the best tool to uncover potential therapeutic targets, as they might not point to essentiality.

An environmental factor that is inherently intertwined with infection and disease is ambient temperature. As S. pneumoniae moves from the nasopharynx (32 °C) to other parts of the body (37 °C), where it can bring about fever (>38 °C), it faces considerable temperature shifts⁶⁸. This is known to affect the cells in a myriad of ways, including membrane properties such as fluidity and permeability. Bacteria are known to counter these effects through homeoviscous adaptation, i.e., by adjusting the relative levels of different fatty acid types in the membrane^26,27. Previous studies indicated that such adaptation by S. pneumoniae in response to temperature changes was independent of the fatty acid biosynthesis master regulator FabT¹⁹. Here, we report that spv_0647 encodes a transcriptional regulator that enables pneumococci to maintain proper saturated:unsaturated fatty acid (SFA:UFA) balance, critical during heat stress, by mediating transcription of fabM, and potentially fakB3 and other genes. As such, we named this gene fasR, for fatty acid saturation regulator. In the absence of fasR, SFA:UFA ratios increase at higher temperatures, which is conventionally assumed to allow survival at higher temperatures. Notwithstanding, we observed a growth defect, which could specifically be rescued by restoring SFA:UFA balance either by complementation or fabM overexpression, presumably increasing UFA biosynthesis substrate levels. Of note, such overexpression did not increase relative UFA levels at 30 °C, implying FabM is normally saturating, in line with previous reports¹⁹. These results lead us to hypothesize that maintaining SFA:UFA balance, rather than increasing this ratio, confers heat resistance in pneumococci, and that fasR and fabM play critical roles in that process. Alternatively, fatty acid acyl chain length could also play an important role in maintaining membrane integrity at varying temperatures, which was indeed also affected in the mutant and restored in the complementation and overexpression mutants. Since we did not find putative binding sites for FasR in the promoter regions of differentially expressed genes using motif enrichment analyses with the bioinformatic MEME suite⁶⁹, future research may focus on establishing DNA binding sites and sequences using, for example, EMSA and ChIP-seq. Regulation by FasR is likely modulated by a FasR ligand, and although multiple structurally similar proteins have been shown to sense saturated or unsaturated fatty acids by direct binding (Table 1), this remains to be elucidated for FasR. In addition, it could be insightful to examine lipid head groups, which might also influence membrane properties but are missed by GC-FAME, the lipid analysis technique used here^25,26. Metabolomics techniques could also provide a separate, additional layer of information, as it would be possible to gauge, for instance, the abundance of lipid intermediates, further narrowing down the enzymes potentially regulated by FasR. Moreover, most of the observed differences between the growth conditions tested here were metabolic in nature and could potentially be better understood in the light of metabolomic profiles.

We elaborated on one such instance here, showing that N-acetylglucosamine (GlcNAc) degradation after uptake is essential for pneumococci, and corroborating this is the case for sucrose as well⁶⁰. As both sugars are phosphorylated upon import, we hypothesize these are instances of sugar-phosphate stress⁵¹. Moreover, these toxicities are seen in the presence of glucose, suggesting simultaneous uptake of these sugars. Indeed, we showed a slowdown in glucose uptake in the presence of GlcNAc while the growth rate of WT cells was unaffected. Although this challenges the dogma of CcpA-based glucose preference in S. pneumoniae^10,13, it may not be illogical from an evolutionary perspective, as glycans such as GlcNAc are far more available for scavenging in its natural niche, the human nasopharynx, than glucose is^6,70,71.

Although we provide a broad overview of the large, systems-level data compendium presented here, we have only worked out a few processes that stood out in detail. Many more biological insights may be concealed in these data, and we encourage the community to use them to their advantage. To facilitate such efforts, we have also integrated our genome-wide fitness, transcript, and protein data sets with our recently renewed genome browser PneumoBrowse 2 (https://veeninglab.com/pneumobrowse)²⁹. On top of that, we highlighted some of these avenues to be explored, such as potential sugar-phosphate toxicities or the potential fitness gain upon arginine retention during heat stress. Despite the fact that heat itself has been shown to influence CRISPRi efficiency, we clearly do retrieve the standard core essentialome in our high-temperature growth condition and are therefore confident regarding data quality⁷².

Our findings on GlcNAc metabolism and membrane homeostasis contribute to our knowledge of gene function and the biology behind environmental adaptation. We note that many pneumococcal genes remain of unknown function, and that adaptation is a complex phenotype, brought about on multiple, interacting regulatory levels. S. pneumoniae is notoriously versatile in terms of niche adaptation, as it can occupy many micro-environments in the human body^1,2,3. A deeper understanding of the biology underpinning this capacity, in the pneumococcus as well as other microorganisms, therefore, ultimately also yields a deeper understanding of human health and disease.

Methods

Bacterial strains and growth conditions

Streptococcus pneumoniae D39V serotype 2 and derivatives were routinely grown at 37 °C on Columbia agar plates with 2.5–5% (v/v) defibrinated sheep blood (CBA, Thermo Scientific) at 5% CO₂, or in sealed 5 mL culture tubes with C + Y liquid medium (pH 6.8) without shaking, supplemented with 0.5 μg mL⁻¹ erythromycin, 0.5 μg mL⁻¹ tetracycline, or 50 ng mL⁻¹ anhydrotetracycline when appropriate. Liquid cultures were routinely inoculated at 100× dilutions from pre-cultures cultivated from frozen, isogenic stock cultures (16% glycerol). Strains used in this study are listed in Supplementary Table 2. Other growth conditions and media were prepared as described by Aprianto and colleagues (2018)³¹, with the only adaptation of continuous culturing at 40 °C in the FEVER condition for the CRISPRi-seq assay. Growth assays for sugar preference were performed in a C + Y liquid or agar plate background without added sugars, supplemented with 9.4 mM glucose, 9.4 or 0.94 mM N-acetyl-D-glucosamine (Sigma-Aldrich, A3286), or both sugars as appropriate.

Mutant strain construction

Donor DNA constructs carrying the insert of interest with ~1000 bp flanking regions homologous to the insertion site were produced with a one-pot Golden Gate assembly strategy using Type II restriction enzymes BsaI, Esp3I, or SapI (New England Biolabs)⁷³. Restriction sites were introduced via the primers during PCR amplification. Used oligonucleotides and restriction enzymes are listed in Supplementary Table 3. We used the PT5-3 variant of the P_tet promoter as characterized by Sorg and colleagues (2020)⁷⁴.

Subsequent transformation with the donor DNA was performed as described previously⁴⁴. Briefly, pneumococci were cultured at 37 °C to the early exponential phase (OD595 ~ 0.1) followed by the addition of 0.1 μg mL competence-stimulating peptide 1 (CSP-1) and growth for another 12 min to activate competence. 100 μL activated culture was mixed with donor DNA at a concentration of 1 ng μL and further cultured at 30 °C for 20 min. 900 μL fresh C + Y was added, and the culture was cultivated at 37 °C for another 1.5 h for the transformants to recover. The culture was plated and incubated overnight as described above. Colonies were re-streaked on plates and again incubated overnight. Transformant colonies were picked and grown in liquid C + Y until OD595 ~ 0.3, and stocked in 14–20% glycerol. Genotypes of mutant strains were confirmed by Sanger sequencing (Microsynth).

Growth curves

Pre-cultures were grown in C + Y to OD595 ~ 0.1 at 37 °C, diluted 100× in the appropriate medium, and loaded into flat-bottom 96-well plates at 250 μL per well. 50 ng mL⁻¹ anhydrotetracycline or 1 mM IPTG was added to both pre-cultures and dilutions where appropriate. Experiments were always performed in triplicate (three separate pre-cultures), unless specifically stated in the figure. For each replicate, a representative curve was chosen out of three technical replicates (wells within the same plate), on the basis of the most frequent appearance between the other two technical replicates across all time points. Blanks were added to each plate to assess potential contamination. In experiments where glucose levels were also measured, the number of technical replicates equaled the number of glucose measurement time points to allow subsampling for that purpose. Optical density was measured every 10 min at 595 nm in a plate reader (Tecan MPlex, F200 or M200 series). In temperature variation experiments, three identical plates were prepared and measured simultaneously in three different plate readers, each set at a different temperature (30, 37, or 40 °C). Plates were sealed with parafilm in this case to avoid excessive evaporation. Raw OD values were normalized per well to the theoretical start OD of 0.001 by subtraction, and lower values were also set to this theoretical minimum.

Glucose oxidation assays

Samples were obtained by pausing the plate reader during growth curve assays at time points indicated in the figures, transferring the contents of technical replicates (200 μL) to microtubes, and resuming OD measurements as fast as possible. Microtubes were immediately spun down on a tabletop mini centrifuge for 3 min to pellet the cells, after which 150 μL supernatant was transferred to a new microtube. The first time point sample (0 h) was taken directly from the growth curve pre-culture and treated in the same way. Samples were snap-frozen using liquid nitrogen and stored at −80 °C until glucose measurements.

Glucose concentrations were measured with a Glucose (GO) Assay Kit (Sigma-Aldrich GAGO20) according to the instructions of the manufacturer, except for the total sample volumes. We scaled all volumes down so that samples were 150 μL instead of 5 mL, allowing for higher-throughput measurements using flat-bottom 96-well plates in a Tecan plate reader, instead of cuvettes.

Proteo-transcriptomics

Wild-type cells were pre-cultured in each respective growth medium to OD600 ~ 0.1 and diluted to OD600 ~ 0.05. CSP cultures were supplemented with 0.1 μg mL⁻¹ CSP-1 for 20 min, and FEVER cultures were transferred to 40 °C for 5 min. Each of three replicates was split into two subsamples, one of which was subjected to RNA-seq and the other to quantitative LC–MS.

Total RNA was extracted and cDNA libraries were constructed as before, without rRNA depletion³¹. Libraries were sequenced on an Illumina NextSeq machine at GeneCore, EMBL Heidelberg. Read quality was checked with FastQC (v0.11.5, https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) before and after trimming off TruSeq3 adapters, leading and trailing bases below a phred score of 3, cutting regions if average phred scores went below 20 in a sliding window of 5 bases, and only keeping reads with a minimum length of 50 bases using Trimmomatic (v0.36)⁷⁵. Reads were aligned to the S. pneumoniae D39V reference genome (CP027540) using STAR (v2.5.3a)⁷⁶ and reverse strand transcripts were counted with featureCounts (Subread v1.5.3)⁷⁷, using multi-mapping, overlap, and fractional count modes, as before³¹. Downstream analyses were done with DESeq2 (v1.34.0)⁷⁸ in R (v4.1.1), where differential expression was tested against an absolute log₂ fold change of 1 at an alpha of 0.05, and counts were normalized for Principal Component Analysis with a blind rlog transformation⁷⁸. Transcripts per million (TPM) were calculated per sample with n genes for each gene i as: \({{{{\rm{TPM}}}}}_{i}=\frac{{c}_{i}\cdot 1{0}^{6}}{{l}_{i}\cdot {\sum }_{j=1}^{n}\frac{{c}_{j}}{{l}_{j}}}\), where c and l represent the raw transcript counts and lengths, respectively.

Cells in the proteomics subsamples were lysed with a bead beater and further treated for LC–MS at the Proteomics—Mass Spectrometry Service Facility, University of Groningen. Briefly, sample volumes were reduced by freeze drying, protein concentrations were determined with a BCA assay (Thermo, 23252), and samples were reconstituted in a 100 mM ammonium bicarbonate buffer. Alkylation of 100 μg protein was achieved by adding iodoacetamide to a final concentration of 40 mM and incubation for 45 min at room temperature, in the dark. Samples were diluted 2× in 100 mM ammonium bicarbonate, and overnight digestion was performed at 37 °C, 400 rpm with mass spectrometry grade trypsin (Promega, V5280) using a 1:50 trypsin:protein (μg:μg) ratio. The reaction was stopped by adding trifluoroacetic acid to a final concentration of 1%. Pierce® C18 tips (Thermo, 87784) were used for sample cleanup by solid phase extraction according to the manufacturer’s instructions. The elute fraction was dried under vacuum and reconstituted with 20 μL 2% acetonitrile and 0.1% formic acid (FA). Peptide separation was performed with 2 μL peptide sample using a nano-flow chromatography system (Thermo, EASY nLC II) equipped with a reversed phase HPLC column (75 μm, 15 cm) packed in-house with C18 resin (Dr. Maisch, ReproSil-Pur C18–AQ, 3 μm resin) using a linear gradient from 95% solvent A (0.1% FA, 2% acetonitrile) and 5% solvent B (99.9% acetonitrile, 0.1% FA) to 28% solvent B over 90 min at a flow rate of 200 nL min⁻¹. The total MS time was 120 min. The peptide and peptide fragment masses were determined by an electrospray ionization mass spectrometer (Thermo, LTQ-Orbi-trap XL).

Peptides were mapped to the S. pneumoniae D39V (CP027540) protein fasta file and quantified as both label-free quantification (LFQ) and intensity Based Absolute Quantification (iBAQ) values with MaxQuant⁷⁹, using a false discovery rate (FDR) cutoff of 0.01. Downstream analyses were performed with R package DEP (v1.16.0)⁸⁰ using LFQ values as input, where a protein was considered differentially enriched if its absolute log₂-fold change was significantly >1, with an FDR-adjusted P-value below 0.05. We retained only proteins that were detected in at least two out of three replicates per condition, normalized with a variance-stabilizing transformation, and imputed missing values with the MinProb method as implemented in DEP, since values were not missing at random, but biased towards lower intensities (Supplementary Fig. 3). Gene Ontology (GO) term enrichment analysis was carried out using R package clusterProfiler (4.2.2)⁸¹. For visualization purposes, we normalized for library size bias per sample as proteins per million (PrPM): \({{{{\rm{PrPM}}}}}_{i}=\frac{{{{{\rm{IBAQ}}}}}_{i}\cdot 1{0}^{6}}{{\sum }_{j=1}^{n}{{{{\rm{IBAQ}}}}}_{j}}\), akin to TPM as described above.

We used the blind rlog and variance-stabilized quantifications of transcripts and proteins with non-zero variance across samples as input for Multi-Omics Factor Analysis using the R package MOFA2 (v1.4.0)⁶¹ with default settings.

RNA-seq Δspv_0647

Wild-type (VL1) and Δspv_0647 (VL6297) strains (Supplementary Table 2) were pre-cultured in C + Y medium at 30 and 37 °C until OD595 ~ 0.3. Pre-cultures were diluted to OD595 ~ 0.01 in quadruplicates and grown at 30 °C and 37 °C until OD595 ~ 0.3–0.4. Cells were pelleted by centrifugation (4 °C, 10,000×g, 5 min), supernatants removed, and pellets were stored at −80 °C after snap-freezing with liquid nitrogen.

Total RNA was isolated with a High Pure RNA Isolation Kit (Roche, 11828665001) as before, with minor adaptations³¹. Briefly, the pellets were resuspended in 400 μL Tris–EDTA buffer and transferred to tubes with 50 μL SDS 10%, 500 μL phenol–CHCl₃, and glass beads. Cells were lysed with a bead beater (3× 45 s with 45 s breaks) and pelleted by centrifugation (4 °C, 21,000×g, 15 min). 300 μL of the aqueous phase was mixed into 400 μL lysis/binding buffer. The samples were loaded onto columns and centrifuged (8000×g, 30 s). 100 μL DNase mix (90 μL buffer, 10 μL DNase I) was loaded onto the column filter and incubated for 1 h at room temperature. Samples were washed once with wash buffer I (500 μL) and twice with wash buffer II (first time 500 μL, second time 200 μL) by centrifugation (8000×g, 30 s). Samples were eluted in 50 μL elution buffer and incubated for 10 min at room temperature. Sample quality was checked by NanoDrop and Fraction Analyzer (Agilent Technologies), and samples were stored at −80 °C. Samples had RNA Quality Numbers (RQN) between 5.7 and 7.8.

Per strain-temperature combination, the three samples with minimal potential gDNA contamination and the highest RQN were selected for cDNA library preparation and sequencing at the Genomic Technologies Facility, University of Lausanne. Briefly, RNA-seq libraries were prepared from 100 ng of total RNA with the Illumina Stranded mRNA Prep reagents (Illumina) using a unique dual indexing strategy and following the official protocols. The polyA selection step was replaced by an rRNA depletion step with RiboCop for Bacteria, mixed bacterial samples, and reagents (Lexogen). Libraries were quantified by a fluorometric method (QubIT, Life Technologies), and their quality was assessed on a Fragment Analyzer (Agilent Technologies). Sequencing was performed on an Illumina NovaSeq 6000 for 100 cycles, single read. Sequencing data were demultiplexed using the bcl2fastq2 Conversion Software (v2.20, Illumina).

Read quality was checked with FastQC (v0.11.9) and MultiQC (v1.15)⁸² before and after trimming off leading and trailing bases below a phred score of 3, cutting regions if average phred scores went below 20 in a sliding window of 5 bases, and only keeping reads with a minimum length of 50 bases using Trimmomatic (v0.36)⁷⁵. Reads were aligned to the S. pneumoniae D39V reference genome (CP027540) using bowtie2 (v2.4.5)⁸³, using soft-clipping (with the “--local” option) to account for any remaining partial adapters.

Transcript counts were extracted with featureCounts (v2.0.6)⁷⁷ and downstream analyses were performed with DEseq2⁷⁸ in R, as described for the proteo-transcriptome experiment.

CRISPRi-seq

An IPTG-inducible genome-wide CRISPRi library was used as described before^43,44. Briefly, the library was pre-cultured from frozen stock (16% glycerol in the respective media, OD595 = 0.1) by 100× dilution in the respective growth conditions (Supplementary Table 1) to OD595 ~ 0.1, followed by 100× dilution supplemented with 1 mM IPTG for CRISPRi induction and CSP-1 (0.1 μg mL⁻¹) when appropriate, to yield quadruplicates for each growth condition with and without IPTG. Samples were grown to OD595 ~ 0.1 once (LMC, NMC, CSFMC, FEVER; 4–5 mL culture in 5 mL culture tubes) or twice (C + Y, CSP, THY, BMC; 10 mL in 50 mL conical tubes) by 100× back-dilution, corresponding to 7–14 generations of exponential, sample-wide growth (Supplementary Data 1). Pellets were harvested on ice, centrifuging once (7 generations) or twice (14 generations) (4 °C, 15 min at 4000×g and 5 min at 12,000×g, respectively), discarding supernatant and resuspending in PBS. gDNA isolation, library preparation, and sequencing were done as described in our published protocols⁴⁴. CRISPRi-induced NMC samples 54 and 55 got mixed up during gDNA isolation and were henceforth regarded as technical replicates.

Samples were split over two sequencing runs on an Illumina MiniSeq system with our published custom sequencing protocol⁴⁴. sgRNA counts were extracted with 2FAST2Q (v2.5.2)⁸⁴ using default settings (minimal phred score 30, one mismatch allowed). Differential fitness analyses were performed with DESeq2 (v1.34.0)⁷⁸ in R (v4.1.1), testing against a minimal (difference in) absolute log₂-fold change of 1 at an alpha of 0.05 for statistical significance. We collapsed the induced NMC technical replicates into one biological replicate with the DESeq2 function “collapseReplicates()”.

Given growth at a rate of 2ⁿ, fitness quantifications on a log₂FC should scale linearly across conditions. We observed this was not the case between conditions grown for a different sample-wide number of generations, implying a strong library composition bias and rendering DESeq2-based interaction effects uninformative. We corrected this bias by estimating generation numbers per CRISPRi strain in each induced sample, linearizing them over all samples with a LOESS transformation, and recomputing corresponding sgRNA counts. These were rounded and used as DESeq2 input. Sample-wide CRISPRi induction generation numbers (0, 7, or 14) were scaled and centered with the built-in R function scale(), and together with growth condition and their interaction term were used as explanatory variables in the DESeq2 design formula. So, default DESeq2 methods were applied after we normalized for the non-linear generation effect.

Specifically, for each of k sgRNAs in a sample, the raw count c was first normalized to c* to correct for library size bias: \({c}_{t1}^{\ast }=\,{\log }_{2}(1+\frac{{10}^{6}\,\cdot c}{{\sum }_{i=1}^{k}{c}_{i}})\), where t1 indicates this concerns counts after growth (as measured). We used uninduced sample counts as a proxy for the relative starting distribution of CRISPRi strains per sample prior to induction, i.e., at t0. Given bacterial growth occurs at a rate of 2^m, so that in general \({c}_{t1}={c}_{t0}\cdot {2}^{m}\), the relative counts per strain at t0 are here: \({c}_{t0}=\frac{{c}_{t1}^{\ast }}{{2}^{m}}\), where in our case m is 7 or 14. To obtain a more robust estimate, we averaged c_t0 per strain across each set of four uninduced replicate samples, yielding \({\bar{c}}_{{{{\rm{t}}}}0}\). This represents an estimate of the relative starting counts per CRISPRi strain, or sgRNA, for all samples within a given growth condition. Following the same standard growth equation as above, the relative counts after growth should then equal \({c}_{{{{\rm{t}}}}1}^{*}=\,{\bar{c}}_{{{{\rm{t}}}}0}\cdot {2}^{n}\), and so we estimated the strain-wise generation numbers in each induced sample as: \(n=\,{\log }_{2}(\frac{0.5+{c}_{{{{\rm{t}}}}1}^{*}}{{\bar{c}}_{{{{\rm{t}}}}0}})\), where a pseudocount of 0.5 was added to avoid zeroes. Together, this yields the generation number matrix N, where rows represent CRISPRi strains, and columns represent induced samples of all growth conditions. Generation number estimates were then linearized with the cyclicloess method of the normalizeBetweenArrays() function from the limma R package (v3.50.1)⁸⁵, with N as input and otherwise default parameters. The resulting matrix columns represent the corrected generation number estimates n* for each strain per induced sample, which we then used to estimate relative strain counts after growth per induced sample following the same standard growth dynamics described above: \({\hat{c}}_{t1}={\hat{c}}_{t0}\cdot {2}^{{n}^{\ast }}\), where \({\hat{c}}_{t0}=\frac{c}{{2}^{m}}\). Lastly, we scaled these corrected counts to their original library size as: \({\hat{c}}_{t1}^{\ast }={\hat{c}}_{t1}\cdot \frac{{\sum }_{i=1}^{k}{c}_{i}}{{\sum }_{i=1}^{k}{\hat{c}}_{{t1}_{i}}}\). These normalized sgRNA counts for the induced samples were subsequently used as DESeq2 input for downstream analyses, together with the original sgRNA counts of the uninduced samples.

Foldseek

Foldseek⁵⁹ was accessed through the online submission portal (https://search.foldseek.com) and ran with UniProt accession number A0A0H2ZQ31, for spv_0647. Only matches with crystalized protein structures from the PDB100 database were considered for this work.

Dual CRISPRi-seq analysis

Fitness scores (log₂FC values) for every unique pair of 869 sgRNAs were obtained from Dénéréaz and colleagues⁶⁰. Scores of sgRNAs in combination with themselves (twice the same sgRNA in the same CRISPRi strain) were used as baseline fitness estimates of the corresponding sgRNA, as in the original study.

GC-FAME

Strains were pre-cultured at 30, 37, or 40 °C with or without 50 ng mL⁻¹ anhydrotetracycline and 1 mM IPTG, as appropriate, to an OD595 ~ 0.3 and concentrated 10× by centrifugation (3 min, 8000×g). Quadruplicate samples were grown from these pre-cultures in the same growth conditions, as appropriate, in volumes of 45 mL per sample to OD595 ~ 0.2–0.3. Samples were kept at their respective growth temperatures throughout the experiment. Cells were pelleted by centrifugation (15 min, 1968×g), resuspended in ~1 mL of the remaining medium after discarding supernatant, transferred to 2 mL screw-cap tubes, and centrifuged again (5 min, 20,238×g). Cells were washed by supernatant removal, resuspension in 1 mL PBS, and centrifugation (5 min, 20,238×g). Supernatant was removed again, and pellets were stored at −80 °C following snap-freezing with liquid nitrogen.

For derivatization of fatty acid methyl esters, frozen cell pellets were resuspended and then vortexed in 100 µL concentrated sulfuric acid (96%) and 200 µL methanol. Samples were then boiled by incubation in boiling water for 5 min and allowed to cool to RT. 300 µL of dichloromethane was added and the samples were then vortexed and centrifuged 1 min at 15,000×g. The organic (bottom) layer was transferred to a new tube containing a pinch of Na₂SO₄ (to remove any remaining water), mixed by vortexing, and centrifuged 1 min at 15,000×g. The supernatant was then transferred to an HPLC tube and stored at 4 °C until being loaded on the GC–MS. 1 µL of sample was injected into an Agilent 7890 Gas Chromatograph equipped with an Agilent G3903-63011 column, an Agilent 5977A Mass Detector, and an Agilent 7693 Autoinjector. The carrier gas was helium, and the column oven temperature program was the following: 150 °C for 0.5 min; ramp temperature 25 ˚°C/min to 230 ˚°C, then hold for 1 min; ramp temperature 5 ˚°C/min to 245 ˚°C, then hold for 1 min. Spectra from 50 to 500m/z were collected after a 4 min solvent delay. The ion source temperature was 230˚C.

Raw data were exported as CDF files, which were filtered to remove empty scans and converted to.mzML files using MZmine 4 (mzio.io)⁸⁶. Spectral alignment, deconvolution, and relative abundance analyses were performed using MS-Hub as part of the global natural products social molecular networking (GNPS) GC–MS EI Data Analysis pipeline⁸⁷. Peaks representing methylated fatty acids were identified based on comparison of spectra and retention times to those obtained using TraceCERT 37 component fatty acid methyl ester standard (Sigma Aldrich) across the same instrumentation protocol. For each sample, peak intensities were summed per unique fatty acid. These values were standardized to sum to 100 per sample, and analyzed with the R package compositions (v2.0.8) to account for the particular biases and inherent complexities of compositional data⁸⁸. Specifically, we used the acomp() function to transform the data for principal component analysis, and the irl transformation in combination with a compositional (mlm) ANOVA for hypothesis testing.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

For the proteo-transcriptomic profiling, raw RNA-seq data are available on SRA, accession number PRJNA527271, and LC–MS data on the UCSD MassIVE repository, accession number MSV000097932. Raw CRISPRi-seq data are available on SRA, accession number PRJNA1262882. RNA-seq data for the mutant experiment can be found on SRA, accession number PRJNA1262992, and GC-FAME data on the UCSD MassIVE repository, accession number MSV000097619. Source data are provided with this paper.

References

Narciso, A. R., Dookie, R., Nannapaneni, P., Normark, S. & Henriques-Normark, B. Streptococcus pneumoniae epidemiology, pathogenesis and control. Nat. Rev. Microbiol. https://doi.org/10.1038/s41579-024-01116-z (2024).
Article PubMed Google Scholar
Weiser, J. N., Ferreira, D. M. & Paton, J. C. Streptococcus pneumoniae: transmission, colonization and invasion. Nat. Rev. Microbiol. 16, 355–367 (2018).
Article CAS PubMed PubMed Central Google Scholar
Henriques-Normark, B. & Tuomanen, E. I. The pneumococcus: epidemiology, microbiology, and pathogenesis. Cold Spring Harb. Perspect. Med. 3, a010215 (2013).
Article PubMed PubMed Central Google Scholar
Troeger, C. et al. Estimates of the global, regional, and national morbidity, mortality, and aetiologies of lower respiratory infections in 195 countries, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Infect. Dis. 18, 1191–1210 (2018).
Article Google Scholar
Ikuta, K. S. et al. Global mortality associated with 33 bacterial pathogens in 2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 400, 2221–2248 (2022).
Article Google Scholar
Minhas, V., Paton, J. C. & Trappetti, C. Sickly sweet—how sugar utilization impacts pneumococcal disease progression. Trends Microbiol. 29, 768–771 (2021).
Article CAS PubMed Google Scholar
Bidossi, A. et al. A functional genomics approach to establish the complement of carbohydrate transporters in Streptococcus pneumoniae. PLoS ONE 7, e33320 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Deutscher, J., Francke, C. & Postma, P. W. How phosphotransferase system-related protein phosphorylation regulates carbohydrate metabolism in bacteria. Microbiol. Mol. Biol. Rev. 70, 939–1031 (2006).
Article CAS PubMed PubMed Central Google Scholar
Görke, B. & Stülke, J. Carbon catabolite repression in bacteria: many ways to make the most out of nutrients. Nat. Rev. Microbiol. 6, 613–624 (2008).
Article PubMed Google Scholar
Paixão, L. et al. Transcriptional and metabolic effects of glucose on Streptococcus pneumoniae sugar metabolism. Front. Microbiol. 6, 1041 (2015).
Article PubMed PubMed Central Google Scholar
Paixão, L. et al. Host glycan sugar-specific pathways in Streptococcus pneumonia: galactose as a key sugar in colonisation and infection. PLoS ONE 10, e0121042 (2015).
Article PubMed PubMed Central Google Scholar
Fleming, E. & Camilli, A. ManLMN is a glucose transporter and central metabolic regulator in Streptococcus pneumoniae. Mol. Microbiol. 102, 467–487 (2016).
Article CAS PubMed PubMed Central Google Scholar
Carvalho, S. M., Kloosterman, T. G., Kuipers, O. P. & Neves, A. R. CcpA ensures optimal metabolic fitness of Streptococcus pneumoniae. PLoS ONE 6, e26707 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Woo, J. K. K., Zimnicka, A. M., Federle, M. J. & Freitag, N. E. Novel motif associated with carbon catabolite repression in two major Gram-positive pathogen virulence regulatory proteins. Microbiol. Spectr. 12, e00485–24 (2024).
Article PubMed PubMed Central Google Scholar
Giammarinaro, P. & Paton, J. C. Role of RegM, a homologue of the catabolite repressor protein CcpA, in the virulence of Streptococcus pneumoniae. Infect. Immun. 70, 5454–5461 (2002).
Article CAS PubMed PubMed Central Google Scholar
Iyer, R., Baliga, N. S. & Camilli, A. Catabolite control protein A (CcpA) contributes to virulence and regulation of sugar metabolism in Streptococcus pneumoniae. J. Bacteriol. 187, 8340–8349 (2005).
Article CAS PubMed PubMed Central Google Scholar
Im, H. et al. Anatomical site-specific carbohydrate availability impacts Streptococcus pneumoniae virulence and fitness during colonization and disease. Infect. Immun. 90, e00451–21 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lambert, C., Poyart, C., Gruss, A. & Fouet, A. FabT, a bacterial transcriptional repressor that limits futile fatty acid biosynthesis. Microbiol. Mol. Biol. Rev. 86, e00029–22 (2022).
Article PubMed PubMed Central Google Scholar
Lu, Y.-J. & Rock, C. O. Transcriptional regulation of fatty acid biosynthesis in Streptococcus pneumoniae: regulation of fatty acid composition in S. pneumoniae. Mol. Microbiol. 59, 551–566 (2006).
Article CAS PubMed Google Scholar
Marrakchi, H., Choi, K.-H. & Rock, C. O. A new mechanism for anaerobic unsaturated fatty acid formation in Streptococcus pneumoniae. J. Biol. Chem. 277, 44809–44816 (2002).
Article CAS PubMed Google Scholar
Gullett, J. M., Cuypers, M. G., Frank, M. W., White, S. W. & Rock, C. O. A fatty acid-binding protein of Streptococcus pneumoniae facilitates the acquisition of host polyunsaturated fatty acids. J. Biol. Chem. 294, 16416–16428 (2019).
Article CAS PubMed PubMed Central Google Scholar
Parsons, J. B. et al. Identification of a two-component fatty acid kinase responsible for host fatty acid incorporation by Staphylococcus aureus. Proc. Natl. Acad. Sci. USA 111, 10532–10537 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Waters, J. K. & Eijkelkamp, B. A. Bacterial acquisition of host fatty acids has far-reaching implications on virulence. Microbiol. Mol. Biol. Rev. 88, e00126–24 (2024).
Article PubMed PubMed Central Google Scholar
Parsons, J. B., Frank, M. W., Subramanian, C., Saenkham, P. & Rock, C. O. Metabolic basis for the differential susceptibility of Gram-positive pathogens to fatty acid synthesis inhibitors. Proc. Natl. Acad. Sci. USA 108, 15378–15383 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, Y.-M. & Rock, C. O. Membrane lipid homeostasis in bacteria. Nat. Rev. Microbiol. 6, 222–233 (2008).
Article PubMed Google Scholar
Renne, M. F. & Ernst, R. Membrane homeostasis beyond fluidity: control of membrane compressibility. Trends Biochem. Sci. S0968000423002074 https://doi.org/10.1016/j.tibs.2023.08.004 (2023).
Sinensky, M. Homeoviscous adaptation—a homeostatic process that regulates the viscosity of membrane lipids in Escherichia coli. Proc. Natl. Acad. Sci. USA 71, 522–525 (1974).
Article ADS CAS PubMed PubMed Central Google Scholar
Slager, J., Aprianto, R. & Veening, J.-W. Deep genome annotation of the opportunistic human pathogen Streptococcus pneumoniae D39. Nucleic Acids Res. 46, 9971–9989 (2018).
CAS PubMed PubMed Central Google Scholar
Janssen, A. B. et al. PneumoBrowse 2: an integrated visual platform for curated genome annotation and multiomics data analysis of Streptococcus pneumoniae. Nucleic Acids Res. 53, D839–D851 (2025).
Article PubMed Google Scholar
Hiller, N. L. & Sá-Leão, R. Puzzling over the pneumococcal pangenome. Front. Microbiol. 9, 2580 (2018).
Article PubMed PubMed Central Google Scholar
Aprianto, R., Slager, J., Holsappel, S. & Veening, J.-W. High-resolution analysis of the pneumococcal transcriptome under a wide range of infection-relevant conditions. Nucleic Acids Res. 46, 9990–10006 (2018).
CAS PubMed PubMed Central Google Scholar
Buccitelli, C. & Selbach, M. mRNAs, proteins and the emerging principles of gene expression control. Nat. Rev. Genet. 21, 630–644 (2020).
Article CAS PubMed Google Scholar
Liu, Y., Beyer, A. & Aebersold, R. On the dependency of cellular protein levels on mRNA abundance. Cell 165, 535–550 (2016).
Article CAS PubMed Google Scholar
Lu, P., Vogel, C., Wang, R., Yao, X. & Marcotte, E. M. Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation. Nat. Biotechnol. 25, 117–124 (2007).
Article CAS PubMed Google Scholar
Vogel, C. & Marcotte, E. M. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat. Rev. Genet. 13, 227–232 (2012).
Article CAS PubMed PubMed Central Google Scholar
Jensen, P. A., Zhu, Z. & Van Opijnen, T. Antibiotics disrupt coordination between transcriptional and phenotypic stress responses in pathogenic bacteria. Cell Rep. 20, 1705–1716 (2017).
Article CAS PubMed PubMed Central Google Scholar
Powell, J. E., Leonard, S. P., Kwong, W. K., Engel, P. & Moran, N. A. Genome-wide screen identifies host colonization determinants in a bacterial gut symbiont. Proc. Natl. Acad. Sci. USA 113, 13887–13892 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Turner, K. H., Everett, J., Trivedi, U., Rumbaugh, K. P. & Whiteley, M. Requirements for Pseudomonas aeruginosa acute burn and chronic surgical wound infection. PLoS Genet. 10, e1004518 (2014).
Article PubMed PubMed Central Google Scholar
Deutschbauer, A. et al. Evidence-based annotation of gene function in Shewanella oneidensis MR-1 using genome-wide fitness profiling across 121 conditions. PLoS Genet. 7, e1002385 (2011).
Article CAS PubMed PubMed Central Google Scholar
Giaever, G. et al. Functional profiling of the Saccharomyces cerevisiae genome. Nature 418, 387–391 (2002).
Article ADS CAS PubMed Google Scholar
Price, M. N. et al. Indirect and suboptimal control of gene expression is widespread in bacteria. Mol. Syst. Biol. 9, 660 (2013).
Article PubMed PubMed Central Google Scholar
Van Opijnen, T. & Camilli, A. A fine scale phenotype–genotype virulence map of a bacterial pathogen. Genome Res. 22, 2541–2551 (2012).
Article PubMed PubMed Central Google Scholar
Liu, X. et al. Exploration of bacterial bottlenecks and Streptococcus pneumoniae pathogenesis by CRISPRi-Seq. Cell Host Microbe 29, 107–120.e6 (2021).
Article CAS PubMed Google Scholar
De Bakker, V., Liu, X., Bravo, A. M. & Veening, J.-W. CRISPRi-seq for genome-wide fitness quantification in bacteria. Nat. Protoc. 17, 252–281 (2022).
Article PubMed Google Scholar
Slager, J., Aprianto, R. & Veening, J.-W. Refining the pneumococcal competence regulon by RNA sequencing. J. Bacteriol. 201, e00780–18 (2019).
Article CAS PubMed PubMed Central Google Scholar
Peterson, S. N. et al. Identification of competence pheromone responsive genes in Streptococcus pneumoniae by use of DNA microarrays: pneumococcal competence and gene expression. Mol. Microbiol. 51, 1051–1070 (2004).
Article CAS PubMed Google Scholar
Dagkessamanskaia, A. et al. Interconnection of competence, stress and CiaR regulons in Streptococcus pneumoniae: competence triggers stationary phase autolysis of ciaR mutant cells: competence, stress and autolysis in S. pneumoniae. Mol. Microbiol. 51, 1071–1086 (2004).
Article CAS PubMed Google Scholar
Härtel, T. et al. Characterization of central carbon metabolism of Streptococcus pneumoniae by isotopologue profiling. J. Biol. Chem. 287, 4260–4274 (2012).
Article PubMed Google Scholar
Afzal, M., Shafeeq, S., Manzoor, I., Henriques-Normark, B. & Kuipers, O. P. N-acetylglucosamine-mediated expression of nagA and nagB in Streptococcus pneumoniae. Front. Cell. Infect. Microbiol. 6, 158 (2016).
PubMed PubMed Central Google Scholar
Moye, Z. D., Burne, R. A. & Zeng, L. Uptake and metabolism of N-acetylglucosamine and glucosamine by Streptococcusmutans. Appl. Environ. Microbiol. 80, 5053–5067 (2014).
Article ADS PubMed PubMed Central Google Scholar
Boulanger, E. F., Sabag-Daigle, A., Thirugnanasambantham, P., Gopalan, V. & Ahmer, B. M. M. Sugar-phosphate toxicities. Microbiol. Mol. Biol. Rev. 85, e00123–21 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kloosterman, T. G. & Kuipers, O. P. Regulation of arginine acquisition and virulence gene expression in the human pathogen Streptococcus pneumoniae by transcription regulators ArgR1 and AhrC. J. Biol. Chem. 286, 44594–44605 (2011).
Article CAS PubMed PubMed Central Google Scholar
Nanduri, B. & Swiatlo, E. The expansive effects of polyamines on the metabolism and virulence of Streptococcus pneumoniae. Pneumonia 13, 4 (2021).
Article PubMed PubMed Central Google Scholar
Roncarati, D. & Scarlato, V. Regulation of heat-shock genes in bacteria: from signal sensing to gene expression output. FEMS Microbiol. Rev. 41, 549–574 (2017).
Article CAS PubMed Google Scholar
Fozo, E. M. & Quivey, R. G. The fabM gene product of Streptococcus mutans is responsible for the synthesis of monounsaturated fatty acids and is necessary for survival at low pH. J. Bacteriol. 186, 4152–4158 (2004).
Article CAS PubMed PubMed Central Google Scholar
Eijkelkamp, B. A. et al. Arachidonic acid stress impacts pneumococcal fatty acid homeostasis. Front. Microbiol. 9, 813 (2018).
Article PubMed PubMed Central Google Scholar
Reithuber, E. et al. The bactericidal fatty acid mimetic 2CCA-1 selectively targets pneumococcal extracellular polyunsaturated fatty acid metabolism. mBio 11, e03027–20 (2020).
Article CAS PubMed PubMed Central Google Scholar
Shi, Y. et al. Structure and mechanism for streptococcal fatty acid kinase (Fak) system dedicated to host fatty acid scavenging. Sci. Adv. 8, eabq3944 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Van Kempen, M. et al. Fast and accurate protein structure search with Foldseek. Nat. Biotechnol. 42, 243–246 (2024).
Article PubMed Google Scholar
Dénéréaz, J. et al. Dual CRISPRi-seq for genome-wide genetic interaction studies identifies key genes involved in the pneumococcal cell cycle. Cell Syst. 101408 https://doi.org/10.1016/j.cels.2025.101408 (2025).
Argelaguet, R. et al. Multi-Omics Factor Analysis—a framework for unsupervised integration of multi-omics data sets. Mol. Syst. Biol. 14, e8124 (2018).
Article PubMed PubMed Central Google Scholar
Moreno-Gámez, S. et al. Quorum sensing integrates environmental cues, cell density and cell history to control bacterial competence. Nat. Commun. 8, 854 (2017).
Article ADS PubMed PubMed Central Google Scholar
Cuevas, R. A. et al. A novel streptococcal cell–cell communication peptide promotes pneumococcal virulence and biofilm formation. Mol. Microbiol. 105, 554–571 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, C. Y., Medlin, J. S., Nguyen, D. R., Disbennett, W. M. & Dawid, S. Molecular determinants of substrate selectivity of a pneumococcal Rgg-regulated peptidase-containing ABC transporter. mBio 11, e02502–e02519 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fleuchot, B. et al. Rgg proteins associated with internalized small hydrophobic peptides: a new quorum-sensing mechanism in streptococci. Mol. Microbiol. 80, 1102–1119 (2011).
Article CAS PubMed Google Scholar
Aggarwal, S. D., Yesilkaya, H., Dawid, S. & Hiller, N. L. The pneumococcal social network. PLoS Pathog. 16, e1008931 (2020).
Article CAS PubMed PubMed Central Google Scholar
Koo, B.-M. et al. Comprehensive genetic interaction analysis of the Bacillus subtilis envelope using double-CRISPRi. Cell Syst. 101406, https://doi.org/10.1016/j.cels.2025.101406 (2025).
Chung, S.-K. & Na, Y. Dynamic characteristics of heat capacity of the human nasal cavity during a respiratory cycle. Respir. Physiol. Neurobiol. 290, 103674 (2021).
Article PubMed Google Scholar
Bailey, T. L., Johnson, J., Grant, C. E. & Noble, W. S. The MEME suite. Nucleic Acids Res. 43, W39–W49 (2015).
Article CAS PubMed PubMed Central Google Scholar
Philips, B. J., Meguer, J.-X., Redman, J. & Baker, E. H. Factors determining the appearance of glucose in upper and lower respiratory tract secretions. Intensiv. Care Med. 29, 2204–2210 (2003).
Article Google Scholar
Shelburne, S. A., Davenport, M. T., Keith, D. B. & Musser, J. M. The role of complex carbohydrate catabolism in the pathogenesis of invasive streptococci. Trends Microbiol. 16, 318–325 (2008).
Article CAS PubMed PubMed Central Google Scholar
Vigouroux, A., Oldewurtel, E., Cui, L., Bikard, D. & Van Teeffelen, S. Tuning dCas9’s ability to block transcription enables robust, noiseless knockdown of bacterial genes. Mol. Syst. Biol. 14, e7899 (2018).
Article PubMed PubMed Central Google Scholar
Engler, C., Gruetzner, R., Kandzia, R. & Marillonnet, S. Golden gate shuffling: a one-pot DNA shuffling method based on Type IIs restriction enzymes. PLoS ONE 4, e5553 (2009).
Article ADS PubMed PubMed Central Google Scholar
Sorg, R. A., Gallay, C., Van Maele, L., Sirard, J.-C. & Veening, J.-W. Synthetic gene-regulatory networks in the opportunistic human pathogen Streptococcus pneumoniae. Proc. Natl. Acad. Sci. USA 117, 27608–27619 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
Article CAS PubMed Google Scholar
Zhang, X. et al. Proteome-wide identification of ubiquitin interactions using UbIA-MS. Nat. Protoc. 13, 530–550 (2018).
Article CAS PubMed Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS J. Integr. Biol. 16, 284–287 (2012).
Article CAS Google Scholar
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Article CAS PubMed PubMed Central Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bravo, A. M., Typas, A. & Veening, J.-W. 2FAST2Q: a general-purpose sequence search and counting program for FASTQ files. PeerJ 10, e14041 (2022).
Article PubMed PubMed Central Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Article PubMed PubMed Central Google Scholar
Heuckeroth, S. et al. Reproducible mass spectrometry data processing and compound annotation in MZmine 3. Nat. Protoc. 19, 2597–2641 (2024).
Article CAS PubMed Google Scholar
Aksenov, A. A. et al. Auto-deconvolution and molecular networking of gas chromatography–mass spectrometry data. Nat. Biotechnol. 39, 169–173 (2021).
Article CAS PubMed Google Scholar
Van Den Boogaart, K. G. & Tolosana-Delgado, R. Analyzing Compositional Data with R (Springer, 2013).
Frénois, F., Engohang-Ndong, J., Locht, C., Baulard, A. R. & Villeret, V. Structure of EthR in a ligand bound conformation reveals therapeutic perspectives against tuberculosis. Mol. Cell 16, 301–307 (2004).
Article PubMed Google Scholar
Christen, S. et al. Regulation of the Dha operon of Lactococcus lactis. J. Biol. Chem. 281, 23129–23137 (2006).
Article CAS PubMed Google Scholar
Lara, J. et al. Mycobacterium tuberculosis FasR senses long fatty acyl-CoA through a tunnel and a hydrophobic transmission spine. Nat. Commun. 11, 3703 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, Y., Zhu, K., Frank, M. W. & Rock, C. O. A Pseudomonas aeruginosa transcription factor that senses fatty acid structure. Mol. Microbiol. 66, 622–632 (2007).
Article CAS PubMed Google Scholar
Beggs, G. A. et al. Structural, biochemical, and in vivo characterization of MtrR-mediated resistance to innate antimicrobials by the human pathogen Neisseria gonorrhoeae. J. Bacteriol. 201, e00401–e00419 (2019).
Article CAS PubMed PubMed Central Google Scholar
Delmar, J. A. et al. Structural basis for the regulation of the MmpL transporters of Mycobacterium tuberculosis. J. Biol. Chem. 290, 28559–28574 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fujihashi, M. et al. Structural characterization of a ligand-bound form of Bacillus subtilis FadR involved in the regulation of fatty acid degradation. Proteins Struct. Funct. Bioinform. 82, 1301–1310 (2014).
Article CAS Google Scholar
Yu, W.-L. et al. Structural insights into the substrate specificity of a 6-phospho-β-glucosidase BglA-2 from Streptococcus pneumoniae TIGR4. J. Biol. Chem. 288, 14949–14958 (2013).
Article CAS PubMed PubMed Central Google Scholar
Nieto, C., Espinosa, M. & Puyet, A. The Maltose/Maltodextrin regulon of Streptococcus pneumoniae. J. Biol. Chem. 272, 30860–30865 (1997).
Article CAS PubMed Google Scholar
De Saizieu, A. et al. Microarray-based identification of a novel Streptococcus pneumoniae regulon controlled by an autoinduced peptide. J. Bacteriol. 182, 4696–4703 (2000).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank Andrew Quinn for advice on the use of glucose oxidation assays, Florian Bock for practical biochemistry guidance, Johann Mignolet and Julien Dénéréaz for providing genetic constructs, Axel Janssen for integrating our data with PneumoBrowse 2, Kathrin Fröhlich for bringing the phenomenon of sugar-phosphate stress to our attention, and James Sáenz for valuable insights regarding membrane properties. Moreover, we thank Rieza Aprianto for proteo-transcriptomic sample preparation, and Johan Hekelaar at the Proteomics Facility of the University of Groningen for the LC–MS quantifications and his help in proteomics analyses. RNA-seq for the proteo-transcriptome experiment was performed at GeneCore, EMBL, Heidelberg. Library preparation and RNA-seq of the spv_0647 mutant and WT strain were performed at the Lausanne Genomic Technologies Facility, University of Lausanne, Switzerland. This work was supported by Swiss National Science Foundation (SNSF) PostDoc Mobility fellowship P500PB_225439 (V.d.B.), ERC consolidator grant 771534, SNSF grants 310030_192517, 310030_200792, NCCR ‘AntiResist’ 51NF40_180541 (J.W.V.), and NIH/NIDCR R00-029228 (J.L.B.).

Author information

Authors and Affiliations

Department of Fundamental Microbiology, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland
Vincent de Bakker, Xue Liu & Jan-Willem Veening
Department of Microbiology, Harvard Medical School, Boston, MA, USA
Vincent de Bakker
Department of Pathogen Biology, Base for International Science and Technology Cooperation: Carson Cancer Stem Cell Vaccines R&D Center, International Cancer Center, Shenzhen University Health Science Center, Shenzhen, China
Xue Liu
Department of Biomaterial & Biomedical Sciences, School of Dentistry, Oregon Health & Science University, Portland, OR, USA
Jonah Tang, Matthew Barbisan & Jonathon L. Baker

Authors

Vincent de Bakker
View author publications
Search author on:PubMed Google Scholar
Xue Liu
View author publications
Search author on:PubMed Google Scholar
Jonah Tang
View author publications
Search author on:PubMed Google Scholar
Matthew Barbisan
View author publications
Search author on:PubMed Google Scholar
Jonathon L. Baker
View author publications
Search author on:PubMed Google Scholar
Jan-Willem Veening
View author publications
Search author on:PubMed Google Scholar

Contributions

V.d.B. performed experiments, analyzed data, and wrote the manuscript. X.L. supervised experiments. J.T. and M.B. performed experiments and analyzed data. J.L.B. supervised experiments, analyzed data, and edited the manuscript. J.W.V. supervised the study and edited the manuscript.

Corresponding author

Correspondence to Jan-Willem Veening.

Ethics declarations

Competing interests

J.W.V. is a scientific advisory board member at i-Seq Biotechnology. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Suzanne Dawid and the other, anonymous, reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information (download PDF )

Supplementary Data 1 (download XLSX )

Supplementary Data 2 (download XLSX )

Supplementary Data 3 (download XLSX )

Supplementary Data 4 (download XLSX )

Supplementary Data 5 (download XLSX )

Description of Additional Supplementary Files (download PDF )

Reporting Summary (download PDF )

Transparent Peer Review file (download PDF )

Source data

Source Data (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

de Bakker, V., Liu, X., Tang, J. et al. Multi-omics profiling reveals atypical sugar utilization and a key membrane composition regulator in Streptococcus pneumoniae. Nat Commun 16, 10429 (2025). https://doi.org/10.1038/s41467-025-66611-0

Download citation

Received: 05 June 2025
Accepted: 11 November 2025
Published: 21 November 2025
Version of record: 25 November 2025
DOI: https://doi.org/10.1038/s41467-025-66611-0