Introduction

The intestinal microbiome plays a fundamental role in human health, especially during infancy, contributing to child nutrition and development, protecting against pathogens, and educating the immune system1. The composition of the infant gut microbiome changes dynamically during the first year of life2,3, being clearly different from the adult gut microbiome as it transitions to a mature state after the introduction of solid foods2,4.

Human milk is the gold standard for infant nutrition and provides a mixture of nutrients and bioactive components that support proper infant growth5,6. One hallmark of human breast milk is a high concentration of human milk oligosaccharides (HMOs)7, which, among other functions, shape the infant gut microbiome8,9. HMOs are a family of linear and branched unconjugated glycans composed of a lactose core with linear chains varying from 3 to 15 units. HMOs are composed of five monosaccharides: glucose, galactose, fucose, N-acetylglucosamine (GlcNAc), and N-acetylneuraminic acid (NeuAc; sialic acid)6,10. The proportions of fucosylated, sialylated, and non-fucosylated neutral HMOs in breast milk are 35–50%, 12–14%, and 42–55% respectively11. Some of the most abundant and representative HMOs are 2´-fucosyllactose (2´FL), lacto-N-tetraose (LNT), and 3´-sialyllactose (3´SL)11. Breastfeeding often enriches Bifidobacterium spp., though this varies across infants and populations5,12,13. These bacteria are beneficial for the host through diverse mechanisms14. The ecological success of these microorganisms in the infant gut may be related to the overrepresentation of several carbohydrate utilization genes in their genomes, participating in the transport and metabolism of different types of HMOs15,16,17,18. Individual members of other taxa, such as Lachnospiraceae and Bacteroides, have also been described for their adaptation to HMO utilization19,20,21, while certain Lactobacillus strains exhibit limited growth on specific HMO substructures22.

The most studied species capable of metabolizing HMOs are Bifidobacterium bifidum, Bifidobacterium longum subsp. infantis (B. infantis), Bifidobacterium longum subsp. longum (B. longum) and Bifidobacterium breve, among others23,24. For example, B. infantis targets most HMOs and possesses multiple genes associated with transport via ABC transporters and enzymes that metabolize them intracellularly23,25. In contrast, B. bifidum can also grow on all types of HMOs but by an extracellular mechanism characterized by multiple membrane glycosyl hydrolases capable of breaking the glycosidic bonds in HMOs and transporting a few molecules such as lactose and lacto-N-biose26,27,28. B. breve and B. longum generally consume only certain HMO types, such as LNT, by an intracellular mechanism10,29. These different utilization strategies could have ecological implications, as an extracellular mechanism could favor carbohydrate cross-feeding, for example, of NeuAc or fucose30. HMO fermentation usually results in high concentrations of acetate and lactate, which could also participate in cross-feeding interactions with butyrate and propionate producers30,31. In addition, Bifidobacterium and other species with similar nutritional niches targeting the same HMOs are expected to compete for some of these HMOs.

The role of individual species in the gut microbiome should be considered in the context of the collective activity of numerous microorganisms and their metabolites32. Metabolic interactions are highly frequent in the human gut microbiome, and metabolites participate in complex networks of cross-feeding or exploitative competition32,33. Understanding complex microbial ecosystems is crucial for predicting the functional state of the human gut microbiome and developing effective microbiome modulation strategies, for example, using HMOs34,35,36,37. Synthetic microbial communities assembled from host-derived strains have received considerable attention for the characterization of the ecological and metabolic features of the human gut microbiome38,39,40. Common approaches in microbe-microbe interactions combine 16S rRNA gene-based composition analysis with metabolite measurements, mathematical modeling, and metatranscriptomics. These methods collectively enable the examination of community dynamics, stability, inter-species metabolic interactions, and trophic roles within the microbiome. Controlled in vitro models can provide insights into microbe-specific temporal dynamics, niche overlap, and trophic interaction networks, shedding light on the coexistence of essential intestinal microbes in a competitive and dynamic ecosystem41,42,43,44.

This study aimed to determine functional and ecological roles within a synthetic microbial consortium, including important microorganisms with different mechanisms of HMO utilization, using a species dropout approach.

Results

In this study, we assembled a synthetic consortium composed of seven infant gut microbes to determine their ecological roles during the consumption of HMO (Fig. 1). These microbes display different mechanisms of HMO utilization: some are well known for their ability to use multiple HMOs either intra or extracellularly (B. infantis, B. bifidum) or are partial HMO-utilizers (B. breve, Bacteroides thetaiotaomicron). Other species can use HMO-derived monosaccharides, such as fucose, sialic acid, GlcNAc, or lactose (Escherichia coli, Pediococcus acidilactici, Lachnoclostridium symbiosum). Table 1 presents additional functional and genomic information on these microorganisms. Synthetic consortia were cultured in batch bioreactors with a mixture of LNT, 2´FL, and 3´SL, and fermentations were assessed by growth kinetics, relative abundance of species, HMO consumption, acid production, and changes in community gene expression. Specific dropouts are indicated by a Δ notation (e.g., ΔBBIF), indicating that the aforementioned species was omitted from the bioreactor (Table 1).

Fig. 1: Schematic summary of approaches used in this study.
Fig. 1: Schematic summary of approaches used in this study.
Full size image

Graphical abstract of the study. A biological replication of a synthetic consortium of seven infant gut microbes using three HMOs was established. Specific bacterial species were selectively removed, and the system was characterized by culture growth, metabolite production, and HMO utilization. Metatranscriptome analyses were obtained for each condition. Multivariate analyses and correlation networks were obtained. Finally, a mathematical model consisting of a system of ordinary differential equations was constructed to quantify the degradation rates of complex sugars and enable comparisons across different experiments. These comprehensive analyses led to the development of a model describing interactions among the consortium’s diverse bacterial members. Figures were retrieved from BioRender.

Table 1 Culture, functional, and genomic information of the microorganisms used in this study

Growth kinetics

We first evaluated how individual species dropouts affected growth parameters such as growth rate, total biomass, and lag time (Fig. 2). For the ALL consortium, the lag phase lasted 7.8 h, and µmax was 0.76 h-1. Growth rates were significantly reduced in the absence of B. bifidum, B. breve, and P. acidilactici, indicating these bacteria might contribute to the system’s combined growth (Fig. 2). Absence of P. acidilactici significantly increased lag phase. ΔBBIF had a lower growth rate and the shortest lag phase duration (Fig. 2). ΔBINF and ΔBTHE showed similar kinetic parameters to ALL, with an exponential phase starting earlier at 4 h. These results suggest that the absence of certain bacteria reduces community growth rate and total biomass, and some dropouts could accelerate the start of the exponential phase.

Fig. 2: Growth kinetics parameters in species dropout experiments.
Fig. 2: Growth kinetics parameters in species dropout experiments.
Full size image

Growth curves from the dropout experiments (A) and statistical analysis of their parameters (B). Culture growth was quantified by measuring OD600max. Growth curves were adjusted to the Gompertz-Zwietering growth model to determine its parameters for each experiment. The parameters µmax associated with the maximum growth rate, OD600max associated with the maximum growth achieved, and Tlag associated with lag phase time were analyzed. Statistical significance is reported based on adjusted p-values, and the color code indicates the experiment to which it differs. For instance, for the Tlag parameter, two blue asterisks are observed over the ΔPACI experiment, indicating a significant difference compared to the ΔBBIF experiment (0.001 < ** < 0.01 < * < 0.05).

Relative abundance of species

The microbial composition of the bioreactors was determined during the exponential and stationary growth phases by qPCR. This information could reveal how the absence of certain microbes allows others to expand their niche or affect the whole system due to their functional relevance. In the ALL consortium, B. breve dominated during the exponential phase, and B. bifidum during the stationary phase (Fig. 3). Both bacteria maintained a significant abundance across all conditions evaluated. The representation of B. bifidum increased earlier in the fermentation in the absence of B. infantis or B. thetaiotaomicron (ΔBINF or ΔBTHE; Fig. 3), suggesting competition. In contrast, the only condition in which B. infantis was dominant was in ΔBBIF. In this condition, a higher representation of E. coli and L. symbiosum was found compared to ALL. In aggregate, the relative abundance analysis suggests possible scenarios of resource competition between dominant HMO users such as B. bifidum, B. infantis, or B. thetaiotaomicron. When controlling for differences in initial cell counts (log10 fold change in absolute abundances; Supplementary Table 1), B. bifidum was able to increase up to 100 times its relative abundance in most dropouts, and L. symbiosum increased 600 times in ΔBBIF from its initial abundance.

Fig. 3: Relative abundance of the strains used in this study.
Fig. 3: Relative abundance of the strains used in this study.
Full size image

Relative abundance of species present in each experiment at three time points: initial time (0 h), exponential growth phase (10 h), and stationary growth phase (18 h). The relative abundance was calculated using specific primers for each species by qPCR to determine the quantity of cells present. Subsequently, counting and normalization were performed, allowing the relative amount of each to be calculated.

HMO consumption and metabolite production analysis

It is likely that the absence of certain species changes HMO consumption patterns of this community, for example consumption times, preference for some HMO, or generation of degradation products. Similarly, short-chain fatty acids (SCFAs) could be differentially produced due to changes in HMO consumption or species abundances. Different patterns of HMO consumption were observed upon species dropouts (Fig. 4). Lactose was found to be present in one of the HMOs (3´SL), which was confirmed by TLC. The ALL condition showed a reduction in all HMO at 12 h, which resulted in fucose accumulation at the end of the experiment (Fig. 4). A gradual increase in lactose was observed up to 10 h, which can be explained by the extracellular degradation of 2´FL and 3´SL. In ΔBBIF, no free monosaccharides were found, and a gradual lactose increase was not observed. This is consistent with B. bifidum extracellular HMO hydrolysis and the release of degradation products (Fig. 4).

Fig. 4: Consumption of HMOs and derived carbohydrates for each species dropout experiment.
Fig. 4: Consumption of HMOs and derived carbohydrates for each species dropout experiment.
Full size image

Thin layer chromatography (TLC) of carbohydrates of interest from experiments cultivated in mZMB with a representative mix of HMOs as carbon source. Fucose (Fuc), galactose (Gal), glucose (Glc), lactose (Lac), 2´-fucosyllactose (2’FL), 3´-sialyllactose (3’SL), and lacto-N-tetraose (LNT) were used as standards. Samples were taken every 2 h. TLCs were performed in duplicates, and the representative replicate is shown.

Interestingly, ΔBINF and ΔBTHE displayed accelerated degradations of all HMOs (up to 4–6 h) and an increase in monosaccharide release (fucose, galactose, and glucose), which were eventually consumed at 10–12 h (Fig. 4). This faster consumption is consistent with the HMO extracellular utilization mechanism of B. bifidum and suggests less competition, allowing B. bifidum to dominate earlier the consortium (Fig. 3). The absence of B. breve, P. acidilactici, or L. symbiosum caused a delay in monosaccharide removal that lasted until the end of the fermentation (Fig. 4). Finally, sialic acid concentrations in ALL bioreactors peaked at the exponential phase (Supplementary Fig 1), decreasing later, suggesting that some species consumed this monosaccharide. Dropouts of B. breve, P. acidilactici, and especially L. symbiosum resulted in the accumulation of sialic acid, suggesting they are responsible for NeuAc consumption.

Acetate and lactate were present at high concentrations as end products (Supplementary Fig 2). ΔBBIF resulted in the highest acetate concentration, and ΔBINF resulted in the highest lactate concentration. The absence of B. breve resulted in high succinate production by the consortium (Supplementary Fig 2), which correlates with an overabundance of B. thetaiotaomicron.

Correlation analysis of bioreactor data

Next, data from bioreactor kinetics, relative abundance, carbohydrate consumption, and acid production was used to determine if some variables of these systems showed non-evident correlations. Principal component analysis and a correlation matrix were calculated (Fig. 5). PCA indicated that the combined characteristics of ΔBTHE and ΔBINF dropouts were very similar. The ALL and ΔECOL bioreactors were unique and not grouped with other dropouts in the exponential phase (Fig. 5A), and ΔBBRE did not group with other dropouts in the stationary phase (Fig. 5B). Figure 5C shows that at the exponential phase, B. bifidum abundance correlated significantly with NeuAc and lactate concentrations, suggesting it contributes to their accumulation. In this phase, the concentrations of the three HMOs correlated positively, indicating that they are consumed at a similar rate (Fig. 5C). At the stationary phase, B. thetaiotaomicron correlated with succinate, indicating it contributes to its production, and B. bifidum with biomass production, which is consistent with its dominance across most conditions. The concentration of simple carbohydrates such as lactose, glucose, and lactose, instead of intact HMO, correlated positively at the stationary phase. This is consistent with the extensive release of simple carbohydrates after HMO degradation (Fig. 5B).

Fig. 5: Multivariate analyses performed in this study.
Fig. 5: Multivariate analyses performed in this study.
Full size image

Multivariate analyses of bioreactor data. Panels display principal component analyses for exponential (A) and stationary (B) phases (10 and 18 hours, respectively), along with correlation network construction for exponential (C) and stationary (D) phases. These analyses were based on measurements of fucose, galactose, glucose, lactose, 2’FL, 3’SL, LNT, acetate, succinate, lactate, butyrate, culture absorbance, relative species abundance, and the Shannon coefficient to quantify culture diversity. The principal component analysis (A and B) shows the first two components explaining the most variability in the data (PC1 and PC2). Correlation networks (C and D) were constructed based on the Spearman correlation coefficient, with significant interactions filtered at a 5% significance level and an absolute magnitude greater than 0.6. Burgundy-colored edges denote a positive correlation, while blue edges indicate a negative correlation. The thickness of the edge reflects the absolute magnitude of the correlation, with a thicker edge indicating a stronger correlation. The size of the node represents its degree, i.e., the number of edges connected to the node; a larger node implies a higher number of connections. Circular nodes correspond to system metabolites, while rhombus-shaped nodes are related to biomass.

Metatranscriptomic changes upon species dropouts

We later aimed to determine global changes in gene expression in these synthetic consortia in exponential phase upon species dropouts, by metatranscriptomic RNA-seq (Supplementary Fig 3 and Fig. 6). Global gene expression during exponential phase of the ALL consortium was used as a reference, and Supplementary Table 2 presents the statistics of this analysis. After quality filtering, reads aligning with specific genes were counted and normalized. A PCA of the normalized counts matrix was used to compare replicates and determine metatranscriptional variations upon species dropouts (Supplementary Fig 3). ΔECOL had the most distinct metatranscriptome compared to other samples, followed by ΔBBIF. A heatmap of global gene expression across dropouts showed a large expansion in gene expression of B. infantis and B. breve in the absence of B. bifidum, suggesting a negative interaction between these bacteria. E. coli appeared the most responsive bacteria to any dropout (Fig. 6). In turn, B. infantis, B. breve, B. bifidum and B. thetaiotaomicron showed important changes in gene expression upon the absence of E. coli (Fig. 6). Finally, B. thetaiotaomicron presented several transcriptional changes when B. breve was not present, and P. acidilactici repressed several genes when B. infantis, B. thetaiotaomicron or L. symbiosum were left out (Fig. 6).

Fig. 6: Hierarchical clustering analysis performed in this study.
Fig. 6: Hierarchical clustering analysis performed in this study.
Full size image

Hierarchical clustering analysis was conducted using differential gene expression data for each bacterium within each experiment. The reference expression dataset consisted of the experiment containing ALL consortium. In the resulting heatmap, columns represent individual experiments, while rows correspond to specific differentially expressed genes. Genes are grouped according to their respective bacteria. The color intensity in the table reflects the expression change, quantified as Log2FC, with burgundy denoting overexpression and blue indicating repression.

We next examined specific functional changes in the metatranscriptomes upon species dropouts. The most consistent changes mapped to genes related to the utilization of HMO-derived monosaccharides such as fucose, sialic acid, GlcNAc, and galactose, central metabolism and certain physiological functions (Supplementary Information). Two general patterns emerged: niche expansion under competitive release and dependence on cross-feeding partners.

In ΔBBIF, B. infantis and B. breve markedly upregulated several HMO transport and degradation genes (LNB/GNB clusters, type 2 HMO utilization cluster, fucosidases, sialidases), together with fermentative pathways for GlcNAc, galactose, fucose, and NeuAc. Similar induction patterns were observed in ΔBINF, where B. bifidum expanded expression of LNB, galactose, GlcNAc, and central metabolism genes. Their expansion in the absence of the other is consistent with competition and niche overlap, with direct competition for the same HMO-derived carbohydrates (Fig. 7).

Fig. 7: Differential expression of specific strains in the synthetic consortium.
Fig. 7: Differential expression of specific strains in the synthetic consortium.
Full size image

Visualization of differential expression of relevant bacteria in the specified consortium. In the upper panel of each panel, volcano plots with burgundy indicate overexpression, blue indicates repression, and gray denotes non-significant changes, with each point representing a gene and the shape of the point corresponding to the microorganism on the right. Expression thresholds are marked at -2 and 2 for Log2FC (fold change) and 0.01 for the adjusted p-value. The number of differentially expressed genes assigned to a specific bacterium is shown in the lower panel below the volcano plot. A ΔBBIF vs ALL; B ΔBBRE vs ALL; C ΔBINF vs ALL; D ΔBTHE vs ALL; E ΔECOL vs ALL; F ΔPACI vs ALL.

In contrast, E. coli repressed genes for fucose, NeuAc, and galactose utilization in ΔBBIF, ΔBINF, and ΔBTHE (Fig. 7 and Supplementary Information). This suggests that its ability to metabolize these sugars depends on the presence of other bacteria. P. acidilactici also downregulated NeuAc, GlcNAc and galactose transport upon these three dropouts, further supporting the idea that some community members rely on external activity to access specific substrates. A similar pattern emerged in ΔLSYM, where E. coli repressed NeuAc and central metabolism genes and P. acidilactici again showed broad repression. These results highlight that, beyond competition, cross-feeding and facilitation play a major role in structuring the consortium.

Beyond sugar metabolism, in ΔBBIF and ΔBINF E. coli repressed multiple respiratory modules, including NADH dehydrogenase complexes, cytochrome oxidases, and fumarate reductase, together with several flagellar biosynthesis genes (Fig. 7 and Supplementary Information). In parallel, genes for nitrate and nitrite reductases, lactate dehydrogenase, and quinone oxidoreductases were induced, pointing to a switch toward alternative anaerobic pathways and fermentative metabolism. This coordinated reduction of high-energy respiration and motility functions, coupled with activation of lower-energy routes, indicates a broad downshift in energy metabolism. These results suggest that in these conditions E. coli not only loses access to certain HMO-derived monosaccharides, but also enters a distinct physiological state characterized by reduced respiratory capacity, altered redox balance, and motility repression, potentially reflecting energy conservation. In other deletions, E. coli showed induction of arginine and GABA metabolism (ΔPACI, ΔLSYM), pointing to context-dependent reprogramming that extends beyond HMO utilization.

Other deletions revealed strong competitive release within the carbohydrate niches. In ΔECOL, B. breve markedly increased transcription of HMO utilization modules (NeuAc, fucose, GlcNAc, LNB/GNB cluster) and central metabolism genes, highlighting its overlap with E. coli (Fig. 7 and Supplementary Information). Similar patterns of niche expansion were observed for B. bifidum in ΔBINF and ΔBTHE, where galactose and GlcNAc pathways were strongly induced. B. thetaiotaomicron also expanded its range in ΔBBRE, upregulating fucose and GlcNAc genes in the absence of B. breve. Together, these responses highlight the extent of shared metabolic niches and the rapid compensatory capacity of Bifidobacterium and Bacteroides when competitors are absent.

Mathematical modeling of ecological interactions in a synthetic consortium

To further understand the dynamics of HMO degradation, we developed a system of ordinary differential equations derived from the main HMO hydrolysis reactions as in previous works (Supplementary Information45,46). The model describes mass balances of HMO degradation and their impact on community growth.

Parameter sensitivity analysis showed that the extracellular hydrolysis constants of LNT, 2’FL, 3’SL, and lactose (k1–k4) were consistently the most influential (Fig. 8), underscoring the central role of HMO breakdown in shaping consortium dynamics. The model reproduced overall growth trends and sugar consumption patterns observed experimentally, and simulations revealed that species deletions altered HMO degradation parameters. Removal of B. bifidum markedly reduced degradation rates across all HMOs, supporting its role as a key carbohydrate provider. In contrast, drop-outs of B. infantis and B. thetaiotaomicron accelerated HMO consumption, reflecting niche expansion and enhanced activity of B. bifidum in their absence.

Fig. 8: Statistical analysis of complex oligosaccharides degradation parameters in the mathematical model.
Fig. 8: Statistical analysis of complex oligosaccharides degradation parameters in the mathematical model.
Full size image

Statistical analysis of HMO degradation parameters in the mathematical model. The degradation rate of LNT is represented by parameter k1 in the model (A), the degradation rate of 2’FL is defined by parameter k2 (B), the degradation rate of 3’SL is represented by parameter k3 (C), and the degradation rate of lactose is represented by parameter k4 (D). The statistical significance shown corresponds to adjusted p-values less than 0.0001. In the figure’s subtitle, the following values are indicated in the given order: the Chi-square statistic, the associated p-value for the statistical test, the estimated coefficient of determination of the ANOVA model, the 99% confidence interval for this coefficient value, and the total number of observations in the test. The median value of the parameter is indicated in the distributions. The experiment name is listed on the X-axis, along with the number of values in the distribution and the parameter’s value in the adjusted model.

Discussion

Species dropouts is a valuable approach for identifying emergent roles of microbes within complex communities38. Synthetic gut microbiome consortia help determine the metabolic role of individual microorganisms and how unique strains affect gut microbiome structure and function. This approach has also been used to identify keystone species, defined as those whose disappearance causes a dramatic effect on the entire community47.

Bifidobacteria are frequent and important members of the early-life microbiota, particularly in breastfed infants in certain cohorts, and may contribute significantly to shaping the infant gut ecosystem13. HMO utilization is a crucial adaptation of bifidobacteria to the host48. B. infantis and B. bifidum share and compete for the same nutritional niche, but they are characterized by different HMO consumption mechanisms. Interestingly, both microorganisms did not adapt or modify their nutritional preferences in the presence of their competitor and did not engage in any cross-feeding interaction. The consumption mechanisms of these two archetypical bacteria also have important differences. B. infantis devotes a significant number of ABC transporters, multiple glycosyl hydrolases, and multiple metabolic pathways for consuming HMO and its constituents49. This strategy is energy-demanding, at least compared with B. bifidum extracellular degradation. B. bifidum has a more restricted preference for carbohydrates, mainly lactose and LNB, not metabolizing fucose or sialic acid27,50. We hypothesize that the predominance of B. bifidum over other competitors, especially B. infantis is due to its targeted strategy that probably gives this microorganism a higher growth rate on HMO under similar conditions.

Our results suggest that in this system, B. bifidum is a keystone species. It plays a central role in degrading HMOs, affecting community growth kinetics, and providing critical resources for the rest of the consortium. B. bifidum facilitates cross-feeding, leaving partially degraded HMO products51. Other studies have demonstrated a syntrophic relationship with B. breve, for example sharing fucose or sialic acid, which is supported by their predicted co-occurrence30,48,52. Its degradation activities can also foster the growth of other bacteria such as Blautia wexlerae53. Here, we also observed that L. symbiosum consumed NeuAc released by B. bifidum, and E. coli exploited multiple degradation products. Importantly, B. bifidum growth temporarily increased lactose concentration in the medium, probably derived from 2’FL and 3’SL. Most gut microbes, either commensal (such as LAB) or pathogenic bacteria, can access lactose. In contrast, B. infantis seems to avoid carbohydrate cross-feeding interactions with other bacteria, but acetate, lactate, and succinate produced by B. infantis have been involved in SCFA cross-feeding, for example, with butyrate- and propionate-producing bacteria54. The contrasting ecological consequences may have critical consequences for microbiota assembly and infant health.

Carbohydrate utilization was a dominant ecological force, driving transcriptional reprogramming upon dropouts. A recurrent observation was niche expansion: most microorganisms broadened substrate use in the absence of a competitor, revealing strong competition in this simplified model. Although B. bifidum showed a fitness advantage over B. infantis in this in vitro setting, this result contrasts with in vivo observations of some studies55. These differences likely reflect model limitations and underscore the importance of host and environmental factors in shaping microbial colonization in the infant gut.

Niche expansion was also reflected by the increased gene expression of central metabolic pathways and the activation of HMO-related metabolic pathways upon species dropouts. Microorganisms in the synthetic consortium in this study fell into two categories: those able to access complex HMO and those using only degradation products (Fig. 8). A strong competition between E. coli and B. breve was reported here, where evidence suggested they compete for similar nutrients in the presence of B. bifidum. Bacterial communities in the human gut form trophic interactions56, with specialized roles in the degradation of complex polysaccharides, simple sugars, or the use of SCFA to release inorganic substrates such as CO2 or H240.

Dropout experiments partially mimicked the impact of microbial extinctions in complex ecosystems such as the gut microbiome. Beneficial microbes are decreasing in modern societies5, and some Bifidobacterium species have shown a marked decline in prevalence, particularly in industrialized populations55. Infants in industrialized countries show a significant reduction in B. infantis and gene clusters associated with HMO utilization57. This phenomenon has been partly attributed to the shortening of breastfeeding in Western societies. The lack of HMO genes in their metagenomes correlates with increased inflammation and immune dysregulation, which might be related to alterations in SCFA production patterns in the infant’s gut30. In turn, B. breve appears to be the most dominant species in industrialized infants13,57, despite being a sub-optimal HMO consumer. B. breve and B. longum have been characterized as sub-optimal HMO users in gnotobiotic mice, being outcompeted by efficient HMO consumers58. However, B. infantis has been associated with reduced asthma risk in antibiotic-exposed infants, while B. breve has shown beneficial effects on intestinal barrier maturation and immune modulation14,59. We hypothesize that a microbiome dominated by sub-optimal HMO consumers such as B. breve will not use the entire potential of HMO, rendering some HMOs unconsumed and being lost in infant feces. Interestingly, HMO administration permits the engraftment of B. infantis in adults60. This symbiotic combination increased butyrate and butyrate-producing bacteria abundance in vitro and in vivo, as well as propionate-producing bacteria in subjects undergoing antibiotic-induced dysbiosis, thereby reducing inflammatory markers54.

One important limitation of this study is the assumption that all dropouts follow a similar trajectory and that priority effects are not relevant. In microbial ecology, priority effects relate to the impact of order and the impact of species arrival in the structure of a community61. Dissimilar initial abundances (Fig. 3) might result in different final microbial compositions upon each dropout, and even in different rates or mechanisms of HMO utilization, limiting the observations of this study. Interestingly, HMO reduce significantly the impact of priority effects in vitro and in vivo62, suggesting that the deterministic force of HMO utilization overrides any initial condition of the community. Similarly, studies on priority effects on bifidobacteria show that B. breve benefits from fucose released from other microbes to become a dominant species63.

In addition to these ecological considerations, there are methodological limitations inherent to our experimental setup. Although each species was inoculated at OD600 = 1.0, we acknowledge that OD600 does not equate to equivalent cell numbers due to differences in cell size and optical properties, which may contribute to variability between inocula. Furthermore, traces of E. coli DNA were detected in the ΔECO condition by qPCR at Time 0, potentially due to residual DNA in the bioreactor system or minor cross-contamination during inoculation. However, this signal was less than 2% of total genomic DNA, and ΔECO exhibited distinct metabolic and transcriptomic profiles, suggesting minimal impact on overall community dynamics. Two strains used in this study (B. breve and P. acidilactici) are laboratory isolates under characterization for HMO utilization with no major previous history of use, whereas all other members of the consortium were well-established type strains. While strain-level variation may influence ecological interactions in vivo, we believe this simplified consortium captured key metabolic behaviors of the infant gut microbiome. The simplified nature of our synthetic consortium, the use of a single strain per species, and bioreactor-specific constraints (e.g., possible biofilm formation) also limit extrapolation of these results to the highly complex and dynamic environment of the infant gut.

In summary, these findings reveal emergent ecological roles within a synthetic infant gut community. B. bifidum acts as a keystone species by degrading HMOs extracellularly and releasing simple carbohydrates that support cross-feeding with other species. In contrast, species such as B. infantis and B. thetaiotaomicron compete for similar niches and primarily rely on intracellular HMO utilization. B. breve exhibits cooperative traits by releasing fucose, lactose, and galactose, while E. coli shows strong competition for HMO-derived substrates. P. acidilactici and L. symbiosum appear to be important in the consumption of lactose and NeuAc, respectively. At the community level, HMO degradation shaped competition, niche expansion, and metabolic acceleration, providing mechanistic insights into colonization dynamics in the infant gut.

Methods

Bacterial strains and culture media

The strains used in this study were obtained from BEI Resources, ATCC, and UC Davis Culture Collection (Table 1). For routine experiments, microorganisms were cultured in their respective media (Table 1). Luria-Bertani medium (LB; Becton, Dickinson, Franklin Lakes, NJ) was used directly, while reinforced clostridium broth (RCM; Becton, Dickinson, Franklin Lakes, NJ) and Man-Rogosa-Sharpe broth (MRS; Becton, Dickinson, Franklin Lakes, NJ) were supplemented with 0.05% w/v of L-cysteine-HCl (Sigma-Aldrich, St. Louis, MO, USA). The cultures were incubated at 37 °C for 24–48 h in an anaerobic jar (Anaerocult; Merck, Darmstadt, Germany) with anaerobic packs (Gaspak EM; Becton-Dickinson, Franklin Lakes, NJ, USA).

Bioreactor setup and operation

Eight batch experiments were conducted in duplicate using two 250 mL bioreactors connected to a MyControl system under anaerobic conditions (Mini-bio Applikon Biotechnology, The Netherlands). The first experiment involved inoculation with the complete consortium (ALL). The remaining experiments replicated the conditions of the whole consortium but with one species out each time. Therefore, each bioreactor was inoculated with six strains (Table 1). For simplicity, consortia where a given species was omitted from inoculation are referred to using the Δ notation (e.g., ΔBBIF), indicating a ‘species dropout’ condition rather than an active deletion from an established community.

Before inoculation, microorganisms were cultured in modified ZMB medium64 supplemented with lactose (20 g/L) at 37 °C for 24–48 h under anaerobic conditions. Microorganisms were inoculated at an OD600 of 1.0 and were previously washed in fresh reduced mZMB without a carbon source. The bioreactors contained the same mZMB formulation and were supplemented with three HMOs at 2% (45% LNT, 40% 2´FL, and 15% 3´SL) (Glycom, Denmark). These values corresponded to approximate HMO concentrations in breast milk reported by Conze et al.65. The anaerobic environment inside the bioreactors was maintained using nitrogen (99.99% purity), which was injected at the beginning of the fermentation and was continuously monitored. The temperature was set to 37 °C, and the mixture was stirred at 90 rpm. The pH was maintained constant throughout the fermentation at 5.5 using an automatic injection of NaOH and HCl 3 M. Samples were taken every two hours up to 24 h and immediately centrifuged at 8000 rpm for 3 min. Pellets were stored at -80 °C and subsequently used for DNA extraction and determination of relative abundance by qPCR. In addition, another pellet was resuspended in 1 mL of RNA later (Sigma, St. Louis, MO, USA). The supernatants were used for metabolite analysis. The growth curves of the experiments were adjusted to the Gompertz-Zwietering66 equation using R software, obtaining the growth parameters for each replicate. Statistical analysis of growth parameters was performed in the R package stats.

DNA extraction

Genomic DNA was extracted using a modified phenol-chloroform isoamyl protocol67. Briefly, cell pellets were lysed with lysozyme and incubated at 37 °C for 30 min (Amresco, Toronto, Canada). The suspensions were purified using phenol: chloroform: isoamyl-alcohol 25:24:1 pH 8 and sterile acid-washed-glass beads (Sigma, St. Louis, MO, USA). The cells were disrupted using Disruptor Genie for 6 min (Scientific Industries, Bohemia, NY, USA) and centrifuged for phase separation. The supernatants were purified using chloroform: isoamyl alcohol 24:1 and centrifuged for phase separation. The DNA was precipitated using isopropanol and sodium acetate 3 M. The precipitated DNA was pelleted by centrifugation, washed twice with cold ethanol, and dried by ethanol evaporation. DNA concentrations were calculated by measuring absorbance at 260 nm using an Infinite M200 PRO spectrophotometer (Tecan)51.

Carbohydrate consumption

The total carbohydrate consumption was analyzed by Thin Layer Chromatography (TLC) using a silica gel plate F 60 (Merck, Germany). Galactose, glucose, fucose, lactose, 2´FL, 3´SL, and LNT were used as standards. 1-butanol/ethanol/water 1,75:1,25:1 v/v was used as the mobile phase, and 1% α-naphthol, 0.5% sulphuric acid (5% solution) in ethanol as solvent was used as revealing solution34. 1 µL of each sample was diluted in 2 µL ultrapure water, applied to the plate, dried, and run, followed by a 10 s incubation in an oven68. Numerical values were estimated using ImageJ software based on band intensity69. Sialic acid concentrations were determined using an enzymatic kit (Sigma Aldrich, Germany) according to the manufacturer’s instructions.

High-Performance Liquid Chromatography (HPLC)

Acetic, butyric, lactic, propionic, and succinic acids from each supernatant obtained from the bioreactor experiments were quantified using a Lachrom L-700 HPLC system (Hitachi, Japan), using an Aminex HPX87H Ion exclusion column (300 mm × 7.8 mm, Bio-Rad) and an isocratic mobile phase with 5 mM H2SO470. 30 µL of supernatant were injected at a flux of 0.45 mL/min at 35 °C for 35 min. Standard curves were obtained using nine dilutions of 30 to 0.155 g/L of each acid in HPLC-grade water. 24 conditions were tested in duplicates.

Determination of relative bacterial abundance

Relative and absolute bacterial abundance were determined by qPCR using a set of species-specific primers based on the unique genes present in each microorganism (Supplementary Table 3). Amplification and detection were carried out using StepOnePlus equipment (Applied Biosystems, USA) in 96-well optical plates (MicroAmp Fast Optical, Thermo Fisher, USA). Briefly, 96-well plates were filled with a mixture containing 5 mL of PowerUp SYBR Master Mix or Fast SYBR Green Master Mix (Applied Biosystems, USA), 0.3 mM of each primer (0.3 mL each), 4.4 mL nuclease-free water (IDT, USA), and 1 mL of DNA (diluted to 10 ng/mL). DNA samples were amplified with an initial hold at 50 °C for 2 min and a polymerase activation step at 95 °C for 15 min for PowerUp SYBR Master Mix or 95 °C for 20 s for Fast SYBR Green Master Mix, followed by 40 cycles of denaturation at 95 °C for 10 s, annealing and elongation at 60 °C for 30 s. A melting curve was generated to verify a single amplification peak by increasing the temperature from 65 °C to 95 °C. Reactions were performed in triplicate, and threshold cycle (CT) values were converted into genome copy numbers per mL according to a previously described procedure71.

RNA extraction and RNA gene sequencing

Pellets from 1 mL of each sample in the exponential phase were resuspended in 2 mL of RNA-later solution and stored at -80 °C. Subsequently, RNA was extracted with 1 mL of lysis buffer from the RNeasy Kit (Qiagen, MD) containing 1% β-mercaptoethanol (added before buffer use). The suspension was centrifuged to separate the aqueous phase from the organic phase. The aqueous layer was precipitated in ethanol, and RNA was re-dissolved in 400 μL lysis buffer and further purified using RNeasy columns (Qiagen, MD) according to the manufacturer’s instructions. The integrity and quantity of isolated total RNA samples were determined using a high-resolution electrophoresis system (Agilent 2100 Bioanalyzer, Agilent, CA). After quality verification, samples were depleted of rRNA using the QIAseq stranded total rRNA kit (Qiagen, MD). Libraries were prepared using the QIAseq Stranded RNA Library (Qiagen, MD) and sequenced on a NextSeq 500 Illumina at the Unidad de Genomica y Bioinformatica at the University of Santiago of Chile (Santiago, Chile).

Multivariate data analysis

Data from the exponential and stationary phases (10 and 18 h, respectively) were used for multivariate analysis. We considered the concentration of metabolites, biomass, relative abundance, and Shannon diversity coefficient72. The results were analyzed using the R software73. The prcomp function was used for principal component analysis (PCA), and the number of clusters was determined using the factoextra package74. Spearman’s correlation was calculated for network construction and interactions with a significance level greater than 5% were included. Cytoscape was used for network visualization75.

Metatranscriptomic analysis

A Jupyter notebook76 was generated using the Bash kernel, detailing the step-by-step procedures. The paired library quality was assessed using FastQC77 and MultiQC78, followed by quality filtering and trimming using Trimmomatic79 and VSEARCH80. The mRNA reads were separated from the rRNA reads using SortMeRNA81. Reference genomes were extracted from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) database82. The following genomes were used: Bifidobacterium bifidum JCM 1254 (Genome ID:398514.7), Bifidobacterium breve DSM 20213 = JCM 1192 (Genome ID:518634.20), Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 (Genome ID:391904.5), Bacteroides thetaiotaomicron VPI-5482 (Genome ID:226186.12), Escherichia coli str. K-12 substr. MG1655 (Genome ID:511145.12), Lachnoclostridium symbiosum WAL-14673 (Genome ID:742741.3), and Pediococcus acidilactici strain PMC65 (Genome ID:1254.353). HISAT283 was used for alignment, and file manipulation was performed using Samtools84. HTSeq85 was used to generate the count matrix. PCA was conducted on the count matrix with data transformed using variance stabilization (VST) from the DESeq2 package86. For PCA, the furthest replicate was excluded, and the analysis was performed using the two closest replicates.

Taxonomic composition was estimated using SAMSA287 to identify potential outlier replicates. One replicate per treatment was removed due to compositional discrepancies confirmed by principal component analysis. Normalization of metatranscriptomic counts was performed using the taxon-specific scaling approach described by Klingenberg et al.88. Briefly, raw counts were partitioned by organism, and for each taxon a matrix was constructed that included both treatment (Δ) and control (ALL) replicates. Size factors and dispersions were estimated across these taxon-specific datasets using DESeq286, ensuring that normalization reflected microorganism transcriptional activity rather than compositional differences in community structure. Differential expression analysis was performed using the DESeq2 package with the lfcShrink function89. A Benjamini-Hochberg adjusted p-value ≤ 0.01 and a log2 fold change ≥ 2 were used as filtering criteria. Volcano plots were generated with ggplot2 using the significant DESeq2 results. For each experiment, log2 fold-change was plotted against the –log10 adjusted P-value, with capped y-values to facilitate visualization of highly significant genes.

Dynamic model

The mathematical model was based on the mass balance of the system using a system of ordinary differential equations. This model is a proprietary development that uses modified Monod kinetics for community growth (Supplementary Information). The model contained 10 differential equations, 1 algebraic equation, and 22 parameters. MATLAB software was used for modeling and parameter fitting, employing a cooperative enhanced scatter search from the MEIGO package90. The parameter-fitting procedure was as follows. A primary fit was performed with five iterations for each set of experimental data, and the set with the lowest sum of squares error (SSE) was selected. Subsequently, an identifiability analysis was conducted for each set of experimental data to determine a set of highly sensitive parameters called “core parameters” and a set of less sensitive parameters called “shell parameters”. Once the parameter classification was determined, the core parameters were adjusted using 20 iterations, the obtained values were stored, and the set with the lowest SSE was selected. Finally, the shell parameters were adjusted using ten iterations, storing the values found and selecting the set with the lowest SSE. For statistical analysis, the values obtained for each iteration were collected, and an analysis of variance using the Kruskal-Wallis test was performed for paired comparisons. All the codes used can be found on the following Github page: https://github.com/taproschle/msc_taproschle.