Modeling host-associating microbes under selection

Bansept, Florence; Obeng, Nancy; Schulenburg, Hinrich; Traulsen, Arne

doi:10.1038/s41396-021-01039-0

Download PDF

Article
Open access
Published: 22 June 2021

Modeling host-associating microbes under selection

The ISME Journal volume 15, pages 3648–3656 (2021)Cite this article

5284 Accesses
14 Altmetric
Metrics details

Subjects

Abstract

The concept of fitness is often reduced to a single component, such as the replication rate in a given habitat. For species with multi-step life cycles, this can be an unjustified oversimplification, as every step of the life cycle can contribute to the overall reproductive success in a specific way. In particular, this applies to microbes that spend part of their life cycles associated to a host. In this case, there is a selection pressure not only on the replication rates, but also on the phenotypic traits associated to migrating from the external environment to the host and vice-versa (i.e., the migration rates). Here, we investigate a simple model of a microbial lineage living, replicating, migrating and competing in and between two compartments: a host and an environment. We perform a sensitivity analysis on the overall growth rate to determine the selection gradient experienced by the microbial lineage. We focus on the direction of selection at each point of the phenotypic space, defining an optimal way for the microbial lineage to increase its fitness. We show that microbes can adapt to the two-compartment life cycle through either changes in replication or migration rates, depending on the initial values of the traits, the initial distribution across the two compartments, the intensity of competition, and the time scales involved in the life cycle versus the time scale of adaptation (which determines the adequate probing time to measure fitness). Overall, our model provides a conceptual framework to study the selection on microbes experiencing a host-associated life cycle.

Natural selection for imprecise vertical transmission in host–microbiota systems

Article 23 December 2021

Applying evolutionary theory to understand host–microbiome evolution

Article 08 September 2025

Host dispersal relaxes selective pressures in rafting microbiomes and triggers successional changes

Article Open access 30 December 2024

Introduction

Fitness is a central concept in evolutionary biology, of particular importance for the theory of natural selection. Fitness measures how well a phenotype performs in terms of reproductive success, i.e., in terms of its ability to survive and reproduce. Natural selection, acting through reproduction and inheritance of the phenotypic traits, then leads to an increase in the population of the genotypes producing high fitness phenotypes [1].

In any system, fitness emerges mechanistically from birth and death events [2]. However, when it comes to the study of particular experimental systems or models, the question of how to measure fitness is often delicate, and fitness is often defined from the outset, as a phenomenological parameter. For practical reasons, fitness is often quantified under controlled laboratory conditions, using different proxies such as a net replication rate measured over a limited period of time, or a proportion of habitats successfully colonized. But none of these fitness components alone provides a holistic view of what fitness encompasses in natural conditions. Indeed, in nature, individual lineages within a species are often subject to multi-step life cycles, during which they transition across different habitats (e.g., hosts and environments), which may each favor distinct life-history characteristics. Some of the steps of these life cycles allow for offspring production, others may be developmental, or may concern migration or dispersal to the appropriate environments, or mate finding – in the case of sexual reproduction (see for example [3] for multi-step life cycles in animals). Fitness of an individual lineage is thus a multivariate function of all the life-history traits characterizing its life cycle, and in particular, its reproduction rates within the habitats and, importantly, transmission across habitats.

The development of methods to take into account life cycles in the assessment of fitness has proven important in a variety of contexts. Historically, age-structured models have been developed to study human demography [4]. In the context of species conservation, or, at the other end of the spectrum, pest management, the focus has been on finding the “Achilles heels” of species life cycles to design efficient strategies to act upon them, in order to shape and preserve biodiversity [4]. This idea has further been developed theoretically, within the conceptual framework of metapopulation dynamics [5, 6]. Moreover, life cycle characteristics are also central to the study of the onset of multicellularity, to understand why and how group replication can be selected for [7, 8].

The question of how life cycle components contribute to fitness is of particular relevance for the study of microbial communities that associate with hosts (i.e., host-associated microbiota). Intricate life cycles are common in nature, where microbes can for example use hosts as vectors between different habitats [9, 10]. Having a living host as a habitat adds complexity to the assessment of fitness, given that the presence of the microbes may impact the host fitness and vice-versa. Research has often been biased towards the host perspective, and has focused on how microbes can contribute to host fitness by extending the host functional repertoire, e.g., performing digestive or immune tasks [11,12,13]. An exception is epidemiology and parasitology, that have specifically addressed the impact of the host fitness on the pathogen, in the form of trade-offs between transmission and within-host virulence [14,15,16,17]. But what about commensal relationships, where bacteria do not have a negative impact on the host fitness? In this context, what are the factors that determine fitness of a microbial lineage?

Here, we focus on a primary aspect of the impact of a host on the overall fitness of a microbial lineage, in that it provides the microbe with an alternative habitat, where growth conditions are potentially different from an environmental habitat. We propose a framework to assess the selection gradient acting upon the life-history traits of microbes undergoing a biphasic life cycle, in which they alternate between phases of host association and free-living environmental phases. Biphasic life cycles are likely to be at the origin of host-microbiota associations and are still widespread in current associations [18, 19]. We propose that the overall fitness for a microbial lineage during such a biphasic life cycle needs to integrate evolutionary success across the different steps of the life cycle. It is therefore shaped by reproductive rates in both of the habitats and additionally by the migration rates between the habitats. The gradient of selection determines the direction in the phenotypic space that evolution is expected to follow to maximize overall fitness. Our general aim is to provide a tool to compare the relative importance of the different life-history traits of a microbial lineage, starting only from the equations that describe the population dynamics experienced throughout the life cycle. We explore a simple continuous-time two-compartment model that allows microbes to migrate between a host and an environment. We use the method of sensitivity analysis [4] to infer how strongly the overall growth rate depends on the traits we are considering. In the baseline version of the model, we consider unconstrained growth. Subsequently, we extend our framework to include population size constraints. We define the local direction of the selection gradient as the optimal strategy for a microbial lineage to adapt to its life cycle, starting from the local values of the traits. We show the existence of defined regions of different optimal strategies in the phenotypic space in which it is either more beneficial to optimize growth or migration. The boundaries of these regions are driven by modeling assumptions such as competition, and the probing time chosen to measure fitness.

Model

We focus on a single commensal microbial type and ask how the overall growth rate across its life cycle is affected by its life-history traits. We consider a simple biphasic life cycle, with two compartments corresponding to communicating habitats: a host and an environment. Let us write n_H(t) for the number of host-associated microbes at a given time, and n_E(t) for the number of environmental ones. We define the life-history traits of the microbial lineage as the rates at which individual microbes reproduce and die in each compartment, compete, and migrate from one compartment to another (Fig. 1A). The microbes reproduce clonally, and the net replication rates in the environment and within the host are r_E and r_H, respectively. They could encompass both offspring production and death, and thus could be negative. The migration rates from the host to the environment and from the environment to the host are m_E and m_H, respectively. We start with exponential growth. We later introduce intra-specific competition for space of intensity k_ij experienced by the microbes of compartment i due to the abundance of microbes in the compartment j. We assume that the number of microbes is large enough to be described by differential equations and assume that all rates introduced above are constant.

$$\left\{ {\begin{array}{*{20}{c}} {\frac{{\partial n_H}}{{\partial t}} = r_Hn_H + m_Hn_E - m_En_H - k_{HE}n_Hn_E - k_{HH}n_H^2} \\ {\frac{{\partial n_E}}{{\partial t}} = r_En_E + m_En_H - m_Hn_E - k_{EH}n_En_H - k_{EE}n_{E.}^2} \end{array}} \right.$$

(1)

**Fig. 1: Optimal strategies in the baseline model (no competition).**

In the following, we first consider unconstrained growth, where there is no competition (k_EE = k_HH = k_EH = k_HE = 0), before adding global competition (k_EE = k_HH = k_EH = k_HE = k), competition limited to one of the compartments (k_EH = k_HE = 0 and k_EE ≠ 0 or k_HH ≠ 0), and finally, equal competition in each of the compartments (k_EH = k_HE = 0 and k_EE = k_HH = k). While in nature it is likely that none of the k_ij vanishes and that a wide range of values are possible, the study of these limit cases gives powerful insights into what is to be expected in a wide range of situations.

Results

Baseline model: no competition

We start by assuming no competition and consider unconstrained growth in each of the two compartments. In this case, the equations describing our model become linear and can be rewritten in matrix form [4] as

$$\left( {\begin{array}{*{20}{c}} {\frac{{\partial n_{H}}}{{\partial t}}} \\ {\frac{{\partial n_{E}}}{{\partial t}}} \end{array}} \right) = \underbrace{\left( {\begin{array}{*{20}{c}} {r_{H} - m_{E}} & {m_{H}} \\ {m_{E}} & {r_{E} - m_{H}} \end{array}} \right)}_{{\mathrm{projection}}\, {\mathrm{matrix}}}\left( {\begin{array}{*{20}{c}} {n_{H}} \\ {n_{E}}\end{array}} \right)$$

(2)

The dominant eigenvalue λ of the above-defined projection matrix gives the asymptotic overall growth rate of the considered microbial lineage. This quantity is an appropriate measure of fitness [4] insofar as it measures reproductive as well as transmission success and recapitulates the effects of all the life-history traits (r_E, r_H, m_E, and m_H, also defining the phenotype in our model). Overall microbial fitness is thus integrated across the different steps of the life cycle, thereby considering the reproductive rates (i.e., replication rates) within each of the compartments and importantly transmission rates (i.e., migration rates) across the compartments. The dominant right eigenvector represents the stable distribution of microbes in the two compartments, and the number of microbes in each of the compartments grows exponentially with rate λ. The value of λ can be calculated at each point of the phenotypic space defined by the ranges of possible values that could be taken by the life-history traits r_E, r_H, m_E, and m_H. The dependence of λ on these traits tells us at which points of the phenotypic space fitness is maximized and how it can be increased at all other points.

From the projection matrix, we calculate the dominant eigenvalue as

$$\lambda = \frac{1}{2}\left(\sqrt {\left( {r_E + r_H - m_E - m_H} \right)^2 {\,}-{\,} 4\left( {r_Er_H - r_Em_E - r_Hm_H} \right)} + r_E +r_H - m_E - m_H \right).$$

(3)

Note that if microbes replicate at the same rate in the host and in the environment, i.e., if r_E = r_H = r, λ simplifies to r, regardless of the migration rates m_H and m_E. When there is an asymmetry between the two replication rates however, which is very likely to be the case in nature, then the migration rates also affect the overall growth rate. In the following sections, we study this effect compared to the effect of the replication rates. We arbitrarily set r_H ≤ r_E, and r_E > 0 – otherwise the lineage goes extinct. In biological terms, this corresponds to the situation where the microbial lineage is initially more adapted to the environment than to the host and thus grows faster in the environment. But mathematically, in this model, host and environment are symmetrical, i.e., they only differ by the rates defined above. Thus, the chosen direction of this inequality does not carry any strong meaning, and there is no loss of generality in making this choice. In particular, one can access the opposite biological situation where microbes replicate faster in the host than in the environment – as is the case for viruses, that can only replicate in the host (r_H > 0) but decay in the environment (r_E < 0) – by a single switch of the index E and H.

Let us first study the case where the migration rates from and towards the environment are equal, i.e., m_E = m_H = m > 0. Setting r_E = 1 to scale time (and thus, measuring all other rates in units of the replication rate of the microbe in the environment), λ reduces to

$${\uplambda}_{sym} = \frac{1}{2}\left( {1 + r_H - 2m + \sqrt {\left( {1 - r_H} \right)^2 {\,}+ {\,}4m^2} } \right)$$

(4)

For any fixed positive value of m, λ_sym is a strictly increasing function of r_H, which reflects the fact that increasing r_H allows for additional growth within the host. We will limit ourselves to the study of r_H ≥ −1, which ensures a positive value for λ_sym. For any fixed value of r_H, λ_sym is a decreasing function of m, which reflects the fact that for increasing m, microbes are increasingly lost towards the host, where growth is slower than in the environment. Figure 1C shows the value of λ_sym on the reduced phenotypic space defined by r_H and m. The maximum possible value for λ is 1 (in units of r_E). This value is achieved either by increasing the ratio of replication rates between host and environment, so that the replication rates in both compartments are identical (strategy I), or by reducing migration between host and environment, and in particular, by reducing m_H (strategy II). This second strategy allows microbes to spend a longer time in the environment on average. Note however, that this strategy is limited, since setting m to zero decouples the two compartments completely, in which case the microbial lineage is no longer subject to a multi-step life cycle.

How strong is the selection on these traits? This question can be approached by inferring how strongly the overall growth rate depends on the traits we are considering. One standard approach to measure this is sensitivity analysis [4]. One defines the sensitivity of the overall growth rate λ achieved by the phenotype described by the vector x = (x₁,…, x_N) in the trait space to its ith life-history trait as

$$s_{\mathrm{i}}\left( {\mathbf{x}} \right) = \left. {\frac{{\partial {\uplambda}}}{{\partial {\mathrm{x}}_{\mathrm{i}}}}} \right|_{\mathbf{x}}$$

(5)

This quantity gives the change in the value of λ that results from a small increment of the trait i. It is a local property that can be calculated for each point ${\mathbf{x}}$ of the trait space. The vector of the sensitivities at point ${\mathbf{x}}$ gives the direction of the selection gradient on the fitness landscape. In other words, to achieve efficient phenotypic adaptation, the lineage should move in the trait space following the direction of this gradient.

If the lineage can invest in phenotypic adaptation only by tuning one of its life-history traits at a time, then it should act upon the trait that has the largest (absolute) sensitivity at the current position of the lineage in the trait space. In our model, in all generic cases (i.e., when m > 0), the largest sensitivity is always associated to the increase of the trait r_E, the replication rate in the fast-growing compartment. However, we assume that the considered microbial lineage is initially fully adapted to the environment, so that it has reached its evolutionary limit, and we can essentially ignore the sensitivity to r_E throughout the manuscript to focus on the sensitivity to the other traits. This reasoning allows to divide the trait space into regions of distinct optimal strategies, as shown in Fig. 1C. In the regime of high migration rates (i.e., when the switch between the compartments is so rapid that the microbial lineage is almost experiencing a habitat having average properties between the host and the environment), strategy I (increasing r_H) becomes almost always optimal, except for small replication ratios, where there is almost no replication in the host. In summary, migration rates are important when replication in the host is slow compared to the environment, and when migration itself is slow. These conclusions remain qualitatively unchanged with asymmetric migration rates, although a third optimal strategy (increasing m_E) appears for an intermediate region of the traits space when the asymmetry is important (see electronic Supplementary Material (ESM) section 1 and Supplementary Fig. S1).

Model with global competition between all microbes

In the baseline model, there are no constraints on growth. In nature, however, microbes do face limits to their growth. Since the equations above are linear and can only give rise to exponential growth or exponential decay, they can only describe the microbial dynamics over a limited period of time. In order to account for saturation and competition during growth, we thus need to introduce non-linear terms to the equations (1). The study of this kind of systems often focus on long-term dynamics, yet it can be of high practical relevance to study the transient optimal strategies, as shorter timescales are often relevant in the real world – whether it be due to experimental constraints or to ecological disturbances and perturbations [20]. Since we are going to consider some out-of equilibrium dynamics, in particular in the section with competition limited to one of the compartments, and because we are also interested in transient properties, we will adopt a numerical approach based on the number of microbes [21, 22].

In this section, we study the case of a microbial lineage constrained by global competition occurring at rate k = k_HH = k_EE = k_EH = k_HE. This situation could correspond to a host-associated microbe living in direct contact with an external environment, e.g., on the surface of an organism. Alternatively, what we call the “environment” in our model could represent another host compartment in direct contact with the other, like the gut lumen and the colonic crypts. In that case, microbes living in association with the host are in direct contact with those in the environment and can mutually impact each other’s growth. This is of particular relevance if microbes living in both compartments rely on and are limited by the same nutrients for growth.

From the microbial abundances in the two compartments obtained by numerically solving the equations, one can build a proxy for the overall growth rate of the microbial lineage. To remain consistent with the previous section, we define

$$\varLambda \left( {\mathbf{x}} \right) = \frac{1}{{t_{max}}}\log \left( {\frac{{n_E\left( {t_{max}} \right) + n_H\left( {t_{max}} \right)}}{{n_E\left( 0 \right) + n_H\left( 0 \right)}}} \right)$$

(6)

i.e., the effective exponential growth rate of the microbial lineage over a chosen period of time [0, t_max]. Figure 2A provides a graphical explanation for the expression of Λ. There are indeed several fundamental differences between the effective exponential growth rate Λ in a non-linear system and the asymptotic growth rate λ in a linear system, the dominant eigenvalue of the projection matrix as defined in the baseline model. First, Λ provides a measure of growth for the whole lineage, but is not an asymptotic growth rate (as compared to λ in the baseline model): in the case of global saturation, replication stops when the carrying capacity is reached, and the asymptotic growth rate for the whole lineage would thus be zero. Therefore, the choice of the probing time t_max has an impact on Λ, as shown in Fig. 2A. Second, the choice of the exact form of Λ now implies biological assumptions on the selection pressure experienced by the microbial lineage: choosing the effective exponential growth rate over the whole lineage as we do implies that selection is acting on both compartments evenly. There may be some situations in which the microbes in one of the compartments only are artificially selected for (e.g., as part of the protocol of an evolution experiment). In such cases, it would make sense to define Λ as the effective exponential growth rate over just this compartment. This may lead to different conclusions, in particular at the transient scale. One must thus adapt Λ to the specifics of the modeled system. In addition, the choice of t_max itself has a biological meaning, and should in particular not exceed the time upon which the dynamics of the system are accurately described by the set of equations. This may also be determined by experimental times.

**Fig. 2: Optimal strategies in the model with global competition.**

We now calculate the sensitivity of Λ in the direction of the trait i at the point x of the phenotypic space as

$$S_i = \frac{{\varLambda \left( {x_1,x_2, \ldots ,x_{i - 1},x_i + \delta x_i,x_{i + 1}, \ldots ,x_N} \right) - \varLambda \left( {x_1,x_2, \ldots ,x_N} \right)}}{{\delta x_i}}$$

(7)

with δx_i the discretization interval, and N the number of traits defining a phenotype x.

For this numerical approach, additional choices need to be made. First, the trait space needs to be discretized. Then, to calculate Eq. (7), one needs to choose a set of initial conditions and a probing time at which to measure the microbial abundances, as exposed in detail for the linear case in [20]. Finally, we need to choose the discretization interval δx_i. In the following, we always choose δx_i sufficiently small for convergence, i.e., so that it does not significantly impact the numerical values of the sensitivities, and focus on the choices of the other parameters (probing time and initial conditions) and the influence of the competition intensity k. One strategy to explore the possible impact of initial conditions is to use “stage biased vectors” [20], i.e., extreme initial distributions of microbes across the two compartments. This corresponds to initial conditions where microbes either exist only in the host or only in the environment.

In Fig. 2B, we show how the contour lines delimiting the two optimal strategies change with the final time t_max chosen to measure the overall growth rate and with the intensity of competition k, for a mixed initial condition (n_E(0) = 0.5, n_H(0) = 0.5), and Supplementary Fig. S2 shows how this is modified with stage biased vectors. In all cases, with sufficiently long t_max, the contours converge to the contour plot of the baseline model shown in the previous section. This is expected, since competition here affects all the microbes in the same way, so that the equilibrium distribution is the same as the asymptotic distribution of the baseline model (given by the dominant eigenvector). Mathematically, global competition can be seen as a modification of the baseline projection matrix by subtracting an identity matrix times a scalar depending on time. This does neither affect the eigenvectors nor the dependence of the dominant eigenvalue on the traits.

In the case where all the microbes are initially in the environment (Supplementary Fig. S2A), there is no transient effect and whichever t_max is chosen, all the contour lines collapse to the limit of the baseline case. In the case where all the microbes are initially in the host (Supplementary Fig. S2B), a third optimal strategy transiently appears (increasing m_E) and remains at long times around m = 0. In this unfavorable condition (m = 0 and an initially empty environment), increasing the microbial flux towards the environment becomes more important than limiting the flux of microbes leaving it (which is nonexistent when m = 0).

Finally, we observe that the intensity of competition has only a small effect on the contours (Fig. 2B and S2B), but increasing k appears to slightly accelerate convergence to the baseline contour. By limiting growth in the host compartment – when it is initially relatively more populated than in the asymptotic distribution – competition facilitates the convergence to the baseline asymptotic distribution, where most of the microbes live in the environment.

Model with competition within one of the compartments only

In this section we consider competition happening inside one of the compartments only (i.e., k_EH = k_HE = 0 and k_EE ≠ 0 or k_HH ≠ 0). We will start by considering competition in the host only (the slow-replicating compartment). In a second step we also look at the case with competition limited to the environment. One should bear in mind that it also covers the case of competition limited to a host where replication is faster than in the environment (r_H > r_E), provided a switch of the H and E index.

In the case where competition is limited to only one of the compartments, we do not expect an equilibrium to exist for all traits combination of the phenotypic space. If migration is not sufficiently important, the number of microbes in the unconstrained compartment keeps increasing exponentially faster than the number of microbes in the constrained compartment, which contribution to the whole lineage thus becomes rapidly negligible. At sufficiently high migration rates however, an equilibrium is expected, because microbes switch habitats sufficiently rapidly for competition to be globally effective, although it directly affects only one of the compartments.

Competition in the host only (slow-replicating compartment)

When there is competition in the host only, there is no (positive) equilibrium for all m_H < r_E = 1 (Fig. 3B). In this case, replication inside the host should have less importance for the lineage because the number of microbes associated to the host becomes negligible compared to the ones present in the environment. In this region of the phenotypic space we thus expect the sensitivity of Λ to the parameter r_H to tend to zero with increasing probing times t_max or intensity of competition k = k_HH, whatever be the other parameters (initial conditions, intensity of competition). When migration out of the environment is sufficiently important for an equilibrium to exist, we can derive the expression of the number of microbes at equilibrium analytically and perform a sensitivity analysis to determine the limit of the contour line separating the regions of optimality of the different strategies.

Figure 3 verifies these verbal arguments. As expected, for a fixed t_max, we recover the shape of the fitness landscape of the baseline model for small values of k = k_HH. When increasing k, the values of Λ become smaller overall: growth is slower due to competition. For small k values, the contour delimiting strategy I from II is close to the baseline limit: the effect of competition is negligible. With increasing values of k, strategy I (increasing r_H) sees its area of optimality reduced out of the m_H < r_E = 1 region, until the contour converges to the limit of equal sensitivities of the number of microbes at equilibrium (Fig. 3A, B, and Supplementary Fig. S3).

**Fig. 3: Optimal strategies in the model with competition in the host only.**

When initially the microbes are in the host only (Supplementary Fig. S3A, C), we can again observe the appearance of the third strategy (increasing m_E), around m = 0. Indeed, when m = m_E = m_H = 0 initially, decreasing m_H (strategy II) has no effect, while increasing m_E will allow the colonization of the initially empty environment.

Finally, the impact of increasing the probing time t_max at fixed k is similar in every way to increasing the competition intensity k at fixed t_max (Supplementary Fig. S3B, C).

Competition in the environment only (fast-replicating compartment)

When there is competition in the environment only, there is no (positive) equilibrium for all m_E < r_H. In this region of the phenotypic space, the number of microbes in the environment becomes substantially smaller than the number present in the host after some time. As a consequence, strategy I (increasing the replication rate within the host) becomes more important, so that we see its area of optimality extend, see Supplementary Fig. S4. For a fixed t_max, with a small value of k we recover the shape of the fitness landscape from the baseline model with no competition, but increasing k shifts the contour line to lower r_H until the strategy II (decreasing m_H) disappears from the m_E < r_H region and the delimitation of the strategies approaches the contour of equal sensitivities of the number of microbes at equilibrium, calculated analytically. Remarkably, we also observe the appearance of a fourth optimal strategy around m = 0, increasing m_H. Intuitively, initial conditions where all the microbes are initially located in the (fast-replicating) environment are less favorable when there is competition in the environment, so that migration towards the host (where growth remains unconstrained) becomes more important when the migration rates are initially small. Similar to the previous case, when initially microbes are in the host only (Supplementary Fig. S5A, C), the third strategy (increasing m_E) prevails around m = 0. As before, the impact of increasing the probing time t_max at fixed k is similar in every way to increasing the competition intensity k at fixed t_max (Supplementary Fig. S5B, C).

Competition of equal intensity within each compartment

When there is competition of equal intensity in the host and the environment (i.e., k_EH = k_HE = 0 and k_EE = k_HH = k), we observe very similar results to the previous section, with competition in the environment only (see Fig. 4 and Supplementary Fig. S6): increasing k or increasing t_max leads to the disappearance, at long times, of the area of optimality of strategy II (decreasing m_H), except for a distinct region of small r_H and intermediate m, predicted by the contour of equal sensitivities of the number of microbes at equilibrium. Strategy IV (increasing m_H) is optimal around m = 0. This implies that the effect of competition in the fast-replicating compartment has a dominating effect on the overall growth rate.

**Fig. 4: Optimal strategies in the model with equal intensity of competition within each compartment.**

Discussion

Out in the wild, microbial lineages are often subject to multi-step life cycles, where they alternate between at least two habitats. Each of the steps of these life cycles can contribute to the overall reproductive success. In general, microbial fitness is thus more complex than the common approximation of growth yield used in the lab. This is particularly true for microbes with life cycles that involve a host-associated phase and a free-living phase, as commonly observed for many host-associated microbiota members [19]. In this case, selection should favor traits which ensure both high reproductive rates within each habitat, but also successful transmission between them. A framework to study fitness in all its complexity is needed in the field of microbiota studies, which could benefit from some of the mathematical tools first introduced in demography, as the ones used in this work. Here, we investigate a model of a microbial lineage living, replicating, migrating, and competing in and between two compartments: a host – assumed to be, throughout the paper, a compartment where replication is slower – and an environment. To analyze the selection gradient experienced by the microbial lineage going through this biphasic life cycle – with phases in the environment and phases in the host – we perform sensitivity analysis. We focus on the leading direction of the selection gradient at each point of the phenotypic space, thereby defining an optimal strategy for the microbial lineage to maximize its fitness.

We show that in the case of unconstrained exponential growth in both the compartments, there are two optimal strategies: increasing the replication rate in the host compared to the environment (strategy I), and decreasing the migration rate to the host (strategy II) to maximize the time spent in the fast-replicating compartment. The first strategy is optimal at initially high within-host replication rates and high migration rates, while the second strategy is optimal at initially small migration rates and small within-host replication rates.

Next, we extend the model to a scenario where microbial growth is limited by competition. We start with global competition, a case which could describe competition for a resource homogeneously shared between the host and the environment. Biologically, this corresponds to communities of microbes that are associated with hosts, but have extensive contact with the environment, as the skin or other epithelial microbiota for example [23, 24]. In this case, we show that apart from a transient effect, the optimality of the strategies is conserved from the case without competition. With competition in the host only (the slow-replicating compartment), at longer probing times, or at higher competition intensities, the strategy I (increasing the ratio of replication rates) is disfavored when migration out of the environment is slower than replication in the environment, i.e., where there is no equilibrium. Strategy II (decreasing migration to the host) thus increases its area of optimality. Inversely, with competition in the environment only (the fast-replicating compartment), or with competition of equal intensity within the host and within the environment, the strategy II is disfavored when migration out of the host is slower than replication in the host, leaving strategy I as the only optimal strategy in this region of the parameter space. Unsurprisingly, this suggests that competition within the fast-replicating compartment dominates the effect on the selection gradient.

While this analysis provides crucial information on the selection gradient that shapes microbial adaptation to life cycles involving host association, it does not take into account the evolvability of the traits themselves. Although the selection gradient is a good indicator of the expected evolutionary path in the phenotypic space, the underlying genotype/phenotype mapping does not always allow for this path to be taken [25,26,27,28], and the outcome of evolution may thus be different. The discrete nature, the non-additivity and non-linearity of genetic information, as well as the existence of costs, trade-offs and evolutionary constraints may prevent the predicted continuous change on the phenotypic trait. In addition, using sensitivities is built on the assumption that adaptation generates additive changes in life-history traits. Although this is a common assumption, different choices are sometimes made. For example, multiplicative changes of the traits are assumed in elasticity analysis [4, 21, 27, 29], which presents the advantage of manipulating only proportional changes and thus non-dimensional quantities, but deals poorly with traits that can take the value of zero. These fundamental assumptions can sometimes result in different inferred selection gradients, as was shown for example in the context of age-classified populations [30].

Stepping back, we can evaluate the predictions of our model in the light of biological observations. Evolution experiments where microbial lineages are serially passaged through a host and an environment are of particular interest here, to assess the response to selection resulting from biphasic life cycles. The key role of microbial immigration during the initial adaptation to their zebrafish host has for example been highlighted in [31]. In Drosophila [32] and in C. elegans [33], experimental selection towards host association resulted in adaptive changes in microbial life history with a direct impact on host fitness. In detail, in the first case, there is evolution towards by-product mutualism, and in the second, which concerns an initially pathogenic population, evolution towards less virulence and an increased carrying capacity.

Conceptually, using an integrative, overall growth rate as a measure of fitness across the life cycle provides a complementary insight to invasion fitness approaches [34, 35] developed to analyze such evolution experiments, for example in [36, 37]. While invasion fitness analysis relies on assessing the long term chances of successful invasion of an established population at equilibrium by a new mutant strain of defined traits values, sensitivity analysis of the overall growth rate provides a systematic framework that can be applied to out-of-equilibrium systems, and provides information on shorter time scales. Both frameworks rely on different proxies to assess a fitness capturing its different components - in one case, the frequency of patches where the microbe is present, and in the other, the overall growth rate, but both frameworks converge on the key role of migration between compartments. In fact, in many common cases like global competition, the long-term predictions of invasion fitness are recovered with the sensitivity analysis of the effective growth rate by setting t_max sufficiently large [21].

In future work, our framework could be extended in different directions to capture additional characteristics of microbial life cycles in host association. The first extension could be to increase the number of compartments. While the question of fluctuating environments has been studied before, in discrete times or in a different context [8, 21], in our context it may be profitable to consider and include host population dynamics. This would notably allow us to include microbial traits that affect host fitness in our analysis. A second direction could be to include non-homogeneities and stochasticity. A first step could be to introduce several interacting taxa with different life-history traits, and assess how the presence of additional taxa potentially modifies the selection on the taxon of focus. Secondly, our deterministic description is valid only if the number of microbes is sufficiently large at all times and can only describe the average selection gradient experienced by the lineage. Introducing stochasticity would crucially allow the study of differentiation, which may play an essential role in the response to multi-step life cycles which include replication in several steps. Differentiation, in the form of speciation, phenotypic plasticity, or bet-hedging is indeed observed in evolution experiments and natural microbial populations [38,39,40,41,42,43]. It is also observed in host-associated populations [44] and may thus be expected in evolution experiments that include a host-association phase. In a stochastic setting with mutation of the life-history traits, it could be important to also incorporate other mechanisms of transfer of genetic information, such as horizontal gene transfer and recombination, which could decelerate or even prevent differentiation [45, 46]. Finally, a key aspect that we have so far excluded is spatiality. Effects of spatiality on the selection gradient are known for example in a simple Petri dish system, where the existence of an optimal expansion speed for a given habitat size is shown [47, 48]. Generally, hosts are highly structured habitats with variation in nutrients and chemical and physical gradients shaping for example the gut [49,50,51], which may also favor differentiation. The introduction of several compartments or sub-compartments within the hosts could represent a first step in this direction.

In conclusion, the framework we introduce here with a minimal model provides a basis to study the consequences of habitat switching for microbes, and will allow to explore additional aspects of host association in the future. It meets the need to conceptualize fitness as a holistic measure that captures all the aspects of microbial life cycles. With the development of this framework, we aim to contribute to a better understanding of the mutual benefits that microbes and hosts can retrieve from such associations.

Data availability

The Mathematica files to produce the figures are available at https://github.com/flobansept/microbes_life_history_selection.

References

Lewontin RC. The units of selection. Ann Rev Ecol Syst. 1970;1:1–18.
Article Google Scholar
Doebeli M, Ispolatov Y, Simon B. Towards a mechanistic foundation of evolutionary theory. Shou W, Herausgeber. eLife. 2017;6:e23804.
Article PubMed PubMed Central Google Scholar
Moran NA. Adaptation and constraint in the complex life cycles of animals. Annu. Rev. Ecol. Syst. 1994;25:573–600.
Article Google Scholar
Caswell H. Matrix population models. 2nd Aufl. Sunderland MA: Sinauer Associates; 2001.
Hanski I. Metapopulation dynamics. Nature. 1998;396:41–9.
Article CAS Google Scholar
Andow DA, Kareiva PM, Levin SA, Okubo A. Spread of invading organisms. Landscape Ecol. 1990;4:177–88.
Article Google Scholar
Pichugin Y, Peña J, Rainey P, Traulsen A. Fragmentation modes and the evolution of life cycles. PLoS Comput. Biol. 2017;13:e1005860.
Article PubMed PubMed Central Google Scholar
Pichugin Y, Park H, Traulsen A. Evolution of simple multicellular life cycles in dynamic environments. J R Soc Interface. 2019;16:154.
Article Google Scholar
Goodrich‐Blair H, Clarke DJ. Mutualism and pathogenesis in Xenorhabdus and Photorhabdus: two roads to the same destination. Mol Microbiol. 2007;64:260–8.
Article PubMed Google Scholar
Ciche TA, Darby C, Ehlers R-U, Forst S, Goodrich-Blair H. Dangerous liaisons: the symbiosis of entomopathogenic nematodes and bacteria. Biological Control. 2006;38:22–46.
Article Google Scholar
Hrček J, Parker BJ, McLean AHC, Simon J-C, Mann CM, Godfray HCJ. Hosts do not simply outsource pathogen resistance to protective symbionts. Evolution. 2018;72:1488–99.
Article Google Scholar
Consuegra J, Grenier T, Baa-Puyoulet P, Rahioui I, Akherraz H, Gervais H, et al. Drosophila-associated bacteria differentially shape the nutritional requirements of their host during juvenile growth. PLOS Biol. 2020;18:e3000681.
Article CAS PubMed PubMed Central Google Scholar
Zimmermann J, Obeng N, Yang W, Pees B, Petersen C, Waschina S, et al. The functional repertoire contained within the native microbiota of the model nematode Caenorhabditis elegans. ISME J. 2020;14:26–38.
Article CAS PubMed Google Scholar
Combes C. Fitness of parasites: pathology and selection. Int J Parasitol. 1997;27:1–10.
Article CAS PubMed Google Scholar
Gandon S. Evolution of multihost parasites. Evolution. 2004;58:455–69.
PubMed Google Scholar
Brown SP, Cornforth DM, Mideo N. Evolution of virulence in opportunistic pathogens: generalism, plasticity, and control. Trends Microbiol. 2012;20:336–42.
Article CAS PubMed PubMed Central Google Scholar
Park M, Loverdo C, Schreiber SJ, Lloyd-Smith JO. Multiple scales of selection influence the evolutionary emergence of novel pathogens. Philos Trans R Soc Lond B Biol Sci. 2013;368:20120333. https://doi.org/10.1098/rstb.2012.0333.
Sieber M, Traulsen A, Schulenburg H, Douglas AE. On the evolutionary origins of host-microbe associations. Proc Natl Acad Sci. 2021;118:e2016487118.
Article CAS PubMed PubMed Central Google Scholar
Obeng N, Bansept F, Sieber M, Traulsen A, Schulenburg H. Evolution of microbiota-host associations: the microbe’s perspective. Trends Microbiol. 20212:S0966-842X(21)00041-X. https://doi.org/10.1016/j.tim.2021.02.005.
Stott I, Townley S, Hodgson DJ. A framework for studying transient dynamics of population projection matrix models. Ecol Lett. 2011;14:959–70.
Article PubMed Google Scholar
Grant A, Benton TG. Elasticity analysis for density-dependent populations in stochastic environments. Ecology. 2000;81:680–93.
Article Google Scholar
Grant A. Selection pressures on vital rates in density-dependent populations. Proc Biol Sci. 1997;264:303–6.
Article PubMed Central Google Scholar
Chen YE, Fischbach MA, Belkaid Y. Skin microbiota–host interactions. Nature. 2018;553:427–36.
Article CAS PubMed PubMed Central Google Scholar
Fraune S, Bosch TC. Long-term maintenance of species-specific bacterial microbiota in the basal metazoan Hydra. Proc Natl Acad Sci USA. 2007;104:13146–51.
Article CAS PubMed PubMed Central Google Scholar
Orr HA. The genetic theory of adaptation: a brief history. Nat Rev Genet. 2005;6:119–27.
Article CAS PubMed Google Scholar
Lande R. A quantitative genetic theory of life history evolution. Ecology. 1982;63:607–15.
Article Google Scholar
Tienderen PHV. Elasticities and the link between demographic and evolutionary dynamics. Ecology. 2000;81:666–79.
Article Google Scholar
Houle D. Comparing evolvability and variability of quantitative traits. Genetics. 1992;130:195–204.
Article CAS PubMed PubMed Central Google Scholar
Benton T, Grant A. Elasticity analysis as an important tool in evolutionary and population ecology. Trends Ecol Evol. 1999;14:467–71.
Article CAS PubMed Google Scholar
Baudisch A. Hamilton’s indicators of the force of selection. Proc Natl Acad Sci USA. 2005;102:8263–8.
Article CAS PubMed PubMed Central Google Scholar
Robinson CD, Klein HS, Murphy KD, Parthasarathy R, Guillemin K, Bohannan BJM. Experimental bacterial adaptation to the zebrafish gut reveals a primary role for immigration. PLOS Biol. 2018;16:e2006893.
Article CAS PubMed PubMed Central Google Scholar
Martino ME, Joncour P, Leenay R, Gervais H, Shah M, Hughes S, et al. Bacterial adaptation to the host’s diet is a key evolutionary force shaping drosophila-lactobacillus symbiosis. Cell Host Microbe. 2018;24:109–119.e6.
Article CAS PubMed PubMed Central Google Scholar
Jansen G, Crummenerl LL, Gilbert F, Mohr T, Pfefferkorn R, Thänert R. Evolutionary transition from pathogenicity to commensalism: global regulator mutations mediate fitness gains through virulence attenuation. Mol Biol Evol. 2015;32:2883–96.
Article CAS PubMed PubMed Central Google Scholar
Hurford A, Cownden D, Day T. Next-generation tools for evolutionary invasion analyses. J R Soc Interface. 2010;7:561–71.
Article PubMed Google Scholar
Nguyen PL, van Baalen M. On the difficult evolutionary transition from the free-living lifestyle to obligate symbiosis. PLOS One. 2020;15:e0235811.
Article CAS PubMed PubMed Central Google Scholar
Miller ET, Svanbäck R, Bohannan BJM. Microbiomes as Metacommunities: Understanding Host-Associated Microbes through Metacommunity Ecology. Trends Ecol Evol. 2018;33:926–35. https://doi.org/10.1016/j.tree.2018.09.002.
Miller ET, Bohannan BJM. Life between patches: incorporating microbiome biology alters the predictions of metacommunity models. Front Ecol Evol. 2019;7:276. https://doi.org/10.3389/fevo.2019.00276.
Rainey PB, Travisano M. Adaptive radiation in a heterogeneous environment. Nature. 1998;394:69–72.
Article CAS PubMed Google Scholar
Beaumont HJE, Gallie J, Kost C, Ferguson GC, Rainey PB. Experimental evolution of bet hedging. Nature. 2009;462:90–3.
Article CAS PubMed Google Scholar
Zhang X-X, Rainey PB. Bet hedging in the underworld. Genome Biol. 2010;11:137.
Article PubMed PubMed Central Google Scholar
Medina I, Langmore NE. Coevolution is linked with phenotypic diversification but not speciation in avian brood parasites. Proc R Soc B. 2015;282:20152056.
Article PubMed PubMed Central Google Scholar
Xue B, Leibler S. Evolutionary learning of adaptation to varying environments through a transgenerational feedback. Proc Natl Acad Sci USA. 2016;113:11266–71.
Article CAS PubMed PubMed Central Google Scholar
Moreno-Gámez S, Kiviet DJ, Vulin C, Schlegel S, Schlegel K, van Doorn GS, et al. Wide lag time distributions break a trade-off between reproduction and survival in bacteria. Proc Natl Acad Sci USA. 2020;117:18729–36.
Article PubMed PubMed Central Google Scholar
Ashish A, Paterson S, Mowat E, Fothergill JL, Walshaw MJ, Winstanley C. Extensive diversification is a common feature of Pseudomonas aeruginosa populations during respiratory infections in cystic fibrosis. J Cystic Fibros. 2013;12:790–3.
Article Google Scholar
Fraser C, Hanage WP, Spratt BG. Recombination and the nature of bacterial speciation. Science. 2007;315:476–80.
Article CAS PubMed PubMed Central Google Scholar
Garud NR, Pollard KS. Population genetics in the human microbiome. Trends Genet. 2020;36:53–67.
Article CAS PubMed Google Scholar
Liu W, Cremer J, Li D, Hwa T, Liu C. An evolutionarily stable strategy to colonize spatially extended habitats. Nature. 2019;575:664–8.
Article CAS PubMed PubMed Central Google Scholar
Mattingly H, Emonet T. A rule from bacteria to balance growth and expansion. Nature. 2019;575:602–3.
Article CAS PubMed Google Scholar
Schlomann BH. Stationary moments, diffusion limits, and extinction times for logistic growth with random catastrophes. J Theor Biol. 2018;454:154–63.
Article PubMed PubMed Central Google Scholar
Schlomann BH, Wiles TJ, Wall ES, Guillemin K, Parthasarathy R. Bacterial cohesion predicts spatial distribution in the larval zebrafish intestine. Biophys J. 2018;115:2271–7.
Article CAS PubMed PubMed Central Google Scholar
Donaldson GP, Lee SM, Mazmanian SK. Gut biogeography of the bacterial microbiota. Nat Rev Microbiol. 2016;14:20–32.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank the Evolutionary Theory Department in the MPI Ploen for useful feedback and discussions, Stefano Giaimo, Román Zapién-Campos, and Claude Loverdo for careful reading of an earlier version of the manuscript, and two anonymous reviewers for useful comments and suggestions. All authors acknowledge funding and support from the CRC 1182: Origins and Functions of Metaorganisms, project A4.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Max-Planck-Institute for Evolutionary Biology, Ploen, Germany
Florence Bansept, Hinrich Schulenburg & Arne Traulsen
Department of Evolutionary Ecology and Genetics, University of Kiel, Kiel, Germany
Nancy Obeng & Hinrich Schulenburg

Authors

Florence Bansept
View author publications
Search author on:PubMed Google Scholar
Nancy Obeng
View author publications
Search author on:PubMed Google Scholar
Hinrich Schulenburg
View author publications
Search author on:PubMed Google Scholar
Arne Traulsen
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Florence Bansept.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bansept, F., Obeng, N., Schulenburg, H. et al. Modeling host-associating microbes under selection. ISME J 15, 3648–3656 (2021). https://doi.org/10.1038/s41396-021-01039-0

Download citation

Received: 22 January 2021
Revised: 28 May 2021
Accepted: 09 June 2021
Published: 22 June 2021
Version of record: 22 June 2021
Issue date: December 2021
DOI: https://doi.org/10.1038/s41396-021-01039-0

This article is cited by

Bacterial c-di-GMP has a key role in establishing host–microbe symbiosis
- Nancy Obeng
- Anna Czerwinski
- Hinrich Schulenburg
Nature Microbiology (2023)
Determinants of associations between codon and amino acid usage patterns of microbial communities and the environment inferred based on a cross-biome metagenomic analysis
- Arup Panda
- Tamir Tuller
npj Biofilms and Microbiomes (2023)