Selective sweep probabilities in spatially expanding populations

Stein, Alexander; Bostock, Kate; Kizhuttil, Ramanarayanan; Bak, Maciej; Noble, Robert

doi:10.1038/s41467-026-69363-7

Download PDF

Article
Open access
Published: 11 February 2026

Selective sweep probabilities in spatially expanding populations

Nature Communications volume 17, Article number: 2181 (2026) Cite this article

3670 Accesses
14 Altmetric
Metrics details

Subjects

Abstract

Evolution during range expansions shapes biological systems from microbial communities and tumours to invasive species. A fundamental question is whether, when a beneficial mutation arises during a range expansion, it will evade clonal interference and sweep to fixation. However, most theoretical investigations of range expansions have considered regimes in which selective sweeps are effectively impossible, while studies of selective sweeps have assumed constant population size or ignored spatial structure. Here we use mathematical modelling and analysis to investigate selective sweep probabilities and timings in biologically relevant scenarios, including the case in which mutants can displace a slowly spreading wildtype. Assuming constant expansion speed, we find surprisingly simple approximate and exact expressions for sweep probabilities in one, two and three dimensions, which are independent of mutation rate. Agent-based simulations confirm that our predictions are accurate for the spatial Moran process and remain informative when mutation effects on fitness are random and multiplicative. We further compare and synthesise our results with those obtained for alternative growth laws. Parameterised for human tumours, our model predicts that selective sweeps are rare except during early solid tumour growth, thus providing a general, pan-cancer explanation for findings from recent sequencing studies.

Evolutionary rescue of resistant mutants is governed by a balance between radial expansion and selection in compact populations

Article Open access 23 December 2022

Spatial structure governs the mode of tumour evolution

Article Open access 23 December 2021

Mutant clones in normal epithelium outcompete and eliminate emerging tumours

Article 13 October 2021

Introduction

Range expansion—the spatial spread of populations into new regions—is ubiquitous across biological scales and alters the course of evolution in distinct, often profound ways that remain incompletely understood¹. Among cell populations, evolution during range expansions determines the development and spatial heterogeneity of biofilms², tumours³, mosaicism⁴ and normal tissue⁵. At the species level, range expansions influenced human evolution⁶ and are of growing importance as climate change forces organisms into new habitats^7,8. Prior theoretical and experimental investigations of evolution during range expansion have considered the case in which the wildtype population spreads into new territory much faster than any mutant can displace the wildtype^9,10,11,12. In this scenario, which is typical of microbial colonies growing in vitro, mutants essentially expand only at the population boundary and selective sweeps are precluded. The alternative case of slow range expansion and strong selection has been unexplored but is widely plausible. Consider, for example, an invasive species that is rapidly adapting to its new environment while gradually displacing a resident competitor¹³, or a bacterial colony whose growth is slowed by sub-inhibitory antibiotic treatment.

Cancer provides an especially strong motivation for investigating the likelihood of selective sweeps during range expansion. Having acquired the necessary driver mutations to escape homeostasis, solid tumours continue to accumulate neutral and driver (that is, cell-fitness-enhancing) mutations as they grow and invade surrounding tissue^14,15. The mode of this evolutionary process has sparked debate^16,17. An early, highly influential model based on colorectal cancer genetics posited a linear mode of evolution in which cancers acquire mutations through sequential selective sweeps¹⁸. Later studies demonstrated intra-tumour heterogeneity with respect to both neutral and driver mutations, suggesting that most cancers undergo branching evolution before they are detected^{19,20,21,22,23}. In some cases, the extent of heterogeneity has been shown to predict clinical outcomes^24,25,26. A possible explanation that reconciles these observations is that linear evolution is restricted to the very early stages of solid tumour evolution and mutations with very strong selective advantage, yet a systematic study of this general, pan-cancer hypothesis is still lacking^16,17,27. Mathematical modelling offers a way to investigate the early stages of tumour evolution, which are typically impossible to observe in the clinic^28,29. We^26,30 and others^31,32,33,34 have used models with alternative spatial structures and modes of cell dispersal to examine how spatial structure influences tumour evolutionary mode and the extent of intratumour heterogeneity. However, with notable exceptions^35,36,37,38, theoretical investigations have ignored spatial structure^39,40,41 or relied on agent-based models^{26,30,31,32,33,34} whose results can be difficult to interpret, provide only limited explanatory insights and are not readily generalisable.

Here we use mathematical analysis to explain why beneficial mutations typically fixate only—if at all—in the very early stages of range expansions, even when mutants can displace the wildtype faster than the wildtype expands its range. By solving our model in the canonical case of constant radial expansion speed, we derive exact and simple approximate expressions for sweep probabilities in one, two and three dimensions. We confirm the accuracy and robustness of our analytical results using extensive agent-based simulations of a spatial Moran process and we compare outcomes for alternative growth models. We discuss how these findings shed light on the nature of evolution in range expansions in general and cancer development in particular.

Results

A macroscopic model of evolution during range expansion

Our macroscopic model is designed to test whether clonal interference alone can prevent selective sweeps and to obtain upper bounds on selective sweep probabilities during range expansions. We consider a wildtype population that starts expanding spherically at time t = 0, such that its radius x_wt grows at speed c_wt. Focusing on selective sweeps, we consider only advantageous mutations, which we assume spread within the wildtype at speed c_m > c_wt (Fig. 1). Mutations occur at per-capita rate $\widetilde{\mu }$ with each surviving genetic drift with probability ρ. In our analytic model, it is enough to consider the compound parameter $\mu=\rho \widetilde{\mu }$. For brevity, we will refer to μ as the mutation rate unless otherwise mentioned.

**Fig. 1: Illustration of the macroscopic model (not to scale) showing the two possible fates of the first surviving mutant (blue) within the wildtype population (orange).**

For mathematical and biological reasons (see ‘Discussion’), we focus on a model with constant radial expansion speeds; later, we compare these results with those that pertain to alternative growth models. Various models link propagation speeds to fitness values and migration rates. The most prominent formula is obtained from a reaction-diffusion equation associated with Fisher⁴² and Kolmogorov⁴³: $c=2\sqrt{aD}$, where D is the diffusion coefficient carrying information of migration and a is the difference in the local proliferation rate. More recent studies have sought to refine and generalise this result^44,45,46,47. Our main results apply to any model that generates approximately constant speeds.

The first surviving mutation achieves a selective sweep only if it reaches every part of the wildtype expansion front before a second mutant of equal or greater fitness arises within the wildtype (Fig. 1). Otherwise, the outcome is clonal interference or possibly a soft selective sweep if the competing mutations are sufficiently similar⁴⁸. For simplicity, in our macroscopic model, we neglect mutants with fitness values between those of the wildtype and the first mutant, which would slow the expansion of the first mutant and so reduce rather than nullify the selective sweep probability. Neither do we investigate the case of a yet-fitter mutant that arises from the first mutant and achieves a selective sweep. Later, we will examine the effects of relaxing these assumptions.

The unconditional sweep probability is derived in four steps. First, we introduce the random variable X, the radius of the wildtype population when the first mutant arises and we compute its probability density f_X(x). Second, we introduce the random variable Y, the distance between the wildtype and mutant origins and we calculate its probability density conditioned on X, namely f_Y(y∣X = x). Third, we derive an expression for the conditional sweep probability Pr(sweep∣X = x, Y = y). Finally, we marginalise out X and Y to obtain the unconditional sweep probability Pr(sweep). We focus on the three-dimensional case; analogous results in one and two dimensions are presented in the SI Text Sections 5 and 6.

The following result (proved in SI Text Section 1) will be useful in various parts of subsequent derivations.

Claim 1

The probability that k mutations arise and survive during the time interval [0, t] is Poisson distributed,

$${P}_{k}=\frac{{e}^{-\lambda }{\lambda }^{k}}{k!}\,{{{\rm{with}}}}\,\lambda=\mu \int _{0}^{t }N(s)\,ds,$$

(1)

where N(s) is the population size at time s. In particular, the probability that no successful mutant arises is P₀ = e^−λ.

Although it is commonly assumed that mutations are coupled to divisions, it is straightforward to translate between per-capita and per-division mutation rates (see SI Text Section 2).

Arrival time of the first mutant

In the absence of mutants, the wildtype population in three dimensions grows as ${N}_{{{{\rm{wt}}}}}=\frac{4}{3}\pi {x}_{{{{\rm{wt}}}}}^{3}=\frac{4}{3}\pi {({c}_{{{{\rm{wt}}}}}t)}^{3}$. Applying Claim 1, we obtain the probability that no mutations arise in the time interval [0, t],

$${P}_{0}={e}^{-\mu {\int }_{0}^{t}\frac{4}{3}\pi {({c}_{{{{\rm{wt}}}}}s)}^{3}ds}={e}^{-{\left(\frac{t}{\kappa }\right)}^{4}},$$

(2)

in terms of a characteristic duration $\kappa=\root{{4}}\of{\frac{3}{\mu \pi {c}_{{{{\rm{wt}}}}}^{3}}}$. We identify 1 − P₀ as the cumulative distribution function for the arrival time T of the first surviving mutant. The probability density of T is then

$${f}_{T}(t)=\frac{d(1-{P}_{0})}{dt}=\frac{4{t}^{3}}{{\kappa }^{4}}{e}^{-{\left(\frac{t}{\kappa }\right)}^{4}},$$

(3)

which is the Weibull distribution with shape parameter 4 and scale parameter κ. Substituting x = c_wtt, we find that f_X(x), the probability density of the radius of the wildtype population X at the time the first surviving mutant arises, is the Weibull distribution with shape parameter 4 and scale parameter $\theta=\root{{4}}\of{\frac{3{c}_{{{{\rm{wt}}}}}}{\pi \mu }}$. It follows that ${\mathbb{E}}[X]\approx 0.91\,\theta$ and Var[X] ≈ 0.065 θ². Analogous calculations in one and two dimensions yield similar Weibull distributions (SI Text Sections 5 and 6 and Fig. 2A–C).

**Fig. 2: Analytical and numerical solutions of the macroscopic model.**

Location of the first mutant

Next, we compute the probability density for the distance Y between the first surviving mutant and the centre of the wildtype population. Since mutants arise in proportion to the number of wildtype cells, we have f_Y(y∣X = x) dy ∝ D(y), where D(y) is the number of cells at distance y. D(y) corresponds to the infinitesimal shell, D(y) = 4πy² dy 1_[0, x](y), where the last term is an indicator function that defines the boundary of the wildtype population. After normalisation, we obtain

$${f}_{Y}(y| X=x)=\frac{3{y}^{2}}{{x}^{3}}{{{{\bf{1}}}}}_{[0,x]}(y).$$

(4)

We calculate the unconditional probability density of Y by marginalising out X,

$${f}_{Y}(y)=\int _{0}^{\infty }{f}_{Y}(y| X=x){f}_{X}(x)\,dx=\frac{3{y}^{2}}{{\theta }^{3}}\Gamma \left(\frac{1}{4},\frac{{y}^{4}}{{\theta }^{4}}\right),$$

(5)

where $\Gamma (a,z)={\int }_{z}^{\infty }{t}^{a-1}{e}^{-t}\,dt$ is the incomplete gamma function. We then find ${\mathbb{E}}[Y]\approx 0.68\,\theta$ and Var[Y] ≈ 0.070 θ². Similar results pertain in one and two dimensions (SI Text Sections 5 and 6; Fig. 2A–C).

Conditional sweep probability

To compute the sweep probability, we first need an expression for the remaining wildtype population, N_wt. Therefore, we introduce time measure τ, with τ = 0 when the first mutant arises. Recall that t = 0 at the origin of the wildtype population, so t = τ + x/c_wt. Once the mutant population starts expanding, we have ${N}_{{{{\rm{wt}}}}}(\tau )={\widetilde{N}}_{{{{\rm{wt}}}}}(\tau )-\Delta (\tau )$, where Δ(τ) is the number of wildtype cells that the mutant has replaced and ${\widetilde{N}}_{{{{\rm{wt}}}}}(\tau )=\frac{4}{3}\pi {x}_{{{{\rm{wt}}}}}^{3}(\tau )$ is the wildtype population size had there been no mutant. While the mutant is surrounded by the wildtype, we have $\Delta (\tau )={\Delta }_{1}(\tau )=\frac{4}{3}\pi {x}_{{{{\rm{m}}}}}^{3}(\tau )$. After the mutant breaches the wildtype boundary, Δ(τ) is given by the intersection formula of two balls, which we denote Δ₂(τ) (see SI Text Section 3 for the explicit formula). Together, we have

$${N}_{{{{\rm{wt}}}}}(\tau ) = ({\widetilde{N}}_{{{{\rm{wt}}}}}(\tau )-{\Delta }_{1}(\tau )){{{{\bf{1}}}}}_{[0,{\tau }_{1}]}(\tau )\\ +({\widetilde{N}}_{{{{\rm{wt}}}}}(\tau )-{\Delta }_{2}(\tau )){{{{\bf{1}}}}}_{[{\tau }_{1},{\tau }_{2}]}(\tau ),$$

(6)

where τ₁ is the time at which the mutant reaches the wildtype boundary and τ₂ is the time at which the mutant has entirely replaced the wildtype. Noting that

$${\tau }_{1}{c}_{{{{\rm{m}}}}}=(x-y)+{\tau }_{1}{c}_{{{{\rm{wt}}}}}\Rightarrow {\tau }_{1}=\frac{x-y}{{c}_{{{{\rm{m}}}}}-{c}_{{{{\rm{wt}}}}}}$$

(7)

and

$${\tau }_{2}{c}_{{{{\rm{m}}}}}=(x+y)+{\tau }_{2}{c}_{{{{\rm{wt}}}}}\Rightarrow {\tau }_{2}=\frac{x+y}{{c}_{{{{\rm{m}}}}}-{c}_{{{{\rm{wt}}}}}},$$

(8)

we express N_wt(τ) in terms of x and y (see SI Text Section 3). We then apply Claim 1 to obtain the conditional sweep probability,

$$\Pr ({{{\rm{sweep}}}}| X=x,Y=y)={e}^{-\mu {\int }_{0}^{\infty }{N}_{{{{\rm{wt}}}}}(\tau )d\tau }.$$

(9)

Although this integral can be solved analytically (SI Text Section 3), the resulting expression is complicated and not very enlightening and we therefore seek simpler approximations. An especially fruitful approach is to assume y = 0, so that f_Y(y∣X = x) = δ(y), where δ is the Dirac delta function. This yields an upper bound on the conditional sweep probability because, due to geometrical symmetry, the time required for a mutant to sweep (τ₂ in eqn. (8)) is minimal when y = 0. With this approximation, eqn. (9) simplifies drastically as ${\tau }_{1}={\tau }_{2}=\frac{x}{{c}_{{{{\rm{m}}}}}-{c}_{{{{\rm{wt}}}}}}$ and we do not need to integrate over Δ₂(τ). The analytic solution is

$$\Pr ({{{\rm{sweep}}}}| X=x,Y=0)={e}^{-{\left(\frac{x}{\alpha }\right)}^{4}},$$

(10)

where $\alpha=\root{{4}}\of{\frac{3{({c}_{{{{\rm{m}}}}}-{c}_{{{{\rm{wt}}}}})}^{3}}{\pi \mu \left({c}_{{{{\rm{wt}}}}}^{2}-3{c}_{{{{\rm{wt}}}}}{c}_{{{{\rm{m}}}}}+3{c}_{{{{\rm{m}}}}}^{2}\right)}}$ is a characteristic length. Despite being based on a seemingly crude approximation, eqn. (10) is remarkably close to the exact conditional sweep probability (Fig. 2D). The corresponding approximations in one and two dimensions (SI Text Sections 5 and 6) are likewise simple and useful.

Unconditional sweep probability

We now have all the necessary ingredients to compute the unconditional sweep probability by marginalising out X and Y from the conditional sweep probability,

$$\Pr ({{{\rm{sweep}}}}) =\int _{0}^{\infty }\int _{0}^{\infty }\Pr ({{{\rm{sweep}}}}| X=x,Y=y)\,{f}_{Y}(y| X=x)\,{f}_{X}(x)\,dy\,dx\\ =\int _{0}^{\infty }\int _{0}^{x}{e}^{-\mu \int _{0}^{\infty }{N}_{{{{\rm{wt}}}}}(\tau )d\tau }\frac{3{y}^{2}}{{x}^{3}}\frac{4{x}^{3}{e}^{-{x}^{4}/{\theta }^{4}}}{{\theta }^{4}}\,dy\,dx.$$

(11)

As before, we use the approximation y = 0 to obtain the strikingly simple result

$$\begin{array}{rcl}\Pr ({{{\rm{sweep}}}}) & \le & {\int }_{0}^{\infty }\,\Pr ({{{\rm{sweep}}}}| X=x,Y=0)\,{f}_{X}(x)\,dx\\ &=& {\int }_{0}^{\infty }\,{e}^{-{\left(\frac{x}{\alpha }\right)}^{4}}\frac{4{x}^{3}{e}^{-{x}^{4}/{\theta }^{4}}}{{\theta }^{4}}\,dx\hfill\\ &=& {\left(\frac{{c}_{{{{\rm{m}}}}}-{c}_{{{{\rm{wt}}}}}}{{c}_{{{{\rm{m}}}}}}\right)}^{3}.\hfill\end{array}$$

(12)

This result generalises to analogous one- and two-dimensional models, with the exponent 3 replaced by the respective spatial dimension (SI Text Sections 5 and 6). The approximate expressions are close to the exact solution in one dimension and to numerical evaluations of the integral in two and three dimensions (Fig. 2E). In the one-dimensional case, we can moreover obtain an even better approximation by setting ${y}^{2}={\mathbb{E}}[{Y}^{2}]={x}^{2}/3$ or (better still⁴⁹) y² = 0.28125x² instead of y = 0 and hence shed light on why the upper bound is close to the exact solution (SI Text Section 5). The sweep probability is independent of the mutation rate, not only in these approximations but also in the general case of Eqn. (11). This follows from a more general result (SI Text Section 4). Eqn. (12) also leads to a simple upper bound on the probability of multiple sequential sweeps (SI Text Section 10).

When the expanding wildtype must displace a resident competitor (as in our agent-based simulations, to follow), the sweep probability can be approximated in terms of evolutionary parameters. For example, if we take the speed predictions from the Fisher-Kolmogorov-Petrovsky-Piscounov (FKPP) equation ${c}_{{{{\rm{wt}}}}}=2\sqrt{D{a}_{{{{\rm{wt}}}}}}$ and ${c}_{{{{\rm{m}}}}}=2\sqrt{D{a}_{{{{\rm{m}}}}}}$ with ${a}_{{{{\rm{wt}}}}}={r}_{{{{\rm{wt}}}}}-{r}_{{{{\rm{re}}}}}$ and a_m = r_m − r_wt (see ‘Methods’) and insert them into eqn. (12) then we obtain

$$\Pr ({{{\rm{sweep}}}})\approx {\left(1-\sqrt{\frac{{r}_{{{{\rm{wt}}}}}-{r}_{{{{\rm{re}}}}}}{{r}_{{{{\rm{m}}}}}-{r}_{{{{\rm{wt}}}}}}}\right)}^{d},$$

(13)

where r_m, r_wt and ${r}_{{{{\rm{re}}}}}$ are the proliferation rates of the mutant invader, wildtype invader and resident populations, respectively and d is the spatial dimension.

Conditional arrival time of the first mutant

In biological systems where it is infeasible to track evolutionary dynamics, selective sweeps must be inferred from subsequent genetic data. For example, we might observe a public mutation in a tumour and ask when this mutation occurred. We can use our model to obtain the probability distribution of the radius X at the time the first mutant arose, given that we observe a selective sweep, by applying Bayes’ theorem,

$${f}_{X}(x| {{{\rm{sweep}}}})=\frac{\Pr ({{{\rm{sweep}}}}| X=x){f}_{X}(x)}{\Pr ({{{\rm{sweep}}}})}.$$

(14)

Using the approximation y = 0, we obtain the Weibull distribution

$${f}_{X}(x| {{{\rm{sweep}}}})\approx \frac{4{x}^{3}}{{\beta }^{3}{\theta }^{4}}{e}^{-\frac{{x}^{4}}{{\beta }^{3}{\theta }^{4}}},$$

(15)

where $\beta=\frac{{c}_{{{{\rm{m}}}}}-{c}_{{{{\rm{wt}}}}}}{{c}_{{{{\rm{m}}}}}}$. This approximation and the corresponding results in one and two dimensions are close to the exact solutions (Fig. 2F).

Sweep probabilities in alternative growth models

To examine the robustness of our findings to variations in the model assumptions, we compare them to results obtained from alternative growth models. Instead of permitting mutants to grow within the wildtype population, one might instead assume that dispersal is restricted to the wildtype population boundary. In this case, a complete sweep is impossible and we instead ask whether the first arising mutant envelopes the wildtype. Interestingly, this envelopment probability is independent of the mutation rate and obtains values close to the sweep probabilities of our main model³⁶ (Fig. 3A and SI Text Section 7). Although our focus is on expanding populations, we also applied our methods to compute the sweep probability in constant populations (SI Text Section 7), obtaining a result that depends on both mutation rate and population size (Fig. 3), in agreement with prior analyses^50,51.

**Fig. 3: Sweep probabilities for alternative growth models.**

We have also investigated sweep probabilities in non-spatial models. In the case of exponential growth with no competition, we find that a single mutant is unlikely to become dominant unless its exponential growth rate is several times larger than that of the wildtype and that the sweep probability is relatively insensitive to the mutation rate (Fig. 3B and SI Text Section 7). In the case of constant population size and a logistically growing mutant, the sweep probability is sensitive to both population size and mutation rate (Fig. 3B and SI Text Section 7). For a non-spatial population of constant size, the sweep probability decreases as the mutation rate increases, as in the spatial case. In summary, we find that the main conclusions drawn from our primary model also hold for alternative models of range expansions (exponential or boundary growth) but not for non-growing or tightly bounded populations.

Validation of our macroscopic model using agent-based simulations

To gauge the robustness of our theoretical predictions to the effects of stochastic growth and discretization, we measure the frequency of selective sweeps in a two-dimensional agent-based model. We suppose that the wildtype population invades a habitat initially occupied by a resident competitor, which is a plausible biological scenario for both invasive species¹³ and invasive tumours^26,30. Our agent-based model has resident, wildtype invader and mutant invader populations with local proliferation rates ${r}_{{{{\rm{re}}}}}$, r_wt and r_m,i. Localised competition between wildtype and resident slows the wildtype expansion and creates potential for selective sweeps. For consistency with our macroscopic model, we parametrised the simulations so that all mutations have equal effect (r_m,i = r_m for all i). We implemented this model using the demon agent-based modelling framework⁵² within the warlock computational workflow⁵³, which facilitates running large numbers of simulations on a high-performance computing cluster. We have previously applied the same framework to studying cancer evolution^30,53. Further model details are given in Methods.

The agent-based simulations provide useful insights in addition to the macroscopic model because, although the general setup is the same, they differ in several ways. Space in the simulations is divided into discrete patches; the times between birth and dispersal events are exponentially distributed random variables constituting another source of stochasticity; population boundaries are rough, not smooth; the expansion wave front is typically not sharp and changes shape as the wave progresses; and the mutant will have increased propagation speed when competing with the resident population. Hence, we would not expect perfect agreement between the results of the two models.

Our agent-based model approximately resembles a spatial death-birth Moran process (also known as the stepping stone model)^38,54. Expansion speeds in the spatial Moran process can in turn be approximated using the FKPP equation^46,47, which predicts that the mutant will expand within the wildtype with constant radial expansion speed c_m dependent on the difference in their local proliferation rates a_m = r_m − r_wt^42,43,45. Analogously, we have a constant expansion speed of the wildtype c_wt dependent on ${a}_{{{{\rm{wt}}}}}={r}_{{{{\rm{wt}}}}}-{r}_{{{{\rm{re}}}}}$. To compare the results of our discrete-space simulations to our continuous-space macroscopic model, we measured the propagation speeds of the wildtype within the resident and of the mutant within the wildtype (see ‘Methods’). Further investigations of propagation speeds in this model are the subject of a manuscript in preparation.

Given the considerable differences between the models, the probability density functions resulting from the macroscopic and microscopic models are reassuringly consistent. The radius at the time the first surviving mutant arises is slightly lower in the simulations than in our analytical model (Fig. 4A). Similarly, the distribution for the location of the first surviving mutant coincides well except for a small offset of the mean (Fig. 4B). Such offsets are expected due to discretization effects and the fact that, in the simulations, the propagation front needs to be established before the expansion can proceed. The sweep probabilities in simulations are nevertheless very close to our analytical predictions and change very little when varying the mutation rate over orders of magnitude, confirming our prediction that the sweep probability is independent of the mutation rate (Figs. 4C and Supplementary Figs. S1–S4).

**Fig. 4: Agent-based simulation results versus macroscopic model predictions.**

Random fitness effects

Since the effects of mutations are, in reality, not equal but random, we next used our agent-based model to test whether our predictions remain informative when mutation fitness effects are drawn from an exponential distribution, while still assuming that no mutant can accumulate more than one mutation. Because the random fitness effect determines not only a mutant’s expansion speed but also its probability of evading stochastic extinction, it is more appropriate to compare results in terms of the mean fitness effect $\widetilde{s}$ of the contending mutations that evade stochastic extinction, rather than the mean fitness of all generated mutations^55,56 (see ‘Methods’). In terms of this $\widetilde{s}$ (Fig. 5C) or an alternative summary statistic (Supplementary Fig. S5; SI Text Section 9), sweep probabilities in random-fitness-effect simulations are close to the predictions obtained under the assumption of fixed effects. Although it is possible for many fitter second or subsequent mutants to arise in the wildtype population and achieve a selective sweep by replacing the wildtype and all less-fit competitor mutants, in our simulations this scenario contributes very little to the total sweep probability.

**Fig. 5: Results of models with random and cumulative mutation effects.**

Multiplicative fitness effects

Next, we examined the consequences of allowing individuals to acquire multiple mutations with multiplicative fitness effects. The accumulation of beneficial mutations has two opposing effects on the sweep probability. By acquiring further mutations, a mutant lineage can become fitter, expand faster and hence increase its probability of achieving a selective sweep. On the other hand, competitor lineages can also become fitter and more likely to prevent a sweep. By grouping simulations that have the same distribution of mutation fitness effects, we find that the first factor outweighs the second, so that sweep probabilities are higher in the case of cumulative mutation effects (Fig. 5C). This result makes intuitive sense because the lineage closest to achieving a sweep represents the largest target for further fitness-enhancing mutations. Allowing the accumulation of beneficial mutations also increases the median time at which sweeps finish, in particular when fitness advantages are weak (Fig. 5D and Supplementary Figs. S11–S14). Nevertheless, the conclusion remains that, even when mutations have multiplicative effects, selective sweeps are infrequent unless contending mutations are on average strongly beneficial, in which case sweeps occur early.

When the number of mutations that can accumulate is unlimited, we find that, for every distribution of fitness effects, a single selective sweep is a more common outcome than two sweeps and two sweeps occur more often than three (Fig. 5E). Only rarely do we observe more than three sweeps. This conclusion is robust to the caveat that, for small s values, we likely underestimate the frequency of multiple sweeps because some would complete after we terminate the simulations. By plotting sweep probability against the mean fitness increase over the course of the simulation, we find that sweeps are more often due to a single large-effect mutation than several mutations with small effects (Supplementary Fig. S5).

Application to cancer

Our findings have especially interesting implications for understanding cancer evolution, in which case we consider a wild-type tumour that evolves while invading a resident population of normal cells. The macroscopic model then has only three parameters: the mutation rate conditional on survival, μ and the wildtype and mutant propagation speeds, c_wt and c_m. To estimate c_wt for human tumours, we follow a similar procedure to prior studies^36,57. Consider a tumour that grows to a volume of V between 1 and 10 cm³ in time T between 5 and 20 years. The propagation speed can then be estimated as

$$\widetilde{c}=\frac{r}{T}=\frac{\root{{3}}\of{\frac{3}{4\pi }V}}{T},$$

(16)

which equates to between 1 and 40 μm per day. Given that the diameter of a typical cancer cell is l ≈ 20 μm⁵⁸ and the generation time (cell cycle time) is τ_G ≈ 4 days⁵⁹, we can switch units to obtain $c=\widetilde{c}\times {\tau }_{G}/l$, which is between 0.15 and 7 cell diameters per generation. The rate of acquiring advantageous (driver) mutations is usually estimated at around $\widetilde{\mu }=1{0}^{-5}$ per cell per generation^57,60. For the survival probability ρ, we assume values between 0.09 and 0.5, in agreement with inferred values in colorectal tumours⁶¹. Together, these parameter values lead to a typical length scale $\theta=\root{{4}}\of{\frac{3{c}_{{{{\rm{wt}}}}}}{\pi \mu }}\approx 10\,{{{\rm{to}}}}\,50\,{{{\rm{cells}}}}$. The expected values of X and Y are then ${\mathbb{E}}[X]=9$ to 45 cell diameters and ${\mathbb{E}}[Y]=7$ to 35 cell diameters. It follows that a tumour, having acquired sufficient mutations to grow, will likely gain further driver mutations already during early development.

By eqn. (12), if a mutant expands 10% faster within a tumour than the tumour expands into surrounding tissue, then the sweep probability is predicted to be less than ${\left(1-\frac{1}{1.1}\right)}^{3}\approx 0.00075$. If the mutant propagates twice as fast as the wildtype, we have Pr(sweep) < 0.13 and in the extreme case where c_m is ten times c_wt, we have Pr(sweep) < 0.73. In the latter case, the expected population radius when the sweep began, given that a sweep occurred, is approximately 50 cells, corresponding to a population size N of 400,000 cells. The time for the sweep to be completed is then τ₂ ≈ 40 generations, at a population size of approximately 800,000 cells. This result is relatively robust because the expected values ${\mathbb{E}}[X]$, ${\mathbb{E}}[Y]$ and ${\mathbb{E}}[X| {{{\rm{sweep}}}}]$ are proportional to the characteristic length θ, which varies with the fourth root of μ and c_wt. Hence, in three dimensions, changing μ or c_wt by a factor of 10 changes our estimates of the radius and time by only a factor of 10^1/4 ≈ 1.8 and estimates of the population size by a factor of 10^3/4 ≈ 5.6. In summary, our macroscopic model (assuming equal mutation effects and no accumulation of mutations) predicts that selective sweeps during a clonal expansion are rare unless mutations are very strongly beneficial, in which case sweeps begin and end early in the expansion. This general conclusion also holds for alternative growth models (Fig. 3).

Discussion

Here, we have used mathematical modelling and analysis to determine the expected frequency of selective sweeps. We find that this frequency is generally expected to be low, even for mutations with a strong selective advantage. Moreover, when the wildtype and mutant radial expansion speeds are constant, the sweep probability can be expressed solely in terms of those speeds, which can in turn be related, through the FKPP equation or other standard models, to the selection coefficient, dispersal rates and other basic parameters. Our analytical predictions remain informative even when mutation fitness effects are random and multiplicative.

Why is the sweep probability independent of the mutation rate? An intuitive explanation is that if the mutation rate is higher, then on the one hand, the first advantageous mutation is likely to arise in a smaller population—meaning that it has less distance to travel to achieve a sweep— but, on the other hand, competing mutations will also tend to arise sooner than in the case of a lower mutation rate. These two effects exactly cancel out under the assumption of constant radial growth speeds. In alternative growth models, the two effects are unequal, resulting in either a positive or a negative effect of mutation rate on sweep probability (SI Text Section 8).

We make three arguments to justify our focus on this particular model of range expansions. First, our model corresponds to the continuum approximation of standard mathematical models of range expansions and spatial population genetics: the spatial Moran process (or stepping stone model⁵⁴) and the biased voter model (which is equivalent to an Eden growth model extended to allow local dispersal and competition throughout the population)⁴⁵. These models are well understood, intuitive and easy to parameterise. Second, because our model is relatively permissive to selective sweeps, it provides useful upper bounds for selective sweep probabilities in more complex scenarios, as further explained below. Our third justification is that, in much the same way as the Moran and Wright-Fisher processes are the most useful, tractable models of evolution in constant-sized, non-spatial populations, so the constant-radial-speed model yields the clearest results for range expansions. Haldane’s famous rule of thumb is that the fixation probability of a weakly beneficial allele in a large, non-spatial population of constant size is approximately proportional to its relative advantage in terms of proliferation rate⁶². Here, we have obtained the comparably simple result that the probability of a strongly beneficial allele achieving a selective sweep in an expanding population is approximately equal to its relative advantage in terms of radial expansion speed, raised to the power of the spatial dimension. For instance, if the radial expansion speed at which a mutant spreads within the wildtype population is twice the speed at which the wildtype expands, then the probability of this mutant achieving a selective sweep can be approximated simply as (1−1/2)² = 1/4 in two dimensions and 1/8 in three dimensions.

Some alternative models have been considered previously. Antal and colleagues³⁶ used a macroscopic model similar to ours to investigate the case in which mutations arise only at the boundary of a range expansion. Given that selective sweeps are then impossible, the interesting outcome is when the mutant envelops the wildtype. Ralph & Coop⁵⁰ and Martens and colleagues^35,51 instead considered constant-sized populations and found that selective sweeps are likely only if the population width is much smaller than a characteristic length scale, which depends on the mutation rate, the dispersal rate, the effective local population density and the strength of selection. In SI Text Section 7 we compare our results to the findings of these and other prior studies, including a well-known result of Gerrish and Lenski for non-spatial populations⁵⁵. The general conclusions are that selective sweeps are predicted to be rare except in small populations and that our model provides a useful upper bound on the sweep probability in range expansions.

A selective sweep can occur only if the rate of spread of an advantageous mutation exceeds the expansion speed of the wildtype population. This scenario is plausible, for example, in a biofilm growing slowly under antibiotic stress, in which case our findings predict the evolutionary dynamics of antibiotic resistance. Our results apply equally to an invasive species that is still adapting to its new conditions and whose range expansion is slowed by the need to modify its environment (niche construction) or by interspecific interactions¹³. If the invader must outcompete a resident, then the sweep probability can be approximated via the FKPP solution in terms of proliferation rates as in eqn. (13). The dimensional exponent in this result implies that selective sweeps are most likely to occur in species invading essentially linear habitats, such as coastlines.

Parametrised for human solid tumours, our macroscopic model indicates that selective sweeps during clonal expansions are restricted to strong driver mutations during the early stages of expansion. Late-arising drivers can become locally abundant but are unlikely to become fixed. There are several reasons to expect this general conclusion to hold even when driver mutation effects are random and multiplicative. First, an extension of our mathematical analysis suggests that the increase in sweep probability due to the accumulation of beneficial mutations is not much larger in three dimensions than in our two-dimensional simulations (Supplementary Fig. S17). Second, selective sweeps will be less frequent when the wildtype radial expansion speed increases over time (as is typical in the middle stage of tumour progression) or when mutant expansion is restricted to the tumour boundary (which may be the case in some tumours^30,33,63). Third, whereas our models assume a homogeneous microenvironment, tumours typically contain regions of hypoxia, necrosis and connective tissue that slow or prevent cell dispersal and hence impede selective sweeps. Fourth, due to microenvironmental heterogeneity and niche construction, fitness landscapes are likely to vary such that a mutation that is beneficial in one tumour region may be neutral or deleterious in another⁶⁴.

Our prediction that selective sweeps are rare during later tumour expansion, even if driver mutations continue to accumulate, is consistent with recent findings of the Pan-Cancer Analysis of Whole Genomes consortium. Analysing 2658 human tumour genomes, the consortium detected on average four clonal driver mutations per tumour^21,22, while also finding that more than 95% of tumours exhibit at least one subclonal expansion⁶⁵. Although only 11% of subclonal expansions could be explained by single-nucleotide variants that are known cancer drivers, the consortium found evidence of positive selection across subclones and cancer types. They therefore concluded that tumours may harbour additional subclonal expansions (corresponding to incomplete sweeps) driven by copy number aberrations, structural variants, epigenetic alterations, or single-nucleotide variants that have yet to be identified as drivers⁶⁵.

How then do tumours acquire multiple clonal driver mutations? Driver mutations can sometimes be classified into early and late events by leveraging point mutations on top of copy-number gains^22,23. However, because a selective sweep resets the common ancestor, it has proven difficult to determine whether clonal drivers are mostly acquired before or during tumour expansion. According to our models, it is highly unlikely that more than two selective sweeps will occur during a tumour’s final growth phase. Multiple clonal driver mutations are therefore better explained by multi-stage models of tumour initiation and progression (reviewed in ref. ⁶⁶), in which growth repeatedly stalls due to constraints such as hypoxia, immune control and physical barriers. Driver mutations enable subclones to escape these constraints and invade new territory, each time purging genetic diversity so that the final, prolonged expansion originates from a single highly transformed cell. This episodic model is conventional^39,67 and has been particularly well characterised in recent studies of colorectal cancer^68,69 and breast cancer⁷⁰. Our results suggest that once a tumour has entered its final growth phase and is more than a cubic millimetre in volume, even extremely strong drivers are highly unlikely to become clonal and will instead contribute to genetic heterogeneity and possibly to parallel evolution⁷¹.

The mathematical approach we have developed here can potentially be extended to investigate the dynamics of mutant population sizes, to understand better how intratumour heterogeneity relates to cancer treatment outcomes^17,26 and to develop more effective cancer treatment strategies^72,73. To obtain more precise predictions rather than upper bounds, it will be important to examine how microenvironmental heterogeneity, immune responses and phenotypic plasticity inhibit clonal expansions in each cancer type. Our predictions also motivate further investigation of selective sweeps in biofilms and other experimental systems².

Methods

Numerical integration

We performed numerical integration using the MATLAB function ‘trapz’. For values of x close to 0 and beyond 2θ, f_X(x) is small and its contribution to the integrals is negligible. Hence, to avoid numerical errors without compromising precision, we set the lower and upper bounds of integration to 0.001θ and 3θ when integrating over f_X(x). Similarly, we set 0.001θ as the lower bound of the integration over y. We set interval widths to 0.001θ for x ≤ 0.5 and 0.01θ for x > 0.5.

Agent-based simulations

We ran agent-based simulations using the Warlock automated computational workflow for the demon modelling framework^30,53. Individuals in this agent-based model are subdivided into well-mixed demes on a regular two-dimensional grid. The demes have identical carrying capacities, K and are initially filled with residents, except that a single wildtype invader is introduced at the centre of the grid. At each time step, an individual is chosen at random, with probability weighted by fitness, to be replaced by two offspring. Each offspring then either migrates, with probability m, to a neighbouring deme in a randomly chosen direction or remains in its parent deme. Local density dependence is implemented by imposing a very high death rate whenever a deme is above the carrying capacity. Mutation is coupled to wildtype reproduction. Motivated by an invasive cancer cell population spreading in non-invasive healthy tissue, we do not permit resident individuals to disperse. We can account for this asymmetry by adapting the conversion from fitness advantage into expansion speeds (to be published in a later study). Further model details have been published previously^30,53. All simulations were performed using City, St George’s, University of London’s Hyperion cluster. Example graphical outputs of the simulations, plotted using the ggmuller R package⁷⁴, are included in Supplementary Figs. S7 and S8.

For all simulations, we set the deme size to K = 16 and the migration probability m = 0.05. For such a small m, the probability of surviving drift can be estimated as the probability of becoming locally fixed in one deme. Since the within-deme dynamics approximate a Moran process, this probability is $\rho=\frac{1-{r}_{{{{\rm{wt}}}}}/{r}_{{{{\rm{m}}}}}}{1-{\left({r}_{{{{\rm{wt}}}}}/{r}_{{{{\rm{m}}}}}\right)}^{K}}$, where r_wt and r_m are the proliferation rates⁷⁵. We measured propagation speeds by simulating the expansion of a wildtype into a large resident population, or of a mutant into a large wildtype population, in the absence of mutation and then applying linear regression to the growth curve of the effective radius, defined as the square root of the population size divided by π.

For each parameter set, we took the mean speed from ten replicates, for which the standard deviation was consistently below 2% of the mean. We set the proliferation rates to ${r}_{{{{\rm{re}}}}}=0.91$ and r_wt = 1.0 for the resident and wildtype, respectively, leading to ${a}_{{{{\rm{wt}}}}}={r}_{{{{\rm{wt}}}}}-{r}_{{{{\rm{re}}}}}=0.09$. The measured wildtype speed was then c_wt ≈ 0.15. In simulations with fixed mutation effects, we varied the mutant proliferation rate r_m from 1.1 to 3.0, so that a_m = r_m − r_wt ranged from 0.1 to 2.0. The measured mutant speeds are presented in Supplementary Table S3.

To implement random mutation effects, we multiplied the fitness effect of each mutation by a factor X drawn from an exponential distribution with mean 1. To prevent birth rates from becoming implausibly large, we assumed diminishing-returns epistasis associated with maximum birth rate M = 10. The birth rate after i mutations was then

$${r}_{{{{\rm{m}}}},{{{\rm{i}}}}}=\min \left\{{r}_{{{{\rm{m}}}},i-1}\left(1+sX\left(1-\frac{{r}_{{{{\rm{m}}}},i-1}}{M}\right)\right),M\right\},$$

(17)

where parameter s > 0 determines the mean mutation effect.

Computation of sweep probabilities in simulations with fixed mutation effects

We ran a batch of 1000 simulations for each combination of mutation rate $\widetilde{\mu }\in \{1{0}^{-4},1{0}^{-5},1{0}^{-6}\}$ and mutant proliferation rate r_m ∈ {1.1, 1.2, 1.3, 1.4, 1.5, 2.0, 2.5, 3.0}. Resident and wildtype proliferation rates were in all cases ${r}_{{{{\rm{re}}}}}=0.91$ and r_wt = 1.0. Simulations were stopped at a population size of 1,000,000 or at 2000 generations (whichever was reached first). We called a sweep when all individuals at the end of the simulation shared a mutation.

To investigate the extent to which we failed to detect selective sweeps that would have completed after the simulation stop time, we examined the population sizes when the detected sweeps finished. For every set of parameter values for which sweeps were abundant enough to analyse, the population radius at the time of sweep completion exhibited a bell-shaped distribution that decayed steeply before the maximum possible radius $\sqrt{1{0}^{6}/\pi }$, indicating that we detected the vast majority of sweeps. In the remaining cases ($\widetilde{\mu }=1{0}^{-5}$ and r_m = 1.1; $\widetilde{\mu }=1{0}^{-6}$ and r_m = 1.1 or 1.2), sweeps were sufficiently rare that even relatively severe undercounting would not affect our conclusions.

Computation of sweep probabilities in simulations with random mutation effects

For each variant of the random-mutation-effects model (permitting the accumulation of up to one, two, three or unlimited mutations), we ran eleven batches of 1000 simulations, with fitness effects calculated according to eqn. (17) and s ∈ {0.05, 0.075, 0.1, 0.15, 0.2, 0.3, 0.4, 0.5, 1, 1.5, 2}. As before, simulations were stopped when they reached a population size of 1,000,000 or 2000 generations.

The establishment probability ρ is lower for weakly beneficial than for highly beneficial mutations, which implies that the effective mutation rate $\mu=\rho \widetilde{\mu }$ is lower in the former case. For example, in a Moran process with K = 16, nearly 94% of nearly-neutral mutations succumb to stochastic extinction before they can contribute to the macroscopic evolutionary dynamics, compared to only 50% of mutations that confer fitness effects s = 1. To account for this difference in effective mutation rates, we calculated, in the at-most-one-mutation model, the mean effect $\widetilde{s}$ of mutations that reached an abundance of at least 10 individuals (see Supplementary Fig. S16 for the distributions). We then compared the sweep probability for each random-effects model with that of the equal-effects model with ${a}_{{{{\rm{m}}}}}=\widetilde{s}$. As before, we transformed a_m into c_m according to Supplementary Table S3.

To investigate whether our finite-time simulations omit sweeps that would occur if simulations were run for longer times, we investigated the population radii at sweep completion. Population size rather than number of generations was the dominant stopping criterion except when s = 0.05 in all models and s = 0.075 in the at-most-one-mutation model (Supplementary Fig. S9). In all cases, the radius distributions are bell-shaped and indicate that, for all but the smallest fitness effects, most selective sweeps finished long before the simulation stop time (Supplementary Figs. S11–S14). To estimate the extent of undercounting, we fitted right-truncated gamma distributions to the radius distributions, with the truncation at $\sqrt{1,000,000/\pi }\approx 564$ corresponding to the population size at which simulations were stopped. Using a maximum likelihood approach, we then estimated the scale and shape parameters of each gamma distribution. By evaluating the probability in the tail of the non-truncated gamma distribution beyond the truncation point, we estimated the fraction of sweeps omitted in each batch of simulations and adjusted the sweep probabilities in Fig. 5C accordingly. The adjustments are negligible except in the case of small fitness effects, for which sweep probabilities are in any case low. In tests using artificially truncated distributions from simulations with large s values (Supplementary Fig. S15), we find that this method typically overestimates the proportion of missing sweeps. The adjusted values can therefore be regarded as upper bounds.

Data availability

Configuration files and simulated data can be downloaded from https://doi.org/10.5281/zenodo.10775383.

Code availability

Supporting Mathematica scripts and R code to generate the figures can be found at https://doi.org/10.5281/zenodo.18246214. Simulations were run using the workflow warlock, which is based on the demon simulation framework. The code can be found at https://doi.org/10.5281/zenodo.7435093 and https://github.com/robjohnnoble/demon_model respectively.

References

Excoffier, L., Foll, M. & Petit, R. J. Genetic consequences of range expansions. Annu. Rev. Ecol. Evol. Syst. 40, 481–501 (2009).
Article Google Scholar
Hallatschek, O., Hersen, P., Ramanathan, S. & Nelson, D. R. Genetic drift at expanding frontiers promotes gene segregation. Proc. Natl. Acad. Sci. USA 104, 19926–19930 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Seferbekova, Z., Lomakin, A., Yates, L. R. & Gerstung, M. Spatial biology of cancer evolution. Nat. Rev. Genet. 24, 295–313 (2023).
Article CAS PubMed Google Scholar
Biesecker, L. G. & Spinner, N. B. A genomic view of mosaicism and human disease. Nat. Rev. Genet. 14, 307–320 (2013).
Article CAS PubMed Google Scholar
Martincorena, I. et al. High burden and pervasive positive selection of somatic mutations in normal human skin. Science 348, 880–886 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Cavalli-Sforza, L. L., Menozzi, P. & Piazza, A. Demic expansions and human evolution. Science 259, 639–646 (1993).
Article ADS CAS PubMed Google Scholar
Pecl, G. T. et al. Biodiversity redistribution under climate change: impacts on ecosystems and human well-being. Science 355, eaai9214 (2017).
Article PubMed Google Scholar
Moran, E. V. & Alexander, J. M. Evolutionary responses to global change: lessons from invasive species. Ecol. Lett. 17, 637–649 (2014).
Article PubMed Google Scholar
Hallatschek, O. & Nelson, D. R. Life at the front of an expanding population. Evolution 64, 193–206 (2010).
Article PubMed Google Scholar
Korolev, K. S. et al. Selective sweeps in growing microbial colonies. Phys. Biol. 9, 026008 (2012).
Article ADS PubMed PubMed Central Google Scholar
Fusco, D., Gralka, M., Kayser, J., Anderson, A. & Hallatschek, O. Excess of mutational jackpot events in expanding populations revealed by spatial Luria–Delbrück experiments. Nat. Commun. 7, 12760 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Aif, S., Appold, N., Kampman, L., Hallatschek, O. & Kayser, J. Evolutionary rescue of resistant mutants is governed by a balance between radial expansion and selection in compact populations. Nat. Commun. 13, 7916 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Svenning, J.-C. et al. The influence of interspecific interactions on species range expansion rates. Ecography 37, 1198–1209 (2014).
Article ADS PubMed PubMed Central Google Scholar
Hanahan, D. & Weinberg, R. A. The hallmarks of cancer. cell 100, 57–70 (2000).
Article ADS CAS PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell 144, 646–674 (2011).
Article CAS PubMed Google Scholar
Davis, A., Gao, R. & Navin, N. Tumor evolution: linear, branching, neutral or punctuated? Biochim. Biophys. Acta Rev. Cancer 1867, 151–161 (2017).
Article CAS PubMed PubMed Central Google Scholar
Turajlic, S., Sottoriva, A., Graham, T. & Swanton, C. Resolving genetic heterogeneity in cancer. Nat. Rev. Genet. 20, 404–416 (2019).
Article CAS PubMed Google Scholar
Fearon, E. R. & Vogelstein, B. A genetic model for colorectal tumorigenesis. Cell 61, 759–767 (1990).
Article CAS PubMed Google Scholar
Gerlinger, M. et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N. Engl. J. Med. 366, 883–892 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liu, X. et al. Tumor phylogeography reveals block-shaped spatial heterogeneity and the mode of evolution in hepatocellular carcinoma. Nat. Commun. 15, 3169 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium Pan-cancer analysis of whole genomes. Nature 578, 82–93 (2020).
Article ADS Google Scholar
Gerstung, M. et al. The evolutionary history of 2,658 cancers. Nature 578, 122–128 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Gopal, P. et al. Clonal selection confers distinct evolutionary trajectories in BRAF-driven cancers. Nat. Commun. 10, 5143 (2019).
Article ADS PubMed PubMed Central Google Scholar
Andor, N. et al. Pan-cancer analysis of the extent and consequences of intratumor heterogeneity. Nat. Med. 22, 105–113 (2016).
Article CAS PubMed Google Scholar
McGranahan, N. & Swanton, C. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 613–628 (2017).
Article ADS CAS PubMed Google Scholar
Noble, R., Burley, J. T., Le Sueur, C. & Hochberg, M. E. When, why and how tumour clonal diversity predicts survival. Evolut. Appl. 13, 1558–1568 (2020).
Article Google Scholar
Vendramin, R., Litchfield, K. & Swanton, C. Cancer evolution: Darwin and beyond. EMBO J. 40, e108389 (2021).
Article CAS PubMed PubMed Central Google Scholar
Beerenwinkel, N., Schwarz, R. F., Gerstung, M. & Markowetz, F. Cancer evolution: mathematical models and computational inference. Syst. Biol. 64, e1–e25 (2015).
Article CAS PubMed Google Scholar
Colyer, B., Bak, M., Basanta, D. & Noble, R. A seven-step guide to spatial, agent-based modelling of tumour evolution. Evolut. Appl. 17, e13687 (2024).
Article Google Scholar
Noble, R. et al. Spatial structure governs the mode of tumour evolution. Nat. Ecol. Evol. 6, 207–217 (2022).
Article PubMed Google Scholar
Sun, R. et al. Between-region genetic divergence reflects the mode and tempo of tumor evolution. Nat. Genet. 49, 1015–1024 (2017).
Article CAS PubMed PubMed Central Google Scholar
West, J., Schenck, R. O., Gatenbee, C., Robertson-Tessi, M. & Anderson, A. R. A. Normal tissue architecture determines the evolutionary course of cancer. Nat. Commun. 12, 2060 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Fu, X. et al. Spatial patterns of tumour growth impact clonal diversification in a computational model and the TRACERx renal study. Nat. Ecol. Evol. 6, 88–102 (2022).
Article PubMed Google Scholar
Waclaw, B. et al. A spatial model predicts that dispersal and cell turnover limit intratumour heterogeneity. Nature 525, 261–264 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Martens, E. A., Kostadinov, R., Maley, C. C. & Hallatschek, O. Spatial structure increases the waiting time for cancer. N. J. Phys. 13, 115014 (2011).
Article Google Scholar
Antal, T., Krapivsky, P. L. & Nowak, M. A. Spatial evolution of tumors with successive driver mutations. Phys. Rev. E 92, 022705 (2015).
Article ADS Google Scholar
Paterson, C., Nowak, M. A. & Waclaw, B. An exactly solvable, spatial model of mutation accumulation in cancer. Sci. Rep. 6, 39511 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Durrett, R., Foo, J. & Leder, K. Spatial Moran models, II: cancer initiation in spatially structured tissue. J. Math. Biol. 72, 1369–1400 (2016).
Article MathSciNet CAS PubMed Google Scholar
Armitage, P. & Doll, R. A two-stage theory of carcinogenesis in relation to the age distribution of human cancer. Br. J. Cancer 11, 161 (1957).
Article CAS PubMed PubMed Central Google Scholar
Paterson, C., Clevers, H. & Bozic, I. Mathematical model of colorectal cancer initiation. Proc. Natl. Acad. Sci. USA 117, 20681–20688 (2020).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Nicholson, M. D., Cheek, D. & Antal, T. Sequential mutations in exponentially growing populations. PLoS Comput. Biol. 19, e1011289 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Fisher, R. A. The wave of advance of advantageous genes. Ann. Eugen. 7, 355–369 (1937).
Article Google Scholar
Kolmogorov, A. N., Petrovsky, I. G. and Piskunov, N. S. A study of the diffusion equation with increase in the amount of substance, and its application to a biological problem. (Bulletin of Moscow State University, 1937).
Brunet, E. & Derrida, B. Shift in the velocity of a front due to a cutoff. Phys. Rev. E 56, 2597 (1997).
Article ADS MathSciNet CAS Google Scholar
Murray, J. D. Mathematical Biology: I: An Introduction, Vol. 17, (Springer, 2002).
Hallatschek, O. & Korolev, K. S. Fisher waves in the strong noise limit. Phys. Rev. Lett. 103, 108103 (2009).
Article ADS PubMed Google Scholar
Houchmandzadeh, B. & Vallade, M. Fisher waves: an individual-based stochastic model. Phys. Rev. E 96, 012414 (2017).
Article ADS CAS PubMed Google Scholar
Hermisson, J. & Pennings, P. S. Soft sweeps and beyond: understanding the patterns and probabilities of selection footprints under rapid adaptation. Methods Ecol. Evol. 8, 700–716 (2017).
Article Google Scholar
Lyons, R. Another contender in the arctangent race. IEEE Signal Process. Mag. 21, 109–110 (2004).
Article ADS Google Scholar
Ralph, P. & Coop, G. Parallel adaptation: one or many waves of advance of an advantageous allele? Genetics 186, 647–668 (2010).
Article PubMed PubMed Central Google Scholar
Martens, E. A. & Hallatschek, O. Interfering waves of adaptation promote spatial mixing. Genetics 189, 1045–1060 (2011).
Article PubMed PubMed Central Google Scholar
Noble, R. Demon: deme-based oncology model. https://github.com/robjohnnoble/demonmodel, (2023).
Bak, M., Colyer, B., Manojlović, V. and Noble, R. Warlock: an automated computational workflow for simulating spatially structured tumour evolution. Preprint at https://doi.org/10.48550/arXiv.2301.07808 (2023).
Kimura, M. & Weiss, G. H. The stepping stone model of population structure and the decrease of genetic correlation with distance. Genetics 49, 561 (1964).
Article CAS PubMed PubMed Central Google Scholar
Gerrish, P. J. & Lenski, R. E. The fate of competing beneficial mutations in an asexual population. Genetica 102, 127–144 (1998).
Article PubMed Google Scholar
Bataillon, T. & Bailey, S. F. Effects of new mutations on fitness: insights from models and data. Ann. N. Y. Acad. Sci. 1320, 76–92 (2014).
Article ADS PubMed PubMed Central Google Scholar
Bozic, I. et al. Accumulation of driver and passenger mutations during tumor progression. Proc. Natl. Acad. Sci. USA 107, 18545–18550 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Shashni, B. et al. Size-based differentiation of cancer and normal cells by a particle size analyzer assisted by a cell-recognition PC software. Biol. Pharm. Bull. 41, 487–503 (2018).
Article CAS PubMed Google Scholar
Jones, S. et al. Comparative lesion sequencing provides insights into tumor evolution. Proc. Natl. Acad. Sci. USA 105, 4283–4288 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Williams, M. J. et al. Quantification of subclonal selection in cancer from bulk sequencing data. Nat. Genet. 50, 895–903 (2018).
Article CAS PubMed PubMed Central Google Scholar
Werner, B. et al. Measuring single-cell divisions in human tissues from multi-region sequencing data. Nat. Commun. 11, 1035 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Haldane, J. B. S. A mathematical theory of natural and artificial selection. Math. Proc. Camb. Philos. Soc. 23, 607–615 (1927).
Article ADS Google Scholar
Lewinsohn, M. A., Bedford, T., Müller, N. F. & Feder, A. F. State-dependent evolutionary models reveal modes of solid tumour growth. Nat. Ecol. Evol. 7, 581–596 (2023).
Article PubMed PubMed Central Google Scholar
Yang, K. R. et al. Niche inheritance: a cooperative pathway to enhance cancer cell fitness through ecosystem engineering. J. Cell. Biochem. 115, 1478–1485 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dentro, S. C. et al. Characterizing genetic intra-tumor heterogeneity across 2,658 human cancer genomes. Cell 184, 2239–2254 (2021).
Article CAS PubMed PubMed Central Google Scholar
S. A. Frank. Dynamics of cancer: incidence, inheritance, and evolution (Princeton University Press, 2018).
Vogelstein, B. et al. Cancer genome landscapes. science 339, 1546–1558 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ryser, M. D. et al. Minimal barriers to invasion during human colorectal tumor growth. Nat. Commun. 11, 1280 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Cross, W. et al. The evolutionary landscape of colorectal tumorigenesis. Nat. Ecol. Evol. 2, 1661–1672 (2018).
Article PubMed PubMed Central Google Scholar
Lomakin, A. et al. Spatial genomics maps the structure, nature and evolution of cancer clones. Nature 611, 594–602 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Frankell, A. M. et al. The evolution of lung cancer and impact of subclonal selection in tracerx. Nature 616, 525–533 (2023).
West, J. et al. A survey of open questions in adaptive therapy: bridging mathematics and clinical translation. Elife 12, e84263 (2023).
Article CAS PubMed PubMed Central Google Scholar
Viossat, Y. & Noble, R. A theoretical analysis of tumour containment. Nat. Ecol. Evol. 5, 826–835 (2021).
Article PubMed PubMed Central Google Scholar
Noble, R. ggmuller: Create Müller plots of evolutionary dynamics. R package version 0.5.6 (2023).
Nowak, M. A. Evolutionary dynamics: exploring the equations of life. (Harvard University Press, 2006).

Download references

Acknowledgements

We thank Jonas Demeulemeester and Maxime Tarabichi for guidance on the interpretation of recent cancer sequencing studies, and Guillaume Martin for helpful discussions about mutation fitness effect distributions. We are grateful for the use of City St George’s Hyperion cluster to run the many simulations integral to this study. A.S. was supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Składowska-Curie EvoGamesPlus grant agreement no. 955708, and by UK Research and Innovation (UKRI) under grant no. MR/V02342X/1. M.B. was supported by an award from the City of St George’s, University of London Research Pump-priming Fund. R.N. was supported by the National Cancer Institute of the National Institutes of Health under Award Number U54CA217376. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

Centre for Cancer Evolution, Barts Cancer Institute, Queen Mary University of London, London, UK
Alexander Stein
Department of Physics, ETH Zurich, Zürich, Switzerland
Alexander Stein
Department of Mathematics, City St George’s, University of London, London, UK
Kate Bostock, Maciej Bak & Robert Noble
Department of Physics, Indian Institute of Science Education and Research, Kolkata, India
Ramanarayanan Kizhuttil
Department of Ecology, Behavior & Evolution, UC San Diego, San Diego, CA, USA
Ramanarayanan Kizhuttil

Authors

Alexander Stein
View author publications
Search author on:PubMed Google Scholar
Kate Bostock
View author publications
Search author on:PubMed Google Scholar
Ramanarayanan Kizhuttil
View author publications
Search author on:PubMed Google Scholar
Maciej Bak
View author publications
Search author on:PubMed Google Scholar
Robert Noble
View author publications
Search author on:PubMed Google Scholar

Contributions

R.N. conceived the research question and supervised the project. R.N. and A.S. designed the research. A.S. and R.K. carried out the mathematical analysis. M.B. and K.B. performed agent-based simulations. A.S. and K.B. analysed simulation results. All authors wrote and approved the manuscript.

Corresponding authors

Correspondence to Alexander Stein or Robert Noble.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Diana Fusco and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Transparent Peer Review file (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stein, A., Bostock, K., Kizhuttil, R. et al. Selective sweep probabilities in spatially expanding populations. Nat Commun 17, 2181 (2026). https://doi.org/10.1038/s41467-026-69363-7

Download citation

Received: 17 June 2024
Accepted: 29 January 2026
Published: 11 February 2026
Version of record: 04 March 2026
DOI: https://doi.org/10.1038/s41467-026-69363-7