Constraints to gene flow increase the risk of genome erosion in the Ngorongoro Crater lion population

Dussex, Nicolas; Jansson, Ingela; van der Valk, Tom; Packer, Craig; Norman, Anita; Kissui, Bernard M.; E. Mjingo, Ernest; Spong, Göran

doi:10.1038/s42003-025-07986-0

Download PDF

Article
Open access
Published: 21 April 2025

Constraints to gene flow increase the risk of genome erosion in the Ngorongoro Crater lion population

Communications Biology volume 8, Article number: 640 (2025) Cite this article

3278 Accesses
1 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Small, isolated populations are at greater risk of genome erosion than larger populations. Successful conservation efforts may lead to demographic recovery and mitigate the negative genetic effects of bottlenecks. However, constrained gene flow can hamper genomic recovery. Here, we use population genomic analyses and forward simulations to assess the genomic impacts of near extinction in the isolated Ngorongoro Crater lion (Panthera leo) sub-population. We show that 200 years of quasi-isolation and the recent epizootic in 1962 resulted in a two-fold increase in inbreeding and an excess in the frequency of highly deleterious mutations relative to other populations of the Greater Serengeti. There was little evidence for purging of genetic load. Furthermore, forward simulations indicate that higher gene flow from outside of the Crater is needed to prevent future genomic erosion in the population, with a minimum of one to five effective male migrants per decade required to reduce the risk of long-term inbreeding depression and reduction in genetic diversity. Our results suggest that in spite of a rapid post-epizootic demographic recovery since the 1970s, continued isolation of the population driven by habitat fragmentation and potentially male territoriality, exacerbate the effects of genome erosion.

Genomic insights into the conservation status of the world’s last remaining Sumatran rhinoceros populations

Article Open access 26 April 2021

Population genomics and conservation management of a declining tropical rodent

Article Open access 04 March 2021

Species-wide genomics of kākāpō provides tools to accelerate recovery

Article 28 August 2023

Introduction

Increasing anthropogenic activities over the past centuries have had a strong impact on ecosystem health and diversity and are often characterised by severe wildlife population declines. Such declines have negative genetic consequences referred to as genome erosion (i.e., loss of genetic variation, increase in genetic load, mismatch between adaptations and environment¹) which can increase the risk of extinction by trapping species into an extinction vortex².

The rate of demographic decline and recovery as well as duration of a bottleneck will have a strong influence on the magnitude of genome erosion^1,3. For instance, only a small portion of genome-wide diversity may be lost if the decline is followed by quick demographic rebound, whereas a sustained bottleneck will likely result in a significant loss of diversity. Similarly, the amount of genetic load may not substantially change and thus impact individual fitness in future generations if there is a quick population recovery. In contrast, long-term declines characterised by gradual increases in inbreeding may facilitate a reduction in deleterious variation through purifying selection, in a process referred to as purging, whereas rapid declines may induce an increase in the frequency of deleterious variants^3,4, although both processes can occur in either types of declines due to stochastic effects.

There has been a strong focus on examining the effect of the intensity of population bottlenecks or founder events on genome erosion processes (e.g. ref. ^5,6,7,8,9). However, even after a population recovers or stabilizes from such events, genome erosion can continue if the population remains isolated^1,3. Habitat fragmentation associated with human activities reduces gene flow among subpopulations, increases genetic drift and inbreeding, and impacts genetic load¹⁰. High territoriality and competition from local individuals could also limit mating opportunities of immigrants and exacerbate this isolation¹¹. For instance, in polygamous species where only few males contribute genetic diversity to future generations, genomic recovery (e.g. reduction in inbreeding) can be relatively slow, and the random effect of genetic drift (i.e., loss of diversity or fixation of deleterious alleles) can be particularly strong as shown in the lek-breeding kākāpō (Strigops habroptilus)⁶.

As a case in point, the Ngorongoro Crater (hereafter referred to as the Crater) lion population in Tanzania, which now comprises ~60 lions (~5 prides), seems to function as an ecologically isolated population. The Crater offers high prey abundance and minimal human threats whereas the surrounding landscape is a multi-use area shared with pastoralists and their livestock, where natural prey densities are lower and more variable, and where human-lion conflicts occur. Thus, opportunities for pride establishment outside the Crater are limited^12,13,14. Importantly, even though there are no geographical barriers to dispersal into the Crater¹⁴, immigration of breeding males from the adjacent Serengeti plains has been infrequent, with only one breeding male successfully establishing in the Crater between 1965 and 2013^13,15. Furthermore, intensive monitoring through individual identification indicates a high degree of territoriality of local resident males and little opportunities for immigrant males to mate and establish¹³. For instance, 80% of resident males were born in the Crater, whereas in the rest of the Serengeti National Park only 33% of resident males were born locally. Moreover, a higher proportion (i.e., 7 vs <1%) of males became resident in their natal pride in the crater compared to the in Serengeti. Lastly, tenure is more successfully retained in the Crater, with males siring nearly ten times as many cubs in the Crater relative to the rest of the Serengeti¹³. Consequently, reduced reproduction opportunities of immigrant males from outside of the Crater cause reduced gene flow and to the genetic isolation of the population. This may be counteracted by inbreeding avoidance, as it has been found that mating among related pride members are rare^16,17.

Consistent with an estimated 85% reduction in the range of African lions since 1500 AD¹⁸ primarily driven by increasing anthropogenic pressure on the landscape, the Crater population experienced several bottlenecks associated with landscape fragmentation. Exponential human population growth and land use changes (i.e., agriculture and pastoralism¹⁹), retaliatory and ritual lion hunts^20,21, colonial-era trophy hunting²² as well as periodic rinderpest epizootics between 1890-1962 reducing ungulate populations in the Greater Serengeti Ecosystem (GSE) exacerbated this population fragmentation over the past ~200 years. Moreover, the population experienced a severe decline in 1962 induced by an epizootic outbreak causing pronounced lethargy and skin lesions followed by death, after which 15 adults and juvenile survivors founded the current population^12,13. After a period of recovery, the population declined in the late 90 s for unknown causes²³ and again in 2001, from a combination of tick-borne disease and canine distemper virus (CDV)²⁴, with the number of lions stabilising at 50 individuals over the past ~30 years. In spite of the immigration of a few breeding males in 1964–1965 and between 2013 and 2018, the Crater population is still characterised by low diversity and inbreeding depression. For instance, there is evidence for increase in levels of sperm abnormality and low spermiogenesis^{12,13,25,26,27} as well as higher levels of cub mortality in the first few weeks of life¹³. These observations seem consistent with a population harboring a significant genetic load. However, little is known about the impact of population declines and isolation on genome-wide variation of Crater lions and about the distribution of the deleterious variation in the population.

When temporal data are not available, comparative approaches can be useful for genomic studies in endangered species as they can reveal contrasted patterns of genome erosion, reflecting distinct population histories or degrees of connectivity (e.g., Ibex²⁸; Svalbard reindeer²⁹; Indian tigers³⁰). Here, we analyse 15 newly-sequenced lion genomes from Tanzania (the wider Ngorongoro Conservation Area including the Crater and the adjacent Serengeti plains) and compare them to 5 published lion genomes from Tanzania, Botswana and South Africa to examine the genomic consequences of the recent population decline and isolation on the Crater population. Genomic data produced in this study suggest that the recent decline induced an increase in genome erosion and realised load in the Crater population while simulations suggest that reduced male immigration could have also potentially contributed to the observed pattern. Simulations also reveal that a minimum of one to five effective male migrants per decade would be required to avoid substantial reduction in genome-wide variation and increases in inbreeding and genetic load.

Results and Discussion

Population structure, gene flow and past demography

We analysed 20 lion genomes to reconstruct the population history of the Crater lions and to examine the consequences of isolation and recent population decline on genome-wide diversity and genetic load of the population (Fig. 1a).

**Fig. 1: Sampling, population structure and gene flow.**

We first examined the population structure to determine genetic relationships among African lion populations and assess whether the Crater population is distinct from other African populations. Overall, there was little evidence for admixture with adjacent Serengeti plains lions (Fig. 1b, c). While there was strong support (i.e., lowest Cross Validation error) for two genetic clusters (K = 2), considering three clusters (K = 3) supported a clear division among Crater, Greater Serengeti (i.e., comprising Tanzania, Ndutu, Endulen) and Selous/Botswana/South African lions (Fig. 1b, c). We also note that two males of unknown natal origin (Cra_05, Cra_07) and one Crater-born female (Cra_06; Supplementary Data 1) show slightly distinct ancestry.

Estimates of the divergence time between the Crater and Greater Serengeti suggests that gene flow was reduced c. 200 y. BP (Mean: 178; 95% HPD: 41- 380; Supplementary Fig. 1). The timing of this divergence is consistent with the recent bottleneck and increase in habitat fragmentation of the past 200 y. BP which both contribute to the genetic isolation of the population.

The demographic reconstruction based on the Pairwise Sequentially Markovian Coalescent (PSMC) showed a long-term decline over the past ~1 Ma BP and shared demographic history among lion populations (Fig. 2a), as previously shown by de Manuel et al. ³¹ for other lion populations. This decline is consistent with drying and cooling conditions of the first 1 M of the Pleistocene, while the Eemian period (c. 130,000 to 115,000 y. BP) characterised by warmer and wetter conditions and expansion of savannas and forests showed a plateau (Fig. 2a). Subsequently this decline continued with the next cycle of cooling³² until the Holocene.

The recent past demography (i.e., ~200 generations or ~1000 y. BP³³) indicated a population increase c. 1200-1000 y. BP, coinciding with a prolonged dry period in Eastern and Equatorial Africa²², followed by a decline c. 600 y. BP, coinciding with wetter conditions of the Little Ice Age³⁴ (Fig. 2b). While speculative, this suggests that denser woodland may have been potentially less favourable to lions. Population census size estimates since the early 1960s reflect the effects of the 1962 epizootic and the decline of 2001 caused by CDV, with population size ranging between 10 and 124 individuals (Fig. 2c).

Genetic diversity and load

Heterozygosity and inbreeding estimates revealed that the Crater population has the lowest amount of variation and is ~1.6 times as inbred compared to the larger neighbouring Serengeti and Selous populations (F_ROH-Crater = 0.37 \(\pm\)0.031, F_{ROH-Serengeti} = 0.22 \(\pm\)0.020, F_ROH-Selous = 0.23 \(\pm\) 0.029; Fig. 3, S2; Supplementary Data 4). Furthermore, there is on average ~60% (range: 51-66%) of the F_ROH coefficient comprising Runs of Homozygosity (ROH) \(\ge \) 2Mb, consistent with recent inbreeding events. When using a recombination rate estimated in felids³⁵ and a generation time of 5 years for lions³¹, inbreeding events characterised by ROH \(\ge \) 2Mb and ROH \(\ge\)10 Mb date to the past ~110 and ~30 years, respectively (Supplementary Data 5). These signatures of inbreeding are thus consistent with the recent history of population decline resulting from habitat fragmentation over the past two centuries in the region (Fig. 2c).

**Fig. 3: Heterozygosity and inbreeding.**

F_ROH estimates for the newly-sequenced Serengeti and Selous lions in our dataset are consistent with previous estimates for Tanzanian lions (F_ROH\(\simeq\)0.2; Supplementary Data 4)³¹.

We estimated genetic load by annotating variants in coding regions using Snpeff³⁶. When considering the frequency of deleterious variation among populations, the R_xy ratio³⁷ showed an excess in High impact (i.e., premature stop codons) variants and a slight deficit for Moderate impact (i.e., non-disruptive variants that might change protein effectiveness) variants in Crater lions relative to Serengeti and Selous lions (Fig. 4a). While the total load estimate for High and Moderate impact variants was not significantly different among populations, the High impact load was higher in the Crater population relative to other populations (Fig. 4b; Supplementary Data 4).

Since most deleterious mutations are partially recessive and since heterozygous ones are hidden from selection, homozygous mutations are the most informative of the negative fitness effects, also referred to as the realised load^1,3. Consistent with the higher inbreeding in the Crater population, we found significantly higher realised load for both High and Moderate impact variants relative to the more outbred populations (i.e., Serengeti and Selous; Figs. 4c, S3). Furthermore, the numbers of heterozygous variants for both High and Moderate categories are significantly lower in Crater lions relative to the Serengeti population (Supplementary Fig. 3), suggesting that those may have been lost through a combination genetic drift directly associated with the recent bottlenecks or through some early purging effect^1,3. Nevertheless, the higher total load in several Crater lions relative to other populations (i.e., Serengeti, Selous and South Africa) suggests that there has not been sufficient time for selection to purge highly deleterious variation and that the Crater population is most likely in the early stages of exposure to genome erosion following the ~200 years isolation from the Serengeti population and the 1962 epizootic.

As theory, empirical data and simulations indicate, the early stage of a decline is characterised by an increase in inbreeding and realised load and thus of inbreeding depression^3,4. Lethal or highly deleterious alleles should be reduced in frequency relatively early during a bottleneck, while a number of moderate impact variants with a lower selection coefficient and thus less individual effect on fitness are more likely to drift to fixation during a bottleneck and still contribute to inbreeding depression³. The pattern observed in Crater lions is similar to that of Grauer’s gorilla (Gorilla beringei graueri), which experienced severe population declines over the past century³⁸. However, the random effect of drift is particularly evident in Crater lions, with mostly High impact variants drifting to high frequencies, whereas a portion of the Moderate impact variants may have been lost during the bottleneck. Alternatively, it is possible that the Serengeti and Selous lions lost more of their High impact variants through a combination of purging or drift, which would cause the same observed difference. Finally, we note that the South African genome (i.e. previously identified as genetically grouping with South African lions³¹) and most likely from a zoo, shows one of the lowest total load, consistent with a scenario of purging, but also has high inbreeding and realised load.

Based on prior evidence of inbreeding depression, sperm abnormality and low sperm counts in Crater lions^12,25,26,27, as well as higher cub mortality in the first weeks of life¹³, we examined the gene functions of genes carrying High and Moderate impact alleles (Supplementary Data 6) using the Mouse Genome Informatics database³⁹. Among genes carrying High impact variants (i.e., premature stop codons) at higher frequency in the Crater population relative to Serengeti and Selous lions, we found a number of genes associated with sperm morphology and male fertility (e.g., AURKC, TSHB, TLR6, CWF19L2), nervous system (e.g., ARSG, FGD4) and immunity (e.g., TLR6; Supplementary Data 7). Among genes carrying Moderate impact variants (i.e., Missense) at high frequency, we again found functions associated with sperm morphology and male fertility (e.g., NUTM1, CCDC87, CEP250, PRSS55, TSPYL1) and additionally, with cardiovascular system (e.g., LAMA4, KLK5, GAS2L3, PPP1R15B, OBSCN) and nervous system (e.g., LAMA1, KCNJ16, SCN1B; Supplementary Data 7).

Forward genome-informed simulations

To model the evolutionary trajectory of populations and to assess the plausibility of interpretation based on empirical data⁴⁰, we performed forward-in-time genomic simulations recapitulating the population history of Crater lions based on our demographic reconstructions and population monitoring data since the 1970s using SLiM 4.0^41,42 (Supplementary Data 8). Overall, our simulations were consistent with our empirical data (Fig. 5). We found a 2-fold increase in inbreeding (from F_ROH 0.14 to 0.27) and ~15% reduction in heterozygosity in the Crater population relative to the Greater Serengeti Ecosystem (GSE) over the past 200 years (Fig. 5a). This increase was gradual since the population split c. 200 y. BP. However, while the population was reduced to 9 females and one male after the 1962 epizootic, the immigration of reproducing males right after the bottleneck and rapid demographic recovery (Fig. 2c) induced a rapid ~20% reduction (from F_ROH ~ 0.25 to 0.2) in inbreeding until the 1970s when inbreeding started to gradually increase.

**Fig. 5: Genome-informed simulations recapitulating the recent population history of Crater Lion from 1820 to 2020.**

There was also a ~ 50% increase in realised load relative to the GSE population (Fig. 5a), consistent with increasing inbreeding and with the estimated realised load based on empirical data (Fig. 4c). Moreover, the masked load showed a temporal decrease of <1% over 200 years as it is gradually converted to realised load via the effects of inbreeding and drift since the time of the isolation of the Crater population. This decrease is consistent with the lower number of heterozygous High impact variants relative to GSE (Supplementary Fig. 3). However, the migration of seven males in 1964-1965 and between 2013 and 2018, led to a reduction in inbreeding and realised load and induced an increase in heterozygosity and masked load.

In contrast, the number of deleterious variants in any category did not vary substantially between the Crater and GSE. Nevertheless, there was an indication of reduction in Very Strongly deleterious variants resulting from the isolation of the population c. 200 y. BP (Fig. 5b). This is not entirely surprising since the most deleterious variants should be purged relatively early during a bottleneck through purifying selection as they have the most impact on fitness³. Furthermore, a small number of simulations showed increases in those variants following the epizootic of 1962 and the migration of seven males in 1964-1965 (Fig. 5b).

We ran our simulations over an additional 100 years period, until the year 2120 to assess the effect of male migration (i.e., 0 to 10 effective migrants per decade) on genome-wide variation. These simulations indicated that between one to five effective male migrants per decade would be required to remain within a 5% window of change in heterozygosity, inbreeding and load (Fig. 6) for a realistic carrying capacity K of 50 to 100 individuals for the Crater population. Below one effective migrant per decade, there would be a substantial risk of negative genetic effects, with >40% and >20% increase in inbreeding and realised load, respectively. There would also be a > 10% reduction in heterozygosity, especially for a K of 50 individuals. Consistent with theory³ and empirical data⁴³, low migration (i.e., M = 0-1) would induce a reduction in masked load whereas higher migration (i.e., M > 1) would lead to an increase through the introduction of new genetic variation. These effects would be stronger for smaller K values, where overall genetic diversity is likely to be lower than for larger K values. Thus, with little to no migration over 100 years, the masked load would be reduced through the effect of purging, whereas overall diversity would also be reduced (Fig. 6d).

**Fig. 6: Prediction of future changes in genome-wide variation over 100 years for various migration rates.**

Conservation implications

In spite of severe population declines driven by landscape fragmentation and an epizootic, the Ngorongoro Crater lion population has recovered quickly since the 1970s. Yet, the population shows evidence of inbreeding and inbreeding depression. Our simulations show that rapid demographic recovery and periodic effective migration can counteract the negative genetic effects of these bottlenecks and our empirical data provide an example of how continued geographical isolation driven by habitat fragmentation may negate the positive effects of population recovery by exacerbating genetic drift. While speculative, this drift effect may be affected by male territoriality that reduces the opportunities for immigration and the establishment of breeding status of males from outside the Crater, thereby exacerbating the effects of genome erosion. Taken together, our results supports theory, which posits that demographic increase and stability alone is not a panacea for preventing loss of genetic variation and that gene flow is required to enable the genomic recovery of small populations^44,45.

Our simulations and a growing number of empirical studies^6,28,30,38 show that a gradual increase in inbreeding could facilitate the reduction of some of the genetic load through purging. Thus, while gene flow from an outbred to an inbred population will contribute to a genetic rescue effect^44,45, it will also represent a risk of introducing deleterious variation. In a highly inbred population, this newly introduced deleterious variation could readily be expressed, reducing overall fitness and thus increasing the risk of population extinction. For instance, the inbred Isle Royale wolf⁴⁶ population experienced a rapid population decline after the effective migration of a single male. Consequently, fostering periodic and long-term gene flow into the Crater before the population becomes too inbred would reduce the risk of expression of deleterious variation and population collapse.

Given the logistical challenges of translocations and the possible behaviourally-mediated resistance of immigration into the small Crater population¹³, fostering long-term connectivity with the GSE would increase the chance of effective migration (i.e., mating) events or the temporary interactions of dispersing Serengeti males just outside the Crater with Crater-born females^15,47. Mitigation of human-lion conflicts in the connecting landscape between the Crater and Serengeti to maintain, and possibly further improve, dispersal opportunities¹⁵ thus likely represents the best long-term management option to reduce the risk of future genome erosion in Crater lions.

Our study improves our understanding of genome erosion in fragmented populations driven by human activities. Importantly, combining empirical genomic data with forward simulations provides deeper insights into the dynamics of genetic variation and genetic load⁴⁰ and of the threats of genome erosion in endangered populations.

Methods

Ethical statement

We have complied with all relevant ethical regulations for animal use. All research fieldwork and data collection were conducted in accordance with the Tanzania Wildlife Research Institute’s regulations. It was carried out under the yearly renewed research permits granted to our research project titled “Balancing Pastoralist Livelihoods and Wildlife Management in Ngorongoro”, and to each individual researcher, by the Tanzania Commission for Science and Technology (COSTECH; Dar es Salaam, Tanzania; rclearance@costech.or.tz) and Tanzania Wildlife Research Institute (TAWIRI; Arusha, Tanzania; researchclearance@tawiri.or.tz).

Data collection and sequencing

We obtained tissue samples for 15 wild lions from the Ngorongoro Conservation area (NCA; n = 13), and Selous (n = 2; Supplementary Data 1). The NCA lion samples were collected in the Crater (n = 10), Ndutu (n = 2; Serengeti plains, on border of Serengeti National park), and Endulen (n = 1; between the Crater and Serengeti, from a lion born in Ndutu). Samples from lion were collected using biopsy darting, or by extracting a small piece of tissue from the ear using a biopsy punch on animals immobilised for other reasons, e.g. collaring, and from a dead lion. Samples selected for the WGS were selected from the more distantly related individuals (i.e., different prides, different parents), based on our observation data. Relatedness was estimated with vcftools⁴⁸ using the –relatedness flag and ranged between 0.09 and 0.16, indicating distant relationships among genomes. DNA extraction was performed using a DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) on a Qiagen Symphony extraction robot. Genomic library preparation from modern DNA extracts was performed using a Illumina TruSeq DNA PCR-free Library Preparation Kit. Paired-end libraries were built with 150 bp size selection using 1000 ng sheared DNA input, at the Science for Life Laboratories (SciLifeLab), Stockholm, aiming for a minimum target coverage of ~15X. Libraries were sequenced using one Illumina HiSeq X lane using a 2x150bp setup.

Additionally, we obtained genomic data for 5 lions from Tanzania (n = 2), Botswana (n = 2) and South Africa (n = 1; PRJNA611920³¹). Based on the PCA clustering results, we grouped our samples for downstream analyses such as: Crater (n = 10), Serengeti (n = 5), Selous (n = 2), Botswana (n = 2), South Africa (n = 1).

Genome data mapping

We processed raw genomic data using the GenErode bioinformatics pipeline⁴⁹. Adapter trimming was done with fastp v0.22.0⁵⁰ and reads were mapped to the chromosome-level Lion assembly (P.leo_Ple1_pat1.1; GCF_018350215.1) using BWA-MEM v0.7.17⁵¹. Read were then sorted using SAMtools v1.12⁵², duplicates removed with picard MarkDuplicates v2.26.6 (http://broadinstitute.github.io/picard/), and reads realigned around indels using GATK IndelRealigner v3.4.0⁵³.

We called variant using the mpileup command of bcftools v1.8⁵⁴, filtering out variants using a minimum depth of coverage (DP4) of ~1/3 (i.e., 5X) of the average depth of coverage, and base quality QV ≥ 30. Indels and SNPs within 5 bp of indels were removed. We also filtered out SNPs in heterozygous state that were not in an allelic balance (i.e., number of reads displaying the reference allele/depth) of < 0.2 and > 0.8 in order to avoid biases caused by contamination, mapping or sequencing error. After merging all individual vcf files, we excluded the sex chromosomes (NC_056697.1, NC_028302.1) and masked repeats with BEDtools v2.27.1⁵⁵.

We retained 20 genomes for the Principal Component Analysis (PCA) without filtering for missing data and obtained 4,560,409 SNPs. For estimates of heterozygosity, inbreeding and genetic load, we only retained the 16 genomes with the highest coverage (i.e., \(\ge\)14X; Supplementary Data 1). After filtering for missing data with bcftools we retained 3,308,190 SNPs.

Population structure and past demography

We first performed a PCA in PLINK v2⁵⁶. We then estimated individual-based ancestry to infer the number of genetic clusters with ADMIXTURE v1.3.0⁵⁷ for K = 1–6 and using the cross-validation error estimation (--cv option). We estimated past fluctuations in effective size (N_e) using two complementary approaches. First, we used the Pairwise Sequentially Markovian Coalescent (PSMC) v0.6.5⁵⁸. We excluded the X chromosome and generated consensus autosomal sequences for five high-coverage genomes with SAMtools mpileup⁵², using base and mapping quality filters (-Q 30, -q 30) and with the vcf2fq command from vcfutils.pl. We excluded sites with depth < 5X and excluded positions with more than two times the average coverage estimated for each genome (i.e. ~30-40X). To infer the TMRCA between each chromosome from each individual genome, we used the following parameters: Number of iterations, N = 25; Tmax, t = 15; atomic time interval, p = 64 (4 + 25*2 + 4 + 6, for each of which parameters are estimated with 28 free interval parameters). We used a substitution rate of 4.5e⁻⁹ substitutions/site/generation and a generation time of five years³¹. Secondly, we reconstructed the past demography of Crater lions over the past 200 generations using GONE³³ which estimates changes in N_e calculated as the geometric mean over 40 independent estimates from the observed spectrum of linkage disequilibrium (LD). We used autosomal chromosomes and the following parameters: PHASE = 2; cMMb=1.11³⁵; DIST = 1; NGEN = 2000; NBIN = 400; MAF = 0.0; ZERO = 1; maxNCHROM=85; maxNSP=50000; REPS = 40; threads = −99. We reduced the hc value from the default of 0.05 to 0.01 to avoid biases caused by recent immigration as suggested by Santiago et al. ³³. We performed 10 replicate runs for the population and assumed a generation time of five years³¹.

We also estimated population divergence time between the Crater (n = 10) and Serengeti (Ndutu, n = 2; Endulen, n = 1; Tanzania, n = 2) for nuclear data using SMC + + v1.15.2⁵⁹. We used a substitution rate of 4.5e⁻⁹ substitutions/site/generation³¹ and used the ‘estimation validation’ approach with –em-iterations 5000, and –thinning 1300, –regularization-penalty 6, –polarization-error of 0.5, –ftol of 1e-7, –c 1000000 and –xtol of 1e-7. We reconstructed the SFS for each population separately and the joint SFS between two populations using the smc + + vcf2smc command. We then used the smc + + split command to calculate the marginal estimate of joint demography. For this analysis, we performed 50 bootstrap replicates for each population and for 15 chromosomes to reduce computational time and estimate the mean and 5^th and 95^th percentiles. We also assumed a generation time of five years³¹.

Heterozygosity and inbreeding

We first used mlRho v2.7⁶⁰ to estimate the individual mutation rate (θ), which approximates the genome-wide heterozygosity measured as the number of heterozygous sites per 1,000 bp. We down-sampled each genome to the average coverage of the genome with lowest coverage (i.e., 14X), filtered out bases with quality (-Q) < 30, mapped sequencing reads with mapping quality (-q) <30 and positions with root-mean-squared mapping quality (MQ) < 30 from the historical and modern bam files. We also filtered out sites with depth <2X and higher across all our genomes to avoid false heterozygous sites.

We identified runs of homozygosity (ROH) using the sliding-window approach implemented in PLINK v2. We estimated inbreeding coefficients (F_ROH) by dividing the sum of all ROH by the size of the genome (autosomes only). We used the following parameters: a sliding window size of 100 SNPs (homozyg-window-snp 100); no more than 1 heterozygous site per window to assume a window as homozygous (homozyg-window-het 1); at least 5% of all windows including a given SNP to define the SNP as being in a homozygous segment (homozyg-window-threshold 0.05); a homozygous segment was defined as a ROH if the segment included ≥ 25 SNPs (homozyg-snp 25) and covered ≥ 100 kb (homozyg-kb 100); the minimum SNP density was one SNP per 50 kb (homozyg-density 50); and the maximum distance between two neighbouring SNPs was ≤ 1,000 kb (homozyg-gap 1,000). Finally, we set the value at 750 heterozygous sites within ROH (homozyg-het 750) in order to prevent sequencing errors to cut ROH. We statistically compared heterozygosity and F_ROH among populations using Wilcoxon signed-rank tests in R⁶¹ for ROH\(\ge\)100 kb (i.e., background relatedness) and ROH\(\ge \) 2Mb (i.e., recent inbreeding events).

Using the length of ROH, we also inferred the timing of inbreeding by solving g = 100/(2rL)⁶², where g corresponds to the number of generations back in time, L to the length of ROH in Mb, and r to the recombination rate. We used a r of 1.11 cM/Mb estimated in felids³⁵ and a generation time of 5 years³¹. Inferred times based on ROH lengths are shown in Supplementary Data 5.

Genetic load

To estimate genetic load, we first generated an ancestral felid genome using domestic cat (SRR1179888-SRR1179901), tiger (SRR13242485) and cheetah (SRR22273180) to polarise our multi-individual vcf. We mapped short reads to the lion assembly (GCF_018350215.1) as described in the ‘Genome data mapping’ section and subsampled each of these three genomes to a depth of 6X and merged them with samtools merge. We then used ANGSD v0.917⁶³ to generate a consensus ancestral genome using the doFasta 2 and doCounts 1 options. Next, we used a custom script, to polarise the vcf to the ancestral allele (see Code availability section).

We then annotated synonymous and non-synonymous variants in coding regions using SnpEff v4.3³⁶. We removed gene models with in-frame STOP codons, missing START and terminal STOP codons (-J option) and genes labelled as pseudogenes (--no-pseudo option) with Cufflinks v2.2.1⁶⁴ and obtained a total of 19,491 genes. We identified three categories of variants: a) Synonymous; b) Moderate: non-disruptive variants that might change protein effectiveness; c) High: variants assumed to have high (disruptive) impact on protein, probably causing protein truncation, loss of function or triggering nonsense mediated decay and including stop gained codons, splice donor variant and splice acceptor as well as start codon lost³⁶. We also excluded intergenic (-no-intergenic) and intron (-no-intron) variants. For each variant category, we recorded the number of homozygous and heterozygous variants and summed the total number of variants. We estimated the total load and corrected for potential mapping biases arising from different sample types (i.e., batch effects associated with different datasets and unequal distance to outgroup) by calculating the ratio of deleterious variants (High and Moderate impact) to Synonymous SNPs, as previously described in Xue et al. ³⁷. We also calculated the individual realised load (i.e., total number of homozygous variants of category i divided by twice the total number of segregating sites for category i) as described in Mathur & DeWoody⁶⁵. We compared the differences in load among populations using Wilcoxon signed-rank tests in R.

To take into account the frequency of variants in each population, we also calculated the R_xy ratio for High and Moderate impact variants comparing the Crater (n = 10) vs Serengeti/Selous (n = 5) populations following Xue et al. ³⁷. We used a random number intergenic SNPs corresponding to the number of each impact type for standardisation, which makes R_xy robust against sampling effects and population substructure³⁷. We then estimated allele frequencies for intergenic, High and Moderate impact variants using PLINK and used custom scripts to calculate the R_xy. An R_xy equal to 1 corresponds to no change in frequency between two populations, whereas R_xy < 1 or >1 corresponds to a deficit/decrease or an excess/increase in frequency in population x (i.e., Crater) relative to population y (i.e., Serengeti/Selous), respectively. We used a jack-knife procedure in R to estimate the variance in the R_xy ratio.

Gene ontology

To identify putative genes associated with lower fitness in Crater lions and carrying High and Moderate impact variants, we used the Mouse Genome Informatics database (www.informatics.jax.org³⁹) as well as literature searches to manually retrieve gene ontologies and mammalian phenotype information for each candidate gene.

Population genomic simulations

The recent demographic history of the Crater lion population is well documented. However, it is crucial to assess whether our empirical results are consistent with the recent history of the population and assess the plausibility of our results interpretations. We thus performed forward genomic simulations in SLiM 4.0^41,42 to assess the potential effects of variable gene flow on the genomic variation of the Crater population from the 1950s until the present day and to predict the effects of future effective migration on genome-wide variation over the following 100 years.

We used a non-Wright-Fisher model (nonWF) which allows for overlapping generations and where each cycle in the simulation corresponds to a year. Also, the probability of an individual surviving from one year to the next is given by its absolute fitness, which ranges from 0 to 1 and which is determined by its genetic composition. Population size N is an emergent parameter controlled by carrying capacity (K) and is the outcome of a stochastic process of reproduction and viability selection. If N > K, the absolute fitness is rescaled downward by the ratio of K/N. Therefore, these models did not allow for population growth beyond K but instead allow for N to fluctuate around K.

To avoid the fitness of all individuals increasing to 1 in case of severe decline and to allow for viability selection and impacts of inbreeding depression, density-dependent selection was rescaled following Robinson et al. ⁶⁶ by drawing the new individual fitness as min(K/N, 1.0).

We created scenarios recapitulating the population history of the Greater Serengeti Ecosystem (GSE) and Crater lion based on the demographic reconstructions from the PSMC and the recent population history and monitoring data^12,13. To convert N_e into N (i.e., K), we used a conservative N_e/N_C = 0.1⁶⁷. After 400,000 years of burnin for a large ancestral population (K_Ancestral = 50,000), we modelled a K_{GSE-Ancestral} = 5000 between 5000 and 600 years Before Present (y. BP) and then modelled a decline a K_{Historical-Present-GSE} = 2000, between 600 y. BP and the present time¹². We then modelled a population split between the GSE population and the Crater population 200 y. BP (i.e., based on the SMC + + divergence estimate; Supplementary Fig. 1) followed by a decline to K_1950s = 100 in ~1950 (Supplementary Data 8). We then modelled recent population fluctuations based on the well-documented demographic history of the Crater lion until the 2020s (Supplementary Data 3; Fig. 2c). From a population of ~100 in the 1950s, the population declined to nine adult females and one male due to an epizootic in 1962¹². The following year, the population numbered ~15 lions which are the founders of the current population. We also modelled the arrival of breeding males: seven in 1964 and 1965, one in 1993, a coalition of four males in 2013, one male in 2015 and a coalition of two males in 2018. For each time interval, we set K based on the census data (Supplementary Data 3, 8; Fig. 1c). We then ran the simulations for an additional 100 years period until the year 2120 to assess the future effects of gene flow by simulating 0, 1, 5 or 10 effective migrants (M) per decade and for K values of 50, 100, 150 or 200.

Reproduction was modelled with the simplifying assumptions of a first age at reproduction of 4 for males and females and last year of reproduction at 10 and 14 years old for males and females, respectively. We assumed a harem-like (polygamous) reproduction system with 69% and 78% of males allowed to reproduce for the GSE and Crater population, respectively¹³. Breeding occurred every two years¹³. We assumed a maximum of two litters (i.e., mating with up to two different females) per male per breeding cycle allowing for 40% and 60% of males mating with one and two females respectively to reflect the higher reproductive success of some males in a coalition (Supplementary Data 8). Assuming a litter size of 2.4 cubs (min=1; max=3)¹³, we used offspring number probability values (i.e., weights) for each mating event of 0.2, 0.2 and 0.6 for 1, 2 and 3 cubs respectively. The cub sex ratio was set at 0.5 The model assumed a maximum longevity of 16 and 14 years for females and males, respectively and different age-specific mortality for each sex (Supplementary Data 8). We assumed an adult sex-ratio (4-14+ years old) of 2.75 (SD = 0.83) for the Crater population and of 2.65 (SD = 0.42) for the GSE population. When simulating gene flow, we assumed that females were philopatric¹³ and only allowed males to immigrate and mate with local individuals during the year of arrival. In subsequent years, these males were joined to the pool of mating individuals with the same probability of mating as resident males.

We simulated 10 chromosomes containing each 500 genes of 1750 bp long following Robinson et al. ⁶⁶. We randomly generated deleterious (non-synonymous) mutations in exonic regions at a ratio of 2.31:1 to neutral (synonymous) mutations⁶⁸. For selection coefficients (s) of non-synonymous mutations we used distributions based on estimates in humans⁶⁹ to model very strongly, strongly, moderately and weakly deleterious mutations as well as lethal mutations and using a gamma distribution a mean s = −0.01314833 and shape = 0.186 (Supplementary Data 8). For dominance coefficients (h), we assumed an inverse relationship between h and s^70,71 with h = 0.0 for very strongly deleterious mutations (s < -0.1), h = 0.01 for strongly deleterious mutations (-0.1 ≤ s < -0.01), h = 0.1 for moderately deleterious mutations (-0.01 ≤ s < -0.001), and h = 0.4 for weakly deleterious mutations (s > -0.001). For neutral and lethal mutations s was set to a fixed value of 0 and -1, respectively. We used a mutation rate of 1e⁻⁹ mutations/site/year (i.e., 4.5e⁻⁹ mutations/site/generation³¹) and assumed a generation time of five years³¹. For recombination rate, we assumed no recombination within genes, a rate of 1e⁻³ between genes, and free recombination between chromosomes.

Since we only simulated 5000 genes, we rescaled estimates of realised and masked load by a factor of 4 so that our estimates would correspond to a complete lion genome with ~20,000 protein coding genes (GCF_018350215.1). For the realised load, we used the exponential of the simulated value to the power of 4, since genetic load is multiplied across sites. For the masked load, we multiplied the simulated values by 4, since inbreeding load is summed across sites.

Summary statistics included population size (N), mean heterozygosity, mean F_ROH ( >0.1 Mb), the mean number of each non-synonymous mutation category (i.e., weakly to very highly deleterious deleterious) as well as mean realised load (i.e., reduction in fitness due to segregating and fixed deleterious mutations calculated multiplicatively across sites; 1 - estimated Fitness) and mean masked load (i.e., the quantity of recessive deleterious variation concealed in heterozygotes, measured as the sum of diploid lethal equivalents) as estimated in Kyriazis et al. ⁷. We estimated all statistics based on a sample of 15 individuals every 2 years. For all K-M combinations we ran a total of 80 replicates each starting with a different seed. We then plotted indices of diversity through time, changes in the number of deleterious alleles and the 5% change in genetic indices over 100 years in R.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Resequencing data (ENA BioProject): PRJEB80542 (this study), PRJNA611920, PRJNA182708, PRJNA16726, PRJNA854353, PRJNA684344. The provided accession codes are also available at NCBI. See Supplementary Data 4 for estimates of Heterozygosity, inbreeding and genetic load.

Code availability

Code for data processing and analysis, and simulation are deposited to Github: https://github.com/ndussex/Crater_lion_genomics.

References

Bertorelle, G. et al. Genetic load: genomic estimates and applications in non-model animals. Nat. Rev. Genet. 23, 492–503 (2022).
Article CAS PubMed Google Scholar
Lynch, M., Conery, J. & Burger, R. Mutation Accumulation and the Extinction of Small Populations. Am. Nat. 146, 489–518 (1995).
Article Google Scholar
Dussex, N., Morales, H. E., Grossen, C., Dalén, L. & van Oosterhout, C. Purging and accumulation of genetic load in conservation. Trends Ecol. Evol. 38, 961–969 (2023).
Article PubMed Google Scholar
Hedrick, P. W. & Garcia-Dorado, A. Understanding inbreeding depression, purging, and genetic rescue. Trends Ecol. Evol. 31, 940–952 (2016).
Article PubMed Google Scholar
von Seth, J. et al. Genomic trajectories of a near-extinction event in the Chatham Island black robin. BMC Genomics 23, (2022). 747.
Article Google Scholar
Dussex, N. et al. Population genomics of the critically endangered kākāpō. Cell Genomics 1, 100002 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kyriazis, C. C. et al. Genomic underpinnings of population persistence in isle royale moose. Mol. Biol. Evol. 40, msad021 (2023).
Article CAS PubMed PubMed Central Google Scholar
Cavill, E. L. et al. When birds of a feather flock together: Severe genomic erosion and the implications for genetic rescue in an endangered island passerine. Evol. Appl. 17, e13739 (2024).
Article PubMed PubMed Central Google Scholar
Wang, X., Peischl, S. & Heckel, G. Demographic history and genomic consequences of 10,000 generations of isolation in a wild mammal. Curr. Biol. 33, 2051–2062.e4 (2023).
Article CAS PubMed Google Scholar
Pinto, A. V., Hansson, B., Patramanis, I., Morales, H. E. & van Oosterhout, C. The impact of habitat loss and population fragmentation on genomic erosion. Conserv. Genet. 25, 49–57 (2024).
Article CAS Google Scholar
Dobson, F. S. & Jones, W. T. Multiple Causes of Dispersal. Am. Nat. 126, 855–858 (1985).
Article Google Scholar
Packer, C. et al. Case study of a population bottleneck: Lions of the Ngorongoro Crater. Conserv. Biol. 5, 219–230 (1991).
Article Google Scholar
Packer, C. The Lion: Behavior, Ecology, and Conservation of an Iconic Species. (Princeton University Press, 2023).
Jansson, I. et al. Coexistence from a lion’s perspective: movements and habitat selection by African lions (Panthera leo) across a multi-use landscape. Plos One (2024).
Parsons, A. W. et al. The benefits of inclusive conservation for connectivity of lions across the Ngorongoro Conservation Area, Tanzania. Conservation, Science and Practice in press, (2025).
Pusey, A. E. & Packer, C. The evolution of sex-biased dispersal in lions. Behaviour 101, 275–310 (1987).
Article Google Scholar
Spong, G., Stone, J., Creel, S. & Björklund, M. Genetic structure of lions (Panthera leo L.) in the Selous Game Reserve: implications for the evolution of sociality: Genetic structure and social groups. J. Evol. Biol. 15, 945–953 (2002).
Article Google Scholar
Morrison, J. C., Sechrest, W., Dinerstein, E., Wilcove, D. S. & Lamoreux, J. F. Persistence of large mammal faunas as indicators of global human impacts. J. Mammal. 88, 1363–1380 (2007).
Article Google Scholar
Marchant, R. et al. Drivers and trajectories of land cover change in East Africa: Human and environmental interactions from 6000 years ago to present. Earth Sci. Rev. 178, 322–378 (2018).
Article Google Scholar
Hazzah, L., Borgerhoff Mulder, M. & Frank, L. Lions and Warriors: Social factors underlying declining African lion populations and the effect of incentive-based management in Kenya. Biol. Conserv. 142, 2428–2437 (2009).
Article Google Scholar
Ikanda, D. & Packer, C. Ritual vs. retaliatory killing of African lions in the Ngorongoro Conservation Area, Tanzania. Endanger. Species Res. (2008) https://doi.org/10.3354/esr0n120.
Sinclair, A. R. E., Dobson, A., Mduma, S. A. R. & Metzger, K. L. 2. Shaping the Serengeti Ecosystem. in Serengeti IV 11–30 (University of Chicago Press, 2015).
Kissui, B. M. & Packer, C. Top-down population regulation of a top predator: lions in the Ngorongoro Crater. Proc. Biol. Sci. 271, 1867–1874 (2004).
Article PubMed PubMed Central Google Scholar
Munson, L. et al. Climate extremes promote fatal co-infections during canine distemper epidemics in African lions. PLoS One 3, e2545 (2008).
Article PubMed PubMed Central Google Scholar
Smitz, N. et al. A genome-wide data assessment of the African lion (Panthera leo) population genetic structure and diversity in Tanzania. PLoS One 13, e0205395 (2018).
Article PubMed PubMed Central Google Scholar
Munson, L. et al. Genetic diversity affects testicular morphology in free-ranging lions (Panthera leo) of the Serengeti Plains and Ngorongoro Crater. J. Reprod. Fertil. 108, 11–15 (1996).
Article CAS PubMed Google Scholar
Wildt, D. E. et al. Reproductive and genetic consequences of founding isolated lion populations. Nature 329, 328–331 (1987).
Article PubMed Central Google Scholar
Grossen, C., Guillaume, F., Keller, L. F. & Croll, D. Purging of highly deleterious mutations through severe bottlenecks in Alpine ibex. Nat. Commun. 11, 1001 (2020).
Article CAS PubMed PubMed Central Google Scholar
Dussex, N. et al. Adaptation to the High-Arctic island environment despite long-term reduced genetic variation in Svalbard reindeer. iScience 26, 107811 (2023).
Article PubMed PubMed Central Google Scholar
Khan, A. et al. Genomic evidence for inbreeding depression and purging of deleterious genetic variation in Indian tigers. Proc. Natl. Acad. Sci. USA118, e2023018118 (2021).
Article CAS PubMed PubMed Central Google Scholar
de Manuel, M. et al. The evolutionary history of extinct and living lions. Proc. Natl. Acad. Sci. USA117, 10927–10934 (2020).
Article PubMed PubMed Central Google Scholar
Bertola, L. D. et al. Phylogeographic Patterns in Africa and High Resolution Delineation of Genetic Clades in the Lion (Panthera leo). Sci. Rep. 6, (2016). 30807.
Article CAS PubMed PubMed Central Google Scholar
Santiago, E. et al. Recent Demographic History Inferred by High-Resolution Analysis of Linkage Disequilibrium. Mol. Biol. Evol. 37, 3642–3653 (2020).
Article CAS PubMed Google Scholar
Verschuren, D., Laird, K. R. & Cumming, B. F. Rainfall and drought in equatorial east Africa during the past 1,100 years. Nature 403, 410–414 (2000).
Article CAS PubMed Google Scholar
Li, G., Figueiró, H. V., Eizirik, E. & Murphy, W. J. Recombination-Aware Phylogenomics Reveals the Structured Genomic Landscape of Hybridizing Cat Species. Mol. Biol. Evol. 36, 2111–2126 (2019).
Article CAS PubMed PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
Article CAS PubMed PubMed Central Google Scholar
Xue, Y. et al. Mountain gorilla genomes reveal the impact of long-term population decline and inbreeding. Science 348, 242–245 (2015).
Article CAS PubMed PubMed Central Google Scholar
van der Valk, T., Díez-Del-Molino, D., Marques-Bonet, T., Guschanski, K. & Dalén, L. Historical genomes reveal the genomic consequences of recent population decline in eastern gorillas. Curr. Biol. 29, 165–170.e6 (2019).
Article PubMed Google Scholar
Blake, J. A. et al. The Mouse Genome Database: integration of and access to knowledge about the laboratory mouse. Nucleic Acids Res. 42, D810–D817 (2014).
Article CAS PubMed Google Scholar
Kyriazis, C. C., Robinson, J. A. & Lohmueller, K. E. Using Computational Simulations to Model Deleterious Variation and Genetic Load in Natural Populations. Am. Naturalist 202, 737–752 (2023).
Article Google Scholar
Haller, B. C. & Messer, P. W. SLiM 3: Forward Genetic Simulations Beyond the Wright-Fisher Model. Mol. Biol. Evol. 36, 632–637 (2019).
Article CAS PubMed PubMed Central Google Scholar
Haller, B. C. & Messer, P. W. SLiM 4: Multispecies Eco-Evolutionary Modeling. Am. Nat. 201, E127–E139 (2023).
Article PubMed PubMed Central Google Scholar
Smeds, L. & Ellegren, H. From high masked to high realized genetic load in inbred Scandinavian wolves. Mol. Ecol. 32, 1567–1580 (2023).
Article PubMed Google Scholar
Frankham, R. Genetic rescue of small inbred populations: meta-analysis reveals large and consistent benefits of gene flow. Mol. Ecol. 24, 2610–2618 (2015).
Article PubMed Google Scholar
Whiteley, A. R., Fitzpatrick, S. W., Funk, W. C. & Tallmon, D. A. Genetic rescue to the rescue. Trends Ecol. Evol. 30, 42–49 (2015).
Article PubMed Google Scholar
Robinson, J. A. et al. Genomic signatures of extensive inbreeding in Isle Royale wolves, a population on the threshold of extinction. Sci. Adv. 5, eaau0757 (2019).
Article PubMed PubMed Central Google Scholar
Jansson, I. Connectivity lions (Panthera leo) across pastoralist landscape: Coexistence, conflict, collective action lion conservation Ngorongoro. (Swedish University of Agricultural Sciences, Umeå, Sweden, Tanzania, 2024).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kutschera, V. E. et al. GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species. BMC bioinformatics 23, 228 (2022).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Article PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Article CAS PubMed PubMed Central Google Scholar
Quinlan, A. R. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr. Protoc. Bioinforma. 47, 11.12.1–34 (2014).
Google Scholar
Chang, C. C., et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19, 1655–1664 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Article CAS PubMed PubMed Central Google Scholar
Terhorst, J., Kamm, J. A. & Song, Y. S. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genet. 49, 303–309 (2017).
Article CAS PubMed Google Scholar
Haubold, B., Pfaffelhuber, P. & Lynch, M. mlRho - a program for estimating the population mutation and recombination rates from shotgun-sequenced diploid genomes. Mol. Ecol. 19, 277–284 (2010).
Article PubMed PubMed Central Google Scholar
R. Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, Vienna, Austria, 2020).
Thompson, E. A. Identity by descent: variation in meiosis, across genomes, and in populations. Genetics 194, 301–326 (2013).
Article CAS PubMed PubMed Central Google Scholar
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinforma. 15, 356 (2014).
Article Google Scholar
Trapnell, C., et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mathur, S. & DeWoody, J. A. Genetic load has potential in large populations but is realized in small inbred populations. Evol. Appl. 14, 1540–1557 (2021).
Article CAS PubMed PubMed Central Google Scholar
Robinson, J. A. et al. The critically endangered vaquita is not doomed to extinction by inbreeding depression. Science 376, 635–639 (2022).
Article CAS PubMed PubMed Central Google Scholar
Frankham, R. Effective population size/adult population size ratios in wildlife: a review. Genet. Res. 89, 491–503 (1995).
Article Google Scholar
Huber, C. D., Kim, B. Y., Marsden, C. D. & Lohmueller, K. E. Determining the factors driving selective effects of new nonsynonymous mutations. Proc. Natl. Acad. Sci. USA114, 4465–4470 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kim, B. Y., Huber, C. D. & Lohmueller, K. E. Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples. Genetics 206, 345–361 (2017).
Article PubMed PubMed Central Google Scholar
Agrawal, A. F. & Whitlock, M. C. Inferences about the distribution of dominance drawn from yeast gene knockout data. Genetics 187, 553–566 (2011).
Article CAS PubMed PubMed Central Google Scholar
Huber, C. D., Durvasula, A., Hancock, A. M. & Lohmueller, K. E. Gene expression drives the evolution of dominance. Nat. Commun. 9, 2750 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the Government of Tanzania, TAWIRI, and the Ngorongoro Conservation Area Authority for their support, and especially acknowledge their veterinarian, late Dr. A.R. Nyaki. We also acknowledge sequencing support from the Swedish National Genomics Infrastructure (NGI) at the Science for Life Laboratory in Uppsala and Stockholm supported by the Swedish Research Council and the Knut and Alice Wallenberg Foundation. The computations and data handling was enabled by resources provided by the National Academic Infrastructure for Supercomputing in Sweden (NAISS) and the Swedish National Infrastructure for Computing (SNIC) partially funded by the Swedish Research Council through grant agreements no. 2022-06725 and no. 2018-05973. The sample collection was supported by the Swedish Research Council through grant no. 2014-03382, and by various funds raised by KopeLion Inc. We thank the two anonymous reviewers for the constructive comments and suggestions.

Funding

Open access funding provided by Swedish Museum of Natural History.

Author information

Authors and Affiliations

Department of Population Analysis and Monitoring, Swedish Museum of Natural History, SE-106 91, Stockholm, Sweden
Nicolas Dussex
Molecular Ecology Group, Department of Wildlife, Fish, and Environmental Studies, Swedish University of Agricultural Sciences, SE-901 83, Umeå, Sweden
Ingela Jansson, Anita Norman & Göran Spong
Centre for Palaeogenetics, Svante Arrhenius väg 20C, SE-106 91, Stockholm, Sweden
Tom van der Valk
Department of Bioinformatics and Genetics, Swedish Museum of Natural History, SE-106 91, Stockholm, Sweden
Tom van der Valk
Department of Ecology, Evolution and Behavior, University of Minnesota, MN 55108, St. Paul, MN, USA
Craig Packer
School for Field Studies, Centre for Wildlife Management Studies, Karatu, Tanzania
Bernard M. Kissui
Tanzania Wildlife Research Institute (TAWIRI), Arusha, Tanzania
Ernest E. Mjingo
Luke, FI 00790, Helsinki, Finland
Göran Spong

Authors

Nicolas Dussex
View author publications
Search author on:PubMed Google Scholar
Ingela Jansson
View author publications
Search author on:PubMed Google Scholar
Tom van der Valk
View author publications
Search author on:PubMed Google Scholar
Craig Packer
View author publications
Search author on:PubMed Google Scholar
Anita Norman
View author publications
Search author on:PubMed Google Scholar
Bernard M. Kissui
View author publications
Search author on:PubMed Google Scholar
Ernest E. Mjingo
View author publications
Search author on:PubMed Google Scholar
Göran Spong
View author publications
Search author on:PubMed Google Scholar

Contributions

G.S. and N.D. conceived and designed the study. I.J. collected the NCA samples. G.S. acquired and generated genomic data. N.D. analysed the data. G.S., N.D., I.J., T.v.d.V., B.M.K., E.E.M., A.N. and C.P. interpreted the data. G.S. provided funding. N.D. wrote the manuscript with input from the other authors.

Corresponding authors

Correspondence to Nicolas Dussex or Göran Spong.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks Benjamin N. Sacks and the other, anonymous, reviewer for their contribution to the peer review of this work. Primary Handling Editors: Pavel Flegontov and Michele Repetto. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1-8

reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dussex, N., Jansson, I., van der Valk, T. et al. Constraints to gene flow increase the risk of genome erosion in the Ngorongoro Crater lion population. Commun Biol 8, 640 (2025). https://doi.org/10.1038/s42003-025-07986-0

Download citation

Received: 22 November 2024
Accepted: 21 March 2025
Published: 21 April 2025
DOI: https://doi.org/10.1038/s42003-025-07986-0