Main

Pathogens have been a constant threat to human health throughout our evolutionary history. Until around 1850, at least a quarter of all children died before the age of one, and around another quarter before turning 15. Infectious diseases are estimated to have been responsible for more than half of these deaths6. Larger disease outbreaks have profoundly affected human societies, sometimes devastatingly affecting entire civilizations7. Infectious diseases have left lasting impressions on human genomes, as selective pressures from pathogens have continuously shaped human genetic variation8,9. Where and when different human pathogens first emerged, how and why they spread, and how they affected human populations are important but largely unresolved questions.

During the Holocene (beginning roughly 12,000 years ago), the agricultural transition created larger and more sedentary communities, facilitating pathogen transmission and persistence within populations10. Simultaneously, the rise of animal husbandry and pastoralism are thought to have increased the risk of zoonoses3. Technological advances, such as horses and carts, increased both mobility and the risk of disease transmission between populations11. It has been proposed that these changes led to the so-called ‘first epidemiological transition’ characterized by increased infectious disease mortality3. However, direct evidence remains scarce, and the idea is debated12. Palaeopathological examinations of ancient skeletons offer insights into past infectious disease burden13, but are limited to the few diseases identifiable from the available tissue. Recent advances in ancient DNA (aDNA) techniques allow for the retrieval of direct genomic evidence of past microbial infections, enabling the reconstruction of complete ancient pathogen genomes. These studies have typically focused on specific pathogens and have provided surprising insights into the evolutionary history of the causative agents of some of the most historically important infectious diseases affecting humans, including plague (Yersinia pestis)14,15, tuberculosis (Mycobacterium tuberculosis)16,17, smallpox (variola virus)18,19, hepatitis B (hepatitis B virus, HBV)20,21,22 and others23,24,25,26,27,28. However, there is an unmet need to investigate the combined landscape of ancient bacteria, viruses and parasites that affected our ancestors across various regions and time periods. Here we use a new high-throughput computational workflow to screen for ancient microbial DNA and use our data to investigate long-standing questions in paleoepidemiology: when and where did important human pathogens arise? And what factors influenced their spatiotemporal distribution?

Ancient microbial DNA from 1,313 Eurasians

To understand the distribution of ancient pathogens, we developed an accurate and scalable workflow to identify ancient microbial DNA in shotgun-sequenced aDNA data (Extended Data Figs. 14 and Supplementary Information 1). The data (roughly 405 billion sequencing reads) are derived from 1,313 ancient individuals from western Eurasia (n = 1,015; 77%), central and north Asia (n = 265; 20%) and southeast Asia (n = 33; 3%), spanning a roughly 37,000-year period, from the Upper Palaeolithic to historical times (Fig. 1, Supplementary Table 1 and Supplementary Information 2). As burial practices varied across cultures and time, these samples represent a subset of groups within past societies. Nevertheless, the identified pathogens probably affected the broader population, as diseases spread easily in communities with poor sanitation and hygiene29. Initial metagenomic classification showed a large fraction of reads classified as soil-dwelling taxa including genera such as Streptomyces or Pseudomonas, reflecting a predominantly environmental source of microbial DNA. Further characterization using a topic model, however, suggested that microbial DNA in ancient tooth samples often derives from genera commonly associated with the human oral microbiome such as Actinomyces or Streptococcus (Extended Data Fig. 1d–g).

Fig. 1: Dataset overview.
figure 1

Spatiotemporal distribution and site contexts of the study samples. White squares in the geographic maps indicate locations of the full set of n = 1,313 study samples, whereas coloured circles highlight location and age of samples from the time period and region indicated in the respective panel. Bar plots show the number of samples obtained from different site type contexts in each region. ND, not determined.

We selected a set of 136 bacterial and protozoan genera (11,553 species total) containing human pathogenic species2 as well as 1,356 viral genera (259,979 species total) for further authentication and detection of ancient taxa. We found that ancient microbial DNA was widely detected, with 5,486 authenticated individual hits identified across 1,005 samples (Z-score for aDNA damage rate from metaDMG of greater than or equal to 1.5; Fig. 2a, Supplementary Table 2 and Extended Data Fig. 4). Of those, 3,384 hits were found among 214 known human pathogen species2, with the remaining 2,104 hits involving 278 other species. The highest numbers were observed in bacterial genera associated with the human oral microbiome, such as Actinomyces (380; 28.5% of samples) and Streptococcus (242; 18.1% of samples), or those commonly found in soil environments, such as Clostridium (252; 18.9% of samples) and Pseudomonas (111; 8.3% of samples).

Fig. 2: Overview and characteristics of detected ancient microbial DNA.
figure 2

a, Bar plot showing the total number of putative ancient microbial hits (overall detection rate in brackets) for bacterial, eukaryotic and viral (n ≥ 4) genera. Bar colour and shading distinguishes counts in the different aDNA damage categories. b,c, Distributions of ANI (b) and log10 fold change of mapped reads over median of reads classified at taxonomic rank of genus per sample (c) for individual species hits in selected example genera. Symbol colour indicates aDNA damage category.

We observed marked differences in the distributions of the genetic similarity of the ancient microbial sequences to their reference assemblies, both among genera and between species within a genus (Fig. 2b and Extended Data Fig. 5). High average nucleotide identity (ANI) indicates that ancient microbial sequences are closely related to a reference assembly in the modern database, and was observed in hits across all species from some genera (for example, Yersinia, Fig. 2b). In other genera, only a few hits had a closely related database reference assembly match. An example is the genus Mycobacterium, in which only hits of the leprosy-causing bacterium Mycobacterium leprae were highly similar to their reference assembly (ANI > 99%, Fig. 2b). Low ANI indicates that the ancient microbial DNA is only distantly related to the reference assembly, for example, because of aDNA damage, poor representation of the diversity of the genus in the database or false-positive classification of ancient microbial reads deriving from a related genus (Extended Data Fig. 3). Alternatively, ANI can also be reduced when reads mapped to a particular reference assembly originate from many closely related strains or species in a sample. To test for such mixtures, we quantified the rate of observing different alleles at two randomly sampled reads at nucleotide positions across the genomes of hits with read depths greater than or equal to one. We found a high multi-allele rate in many species associated with the human oral microbiome, such as Streptococcus sanguinis or Treponema denticola. Hits for these species also showed lower ANI, consistent with the expectation for mixtures of ancient microbial DNA (Extended Data Fig. 6a).

The rate of read mapping varied by orders of magnitude between species, from hits in species with high read recruitment, such as M. leprae (greater than 100-fold enrichment over the median number of classified reads across target genera) to hits at the lower limits of detection, for example, for the louse-borne pathogen Borrelia recurrentis (lowest read recruitment roughly 100-fold less than the median number of classified reads across target genera; Fig. 2c and Extended Data Fig. 5b). Ancient microbial DNA from species commonly found in soil, such as Clostridium botulinum, was detected at similar rates in tooth and bone samples. Conversely, species associated with the human oral microbiome (for example, Fusobacterium nucleatum, Streptococcus mutans and Porphyromonas gingivalis) or pathogenic infections (for example, Y. pestis and HBV) were significantly more frequently identified in tooth samples (Extended Data Fig. 6b). To further verify hits with low read numbers, we performed a BLASTn search for all reads of each hit with n ≤ 100 final reads (n = 712 hits total; Supplementary Table 3). Most hits showed a high proportion (greater than or equal to 80%) of reads assigned to the same species using BLASTn, and the species with the most top-ranked BLASTn hits generally matched the inferred hit species (Extended Data Fig. 7a,b).

Our results show that ancient microbial DNA isolated from human remains originates from complex mixtures of distinct endogenous and exogenous sources. The high detection rate, high read recruitment, lower ANI and evidence of mixtures in genera such as Clostridium or Pseudomonas (Fig. 2 and Extended Data Figs. 5 and 6) suggest that a substantial fraction of this ancient microbial metagenome derives from environmental sources, possibly associated with the ‘necrobiome’ involved in post-mortem putrefaction processes30,31 (Supplementary Information 3). By contrast, species from other frequently observed genera, including Actinomyces or Streptococcus, were predominantly identified from teeth and probably originated from the endogenous oral microbiome30. Species representing likely cases of pathogenic infections (for example, Y. pestis and M. leprae) were often characterized by higher ANI and/or low multi-allele rate, consistent with pathogen load predominantly originating from a single dominant strain.

The ancient Eurasian pathogen landscape

Our dataset provides a unique opportunity to investigate the origins and spatiotemporal distribution of human pathogens in Eurasia, expanding the known range of some ancient pathogenic species and identifying others not previously reported using paleogenomic data (Supplementary Tables 3 and 5).

Considering bacterial pathogens, we found widespread distribution of the plague-causing bacterium Y. pestis, consistent with previous studies15. We identified 42 putative cases of Y. pestis (35 newly reported, Extended Data Fig. 6e), corresponding to a detection rate of roughly 3% in our samples. These newly identified cases expand the spatial and temporal extent of ancient plague over previous results (Fig. 3). The earliest three cases were dated between around 5,700–5,300 calibrated years before present (cal. bp), across a broad geographic area ranging from western Russia (NEO168, 5,583–5,322 cal. bp), to central Asia (BOT2016, 5,582–5,318 cal. bp) and to Lake Baikal in Siberia32 (DA342, 5,745–5,474 cal. bp). This broad range of detection among individuals predating 5,000 cal. bp challenges previous interpretations that early plague strains represent only isolated zoonotic spillovers33. We replicated previously identified cases of plague in Late Neolithic and Bronze Age (LNBA) contexts across the Eurasian Steppe15 and identified many instances in which several individuals from the same burial context were infected (Afanasievo Gora, Russia; Kytmanovo, Russia; Kapan, Armenia; Arban 1, Russia) (Supplementary Table 2). These results indicate that the transmissibility and potential for local epidemic outbreaks for strains at those sites were probably higher than previously assumed33. Finally, 11 out of 42 cases were identified in late medieval and early modern period individuals (800–200 bp) from two cemeteries in Denmark (Aalborg, Randers), highlighting the high burden of plague during this time in Europe. All but one hit (NEO627, n = 84 reads total) showed expected coverage for the virulence plasmids pCD1 and pMT1, with hits before 2,500 years bp characterized by the previously reported absence of a 19 kilobase region on pMT1 containing the ymt gene15 (Extended Data Fig. 8c and Supplementary Information 4).

Fig. 3: Spatiotemporal distribution of selected ancient pathogens.
figure 3

Each panel shows geographic distribution (top) and timeline (bottom) for identified cases of the respective pathogen (indicated by coloured circles). Geographic locations and age distributions of all n = 1,313 study samples are shown in each panel using white squares.

Another bacterial pathogen frequently detected was the spirochaete bacterium B. recurrentis, the causative agent of louse-borne relapsing fever (LBRF), a disease with a mortality of 10–40% (Supplementary Information 5). Whereas previous paleogenomic evidence for LBRF is limited to a few cases from Scandinavia and Britain23,34, we report 34 new putative cases (2.5% detection rate; Extended Data Fig. 6e), with wide geographic distribution across Europe, central Asia and Siberia (Fig. 3). We detected the earliest case in a Neolithic farmer individual from Scandinavia (NEO29, Lohals, 5,647–5,471 cal. bp), suggesting that human body lice were already vectors for infectious disease during the Neolithic period, supported by phylogenetic analyses of B. recurrentis34. The highest detection rates were found during the Iron and Viking Ages. LBRF outbreaks were historically associated with crowded living conditions, poor personal hygiene and wet and cold seasons, but are rare today in most regions (Supplementary Information 5). Our results indicate that B. recurrentis infections exerted a substantial disease burden on past populations.

We also report new cases of other bacterial pathogens previously detected in paleogenomic data. The leprosy-causing bacterium M. leprae was identified in seven individuals (0.5% detection rate) from Scandinavia and only appeared from the Late Iron Age onwards (earliest case RISE174, 1,523–1,339 cal. bp). Because M. leprae can infect both red squirrels and humans35, and archaeological evidence demonstrates that fur trade from Scandinavia, including squirrel fur, increased substantially during the late Iron and Viking Ages36, our results support the suggestion that squirrel fur trade could have facilitated transmission37. Our findings are also consistent with the widespread distribution of leprosy in medieval Europe38. We further detected three putative cases of Treponema pallidum—subspecies of which are the causative agents of treponematoses such as yaws, and endemic and venereal syphilis—in three individuals from recent time periods (earliest case 101809T, Denmark, 600–500 bp; Extended Data Fig. 7). Two cases were identified in individuals from Borneo in southeast Asia (around 500–300 years bp), expanding the range of paleogenomic evidence for treponemal disease into this region.

Among the species newly reported using paleogenomic data, we identified 12 putative cases of Yersinia enterocolitica, the causative agent of yersiniosis, commonly contracted through consuming contaminated raw or undercooked meat (Fig. 3). The animal reservoirs for Y. enterocolitica include boars, deer, horses, cattle and sheep. As Y. enterocolitica rarely enters the bloodstream, our results probably underestimate the disease burden. This species includes some of the only identified putative zoonotic infections in individuals from Mesolithic hunter-gatherer contexts (NEO941, Denmark, 6,446–6,302 cal. bp). We also detected other members of the order Enterobacterales, transmitted through the faecal–oral route, including members of the genera Shigella, Salmonella and Escherichia (Supplementary Table 2). We report the earliest evidence for ancient leptospirosis (genus Leptospira) dating back to the Neolithic, 5,650–5,477 cal. bp (NEO46, Sweden; Leptospira borgpetersenii). Whereas earlier cases predominantly involved L. borgpetersenii (n = 5, 0.4% detection rate), most hits were Leptospira interrogans (n = 20, 1.5% detection rate), almost exclusively in Scandinavian contexts from the Viking Age onwards (Fig. 3). L. borgpetersenii is today primarily found in cattle, whereas L. interrogans is detected more broadly in both domestic and wild animals. Although the clinical manifestations are similar, with an untreated fatality rate of 1% today, transmission routes vary39. Although host-to-host transmission predominates for L. borgpetersenii, transmission by means of urine-contaminated environments dominates for L. interrogans. We also report two putative cases of Corynebacterium diphtheriae, the causative agent of diphtheria; the oldest of which dates back to the Mesolithic (Sidelkino, 11,336–11,181 cal. bp) (Supplementary Table 2).

Other diseases associated with animals and livestock, such as listeriosis (Listeria monocytogenes) and brucellosis (genus Brucella), could not be reliably identified. Another main human pathogen not identified in our dataset is M. tuberculosis, which causes tuberculosis. However, as the M. tuberculosis load in blood is typically low in immunocompetent patients without advanced disease40 and latent tuberculosis develops in 60% of cases and can persist for decades, it is, on the basis of current knowledge, unlikely to be readily identified using aDNA data from tooth and bone remains sampled for ancient human DNA.

Identifying eukaryotic pathogens is challenging as sequence contamination from other organisms frequently occurs in their large and often fragmented reference genomes41. An illustrative example in our dataset is the protozoan parasite Toxoplasma gondii, which we readily identified in hits with high ANI and aDNA damage but low support from coverage evenness statistics owing to reads mapping to short contigs representing human contamination (Extended Data Fig. 4a,b). Despite these challenges, we identified nine putative malaria infections across three different human-infecting species (Plasmodium vivax n = 5; Plasmodium malariae n = 3; Plasmodium falciparum n = 1; Extended Data Fig. 7 and Supplementary Table 2). The most widely detected parasite species was P. vivax, with the earliest evidence in a Bronze Age individual from central Europe (RISE564, 4,750–3,750 bp based on an archaeological context). Other cases include a medieval individual from central Asia (DA204, Kazakhstan; 1,053–1,025 cal. bp) and two Viking Age individuals from eastern Europe (VK224, 950–750 bp and VK253, 950–850 bp; Russia). The P. vivax malaria vector Anopheles atroparvus is at present widespread in Europe and nearby regions, including the Pontic Steppe, and our cases indicate this was also true in the past42. The single case of P. falciparum malaria was found in a sample from Armenia (NEO111; 463-0 cal. bp), where malaria was eliminated in the 1960s43.

Among DNA viral species, we found widespread infections with HBV (28 cases, 2.1% detection rate), consistent with previous studies20,21,22 (Extended Data Fig. 6e). Our newly reported HBV cases include individuals from Mesolithic (Kolyma River, n = 1) and Neolithic (Lake Baikal, n = 3) contexts in Siberia dating back to 9,906–9,665 cal. bp, expanding the spatiotemporal range of ancient HBV into those regions (Extended Data Fig. 7). We also report a putative ancient case (n = 1) of torque teno virus dating back roughly 7,000 years (NEO498, Ukraine; 7,161–6,950 cal. bp). Torque teno virus infects around 80% of the human population today and, although it is not associated with any particular disease, it replicates rapidly in immunocompromised individuals44. Other ancient virus hits included viruses not known to infect humans, such as ancient phage DNA (for example, Escherichia phage T4, Proteus virus Isfahan; Supplementary Table 2) and one putative case of an ancient insect virus (Invertebrate iridescent virus 31 (IIV-31)) in a tooth sample of a Viking Age individual from Sweden (VK30, Varnhem; 950–650 bp)45. The virus source is probably exogenous, potentially originating from aDNA of food sources in the tooth remains.

Coinfections with many pathogens can worsen disease progression and outcomes46 and they were probably an important morbidity factor in ancient human populations. Searching for individuals showing co-occurrence of distinct ancient microbial species, we identified 15 cases of putative coinfections in our dataset (Supplementary Table 2). A striking case was a Viking Age individual from Norway (VK388), in which we replicated previous results of infection with a probably smallpox-causing variola virus19 and furthermore found evidence of infection with the leprosy-causing bacterium M. leprae. Another case of possible coinfection with M. leprae was found in VK366, a Viking Age individual from Denmark, who also showed evidence for leptospirosis (L. interrogans). Among the 15 cases, 6 involved coinfections of HBV with non-viral pathogens (Y. pestis n = 3; B. recurrentis n = 2; P. malariae n = 1; Supplementary Table 2). This suggests that some of these cases involved chronic hepatitis, possibly reflecting HBV infection during infancy, when hepatitis becomes chronic in 90–95% of modern cases compared to only 2–6% in adult infections. An intriguing early case of a possible coinfection was found in a Mesolithic hunter-gatherer from Russia (Sidelkino, 11,336–11,181 cal. bp). This individual showed evidence of the respiratory pathogen C. diphtheriae and Helicobacter pylori, usually restricted to gastric infections; however, rare contemporary examples of bacteraemia have been reported for both47,48. Overall, our results show that coinfections can be detected using ancient metagenomic screening but are probably underestimated given methodological limitations such as differences in pathogen load, tissue availability and other factors affecting detectability of ancient microbial DNA.

Temporal dynamics and drivers of incidence

Understanding the factors affecting the dynamics of past epidemics is a main aim of paleoepidemiology. Our dataset allows us to address this question using direct molecular evidence for ancient pathogens across prehistory. To investigate changes in pathogen incidence over time, we performed Bayesian change-point detection and time series decomposition on two pathogens with high detection rates, Y. pestis (plague) and B. recurrentis (LBRF), using the detection rate of the respective pathogen as a proxy for its incidence (Methods). For plague, we inferred a gradual rise in detection rate starting from roughly 6,000 bp, about 1,000 years after the estimated time to the most recent common ancestor of now-known ancient strains (7,100 cal. bp)33. It reached a first peak around 5,000 bp across Europe and the Eurasian Steppe, coinciding with the emergence and early spread of the LNBA− strains, believed to have had limited flea-borne transmissibility15,49 (Fig. 4). Detection remained high with extra peaks for a roughly 3,000-year period, until an abrupt change around 2,800 bp led to a roughly 800-year period in which plague was only detected in one sample (VK522, Oland, Sweden 2,343–2,154 cal. bp). Starting at roughly 2,000 bp, plague reappeared in three samples from central Asia (DA92, DA101, DA104, Kazakhstan and Kyrgyzstan; Extended Data Fig. 7 and Supplementary Table 2), just before the first historically documented plague pandemic (Fig. 4). Another hiatus of roughly 600 years led to a rise and peak associated with the second plague pandemic roughly 600 bp (European late medieval cases, Denmark and previously published cases; Fig. 4). This pattern of change coincides with the extinction of the LNBA− strains roughly 2,700 bp (ref. 49) and the second Y. pestis diversification event starting roughly 3,700 bp, which gave rise to an extinct Bronze Age lineage (RT5, LNBA+)50 and present-day lineages; these had increased flea-mediated transmission adaptations favouring bubonic plague and led to all known later plague pandemics51. The adaptations included acquiring two plasmids: one with the ymt gene for survival in the flea midgut and another with the pla gene for invasiveness after transmission52. The lack of detection during both periods is also seen in publicly available ancient Y. pestis genomes from other Eurasian sites, suggesting that sampling bias is unlikely to substantially influence the observed dynamics.

Fig. 4: Bayesian time series decomposition of main epidemic pathogens.
figure 4

Panels show estimated trendlines and 95% credible interval for detection rates (top), probability distributions and locations (dotted lines) for change points (middle) and probability of trend slope (bottom) being positive (red), negative (blue) or zero (white), inferred using Bayesian change-point detection and time series decomposition. Tops of panels show temporal distributions of newly reported pathogen hits (blue circles) as well as previously published ancient pathogens (grey circles) from the respective species.

The inferred temporal dynamics of LBRF show a first peak in detection around 5,500 bp, slightly more recent than for plague, but with more sporadic occurrences and sharper peaks during the first roughly 2,000 years (Fig. 4). The geographic extent during the early period ranges from Scandinavia (NEO29, Denmark, 5,647–5,471 cal. bp) to the Altai mountains (RISE503, Russia, 3,677–3,461 cal. bp) (Fig. 3 and Supplementary Table 2). From roughly 2,800 bp, LBRF was detected more consistently, peaking around 2,000 years ago, predominantly in the Eurasian Steppe region (Fig. 3). This change from epidemic outbreaks to endemicity overlaps in time with the estimated emergence of a distinct B. recurrentis Iron Age clade34 (Supplementary Information 5). The period of high LBRF detection coincided with a time without detectable plague activity (Fig. 4), reinforcing that the absence of plague is not due to sample size limitations or poor DNA preservation. This opposing pattern is unlikely to result from any cross-immunity between Y. pestis and B. recurrentis but could plausibly, in part, be caused by population size decreases and behavioural and societal adjustments during plague epidemics. LBRF remained detectable until the end of the time series, particularly in Europe; the continued presence might have facilitated the emergence of a medieval B. recurrentis clade roughly 600 years bp (ref. 34) (Supplementary Information 5) (Figs. 3 and 4).

A striking feature shared in the temporal dynamics of plague and LBRF was the absence of detectable cases before roughly 6,000 bp, coinciding with a transition of individuals in predominantly hunter-gatherer contexts to those in farming or pastoralist cultural contexts. It has been proposed that this transition led to a higher risk of zoonotic disease transmission and facilitated the spread of both old and new pathogens3. Our dataset allows us to test this hypothesis using molecular evidence for infectious disease burden. To increase power to detect changes in the load of different pathogen types, we focused on grouped ancient microbial hit categories (Supplementary Table 4).

We found that species associated with the ancient oral microbiome showed the highest relative detection rate, accounting for up to 50% of ancient hits across various periods (Fig. 5a and Extended Data Fig. 9). Species in the ‘environmental’ classes of probably exogenous origins were also detected at consistent rates throughout time. Species in the ‘infection’ classes occurred at low detection rates throughout (mostly less than 10%). We found that species in the ‘zoonotic’ reservoir classes were not detected until around 6,500 bp (Fig. 5a). Using Bayesian time series decomposition, we inferred an overall increase in the detection rates of the zoonotic reservoir classes from roughly 6,000 bp, remaining at elevated levels until the medieval period (Fig. 5b and Extended Data Fig. 9a). Whereas species in the ‘anthroponotic’ reservoir classes also occur earlier (predominantly species with human-to-human transmission, Extended Data Fig. 10a), we observe increased detection rates from roughly 2,500 bp onwards (Fig. 5b and Extended Data Fig. 9). Our results thus provide direct evidence for an epidemiological transition of increased infectious disease burden after the onset of agriculture through to historical times.

Fig. 5: Time series of ancient microorganisms by microbial source.
figure 5

a, Timeline of relative detection rates in sliding windows of n = 21 temporally consecutive samples for different ancient microbial species classes. Coloured horizontal lines indicate the expected rates if species in all classes would be detected at equal rates based on the total number of distinct species in each class. b, Trendlines for detection rates inferred using Bayesian change-point detection and time series decomposition for ancient microbial species in the zoonotic (top) and anthroponotic (bottom) reservoir class.

We used Bayesian spatiotemporal modelling to investigate possible drivers of the observed ancient microbial incidences. We modelled the presence or absence of either individual microbial species or combined species groups using sets of putative covariates, including spatiotemporal variables (longitude, latitude and sample age), paleoclimatic variables (mean annual temperature and precipitation), human mobility and ancestry, sample material (tooth or other) and a proxy for ‘detectability’ (the number of human-classified reads). In the models for the zoonotic or anthroponotic infection species classes, sample age was an important predictor, consistently negatively associated with incidence, and high effect sizes in the individual species models for B. recurrentis and L. interrogans (Extended Data Fig. 11 and Supplementary Table 6). Longitude was another important factor in the infection classes; it was positively associated with incidence rates for the combined anthroponotic class, and in individual models for Y. pestis and B. recurrentis. The positive effect of longitude suggests a higher detection rate in the eastern part of our spatiotemporal range, where samples from the Eurasian Steppe predominate.

The increased infection incidence in Steppe populations could reflect an increased genetic susceptibility or a higher risk of acquiring diseases associated with the pastoralist lifestyle. The latter suggestion seems more plausible as continued exposure to selective pressures from certain infectious diseases probably would reduce susceptibility in these populations. Human ancestry showed small but consistent positive effects in some models, particularly the infection classes, for the Caucasus hunter-gatherers. Across all models, the incidence of ancient microorganisms was positively associated with teeth as sample material; the highest effect sizes were found in the oral microbiome and infection classes (Extended Data Fig. 11). Teeth preserved ancient oral microbiome and pathogen DNA better than petrous bones (the source of 86% of our samples), probably because of oral cavity exposure and better access to microbial DNA in the bloodstream53. These results support the notion that species detected in those classes are predominantly of endogenous origin.

Conclusions

During the Holocene, human lifestyles changed considerably as agriculture, animal husbandry and pastoralism became key practices but the impact on infectious disease incidence is debated. Our study represents a large-scale characterization of ancient pathogens across Eurasia, providing clear evidence that identifiable zoonotic pathogens emerged around 6,500 years ago and were consistently detected after 6,000 years ago. Although zoonotic cases probably existed before 6,500 years ago, the risk and extent of zoonotic transmission probably increased with the widespread adoption of husbandry practices and pastoralism. Today, zoonoses account for more than 60% of newly emerging infectious diseases54.

We observed some of the highest detection rates at roughly 5,000 bp, a time of substantial demographic changes in Europe due to the migration of Steppe pastoralists and the displacement of earlier populations4,5. Steppe pastoralists, through their long-term continuous exposure to animals, probably developed some immunity to certain zoonoses and their dispersals may have carried these diseases westwards and eastwards. Consequently, the genetic upheaval in Europe could have been facilitated by epidemic waves of zoonotic diseases causing population declines, with depopulated areas subsequently being repopulated by opportunistic settlers who intermixed with the remaining original population. This scenario would mirror the population decline of Indigenous people in the Americas following their exposure to diseases introduced by European colonists55,56. Our findings support the interpretation of increased pathogen pressure as a likely driver of positive selection on immune genes associated with the risk of multiple sclerosis in Steppe populations roughly 5,000 years ago57, and immune gene adaptations having occurred predominantly after the onset of the Bronze Age in Europe9.

Expanding our analyses to the broader pathogen landscape allowed us to infer and contrast incidence patterns between different species and types of pathogens to a greater extent than previously possible. If ancient pathogen DNA of a single species is not detected in a particular region or period, asserting whether this is due to low disease incidence or confounding factors such as differential DNA preservation between different periods and environments is challenging. Our analyses counter these limitations; we demonstrate that pathogens with known epidemic potential and high detection rates, such as Y. pestis (plague) and B. recurrentis (LBRF), show notable differences in their detection rate over time, suggesting that low detection rates in these cases represent an actual reduction in incidence. During the early period (roughly 5,700–2,700 years ago), the continuous detection of Y. pestis is suggestive of endemic disease. The succeeding pattern of distinct waves and periods without detection indicate epidemic outbreaks; these detection peaks match the historically described plague pandemics. This shift from endemic to epidemic is concurrent with important changes in the Y. pestis genome, particularly increased flea-transmissibility and pathogenicity15,50. The pattern for B. recurrentis is almost entirely the opposite, with narrow peaks and long periods without detection, suggesting local epidemics before roughly 2,700 years ago and consistent detection afterwards. This later endemicity of LBRF could be driven by changes in the bacterial genome and by human and environmental factors known to increase the risk of louse infestation34,58,59. Experimental studies have demonstrated that Y. pestis, like B. recurrentis, can infect body lice in the midgut and, sometimes, also the Pawlowsky gland, a putative salivary gland59. Body lice infected in the Pawlowsky gland can transmit Y. pestis in concentrations sufficient to initiate disease in humans, possibly contributing to transmission during plague outbreaks. Infected body lice have higher mortality than uninfected lice, and it remains unknown whether coinfection of body lice with Y. pestis and B. recurrentis is possible.

Our study has some important limitations. Whereas ancient shotgun metagenomic data offer direct evidence of past infections, their usefulness depends on having a high pathogen load and the right tissue samples. Our ancient tooth and bone samples are well suited to detect high-load bloodstream infections such as Y. pestis and B. recurrentis, but pathogens with lower loads or different tissue preferences are underrepresented. Moreover, differentiating ancient infections from those arising from environmental sources, such as the necrobiome, is challenging. Finally, our dataset lacks information on RNA viruses, therefore underestimating the zoonotic disease burden. However, the timing is probably accurate as the conditions favouring zoonotic transmission of RNA viruses are similar to those of other zoonotic pathogens60.

Our findings demonstrate how the nascent field of genomic paleoepidemiology can create a map of the spatial and temporal distribution of diverse human pathogens over millennia. This map will develop as more ancient specimens are investigated, as will our abilities to match their distribution with genetic, archaeological and environmental data. Our current map shows clear evidence that lifestyle changes in the Holocene led to an epidemiological transition, resulting in a greater burden of zoonotic infectious diseases. This transition profoundly affected human health and history throughout the millennia and continues to do so today.

Methods

Dataset

We compiled a dataset of aDNA shotgun-sequencing data from 1,313 ancient individuals previously sequenced for studies of human population history (references for previous publications describing laboratory procedures and sample and/or site descriptions in Supplementary Table 1). To facilitate ancient microbial DNA authentication, we excluded sequencing libraries subjected to uracil-DNA glycosylase (UDG) treatment that removes characteristic aDNA damage patterns from further analyses. Samples sequenced across several libraries were combined into single analysis units to maximize sensitivity for detection of ancient microbial DNA present in low abundance.

Ancient microbial DNA screening

We carried out screening for ancient microbial DNA using a computational workflow combining k-mer-based taxonomic classification, read mapping and aDNA authentication. We first performed taxonomic classification of the sequencing reads (minimum read length 30 base pairs (bp)) using KrakenUniq61, against a comprehensive database of complete bacterial, archaeal, viral, protozoan genomes in the RefSeq database (built with default parameters of k-mer size 31 and low-complexity sequences masked). To increase sensitivity for ancient viral DNA, we reran the classification on a viral-specific database of complete viral genomes and neighbour assemblies from RefSeq (https://www.ncbi.nlm.nih.gov/genome/viruses/about/assemblies/), using all reads classified as non-human from the previous run.

Following this initial metagenomic classification, a subset of genera was further processed in the genus-level read mapping and authentication stages. For bacterial pathogens, we selected genera with two or more established species of human pathogens from a recent survey of human bacterial pathogens2 (n = 125 genera). Genera with a single pathogenic species were not included to balance between including genera responsible for substantial human pathogenic burden and computational feasibility. We further included genera including human protozoan pathogens (n = 11 genera), as well as all viral genera (n = 1,356).

For each genus of interest showing more than or equal to 50 unique k-mers assigned, all sequencing reads classified were collected and aligned in parallel against a representative reference assembly for each individual species within the genus. We selected the assembly with the most unique k-mers assigned as the representative reference genome for each species in a particular sample. If no reads were assigned to any assembly of the species in KrakenUniq, we selected the first assembly for mapping. Read mapping against the selected assembly was carried out using bowtie2 (ref. 62), using the ‘very sensitive’ preset and allowing one mismatch in the seed (-N 1 option). Mapped BAM files were subjected to duplicate marking using ‘samtools markdup’63, and filtered for mapping quality MAPQ ≥ 20. The aDNA damage rates were estimated using metaDMG64.

Authentication of ancient microbial DNA

To authenticate ancient microbial DNA, we calculated sets of summary statistics quantifying expected molecular characteristics of true positive ancient microbial DNA hits65:

Similarity to the reference assembly

Summary statistics in this category measure how similar sequencing reads are to a particular reference assembly, with true positive hits expected to show higher similarity than false-positive hits. Summary statistics used include the following.

Average edit distance

This describes the average number of mismatches in sequencing reads mapped to a particular reference (lower means more similar to the reference).

ANI

The ANI is the average number of bases in a mapped sequencing read matching the reference assembly, normalized by the read length (higher means more similar to the reference).

Number of unique k-mers assigned

The number of unique k-mers that are assigned to a particular reference assembly from the KrakenUniq classification (higher means more similar to the reference).

aDNA characteristics

Summary statistics in the aDNA characteristics category measure the evidence for sequencing reads deriving from an aDNA source. Summary statistics used include the following.

Average read length

This describes the average length in base pairs of sequencing reads mapped to a particular reference (shorter means more likely to be ancient).

Terminal aDNA substitution rates

The terminal aDNA substitution rate is the frequency of C>T (G>A) substitutions observed at the 5′ (3′) terminal base across all sequencing reads mapped to a particular reference (higher means more likely to be ancient).

Bayesian D max

Bayesian Dmax is an estimator of the aDNA damage rate from metaDMG (higher means more likely to be ancient).

Bayesian Z

Bayesian Z is an estimator of the significance of evidence for the aDNA damage rate from metaDMG (higher means more likely to be ancient).

Evenness of genomic coverage

Summary statistics in this category measure how evenly mapped sequencing reads are distributed across a reference assembly. Summary statistics used include the following.

Average read depth

The average read depth is the average number of reads covering a base in the reference assembly.

Breadth of coverage

The breadth of coverage describes the fraction of the reference assembly that is covered by one or more sequencing reads.

Expected breadth of coverage

The breadth of coverage expected for a particular average read depth is calculated66 as

$$1-{{\rm{e}}}^{-({\rm{average\; read\; depth}})}$$

Ratio of observed to expected breadth of coverage

This is the ratio of the breadth of coverage observed in the mapping to the breadth of coverage expected given observed average read depth (higher means more even coverage).

Relative entropy of read start positions

This relative entropy is a measure for the information content of the genomic positions of mapped reads. To obtain this statistic, we calculate the frequency of read alignments with their start positions falling within windows along the reference assembly, using two different window sizes (100 and 1,000 bp). The obtained frequency vector is converted into Shannon information entropy, and normalized using the maximum entropy attainable if the same total number of reads were evenly distributed across the windows (higher means more even coverage).

Filtering of putative ancient microbial hits

From this initial screening, we then selected a subset of putative microbial ‘hits’ (sample–species combinations) for further downstream analysis based on a set of aDNA authentication summary statistics:

  • number of mapped reads greater than or equal to 20

  • 5′ C>T deamination rate greater than or equal to 0.01

  • 3′ G>A deamination rate greater than or equal to 0.01

  • ratio of observed to expected breadth of coverage greater than or equal to 0.8

  • relative entropy of read start positions greater than or equal to 0.9

  • ANI > 0.965

  • rank of number of unique k-mers assigned less than or equal to 2

For this initial filtered list of putative microbial hits, we ran metaDMG using the full Bayesian inference method to obtain Z scores measuring the strength of evidence for observing aDNA damage.

The final list of putative individual ancient microbial hits was then obtained using the filtering cutoffs

  • metaDMG Bayesian Dmax ≥ 0.05

  • metaDMG Bayesian Z ≥ 1.5

  • rank of number of unique k-mers assigned = 1

For authentication of viral species, we used the same filtering cutoffs described above, except for a lower ANI cutoff (greater than 0.95), as well as a lower cutoff for relative entropy of read start positions (greater than 0.7) for short viral genomes (fewer than 10 kilobases).

The result of this filtering is a single best-matching species hit for each sample and genus of interest (Supplementary Table 2). We note that this approach will miss potential cases in which aDNA from many species of the same genus are present in the sample. However, because of the considerable challenges involved in distinguishing this scenario from false positives due to cross-mapping of ancient reads from a single source of DNA to reference assemblies of a closely related species (for example, Y. pestis/Yersinia pseudotuberculosis), we opted for the conservative option of retaining only the best hit for each genus.

To further authenticate putative hits with low read counts (n ≤ 100 final reads), we carried out a BLASTn analysis. We extracted the reads for a species hit from the final filtered BAM files and queried them against the ‘nt’ database (downloaded 18 August 2024) using ‘blastn -task blastn -max_hsps 1’. For the reads of each putative ancient microbial hit, we then tabulated the number of times and proportion of the highest scoring BLAST hits matched either the genus or species inferred from our workflow (Supplementary Table 3).

Simulations of ancient microbial DNA

We simulated aDNA fragments from microbial reference genomes in silico using gargammel67. We chose nine species representing pathogens of interest, and for each selected an assembly not present in the pathogen screening workflow database:

For each reference genome, we simulated 5 million single-end sequencing reads (100 bp read length) with adapter sequences, with read length distribution and damage patterns from a mapDamage2 results of a previously published ancient pathogen genome (RISE509, Y. pestis15). The full-length simulated reads were then adapter-trimmed using AdapterRemoval68. To investigate the ability of the workflow to detect low abundance ancient microorganisms, we randomly down-sampled the full read set for each reference genome using seqtk (https://github.com/lh3/seqtk) (50, 100, 200, 500 reads; ten replicates each).

Topic model analysis

We carried out topic model analysis on taxonomic classification profiles for each sample using the R package fastTopics69 (https://github.com/stephenslab/fastTopics). We used the number of unique k-mers assigned to non-human genera from KrakenUniq as the observed count data for each sample, excluding genera with fewer than 50 unique k-mers assigned. The analysis was carried out for L = 2 and L = 3 topics to capture broad structures in the classification profiles.

Ancient microbial groups

For combined analyses, we grouped the ancient microbial hits into three categories on the basis of the likely source of the microbial DNA (Supplementary Table 4):

  1. 1.

    Environmental, to capture all hits derived from environmental sources including the necrobiome (labelled environment_background, environment_pathogen, to distinguish potential pathogenic species from non-pathogenic ones)

  2. 2.

    Oral microbiome, including both commensal and pathogenic species (microbiome_oral)

  3. 3.

    Probably pathogenic infections, further distinguished into different modes of transmission (infection_anthroponotic; infection_vector_borne; infection_zoonotic).

We define zoonotic pathogens here as those transmitted from animals to humans or which made such a host jump in our sampling time frame70.

Time series

To infer temporal dynamics of ancient microbial species, we calculated detection rates in a sliding window of k = 21 temporally consecutive samples across the entire timeline of the 1,266 samples with dating information. For individual species, the detection rate for each window corresponds to the proportion of the 21 samples in each window that were positive for the species of interest. For analyses of species combined in classes, we calculated the detection rate as the ratio of the total number of hits within a class in the window over the total number of possible hits across all species in a window (21 samples × 258 species across all classes). For individual species with n ≥ 20 hits or combined species classes, we further performed Bayesian change-point detection and time series decomposition (BEAST)71 implemented in the R package Rbeast (https://github.com/zhaokg/Rbeast), using the detection frequencies described above as response variables. Data for previously published Y. pestis and B. recurrentis genomes were obtained from AncientMetagenomeDir (release v.23.12, https://doi.org/10.5281/zenodo.10580647)72.

Spatiotemporal models of species incidence

To identify possible drivers of the observed spatiotemporal ancient microbial incidence, we combined the individual microbial species and the combined species groups with palaeoclimatic variables, human mobility estimates and kriged estimates of ancestry composition for Holocene West Eurasia. Palaeoclimatic reconstructions were accessed using the CHELSA-Trace21k data, which provide global monthly climatologies for temperature and precipitation at 30-arcsec spatial resolution in 100-year time steps for the past 21,000 years (ref. 73). To pair the microbial species and groups with the palaeoclimatic reconstructions, we took the average climatic value across all the time steps that fall within the microbial species and/or group age ± standard deviation at each of the sampling locations. Palaeoclimatic variables considered were annual mean temperature (BIO01) and annual precipitation (BIO12). Human mobility values were accessed from ref. 74 and roughly represent the distance in kilometres between the burial location of the ancient human individual and its putative ancestral origin on the basis of patterns of genetic similarity derived from multidimensional scaling (MDS) analysis. Microbial species and/or groups were paired with the mobility estimate of the ancient human individual that occurs closest in space and time. Kriged ancestry estimates were extracted from ref. 75, using the spatiotemporal ancestry kriging method from Racimo et al.76, and paired to the closest spatiotemporal location of the ancient human remain where the corresponding microbial species and/or groups were sampled.

To determine the influence of the covariates on the microbial incidence, we used a hierarchical Bayesian model implemented in the inlabru R package77,78, in which ancient microbial presence or absence follows a binomial distribution and the spatiotemporal variables (latitude, longitude and sample age), number of human-classified reads, sample material, palaeoclimatic variables, human mobility and human ancestry constitute the linear predictors. The sample material is a categorical variable indicating whether the material used for sequencing was a tooth or not (bone), which inlabru treats as a random effect variable. We followed the default inlabru priors, in which distributions are distributed as a Gaussian variable with mean μ and precision τ. The prior on the precision τ is a Gamma with parameters 1 and 0.00005. The mean is a linear combination of the covariates. By default, the prior on the intercept of the linear combination is a uniform distribution, whereas the priors on the coefficients are Gaussian with zero mean and precision 0.001. All covariates were normalized before the analyses. For each microbial species and group, we tested many models with different sets of covariates: (1) palaeoclimate + mobility + ancestry, (2) palaeoclimate + mobility, (3) palaeoclimate + ancestry, (4) only climate, (5) mobility + ancestry, (6) only mobility, (7) only ancestry and (8) no climate, mobility nor ancestry. Spatiotemporal variables, number of human-classified reads and sample material were included in all models. Because covariates were normalized, results indicate deviations from the mean. The effect size is interpreted in units of standard deviation. We used a criterion (DIC score) that prioritizes both fit and simplicity (low number of effective parameters) for evaluating the performance of hierarchical Bayesian models. Parameter distributions of the full model tended not to differ by much to those of the best-performing models under this score, when those parameters were included in the model. Results for both the full and best-performing models (that is, models with the lowest DIC score for each microbial species or combined species group) are shown in Extended Data Fig. 11. DIC scores as well as Watanabe–Akaike information criterion for all models we tested can be found in the Supplementary Table 6.

Geographic maps

All maps were created using the R project for statistical computing79, using the sf package80. Shape files for coastlines, rivers and lakes were obtained from Natural Earth (https://www.naturalearthdata.com) using the rnaturealearth package81. Elevation data for Fig. 1 were obtained from AWS Open Data Terrain Tiles (https://registry.opendata.aws/terrain-tiles) using the elevatr package82.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.