Abstract
The Eastern Italian Alps played a crucial bridging role between Mediterranean and Northern alpine populations since Prehistory. However, few prehistoric individuals from that region have been genomically analysed so far. Among them, the Iceman (Copper Age, 3368-3108 BC) showed a relatively high Anatolian-Neolithic-related ancestry and low Hunter-Gatherers (HGs)-related ancestry. To investigate how the genomic structure of alpine groups varied over time and to contextualize the Iceman, we analysed 47 alpine individuals dated from the Mesolithic (6380-6107 BC) to Middle Bronze Age (1601-1295 BC). The Mesolithic genome reveals genetic admixture between Western and Eastern HGs that occurred from ~13700 − 8300 BC. Most individuals from the Neolithic onwards present a genomic structure resembling that of the Iceman, supporting genetic continuity. Few individuals carry different ancestries, such as the Steppe-related ones appearing ~2400 BC. Finally, the study suggests local and non-local admixture events between HGs and Neolithic farmers from this alpine area.
Similar content being viewed by others
Introduction
The Eastern Italian Alps, including the Trentino-Alto Adige region (EIAlp) considered in this study, are an important area for expanding our knowledge of Prehistory and the major demographic events that occurred in Europe and contributed to shaping the gene pool of today’s European populations1.
The first event is associated with the Hunter Gatherers (HGs), including two major groups: Western Hunter-Gatherers (WHGs) related to the Villabruna cluster2 and the Eastern Hunter-Gatherers (EHGs) from present-day Russia and Ukraine3. The second regards the migrations of Early Neolithic (EN) farmers from Anatolia (from ~7000 BC)1,4,5 who admixed with local HGs groups, to varying degrees depending on the area1,6. The last one is instead related to the expansion of Pontic-Caspian Steppe pastoralists (from ~3000 BC)7,8.
Indeed, due to the presence of wide valleys (e.g. Adige Valley) and mountain passes (e.g. Passo del Brennero/Brennerpass), the EIAlp played a crucial role as a bridge between populations from northern and southern alpine territories, favouring cultural exchanges and contacts. However, the mountain environment may also have influenced such exchanges and interactions among the alpine groups.
The early population history of Europe was heavily influenced by the Last Glacial Maximum (LGM, 30,000–16,500 years ago) when the Alps were covered by thick glaciers and continental Europe was accessible from Italy, mainly from the Balkan Peninsula, and bridges emerging from the Adriatic Sea9. When the large Pleistocene glaciers retreated (~18,000 years ago), the EIAlp was entered by HG groups of the final phase of the Upper Palaeolithic, who possibly arrived from a more southern area (e.g. Po Plain region)10,11. During the temperate climate phase (Late Glacial interstage; ~14,500 and 12,500 years ago) numerous sites (e.g., open-air sites, rock shelters, caves) referable to the Epigravettian culture, were attested (e.g., Arco via Serafini, Riparo Dalmeri, Riparo Villabruna)12,13 and with the beginning of the Holocene, temperate climatic conditions favoured the settlement of Early (~9000 BC, Sauveterrian) and Late Mesolithic (ME) (~6500 BC, Castelnovian) HGs groups on the valley floors (Trentino area) with subsequent expansion into higher altitudes14,15,16.
In the EIAlp, transition from a ME subsistence economy (based on hunting and gathering) to a new Neolithic (NE) productive one (based on agriculture and animal husbandry) is attested between 5100 and 4800 BC (Trentino area)17,18.
Even if in Europe EN farmers migrations followed complex and multifaced dynamics in the various geographical areas19, in general, two major routes have been described20,21,22,23. One along the Mediterranean coastline, associated with the Impressed Ware culture, and another through the Balkans, which led to the formation of the Linearbandkeramik (LBK) culture in central Europe and partly contributed to the formation of the first NE groups in northeastern Italy19. In the latter area, including the EIAlp and the Po Valley south of the Alps, both NE economic and cultural innovations took root, resulting in various Padano-Alpine cultural groups (e.g. Fiorano, Vhó, Isolino, Gaban)19,24. The potteries of this period also show influences from the Impressed Ware ceramic group of central Italy25,26. From ~4800 BC the cultural traits of the “Gaban group”, which was named after the archaeological site (Trentino area), began to be gradually replaced by the diffusion of the Square-Mouthed Pottery Culture (Vasi a Bocca Quadrata, VBQ), distributed mostly in northern and central (Toscana) Italy19. From the mid-4th mill. BC, within the Copper Age (CA; ~3500–2200 BC), significant social changes occurred in the EIAlp with the formation of cultural groups distinguished mainly by different funerary practices such as burials in simple earthen pits with bodies placed on the left side (like Remedello group)27 as well as supine inhumations (Spilamberto group)28 and human remains manipulation and cremation (Grotticelle group)29.
During the Late CA (~2700–2400 BC) and the transition phase from the CA to the Early Bronze Age (EBA, ~2400–2200 BC), the exploitation of copper ores began in the Trentino-Alto Adige region30,31, continuing in the EBA with the Polada culture32,33.
In the Adige valley, contacts between groups from both sides of the Alps (from Northern–Central Europe, transalpine Middle Danubian area and the Po River Valley) are clearly documented by the discovery, in this valley, of numerous objects (e.g. Baltic amber beads) and weapons referable to those areas33,34,35.
Despite the wealth of archaeological information on the different cultures present in the EIAlp during Prehistory, very little is known about the genomic diversity and structure of alpine prehistoric individuals from this and surrounding alpine areas. In fact, only a few prehistoric individuals had been genomically analysed before this study. These involved the Late Upper Palaeolithic individual of Villabruna (~14,000 BP; Veneto region) that gives the name to the Villabruna genetic cluster2, and three CA individuals, including the Tyrolean Iceman, later referred to only as Iceman36,37,38. Furthermore, palaeogenomic data on prehistoric specimens from present-day Italy are still limited and only concern specimens recovered south of the Alps7,39,40,41,42,43,44,45,46,47.
A recent study based on a high coverage genome of the Iceman (3368–3108 BC)37, found that this individual carried a high percentage (90% ± 2.5%) of ancestry associated with EN farmers from Anatolia and low WHGs-related ancestry. The EN farmer-related ancestry has been found to be the highest among contemporaneous European individuals. This may indicate rather isolated alpine groups with limited gene flow from HG-ancestry-related populations or smaller population size of HG groups in this region during the 5th and 4th millennia BC37. Additionally, the study reported further information on some phenotypic traits of the Iceman, in addition to those previously described36. However, although the study provides important initial insights into the genetic history of prehistoric EIAlp, it is based only on a single individual, and its findings and interpretation need to be supported by genomic data from more specimens from this alpine territory.
Studies carried out in north-eastern Italy on CA and Bronze Age (BA) individuals7,41,42, show genomic affinity of CA individuals to EN farmers, similarly to the Iceman. In addition, these studies suggest that the genetic component related to the herders from the Steppe (Steppe-related ancestry) arrived in northern Italy at least by ~2000 BC42. Furthermore, no genomic data are available on individuals from the ME or NE periods from northern Italy, apart from one early ME individual (north-western Italy) for whom only mitochondrial DNA and low-resolution data are available46.
We successfully generated new paleogenomic data (shotgun and capture) from 47 prehistoric/protohistoric (hereafter referred to as prehistoric) alpine individuals from 17 archaeological sites from the EIAlp. Additionally, we report 34 new radiocarbon (14C) dates that, besides those already available, have resulted in a dataset that includes individuals from the Late ME (6380–6107 BC) to the Middle Bronze Age (MBA, 1601–1295 BC) (Fig. 1 and Supplementary Data 1). The data were used to understand how the genomic structure of alpine prehistoric individuals varies over time. This allows us to investigate the genetic impact related to migration events during the ME and NE, and the extent of admixture between NE farmers and local HGs, as well as the possible impact and timing of the arrival of Steppe-related ancestry in this area. Furthermore, by extending the genomic analyses to a larger number of specimens contemporary to the Iceman, we aim to offer a more comprehensive picture of the genomic structure of prehistoric alpine individuals during CA. Finally, we intend to provide an initial insight into the possible social practices of past alpine groups and give information on some phenotypic traits that have also been investigated in the Iceman.
A Geographic map of Italy and the Alpine arc (left) was performed by using ESRI’s ArcGIS Pro software, and a zoom-in (right) of the Trentino Alto-Adige region in the EIAlp with the location of the archaeological sites from this study (Iceman37 and Ora/Auer38). The number of sampled individuals from each site is near the symbol, coloured according to chronology. The first three letters of the site, also used in the sample identification number, are given in brackets. B Mean radiocarbon dates and calibrated intervals (cal. 2-sigma, 95%) for each dated individual (more details in Supplementary Data 1 and Supplementary Information Fig. S16).
In this work, we reveal that the ME alpine individual shows a genetic admixture between Western and Eastern HGs that occurred between 13,700 and 8300 BC. Furthermore, we found a substantial genetic shift from ME to NE and genetic continuity from NE (from at least ~4600–4400 BC) onwards, with a low and constant contribution from HGs. In addition, we report the estimated time of admixture between NE Anatolian farmers and ME (~6100–5100 BC), supporting both local and non-local admixture in the ElAlp. Our study also finds a low genetic impact related to the migration of herders from the Steppe on alpine groups, although it suggests that Steppe-related ancestry appeared in EIAlp earlier than in other Italian regions, including northern Italy. Finally, we found that the Iceman shows the same ancestral genomic model and similarity in phenotypic traits as the other individuals sampled from the CA and most of the other alpine individuals analysed, confirming and extending previous results. However, the Iceman differs from all other alpine individuals analysed in its maternal and paternal lineages, suggesting a slightly different genetic history.
Results
Alpine dataset
Paired-end genomic libraries of DNA extracts from 52 samples (Pars petrosa or teeth) were tested for their content of endogenous human DNA (HR) and authenticity using shotgun sequencing (Supplementary Data 2; Supplementary Information Text S3). The samples with HR > 1% were further enriched for 1240 K SNPs across the human genome, as well as for the Y-Chromosome and mitochondrial DNA (mtDNA) (Supplementary Data 3). Individuals (n = 5) who did not fulfill the quality criteria were excluded from the study (Methods and Supplementary Information Text S3). The final successfully analysed alpine genomes (45 capture and 2 shotgun), with low contamination and typical ancient DNA (aDNA) damage patterns, had a mean coverage from 0.012× to 0.587× while the number of SNPs on the 1240 K panel48 ranged from 33,981 to 1,046,042 SNPs sites (Supplementary Data 4). Available data from the other three alpine CA individuals (Iceman and individuals from Ora37,38) were included reaching a dataset of 50 individuals from the EIAlp which comprises: one Late ME, eight Middle NE (MN), sixteen CA, sixteen CA/EBA and nine from the EBA to the MBA (Supplementary Data 1 and Supplementary Data 4). The dataset includes a similar number of females and males, including several subadults (26 XX and 24 XY; Supplementary Data 4). Among the sixteen related individuals (based on full consistency between at least two of the applied methods; Supplementary Data 5; Supplementary Information Text S4), only the sample of the related pair, which showed the highest number of SNPs, was kept for population genetic analysis, yielding a final dataset of 41 unrelated alpine individuals (Supplementary Data 4 and 5).
Admixture events between WHGs and EHGs in the ME genome from the EIAlp
The Late ME male alpine individual (MAD01; 6380–6107 BC) differs from all other most recent alpine individuals in the Principal Component (PCA) and clustering analyses (Fig. 2A, and Supplementary Information Fig. S22). It shows a strong genomic affinity to WHGs and the Upper Palaeolithic individual from Villabruna, which gave its name to the Villabruna genetic cluster comprising most of the HGs from western Eurasia analysed so far2. Furthermore, a multidimensional scaling plot (MDS) on pairwise outgroup f3-statistics performed using the dataset of Posth and colleagues44, confirms that MAD01 falls together with WHG groups (Villabruna and Oberkassel clusters). Additionally, MAD01 shifts in the plot towards EHGs individuals from present-day Russia (Sidelkino) (Fig. 2B and Supplementary Data 11 and 12).
A PCA of ancient individuals from this study projected onto the variation of present-day individuals from Western Eurasia (refer to Supplementary Data 10). B MDS plot of Eurasian HGs individuals based on 1 − f3 (Mbuti, pop1, pop2) statistic. Comparative data from ref. 35. ANE Ancient North Eurasian (refer to Supplementary Data 11).
Our qpAdm analysis confirms that MAD01 can be best modelled (p-value = 0.145) as the admixture between the two sources of Villabruna (83.6 ± 3.7%) and EHGs-Sidelkino (16.4 ± 3.7%) rather than Oberkassel and EHGs although this model is also possible (p-value = 0.0335) (Supplementary Data 16). We then ran qpAdm using Villabruna and Fournol as sources (the latter ancestry is absent in Villabruna but not in Oberkassel), however the model was rejected (p-value = 0.00118) (Supplementary Data 16). The time of the admixture event in MAD01 between a group of WHGs and EHGs estimated by DATES ranged between 13735 BC and 8326 BC (Supplementary Information Text S7 and Fig. S26, Supplementary Data 20).
The analyses of the unilinear markers in this individual also showed that MAD01 carries the Y-Chromosomal I *(I2a1b1a2b) and the mtDNA U5* (U5b2b3) haplogroups, typically found in individuals with WHGs-related ancestry2,44.
High early anatolian farmers-related ancestry with few changes in the alpine genomes over time
The PCA plot (Fig. 2A, Supplementary Information Fig. S19A and Supplementary Data 10), shows that the majority of alpine individuals from MN to MBA have genomic affinity with each other and mostly with present-day Sardinians. In line with these results, pairwise-qpWave analysis revealed that most of our sampled individuals (excluding seven: ROM303, ROM307, ROM402, NOG302, LAS01, VOL01, and SIU01), share similar ancestries and can be grouped together (Fig. 3). Furthermore, in the PCA graph, most alpine genomes shift toward those from the EN farmers of Anatolia, with whom they share a similar genetic pattern. Specifically, the alpine individuals are in an intermediate position (zoom Fig. 2A and Supplementary Information Fig. S20A) between the NE individuals of Central and Balkan Europe and those of Spain, and the f3 and f4-statistics did not reveal specific affinities with any of these groups (Supplementary Information Text S6, Figs. S27 and S28).
When we formally tested the admixture models by qpAdm analyses (Fig. 4 and Supplementary Data 17), we found that most alpine individuals from MN to MBA (excluding the seven ones that deviate the most from the general pattern, as detailed in the subsequent paragraph), can be best modelled by a two-way model of admixture with high EN-Anatolian farmers and low HGs-related ancestry. Indeed, the EN-related ancestry ranged from 90.9 ± 2% (COR02, dated to CA/EBA, 2276–2041 BC) to 80.3 ± 2.2% (ROM401, relative dating corresponds to CA/EBA, no 14C data available) while local HG-related ancestry (MAD01) varied in these two individuals from 9.1 ± 2% to 19.7 ± 2.2%, respectively (Fig. 4 and Supplementary Data 17).
Each colour represents one of the seven sources used in the models. Number of individuals = n. Villabruna (n = 1), MAD01 (Madonna Bianca, n = 1), EHGs (Eastern Hunter Gatherers, n = 18), CHG (Caucasus Hunter Gatherers, n = 2), AnatoliaN (Anatolian Neolithic farmers, n = 22), IranN (Iranian Neolithic farmers, n = 4) and Steppe (Yamnaya, n = 10). The standard error for each source proportion is indicated by the error bars. No significant model could be found for one individual (ROM303). Haplogroups of the mitochondrial DNA (mtDNA) in all individuals and Y-Chromosome haplogroups in males are indicated on the right-hand side of the graph.
As a further step, in order to test whether the HGs-related component found in alpine MN individuals could be related to different groups of HGs rather than MAD01, an f4-statistical analysis on the form f4 (HG_test, MAD01; MN, Mbuti) was performed with different sources of proxy HGs (Supplementary Data 15). Most f4-statistics were negative (except for Vela_Spila from Croatia and Villabruna), but no test was significant (z-score between −2.8 and +1.069), indicating that no other HGs used for the analysis fit better than MAD01 as a source of HGs ancestry in alpine MN. Notably, the two tests with the strongest affinities with MAD01 (Zscore < -2.7) were the comparisons with ME from Iron Gates, practically excluding the Balkans as a region where NE individuals could have obtained substantial amounts of their HG ancestry before reaching the EIAlp (Supplementary Data 15).
The CA Iceman is after COR02 (CA/EBA from Grotte di Castelcorno, Trentino, Supplementary Information Text S1), the second alpine individual carrying the highest (90.5 ± 2%) and lowest (9.5 ± 2%) proportions of EN Anatolian and HGs-related ancestry, respectively. These values are very similar to those that were found in the previous study by Wang and collaborators37 (89.5 ± 2.5% and 10.5 ± 2.5%).
We additionally found that the proportions of HG-related ancestry in the alpine individuals from the different chronologies differ only slightly (Fig. 4 and Supplementary Data 17). Comparable percentages of HGs-related ancestry (MAD01) were also observed when qpAdm analyses were performed, grouping the alpine individuals based on their chronologies and taking into account differences in their ancestral components (Fig. 4, Supplementary Information Fig. S24 and Supplementary Data 19). A similar pattern with few changes of the HG-related component over time can be observed in Sardinia and Sicily, as suggested by our qpAdm analysis on other available prehistoric individuals from Italy (Supplementary Data 19). Focusing on the CA period in the alpine region and northern Italy (samples from Remedello and Broion), our qpAdm analysis additionally shows similar (low) proportions of HG-related ancestry (MAD01) in these regions (Supplementary Data 19 and Supplementary Information Fig. S25).
The time of the admixture events between WHG and Anatolian farmers in EIAlp groups or individuals was estimated with DATES (Supplementary Data 20), and the ancestral models with the two source populations used for the analysis were verified with qpAdm analysis (Supplementary Data 19 and Supplementary Information Text S7). The admixture time estimated in the alpine MNs ranged from 6073 to 5117 BC (Supplementary Information Fig. S26).
Different ancestries occasionally detected in the EIAlp from the CA-EBA, and a low genetic impact related to Steppe herders
Most of the alpine sampled individuals, across the various chronologies, display a similar genomic structure and ancestry (Figs. 3 and 4). However, seven of them deviate from this general pattern (Supplementary Information Fig. S23). The three CA-EBA (LAS01, VOL01, and ROM402), which shift in the PCA plot in the direction of genomic diversity of BA individuals with high Steppe-related ancestry (Yamnaya), and one MBA individual (SIU01), which behaves as an outlier in the plot (Fig. 2A). The latter cluster with BA individuals from central Europe and present-day western European populations (Supplementary Information Fig. S19A, Fig. S20C and Fig. S20D).
Overall, f4-statistics do not show clear genetic similarities between alpine CA-EBAs and specific groups (Supplementary Data 14 and 15; Supplementary Information Text S6 and Fig. S27). Using qpAdm, three-way best-fitting model (with HG-MAD01, EN from Anatolia and herders from the Steppe as sources) for the individual LAS01 (2402–2149 BC), VOL01 (relative dating corresponds to CA/EBA, no 14C data available) and for SIU01 (1601–1295 BC) was found. Instead, a four-way admixture model, with the addition of one component from Caucasian HGs (CHG; 11.1%), was found in one individual (ROM402) (2272–2041 BC) (Fig. 4 and Supplementary Information Fig. S24 and Supplementary Data 17 and 18). This individual had possibly a higher level of CHG-related ancestry than that included in the Steppe-related ancestry present in the genome of the population used as a source in the model (Steppe-herders).
The percentage of the Steppe-related ancestry in the EIAlp ranged from 19.7 ± 7.1% (ROM402) to a maximum of 36.1 ± 2.6% in SIU01, which explains the position of this outlier in the PCA plot (Fig. 2A). The Steppe-related component value found in SIU01 is higher than those estimated in BA individuals from Broion in northern Italy (29.5 ± 3.8) with a comparable age (1615–1431 BC) (Supplementary Data 19). Based on our qpAdm analyses, which include the ancient data available from Italy to date, this component was first identified in our CA-EBA individual LAS01 dated 2402–2149 BC, while the oldest individual (Bell-Beaker culture) carrying the Steppe-related component in northern Italy is dated to 2200–1930 BC41,49 (Supplementary Information Fig. S25, and Supplementary Data 19). The admixture time between local CA and Steppe-pastoralist-related ancestry (Yamnaya) in LAS01 estimated by DATES ranged between 2912 and 2219 BC (Supplementary Information Text S7 and Fig. S26, Supplementary Data 20).
The other three alpine individuals that differ from the general pattern include NOG302 (CA-EBA, 2202–2037 BC), ROM307 (CA-EBA, 2200–2035 BC), and ROM305 (EBA, 1878–1688 BC). Unlike the other four individuals, they show the most parsimonious and best fitting model (p-value between 0.066 and 0.224) when including the Iranian Neolithic (IN) as a source in addition to EN-Anatolia and HGs (Fig. 4 and Supplementary Data 17) with the IN-related ancestry ranged from 19.2 ± 4.9% (NOG302) to 21.5 ± 4.5% (ROM305). In fact, when running qpAdm on these three individuals with a model that includes the Steppe-related ancestry as a source instead of the IN-related ancestry, the three-way model either does not fit (p-value < 0.01) (NOG302 and ROM305) or fits with a lower p-value (0.012, ROM307). Additionally, based on our comparative qpAdm analysis (Supplementary Data 19), the IN-related component was not detected in any other prehistoric (CA and BA) individuals from northern Italy. On the other hand, it has been found in NE individuals from southern Italy who show this ancestry in addition to those related to HGs and EN farmers. Moreover, central Italian NE and CA individuals best fit with a two-way model (HG and EN), but also a three-way model is possible, which includes the IN-related ancestry in agreement with the analyses performed by40 (Supplementary Data 19).
Unilinear markers, kinship and phenotypic traits
In agreement with autosomal data, Y-chromosome haplogroup analysis shows very few changes in the EIAlp from MN to MBA, with the males analysed carrying the same haplogroup G2a2* associated with the early NE settlement of Europe48. Only two individuals, besides the ME one, had different haplogroups (ISE01, J2a2a and SIU01, R1b-P312) (Fig. 4, Supplementary Information Text S4 and Supplementary Data 3). Moreover, G2a2* is represented by only one sub-lineage (G2a2b2a1a1b*, L497) except for the Iceman, which carries a different one (G2a2a1a2a1a1b, G-Z6208, or formerly G2a2a1a2 L-91)36. Comparative analyses (AADR v54.1 dataset)50 show that the G2a-L497 has been found especially in EN (Linearbandkeramik, LBK) individuals from Germany and MN individuals from France22,51, while the Iceman’s lineage was more widespread. Indeed, it has been found in EN from Germany (LBK) and Spain, MN from France and Croatia, and also in more recent MBA individuals from Croatia22,52. CA individuals from North Italy (Broion) carried a different lineage of G2a* (G2a2b2b-F705 or former G2a3-F705)42, while the Y-Chromosomal haplogroup in the other CA northern Italian males (Remedello) is not available for comparison (low coverage data)7.
Among the alpine MN individuals, ISE01 (4445–4261 BC) is the only one with a different Y-Chromosomal haplogroup (J2a2a*). This haplogroup was not found in the comparative dataset and appears to be rare in ancient samples and mostly restricted to the eastern Mediterranean, Caucasus and Asia, where it may have originated by reaching western Europe during the NE48,53,54. ISE01 is the most recent among the MN analysed and belongs to an advanced phase of the VBQ culture (VBQ3) (Supplementary Information Text S1).
Instead, the Y-Chromosomal haplogroup R1b* of SIU01 is one of the principal haplogroup in the ancient Eurasian individuals but was found only in this alpine male (see Discussion).
Unlike alpine paternal lineages, maternal ones are composed of different macro mtDNA haplogroups (H, J, K, U, V, X), which are typically found in other prehistoric individuals from Europe (Fig. 4 and Supplementary Data 3). However, some alpine sub-lineages (e.g. H3k, H5d, J1c12b, V15, J1c + 1626 + 189) are very rare elsewhere or have not yet been observed according to the comparative dataset (AADR v54.1; samples with mtDNA “coverage >2× or published” and dated from ~5500 to 1200 BC)50. Our comparative analysis also shows that the mtDNA haplogroup of ISE01 (T1a) was described in some NE samples from south-eastern and central Europe and in more recent samples from the Middle East (Israel, Armenia)55,56,57 whereas that of SIU01 (X2b + 226) was found mainly in ancient individuals from central, northern and south-western Europe (MN to MBA)41,58,59,60. Finally, the K1f mtDNA haplogroup of the Iceman, never identified so far in modern and ancient individuals61,62, was also not found in any of the other alpine individuals analysed in this study.
Kinship analyses clearly detected close relationships (parent-child or siblings) in at least four cases, and in other two cases, a 2nd degree relationship. Kinships always involved individuals from the same archaeological site and were buried in multiple or single graves. For instance, using information from unilinear transmitted markers, one possible mother-child (CA; MEZ01/MEZ02) and father-child (CA; ORA01/ORA01)38 and two possible brother-sister pairs (NOG201/NOG202 from CA and VEL701/VEL705 from MN) were identified (Supplementary Information Text S4 and Fig. S17, Supplementary Data 5).
Furthermore, our ROH analysis did not reveal a high level of inbreeding resulting from union between first-degree cousins (sROH>20 over 50 cM) according to other studies63,64. Instead, it identified inbreeding stemming from a union between second-degree cousins in four individuals, including three males (COR01, NOG301, ROM308) and one adult female (SOL01). Additionally, ROH analysis also indicates a higher sum of runs of homozygosity (ROH > 4 cM = 87 cM) for our ME (MAD01) compared to other alpine sampled individuals, suggesting a smaller population size during ME in the EIAlp (Supplementary Information Text S5 and Fig. S18, Supplementary Data 9), as also described in other contexts63,65. Additionally, we found an increase in the effective population size (Ne) from MN (mean sROH[4-8] = 13.46 cM) to CA (mean sROH[4-8] = 4.38) cM followed by a slight decrease in Ne starting from the CA-EBA transition phase, with a similar trend for the following EBA and MBA periods (mean sROH[4-8] between 7.87 and 9.1 cM).
Regarding phenotypic traits analyses, full prediction (based on 41 SNPs) for hair, eye and skin colour was possible only in six individuals. These likely had brown eyes associated with dark brown to black hair colour (similarly to the Iceman) and skin from pale to intermediate skin colour (Supplementary Information Text S9, Supplementary Data 21). However, inconsistent results on Iceman’s skin colour compared to the previous study by37 based on high genome coverage, raise the doubt that the genome coverage in our study does not have sufficient power to correctly predict this complex trait (Supplementary Information Text S9).
In addition, some phenotypic traits that have been associated with a single or few SNPs and which have also been investigated in the Iceman were analysed in our study. For instance, the derived allele of the SNP rs4988235 in the MCM6 gene (MCM6/rs4988235) associated with lactose tolerance was absent in all prehistoric alpine individuals analysed. Moreover, the derived allele of the SNP PLRP2/rs475199566,67, possibly associated with an agricultural diet adaptation, was not detected in the alpine ME individual but was found from the MN onwards, with an increasing occurrence of 62.5% in the MN group and 75% in the CA group. In the later period, a decreasing occurrence was observed (CA-EBA; 50%), whereas the EBA groups only show the derived allele at this position. The analysis of a tag SNP (rs1495741)68 of the gene NAT2 whose “slow acetylator” variant has been hypothesised to be advantageous in agriculturalist populations69,70,71 found the derived allele at this position in all alpine groups and in the Late ME individual (Supplementary Information Text S9, Supplementary Data 22).
Discussion
We extended the genomic analyses to 47 new prehistoric individuals from the Iceman’s territory covering a time span of ~5000 years (from ME to the MBA), providing a more detailed picture of the genomic structure of prehistoric individuals from this alpine area over time. This allowed us to investigate the extent of major demographic events in the EIAlp that occurred in Western Eurasia during Prehistory.
The tooth sample of the ME individual (MAD01) was recovered from a rock shelter in the Adige Valley, and our radiocarbon dating (6380-6107 BC) places it in the last phase of the ME, characterised by the Castelnovian industry (Supplementary Information Text S1). In general, little is known about the last phase of the ME in Europe and the transition from this period to the NE65. In fact, MAD01 is one of the rare Late ME individuals from Italy and southern Europe genomically analysed to date, in addition to a few other individuals from Sicily44,47, making the data from our alpine ME individual particularly valuable. Moreover, the availability of genomic data from this local ME made it possible to use it in our genetic admixture models without resorting to samples outside of the territory. Genomically, MAD01 resembles typical WHGs (Villabruna) with a contribution from EHGs (16.4 ± 3.7%). The absence of Fournol-related ancestry in the alpine ME genome, which has been found in Upper Palaeolithic individuals from western and southwestern Europe44, suggests that alpine ME has a genetic link to the East and could be a representative of migrations from the South-East.
Regarding the EHGs-related ancestry found in MAD01, our finding agrees with a previous study, which has shown that ME individuals from Europe dated <7500 years ago often carried EHGs-related ancestry44. In addition, we found a rather early admixture time between WHGs and EHGs in the alpine ME (~13,700–8300 BC), one of the earliest in Europe according to Chintalapati et al.72, adding a piece to the timing of these admixture events in southern Europe.
From an archaeological point of view, transalpine cultural contacts during the Late ME in the EIAlp73 are well documented in northern Italy (Mondeval de Sora)16, central Switzerland, the Dinaric Alps and the Balkans74.
A substantial shift in ancestry related to migrations of EN farmers from Anatolia can be clearly detected in the genomes of our alpine individuals from MN (starting from at least ~4600–4400 BC, VEL102). These genomes carry on average ~87% of ancestry related to EN and ~13% of local HGs-related ancestry. It is not feasible to verify the presence of these ancestries in early farmers, as no human remains from EN have ever been recovered in this alpine area so far. With our study, we provide the first genomic data on NE individuals from northern Italy.
In our study, we also discovered that the HGs-related ancestry proportion remains constant in the EIAlp from the MN onwards, similar to other contexts in Italy (e.g. Sicily and Sardinia)39,45, but different from central Italy40.
From an archaeological perspective, it is still unclear how the transition from a hunting and gathering economy to a productive one took place in the EIAlp. Analysis of the material culture suggests a gradual process of acculturation of the ME groups17,24, with one example consisting of the finding of a small female deer antler figure (Venus of Gaban) with clear elements of both ME and NE traditions75,76, while the presence of a gap so far observed in the Gaban site between ME and NE underlines the discontinuity77,78. However, radiocarbon dates from other archaeological contexts in the EIAlp suggest continuity in the presence of ME groups at least until 5200–5100 BC79.
Overall, our study suggests (f4-statistics, qpAdm) a genetic contribution of local ME in alpine MN farmers, while there may have been some contribution from other HG groups before reaching the EIALp, but not from the Balkan area. Moreover, since according to the archaeological data the EN farmers reached EIAlp by ~5100 BC, our estimated time of admixture between ME and NE groups (~from 6100 to 5100 BC) could also support both local and non-local admixture in this alpine region.
The alpine genomes from the following CA phase (~3350–2300 BC) resemble those from the previous period. We could confirm that the Iceman has high (>90%) and low proportions (<10%) of ancestry related to EN and HGs, respectively. However, we also revealed that this ancestral model is not peculiar to this individual but is found in all the other twelve (not related) CA alpine individuals analysed in this study. Comparative analyses found similarly low values of HGs-related ancestry in northern Italy (Broion and Remedello) during CA, supporting previous findings37. Notably, in our study, we additionally uncovered that the same ancestral model persisted in most of the individuals from the following CA-EBA and EBA periods, including one of the two most recent alpine individuals of our dataset from the MBA (STE05; 1663–1511 BC). These results are similar to those found in ancient Sardinia, where, albeit with a more extreme pattern, a genetic continuity has been observed from the MN onwards45.
In the alpine territory, this finding is clearly visible at the Y-Chromosome level, since >90% of alpine prehistoric males of all chronologies (excluding the ME) carried the same G2a* paternal haplogroup, unlike those from other European regions. For instance, in southeastern Europe80 Y-Chromosomal haplogroups varied substantially within the same chronological phase and among chronologies. In this region, during the CA, the frequency of haplogroup G2a* was approximately 25% and several other lineages were present, while it has not been found in the most recent EBA individuals from the same areas. In central Europe (Czech Republic), the haplogroup G2a* was present in individuals from the NE together with other lineages and disappeared in individuals from EBA and BA, which carried most of the haplogroups R1b*, R1a* and I2*81. Moreover, in northern Italy, Y-Chromosomal lineage distribution varied more42 and in the Po Valley area, an individual dated to 2194–1939 BC, roughly contemporary with the alpine EBA males with the G2a* lineage, bore the R1b* haplogroup41.
G2a* was the dominant Y-Chromosomal haplogroup of the first NE farmers who migrated from Anatolia to Europe82,83. For instance, this haplogroup has been found in Early NE from Anatolia (Boncuklu, 8300-7800 BCE), in NE (Starčevo–Kőrös–Criș culture, 6000-4500 BCE) and MN individuals in Hungary and Croatia (Sopot culture, 5000 BCE) as well as in individuals associated with the LBK culture (5500–4500 BCE) in Germany22,52,55,56,84. From a paternal perspective, this haplogroup points to a genetic affinity between alpine MN and NE individuals from the Balkans and more northern areas.
Within the Haplogroup G* (with a frequency of <10% in Europe), the G2a-L497 lineage found in the EIAlp is the most frequent in present-day populations. It has been proposed that it originated in central Europe and reached Italy most likely by migratory flows from the north85. High frequencies of this lineage have been found in more isolated populations from the north-western and central Italy86 and in present-day individuals from more isolated alpine valleys in North Tyrol (Austria, G2a-L497 above 40%), closer to the alpine area under investigation87. This previous study also suggests an old settlement of this lineage in the Tyrolean Alps and additionally proposes that this territory may have played a significant role as a source or at least as a transit route for G2a-L497 lineage in Europe87. These findings can be supported by our aDNA study, which attests to a long-term presence of this paternal lineage in the alpine territory starting at least from the MN (~4600–4400 BC). Intriguingly, the Iceman carried a different Y-Chromosome lineage of G2a* (G2a2a1a2a1a1b, G-Z6208*) that is very rare in present-day populations of Europe (<1%), including those from North Tyrol (0.5%)87, apart from some areas such as Corsica and Sardinia36.
Starting from the transition phase from CA to EBA, in the EIAlp we detected a signal of lower gene flow from groups with different ancestry than those associated with ENs and HGs. Indeed, the Steppe-related ancestry was detected in a young female (2402–2149 BC, LAS01), associated with the formative aspect of the Polada culture and in another two female individuals from the same period and cultural horizon, which retain on average ~22% of the Steppe-related ancestry (Supplementary Information Text S1). Notably, based on currently available genetic data and radiocarbon dating from other ancient Italian individuals, we suggest that the Steppe-related component emerged in Italy in the EIAlp (at least from ~2400 BC), later in northern Italy and Sicily (~2200 BC), and then in central Italy (~1600 BCE). Nevertheless, analyses of more prehistoric Italian samples are still necessary to reconstruct the timing of this complex event.
The highest percentage of the Steppe-related ancestry (36.1 ± 2.6%) was found in the only male in our dataset, showing this ancestral component in his genome (SIU01; 1601–1295 BC). This individual additionally carried the Y-Chromosomal haplogroup R1b* (R1b-P312), also described in northern Italy (Broion)42 and that has been related with migrations of Bell Beaker males from central and western Europe41 and brought a rare mtDNA lineage detected in the same European areas58,59. The cranium of SIU01 has been recovered in the more northern alpine area of the EIAlp (Isarco/Eisack Valley) and belonged to an individual whose cultural attribution is not yet fully defined88 (Supplementary Information Text S1). Overall, our results on SIU01 suggest a possible different origin of this individual and a genetic connection with transalpine prehistoric groups from central and western Europe. On the other hand, the rare occurrence of the Y-Chromosomal haplogroup R1b* in the EIAlp compared to other European areas indicates very low genetic impact from males carrying the Steppe-related ancestry in the alpine groups, even if more BA and MBA samples from this alpine territory are necessary to confirm this result.
During the same CA-EBA period, but later than the appearance of the Steppe-related ancestry and independent of it, another ancestry related to IN farmers (average of ~20%), reached the EIAlp, starting at least from ~2200–2000 BC (NOG302). It has been shown that IN-farmers related ancestry spread from the Mediterranean (Aegean) to western and northwestern Europe with an increase starting from the Iron Age39,40. Based on the distribution of the IN-farmers related component in the EIAlp as well as in northern, central, and southern Italy, we suggest that this component reached the alpine region from the Italian areas south of the Alps and/or from the Aegean region39. Finally, around the same time, another component related to CHGs, but not nested in the Steppe-related ancestry, is detected in only one female individual (ROM402; 2272–2041 BC), which may have ancestors from the Eastern Mediterranean89.
In summary, our study, while highlighting occasional gene flow from groups of different ancestry and origin, mainly reveals limited genetic exchanges in the EIAlp from the MN to the MBA. This could indicate a relative isolation of the prehistoric alpine groups, similar to other more isolated areas in Italy (e.g. Sardinia), compared to those from other European regions (e.g. south-eastern or central Europe).
This picture seems to be broadly consistent with the overall cultural development pattern in the EIAlp. In fact, over the long period considered in this study, in this alpine area, there appears to have been a cultural evolution that evolved without interruptions or sudden changes, despite evident cultural contacts with even distant geographical areas. Indeed, the most noticeable characteristics of cultural changes seem to be more linked to the acquisition of new knowledge and technological skills (such as exploiting sources of copper procurement and making metal objects). Moreover, from an archaeological perspective, there is currently no clear evidence of cultural contacts with groups that have been related to the diffusion of the Steppe-related ancestry. It could be hypothesised that Bell Beaker groups may have played some role in the spread of this genomic component, but the cultural Bell Beaker phenomenon appears to be extremely rare in EIAlp90. The diffusion of this component could possibly also be linked to the spread of the statue steles, diffused throughout Europe and also found in the EIAlp91,92. However, so far, clear funerary attestations related to such cultural manifestations are mostly lacking in this territory.
This study additionally found marked differences between the two uniparentally transmitted markers, with the Y-Chromosome showing very low diversity compared to mtDNA. Such differences have been observed globally as the result of a post-Neolithic Y-Chromosome bottleneck that may have occurred in Europe around 3000–5000 BP93. This pattern could be explained by the practice of some cultural rules in past communities (e.g. patrilinearity and patrilocality). In the EIAlp, the observed genetic pattern could be explained by patrilocality, which is when females moved to the male’s birthplace where they had offspring, as has been observed in other prehistoric contexts in Europe59,94. In fact, the Y-Chromosomal lineage in the EIAlp suggests long-term and stable settlement for males. However, it cannot be excluded based on the available data that the males analysed came from another settlement in this alpine region. Additionally, most of the males resulted as not being closely genetically related and do not show strong signals of inbreeding, indicating a complex scenario in the EIAlp.
Regarding the phenotypic analysis, it suggests that, like the Iceman, all prehistoric alpine individuals were unable to digest milk after early childhood, a prevalent condition in ancient individuals and in modern populations, including farmers and pastoralist groups95. Moreover, based on the analyses of SNP rs1495741 in the gene NAT2, we suggest that all the alpine prehistoric individuals analysed probably had high concentrations of fatty acids of plant origin, associated with a predominantly plant-based diet, as also observed in the Iceman37. Although the presence of this variant in our ME individual might seem unexpected, our results agree with what has been observed in a previous study that showed no changes in the allele frequency of the derived allele of SNP rs1495741 over the past 10,000 years (HGs, EN and group with Steppe-related ancestry)67. Moreover, in general, the results in the alpine groups for the SNP rs471995 of the PRLP2 gene might suggest, in the EIAlp, from MN onwards, an adaptation to a diet based on cereals typical of farming communities. In fact, the variant of this SNP has been found to be relatively common in present-day populations with a cereal-based diet and more frequent in EN farmers than HGs, although no evidence of selection has been found66.
To further investigate social practices with possible extended kinships, phenotypic traits and additional analyses such as metagenomics in past alpine groups, deep shotgun sequencing of the sampled individuals can be performed with the available biological material.
Finally, our study also uncovers that, although the Iceman presents the same ancestral genomic model and a similarity in phenotypic traits to the other CA sampled individuals (and with most of the other alpine prehistoric individuals analysed), he differs from them in his maternal and paternal lineages. These results suggest a slightly different genetic history for the Iceman compared to the other CA alpine individuals. Although stable isotope analyses of Iceman’s teeth and bones allowed his origin to be pinpointed to a few valleys close to the discovery site (Schnals/Senales valleys)96 we have no information on the cultural group to which the Iceman may have belonged, as he was recovered without any distinctive material elements (e.g. pottery). Our findings leave open some questions about the genetic origin and cultural affiliation of this enigmatic individual.
Methods
Authorisations by local authorities
Authorisations with prescriptions were obtained for the study, sampling, and temporary transfer of the human osteological finds owned by the Autonomous Province of Trento (Prot. n. 156 15.03.2021) and the Autonomous Province of Bolzano/Bozen (Prot. Nr. 636137-25.09.2019), as well as for genetic analyses, in the context of the project ‘Genomic diversity of prehistoric individuals from the Iceman’s territory in the Eastern Italian Alps’ (see also Supplementary Information Text S1, Data availability section and the Reporting Summary). The human remains were collected from various Institutions scattered throughout the territory (after further authorisations by the specific Institution), which store them in their repositories. These include: MUSE of Trento, Castello del Buonconsiglio of Trento, Museo di Scienze e Archeologia- Fondazione Museo Civico of Rovereto, Ufficio beni archeologici of the Soprintendenza per i Beni Archeologici della Provincia Autonoma di Trento and the Ufficio beni archeologici of the Soprintendenza Archeologica della Provincia Autonoma di Bolzano/Bozen.
Molecular analyses and radiocarbon dating (14C)
Pars petrosa (PP; N = 43), which were isolated or still articulated with the skull, and tooth samples (N = 9) were selected for genetic investigation in the laboratories of the Institute for Mummy Studies of Eurac Research in Bolzano (Italy) (Supplementary Data 2). After detailed photographic documentation (before and after sampling), approximately 200 mg of powdered bone and dental root were collected. For the PP, a thin-tipped drill was used to create a small hole in the cortical bone that forms the structure of the cochlea (part of the osseous labyrinth of the inner ear)97. This approach minimised the invasiveness of the sampling, avoiding the cutting and the destruction of the bone and making it accessible for further (e.g. macroscopic) studies. Regarding the tooth sampling, the root was separated from the crown using a Dremel and then milled. Using the root, the pulp chamber was also included, which preserves microbial DNA, making DNA samples available for further investigations.
The sampling was conducted in a dedicated pre-PCR area of the aDNA laboratory following all stringent rules required for aDNA analyses. DNA samples were then extracted using a silica-based method98 and double-stranded genomic libraries were constructed99,100. These were then sent to an external company (Macrogen) for shotgun sequencing (HiSeq-X system, Illumina). The specifications of the reagents used, company names and catalogue numbers can be found in Supplementary Data 23.
The samples (with more than 1% of HR) were then enriched for ~1.24 million SNPs across the human genome48 as well as 49 K additional sites on the Y-Chromosome and the complete mitogenome using the in-solution target capture kit myBaits ® Expert Human Affinities—Prime Plus (Arbour Bioscience) (Supplementary Information Text S8).
Radiocarbon dating (14C) was carried out on 36 samples, and only two did not have sufficient preserved collagen for the analysis (Supplementary Data 1). The remaining samples were calibrated (sigma 2) with a 95% probability, except for one (ROM308), whose calibration accuracy did not exceed 68% (Supplementary Information Test S2 and Fig. S16).
The raw genetic data generated in this study (shotgun and capture) are available at the European Nucleotide Archive (ENA: https://www.ebi.ac.uk/ena/browser/home) with the accession number PRJEB70242.
Sequence read processing, alignment and damage patterns
The raw sequence data were processed by first trimming and merging reads using PEAR (v.0.9.10)101 with a minimum length of the assembled sequences and a minimum number of overlapping bases of 25 bp. Merged reads were then mapped to the Genome Reference Consortium Human Build 37 (hg19) and to the mtDNA reference genome (rCRS)102 using BWA (v.0.7.17)103. We then converted data into BAM files using SAMtools (v.1.16.1)104 with minimum mapping quality set to 30. Duplicates were removed with Dedup105, and mapDamage2 (v.2.2.1)106 was used to authenticate ancient DNA by assessing the damage patterns in all our samples. To evaluate the number of SNPs sites hitting the 1240 K panel48, vcf files were created by using BCFtools (v.1.16)107.
Contamination estimates
Contamination levels based on mtDNA data were estimated in all individuals using Schmutzi (v.1.5.6)108. For the male individuals only, estimates were based on X-Chromosome data using the method implemented in ANGSD (Analysis of Next Generation Sequencing Data; v.0.941)109. When the contamination level was above 5%, the PMDTools method (threshold set to 3) was applied to filter the sequences identified as genuinely ancient (v.0.60)110.
Genotyping
All bam files were genotyped by using pileupCaller (https://github.com/stschiff/sequenceTools) and pseudohaploidized by randomly calling an allele for each position of the 1240 K panel (1,233,013 SNPs)48. We removed indels and the transition sites to avoid taking possible false polymorphisms due to deamination into account in the analyses.
Molecular sex determination
The molecular sex of all individuals was assigned using the method described in ref. 111, which is based on the number of reads mapping to the non-recombining portion of the Y-Chromosome and X-Chromosome compared to the autosomal reads, and the method described in ref. 112 that analyzes only X-Chromosome linked markers.
Mitochondrial DNA and Y-Chromosome haplogroups assignment
Consensus mtDNA sequences were obtained for each individual using Schmutzi (v.1.5.6)108 with a minimum mapping quality of 30. The obtained fasta files were used for the mtDNA haplogroups assignment using HaploGrep 2 (v.2.4.0)113. Instead, Y-Chromosomal haplogroups were assigned to male individuals using Yleaf statistical software version 2.2114 on the bam files, with minimal base-quality of 30 (-q 30) and base-majority to determine an allele set to 90% (-b 90).
Run of homozygosity (ROH)
ROH values were inferred by using the software hapROH (v0.63)63. The default parameters and pseudo-haploid data as input were used. ROH was determined using the default genetic map of the software and 5008 haplotypes from the 1000 Genome as a reference panel115 and SNPs were called for each chromosome and individual (Supplementary Information Text S5).
Kinship analyses
Three different methods were applied (READ, TKGWV2 and KIN) to infer possible genetic relatedness between the alpine individuals analysed (Supplementary Information Text S4). For READ (Relationship Estimation from Ancient DNA (v.1)116, we use pseudohaploid genomes with only transversions for all our samples, and the normalisation has been done based on the median of all average P0s. For TKGWV2 method (Two-Sample Kinship and Genetic Relatedness with Haploid and Diploid Genomes (v. 2) we followed the pipeline of Fernandes and collabotarors117 (https://github.com/danimfernades/tkgwv2) to generate transposed PLINK text files for all pair of individuals and we used the allele frequencies file provided by the author, based on modern European CEU data from the 1000 Genomes project Phase 3. Lastly, for KIN (v.3.1.3), the software KINgaroo was used to generate the input files from the bam files previously generated118. For contaminated samples (>5%) filtered by using PMDTools, we did not utilise the contamination correction (-cnt:0) nor the estimation of the location of long runs of homozygosity.
Datasets
A dataset of genomic data from 996 present-day individuals from the Human Origins (HO) panel1,119 was compiled and merged with data from our alpine individuals. In addition, 1.341 West Eurasian ancient individuals were merged with published data from the Allen Ancient DNA Resource (AADR v54.1)50 and used for the PCA and ADMIXTURE analysis (Supplementary Data 10). We filtered ancient individuals in the dataset by removing those with less than 20,000 SNPs and a coverage <0.01. Among duplicated and related, only the individual with the highest number of SNPs was kept. Finally, individuals considered as QUESTIONABLE were not considered. The overlap between the 1240 K SNPs panel48 and the HO data1122 was 593,054 SNPs. After removing indels and transitions, we obtained a total number of 110,416 autosomal SNPs. A second dataset was obtained for qpWave, qpAdm and f-statistics analyses by merging our individuals with the 1240 K AADR dataset (v54.1) (1240 K SNPs)50 using the same set of individuals selected for building the dataset used for PCA and clustering (ADMIXTURE) analyses (without HO data, Supplementary Data 10), with additional 114 recently published ancient genomes44 (Supplementary Data 11).
Principal components analysis
Principal components analysis (PCA) was performed using the programme smartpca (v16000) (EIGENSOFT)120,121 with parameters shrinkmode: yes; lqsproject: yes, on the HO dataset. We performed analysis on modern West Eurasian individuals on which ancient genomes were projected. The period of all the individuals in the PCA was redefined based on the time periods for EIAlp (ME > ~5500 BC; NE: ~5500–3500 BC; CA: ~3500–2200 BC; CA/EBA: ~2900–1900 BC; EBA: ~2200–1500 BC; MBA: ~1600–1300 BC).
Clustering analysis
The software ADMIXTURE (v.1.3.0)122 was run after pruning, to remove SNPs that are in strong linkage disequilibrium, using the option –indep-pairwise 200 25 0.2 in PLINK (v.1.9)123. The software was run from K = 2 to K = 12, with 10 replicates with random seed values for each K. We used the option –cv (cross validation) to find the K value with the lowest errors (Supplementary Information Fig. S21). Results for K = 5 with the lowest cross-validation value were plotted using Pong (v.1.4.9)124 (Supplementary Information Fig. S22).
qpWave/qpAdm analyses
Pairwise-qpWave/qpAdm was run from ADMIXTOOLS (v.7.0.2)8,119 on the 1240 K SNPs dataset. The analysis was done with default parameters and allsnps: YES. We first run pairwise-qpWave to test pairwise clustering of our prehistoric alpine individuals and determine if those from the same period could be grouped (p-value > 0.01). We used a set of right population (outgroup) including, Mbuti.DG, Onge.SG, Papuan.DG, Han.DG, Karitiana.DG, Ethiopia_4500BP.SG, Russia_Ust_Ishim.DG, CHG, Italy_Villabruna, Spain_ElMiron, EHG, Iran_GanjDareh_N, Russia_MA1_HG.SG, Russia_Kostenki14.SG, Israel_Natufian, Czech_Vestonice, GoyetQ116-1, Hungary_Koros, Levant_N, Anatolia_N, Russia_Samara_EBA_Yamnaya. A heatmap based on the p-values calculated with pairwise-qpWave for each pairwise was generated using heatmap.2 function of the R-package gplots125 (Fig. 3 and Supplementary Information Fig. S23).
We then run qpAdm to estimate the ancestral model of prehistoric alpine individuals with the R package admixr (v.1.0.0)126. We used as right populations (outgroups): Mbuti.DG, Onge.SG, Papuan.DG, Han.DG, Karitiana.DG, Ethiopia_4500BP.SG, Russia_Ust_Ishim.DG, CHG, Italy_Villabruna, Spain_ElMiron, EHG, Iran_GanjDareh_N, Russia_MA1_HG.SG, Russia_Kostenki14.SG, Israel_Natufian, Czech_Vestonice, GoyetQ116-1, Hungary_Koros, Levant_N, Anatolia_N (Supplementary Data 16-19). When one of these populations was used as a source, it was removed from the right population.
For our ME individual (MAD01), we performed two-way admixture models using Italy_Villabruna, EHG, OKL, GoyetQ2 and Fournol as sources (Supplementary Data 16).
For all the other individuals from MN to MBA, we performed two-way and three-way models using different combination of sources: MAD01, Italy_Villabruna, Anatolia_N, EHG, Iran_GanjDareh_N (Iran_N) and Russia_Samara_EBA_Yamnaya (Steppe), or a four-way model (ROM402, Fig. 4) using MAD01, CHG (Caucasus HGs), Anatolia_N and Russia_Samara_EBA_Yamnaya (Steppe) (Supplementary Data 17 and 18).
We also performed the qpAdm analysis on groups defined based on the pairwise-qpWave/qpAdm analysis and chronologies (Supplementary Data 19). qpAdm analysis for 2-way models in the alpine groups or individuals (PrehAlps_MN, PrehAlps_CA, PrehAlps_CA-EBA, PrehAlps_EBA PrehAlps_STE05) was also done by using a group of WHGs (Hungary_Koros; Germany_Blatterhohle; Switzerland_Bichon.SG; Croatia_VelaSpila; Italy_OrienteC; Falkenstein; BerryAuBac; Iboussieres25-1; Iboussieres31-2; Rochedane; Drigge, MAD01), instead of MAD01 alone as a source of HG-related ancestry (Supplementary Data 19). We considered the most parsimonious model with a p-value > 0.01 to be most likely.
f-stastistics
All f-statistics were performed with the software ADMIXTOOLS (v.7.0.2)8,119 and the package admixr (v.1.0.0)126 in R with default parameters.
For the f4-statitsics, we used the parameter allsnps: YES. We computed a f4-statistics of the form f4(pop1, pop2; test, Mbuti) to trace possible genetic affinity between our MN individuals (test) and others NE individuals from Europe (pop1 and pop2) or between alpine MN (test) and different groups of HGs and MAD01 (pop1 and pop2) (Supplementary Data 15).
Outgroup f3-statistics analysis of the form of f3(Mbuti; pop1, pop2) was also performed (Supplementary Data 12). To visualise the genetic affinity of MAD01 with others HGs individuals we executed a MDS analysis based on the dissimilarity matrix of pairwise genetic difference values (1 − f3) and we then used the function cmdscale from the package stats (v.3.6.2, https://www.R-project.org) (Supplementary Information Fig. 2B). The genetic clusters were defined in the MDS based on the Extended Data Table 1 and the Fig1c from44. In addition, f3-statistics of the form: f3 (Mbuti; European pop, Alpine group) were performed to estimate possible genetic affinity between European populations and alpine groups, from MN to MBA, as defined by qpWave/qpAdm analysis (Supplementary Data 13 and Supplementary Information Fig. S24).
Estimates of admixture time (DATES)
We used the linkage-disequilibrium-based method DATES (v.4010)53,72 to estimate the timing of admixture between ancestral populations MAD01, Anatolia_N and Steppe in the alpine groups or individuals (defined on qpAdm analysis). We consider 28 years for one generation. To estimate the admixture time between WHG and EHG groups in the alpine ME (MAD01), Villabruna or a group of WHGs (Luxembourg_ Loschbourg.DG; Iraly_Villabruna; Spain_Labrana; Switzerland_Bichon.SG; Italy_OrienteC) was used as a source for WHGs (Supplementary Data 20), and the model was verified by qpAdm analysis (Supplementary Data 16). Instead, to estimate the admixture time between HGs and Anatolian farmers (Turkey_N) in alpine groups or individuals, we used MAD01 or a group of WHGs (Hungary_Koros; Germany_Blatterhohle; Switzerland_Bichon.SG; Croatia_VelaSpila; Italy_OrienteC; Falkenstein; BerryAuBac; Iboussieres25-1; Iboussieres31-2; Rochedane; Drigge, MAD01) as a source for WHGs, and the model was validated by qpAdm (Supplementary Data 19 and 20).
Phenotypic analysis
Hirisplex panel (https://hirisplex.erasmusmc.nl/) was used to attempt to predict eye, skin and hair colour in the alpine individuals. We also analyse specific variants that have been analysed in the Iceman37 and are associated with phenotypic traits of interest (Supplementary Data 21 and 22). Target regions were extracted with bcftool mpileup, and variant-calling was done with bcftools call (v.1.16)107. We filtered variants for a base quality of 30 and with a minimum coverage of 5 reads. The allelic frequency was calculated with PLINK (v1.90)123.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The raw data (FASTQ files) generated in this study have been submitted to the European Nucleotide Archive (ENA: https://www.ebi.ac.uk/ena/browser/home) with the accession number PRJEB70242. Access codes to previously published data: AADR v54.1 database [https://reich.hms.harvard.edu/datasets] and dataset from [https://doi.org/10.1038/s41586-023-05726-0; ENA accession number PRJEB51862]. The osteological remains of the individuals sampled in this study are under the protection of the Autonomous Province of Bolzano/Bozen and the Autonomous Province of Trento and are preserved at the following local authorities (contact person and Director at the time of the study are listed in brackets): the MUSE of Trento (Alex Fontana and Michele Lanzinger), Fondazione Museo Civico di Rovereto (Maurizio Battisti and Alessandra Cattoi), Museo del Buonconsiglio di Trento (Morena Dallemule and Laura Dal Prà), Ufficio Archaeologico, Provincia Autonoma di Bolzano/Bozen, Italy (Catrin Marzoli), UMSt Soprintendenza per i beni e le attività culturali, Ufficio beni archeologici, Provincia Autonoma di Trento (Elisabetta Mottes and Franco Nicolis,) and the Department of Humanities, University of Trento (Annaluisa Pedrotti, Marco Gozz).
References
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature 513, 409–413 (2014).
Fu, Q. et al. The genetic history of Ice Age Europe. Nature 534, 200–205 (2016).
De Barros Damgaard, P. et al. The first horse herders and the impact of early Bronze Age steppe expansions into Asia. Science (1979) 360, eaar7711 (2018).
Skoglund, P. et al. Origins and genetic legacy of neolithic farmers and hunter-gatherers in Europe. Science (1979) 336, 466–469 (2012).
Hofmanová, Z. et al. Early farmers from across Europe directly descended from Neolithic Aegeans. Proc. Natl. Acad. Sci. USA 113, 6886–6891 (2016).
Günther, T. & Jakobsson, M. Genes mirror migrations and cultures in prehistoric Europe—a population genomic perspective. Curr. Opin. Genet Dev. 41, 115–123 (2016).
Allentoft, M. E. et al. Population genomics of Bronze Age Eurasia. Nature 522, 167–172 (2015).
Haak, W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522, 207–211 (2015).
Peresani, M. et al. Hunter-gatherers across the great Adriatic-Po region during the Last Glacial Maximum: environmental and cultural dynamics. Quat. Int. 581–582, 128–163 (2021).
Angelucci, D. E. La valle dell’Adige: genesi e modificazione di una grande valle alpina come interazione tra dinamiche naturali e fattori antropici. in: Il fiume, le terre, l’immaginario: l’Adige come fenomeno storiografico complesso (ed Rovigo, V.) pp 9–43 (Osiride, Rovereto, 2016).
Angelucci, D. E. & Bassetti, M. Humans and their landscape from the Alpine Last Glacial Maximum to the Middle Holocene in Trentino: geoarcheological considerations. Preist. Alp. 44, 59–78 (2009).
Ravazzi, C., Peresani, M. & Pini, R. Il Tardoglaciale Nelle Alpi Italiane e in Pianura Padana. Evoluzione Stratigrafica, Storia Della Vegetazione e Del Popolamento Antropico. https://www.researchgate.net/publication/271515247 (2007).
Mottes, E. et al. Between the forest and the lake. The open-air Late Epigravettian site of Arco Via Serafini (Trento, Northern Italy). in IV Incontro Annuale di Preistoria e Protostoria. Ferrara, 7-8 febbraio 2018 (eds. Arzarello, M., Fontana, F., Peresani, M., Peretto, C. & Hohenstein, U. T.) pp 20–23 (Annali dell’Università degli Studi di Ferrara, Museologia Scientifica e Naturalistica, Ferrara, 2017).
Kompatscher, K. & Kompatscher, N. M. Mittelsteinzeitliche Fernverbindungen über den Alpenhauptkamm. in Das Mesolithikum-Projekt Ullafelsen (Teil 1). Mensch und Umwelt im Holozän Tirols (ed Schäfer, D.) vol 1, pp 205–241 (Innsbruck, 2011).
Bertola, S., Fontana, F. & Schäfer, D. Attraversare le Alpi 11.000 anni fa: il Mesolitico antico di alta quota nel settore orientale delle Alpi e il sito di Ullafelsen (Sellrain, Innsbruck, Austria). Riv. Sci. Preist. S1, 57–69 (2020).
Fontana, F. et al. A snapshot of late mesolithic life through death: An appraisal of the lithic and osseous grave goods from the Castelnovian burial of Mondeval de Sora (Dolomites, Italy). PLoS ONE 15, e0237573 (2020).
Cristiani, E., Pedrotti, A. & Gialanella, S. Tradition and innovation between the Mesolithic and Early Neolithic in the Adige Valley (Northeast Italy). New data from a functional and residues analyses of trapezes from Gaban rockshelter. Doc. Praehist. 36, 191–205 (2009).
Mottes, E. Vasi a Bocca Quadrata. Evoluzione Delle Conoscenze, Nuovi Approcci Interpretativi. (Provincia autonoma di Trento, Soprintendenza per i beni culturali, 2021).
Pessina, A. & Tiné Vincenzo. Archeologia Del Neolitico. L’Italia Tra VI e IV Millennio a.C. (Carocci, Rome, 2022).
Guilaine, J. The Neolithization of Mediterranean Europe: Mobility and interactions from the Near East to the Iberian peninsula. in: The Oxford Handbook of Neolithic Europe (eds Fowler, C., Harding, J. & Hofmann, D.) pp 81–98 (Oxford University Press, Oxford, 2015).
Barnett, W. K. Cardial pottery and the agricultural transition in Mediterranean Europe. in: Europe’s First Farmers (ed Douglas Price, T.) pp 93–116 (Cambridge University Press, 2000).
Rivollat, M. et al. Ancient genome-wide DNA from France highlights the complexity of interactions between Mesolithic hunter-gatherers and Neolithic farmers. Sci. Adv. 6, eaaz5344 (2020).
Olalde, I. et al. A common genetic origin for early farmers from Mediterranean Cardial and Central European LBK cultures. Mol. Biol. Evol. 32, 3132–3142 (2015).
Bagolini, B. & Biagi, P. Le più antiche facies ceramiche dell’ambiente Padano. Riv. Sci. Preist. 32, 219–233 (1977).
Bagolini, B. & Biagi, P. Balkan influences in the Neolithic of Northern Italy. Preist. Alp. 21, 49–57 (1985).
Pedrotti, A. Il Neolitico. in Storia del Trentino, I, Preistoria e Protostoria (eds Lanzinger, M., Marzatico, F. & Pedrotti, A.) pp 119–182 (Il Mulino, Bologna, 2001).
De Marinis, R. C. La necropoli di Remedello Sotto e l’età del Rame nella pianura padana a nord del Po. in: L’età del Rame: la pianura padana e le Alpi al tempo di Ötzi. (ed De Marinis, R. C.) pp 301–351 (Compagnia della stampa Massetti Rodella, Brescia, 2013).
Bagolini, B. Il Neolitico e l’età Del Rame. Ricerca a Spilamberto - S. Cesario 1977-1980. (Cassa di Risparmio di Vignola, Vignola, 1981).
Mottes, E. & Nicolis, F. Forme della ritualità funeraria tra età del Rame e antica età del Bronzo nel territorio della Valle dell’Adige (Trentino Alto Adige, Italia settentrionale). Nota di aggiornamento. in: Un lungo percorso di scienza: scritti in onore di Leone Fasani. Memorie del Museo civico di storia naturale di Verona. 2. Serie. Sezione Scienze dell’uomo (eds Martini, F. & Salzani, L.) pp 191–219 (Museo Civico di Storia Naturale di Verona, 2019).
Artioli, G. et al. Long-distance connections in the Copper Age: New evidence from the Alpine Iceman’s copper axe. PLoS ONE 12, e0179263 (2017).
Artioli, G., Angelini, I., Canovaro, C., Kaufmann, G. & Villa, I. M. Lead isotopes of prehistoric copper tools define metallurgical phases in Late Neolithic and Eneolithic Italy. Sci. Rep. 14, 4323 (2024).
Perini, R. L’età Del Bronzo Antico e Medio. (Il Mulino, Bologna, 2001).
Marzatico, F. Le Alpi Orientali: barriera e ponte. in Frattesina cinquant’anni dopo. Il Delta del Po tra Europa e Mediterraneo nei secoli attorno al 1000 a.C. pp 137–164 (Padusa, LVI, Rovigo, 2022).
David, W. L’Italia settentrionale tra il sud del centro Europa ed il Bacino Carpatico. Contatti transalpini nella prima metà del secondo millennio BC tra Pianura Padana e alto e medio Danubio. Riv. Sci. Preist. LXX 219–230 (2020).
Cardarelli, A. et al. The connection between the plains of the Po and the Danube during the Bronze Age seen through the spread of the ‘urnfield model’. Riv. Sci. Preist. LXX 231–243 (2020).
Keller, A. et al. New insights into the Tyrolean Iceman’s origin and phenotype as inferred by whole-genome sequencing. Nat. Commun. 3, 698 (2012).
Wang, K. et al. High-coverage genome of the Tyrolean Iceman reveals unusually high Anatolian farmer ancestry. Cell Genomics 3, 100377 (2023).
Paladin, A. et al. Archaeological questions and genetic answers: Male paternal kinship in a copper age multiple burial from the eastern Italian Alps. J. Archaeol. Sci. Rep. 50, 104103 (2023).
Fernandes, D. M. et al. The spread of steppe and Iranian-related ancestry in the islands of the western Mediterranean. Nat. Ecol. Evol. 4, 334–345 (2020).
Antonio, M. L. et al. Ancient Rome: a genetic crossroads of Europe and the Mediterranean. Science (1979) 366, 708–714 (2019).
Olalde, I. et al. The Beaker phenomenon and the genomic transformation of northwest Europe. Nature 555, 190–196 (2018).
Saupe, T. et al. Ancient genomes reveal structural shifts after the arrival of Steppe-related ancestry in the Italian Peninsula. Curr. Biol. 31, 2576–2591 (2021).
Bortolini, E. et al. Early Alpine occupation backdates westward human migration in Late Glacial Europe. Curr. Biol. 31, 2484–2493 (2021).
Posth, C. et al. Palaeogenomics of Upper Palaeolithic to Neolithic European hunter-gatherers. Nature 615, 117–126 (2023).
Marcus, J. H. et al. Genetic history from the Middle Neolithic to present on the Mediterranean island of Sardinia. Nat. Commun. 11, 939 (2020).
Hodgkins, J. et al. An infant burial from Arma Veirana in northwestern Italy provides insights into funerary practices and female personhood in early Mesolithic Europe. Sci. Rep. 11, 23735 (2021).
Yu, H. et al. Genomic and dietary discontinuities during the Mesolithic and Neolithic in Sicily. iScience 25, 104244 (2022).
Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528, 499–503 (2015).
Bernabò Brea, M. & Mazzieri Paola. Nuovi dati sul campaniforme in Emilia. in: L’etá del Rame. La pianura Padana e le Alpi al tempo di Ötzi (ed De Marinis, R. C.) pp 503–525 (Compagnia della Stamapa Massetti Rodella, Brescia, 2013).
Mallick, S. et al. The Allen Ancient DNA Resource (AADR) a curated compendium of ancient human genomes. Sci. Data 11, 182 (2024).
Childebayeva, A. et al. Population genetics and signatures of selection in early Neolithic European Farmers. Mol. Biol. Evol. 39 msac108 (2022).
Freilich, S. et al. Reconstructing genetic histories and social organisation in Neolithic and Bronze Age Croatia. Sci. Rep. 11, 16729 (2021).
Narasimhan, V. M. et al. The formation of human populations in South and Central Asia. Science (1979) 365, eaat7487 (2019).
Lazaridis, I. et al. Genomic insights into the origin of farming in the ancient Near East. Nature 536, 419–424 (2016).
Lipson, M. et al. Parallel palaeogenomic transects reveal complex genetic history of early European farmers. Nature 551, 368–372 (2017).
Mathieson, I. et al. The genomic history of southeastern Europe. Nature 555, 197–203 (2018).
Agranat-Tamir, L. et al. The genomic history of the Bronze Age Southern Levant. Cell 181, 1146–1157.e11 (2020).
Martiniano, R. et al. The population genomics of archaeological transition in west Iberia: Investigation of ancient substructure using imputation and haplotype-based methods. PLoS Genet. 13, e1006852 (2017).
Mittnik, A. et al. Kinship-based social inequality in Bronze Age Europe. Science (1979) 366, 731–734 (2019).
Olalde, I. et al. The genomic history of the Iberian Peninsula over the past 8000 years. Science (1979) 363, 1230–1234 (2019).
Ermini, L. et al. Complete mitochondrial genome sequence of the Tyrolean Iceman. Curr. Biol. 18, 1687–1693 (2008).
Coia, V. et al. Whole mitochondrial DNA sequencing in Alpine populations and the genetic history of the Neolithic Tyrolean Iceman. Sci. Rep. 6, 18932 (2016).
Ringbauer, H., Novembre, J. & Steinrücken, M. Parental relatedness through time revealed by runs of homozygosity in ancient DNA. Nat. Commun. 12, 5425 (2021).
Arzelier, A. et al. Neolithic genomic data from southern France showcase intensified interactions with hunter-gatherer communities. iScience 25, 105387 (2022).
Simões, L. G. et al. Genomic ancestry and social dynamics of the last hunter-gatherers of Atlantic France. Proc. Natl. Acad. Sci. USA 121, e2310545121 (2024).
Hancock, A. M. et al. Human adaptations to diet, subsistence, and ecoregion are due to subtle shifts in allele frequency. Proc. Natl. Acad. Sci. USA 107, 8924–8930 (2010).
Mathieson, S. & Mathieson, I. FADS1 and the timing of human adaptation to agriculture. Mol. Biol. Evol. 35, 2957–2970 (2018).
García-Closas, M. et al. A single nucleotide polymorphism tags variation in the arylamine N-acetyltransferase 2 phenotype in populations of European background. Pharmacogenet. Genomics 21, 231–236 (2011).
Luca, F. et al. Multiple advantageous amino acid variants in the NAT2 gene in human populations. PLoS ONE 3, e3136 (2008).
Magalon, H. et al. Population genetic diversity of the NAT2 gene supports a role of acetylation in human adaptation to farming in Central Asia. Eur. J. Hum. Genet. 16, 243–251 (2008).
Sabbagh, A., Darlu, P., Crouau-Roy, B. & Poloni, E. S. Arylamine N-acetyltransferase 2 (NAT2) genetic diversity and traditional subsistence: a worldwide population survey. PLoS ONE 6, e18507 (2011).
Chintalapati, M., Patterson, N. & Moorjani, P. The spatiotemporal patterns of major human admixture events during the European Holocene. Elife 11, e77625 (2022).
Bagolini, B. & Broglio, A. Il ruolo delle Alpi nei tempi preistorici (dal Paleolitico al Calcolitico). in: Studi di paletnologia in onore di Salvatore M. Puglisi (ed Liverani, M.) pp 663–705 (Università di Roma ‘La Sapienza’, Dipartimento di scienze storiche, archeologiche e antropologiche dell’antichità, Rome, 1985).
Cristiani, E. & Borić, D. Appearance and Function of Harpoons in Northeastern Italy. in: Hunter-Gatherers’ Tool-Kit A Functional Perspective (eds Gibaja, J. F., Marreiros, J., Mazzucco, N. & Clemente-Conte I.) pp 2–28 (Cambridge Scholars Publishing, 2020).
Guilaine, J. La Mer Partagée. La Méditerranée Avant l’écriture, 7000-2000 Avant Jésus-Christ. (Hachette, Paris, 1994).
Pedrotti, A. Il riparo Gaban (Trento) e la neolitizzazione della valle dell’Adige. in Antenate di Venere, 27.000-4000 a.C., Catalogo mostra (Milano, Castello Sforzesco - Sala Sforzesca, 5.12.2008-28.02.2010) (eds Kruta, V., Kruta Poppi, L., Licka, M. & Magni, E.) pp 39–47 (Skira, Milan, 2009).
Perrin, T. Nouvelles réflexions sur la transition Mésolithique récent – Néolithique ancien l’abri Gaban (Trento, Italie). Preist. Alp. 41, 89–146 (2005).
Starnini, E., Biagi, P. & Mazzucco, N. The beginning of the Neolithic in the Po Plain (northern Italy): problems and perspectives. Quat. Int. 470, 301–317 (2018).
Pearce, M. Rethinking the North Italian Early Neolithic. (Accordia Research Institute, University of London, London, 2013).
Penske, S. et al. Early contact between late farming and pastoralist societies in southeastern Europe. Nature 620, 358–365 (2023).
Papac, L. et al. Dynamic changes in genomic and social structures in third millennium BCE central Europe. Sci. Adv. 7, eabi6941 (2021).
Lacan, M. et al. Ancient DNA reveals male diffusion through the Neolithic Mediterranean route. Proc. Natl. Acad. Sci. USA 108, 9788–9791 (2011).
Haak, W. et al. Ancient DNA from European early Neolithic farmers reveals their near eastern affinities. PLoS Biol. 8, e1000536 (2010).
Feldman, M. et al. Late Pleistocene human genome suggests a local origin for the first farmers of central Anatolia. Nat. Commun. 10, 1218 (2019).
Rootsi, S. et al. Distinguishing the co-ancestries of haplogroup G Y-chromosomes in the populations of Europe and the Caucasus. Eur. J. Hum. Genet. 20, 1275–1282 (2012).
Grugni, V. et al. Reconstructing the genetic history of Italians: new insights from a male (Y-chromosome) perspective. Ann. Hum. Biol. 45, 44–56 (2018).
Berger, B. et al. High resolution mapping of y haplogroup G in Tyrol (Austria). Forensic Sci. Int. Genet. 7, 529–536 (2013).
Tecchiati, U. Sepolture e resti umani sparsi in abitati della preistoria e della protostoria dell’Italia settentrionale con particolare riferimento al Trentino-Alto Adige. Not. Archeol. Bergomensi 19, 49–63 (2011).
Jones, E. R. et al. Upper Palaeolithic genomes reveal deep roots of modern Eurasians. Nat. Commun. 6, 8912 (2015).
Nicolis, F. Il culto dei morti nell’antica e media età del Bronzo. in: Storia del Trentino. La preistoria e la protostoria (eds Lanzinger, M., Marzatico, F. & Pedrotti, A.) vol 1, pp 337–365 (Il Mulino, Bologna, 2001).
Casini, S., De Marinis, R. C. & Pedrotti, A. Statue-stele e massi incisi nell’Europa dell’età del rame. Not. Archeol. Bergomensi 3, 335 (1995).
Heyd, V., Kulcsár, G. & Preda Balanica, B. Yamnaya Interactions: Proceedings of the International Workshop held in Helsinki, 25–26 April 2019. in: Yamnaya Impact on Prehistoric Europe (Archaeolingua, Budapest, 2021).
Guyon, L., Guez, J., Toupance, B., Heyer, E. & Chaix, R. Patrilineal segmentary systems provide a peaceful explanation for the post-Neolithic Y-chromosome bottleneck. Nat. Commun. 15, 3243 (2024).
Chyleński, M. et al. Patrilocality and hunter-gatherer-related ancestry of populations in East-Central Europe during the Middle Bronze Age. Nat. Commun. 14, 4395 (2023).
Anguita-Ruiz, A., Aguilera, C. M. & Gil, Á Genetics of lactose intolerance: An updated review and online interactive world maps of phenotype and genotype frequencies. Nutrients 12, 2689 (2020).
Müller, W., Fricke, H., Halliday, A. N., McCulloch, M. T. & Wartho, J. A. Origin and migration of the Alpine Iceman. Science (1979) 302, 862–866 (2003).
Pinhasi, R. et al. Optimal ancient DNA yields from the inner ear part of the human petrous bone. PLoS ONE 10, 1–13 (2015).
Gamba, C. et al. Genome flux and stasis in a five millennium transect of European prehistory. Nat. Commun. 5, 1–9 (2014).
Kircher, M., Sawyer, S. & Meyer, M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, e3 (2012).
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 5, pdb.prot5448 (2010).
Zhang, J., Kobert, K., Flouri, T. & Stamatakis, A. PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics 30, 614–620 (2014).
Andrews, R. M. et al. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat. Genet. 23, 147 (1999).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Peltzer, A. et al. EAGER: efficient ancient genome reconstruction. Genome Biol. 17, 60 (2016).
Jónsson, H., Ginolhac, A., Schubert, M., Johnson, P. L. F. & Orlando, L. MapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 29, 1682–1684 (2013).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Renaud, G., Slon, V., Duggan, A. T. & Kelso, J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biol. 16, 224 (2015).
Rasmussen, M. et al. An aboriginal Australian genome reveals separate human dispersals into Asia. Science (1979) 334, 94–98 (2011).
Skoglund, P. et al. Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal. Proc. Natl. Acad. Sci. USA 111, 2229–2234 (2014).
Skoglund, P., Storå, J., Götherström, A. & Jakobsson, M. Accurate sex identification of ancient human remains using DNA shotgun sequencing. J. Archaeol. Sci. 40, 4477–4482 (2013).
Mittnik, A., Wang, C. C., Svoboda, J. & Krause, J. A molecular approach to the sexing of the triple burial at the upper paleolithic site of Dolní Vêstonice. PLoS ONE 11, e0163019 (2016).
Weissensteiner, H. et al. HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. 44, W58–W63 (2016).
Ralf, A., Montiel González, D., Zhong, K. & Kayser, M. Yleaf: Software for Human Y-Chromosomal Haplogroup Inference from Next-Generation Sequencing Data. Mol. Biol. Evol. 35, 1291–1294 (2018).
The 1000 Genomes Project Consortium A global reference for human genetic variation. Nature 526, 68–74 (2015).
Kuhn, J. M. M., Jakobsson, M. & Günther, T. Estimating genetic kin relationships in prehistoric populations. PLoS ONE 13, e0195491 (2018).
Fernandes, D. M., Cheronet, O., Gelabert, P. & Pinhasi, R. TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data. Sci. Rep. 11, 21262 (2021).
Popli, D., Peyrégne, S. & Peter, B. M. KIN: a method to infer relatedness from low-coverage ancient DNA. Genome Biol. 24, 10 (2023).
Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065–1093 (2012).
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Chang, C. C. et al. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Behr, A. A., Liu, K. Z., Liu-Fang, G., Nakka, P. & Ramachandran, S. Pong: Fast analysis and visualization of latent clusters in population genetic data. Bioinformatics 32, 2817–2823 (2016).
Warnes, G. et al. Gplots: Various R Programming Tools for Plotting Data. R package version 2 (2005).
Petr, M., Vernot, B. & Kelso, J. admixr—R package for reproducible analyses using ADMIXTOOLS. Bioinformatics 35, 3194–3195 (2019).
Acknowledgements
This research received funding from the “Research Südtirol/Alto Adige 2019” from the Autonomous Province of Bolzano/Bozen (Grant Agreement no. 16170/2020). Additional support was provided by the European Regional Development Fund 2014-2020_CALL-FESR 2017 Research and Innovation_Autonomous Province of Bolzano-South Tyrol (Project: FESR1078-MummyLabs). One-year postdoctoral scholarship to M.C. was funded by the Sven and Lilly Lawski Foundation. The authors thank the Fondazione Museo Civico di Rovereto, Museo del Buonconsiglio di Trento, the MUSE of Trento and, the Ufficio Archeologico, Autonomous Province of Bolzano/Bozen, Italy (Catrin Marzoli). We thank Katrin Renner (Eurac Research) for editing the map in Fig. 1 and Viviana Conti for editing Fig. S16 and for the English revision of the Supplementary Information. The computational results of this work have been achieved using the Life Science Compute Cluster (LiSC) of the University of Vienna. We are grateful to the Department of Innovation (Research University and Museums of the Autonomous Province of Bolzano/Bozen) for covering the Open Access publication costs.
Author information
Authors and Affiliations
Contributions
Conceived the study: V.C. with contribution from A.Pa. and A.Pe.; Analysed the data: M.C. with input and supervision from V.C. and T.G; Organised the sampling collection: A.Pa. with V.C. and A.Pe., Selected and sampled the human remains: A.Pa.; Provided archaeological context: A.Pe., A.Pa.; E.M. and F.N.; Performed the molecular analyses: S.A. and S.Z.; Wrote the original draft: M.C., A.Pa., A.Pe., V.C.; Final draft: V.C. with inputs from all coauthors; Supervision, Funding acquisition and Project Administration: V.C., Additional found resources: F.M.; A.Z.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Croze, M., Paladin, A., Zingale, S. et al. Genomic diversity and structure of prehistoric alpine individuals from the Tyrolean Iceman’s territory. Nat Commun 16, 6431 (2025). https://doi.org/10.1038/s41467-025-61601-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-61601-8






