Introduction

Research of archaeological and historical textiles and skins has increased massively over the past two decades1,2. Although these carefully manufactured products, leather, fur, and textiles of animal or plant fibres, are derived from different raw materials and produced using different manufacturing methods3,4, Susanna Harris’ research has highlighted the close relationship between them and their shared functionality as they are ‘flexible, thin sheets that can be wrapped, shaped, and folded and are used to clothe, cover and contain’5,6,7.

Cloth-type materials, or cloth, were used by all societies, yet the way they were used varied according to cultural identity, preferences, and values7. The term ‘cloth cultures’ which we adopt here, signifies just that; while every society makes use of cloth-type materials, they do so in culturally specific ways. Moreover, cloth cultures include far more than just clothing. As the examples of this article will demonstrate, cloth was a fundamental aspect of human societies and was used for furnishing, storage, transportation by boat or sledge, as well as protection in the form of shields, sheats and shelters, and much more. Both raw materials and more or less finished products were traded over large distances from very early times onward and were valued greatly, which is also clearly evident from early written sources. Much later and mostly in Europe, the search for raw materials, as well as the urge to improve production techniques, were among the main drivers behind the development of the colonial enterprise, and the Industrial Revolution8,9,10. Today, cloth of natural raw materials remains fundamental in our society even after the invention of synthetic fabrics and other alternatives.

Despite cloth’s economical value and profound impact across cultures, its significance has often been overlooked. This is in part due to its poor chances of survival in the archaeological record and in part to a lack of understanding of its value, importance and original abundance in ancient societies. Lately, however, cloth-type materials have been increasingly recognised as an important means to study broad and complex themes such as society, identity, technology, economy and trade (e.g. refs. 11,12,13,14,15,16).

Central for such studies is the understanding of its entire chaîne opératoire, from the procurement of raw materials to the creation of the final product. Each step in this process varies depending on the type of cloth, and directly affects the properties of the final product17. Traditionally, animal skin and fibre have been studied using morphometric methods, most prominently visual and microscopic techniques (e.g. refs. 3,18,19). These can provide taxonomic identifications, as well as details on the composition and structure of skins and fibres like the colour, quality and preservation state20. These methods have their strength in often being non-invasive to minimally destructive and widely accessible. However, their application requires extensive training and knowledge about, amongst others, intra and interspecies variation. A recurring concern for their use in species identification has been the absence of objective, standardised methods and comprehensive modern reference collections encompassing various species and geographic regions. Efforts to address this issue, such as those undertaken by Skinner et al.21 represent notable advancements. Nevertheless, morphometric approaches face certain challenges in archaeological materials, particularly due to degradation and post-depositional processes. Manufacturing methods can also alter morphological traits, such as skin grain surfaces or hair scale patterns, thereby hindering optimal analysis.

In order to fully understand the meaning and role of cloth materials, it is important to gain as much information as possible from textiles and animal skins. The range of scientific methods that contribute to the study of archaeological cloth has expanded dramatically over the past decades. Today, it encompasses a wide and increasing array of biomolecular tools, of which the analysis of ancient DNA and proteins has received particular attention (Fig. 1). We have termed this new interdisciplinary field ‘clothomics’, as it applies diverse omics methodologies, such as genomics, proteomics, lipidomics, metabolomics and/or isotopic analysis, to expand the horizons of cloth research.

Fig. 1: Number of articles published in English each year investigating historical or archaeological fibres and skins, processed for clothing or furnishing, with ancient protein and/or DNA approaches based on a Google Scholar search.
figure 1

A full list of the publications can be found in Table S1.

In this review, we provide a practical step-by-step guide for scholars working between cloth research, conservation and heritage science, and entering this new field. In doing so, we address common questions about the use of DNA- and protein-based techniques in cloth materials, focusing on their current applications, different methodological approaches, limitations, and practical considerations, such as preservation, sampling and storage, that should be considered in the design of a clothomic study. Finally, we provide recommendations for future research and highlight the importance of integrating results with archaeological knowledge.

The review emphasises genomics and proteomics which are the most widely used omics methods in ancient cloth research and are within our expertise, while acknowledging the potential for future exploration of other techniques. Likewise, the focus is largely on animal-based materials, wool, fur, and skin, which have been more extensively researched. While various studies have been conducted on silk (e.g. refs. 22,23) the potential for applying these techniques to plant materials in the future remains open. Additionally, we point the readers to supplementary resources, including previous reviews focusing on textiles and skins24,25, peptide mass fingerprinting (PMF) of collagen (also called ZooMS or Zooarchaeology by Mass Spectrometry)26, palaeoproteomics27 and ancient DNA analyses28.

Applications

Initial attempts to retrieve ancient DNA on cloth during the early 2000s showed the difficulties of conducting genetic analysis in historical and modern leather, due to significant degradation occurring during the tanning process29,30. This was backed in 2014 by Brandt and colleagues who showed that DNA was not present in skin garments from Danish peat bogs, whereas there were promising prospects for protein recovery31. Proteomics had already been shown to be successful in Arctic utilitarian skin objects32. Additionally, Hollemeyer et al.33,34 applied proteomics on animal hair in the pioneering study of the clothing of Ötzi, a 5000-year-old mummy preserved in the Alps. The preservation of DNA on fibres was first investigated by Brandt et al.35 and Brandt36 on modern and archaeological fibres from Denmark and Greenland, but showed that DNA is generally very degraded in fibres from archaeological environments. With these early findings of limited DNA preservation in ancient and historical cloth, interest has turned towards the application of proteomics (Fig. 1).

Since these early studies, omics analyses have contributed to answer and enrich various problematics in cloth research (Fig. 2). In particular, they have been useful for species identification of common domestic animals like sheep, goat or cattle (e.g. refs. 37,38). More recently, regional domesticated and wild species have gained more attention. For example, Solazzo and Phipps39 determined that Precolumbian textiles from coastal Peru, dating as early as 300 BCE-100 CE, were made with fibres of mountain Viscacha, providing the first archaeological evidence of its use. In addition, this animal is geographically distributed in the highlands, which raises new questions about trade and cultural interactions in the region. Azémard et al.40 conducted the first attempt to differentiate four South American camelids (guanaco, vicuña, llama, alpaca) and identified proteomically the use of llama and alpaca fibres in Peru around 950–1350 CE, shedding more light into the diversity of animal resources used for fibre production in this area.

Fig. 2: Clothomics and its applications to a wide array of themes, all interconnected.
figure 2

The field examines various aspects, such as the choice of locally available natural resources, the set of animal husbandry and farming practices tailored to answer garment production demands, the contribution of hunting in economic activities, landscape use and mobility, as well as cultural exchanges and trade routes. It also sheds light on dress practices and identities of past individuals and communities, and craft technologies related to the functionality and material properties of cloth.

Clothomics can also shed light on modes of production that are usually invisible in the archaeological record, such as hunting or nomadic and semi-nomadic pastoralism and their relation to agricultural societies. Nomadic Scythians utilised species like red fox, felids and squirrels41, while during the Viking Age, seal hair38 and beaver fur were employed42. In ancient Egypt, the use of skin from gazelle has been evidenced21 and in the Norse and historical Arctic, species such as Arctic fox, Arctic hare, wolverine and various seal species have also been documented43,44.

In other contexts, studies have brought unexpected findings, such as the use of human skin by Scythian populations to cover some of their quivers, supporting the claim by Herodotus that these were made of human skin from their defeated enemies41.

But, certainly, one of the best-analysed ancient wardrobes belongs to the Ötzi man. Multiple proteomic and genetic analyses have been conducted over the years on the various clothing items and elements with which he was equipped33,34,45,46,47. These proteomic analyses successfully identified various animal sources, including two samples from his coat, one from his legging to sheep and one from his moccasins to cattle and in 2012 added goat, brown bear, chamois, red deer and a canid to the list of animal types used.

On the other hand, large-scale studies have shed light on animal use patterns and correlations with certain types of garment or material. A study of 109 samples of animal fibres from the Bronze Age-Iron Age in China identified a predominance of sheep and goat, followed by cattle, and the occasional use of human and camel hair48. Interestingly, the study found that textiles were made mainly of sheep wool, whereas furs or pelts were mostly goat. Another study conducted by Brandt et al.42 determined that fur clothing was produced exclusively from wild animals while leather used for accessories or grave furnishing was made of domesticated animals. In 2020, a paper by Brandt et al.49 used a protein-based method to identify 115 elements of shoes dating to the Danish Viking Age or mediaeval period. The identifications showed a particular pattern of some shoe elements consistently being made of the skin of a specific species arguing for at least some choices of material being made based on functionality of elements, and some being made by taste, fashion or affordability.

Besides species identification, clothomics also holds promise for the investigation of animal management strategies and the selection of specific types of animals for cloth production. Shirazi and colleagues50 carried out the first genetic analysis to determine sex ratios from an assemblage of archaeological leather. It demonstrated that 13th-century Indigenous populations from the American Plains preferentially used hides from female bison to produce leather moccasins. It is not clear if selection occurred during the hunt or if they were chosen among the hunted bison, but future comparisons with bone remains could provide additional insights into hunting strategies and the use of hides. Age could also be a selection criteria for leather, as the skin of young animals are softer and thinner. Exactly the use of calf skin has been demonstrated through proteomics by the recovery of foetal haemoglobin - a blood protein specific to very young animals31.

Deciding on your strategy for analysis

Which biomolecules can be analysed in archaeological cloth?

Proteins and DNA are the main biomolecules that can be studied in archaeological cloth. Proteins play a pivotal role in virtually all biological processes within a cell. Twenty different amino acids constitute the building blocks of proteins, which are synthesised through the transcription and translation of DNA sequences. This linear chain of amino acids represents the protein’s primary structure, but they can further fold into intricate 3D shapes, known as secondary and tertiary structures (Fig. 3). Within archaeological contexts, the most abundant proteins are collagen in the case of, for example, bone and skin (Box 1), keratin in tissues such as wool and hair (Box 2) and fibroin in silk (since the protein sericin is partially or completely removed during the production of threads24). For more information on the structure and characteristics of fibroin, we refer to previous articles22,24. Because amino acid sequence variations in collagen and keratin can be found between different biological species (reflecting genetic signals), proteomics has been widely used in archaeology for species determination.

Deoxyribonucleic acid (DNA) is the hereditary material of all known living organisms. It is structured as a double-stranded helix with sequences of nucleotides on each strand that store the genetic information as a ‘code’. The code is made up of a combination of four chemical bases: adenine (A), thymine (T), cytosine (C) and guanine (G). These bases pair with each other, so that A on one strand always pairs with T on the opposite strand, and C on one strand always pairs with G on the other. The pairs are often referred to as base pairs (bp). The complete DNA sequence is known as the genome and is divided into chromosomes and smaller segments called genes, which encode proteins. Nuclear DNA is located in the cell nucleus where they exist in two variants, one from each parent. A small part of the genome is found outside the nucleus in the cell’s energy-producing organelles, the mitochondria. This mitochondrial DNA (mtDNA) is inherited only through the maternal line, it is not recombined and it is highly variable, which enables reconstruction of maternal lineages (haplogroups) through time.

What factors affect molecular preservation in archaeological cloth and how does it impact analytical success?

When deciding on an analytical strategy for archaeological cloth, information on cloth biography, including its age, processing methods and burial environment, is of major importance to ensure the best results and success.

The manufacturing process is known to significantly impact molecular preservation. Previous research has proved that dyeing and mordanting of animal fibres, as well as tanning of animal skin, hamper the potential for DNA analysis29,35, whereas these have successfully yielded proteins31,39,49.

In terms of environmental factors, while certain contexts are favourable for the preservation of textiles and skins, they may still be harmful to DNA preservation. This is at least the case for waterlogged environments with low pH or acidic conditions, which, on the other hand, can still preserve proteins, as demonstrated by the successful analysis of leather from Danish peat bogs31. Similarly, animal fibres from several Danish Viking Age burials exhibited very poor endogenous DNA preservation, rendering any interpretation of DNA impossible, yet proteins were preserved in the samples42. Sinding et al.43 have, however, demonstrated that, in cold Arctic environments, it remains possible to extract mtDNA from fibres and this was also shown on the Ötzi´s clothing45. This is promising for at least some of the environments from which cloth is well-preserved such as cold and permafrozen sites of Greenland, Northern Scandinavia, and Mongolia. For other sites with well-preserved textiles, such as dry sites in North and South America and Asia, this potential has yet to be demonstrated. Theoretically, heat degrades both DNA and proteins, but studies conducted in arid or dry climates have experienced good results using proteomic methods in animal fibres from South America and Asia39,48 and also leather from Egypt51,52,53. Even mineralised fibres on copper have yielded successful peptide mass fingerprints38.

Despite these successful examples, proteins degrade over time and are also impacted by factors such as time, temperature, burial environment and manufacturing. Protein deamidation, the conversion of glutamine (Gln/Q) and asparagine (Asn/N) to glutamic (Glu/E) and aspartic (Asp/D) acids influenced by pH and temperature result in a mass shift of +0.984 Da24. Other frequent forms of degradation are hydrolysis and oxidation which can be accelerated by factors such as the above-mentioned and can lead to breakage of the protein chain and thus shorter peptides, which may be more difficult to identify54. Furthermore, it has been shown that KAPs are less abundant than ɑ-keratins in aged wool when exposed to increased microbial activity55, but can still survive under certain conditions. In leather, mainly collagen is preserved due to production processes like dehairing and tanning. Altogether, this could lead to a lower level of taxonomic identification if peptides with amino acid differences between species are completely or partially degraded37.

In the case of ancient DNA, only a fraction of the original DNA is preserved as it undergoes increasing fragmentation, and this results typically in fragments shorter than 100 bp56. Furthermore, aDNA molecules accumulate chemical modifications. The most prominent is deamination which occurs when cytosines are converted to uracils (and sequenced as thymines by the instrument) at the end of the fragments, resulting in increased C-to-T misincorporations. At the moment, nuclear DNA has not yet been successfully retrieved from ancient textiles. This may be due to the initial low amount of DNA in hair, combined with a more rapid degradation than mtDNA57. Hundreds to thousands of mtDNA copies exist in each cell which makes it easier to retrieve in archaeological materials, compared to nuclear DNA.

In conclusion, both DNA and proteins are affected by degradation, albeit to varying extents. Hence, the following section provides further guidance on when it is more appropriate to select proteins or DNA as an analytical strategy.

What are the opportunities and challenges of protein and DNA analysis respectively?

Proteins have been shown to be more resistant and survive longer than DNA. This is evidenced in remarkable findings from 6.5 and 3.8 million-year-old ostrich eggshells58,59 and a 3.4 million-year-old camel bone60. The binding of proteins to minerals has been described as one of the key factors contributing to their preservation58. While the mineral component of animal skin and fibre is minimal, the presence of metals like iron or copper in contact with textiles can result in the replacement of fibres by minerals and this has been shown to enhance protein preservation in fibre and also likely skin38. Other factors, such as the protein 3D structure, folding and cross-linking, may play a role in protein preservation in artefacts from harsh environments where DNA no longer persists, although this is still poorly understood61,62. Thus, as described above, in very old, mineralised, arid, and acidic environments and in materials that have undergone harsh manufacturing processes, the survival and, thus, analysis of proteins is superior to that of DNA.

In addition to this, proteins have other unique characteristics that render them particularly useful compared with DNA. Unlike the genome, the proteome (all proteins in an organism) is a dynamic entity comprising over 10,000 proteins that are differentially expressed over time and space. As a result, proteins can vary across different biological tissues, life developmental stages and health conditions. Moreover, once formed, proteins undergo post-translational modifications (PTMs) that alter their properties and function. Thus, while the genome remains static, the proteome is more complex and continually changes under different conditions, providing additional layers of information.

For example, different components like collagens and keratins can be discerned in skin and hair samples, and the presence of haemoglobin in animal skin has been used to identify the young age of an archaeological calfskin31. Studies of modern sheep wool have also demonstrated the ability to distinguish modern sheep breeds and detect differences associated with fibre characteristics, like curvature and colour, and nutrition (e.g. refs. 63,64). So far, tissue-specificity and stage-specific expression still need to be further studied in archaeological skin and textiles before their full potential can be evaluated.

Unlike the analysis of the structural protein collagen that has not evolved much over time, DNA offers greater variation in its sequence27 and thus also precision in terms of species determination. For example, sometimes it is not possible to determine the species with protein analysis, regardless of protein sequencing or PMF, but only the genus or family due to phylogenetic proximity as in the case of various African Bovidae65. Beyond identifying species, DNA can offer insights into the sex of the animal and fine-scale population structure. For example, it can be used to distinguish between domestic and wild variants, as well as infer relatedness between different populations or breeds. This may prove invaluable for understanding preferences for certain types of animals as a source of leather or fibre, and how these changed across various periods and regions. A notable example is the study of the York Gospels, where each parchment bifolium was identified with protein analysis, but further characterised with genetics in terms of sex and population diversity66. Thus, DNA analysis can help to shed light on demographic history or geographic provenance. Finally, examining genetic variants associated with wool traits and selection may reveal shifts in wool properties, such as quality or colour (e.g. ref. 67). Even in samples where nuclear DNA is poorly preserved, it might still be possible to retrieve mtDNA sequences due to its abundance in cells and, based on this, reconstruct maternal lines.

Overall, DNA provides additional information about population history and sex attribution. The greatest challenge is however its preservation. Proteins can offer opportunities for species identification in these cases and also age determination, among others.

How can you determine the preservation status of an archaeological sample?

Several screening methods have been employed in archaeology to assess molecular preservation prior to palaeo-genomic and -proteomic analysis, a process that is destructive, costly and time-consuming. Visual assessment of the material appearance, including parameters like flexibility or dryness/brittleness, can give an approximate idea of the degradation state, but are limited by their subjective nature and may not accurately reflect molecular preservation.

Collagen preservation can be tested using Fourier transform infrared spectroscopy in attenuated total reflectance mode (FTIR-ATR). For bones from arid environments, it has been shown that bones with a collagen content of less than 3% of the total weight, did not yield any peptides68. Another study presented different thresholds to distinguish between well and poorly preserved bone samples for DNA and protein analysis69. Portable FTIR-ATR devices have also the advantage that can been deployed in the field to predict ZooMS success70. Archaeological leather degradation has also been examined with FTIR-ATR71 whereas FTIR microspectroscopy in reflectance mode has been favoured for fragile excavated textiles due to its non-invasive and non-destructive nature72. Nevertheless, to our knowledge, fibre and leather degradation have not been compared directly to palaeo-genomic or -proteomic results. Other pre-screening methods for bones include portable near-infrared (NIR) spectroscopy73 and handheld Raman spectroscopy74,75, which can assess collagen preservation in the field and museums and could be applied to textiles or leather as well.

Thus, depending on the ‘value’ or rarity of the sample, its size, the opportunity to destructively sample it, and the questions asked, a strategy can be decided in a collaboration between heritage scientists, cloth researchers and conservators (Fig. 4).

Fig. 3: Structure of commonly recovered proteins in archaeological cloth, from macro- to micro-scale.
figure 3

a Collagen structure after26. b Keratin structure. Graphics: Signe Bjerregaard.

A guideline for proteomic and genomic users

What are the best practices to obtain, handle and store archaeological and historical samples?

Sampling and storage

Different sampling protocols are available for palaeoproteomics, from minimally destructive to non-invasive techniques. A non-invasive eraser method, initially developed for the analysis of parchment76, has, since its introduction, been tested on leather as well. However, its application to leather presents some challenges, as initial attempts have yielded varied results due to factors such as dirt accumulation and the presence of colourant coatings on the grain surface21,77. In addition, the flesh side may not be accessible and if the leather surface is degraded and friable, it risks damage during the sampling. It has been shown, for instance, that the use of the eraser can leave traces on bone surfaces78. The most reliable results have been obtained from leather fibres extracted from the edges or cross-sections within the dermis layer21,77. Other non-destructive methods used in bone artefacts include the analysis of the plastic storage bags or membrane boxes which have contained the objects in question79,80, but they usually provide lower taxonomic resolution and quality spectra compared to destructive analysis. Within the cultural heritage field, various films or gels have been applied to object surfaces81,82,83,84 but non-invasive methods remain largely untested on leather and wool substrates.

Generally, destructive sampling requires only a small sample size (<10 mg), with recent methods allowing to work with 1 mg or less of sample85. However, while minimal destruction should be attempted and is often required when the sample is fragile, small or rare, it is important to consider that this could result in a decreased protein recovery. Thus, it should be ensured that enough sample material has been taken to successfully retrieve proteins. In the case of wool, estimating the necessary material amount may be challenging as the weight can be highly variable across samples of similar dimensions, especially in field or museum settings, where precise analytical balances may not be available. Similarly, the weight of leather samples may vary depending if the sample is dry or wet. The weight can increase by at least 50% when wet, as usually happens in Northern European contexts, compared to dry samples that have been stored in museums for a long period86. It is preferable to sample leather when it is dry, to avoid the development of mould, especially during transportation.

Considering whether a single sample could be used for multiple analyses is also important. For example, the same protein extract employed for ZooMS can be reused for shotgun proteomics, without the need to take a new sample. Combined protein and dye extraction methods have also been tested, and offer great potential to minimise the number of samples87. However, enough subsamples should be taken to carry out other potential analyses, such as DNA analysis, C14 dating, microscopy or isotope analysis, in order to insert proteomics in the multi-proxy analysis of a single artefact. For DNA analysis, for example, around 10–200 mg of ancient wool is needed, which for most samples equals a thread of 2–3 cm in length35.

Regarding wool textiles, either fragments of fabric or individual threads can be taken. It is particularly important to note whether the sample is from the warp (vertical system) or weft (horizontal system) and why it was decided to sample only one or both systems. Ideally, samples should be retrieved from one thread and one weave direction, as the warp may be composed of different fibres than the weft. Also, wool could be sourced from different individuals or species and mixed during the processing of raw wool or the spinning, so taking multiple samples is recommended if possible. It is also recommended to avoid areas with dyes or colourants as these treatments could impact the results. To ensure traceability, marking the precise location on a photograph of the whole object and/or fragment and thoroughly documenting the whole sampling process is very important. Photographs taken before sampling are the most important, if one has to prioritise, as the sample will show the after-state. Collaboration between archaeologists, physical anthropologists and experts in cloth research, familiar with the material, burial, site, region or period is highly recommended, as they can assist in developing an appropriate sample strategy, selecting the best fragments, providing basic documentation and interpreting the results. Thus, their insights can significantly enhance the quality and relevance of the proteomic or genomic analysis.

Contamination

When determining the exact location for sampling, several aspects should be taken into consideration. Conservators often favour edges or loose pieces for sampling, but these areas may be more degraded or contaminated. In addition, sampling at places that show evidence of previous conservation treatments should be avoided. This is particularly relevant for samples excavated many years ago or preserved with treatments that are no longer used, because we know they are harmful to the specimens and also, in some cases, the individuals handling them. Here, access to records of previous treatment processes, cleaning methods or storage conditions is very useful. For example, the conservation of leather with polyethylene glycol (PEG) may be problematic for subsequent ZooMS analysis if its molecular weight is the same as the molecular weight of identifying peptide markers (see section 7). These peptides may be impossible to detect on the mass spectrum of the sample (Fig. 5). Thus, careful consideration of the sampling location is essential to ensure good results.

Fig. 4
figure 4

Decision tree for choosing the right sampling and analytical strategy.

To retrieve the samples, it is advisable to follow specific precautions to avoid contamination, since skin or other keratin proteins could mask proteins of interest. Nitrile gloves (not latex) should be worn during sampling and, if possible, accompanied by long sleeves and hairnets. In some cases where objects have been handled a lot, this may, however, have less impact on the total amount of contamination. The use of wool clothing should be avoided to prevent keratin contamination. Regarding the equipment, scissors or scalpels are suitable to cut the material, while tweezers can be employed to gently pull out and place the sample into individual plastic tubes or bags. If the sample is fragile or gets loose, it can be placed on a folded paper or aluminium foil beforehand. Depending on the object history, these may have been extensively handled over the years, preserved in situ on the body or bone remains or stored with other samples, so there is a possible risk of cross-contamination. Despite this, sterile tools should be utilised for sampling and thoroughly cleaned between each sample to avoid further cross-contamination. For this reason, samples should be taken as early as possible during the excavation, to avoid minimal handling. In the case of DNA, these precautions should be followed even more carefully as the risk of contamination is higher due to the involved Polymerase Chain Reaction (PCR) step, which can amplify even a few contaminant sequences to large amounts that will mask endogenous DNA signals.

Management of data and samples

The archaeological context is very important for future interpretation and traceability and samples should be given a unique and consistent identifier. The artefact number and provenance information should be included in the label so that it can be linked to the original artefact, museum catalogues, reports of the sites and burials, and excavation numbers.

Last but not least, it is important to establish agreements about how and what material (e.g. unused sample material or extracts) should be returned to museums or researchers, and how, where and how long they should be stored. Moreover, data management procedures should be outlined. This includes how data is kept, shared and published, including negative results to prevent future unnecessary destruction of samples88.

What are the most commonly employed genomic methods?

Most studies of archaeological cloth so far have used PCR-based approaches to retrieve target regions of interest, such as short mtDNA fragments. The advent of next-generation sequencing (NGS) has allowed the analysis of whole ancient genomes (mitochondrial and nuclear) by sequencing millions of untargeted DNA sequences or reads at the same time, particularly suitable for short damaged fragments, while reducing time and cost. During the last years NGS has also been applied to archaeological textiles or fur42,43,44,45,50.

NGS approaches involve several steps: sampling, extraction, library preparation, PCR amplification and sequencing (Fig. 6). This is by far the main route and although other ways are possible, optional steps are not covered here (see ref. 28).

Fig. 5: PEG peaks in a ZooMS spectrum of a leather sample from 11th-century Danish Viborg49.
figure 5

The peak (m/z 1201) is within the range of where one should expect to find the collagen peptide marker COL1α2 978–990, which for instance is important for distinguishing cattle (m/z 1192+1208) from sheep and goat (m/z 1180+1196) and horse (m/z 1182+1198), and could easily overlay and thus hinder their identification.

What are the differences between the different proteomic methods and how to choose between them?

There are two main routes for protein analysis on cloth and archaeological materials in general: peptide mass fingerprinting (PMF) and shotgun proteomics (Table 1). PMF focuses on one protein (the most dominant in a sample, i.e. collagen in skin and keratin in wool textiles and fur), while shotgun proteomics analyses all proteins contained in a sample. Although the extraction, digestion and purification of the sample can be similar, the following mass spectrometry (MS) differs. The first method employs single MS, such as MALDI-TOF (matrix-assisted laser desorption/ionisation–time-of-flight), which generates spectra indicating the intensity (abundance) of peptides of a certain mass/charge ratio (m/z). These spectra are then searched for peptide ‘markers’ or fingerprints, which are known peptides with differing m/z between species (see flow chart in e.g. ref. 26).

Table 1 Comparison of instruments and approaches used in studies of ancient cloth based on the following review papers26,27,28 as well as specific references for textiles35,108 and skin29,31,109

The effectiveness of species determination using PMF relies ultimately on the existence of markers unique to each species. For instance, sheep and goats can be distinguished by a single peptide marker (the mass of the peptide located at the position COL1ɑ2 757–789 is 3077.4 for goat and 3017.4 for sheep89). Similarly, in keratin PMF, separation of sheep and goats can be achieved based on the keratin K33a marker90. However, some species are so closely related to each other that they do not present any discernible amino acid differences in current PMF markers and often require further differentiation by liquid chromatography–tandem mass spectrometry (LC-MS/MS) analysis.

In contrast to PMF, with this method peptides are further separated into fragments, enabling the determination of the actual amino acid sequences (Fig. 7). This approach can provide evidence of degradation patterns and allows the analysis of complex mixtures of proteins. The resolution of shotgun proteomics analysis is thus higher (see examples in refs. 41,42), but the downstream analysis is also more complex and time-consuming. Whereas PMF can be conducted using open source software like mMass91 and comparing manually with reference lists (e.g. for collagen, a dataset is available through the University of York (https://docs.google.com/spreadsheets/d/1ipm9fFFyha8IEzRO2F5zVXIk0ldwYiWgX5pGqETzBco/edit)), shotgun proteomics requires more advanced software such as MaxQuant or PEAKS that conducts protein sequence searches against large publicly available databases such as NCBI or UniProt. The price of MS/MS analyses is higher due to the need for more advanced instrumentation, which may be done at another institution, and the longer analysis time26,27. Therefore, single MS is more often used for larger sample sets, although the final decision will ultimately depend on the specific research question(s).

Fig. 6: General workflow for DNA analysis.
figure 6

After samples are obtained, the cells are broken down and DNA is released. Extraction is followed by purification, where DNA is isolated. Next-generation sequencing requires DNA libraries to be built. Compared to PCR methods which target specific regions, libraries contain all the extracted DNA in a sample. Double-stranded library preparation is commonly used and involves the repair and fill-in of damaged DNA fragments, together with the addition of adaptors (short fragments of DNA) and “indexes”, which are unique short sequences that act like barcodes to differentiate samples when these are sequenced in parallel. Following library preparation, PCR amplification is necessary prior to sequencing, given that the amount of endogenous DNA from archaeological samples is typically small. PCR involves multiple cycles of temperature changes which causes strands of DNA to be replicated. Subsequently, the amplified libraries are pooled and sequenced and data is analysed with bioinformatic tools. The first step is to authenticate ancient DNA sequences and assess the degree of contamination. The potential of the data depends upon how well the genome is covered—the higher the coverage, the larger the analytical potential. Graphics: Signe Bjerregaard.

Fig. 7: General workflow for LC-MS/MS analysis.
figure 7

Proteins are extracted from the samples using various solutions, based on their chemical properties. Hair which contains cysteines requires a reduction and alkylation step to break disulfide bonds. Digestion is then performed with an enzyme, normally trypsin, which cleaves the protein into peptides. Before the peptides enter the mass spectrometer, they are separated by their hydrophobicity using liquid chromatography. These are then ionised and introduced successively to the mass spectrometer using electrospray ionisation (ESI). In tandem MS (or MS/MS), a first mass analyser is used to record peptide masses in a particular retention time (MS1). In data dependent acquisition (DDA), only the top intensity peptides from the first MS round are selected with a very narrow window, while lower abundance peptides are left out. This is the main acquisition method used in palaeoproteomics. The individual peptides (called precursor ions) are isolated and passed on to a collision cell (CID/HCD) for fragmentation. This process breaks the precursor ion into smaller fragment ions that are further analysed by a second mass analyser to generate m/z values of the fragments (MS2). The whole amino acid sequence of the peptide can be reconstructed by analysing the differences between masses, which correspond to the mass of one amino acid. Protein identification involves a database search that compares the MS2 spectrum against theoretically predicted spectra found on a chosen database to obtain the best match. Graphics: Signe Bjerregaard.

What do I need to take into account when budgeting this type of study?

Proteomic and genomic analyses can be undertaken in different ways: commercial analysis, research collaboration or hiring of trained personnel. Prices may vary between institutions based on their infrastructure, setups and how sample cost is calculated, i.e. including or not including salary. Currently, only a few institutions offer commercial analyses of ancient cloth materials. While facilities analysing modern materials are more abundant, these may, however, not be set up to handle specific challenges relating to ancient samples such as contamination. Using a commercial service is an easy way to have samples analysed, but if choosing this route, it is important to ensure that the results come with an interpretation. Otherwise, they may be hard to use without the necessary background.

In a research collaboration, an agreement on the economic contribution to the analysis and possibly salary has to be agreed upon. While this approach can foster extensive discussion on samples, context, results and interpretations, there are risks involved. People who are funded mainly by other projects may not be able to prioritise side projects, potentially leading to delays in established timelines. Finally, an alternative is to hire dedicated staff to carry out analyses. This ensures more security that results are delivered on time, but it is also a more expensive solution.

What are their current limitations?

The preservation of DNA is the main limitation of aDNA analysis. This may be overcome in the future by focusing on bone materials rather than cloth for evolutionary questions, method development and selection of samples from very favourable environments. Because of these challenging preservation conditions, here, we will focus on the current limitations of palaeoproteomic methods. While the application of palaeoproteomics to cloth materials has become more widespread, several challenges persist in both PMF and LC-MS/MS analyses regarding the recovery, detection and analysis of proteins26,27.

Recovery and detection methods

One area of particular importance is the recovery and detection of proteins. Various factors, including preservation conditions and chemical alterations occurring during manufacturing and post-depositional processes, have a profound impact on the efficacy of analytical protocols and mass spectrometry instrumentation. Keratins are structurally rigid and stable proteins and have numerous cystine disulfide linkages which can be hard to break down. In addition, KAPs tend to be underrepresented due to degradation processes such as dyeing and mordanting, which would require further investigations87. This is significant because KAPs present a broader diversity of protein types and sequences compared to keratins, characterised by repetitive regions92.

In the forthcoming years, we expect, for instance, to see the development and optimisation of extraction protocols tailored for keratins which will be able to increase the yield and characterisation of wool and fur42,85. Furthermore, the use of alternative enzymes for digestion or their combination has been shown to increase the number of peptides and proteins recovered, obtaining a larger proportion of the proteome and coverage93,94,95,96. For example, until recently, horses and donkeys remained indistinguishable, but the use of a different enzyme, chymotrypsin, enabled the identification of a new ZooMS marker96.

Finally, to overcome some of the limitations of PMF (lower confidence in protein identification) and LC-MS/MS (cost, time, complex data analysis) new methods such as species by proteome investigation (SPIN) have been developed for archaeological bones97. Contrary to the standard palaeoproteomic methods, SPIN uses data-independent acquisition (DIA). In DIA, peptides within a wide, moving m/z window are systematically selected for further fragmentation. This allows the identification of lower abundance peptides and a higher sample throughput, at the expense of more challenging data analysis and the inability to use database search. However, these methods remain untested on fibre and skin-based materials and warrant further research.

Analysis and interpretation of data

Following developments in protein extraction, we expect improvements within the data analysis for both PMF and LC-MS/MS. The success of the identification of proteins depends partly on having good reference databases. For shotgun proteomics, the process typically relies on comparisons with databases such as UniProt and NCBI, except when using de novo sequencing, which does not rely on existing databases. A major issue with these publicly available databases is that the representation of proteins varies massively between taxonomic groups. For example, a search on Uniprot (10/2024) shows 456 keratin protein entries for sheep (Ovis aries), 499 for goat (Capra hircus), 250 for cattle (Bos taurus) while only 153 for Bactrian camel (Camelus bactrianus) or 132 for alpaca (Vicugna Pacos). This can lead to a database-dependent identification bias as better-covered species may receive more hits than worse-covered ones. Thus, sequences from species that are absent in the database will give a closest hit to the most closely related species present in the database, which are not, in fact, the correct species. For instance, Brandt et al.42 demonstrated how a fur sample from the Danish Viking Age was almost misidentified due to an incomplete database. The sample initially matched to alpine marmot (Marmota marmota), until genetic data from NCBI was translated and identified it as Sciuridae, which was not present in the protein database. Another problem may be the specificity of proteins. Most keratin regions are conserved across different species, while only a few ones contain specific sequence variations. Thus, many peptides will be shared between multiple species. In the coming years, we, therefore, hope and expect protein databases to grow in terms of species and protein coverage through de novo sequencing or translation of genetic data.

For keratins, a specific challenge is an inconsistency in the keratin gene and protein names98 and PMF markers, which is hindering comparison between studies, although this is currently being addressed in the field. We are hoping in the future to see a common nomenclature as the one that has been suggested for collagen99 and also a centralised open-access database for PMF keratin marker references, as the one that is currently being curated by the University of York for collagen. This database is constantly growing and covers all domesticated animals commonly used for skin, as well as many wild species. However, no such central reference database exists for PMF of keratin, which poses a significant barrier to the study of archaeological fibres. Each group has their own internal dataset with keratin markers compiled from individual papers, which need to be constantly updated38,90,100. In addition, at present, keratin PMF markers have only been established for a limited number of mammalian species, highlighting the need for additional characterisation and expansion. For shotgun proteomic approaches, a targeted database for keratin-dominated tissues with compiled criteria for which specific proteins and taxa to include and a unified gene list would be extremely useful for the community.

The feasibility of big data studies depends on time. At the moment, PMF identifications require the manual analysis of spectra and is thus very time-consuming. The development of automated data analysis tools could help increase the number of samples that can be analysed. Some examples include the Bacollite package101 and SpecieScan102. Bacollite, initially developed for identifying parchment, currently contains only four species: sheep, goat, cattle and deer, but more species are currently being added. Conversely, SpecieScan includes all taxa from the online ZooMS database. Both tools are currently limited to bone collagen materials.

Palaeoproteomic approaches have focused mostly on determining the animal species by looking at the presence or absence of proteins in a sample, rather than comparing protein abundance between samples as in quantitative proteomic approaches103. This provides a ‘snapshot in time’ and could be useful for determining the animal age (proteins that are expressed differently during life) and different wool qualities, as observed in modern samples63,64. Nevertheless, these proteins (like KAPs or minor proteins) are not always detected in archaeological samples due to degradation and thus, recovered proteins may not accurately reflect the sample’s original protein composition and abundance.

Sharing of data

Finally, the field of palaeoproteomics has not had any agreed way of publishing spectra and uploading raw data according to FAIR principles. In fact, many early articles did not present this opportunity at all. Much data has not been made available and there is no common repository for data. This is changing and we are seeing platforms such as Zenodo increasing its content of PMF data. Protocols are shared through protocols.io and LC-MS/MS data is being uploaded on a variety of platforms including proteomeXchange, enhancing data accessibility and collaboration within the scientific community. We expect that this is a trend that will continue in the future and we hope and encourage that data will be uploaded in public data repositories with standardised metadata and databases used for LC-MS/MS, which are currently often lacking.

How do proteomics and genomics relate to other methods, approaches and findings in archaeology?

Being able to characterise textiles, fur and leather together with evidence from animal bone remains, textile tools, palaeoenvironmental studies and textual or visual sources is obviously of great importance. Ideally, these analyses will be combined in order to altogether give the strongest possible witness of the biography of cloth. Therefore, it is important to insert clothomics in the multi-proxy analysis of objects and phenomena, to answer not only the what, but also the when, where and how. We are observing with pleasure that several such studies are coming out and that the field seems to be moving in the direction of using species IDs as a starting point for larger discussions.

A good example of a study which integrates several approaches was done by Oras et al.104 on two child mummies. Here, the analysis of the burial textiles was combined with osteological analysis of the bodies, AMS radiocarbon dating, CT scans and ancient DNA analysis, among others, allowing multiple aspects of the bodies, the embalming and the death rituals to come together.

Proteomic analyses on textiles and animal skins from ancient Sudan are tracking the use of raw resources through time and space and how garments impacted economic production and different subsistence modes like hunting, nomadic pastoralism or agriculture105. Combined with microscopy, ancient DNA or isotope analysis, it will shed light on past Sudanese ecology, revealing interactions between humans, animals and their natural environment, and like this better understand dress practices and the building of past identities.

The same approach is used in Brandt’s continuing studies of fur in Viking Age graves42. Ancient protein and DNA analyses will be combined with isotope analyses to aim at provenancing non-local animal furs and, thus also, Viking Age trading networks. Together with evidence from literary sources and iconography, these will expand our understanding of whether exclusive and traded furs were worn by only a limited part of the population as a marker of status.

Conclusions and future perspectives

As cross-disciplinary researchers on cloth, we experience an increasing number of requests for analyses of samples and help in understanding the potentials of biomolecular analysis. Within the last years in Denmark, we have also seen several projects of archaeological cloth integrating biomolecular and/or other natural scientific aspects receive funding. Past problematics with visual species identification in cloth research have been answered or enriched by proteomics especially, which has been improved as a tool by the inclusion of more species in databases.

The field, therefore, seems optimistic, and we are so, too. The technical challenges outlined above are being addressed and improved by the communities; for proteomics, we are seeing initiatives for automatic identification of MS1 spectra which will enable working with larger datasets and in turn reveal more diversity than small-scale studies where specific objects are selected for analysis106. We have also seen the optimisation of protein recovery and MS2 analysis to ensure the identification of as many peptides as possible. Both of these will help provide more resolution of protein data.

So far, palaeoproteomics has been applied to specific case studies but not on a large scale. We see a new trend with, for instance, the Citizen science initiative Next Generation Lab (Denmark), where high school students conduct a large-scale analysis of mediaeval and Renaissance leather fragments from Copenhagen, that would otherwise be discarded107. Democratisation of such large datasets along with metadata, will also allow for a better understanding of protein and DNA degradation in various contexts and the impact of different manufacturing processes.

Ancient DNA analyses of textiles and skins have laid low for some years, probably based on discomforting results as presented by Brandt36. Nevertheless, we are experiencing a growing interest in the development and refining of sheep’s wool—also from an ancient DNA point of view. With improved technologies and favourable samples, we may see a positive development in this area of clothomics as well.

As the field expands, the data will need to be further integrated and contextualised with zooarchaeological data, burial context, environmental and landscape records, technical analysis of textiles and skins, historical texts, and visual representations. Archaeologists will undoubtedly become a key part of these future developments. If we succeed in establishing true interdisciplinary collaborations, we believe that clothomics will continue to provide a better understanding of human-animal interactions and the use of animal products beyond subsistence. This will put cloth production back at the heart of archaeological discussions on past societies and economies and answer complex themes such as identity, technology, economy and trade. We hope this article takes a step towards this goal.