Abstract
Genetic monitoring is an essential tool for managing threatened or commercially valuable wildlife species. Ideally, genetic monitoring relies on straightforward protocols for genotyping and data handling, accommodates varying sample volumes, applies to non-invasive samples such as scat and hair, produces reliable and repeatable genotypes, and is cost-effective. Moreover, the marker panel itself should be tailored to specific species and/or population management information needs. We describe the development of a microarray-based 96-SNP panel specifically designed for non-invasive genetic monitoring of the Finnish wolf population. The panel was developed to meet four specific needs for this population: identification of wolf genotypes from the non-target species (fox, raccoon dog, golden jackal), sexing, individual identification, and kinship resolution. We demonstrate the utility of the SNP panel using a subset of the genetic monitoring data collected in 2022–2023, including an overview of the data analysis workflow. The development and successful application of this SNP panel, along with a comparison with similar studies in the field, provide valuable insights for researchers and conservationists aiming to implement similar strategies for other species.
Similar content being viewed by others
Introduction
Population monitoring is essential for wildlife management, involving continuous spatio-temporal efforts to assess population trends over time, providing a more accurate understanding of population dynamics than one-time assessments. In Finland, the grey wolf (Canis lupus) population is closely monitored by the Natural Resources Institute Finland (Luke). This monitoring relies on non-invasive genetic sampling - primarily through scats - to track the population’s status and structure. These efforts include identifying breeding units for the annual census estimates, which require data on the number of unique individuals, their sex, and kinship relationships1,2.
The Finnish wolf population is currently estimated at approximately 400 individuals, with its main distribution in western Finland3. The wolf is classified as endangered in Finland according to the IUCN criteria4 and is therefore protected under current EU legislation. However, the species is also considered a game species under Finnish legislation (Hunting Act 615/1993), and lethal removal of certain individuals is permitted with special licenses, usually issued due to wolf-livestock conflicts. Northern Finland is dedicated to reindeer husbandry, which is widely practiced by the indigenous Sámi people. In this area, wolf hunting is permitted to prevent damage to the practice. Consequently, wolf territories do not become established in the area. In regions located outside of the reindeer husbandry area, a maximum number of licenses are issued annually by the Ministry of Agriculture and Forestry, while no fixed maximum is set for the reindeer husbandry area.
Since 2014, Luke’s wolf monitoring program has included the collection of non-invasive DNA samples, primarily sourced through citizen science, with professional sampling conducted in areas that would otherwise remain unsampled. Sampling has focused on areas where wolves have been observed, with geographic coverage expanding over the years to encompass all potential territories. When scat is collected by volunteers, it is not uncommon for samples from non-target canid species such as dogs (Canis familiaris), red foxes (Vulpes vulpes), and raccoon dogs (Nyctereutes procyonoides) to be included. This necessitates the identification and exclusion of non-target species from the data. Additionally, the golden jackal (Canis aureus), a newcomer to the region, has become a focal point of interest for detection5,6,7.
Genetic monitoring has traditionally relied on microsatellite markers, which are commonly used in wildlife management due to their reliability, high information content, and straightforward genotyping process with rapid turnaround from sample to genotype8. While microsatellites require little technical expertise for data handling, standardizing results among labs is challenging due to variations in scoring procedures and instrument performance9,10. Microsatellites can be used to genotype non-invasive low-quality DNA samples, such as scat and hair, which are commonly used in genetic monitoring (e.g.,11,12,13). However, their usage necessitates a multi-tube approach, resulting in significant costs14. During the annual Finnish wolf monitoring in 2021–2022, over a thousand scat samples were collected. The high sample volume incurred high genotyping costs, highlighting the need for a more cost-efficient genotyping method that can handle large sample volumes without compromising data quality.
The development of cost-effective, high-quality Next Generation Sequencing (NGS) technologies offers a promising alternative to microsatellites in genetic monitoring8,10,13,15. A targeted, highly reduced set of informative SNP markers is selected from NGS or array data, aiming to develop protocols for repeatable genotyping. Some genotyping protocols rely directly on NGS, such as GT-seq16 or Genotyping-by-Sequencing17. However, establishing analysis pipelines requires bioinformatic expertise and, ideally, prior experience with laboratory protocols (8 and references therein). Different sample volumes can be managed through various approaches, such as allocating sequencing flow cells across multiple projects, employing commercial sequencing services, or utilizing a range of sequencers with varying output capacities. However, these approaches do not offer the same level of simplicity or accessibility as microsatellite genotyping.
An SNP genotyping option that closely resembles the microsatellite genotyping experience is the use of microfluidic arrays. Several platforms have been developed by biotechnology companies, including Standard Biotools (formerly Fluidigm), Agena Bioscience (MassARRAY), and Amplifluor8. For example, the laboratory workflow for the Standard Biotools platforms is comparable in complexity to microsatellite genotyping and accommodates high throughput as well as some flexibility in sample volumes through the use of 96 SNPs by 96 sample arrays. Genotype scoring is straightforward and standardized with freely available software. Microfluidic arrays typically exhibit greater sensitivity than microsatellite genotyping, often yielding more and higher quality genotypes even from low-quality non-invasive samples8,18,19,20. Several different SNP panels have now been developed and described for microfluidic array platforms for a range of species and purposes, including carnivores [e.g.18,]19,]21,22], ungulates [e.g.23], birds [e.g.28], and fish [e.g.25].
In this study, we developed a cost-effective, multifunctional SNP panel and genotyping workflow specifically tailored to the needs of the Finnish wolf population. This panel was designed to (1) allow efficient differentiation of wolf samples from foxes, golden jackals and raccoon dogs, and flag samples potentially originating from dogs, or wolf-dog hybrids (2) include sex-chromosomal markers for sex identification, and (3) select highly heterozygous markers for reliable individual identification and (4) kinship analysis. We demonstrate how this SNP panel and microfluidic array workflow can provide accurate, repeatable data on individual wolves, enabling more effective and economical monitoring of the Finnish wolf population. Finally, we highlight how the new workflow is currently integrated into Luke’s annual monitoring practices, enhancing Finland’s capacity for data-driven wolf management.
Materials and methods
Identifying markers from the Illumina CanineHD BeadChip dataset
We utilized an Illumina CanineHD Whole-Genome BeadChip microarray (Illumina, Inc. San Diego, California, USA) dataset, which includes 173,662 SNPs. The SNPs in the Illumina CanineHD Whole-Genome BeadChip microarray were selected to maximize variation between modern dog breeds. Wolves are genetically close to dogs so wolves genotype successfully with this array, however the SNPs are inherently biased toward dog-specific variation and less representative for non-dog canids26. The dataset used here comprised 37 Finnish, 23 Russian, 10 Estonian, and 24 Scandinavian wolf samples, along with six Estonian golden jackal samples and 274 dogs representing 55 dog breeds (~ 5 individuals per breed), which also included two wolf-dog breeds: Saarloos Wolfdog (n = 5) and Czechoslovakian Wolfdog (n = 5). The Finnish wolf samples were almost exclusively collected from Eastern Finland and the reindeer husbandry area because that is where most of the derogation licenses are issued. All wolf and golden jackal samples, except two Finnish samples (hair and saliva) and one Scandinavian sample (blood), were tissue samples. All samples from dogs were blood. All samples, except for 19 Scandinavian wolves and six golden jackals, were previously employed in a wolf-dog hybridization study22.
Individuals with a genotyping success rate below 0.85 (five Russian wolves) and SNPs with a call rate below 90% were excluded from the dataset. Linkage disequilibrium (LD) was calculated in a window of 50 SNPs, shifting in steps of 5 SNPs, pruning one SNP from pairs with high LD (r2 > 0.2) using PLINK 1.927,28.
The data from all wolves and golden jackals were used to identify SNPs that distinguish between the two species. FST values were calculated for each locus by treating wolves and jackals as separate populations, using the diveRsity package29 in R 3.6.3. The SNPs with the highest FST (FST = 1) between the species were extracted, and five candidates were taken for panel testing. Additionally, five markers were selected from a wolf-dog hybrid detection SNP panel developed by Harmoinen et al.22 to enhance the ability to identify dog samples within the wolf monitoring data.
To identify markers with high minor allele frequencies (MAF), only Finnish wolf data was used, filtering out SNPs deviating from Hardy-Weinberg equilibrium (HWE, P < 0.001) using PLINK. Subsequently, MAFs were calculated for the remaining SNPs. Out of the 990 SNPs with the highest possible MAF (0.5), 145 were randomly chosen for testing purposes. The samples genotyped with the Illumina CanineHD BeadChip were biased towards eastern Finland (see above) with low representation of the inbred packs in southwestern Finland (see below). Therefore, SNPs with the highest possible MAF were targeted, assuming that it would increase the chance of obtaining markers with MAFs in the range 0.3–0.5 (suitable for kin discrimination) during optimization on samples obtained from the whole of Finland.
The X-chromosomal SNPs were extracted and those located in the pseudoautosomal region (PAR) of the X chromosome were removed, as SNPs in the PAR region undergo recombination between the sex chromosomes30. Subsequently, females and males were separated into their respective datasets and the minor allele frequency was calculated for both datasets. For the female dataset, 15 SNPs that exhibited the highest minor allele frequencies (MAF), ranging from 0.125 to 0.333, were selected. Conversely, for the male dataset, the SNPs with the lowest MAF (< 0.005) were chosen, since males, possessing only one X chromosome, are expected to show homozygosity at these loci, whereas females, with two copies of the X chromosome, are more likely to be heterozygous.
Test of markers used in the Scandinavian wolf monitoring SNP panel
Forty-eight SNPs from the Scandinavian wolf monitoring panel (SLU, NINA, unpublished) with the highest MAF were selected for testing. Additionally, for male sex determination, a Y-chromosomal marker from the same panel was included, which has already been demonstrated to work well for this purpose in the annual Scandinavian wolf monitoring (pers. com. Mikael Åkesson).
Primer design
Primer design was performed by Standard Biotools, using their D3 assay design tool (https://d3.standardbio.com) for the target SNPs and 200 bp of flanking sequence. For each SNP, one specific target amplification primer (STA), two allele-specific primers (ASPs), and one reverse locus-specific primer (LSP) were designed. In total, primers were ordered for 219 SNPs, including five for discriminating wolves and dogs, five for discriminating wolves and golden jackals, 193 high-MAF markers (including 48 from the Scandinavian SNP panel), 15 X-chromosomal markers, and one Y chromosomal marker.
Microfluidic genotyping and final selection of markers
Several classes of samples were selected to test the diverse marker categories. For species discrimination, tissue samples from various species were used: two golden jackals, five red foxes, three raccoon dogs, two brown bears (Ursus arctos), two Eurasian lynxes (Lynx lynx), and two moose (Alces alces). For the initial sex-chromosomal and high-MAF marker testing, 79 wolf tissue samples were used. All markers from all categories were further tested with 15 scat, 12 urine, 18 saliva, and two blood samples from wolves. The different types of sample material from wolves were used to ensure amplification success across a range of sample qualities. The different sample materials contain varying amounts and purity of DNA, which can result in marker dropouts19. Therefore, all scat samples, often with the lowest or most degraded DNA content, were genotyped in duplicate31. A positive control (tissue sample with a known genotype) and a negative control (no template in the reaction) were included in each run.
The final panel was genotyped on a set of 36 unrelated individuals selected from across Finland to test for deviations to the Hardy-Weinberg equilibrium and linkage disequilibrium and to calculate final population-representative minimum allele frequencies (MAF) and probability of identity (PID).
For genotyping, 96.96 Dynamic Array™ IFC plates were utilized (Integrated Fluidic Circuit plates, Standard BioTools Inc, formerly Fluidigm) with a pre-amplification protocol developed for Scandinavian wolves (SLU, NINA unpublished), as follows: 5 µl reactions with 2.6 µl of Qiagen multiplex master mix and 2 µl DNA (undiluted), using 28 cycles for pre-amplification (STA) and diluting 1 µl of the PCR product in 8 µl of ddH2O. The IFCs were amplified on the (Standard BioTools Inc) Juno platform, and the allele-specific fluorescence was measured by an endpoint reader (Biomark EP1 reader). The raw genotype data was analyzed with the SNP Genotyping Analysis Software v. 4.5.1 (https://www.standardbio.com/products/software), and scores were manually validated from the automatically scored scatter plots.
Markers were initially scored based on readability, and only well-amplifying markers with clear genotype clusters were selected. Assay reference libraries were constructed from the successful runs and incorporated in the subsequent scoring to facilitate genotype calling for the selected markers. Genotypes for species and sex markers were verified to ensure they conformed to the expected frequencies. MAFs were calculated, and significant deviations from LD (R and R2 p < 0.05) and HWE (p < 0.05) were tested using PLINK.
The marker set underwent refinement through several iterations of genotyping on different combinations of samples, following the analyses described above, until a final panel was selected. The final panel comprised two jackal-wolf markers, three dog-wolf markers, nine X-chromosomal markers, one Y-chromosomal marker, and 81 high-MAF markers (including 21 from the Scandinavian panel) (Supplementary Tables file: Tables 1 and 2).
Some test runs were conducted following the manufacturer’s manual with Juno 96.96 Genotyping IFC plates. In this setup, the pre-PCR step (STA) is incorporated on the same plate with PCR involving fluorescently labelled allele-specific markers . However, it was observed that the DNA concentrations in the scat samples were too low or the DNA was too degraded for this type of IFC. Consequently, the genotyping rate appeared lower (60%) than with the 96.96 Dynamic Arrays. Therefore, further testing was discontinued, and optimization efforts continued with the 96.96 Dynamic Arrays.
Performance of the final panel
Error rates
Wolf tissue samples with known Illumina CanineHD BeadChip genotypes (N = 15) were genotyped using the developed SNP panel, and the genotypes were compared for error rate estimation purposes. All markers, except the Y marker not included in the Illumina CanineHD BeadChip, were utilized (95 markers). To estimate genotyping error rates, the genotype based on the Illumina chip was assumed to be accurate. Only loci with non-missing Illumina chip genotypes were considered for the comparison. An allelic mismatch was documented in cases where the Illumina genotype was heterozygous and the designed SNP panel showed homozygotes, or vice versa. Mismatches were also noted if the genotypes were opposite homozygotes. Missing genotypes were recorded based on missing genotypes with the designed SNP panel, regardless of Illumina genotypes.
Identification of wolf genotypes
To test the data and develop the analysis for accurately identifying wolf genotypes from non-target species, the samples from several different species genotyped above were analyzed alongside the Illumina CanineHD BeadChip data of 37 Finnish wolves, 6 golden jackals, and 274 dogs. This analysis was performed using the snmf function and Principal Component Analysis (PCA) within the R package LEA32. Only autosomal markers of the panel (86 SNPs) were included.
The sNMF analysis was conducted with K (the number of populations) 1–10 with 10 replicates per K. The most likely number of populations (K) was determined by plotting the cross-entropy values for each K and examining both the greatest drop in entropy between subsequent K-values, and the K with the lowest entropy value. Individual ancestry coefficients (Q) were extracted from the best run with the most likely K and plotted.
Results from sNMF analysis were used to construct reference groups for the two RUBIAS assignment analyses33. First, dogs and wolves with sNMF Q-values above 0.85 (N = 299) were pooled into one reference group and all jackal, raccoon dog and fox genotypes were pooled into a second reference group. The Q-value cutoff was based on manual inspection of the Q-value distributions34, aiming to maximize assignment certainty while retaining within-species variation (Fig. 1). Individuals with Q-value below this threshold were excluded. The second assignment approach included extracting reference populations for only wolves and dogs from the sNMF analysis above (Q-values above 0.85), followed by a RUBIAS analysis as outlined above. The performance of these reference populations was assessed by self-assignment and a leave-one-out analysis in RUBIAS with default settings.
(A) The sNMF analysis for different species with the most likely number of clusters K = 3 using the 86 autosomal markers from the SNP panel. Species are marked with initials: C for raccoon dog, R for red fox, G for golden jackal, W for wolf, and D for dog. (B) PCA analysis of the same data.
Validation of sex markers
To validate the sex markers, Illumina genotyped samples used in pedigree analyses (N = 67) and a subset of samples from the Finnish wolf population where sex was known (N = 39) were employed (in total N = 87). Genetic sexing was conducted using the --check-sex function in PLINK.
HWE, MAF, linkage disequilibrium tests and probability of identity
Loci deviating from HWE and MAF were calculated using PLINK on unrelated individuals collected across Finland. The probability of identity (PID) and probability of identity between siblings (PIDsib) were computed using GenAlEx 6.535, utilizing the same individuals. All SNPs were included except the Y marker and one marker (TIGRP2P103821_rs9012738) with excessive missing data. A threshold of PID < 0.0001 was selected as a sufficiently low PID for wild populations19,36.
Pedigree analyses of inbred wolf family line
Inbred wolf packs comprising 67 individuals from southwestern Finland were genotyped to evaluate the panel’s resolution in sorting out the pedigrees of these packs, whose histories were known based on the field data and further supported by microsatellite data. Pedigree-based inbreeding coefficients were calculated, and pedigrees were drawn using Pedigraph v. 2.437.
Eighty-one SNPs from the developed panel (excluding sex markers and species-specific markers) and 17 microsatellites from Mäntyniemi et al.38 were used in separate COLONY 2.0.6.639 analyses. Results were compared for samples overlapping between the datasets (see Supplementary Material, Fig. 2). All individuals were considered as potential offspring, all females as potential mothers, and all males as potential fathers. Individuals with unknown sex were included as potential mothers and fathers. The analyses were performed three times for both marker sets (replicates) assuming 0.01 error rates, with a 0.9 probability that parents are in the data. The analysis allowed polygamy and inbreeding and employed the full likelihood model with very long runs and high precision. Offspring-parent relationships with a completely missing genotype in either marker type (due to limited sample availability or quality) were removed from the comparison.
A principal component analysis (PCA) for 233 samples representing a subset of wolf monitoring data from 2022–2023, analyzed alongside reference samples of wolves and dogs, hybrids, golden jackals, foxes and raccoon dogs.
Using the SNP panel in regular wolf monitoring
The final SNP panel was employed during the wolf monitoring season 2022–2023. A subset of 282 samples from this monitoring season was utilized to illustrate the performance of the SNP panel in the wolf monitoring program in Finland. The sample subset comprised various sample types: 188 scat, 74 urine, 16 tissue, and 4 hair samples, which originated from wolves, dogs, dog-wolf hybrids, and foxes. The complete set of analyses is described in Supplementary Materials file Sect. 1.
Results
Panel development
Out of the 219 SNP markers that were tested, a subset was discarded for the following reasons: 21 markers failed in amplification or had a low amplification rate in the tested primer combination, four markers showed amplification in the non-template control more than once, 32 markers were rejected due to poor readability of genotype clusters (e.g. clusters were too close to each other, weirdly placed or one cluster was missing), 23 high MAF markers exhibited a lower than acceptable MAF when tested with newly genotyped samples, 20 markers deviated from HWE, and five markers were in LD with another marker. Some markers were excluded due to more than one category of failure. The markers included in the final panel are listed in Supplementary Tables file: Table 1, along with assay IDs and primer sequences.
Error rate
Genotypes obtained with the developed SNP panel were consistent with the known genotypes. No allelic differences were detected out of the 1425 pairwise comparisons (15 samples x 95 markers). The average amplification success rate was 97% across both samples and loci, primarily affected by one X-chromosomal marker (BICF2P1098274) failing completely. It is important to note that this result applied only to tissue samples, as we did not have Illumina-generated genotypes from other types of sample material.
Species identification
Species within the same taxonomic family as wolves—such as dog (which amplified equally well as wolves, data not shown), golden jackal, red fox, and raccoon dog—were successfully amplified with the SNP panel, and their genotypes passed the data filtering process (Table 2). However, more distantly related species—brown bear, Eurasian lynx, and moose—exhibited significantly lower amplification success rates and did not pass the analysis stage.
The SNP panel demonstrated high resolution in distinguishing between species that passed the data filtering (Fig. 1). The largest drop in cross-entropy from the between sequential K-values occurred between K = 2 and K = 3 in the sNMF analysis, suggesting K = 3 to be most likely, whereas the lowest cross-entropy value was found for K = 4 (Supplementary Material file: Fig. 1). Individual ancestry coefficients (Q-values) were extracted from the best run for K = 3 and K = 4 and examined. For K = 3, raccoon dog, red fox, and golden jackal genotypes clustered together, and all had Q-values above 0.9 (Fig. 1A), suggesting very robust clustering. All dog genotypes, including wolf-dog breeds, formed another distinct cluster with Q-values above 0.66, with only 4% of Q-values below 0.9 (Supplementary Tables file: Table 3). Similarly, all wolf genotypes were assigned to their own cluster, with Q-values above 0.77, whereas 8% had Q-values below 0.9 (Supplementary Tables file: Table 3). For K = 4, results for golden jackals, foxes, raccoon dogs and wolves were highly similar, however, the dogs were split into two clusters, such that 71.2% of dogs belonged to dog cluster 1 (using a Q value cut-off of 0.85 for comparison), 13.1% belonged to dog cluster 2, and 15.7% dogs were split between these two clusters (Supplementary Tables file: Table 3). Therefore, K = 3 was considered the best fit to the data. The results from PCA analyses (Fig. 1B) were consistent with the sNMF results for K = 3. Red fox, raccoon dog, and golden jackal genotypes were grouped together in the first two components, clearly separated from wolf and dog genotype clusters. Although wolf and dog genotype clusters were closer together, they remained distinctive groups. The RUBIAS reference population performance tests showed 100% correct assignment to the reference populations, suggesting robust assignment.
Validation of sex markers
Sexing was accurate in 80% of samples with known sex. Males were successfully sexed using both types of sex-chromosomal markers (Supplementary Tables file: Table 4). The Y marker amplified successfully for all 50 male samples. However, the nine X-chromosomal markers showed ambiguous sex assignment for one sample, while the remaining samples were correctly assigned as males. Females presented more challenges in sex determination, as 35% (13/37) of the females were completely homozygous for the nine X-chromosomal markers and were assigned as males. Notably, 11 of these 13 homozygous females belonged to the inbred Western Finnish wolf families used in pedigree analyses. Additionally, three samples showed ambiguous sex. The Y-marker did not amplify for most of the females (34/37), but there were three samples where the Y-marker was amplified, likely due to contamination. Sexing parameters were derived for the regular wolf monitoring (see below), by adjusting the sexing parameters until there were no mis-called sexes between the SNP data and the known physiological sex. In cases where sex could not be reliably determined, sex was classified as uncertain.
MAF, HWE, LD and PID in the final panel
The average MAF from all high-MAF markers was 0.424, and the range was 0.292–0.500. None of the high-MAF loci were found to be linked or deviate from HWE. Based on the PID calculations, ten markers had enough power to distinguish unique individuals drawn randomly from a population, while 19 markers were required to identify unique individuals among siblings (Supplementary Tables file: Table 5).
Pedigree analyses
Inbreeding coefficients calculated from the pedigree of inbred Western Finnish wolf packs varied from 0 up to 0.2 (Supplementary Material file: Fig. 2). The elevated inbreeding coefficient observed in this pedigree can be traced back to the pairing of a grandfather with his granddaughter, which led to a significant number of offspring (38 known pups, not all included here) between 2015 and 2021. In four subsequent generations, some of their descendants mated within the family line, contributing to the highest inbreeding coefficients in the current (2022–2023) wolf packs.
We were able to reconstruct the known pedigree (Supplementary Materials file: Fig. 2) almost completely using either SNP or microsatellite data. Both marker types encountered issues when one of the parents had a missing genotype, resulting in unknown parentage or incorrect assignment of parent-offspring relationship. Out of the 99 parent-offspring relationships used in the comparison, all were assigned correctly, except one individual (Ind ID: W_494). With SNP data, this individual had unknown parents while microsatellite data correctly assigned it to its parents. This discrepancy was most likely caused by a mistake in the genotyping process, such as a wrongly assigned sample.
Assignment probabilities to the relationship categories varied between microsatellites and SNPs (Supplementary Tables file: Table 6). When considering only relationships with assignment probabilities exceeding 0.8 in all three replicates, 90% of the SNP assignments were accepted, whereas only 52% of the microsatellite assignments were accepted.
Regular Wolf monitoring workflow and results
Out of the 282 samples utilized in the wolf monitoring workflow, 83% met the analysis threshold: all the tissue samples (16/16), 88% of the scat samples (166/188), 66% of the urine samples (49/74), and half of the hair samples (2/4) (Supplementary Tables file: Table 7). Sex assignment was successful for 83% of the samples. Among the 166 scat samples genotyped in duplicate and passing the analysis thresholds, only eight (5%) exhibited genotype mismatch errors, resulting in an average error rate of 0.1. Considering that there are 96 markers in the panel, this equates to an error rate of 0.001 per marker. Mismatches were re-coded as missing data.
In the PCA (Fig. 2) and RUBIAS (Supplementary Tables file: Table 7) assignment, ten out of the 233 samples that passed quality filtering were identified as originating from non-target canid species—namely, the red fox, raccoon dog, and golden jackal. A subsequent RUBIAS assignment involving only dogs and wolves led to the misassignment rendering the results unreliable. One of the seven dogs (see below) was correctly classified as dog, and 14 wolves were correctly assigned as wolves. 74 wolf samples were incorrectly assigned as dogs, and the remaining 129 wolf samples and six dogs had the highest probability to be assigned to neither dog nor wolf.
However, a PCA analysis focusing solely on wolves and dogs indicated the presence of seven dogs in the sample set (Fig. 3). This finding was subsequently confirmed with the wolf-dog hybrid panel22 in Heikkinen et al.40. Notably, the PCA could not distinguish hybrids from wolves. Therefore, all putative wolves, dogs and wolf-dog hybrids identified with the monitoring SNP panel are also genotyped in parallel with the 93-SNP wolf-dog hybrid panel for accurate species identification (see Supplementary Material File: supplementary protocol ).
A principal component analysis (PCA) for 233 samples representing a subset of wolf monitoring data from 2022–2023, analyzed alongside reference samples of wolves and dogs and hybrids.
The remaining 223 samples (including dogs and putative hybrids, Supplementary Materials file supplementary protocol) corresponded to 140 unique individuals. An allele-match analysis established a mismatch threshold of 16 SNPs, indicating that 79 SNPs were necessary for robust individual identification (Supplementary Material File: Fig. 3). The highest number of scat samples collected from the same individual was nine (Ind ID: 2023_PK_15085), with eight yielding a 100% genotypic match and one showing a 92% match due to missing data, demonstrating the high repeatability of our scat genotyping method for individual identification (Supplementary Tables file: Table 7). All unique individuals encountered during a monitoring season were subject to kin analysis, and the results were compared to kinship data from previous years (breeding pairs, known sibships) and field observation records. In particular, unique individuals with single occurrences in the data, samples close to the analysis threshold, samples with undetermined or opposite sex called from the R-script sexing, and urine samples were scrutinized to rule out cross-contamination, and samples with suspected cross-contamination were discarded.
Discussion
This study presents the development of a custom-designed, multifunction SNP panel specifically tailored for annual genetic monitoring of wolves in Finland. It outlines our strategy to identify wolf genotypes from that of other species, sex determination, individual identification, and kinship analysis. Furthermore, it demonstrates the application of this panel in the ongoing genetic monitoring of the Finnish wolf population from 2022 to 2023 and beyond.
Our SNP panel effectively distinguished genetically distant species, such as lynx and moose, from wolves. This was evidenced by failed SNP amplification, leading to samples from these species not meeting the analysis thresholds. Similar results were observed with the 93-SNP wolf-dog hybrid SNP panel22, further confirming that prey species commonly consumed by wolves, such as moose, are sufficiently distantly related to canids preventing the risk of amplifying prey species from scat samples.
In contrast, samples from more closely related species such as foxes, raccoon dogs, and golden jackals passed the genotyping threshold and were clustered together in the PCA, and accurately identified in the assignment analysis. This indicates the panel’s ability to accurately differentiate between these species, despite their genetic proximity to wolves.
Wolves occasionally prey on other mammalian predators, particularly under conditions of starvation. In Estonia, raccoon dogs were estimated to constitute 10% of the wolf diet41, and wolves have also been observed preying on domestic dogs, albeit rarely42. Any potential cross-species contamination would be excluded using the same analysis protocols designed to prevent wolf-to-wolf contamination (see Supplementary Materials file Sect. 1). The panel’s ability to separate closely related species was achieved by incorporating specific autosomal markers selected to differentiate between species (dog vs. wolf, and golden jackal vs. wolf). This approach parallels previous studies in wildlife genetics, such as that of Buchalski et al.43, who successfully discriminated between mountain lions and bobcats using two diagnostic SNPs located in the mitochondrion. Similarly, Blåhed et al.44, differentiated closely related cervids using five mitochondrial SNPs, while Wehrenberg et al.45 discriminated among sympatric bovine species with five autosomal diagnostic markers.
The principal component analyses performed on all canids, and that on only wolves and dogs, were also successful in differentiating dogs from wolves, however the assignment analysis using only dogs and wolves failed. This was likely because the major allele frequency differences in the dataset were between foxes, raccoon dogs and golden jackals as one group, and wolves and dogs as a second group. After removing the more distant canids there was less power for assignment of the genetically close dogs and wolves. Moreover, ascertainment bias stemming from the use of the Illumina CanineHD BeadChip may have further compromised the separation of dogs and wolves with both sNMF and RUBIAS. The Illumina CanineHD BeadChip was developed to maximize polymorphism found among modern dog breeds. Funk and Kidd26 showed that although common variants shared across canines (including wolves, modern breed dogs, and village dogs) represent approximately 22% of the total variation among canine genomes, they account for 65% of the markers on the Illumina chip. Hence, the dogs included in the reference data have inflated genetic variation in the dataset compared to the wolves. This was evident in the sNMF analysis which showed that the wolf genetic cluster and the combined fox, raccoon dog and jackal genetic cluster were stable across K = 3 and K = 4, however, the dogs split into two more diffuse genetic clusters for K = 4, suggesting further substructure. Determining the most accurate K from assignment analyses is known to be challenging when there is unobserved ancient structure in the dataset46. This most likely also lead to the observed misassignment of wolves to the dog breeds using RUBIAS. It is possible that the RUBIAS analysis could be improved by defining two reference population for the dogs. This was not tested, since it does not solve the fundamental problems of ascertainment bias due to the marker selection, and because the PCA analysis nevertheless separate dogs from wolves reliably.
While the panel can separate pure dogs from wolves using PCA, it is important to note that it is intended to be used alongside a dedicated 93-SNP wolf-dog hybrid panel22. All new individuals identified with the monitoring SNP panel as potential wolves, dogs or wolf-dog hybrids are further genotyped with this dedicated SNP panel. By employing the 93-SNP wolf-dog hybrid panel alongside the monitoring panel, we enhance the resolution between wolves, dogs and different hybrid generations, enabling the detection of wolf backcrosses up to the third generation (fourth-generation hybrids)22. This comprehensive approach ensures accurate identification of dogs and hybrids and supports effective population management strategies for mitigating the risks posed by wolf-dog hybridization for the natural wolf population. Accurate hybrid identification is important in Finland since domestic wolf-dog hybrids are legislated to be nationally invasive alien species if hybridization has occurred within the last four generations (Act on managing the risks caused by alien species 704/2019; https://vieraslajit.fi/lajit/MX.5014936).
A crucial aspect of annual wolf monitoring is the identification of breeding units, i.e., reproducing family packs, necessitating accurate sex assignment to identify breeding females and males within a pack. Our results showed that sexing relied primarily on the Y-chromosomal marker, which exhibited 80% accuracy in the samples with known sex. The lower accuracy in determining the sex of females using X chromosomal markers was particularly severe in samples from inbred, homozygous females from Western Finland. Similarly, the SNP panel had around 80% success rate with sexing the field-collected samples. It is important to note that in Finnish wolf monitoring fieldwork, multiple samples are often collected from the same individual over the course of a monitoring year, and sometimes across multiple monitoring years. This practice increases the likelihood of accurately determining the sex of individuals, as it allows for cross-referencing and validation of sex assignment results across multiple samples and time points. As a result, the sex determination rate from field data tends to rise slightly when all samples from a monitoring season are analyzed, especially when results from multiple years are combined40. Sexing with the SNP panel increased the number of successfully sexed individuals by about 10% from the PCR and agarose electrophoresis method used previously (Finnish wolf monitoring reports: https://www.luke.fi/fi/luonnonvaratieto/tiedetta-ja-tietoa/susi/suden-kantaarviot-ja-seuranta#susien-kantaarviot). Removing one of the X-chromosome markers with a lower MAF value and instead adding another Y-chromosome marker to our current panel could increase the confidence in the sexing based on the Y-chromosome without too much impact on the sexing based on X-chromosomal markers.
Similar SNP panels developed for genetic monitoring have employed only Y-chromosomal markers for sex determination21,44,47. However, recent panels appear to target a combination of Y-chromosomal and X-chromosomal markers for positive confirmation of both sexes, not just males (e.g20,43,45,48. In our panel, the X-chromosomal markers serve an additional purpose by indicating cross-contamination, especially in urine samples collected from snow. When a territory-holding female wolf urinates, it is common for the territory-holding male to subsequently urinate over the same spot. These mixed samples often amplify well with the SNP panel, but the R-script employed for data analysis detects the conflicting sex indicators, thereby flagging these instances for further investigation. Additionally, since the included X-chromosomal markers exhibit variability, they contribute to individual identification, unlike the Y-chromosomal marker.
To effectively identify breeding units, we focused on high heterozygosity markers capable of distinguishing unique individuals and delineating kinships in relatedness analyses. Our panel included 81 high MAF SNPs for this purpose, utilizing all markers except the Y-marker for individual identification. The number of markers needed to reach PID and PIDsibs > 0.0001 was calculated to be 10 and 19, respectively, indicating the minimum number of markers required for assigning unique IDs. These numbers are similar to those reported by19,43,44,45,48,49.
However, in our analysis of the field data, with current methodology and thresholds, we have found that we can confidently resolve unique IDs with a minimum of 79 markers. This represents a fourfold difference compared to the number of markers calculated using PIDsib analysis for Finnish wolves. Ekblom et al.20 similarly reported a minimum number of ~ 75 markers for individual ID in wolverines, while Thavornkanlapachai et al.48 achieved successful individual identification in only 60% of their successfully genotyped samples using 40 loci, attributing this to stringent filtering of both loci and samples to ensure correct individual assignment. For the Finnish wolf population, factors such as inbreeding over several generations in certain areas50 and random missing data in scat samples likely contribute to this discrepancy. Therefore, if reliable individual identification is crucial, especially when dealing with poor-quality non-invasive samples or populations with potential inbreeding, we recommend multiplying PIDsib values by a factor of three or four to obtain a more realistic number of markers needed for individual identification.
The PID and PIDsib calculations model ideal populations and Waits et al.36 found that deviations from the Hardy-Weinberg Equilibrium in natural populations leads to an underestimation of the true number of microsatellite markers required for individual identification. They made the recommendation to multiply PID by a factor of three to arrive at a more realistic number. Our results suggest that the correction factor is greater for SNP markers, which may be due to the lower numbers of alleles that SNP data generate compared to microsatellites. Moreover, the allele-matching approach we employed for individual identification is based on pairwise genetic similarities and subsequent clustering, with no underlying genetic model51. The approach also accounts for missing data and allele errors when individual IDs are estimated, which may increase the numbers of SNPs needed, compared to the PID and PIDsib results.
We tested the SNP panel’s capability for kinship discrimination using one of the most inbred multigenerational wolf families in Southwestern Finland. Despite inbreeding coefficients reaching up to 0.2, we constructed an almost complete pedigree using the SNP panel.Microsatellite data yielded similar results in resolving the pedigree, although there was a discrepancy in assignment probabilities, with SNPs demonstrating better assignment power. Furthermore, only SNPs were used to genotype the latest generation with the highest inbreeding coefficients, hence it is uncertain how accurately microsatellites would have resolved kinships in this last cohort. Previous studies, such as Anderson and Garza52, have shown through simulations that 60–100 SNPs with high (0.3–0.5) MAF are generally adequate for accurate parentage assignment, and SNPs offer the advantage of lower error rates compared to microsatellites, contributing to improved assignment accuracy. Looking ahead, SNPs could play a pivotal role in constructing a reliable pedigree for the Finnish wolf population. They may be utilized in combination with microsatellite data as suggested by Giangregorio et al.53 to increase the analytical power and address the complexities associated with resolving kinships in populations.
Testing SNPs from the Scandinavian wolf monitoring panel revealed that the primers and protocols are indeed easily transferable between labs, but it also highlighted that the markers’ properties might not be transferable between populations (Table 18). While we initially included 48 markers with high MAFs in the Scandinavian wolf population, only 21 were retained in our final SNP panel. Similarly, Giangregorio et al.53 found that 45 out of 85 autosomal SNPs developed for Scandinavian bears47 were suitable for kinship analyses in Italian Alpine bears, while only 15 were suitable for the Apennine bears. With space for only 96 markers, these types of panels can become highly biased when markers are selected from limited data for a small focal population and selected because of their particular properties in the focal population (e.g. MAF above 0.3), both of which introduce ascertainment bias54. Furthermore, optimizing a 96-plex primer reaction is a complex task, and some primer combinations are less impacted by the low sample quality than others when using non-invasive samples. Hence it is not guaranteed that SNPs that perform well in one panel can be seamlessly integrated into another panel.
Microarray genotyping is often more sensitive than microsatellite genotyping on non-invasive samples such as scat and hair19,20. When applied to field samples, we found that the SNP panel had a pass rate of about 80%, only slightly higher than the microsatellite pass rate of 77–78%55. However, for scat samples, the 17 microsatellites used were amplified in five different primer multiplexes, genotyped in two pool-plexes, and scat samples were always repeated from three to six times50 (pers. comm. Meri Lindqvist). In comparison, SNP genotyping of samples required only a single 96-plex primer pool, and scat samples were genotyped in duplicate, resulting in greater sensitivity. Our cost per sample amounted to EUR 70/ sample with StandardBiotools (in 2024) using 96.96 genotyping arrays, compared to EUR 115/sample for microsatellites (in 2020); both costs include DNA extraction, but no VAT. Subsequent genotyping using the 93-SNP wolf-dog hybrid panel amounts to another EUR 30/ sample. High quality DNA samples are selected based on the results of the monitoring panel; consequently, duplicated genotyping with the 93-SNP wolf-dog hybrid panel is rarely required. Initial and/or additional costs include buying a genotyping system or access to it via a service lab, and initial optimization costs (see also13). Overall, this translates to significantly lower costs and less effort in generating equal or higher-quality genotype data compared to microsatellites.
Similar to other SNP panels relying on high-MAF markers for individual identification and kinships, our panel is unsuitable for general population genetic analyses43. Although, for example, genetic diversity and inbreeding estimates can be calculated and tracked through time, genome-wide patterns of genetic variation are not reflected in these estimates due to the low number of markers employed and the bias towards selecting markers with high heterozygosity.
In conclusion, this study demonstrates our approach to addressing the genetic monitoring requirements for the wolf population in Finland, ranging from the SNP panel design to monitoring implementation. Additionally, the studies referenced in the discussion illustrate how other researchers have addressed their genetic monitoring needs using similar or related approaches across various taxa. While many of the cited studies, and ours, utilize microfluidic arrays for genotyping, the findings have broad applicability to anyone designing SNP panels for genetic monitoring purposes, regardless of the technology employed.
Data availability
Data are available in the Dryad repository: https://doi.org/10.5061/dryad.80gb5mm1t.
References
Kojola, I., Heikkinen, S. & Holmala, K. Balancing costs and confidence: Volunteer-provided point observations GPS telemetry and the genetic monitoring of finland’s wolves. Mammal Res. 63, 415–423. https://doi.org/10.1007/s13364-018-0371-3 (2018).
Mäntyniemi, S., Helle, I. & Kojola, I. Assessment of the residential Finnish Wolf population combines DNA captures citizen observations and mortality data using a bayesian state-space model. Eur. J. Wildl. Res. 68, 70. https://doi.org/10.1007/s10344-022-01615-5 (2022).
Valtonen, M. et al. Susikanta Suomessa Maaliskuussa 2024. Luonnonvara- Ja Biotalouden Tutkimus 54/2024 (Luonnonvarakeskus, 2024). http://urn.fi/URN:ISBN:978-952-380-934-5.
Liukko, U. M. et al. Nisäkkäät. In The 2019 Red List of Finnish Species Hyvärinen, E., Juslén, A., Kemppainen, E., Uddström, A. & Liukko U.-M (eds). (Ministry of the Environment & Finnish Environment Institute, Helsinki 2019).
Bogdanowicz, W. et al. Species on the move: A genetic story of three golden jackals at the expansion front. Mamm. Biol. 105, 37–48. https://doi.org/10.1007/s42991-024-00452-0 (2024).
Kojola, I., Henttonen, H., Heikkinen, S. & Ranc, N. Golden Jackal expansion in northernmost europe: Records in Finland. Mammalian Biol. 104, 101–105. https://doi.org/10.1007/s42991-023-00382-3 (2024).
Viranta, S., Pihlström, H., Kvist, L., Harmoinen, J. & Aspi, J. The day of the Jackal was over: The first golden Jackal (Canis aureus) collected in Finland. Canid Biol. Conserv. 27, 7–14 (2024). http://www.canids.org/CBC/27/Golden_jackal_Finland.pdf
Carroll, E. L. et al. Genetic and genomic monitoring with minimally invasive sampling methods. Evol. Appl. 11, 1094–1119. https://doi.org/10.1111/eva.12600 (2018).
Baric, S., Monschein, S., Hofer, M., Grill, D. & Via, J. D. Comparability of genotyping data obtained by different procedures—An inter-laboratory survey. J. Hortic. Sci. Biotechnol. 83, 183–190. https://doi.org/10.1080/14620316.2008.11512368 (2008).
Hindrikson, M. et al. Wolf population genetics in europe: A systematic review meta-analysis and suggestions for conservation and management. Biol. Rev. Camb. Philos. Soc. 92, 1601–1629. https://doi.org/10.1111/brv.12298 (2017).
Åkesson, M. et al. Genetic rescue in a severely inbred Wolf population. Mol. Ecol. 25, 4745–4756. https://doi.org/10.1111/mec.13797 (2016).
Norman, A. J. & Spong, G. Single nucleotide polymorphism-based dispersal estimates using noninvasive sampling. Ecol. Evol. 5, 3056–3065. https://doi.org/10.1002/ece3.1588 (2015).
von Thaden, A. et al. Applying genomic data in wildlife monitoring: Development guidelines for genotyping degraded samples with reduced single nucleotide polymorphism panels. Mol. Ecol. Resour. 20, 662–680. https://doi.org/10.1111/1755-0998.13136 (2020).
Navidi, W., Arnheim, N. & Waterman, M. S. A multiple-tubes approach for accurate genotyping of very small DNA samples by using PCR—Statistical considerations. Am. J. Hum. Genet. 50, 347–359 (1992). PMID: 1734715; PMCID: PMC1682471.
de Groot, G. A. et al. Decades of population genetic research reveal the need for harmonization of molecular markers. The grey Wolf canis lupus as a case study. Mammal Rev. 46, 44–59. https://doi.org/10.1111/mam.12052 (2016).
Campbell, N. R., Harmon, S. A. & Narum, S. R. Genotyping-in-Thousands by sequencing (GT-seq): A cost effective SNP genotyping method based on custom amplicon sequencing. Mol. Ecol. Resour. 15, 855–867. https://doi.org/10.1111/1755-0998.12357 (2015).
Elshire, R. J. et al. A robust simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE. 6, e19379. https://doi.org/10.1371/journal.pone.0019379 (2011).
Kraus, R. H. S. et al. A single-nucleotide polymorphism-based approach for rapid and cost-effective genetic Wolf monitoring in Europe based on noninvasively collected samples. Mol. Ecol. Resour. 15, 295–305. https://doi.org/10.1111/1755-0998.12307 (2015).
von Thaden, A. et al. Assessing SNP genotyping of noninvasively collected wildlife samples using microfluidic arrays. Sci. Rep. 7, 10768. https://doi.org/10.1038/s41598-017-10647-w (2017).
Ekblom, R. et al. Sample identification and pedigree reconstruction in Wolverine (Gulo gulo) using SNP genotyping of non-invasive samples. Conserv. Genet. Resour. 13, 261–274. https://doi.org/10.1007/s12686-021-01208-5 (2021).
Nussberger, B., Wandeler, P., Weber, D. & Keller, L. F. Monitoring introgression in European wildcats in the Swiss Jura. Conserv. Genet. 15, 1219–1230. https://doi.org/10.1007/s10592-014-0613-0 (2014).
Harmoinen, J. et al. Reliable wolf-dog hybrid detection in Europe using a reduced SNP panel developed for non-invasively collected samples. BMC Genom. 22, 473. https://doi.org/10.1186/s12864-021-07761-5 (2021).
Blåhed, I. M., Ericsson, G. & Spong, G. Noninvasive population assessment of moose (Alces alces) by SNP genotyping of fecal pellets. Eur. J. Wildl. Res. 65, 96. https://doi.org/10.1007/s10344-019-1337-8 (2019).
Ruegg, K. C. et al. Mapping migration in a Songbird using high-resolution genetic markers. Mol. Ecol. 23, 5726–5739. https://doi.org/10.1111/mec.12977 (2014).
Holman, L. E., Onoufriou, A., Hillestad, B. & Johnston, I. A. A workflow used to design low density SNP panels for parentage assignment and traceability in aquaculture species and its validation in Atlantic salmon. Aquaculture 476, 59–64. https://doi.org/10.1016/j.aquaculture.2017.04.001 (2017).
Funk, M. W. & Kidd, J. M. A Variant-Centric analysis of allele sharing in dogs and wolves. Genes 15, 1168. https://doi.org/10.3390/genes15091168 (2024).
Purcell, S. et al. PLINK: A toolset for whole-genome association and population-based linkage analysis. Am. J. Hum. Genet. 81, 559–575. https://doi.org/10.1086/519795 (2007).
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7. https://doi.org/10.1186/s13742-015-0047-8 (2015).
Keenan, K. et al. An R package for the Estimation and exploration of population genetics parameters and their associated errors. Methods Ecol. Evol. 4, 782–788. https://doi.org/10.1111/2041-210X.12067 (2013).
Otto, S. P. et al. About PAR: the distinct evolutionary dynamics of the pseudoautosomal region. Trends Genet. 27, 358–367. https://doi.org/10.1016/j.tig.2011.05.001 (2011).
Ekblom, R. et al. Genome sequencing and conservation genomics in the Scandinavian Wolverine population. Conserv. Biol. 32, 1301–1312. https://doi.org/10.1111/cobi.13157 (2018).
Frichot, E. & François, O. LEA: An R package for landscape and ecological association studies. Methods Ecol. Evol. 6, 925–929. https://doi.org/10.1111/2041-210X.12382 (2015).
Moran, M. B. & Anderson, E. C. Bayesian inference from the conditional genetic stock identification model. Can. J. Fish. Aquat. Sci. 76, 551–560. https://doi.org/10.1139/cjfas-2018-0016 (2019).
Caniglia, R. et al. A standardized approach to empirically define reliable assignment thresholds and appropriate management categories in deeply introgressed populations. Sci. Rep. 10, 2862. https://doi.org/10.1038/s41598-020-59521-2 (2020).
Peakall, R. & Smouse, P. E. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—An update. Bioinformatics 28, 2537–2539. https://doi.org/10.1093/bioinformatics/bts460 (2012).
Waits, L. P., Gordon, L. & Taberlet, P. Estimating the probability of identity among genotypes in natural populations: Cautions and guidelines. Mol. Ecol. 10, 249–256. https://doi.org/10.1046/j.1365-294X.2001.01185.x (2001).
Garbe, J. R. & Yang, D. Pedigraph: A Software Tool for the Graphing and Analysis of Large Complex pedigrees. Version 2.4 User Manual (Department of Animal Science, University of Minnesota, 2008). https://animalgene.umn.edu/pedigraph
Mäntyniemi, S. et al. Suomen Susikannan Suotuisan Suojelutason Viitearvojen Määrittäminen. Luonnonvara- Ja Biotalouden Tutkimus 80/2022 (Luonnonvarakeskus, 2022). http://urn.fi/URN:ISBN:978-952-380-514-9.
Jones, O. R. & Wang, J. COLONY: a program for parentage and Sibship inference from multilocus genotype data. Mol. Ecol. Resour. 10, 551–555. https://doi.org/10.1111/j.1755-0998.2009.02787.x (2010).
Heikkinen, S. et al. Susikanta Suomessa maaliskuussa 2023. Luonnonvara- ja biotalouden tutkimus 70/2023. Luonnonvarakeskus Helsinki. 120. (2023). http://urn.fi/URN:ISBN:978-952-380-744-0
Valdmann, H. & Saarma, U. Winter diet of Wolf (Canis lupus) after the outbreak of African swine fever and under the severely reduced densities of wild Boar (Sus scrofa). Hystrix It J. Mamm. 2, 154–156. https://doi.org/10.4404/hystrix-00298-2020 (2020).
Kojola, I. & Kuittinen, J. Wolf attacks on dogs in Finland. Wildl. Soc. Bull. 30, 498–501 (2002). http://www.jstor.org/stable/3784509
Buchalski, M. R. et al. Development of a 95 SNP panel to individually genotype mountain lions (Puma concolor) for microfluidic and other genotyping platforms. Conserv. Genet. Resour. 14, 147–150. https://doi.org/10.1007/s12686-022-01255-6 (2022).
Blåhed, I. M., Königsson, H., Ericsson, G. & Spong, G. Discovery of SNPs for individual identification by reduced representation sequencing of moose (Alces alces). PLoS ONE. 13, e0197364. https://doi.org/10.1371/journal.pone.0197364 (2018).
Wehrenberg, G., Tokarska, M., Cocchiararo, B. & Nowak, C. A reduced SNP panel optimised for non-invasive genetic assessment of a genetically impoverished conservation icon the European Bison. Sci. Rep. 14, 1875. https://doi.org/10.1038/s41598-024-51495-9 (2024).
Lawson, D. J., van Dorp, L. & Falush, D. A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots. Nat. Commun. 9, 3258. https://doi.org/10.1038/s41467-018-05257-7 (2018).
Norman, A. J., Street, N. R. & De Spong, G. Novo SNP discovery in the Scandinavian brown bear (Ursus arctos). PLoS ONE. 8, e81012. https://doi.org/10.1371/journal.pone.0081012 (2013).
Thavornkanlapachai, R. et al. Species-specific SNP arrays for non-invasive genetic monitoring of a vulnerable Bat. Sci. Rep. 14, 1847. https://doi.org/10.1038/s41598-024-51461-5 (2024).
Förster, D. W. et al. Targeted resequencing of coding DNA sequences for SNP discovery in nonmodel species. Mol. Ecol. Resour. 18, 1356–1373. https://doi.org/10.1111/1755-0998.12924 (2018).
Granroth-Wilding, H. et al. Non-invasive genetic monitoring involving citizen science enables reconstruction of current pack dynamics in a re-establishing Wolf population. BMC Ecol. 17, 1–15. https://doi.org/10.1186/s12898-017-0154-8 (2017).
Galpern, P., Manseau, M., Hettinga, P., Smith, K. & Wilson, P. Allelematch: An R package for identifying unique multilocus genotypes where genotyping error and missing data May be present. Mol. Ecol. Resour. 12, 771–778. https://doi.org/10.1111/j.1755-0998.2012.03137.x (2012).
Anderson, E. C. & Garza, J. C. The power of single-nucleotide polymorphisms for large-scale parentage inference. Genetics 172, 2567–2582. https://doi.org/10.1534/genetics.105.048074 (2006).
Giangregorio, P. et al. Performance of SNP markers for parentage analysis in the Italian alpine brown bear using non-invasive samples. Nat. Conserv. 53, 105–123 (2023).
Lachance, J. & Tishkoff, S. A. SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it. BioEssays 35, 780–786. https://doi.org/10.1002/bies.201300014 (2013).
Valtonen, M. et al. Suomen Susikannan Suotuisan Suojelutason Viitearvojen määrittäminen: Väliraportti Syyskuu 2021. Luonnonvara- Ja Biotalouden Tutkimus 66/2021 (Luonnonvarakeskus, 2021). http://urn.fi/URN:ISBN:978-952-380-277-3.
Acknowledgements
We extend our gratitude to the dedicated individuals involved in coordinating and collecting wolf DNA samples, including: Samuli Heikkinen, Jaakko Alalantela, Antti Härkälä, Leo Korhonen, Reima Ovaskainen, Tapio Visuri, Pasi Aatsinki, Reijo Hautajärvi, Maria Honkakoski, Sami Hoppula, Jukka Lahti, Ville Lumberg, Jukka Siitari and Tapio Ylimartimo from Luke, and Mikko Jokinen, Jenni Pirinen and Mari Tikkunen from Wildlife Agency Finland, along with the numerous volunteers. Special thanks go to Harri Hiltunen, Anita Kenttälä, Noora Parre and Satu Tolvanen from Luke Taivalkoski Research Station for their efforts in preparing and sending tissue samples of hunted wolves. We also acknowledge the invaluable contributions of Oliver Bitz, Marja-Riitta Arajärvi, Sirpa Moisander, Sari Saviahde, Jonna Tabell, Tuula-Marjatta Hamama, Anneli Virta and Jouni Virta from Luke GenoLab Jokioinen for their work in the DNA laboratory. Furthermore, we are grateful to Øystein Flagstad for providing information on the Scandinavian wolf population, and we appreciate the cooperation of the Finnish Wildlife Agency in DNA collection efforts.
Funding
This study was possible through funding from the Finnish Ministry of Agriculture and Forestry, Natural Resources Institute Finland and the LIFE BOREALWOLF project (LIFE18 NAT/FI/000394). U.S. and M.H. were supported by research funding (grants PRG1209m PSG715 and TK215) from the Estonian Ministry of Education and Research.
Author information
Authors and Affiliations
Contributions
J.H., A-M.H., H.J., M.V., L.K., J.A. designed the study. J.H. and H.J. analyzed and interpreted data. M.V coordinated sample collection for Finnish wolf monitoring. U.S. and M.H. provided Estonian wolf and golden jackal samples. H.L. and E.S. provided the dog and M.Å. the Scandinavian wolf dataset genotyped with the Illumina CanineHD BeadChip. J.H., H.J., and lab personnel carried out the lab experiments, D.F. provided scripts, and J.H., H.J., M.V., L.K., J.A. wrote the manuscript. All authors edited and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Harmoinen, J., Valtonen, M., Fischer, D. et al. Development of a cost-effective, multifunctional SNP panel and analysis workflow for Wolf monitoring in Finland. Sci Rep 15, 40816 (2025). https://doi.org/10.1038/s41598-025-24502-w
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-24502-w





