Evaluating regional heritability mapping methods for identifying QTLs in a wild population of Soay sheep

James, Caelinn; Pemberton, Josephine M.; Navarro, Pau; Knott, Sara

doi:10.1038/s41437-025-00770-0

Download PDF

Article
Open access
Published: 23 May 2025

Evaluating regional heritability mapping methods for identifying QTLs in a wild population of Soay sheep

Heredity volume 134, pages 374–386 (2025)Cite this article

2437 Accesses
2 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The study of complex traits and their genetic underpinnings is crucial for understanding the evolutionary processes and mechanisms that shape natural populations. Regional heritability mapping (RHM) is a method for estimating the heritability of genomic segments that may contain both common and rare variants affecting a complex trait. This research is important because it advances our ability to detect genetic loci that contribute to phenotypic variation, even those that might be missed by traditional methods such as genome-wide association studies (GWAS). Here, we compare three RHM methods: SNP-RHM, which uses genomic relationship matrices (GRMs) based on SNP genotypes; Hap-RHM, which utilizes GRMs based on haplotypes; and SNHap-RHM, which integrates both SNP-based and haplotype-based GRMs jointly. These methods were applied to data from a wild population of sheep, focusing on the analysis of eleven polygenic traits. The results were compared with findings from previous GWAS to assess how RHM performed at identifying both known and novel associated loci. We found that while the inclusion of the regional matrix did not account for significant variation in all regions associated with trait variation as identified by GWAS, it did uncover several regions that were not previously linked to trait variation. This suggests that RHM methods can provide additional insights into the genetic architecture of complex traits, highlighting regions of the genome that may be overlooked by GWAS alone. This study underscores the importance of using complementary approaches to fully understand the genetic basis of complex traits in natural populations.

Investigating pedigree- and SNP-associated components of heritability in a wild population of Soay sheep

Article Open access 10 February 2024

Estimates of genomic heritability and genome-wide association studies for blood parameters in Akkaraman sheep

Article Open access 02 November 2022

Regional heritability mapping identifies several novel loci (STAT4, ULK4, and KCNH5) for primary biliary cholangitis in the Japanese population

Article Open access 09 April 2021

Introduction

Finding the exact causal variants influencing a polygenic trait is often challenging, due to both the number of variants involved and the low effect sizes of the causal variants (Boopathi 2013). Some approaches instead look for variants in linkage disequilibrium (LD) with the causal variants to identify regions of the genome in which the causal variants reside (these regions are referred to as quantitative trait loci—QTL) (Hirschhorn and Daly 2005). Whilst the variants uncovered during QTL identification may not directly affect the focal trait, they can serve as markers for the causal variants and help researchers to locate these casual variants or to understand the size of their effects. By using these approaches, researchers can gain important information about the genetic architecture of a trait, even if they cannot directly identify the causal variants without performing functional studies. Genome-wide association studies (GWAS) are commonly used to identify genotyped SNPs in LD with causal loci. However, GWAS have some limitations and challenges that prevent it from finding all the genetic factors that contribute to complex traits (Du et al. 2012). One of these limitations is the power of GWAS (Yang et al. 2010), which is the ability of GWAS to detect true SNP-trait associations. The power of GWAS depends on several factors, such as the sample size, the variant effect size and frequency and the LD between genotyped and causal SNPs (Yang et al. 2010).

To overcome the limitations of GWAS, especially when a trait is influenced by multiple independent effects and/or rare variants in a region, regional heritability mapping (RHM) methods have been developed (Nagamine et al. 2012; Shirali et al. 2018; Oppong et al. 2021). RHM is a technique that estimates the heritability of a trait that is explained by a specific region of the genome. To estimate the heritability of a region, RHM uses a genomic relationship matrix (GRM), which is a matrix that captures the genetic similarity between individuals based on the proportion of shared SNP genotypes in that region (Yang et al. 2010). RHM also corrects for the genetic similarity across the whole genome by fitting another GRM that includes all the SNPs in the genome (or a leave-one-chromosome-out (LOCO) GRM that excludes the chromosome where the region of interest is located) (Yang et al. 2010). By comparing the model fit to a null model that does not fit the regional GRM (rGRM), RHM can identify regions that contain causal variants for the trait and by using the variance estimate for the rGRM, RHM can estimate how much heritability that region contributes.

RHM can be performed using different types of rGRMs and region sizes, depending on the assumptions and goals of the analysis. There are three main types of RHM that have been proposed. The first type is SNP-RHM, which uses rGRMs that are based on the sharing of SNP alleles across a region. The regions are usually defined as windows that contain a fixed number of SNPs (Nagamine et al. 2012). SNP-RHM aims to identify regions with multiple SNPs that are in LD with the multiple causal variants that have too small an effect on the trait individually to be detected by GWAS. However, SNP-RHM only captures effects associated with genotyped SNPs and only captures additive variance. The second type is Hap-RHM, which uses rGRMs that are based on the sharing of haplotype alleles across a region. The regions are defined as haplotype blocks (Shirali et al. 2018). Hap-RHM aims to identify regions where the causal variant is in LD with the haplotype allele, but not necessarily with any specific genotyped SNPs, which allows for detection of variance that is not captured by genotyped SNPs. This method can capture the effect of rare causal variants due to rare haplotype alleles being more likely to be in LD with rare variants than individual, genotyped SNPs. In addition, haplotype effects may reflect the interaction effects of closely linked causal variants (epistasis). The third method, SNHap-RHM, simultaneously fits two rGRMs: one SNP-based and one haplotype-based, and defines regions as haplotype blocks (Oppong et al. 2021). This combines the advantages of both SNP-RHM and Hap-RHM to increase power to detect regions containing variants influencing the phenotype. On occasions where SNP-RHM and Hap-RHM can detect genetic variance in the same haplotype block, SNHap-RHM can also be used to give more insight into the underlying genetic architecture.

Here, we evaluate the three RHM methods for their ability to identify regions containing potentially causal loci in a sample of wild Soay sheep. In this study, we analysed 11 polygenic morphometric traits in the Soay sheep population using RHM. These traits include the same traits measured at different ages, as they are affected by different non-genetic factors (and potentially different genetic factors) and vary in heritability across different stages of life. Despite using various methods to search for the genetic variants that affect these traits, such as GWAS (Bérénos et al. 2015; James et al. 2022), genomic prediction (Ashraf et al. 2021) and chromosome partitioning (Bérénos et al. 2015), most of the genetic variation in these traits remains unexplained by the genotyped and imputed SNPs. Moreover, for some of these traits, there are no SNPs that show significant association with the trait variation to date.

This paper aims to evaluate the applicability of RHM methods for Soay sheep data, considering the smaller sample sizes, lower density SNP data and higher potential for missing data compared to human datasets. To do this, we compare RHM results with GWAS to assess RHM’s ability to recover known associations and identify new ones. Additionally, this paper uses the results of the RHM analyses to enhance the understanding of the genetic architecture of focal traits and identifying potential causal genes based on functional data.

Methods

Phenotypic data

The Soay sheep (Ovis aries) is a primitive breed of sheep that lives on the St. Kilda archipelago, a small group of islands off the west coast of Scotland. Since 1985, a long-term, individual-based study has been conducted on the population residing on the island of Hirta, the largest of the islands (Clutton-Brock and Pemberton 2003). Each individual is sampled for DNA analysis and ear-tagged when it is first captured (usually within 10 days of birth) so that it can be re-identified later. The study involves regular recaptures to measure various traits throughout an individual’s life, and collection and measurement of skeletal remains after death.

We focused on 11 age-specific morphometric traits which have been repeatedly analysed by different approaches and are known to be polygenic (Bérénos et al. 2015; Ashraf et al. 2021; Hunter et al. 2022; James et al. 2022) (see Table 1 for the number of individuals and records per trait). We analysed these traits separately by age class (neonate, lamb and adult). Birth weight was the only trait analysed in neonates, defined as individuals who were caught and weighed between two and 10 days after birth. In August, lambs (aged ~4 months) and adults were caught and measured for weight, foreleg length and hindleg length. Due to adults being recaptured across multiple years, the adult live traits included repeated measurements. Metacarpal length and jaw length were measured from the skeletons after death. We classified ‘lambs’ as individuals who had live trait data recorded in the August of their birth year, or who died before 14 months of age for post mortem measures. We classified ‘adults’ as individuals who had live trait data recorded at least 2 years after birth, or who died after 26 months of age for post mortem measures. Birth and August weights are recorded to the nearest 0.1 kg, whilst the length traits are measured to the nearest mm (Beraldi et al. 2007). We did not analyse yearlings due to low sample size.

Table 1 Number of individuals and records, fixed and random effects fitted in each trait and age class model during RHM pre-correction, alongside the LOCO GRM.

Full size table

Genetic data

8557 sheep have been genotyped on the Ovine SNP50 Illumina BeadChip, of which 38,130 SNPs are autosomal and polymorphic in the population. 188 individuals have additionally been genotyped on the Ovine Infinium HD SNP BeadChip which genotypes 600 K SNPs; these individuals were specifically selected to maximise the genetic diversity represented in the full population. This allowed for imputation of the remaining genotyped individuals to this higher density. AlphaImpute v1.98 (Hickey et al. 2012) was used for the imputation as it combines shared haplotype and pedigree information to increase imputation accuracy (see Stoffel et al. 2021 for details on our imputation). Genotypes with a probability of <0.99 were excluded, resulting in 419,281 autosomal SNPs remaining for 8557 individuals (4035 females, 4452 males). Cross-validation with the 50 K SNP genotype data gave a concordance rate of 0.995. Imputed genotype ‘hard’ calls were used instead of genotype probabilities in the analyses detailed in this manuscript. We have previously shown that imputation does not affect whole-genome heritability estimates for these traits in this population (James et al. 2022).

Locus positions for both sets of genetic data were based on the OAR_v3.1 genome assembly. Phased data is required for Hap-RHM and SNHap-RHM; genotypes were phased using SHAPEIT v4.2 (Delaneau et al. 2019).

Splitting the genome into regions

In this paper, we used three RHM methods: SNP-RHM, Hap-RHM and SNHap-RHM. To allow for direct comparisons of results, we used the same regions for each RHM method. Due to Hap-RHM and SNHap-RHM requiring regions to be defined as haplotype blocks, we used haplotype blocks for all three methods. Haplotype blocks were estimated with Plink v1.90’s—blocks command (Purcell et al. 2007; Purcell 2014) using the high-density imputed genotype data. All SNPs with a MAF higher than 0.01 were included when calculating haplotype block boundaries, and any gaps larger than 500 kb between consecutive SNPs were automatically considered to be haplotype block boundaries. Using a higher max kb threshold or lower MAF threshold did not alter the haplotype block boundaries estimated.

No haplotype block was allowed to have only one SNP, due to the SNP-based GRM and haplotype-based GRM being identical for such blocks, resulting in the SNP-RHM and Hap-RHM methods therefore being identical and SNHap-RHM fitting two identical (and thus confounding) GRMs. Any block containing only one SNP was therefore omitted from the analysis.

Blocks were determined using all 8557 individuals with imputed genotypes to ensure consistency across phenotypes.

Pre-correction of phenotypes

To perform the RHM analyses, we used DISSECT (Canela-Xandri et al. 2015) as it has the ability to generate the haplotype-based GRMs. However, pre-correction is a necessary step when performing RHM with DISSECT due to DISSECT being unable to fit the necessary fixed and random effects during the RHM step.

Prior to RHM, the traits were pre-corrected to account for genome-wide genetic diversity by fitting LOCO GRMs, which are constructed from all autosomes with the exception of one chromosome (Yang et al. 2014). The LOCO GRMs were computed using DISSECT and the VanRaden 2 method (VanRaden 2008), with the genetic relationship between individuals i and j is computed as:

$${A}_{{ij}}=\frac{1}{N}\mathop{\sum }\limits_{k=1}^{N}\frac{\left({s}_{{ik}}-2{p}_{k}\right)\left({s}_{{jk}}-2{p}_{k}\right)}{2{p}_{k}\left(1-{p}_{k}\right)}$$

where s_ik is the number of copies of the reference allele for SNP k of the individual i, p_k is the frequency of the reference allele for the SNP k, and N is the number of SNPs.

We also fitted fixed and non-genetic random effects during pre-correction (see Table 1 for a full list of fixed and non-genetic random effects fitted). Pre-correction for the non-repeated measures traits was performed in DISSECT (Canela-Xandri et al. 2015) using the following model:

$${\rm{y}}={\rm{X}}{\rm{\beta }}+\sum _{r}{{\rm{Z}}}_{r}{{\rm{u}}}_{r}+{\rm{W}}{g}_{{LOCO}}+{\rm{\varepsilon }}$$

where y is the vector of phenotypic values; X is a design matrix linking individual records with the vector of fixed effects β, Z_r is an incidence matrix that relates a random effect to the individual records; u_r is the associated vector of non-genetic random effects; g_LOCO is the vector of additive genetic random effects from all autosomes except for that containing the focal region with W the incidence matrix linking individual phenotypes with the genetic effect; and ε is the vector of residuals. It is assumed that g_LOCO ~ MVN(0, Mσ_gLOCO²), where σ_gLOCO² is the additive genetic variance explained by all autosomes except the excluded one, and M is the LOCO GRM.

As there are 26 autosomes in the sheep genome, we generated 26 LOCO GRMs—this in turn resulted in each trait having 26 pre-corrected phenotypes. The advantage of using LOCO GRMs over a single whole genome GRM (which would result in one pre-corrected phenotype per trait) is that when performing RHM, when a rGRM is fitted, we can use the pre-corrected phenotype that excluded the chromosome on which the region (the haplotype blocks generated in the previous subsection) is located, preventing the effect of the SNPs in that region from being fitted twice in the model.

The residual for each individual was then taken as the pre-corrected phenotype for RHM:

$${y}_{{pre}-{corrected}}={\rm{\varepsilon }}$$

Pre-correction for the three repeated measures traits (adult August weight, adult foreleg length and adult hindleg length) was performed using ASReml-R (version 4.1, Butler et al. 2017) using the same model as given above, and the mean of the residuals summed with the permanent environment effect for each individual was taken as the phenotype for RHM:

$${y}_{{pre}-{corrected}}=\overline{{pe}}+{\rm{\varepsilon }}$$

Regional heritability mapping

RHM was performed using DISSECT (Canela-Xandri et al. 2015), which generates both the SNP-based and haplotype-based rGRMs and simultaneously performs SNP-RHM, Hap-RHM and SNHap-RHM for each region all in one step.

SNP-RHM

SNP-RHM aims to identify regions with multiple SNPs that are in LD with the multiple causal variants that have too small an effect on the trait individually to be detected by GWAS by fitting a rGRM based on the sharing of SNP alleles across a region.

SNP-RHM is performed using the following model:

$${y}_{{pre}-{corrected}}={\rm{W}}{r}_{{SNP}}+{\rm{e}}$$

where y_{pre-corrected} is the vector of pre-corrected phenotypic values, r_SNP is the vector of individual additive genetic random effects from all SNPs contained within the focal haplotype block and e is the vector of residuals. It is assumed that r_SNP ~ MVN(0, Mσ_rSNP²), where σ_rSNP² is the additive genetic variance from all SNPs in the haplotype block and M is the GRM. The GRMs were computed using DISSECT (Canela-Xandri et al. 2015). The SNP-based GRMs were calculated using the same method as the LOCO GRMs, except they were constructed from the SNPs located in the focal haplotype block.

Hap-RHM

Hap-RHM aims to identify regions where the causal variant is in LD with the haplotype allele, but not necessarily with any specific genotyped SNPs. This allows for detection of variance that is not captured by genotyped SNPs and can capture the effect of rare causal variants due to rare haplotype alleles being more likely to be in LD with rare variants than individual, genotyped SNPs. In addition, haplotype effects may reflect the interaction effects of closely linked causal variants.

Hap-RHM is performed using the following model:

$${y}_{{pre}-{corrected}}={\rm{W}}{r}_{{Hap}}+{\rm{e}}$$

where y_{pre-corrected} is the vector of pre-corrected phenotypic values, r_Hap is the vector of individual additive genetic random effects from the haplotype alleles for the focal haplotype block and e is the vector of residuals. It is assumed that r_Hap ~ MVN(0, Hσ_rHap²), where σ_rHap² is the additive genetic variance from the haplotype alleles and H is the GRM. The haplotype-based GRMs were computed using DISSECT (Canela-Xandri et al. 2015), and the genetic relationship individuals i and j is calculated as follows:

$${H}_{{ij}}=\frac{1}{h}\mathop{\sum }\limits_{k=1}^{h}\frac{\left({d}_{{ik}}-2{p}_{k}\right)\left({d}_{{jk}}-2{p}_{k}\right)}{2{p}_{k}\left(1-{p}_{k}\right)}$$

where d_ik is the diplotype code (coded as the number of copies of haplotype k for individual i and takes the values 0, 1, and 2, pk is the frequency of haplotype k and h is the number of haplotypes in the region (see Oppong et al. 2021 for further information and examples).

SNHap-RHM

SNHap-RHM simultaneously fits both a SNP-based rGRM and a haplotype-based rGRM, which combines the advantages of both SNP-RHM and Hap-RHM to increase power to detect regions containing variants influencing the phenotype. On occasions where SNP-RHM and Hap-RHM can detect genetic variance in the same haplotype block, SNHap-RHM can also be used to give more insight into the underlying genetic architecture.

SNHap-RHM is performed using the following model:

$${y}_{{pre}-{corrected}}={\rm{W}}{r}_{{SNP}}+{\rm{W}}{r}_{{Hap}}+{\rm{e}}$$

where y_{pre-corrected} is the vector of pre-corrected phenotypic values, r_SNP is the vector of individual additive genetic random effects from all SNPs contained within the focal haplotype block and r_Hap is the vector of individual additive genetic random effects from the haplotype alleles for the focal haplotype block and e is the vector of residuals. It is assumed that r_SNP ~ MVN(0, Mσ_rSNP²) and r_Hap ~ MVN(0, Hσ_rHap²), where σ_rSNP² is the additive genetic variance from all SNPs in the haplotype block, σ_rHap² is the additive genetic variance from the haplotype alleles and M and H are the respective GRMs. The GRMs were computed using DISSECT (Canela-Xandri et al. 2015). The rGRMs were calculated as for SNP-RHM and Hap-RHM, respectively.

Null model and multiple testing

To test whether the regional heritability models explained significant variation for each region, we compared them against the null model:

$${y}_{{pre}-{corrected}}={\rm{e}}$$

using loglikelihood ratio testing (LRT).

We performed five comparisons; SNP-RHM, Hap-RHM and SNHap-RHM were all compared with the null model, and SNHap-RHM was additionally compared to each of SNP-RHM and Hap-RHM individually. LRTs were performed with 1 degree of freedom, with the exception of the comparison of SNHap-RHM to the null model, which was performed with 2 degrees of freedom. P values were calculated as 0.5× the p value of a chi-squared distribution with one degree of freedom for the 1 degree of freedom tests. For the 2 degrees of freedom tests, the p values were calculated as 0.25× the p value of a chi-squared distribution with two degrees of freedom plus 0.5× the p value of a chi-squared distribution with one degree of freedom (Self and Liang 1987). To account for multiple testing, model fit was considered to be significantly improved if the resulting p value was less than 1.04e⁻⁰⁶ (0.05 divided by 48,125, the total number of haplotype blocks).

Comparison with GWAS

To determine how well the different RHM methods detected previously discovered loci, we identified which haplotype blocks contained the top SNP from each peak significantly associated with phenotypic variation for each trait when performing GWAS. GWAS and conditional GWAS analysis has recently been performed using the high density genotype data (James et al. 2022), so we used the results from that analysis. The significance threshold used in the GWAS analysis was 1.03e⁻⁰⁶ (0.05/48,635) (James et al. 2022), which accounted for multiple testing using the SimpleM method (Gao et al. 2008). This method accounts for LD between markers in order to calculate the effective number of independent tests. We also compared the proportion of genetic variance explained by the regions for which model fit was significantly improved for each of the RHM methods against the proportion of genetic variance explained by previous GWAS results. In loci with multiple significant regions, only the region with highest heritability was included in the calculation.

Identification of candidate genes

We extracted a list of genes overlapping any haplotype block for which model fit was improved by at least one RHM model, using the R biomaRt package (Durinck et al. 2005; Durinck et al. 2009) from the OAR_v3.1 genome assembly. Each gene was then reviewed against the Ensembl (Howe et al. 2020) and NCBI Gene (Bethesda (MD): National Library of Medicine (US) 2004–2023) databases to examine expression and functional annotations. Human and mouse orthologues were also used to characterise gene function due to the high level of genetic annotation in these two species.

Results

Soay sheep haplotype blocks

Setting the maximum kb between any two variants within the same haplotype block to 500 Kb and the minimum minor allele frequency (MAF) for variants to be considered to 0.01 resulted in 48,125 haplotype blocks being estimated across the 26 Soay sheep autosomes. The maximum number of SNPs in a given haplotype block was 111, the minimum was 2 (as blocks with one SNP were omitted), and the average number of SNPs per haplotype was 8.19. 75% of haplotype blocks contained 10 or less SNPs, and 99% of blocks contained 50 or less. Additional information about the haplotype blocks can be found in the Supplementary Text.

Comparison of RHM

A summary of results for the RHM analyses are shown in Tables 2 and 3, whilst detailed results are shown in Supplementary Tables 1–10 For ease of reporting, we have grouped the traits into the following; birth weight and lamb August weight, lamb leg length traits, lamb jaw length, adult August weight, adult leg length traits, and adult jaw length. To assess whether RHM is capable of identifying both previously associated loci and novel loci in comparison to GWAS, we compared the results to James et al. (2022).

Table 2 Number of haplotype blocks for which inclusion of regional GRMs improved model fit.

Full size table

Table 3 Percentage of genetic variance explained for each trait by each RHM method, and previous GWAS analyses (James et al. 2022).

Full size table

Birth weight and lamb August weight

None of the RHM models significantly improved model fit for any haplotype blocks for either birth weight or lamb August weight, meaning that no regions of the genome were found to significantly explain additional genetic variance not accounted for during pre-correction (see ‘Methods’).

Lamb leg length traits

For lamb foreleg length and lamb hindleg length, Hap-RHM was the only model which significantly improved model fit in comparison to the null model. Improved model fit was shown for one haplotype block on chromosome 1 and one on chromosome 11 for lamb foreleg length (Fig. 1A), and one on chromosome 2 and chromosome 3 for lamb hindleg length (Fig. 1B) (Supplementary Tables 1–3). All four of these blocks are novel associations, as they do not contain SNPs that have previously been found to be associated with any lamb leg length trait. The regions for which Hap-RHM significantly improved model fit explained 6.84% of the total genetic variance for lamb foreleg length and 10.19% of the total genetic variance for lamb hindleg length (Table 3); in comparison, the independently associated SNPs from GWAS analyses explained 1.17% and 1.48% for each trait respectively. Seven genes overlapped with these four haplotype blocks; however, none have a clear link to leg length, skeletal size or growth (Supplementary Table 1).

For lamb metacarpal length, model fit was significantly improved for a total of 40 haplotype blocks across chromosomes 16 and 19 when using at least one RHM method. SNP-RHM improved model fit for all of these blocks, and 30 blocks also showed improved model fit for at least one other model, though no blocks showed increased model fit for SNHap-RHM when compared to SNP-RHM (Fig. 1C, Supplementary Tables 1 and 4).

On chromosome 16, there were 16 haplotype blocks that significantly improved model fit using SNP-RHM. Out of these, 14 blocks showed improved model fit using Hap-RHM, 10 of which also showed improved model fit by SNHap-RHM when compared to the null model (Fig. 1D, Supplementary Tables 1 and 4). On chromosome 19, there were 24 haplotype blocks for which SNP-RHM significantly improved model fit, of which 11 also showed increased model fit for Hap-RHM, 20 showed increased model fit for SNHap-RHM when compared to the null model, and four showed increased model fit for SNHap-RHM when compared to Hap-RHM. (Fig. 1D, Supplementary Tables 1 and 4).

In total, SNP-RHM explained 56.59% of the total genetic variance for lamb metacarpal length, Hap-RHM explained 13.64%, and SNHap-RHM explained 77.16% (Table 3). In comparison, previous GWAS results explained 5.14% of the total genetic variance.

Previous GWAS have found significant associations between lamb metacarpal length and SNPs on chromosomes 16 and 19; the haplotype block containing these SNPs on chromosome 16 showed significantly improved model fit for SNP-RHM, Hap-RHM and SNHap-RHM compared to the null model, whilst the block containing these SNPs on chromosome 19 showed significantly improved model fit for SNP-RHM, and SNHap-RHM when compared to the null model and to Hap-RHM.

165 genes overlapped the haplotype blocks for which model fit was significantly improved by at least one RHM method; of these, six had potential links to leg length and skeletal growth (Table 4, Supplementary Table 1).

Table 4 - Potential candidate genes for future analyses.

Full size table

Lamb jaw length

For lamb jaw length, model fit was significantly improved for five haplotype blocks for at least one RHM method; one each on chromosomes 3, 14 and 17 and two blocks on chromosome 13. Hap-RHM was shown to improve model fit for all five of these blocks, and model fit for the two blocks on chromosome 13 was also improved by SNHap-RHM when compared to SNP-RHM (Fig. 2, Supplementary Tables 1 and 5). All five of these blocks are novel associations; no previous associations have been found for lamb jaw length. In total, Hap-RHM explained 10.05% of the total genetic variance, whilst SNHap-RHM explained 1.79%.

50 genes overlapped these five blocks; only one of these genes has a potential association with jaw length (Table 4, Supplementary Table 1).

Adult August weight

For adult August weight, model fit was significantly improved for 84 haplotype blocks over 22 chromosomes (Fig. 3A, Supplementary Tables 1 and 6). For 83 of these blocks, model fit was improved when using Hap-RHM, of which 57 blocks also showed improved model fit when using SNHap-RHM when compared to SNP-RHM, and 33 of those blocks also showed improved model fit when using SNHap-RHM when compared to the null model. The final block only showed significant improvement in model fit for SNHap-RHM when compared to the null model (Fig. 3B, Supplementary Tables 1 and 6). None of these blocks overlapped with previous GWAS associations for adult August weight (Supplementary Table 11).

Hap-RHM explained 46.10% of the total genetic variance for adult August weight, whilst SNHap-RHM explained 63.33% (Table 3). In comparison, previous GWAS results explained 9.31% of the total genetic variance.

86 genes overlapped these 84 blocks; of these four had associations with body weight and obesity (Table 4, Supplementary Table 1).

Adult leg length traits

For adult foreleg length, model fit was significantly improved for six haplotype blocks; one each on chromosomes 1, 6, 11, 12, 23 and 26 (Fig. 4A, Supplementary Tables 1 and 7). For all six blocks, the models that improved model fit were Hap-RHM, SNHap-RHM when compared to the null model and SNHap-RHM when compared to SNP-RHM. All of these blocks were novel associations, as they did not contain SNPs that had previously been associated with leg length in prior analyses. Hap-RHM explained 4.13% of the total genetic variance for adult foreleg length, whilst SNHap-RHM explained 5.29% (Table 3). In comparison, previous GWAS results explained 9.49% of the total genetic variance.

For adult hindleg length, model fit was significantly improved for 27 haplotype blocks. 25 of these haplotype blocks showed significant model fit improvement for Hap-RHM, with 17 of them also showing significant model improvement for SNHap-RHM when compared to SNP-RHM, and another two showing significant model improvement for SNP-RHM. The remaining two blocks only showed significant model fit improvement for SNP-RHM (Fig. 4B, Supplementary Tables 1 and 8). The 27 haplotype blocks were distributed across 15 different chromosomes, with the four blocks for which SNP-RHM improved model fit all located on chromosome 16. These four blocks were the only blocks on chromosome 16 that showed improved model fit (Fig. 4C, Supplementary Tables 1 and 8). One of these blocks on chromosome 16 contained SNPs that had previously been associated with adult hindleg length when performing GWAS. The remaining blocks were all novel associations. SNP-RHM explained 0.17% of the total genetic variance for adult hindleg length, Hap-RHM explained 2.68% and SNHap-RHM 2.71% (Table 3). In comparison, previous GWAS results explained 5.24% of the total genetic variance.

For adult metacarpal length, model fit was significantly improved for a total of 19 haplotype blocks across chromosomes 16 and 19 when using at least one RHM method. SNP-RHM improved model fit for all of these blocks, and 18 blocks also showed improved model fit for at least one other model, though no blocks showed increased model fit for SNHap-RHM when compared to either SNP-RHM or Hap-RHM (Fig. 4D, Supplementary Tables 1 and 9).

On chromosome 16, there were 15 haplotype blocks that significantly improved model fit using SNP-RHM. Out of these, 12 blocks showed improved model fit using Hap-RHM and 12 showed improved model fit by SNHap-RHM when compared to the null model, though the blocks for which these two models improved model fit were not all the same (Fig. 4E, Supplementary Tables 1 and 9). On chromosome 19, there were 4 haplotype blocks significantly improved model fit, and all four blocks showed improved model fit for both SNP-RHM and SNHap-RHM when compared to the null model (Fig. 4E, Supplementary Tables 1 and 9). One of these blocks contained SNPs that had previously been associated with adult metacarpal length when performing GWAS. SNP-RHM explained a total of 27.09% of the total genetic variance for adult metacarpal length, whilst Hap-RHM explained 5.24% and SNHap-RHM 27.58% (Table 3); in comparison, GWAS results explained 8.03%.

83 genes overlapped all of the blocks that showed improved model fit across the three adult leg length traits. Of these, three had potential links to bone length and skeletal growth; one overlapping a block associated with both adult foreleg and hindleg length, one overlapping a block associated with adult hindleg length, and one overlapping a block associated with adult metacarpal length. The potential causative gene associated with adult metacarpal length also overlapped haplotype blocks showing improved model fit for lamb metacarpal length (Table 4, Supplementary Table 1).

Adult jaw length

For adult jaw length, model fit was significantly improved for six haplotype blocks for at least one RHM method; one each on chromosomes 1, 3, 11 and 18 and two blocks on chromosome 23. Hap-RHM was shown to improve model fit for all six of these blocks, whilst SNHap-RHM significantly improved model fit for the blocks on chromosomes 1 and 3, and one of the blocks on chromosome 23 when compared to SNP-RHM. The blocks on chromosomes 1 and 3 also showed improved model fit when using SNHap-RHM compared to the null model. (Fig. 5, Supplementary Tables 1 and 10). All six of these blocks are novel associations, as they do not contain SNPs that have previously been found to be associated with adult jaw length. Hap-RHM explained a total of 7.84% of the total genetic variance for adult jaw length, whilst SNHap-RHM explained 9.24% (Table 3). In comparison, previous GWAS results explained 2.39%.

Five genes overlapped these six blocks, though none had a clear association with jaw length or skeletal size (Supplementary Table 1).

Discussion

Summary of results

In total, there were 169 haplotype blocks for which model fit was improved for at least one trait by at least one RHM model. Novel block-trait associations were identified using at least one RHM method for all but four traits. In the case of birth weight and August lamb weight, RHM did not improve model fit for any haplotype blocks in comparison to the null model, whilst in the case of lamb and adult metacarpal length, RHM only improved model fit for previously identified QTL regions.

Across all haplotype blocks for which model fit was improved by at least one RHM method for at least one of the 11 focal traits, there are 351 genes overlapping these blocks. 91 of these genes are completely uncharacterised in sheep and classed as ‘novel genes’, and a further 14 genes are RNA genes (Supplementary Table 1). Of the 246 characterised protein coding genes, 13 genes had functional annotations that relate to the traits for which model fit was improved (Table 4). One of these genes is in a haplotype block associated with lamb jaw length, four in haplotype blocks associated with adult August weight, one associated with adult foreleg and adult hindleg length, one in a haplotype block associated only with adult hindleg length and eight in haplotype blocks associated with lamb metacarpal length (one of which was also associated with adult metacarpal length). One of these genes – PTH1R – was previously identified as a putative causal gene due to its functional data and proximity to top GWAS SNPs for multiple Soay sheep leg length measures (James et al. 2022).

Comparison of RHM models and previous studies

We found that Hap-RHM improved model fit more often than SNP-RHM. This is due in part to the fact that Hap-RHM significantly improved model fit for more traits than SNP-RHM; Hap-RHM improved model fit for at least one block for all traits (with the exception of birth weight and lamb August weight), whilst SNP-RHM only improved model fit for lamb and adult metacarpal lengths. Hap-RHM also improved model fit more often than SNHap-RHM when SNHap-RHM was compared to either the null model or either of the single rGRM models.

Of the 11 traits, lamb August weight and lamb jaw length were the only two to have no previously associated genetic loci (Bérénos et al. 2014; James et al. 2022). Of the traits for which GWAS has previously identified SNP-trait associations, RHM only significantly improved model fit for blocks containing SNPs previously associated with lamb metacarpal length, adult hindleg length and adult metacarpal length on chromosomes 16 and 19. The three RHM models explained a higher proportion of the total genetic variance for each trait in comparison to independent significant SNPs from previous GWAS analyses, with the exception of adult foreleg length and adult hindleg length. It is worth noting that RHM failed to find known associations between chromosome 16 and adult foreleg length, and chromosome 19 and both adult foreleg length and adult hindleg length.

SNP-RHM has previously been performed in a smaller sample of this same population, focusing on only adult morphometric traits (Bérénos et al. 2015). 37 K autosomal SNPs were split into 150 SNP windows with a 75 SNP overlap. When comparing the results of Bérénos et al. (2015) to our results for the same traits, we find six regions for which SNP-RHM improved model fit for Bérénos et al. (2015) and at least one RHM method improved model fit in our own analyses; two regions on chromosome 1 and one region on chromosome 6 were associated with adult August weight, one region on chromosome 6 was associated with adult hindleg length, a region on chromosome 16 associated with adult hindleg and metacarpal length, and a region on chromosome 19 associated with adult metacarpal length.

Novel block-trait associations were identified using at least one RHM method for all but four traits – birth weight, lamb August weight, lamb metacarpal length and adult metacarpal length. In the case of the former two traits, RHM did not improve model fit for any haplotype blocks in comparison to the null model, whilst in the case of the latter two, RHM only significantly improved model fit in the same regions as previously identified QTL for these traits.

Insights into genetic architecture

None of the haplotype blocks for which model fit was significantly improved for adult August weight were within 1 Mb of a previously identified GWAS association. Interestingly, neither SNP-RHM compared to the null model nor SNHap-RHM when compared to Hap-RHM improved model fit for any haplotype blocks for adult August weight. This suggests that the majority of genetic variance contributing to variation in adult August weight is not due to small effect causal variants in LD with genotyped SNPs, but instead due to rare SNPs in LD with rare haplotype alleles or due to multiple SNPs in the same region interacting epistatically. We have previously shown that family-associated non-additive genetic variance such as dominance and epistasis may be making up 37.1% of previous narrow-sense heritability estimations for this trait (James et al. 2023). This finding would be consistent with Hap-RHM detecting regions in which multiple variants are acting in an epistasic manner. We found four genes with functional data suggesting an association with adult August weight that overlapped with haplotype blocks for which model fit was significantly improved by at least one RHM method: LEPR, TBX15, SDCCAG8 and EPHX2. For the blocks overlapping these genes, model fit was significantly improved by the presence of the haplotype GRM; Hap-RHM significantly improved model fit for all of the overlapping blocks, and SNHap-RHM significantly improved model fit when compared to the null model for the block overlapping SDCCAG8 and to null model and SNP-RHM for the blocks overlapping LEPR and EPHX2. This may explain why these regions were not identified as being associated with adult August weight when performing GWAS (James et al. 2022), as the variance influencing adult August weight in those regions is likely due to specific haplotype alleles, rather than individual SNP effects.

The underlying causal variant on chromosome 16 influencing lamb metacarpal length is currently presumed to be the same variant influencing adult hindleg length and adult metacarpal length – the GWAS-significant SNPs on chromosome 16 for adult hindleg length and lamb metacarpal length are the same (James et al. 2022), adult hindleg and metacarpal length have been shown to have a genetic correlation of 0.827 (S.E. 0.232) (Bérénos et al. 2014), and SNP-leg trait associations in this region have been shown to be dependent on each other; when a SNP genotype from this region is fitted during conditional analyses, no new SNP associations appear in this region. We can therefore combine the RHM results for these three traits to characterise the architecture of genetic variance in this region. Whilst SNP-RHM significantly improved model fit for blocks on chromosome 16 that Hap-RHM did not, there were no blocks on chromosome 16 for which Hap-RHM improved model fit but SNP-RHM did not (Supplementary Tables 1, 4, 8 and 9). In fact, in the case of adult hindleg length, Hap-RHM did not improve model fit for any blocks on chromosome 16 (Supplementary Table 8). This suggests that the additive genetic variance being attributed to the rGRMs is due to individual SNP genotypes, rather than due to a specific haplotype allele. The haplotype block containing SNP s22142.1, (the SNP with the lowest p value for lamb metacarpal length and adult hindleg length when performing GWAS (James et al. 2022)) contains 17 SNPs and has 18 haplotype alleles in the population. The minor allele for s22142.1 appears in 3 haplotype alleles, with two of these haplotype alleles being relatively rare (each appearing on 17 chromosomes in the genotyped population).

The underlying causal variant on chromosome 19 influencing lamb metacarpal length is presumed to be the same variant influencing adult metacarpal length – whilst the SNP with the lowest p value when performing GWAS are different for these two traits, they still fall in the same haplotype block (Supplementary Table 11) and when the genotype of each of these SNPs is fitted during conditional analysis, no new SNP-trait associations appear (James et al. 2022). Similarly, we can combine the RHM results for both lamb metacarpal length and adult metacarpal length to characterise the underlying architecture. For both traits, model fit for the block containing the top GWAS SNPs was only significantly improved by SNP-RHM and SNHap-RHM when compared to the null model (and SNHap-RHM compared to Hap-RHM in the case of lamb metacarpal length). This suggests that this association is being driven by the SNP alleles in this region, rather than the haplotype alleles. The haplotype block containing the GWAS SNPs with the lowest p value has 37 SNPs and 52 haplotype alleles in the genotyped population. The minor alleles for each of these SNPs each appear in two haplotype alleles, with one haplotype allele containing both minor SNP alleles. The haplotype alleles each containing one of the minor SNP alleles for these SNPs were both rare in the population (appearing on one and 50 chromosomes in the population).

Limitations of RHM

We have previously shown that pre-correcting for fixed and random effects reduces power of GWAS to detect variant-trait associations (James et al. 2022), as fitting covariates during analyses correctly propagates error throughout the analysis, reducing the chance of false positive results and increasing power by disentangling potential correlations. Pre-correction may therefore explain why we did not see the RHM methods improving model fit for all of the haplotype blocks containing previously identified variants. Currently pre-correction is a necessary step when performing RHM with DISSECT due to DISSECT being unable to fit all of the necessary fixed and random effects during RHM, and it is not possible to extract the haplotype-based GRMs from DISSECT to perform the analyses with different software. It would be interesting to rerun this analysis when suitable software is developed for single-step RHM, to determine whether single-step RHM improved model fit for all haplotype blocks containing significant GWAS associations. In addition, pre-correction may reduce the power to identify significant regions due to correlations between the LOCO GRM and the rGRM (as they are describing relatedness between the same set of related individuals). Using a LOCO GRM to pre-correct rather than a whole genome GRM should help alleviate this to some extent (as the SNPs used in the rGRM are therefore not used in the LOCO GRM), however this does not fully exclude the underlying issue. However, the presence of significant regions in our results suggests that this is not an issue for every region, and may only affect a small number.

Previous analyses using Hap-RHM and SNHap-RHM have proposed using the location of recombination hotspots (Shirali et al. 2018; Oppong et al. 2021), however this was not available for our population. Instead, we estimated haplotype blocks using Plink (Purcell et al. 2007; Purcell 2014). This gave us a total of 48,125 haplotype blocks containing at least two SNPs across the sheep genome. Previously, we have calculated the number of independent tests when performing GWAS using this same SNP density to be 48,635 (James et al. (2022)). Given how close these two figures are (especially as we excluded any haplotype blocks that only contained a single SNP), we are confident in the haplotype blocks that were generated and that identifying recombination hotspots was not necessary for our population.

We chose to exclude haplotype blocks with only one SNP, as the SNP-based and haplotype-based rGRMs would be identical. However, this means that any variance being contributed to the trait by these SNPs has been missed from our analyses. Ideally, SNP density and distribution should be such that each haplotype block has at least two SNPs, so that all haplotype blocks can be included in the analysis.

One limitation of RHM is the challenge of determining whether the identified associations are genuine or simply false positives. This uncertainty can stem from various factors, such as statistical noise or random chance due to the testing of multiple regions. Additionally, the regions flagged as significant may contain complex genetic interactions or overlapping effects, making it difficult to pinpoint the true cause of the association. Therefore, further studies such as functional analyses are required to separate the true associations from any potential false positives.

Concluding remarks

Here, we have demonstrated that RHM methods are a useful tool for detecting regions that contribute genetic variation to traits in a wild population and complement other analyses such as GWAS. We found that Hap-RHM and SNHap-RHM improved model fit for more haplotype blocks than SNP-RHM, but all three can be used together to better characterise the underlying genetic architecture within a region. Using these methods, we detected multiple haplotype blocks that improved model fit with at least one RHM method. From these regions, we characterised the genetic regions influencing trait variation and identified 13 potential causal genes that have not previously been associated with variation in these traits in the Soay population.

Data availability

All scripts and data can be found at https://github.com/CaelinnJames/RegionalHeritabilityMapping_SoaySheep.

References

Abdulkarim B, Nicolino M, Igoillo-Esteve M, Daures M, Romero S, Philippi A et al. (2015) A Missense Mutation in PPP1R15B Causes a Syndrome Including Diabetes, Short Stature, and Microcephaly. Diabetes 64:3951–3962
Article PubMed PubMed Central CAS Google Scholar
Abousoliman I, Reyer H, Oster M, Murani E, Mohamed I, Wimmers K (2021) Genome-Wide Analysis for Early Growth-Related Traits of the Locally Adapted Egyptian Barki Sheep. Genes (Basel) 12
Ashraf B, Hunter DC, Bérénos C, Ellis PA, Johnston SE, Pilkington JG et al. (2021) Genomic prediction in the wild: a case study in Soay sheep. Mol Ecol 31(24):6541–6555
Article PubMed Google Scholar
Baujat G, Le Merrer M (2007) Ellis-Van Creveld syndrome. Orphanet J Rare Dis 2:27
Article PubMed PubMed Central Google Scholar
Beraldi D, McRae AF, Gratten J, Slate J, Visscher PM, Pemberton JM (2007) Mapping quantitative trait loci underlying fitness-related traits in a free-living sheep population. Evolution 61(6):1403–1416
Article PubMed Google Scholar
Bérénos C, Ellis PA, Pilkington JG, Pemberton JM (2014) Estimating quantitative genetic parameters in wild populations: a comparison of pedigree and genomic approaches. Mol Ecol 23(14):3434–3451
Article PubMed PubMed Central Google Scholar
Bérénos C, Ellis PA, Pilkington JG, Lee SH, Gratten J, Pemberton JM (2015) Heterogeneity of genetic architecture of body size traits in a free-living population. Mol Ecol 24(8):1810–1830
Article PubMed PubMed Central Google Scholar
Boopathi NM (2013) QTL Identification. Genetic mapping and marker assisted selection: basics, practice and benefits. Springer India, India, p 117–163
Butler DG, Cullis BR, Gilmour AR, Gogel BG, Thompson R (2017) ASReml-R reference manual version 4. Hemel Hempstead, HP1 1ES. VSN International Ltd, UK
Canela-Xandri O, Law A, Gray A, Woolliams JA, Tenesa A (2015) A new tool called DISSECT for analysing large genomic data sets using a Big Data approach. Nat Commun 6(1):10162
Article PubMed CAS Google Scholar
Chagnon YC, Chung WK, Pérusse L, Chagnon M, Leibel RL, Bouchard C (1999) Linkages and associations between the leptin receptor (LEPR) gene and human body composition in the Québec Family Study. Int J Obes 23(3):278–286. https://doi.org/10.1038/sj.ijo.0800809
Article CAS Google Scholar
Clutton-Brock TH, Pemberton JM (2003) Soay sheep: dynamics and selection in an island population. Cambridge University Press, Cambridge, (eds)
Book Google Scholar
Delaneau O, Zagury J-F, Robinson MR, Marchini JL, Dermitzakis ET (2019) Accurate, scalable and integrative haplotype estimation. Nat Commun 10(1):5436
Article PubMed PubMed Central Google Scholar
Diez-Roux G, Banfi S, Sultan M, Geffers L, Anand S, Rozado D et al. (2011) A high-resolution anatomical atlas of the transcriptome in the mouse embryo. PLoS Biol 9:e1000582
Article PubMed PubMed Central CAS Google Scholar
Du Y, Xie J, Chang W, Han Y, Cao G (2012) Genome-wide association studies: inherent limitations and future challenges. Front Med 6(4):444–450
Article PubMed Google Scholar
Duchatelet S, Ostergaard E, Cortes D, Lemainque A, Julier C (2005) Recessive mutations in PTHR1 cause contrasting skeletal dysplasias in Eiken and Blomstrand syndromes. Hum Mol Genet 14:1–5
Article PubMed CAS Google Scholar
Durinck S, Spellman PT, Birney E, Huber W (2009) Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc 4(8):1184–1191
Article PubMed PubMed Central CAS Google Scholar
Durinck S, Moreau Y, Kasprzyk A, Davis S, De Moor B, Brazma A et al. (2005) BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics 21(16):3439–3440
Article PubMed CAS Google Scholar
Gao X, Starmer J, Martin ER (2008) A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms. Genet Epidemiol 32(4):361–369
Article PubMed Google Scholar
Heid IM, Jackson AU, Randall JC, Winkler TW, Qi L, Steinthorsdottir V et al. (2010) Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat Genet 42:949–960
Article PubMed PubMed Central CAS Google Scholar
Hickey JM, Kinghorn BP, Tier B, van der Werf JH, Cleveland MA (2012) A phasing and imputation method for pedigreed populations that results in a single-stage genomic evaluation. Genet Sel Evol 44(1):9
Article PubMed PubMed Central Google Scholar
Hirschhorn JN, Daly MJ (2005) Genome-wide association studies for common diseases and complex traits. Nat Rev Genet 6(2):95–108
Article PubMed CAS Google Scholar
Howe KL, Achuthan P, Allen J, Allen J, Alvarez-Jarreta J, Amode MR et al. (2020) Ensembl 2021. Nucleic Acids Res 49(D1):D884–D891
Article PubMed Central Google Scholar
Hunter DC, Ashraf B, Bérénos C, Ellis PA, Johnston SE, Wilson AJ et al. (2022) Using genomic prediction to detect microevolutionary change of a quantitative trait. Proc Biol Sci 289(1974):20220330
PubMed PubMed Central CAS Google Scholar
Israel D, Chua S Jr (2010) Leptin receptor modulation of adiposity and fertility Trends Endocrinol Metab 21:10–16
Article PubMed CAS Google Scholar
James C, Pemberton JM, Navarro P, Knott S (2022) The impact of SNP density on quantitative genetic analyses of body size traits in a wild population of Soay sheep. Ecol Evol 12(12):e9639
Article PubMed PubMed Central Google Scholar
James C, Pemberton JM, Navarro P, Knott S (2023) Investigating pedigree- and SNP-associated components of heritability in a wild population of Soay sheep. Heredity 132(4):202–210
Article Google Scholar
Khadir A, Kavalakatt S, Madhu D, Cherian P, Al-Mulla F, Abubaker J et al. (2020) Soluble Epoxide Hydrolase 2 Expression Is Elevated in Obese Humans and Decreased by Physical Activity. Int J Mol Sci 21
Kim MJ, Kim S, Kim Y, Jin EJ, Sonn JK (2012) Inhibition of RhoA but not ROCK induces chondrogenesis of chick limb mesenchymal cells. Biochem Biophys Res Commun 418:500–505
Article PubMed CAS Google Scholar
Kotake S, Nanke Y, Kawamoto M, Yago T, Udagawa N, Ichikawa N et al. (2009) T-cell leukemia translocation-associated gene (TCTA) protein is required for human osteoclastogenesis. Bone 45:627–639
Article PubMed CAS Google Scholar
Luderer HF, Bai S, Longmore GD (2008) The LIM protein LIMD1 influences osteoblast differentiation and function. Exp Cell Res 314:2884–2894
Article PubMed PubMed Central CAS Google Scholar
Luther JM, Brown NJ (2016) Epoxyeicosatrienoic acids and glucose homeostasis in mice and men. Prostaglandins Other Lipid Mediat 125:2–7
Article PubMed PubMed Central CAS Google Scholar
Macé T, González-García E, Foulquié D, Carrière F, Pradel J, Durand C et al. (2022) Genome-wide analyses reveal a strong association between LEPR gene variants and body fat reserves in ewes. BMC Genomics 23:412
Article PubMed PubMed Central Google Scholar
Min Ko J, Jung S, Seo J, Ho Shin C, Il Cheong H, Choi M et al. (2016) SOFT syndrome caused by compound heterozygous mutations of POC1A and its skeletal manifestation. J Hum Genet 61:561–564
Article CAS Google Scholar
Murgiano L, Jagannathan V, Benazzi C, Bolcato M, Brunetti B, Muscatello LV et al. (2014) Deletion in the EVC2 gene causes chondrodysplastic dwarfism in Tyrolean Grey cattle. PLoS One 9:e94861
Article PubMed PubMed Central Google Scholar
Nagamine Y, Pong-Wong R, Navarro P, Vitart V, Hayward C, Rudan I et al. (2012) Localising loci underlying complex trait variation using Regional Genomic Relationship Mapping. PloS ONE 7(10):e46501
Article PubMed PubMed Central CAS Google Scholar
NCBI Gene (2004–2023). https://www.ncbi.nlm.nih.gov/gene/
Negishi-Koga T, Shinohara M, Komatsu N, Bito H, Kodama T, Friedel RH et al. (2011) Suppression of bone formation by osteoclastic expression of semaphorin 4D. Nat Med 17:1473–1480
Article PubMed CAS Google Scholar
Oppong RF, Boutin T, Campbell A, McIntosh AM, Porteous D, Hayward C et al. (2021) SNP and haplotype regional heritability mapping (SNHap-RHM): joint mapping of common and rare variation affecting complex traits. Front Genet 12:791712
Article PubMed CAS Google Scholar
Pan DZ, Miao Z, Comenho C, Rajkumar S, Koka A, Lee SHT et al. (2021) Identification of TBX15 as an adipose master trans regulator of abdominal obesity genes. Genome Med 13:123
Article PubMed PubMed Central CAS Google Scholar
Plink v 1.90b4. (2014). http://pngu.mgh.harvard.edu/purcell/plink/
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81(3):559–575
Article PubMed PubMed Central CAS Google Scholar
Qiu T, Xian L, Crane J, Wen C, Hilton M, Lu W et al. (2015) PTH receptor signaling in osteoblasts regulates endochondral vascularization in maintenance of postnatal growth plate. J Bone Min Res 30:309–317
Article CAS Google Scholar
Ros-Freixedes R, Gol S, Pena RN, Tor M, Ibáñez-Escriche N, Dekkers JC et al. (2016) Genome-Wide Association Study Singles Out SCD and LEPR as the Two Main Loci Influencing Intramuscular Fat Content and Fatty Acid Composition in Duroc Pigs. PLoS One 11:e0152496
Article PubMed PubMed Central Google Scholar
Schaefer E, Zaloszyc A, Lauer J, Durand M, Stutzmann F, Perdomo-Trujillo Y et al. (2011) Mutations in SDCCAG8/NPHP10 Cause Bardet-Biedl Syndrome and Are Associated with Penetrant Renal Disease and Absent Polydactyly. Mol Syndromol 1:273–281
Article PubMed PubMed Central CAS Google Scholar
Scherag A, Kleber M, Boes T, Kolbe AL, Ruth A, Grallert H et al. (2012) SDCCAG8 obesity alleles and reduced weight loss after a lifestyle intervention in overweight children and adolescents. Obes (Silver Spring) 20:466–470
Article CAS Google Scholar
Schipani E, Provot S (2003) PTHrP, PTH, and the PTH/PTHrP receptor in endochondral bone development. Birth Defects Res C Embryo Today 69:352–362
Article PubMed CAS Google Scholar
Self SG, Liang K-Y (1987) Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J Am Stat Assoc 82(398):605–610
Article Google Scholar
Shirali M, Knott SA, Pong-Wong R, Navarro P, Haley CS (2018) Haplotype heritability mapping method uncovers missing heritability of complex traits. Sci Rep. 8(1):4982
Article PubMed PubMed Central Google Scholar
Solé E, Ros-Freixedes R, Tor M, Reixach J, Pena RN, Estany J (2021) Antagonistic maternal and direct effects of the leptin receptor gene on body weight in pigs. PLoS One 16:e0246198
Article PubMed PubMed Central Google Scholar
Stoffel MA, Johnston SE, Pilkington JG, Pemberton JM (2021) Genetic architecture and lifetime dynamics of inbreeding depression in a wild mammal. Nat Commun 12(1):2972
Article PubMed PubMed Central CAS Google Scholar
Sun W, Zhao X, Wang Z, Chu Y, Mao L, Lin S et al. (2019) Tbx15 is required for adipocyte browning induced by adrenergic signaling pathway. Mol Metab 28:48–57
Article PubMed PubMed Central CAS Google Scholar
VanRaden PM (2008) Efficient methods to compute genomic predictions. J Dairy Sci 91(11):4414–4423
Article PubMed CAS Google Scholar
Wang J, Xu C, Zhang J, Bao Y, Tang Y, Lv X et al. (2023) RhoA promotes osteoclastogenesis and regulates bone remodeling through mTOR-NFATc1 signaling. Mol Med 29:49
Article PubMed PubMed Central CAS Google Scholar
Yang J, Zaitlen NA, Goddard ME, Visscher PM, Price AL (2014) Advantages and pitfalls in the application of mixed-model association methods. Nat Genet 46(2):100–106
Article PubMed PubMed Central Google Scholar
Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR et al. (2010) Common SNPs explain a large proportion of the heritability for human height. Nat Genet 42(7):565–569
Article PubMed PubMed Central CAS Google Scholar
Ye X, Song G, Fan M, Shi L, Jabs EW, Huang S et al. (2006) A novel heterozygous deletion in the EVC2 gene causes Weyers acrofacial dysostosis. Hum Genet 119:199–205
Article PubMed Google Scholar
Yiannakouris N, Yannakoulia M, Melistas L, Chan JL, Klimis-Zacas D, Mantzoros CS (2001) The Q223R polymorphism of the leptin receptor gene is significantly associated with obesity and predicts a small percentage of body weight and body composition variability. J Clin Endocrinol Metab 86:4434–4439
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We thank the National Trust for Scotland for permission to work on St Kilda and QinetiQ, Eurest and Kilda Cruises for logistics and other support on the island. We also thank all those who have been involved in the long-term project, including those who helped with field work on the island. We thank the Wellcome Trust Clinical Research Facility Genetics Core in Edinburgh for SNP genotyping. We thank Dr Eilidh Fummey for her hard work on implementing SNHap-RHM functionality into DISSECT.

Author information

Authors and Affiliations

Institute of Ecology and Evolution, School of Biological Sciences, The University of Edinburgh, Edinburgh, UK
Caelinn James, Josephine M. Pemberton & Sara Knott
Scotland’s Rural College (SRUC), The Roslin Institute Building, Midlothian, UK
Caelinn James
The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, UK
Pau Navarro
MRC Human Genetics Unit, Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
Pau Navarro

Authors

Caelinn James
View author publications
Search author on:PubMed Google Scholar
Josephine M. Pemberton
View author publications
Search author on:PubMed Google Scholar
Pau Navarro
View author publications
Search author on:PubMed Google Scholar
Sara Knott
View author publications
Search author on:PubMed Google Scholar

Contributions

CJ conducted analyses and drafted the manuscript. JMP, PN and SK helped with analyses and interpretations of results. All authors contributed to revisions.

Corresponding author

Correspondence to Caelinn James.

Ethics declarations

Competing interests

The authors declare no competing interests.

Research ethics statement

N/A

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Associate editor: Joram Mwacharo.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

James, C., Pemberton, J.M., Navarro, P. et al. Evaluating regional heritability mapping methods for identifying QTLs in a wild population of Soay sheep. Heredity 134, 374–386 (2025). https://doi.org/10.1038/s41437-025-00770-0

Download citation

Received: 07 June 2024
Revised: 08 May 2025
Accepted: 08 May 2025
Published: 23 May 2025
Version of record: 23 May 2025
Issue date: June 2025
DOI: https://doi.org/10.1038/s41437-025-00770-0

Subjects

Abstract

Similar content being viewed by others

Investigating pedigree- and SNP-associated components of heritability in a wild population of Soay sheep

Estimates of genomic heritability and genome-wide association studies for blood parameters in Akkaraman sheep

Regional heritability mapping identifies several novel loci (STAT4, ULK4, and KCNH5) for primary biliary cholangitis in the Japanese population

Introduction

Methods

Phenotypic data

Genetic data

Splitting the genome into regions

Pre-correction of phenotypes

Regional heritability mapping

SNP-RHM

Hap-RHM

SNHap-RHM

Null model and multiple testing

Comparison with GWAS

Identification of candidate genes

Results

Soay sheep haplotype blocks

Comparison of RHM

Birth weight and lamb August weight

Lamb leg length traits

Lamb jaw length

Adult August weight

Adult leg length traits

Adult jaw length

Discussion

Summary of results

Comparison of RHM models and previous studies

Insights into genetic architecture

Limitations of RHM

Concluding remarks

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Research ethics statement

Additional information

Supplementary information

Supplementary Material

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links