Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Review Article
  • Published:

Towards improved fine-mapping of candidate causal variants

Abstract

Fine-mapping in genome-wide association studies aims to identify potentially causal genetic variants among a set of candidate variants that are often highly correlated with each other owing to linkage disequilibrium. A variety of statistical approaches are used in fine-mapping, almost all of which are based on a multiple regression framework to model the relationship between genotype and phenotype, while accommodating specific assumptions about the distribution of variant effect sizes and using different inference algorithms. Owing to their modelling flexibility and the ease of making inferential statements, these approaches are predominantly Bayesian in nature. Recently, these approaches have been improved by refining modelling assumptions, integrating additional information, accommodating summary statistics, and developing scalable computational algorithms that improve computation efficiency and fine-mapping resolution.

This is a preview of subscription content, access via your institution

Access options

Buy this article

USD 39.95

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Difference in fine-mapping approaches illustrated in a simple scenario.
Fig. 2: A general workflow for fine-mapping analysis with summary statistics.
Fig. 3: Incorporating additional information enhances the power of fine-mapping.
Fig. 4: Analytic challenges in statistical fine-mapping.

Similar content being viewed by others

References

  1. Visscher, P. M. et al. 10 years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101, 5–22 (2017).

    PubMed  PubMed Central  CAS  Google Scholar 

  2. Uffelmann, E. et al. Genome-wide association studies. Nat. Rev. Methods Primers 1, 59 (2021).

    CAS  Google Scholar 

  3. Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985 (2023).

    PubMed  CAS  Google Scholar 

  4. Loos, R. J. F. 15 years of genome-wide association studies and no signs of slowing down. Nat. Commun. 11, 5900 (2020).

    PubMed  PubMed Central  CAS  Google Scholar 

  5. Wall, J. D. & Pritchard, J. K. Haplotype blocks and linkage disequilibrium in the human genome. Nat. Rev. Genet. 4, 587–597 (2003).

    PubMed  CAS  Google Scholar 

  6. Spain, S. L. & Barrett, J. C. Strategies for fine-mapping complex traits. Hum. Mol. Genet. 24, R111–R119 (2015).

    PubMed  PubMed Central  CAS  Google Scholar 

  7. Schaid, D. J., Chen, W. & Larson, N. B. From genome-wide associations to candidate causal variants by statistical fine-mapping. Nat. Rev. Genet. 19, 491–504 (2018).

    PubMed  PubMed Central  CAS  Google Scholar 

  8. Broekema, R. V., Bakker, O. B. & Jonkers, I. H. A practical view of fine-mapping and gene prioritization in the post-genome-wide association era. Open Biol. 10, 190221 (2020).

    PubMed  PubMed Central  CAS  Google Scholar 

  9. Hutchinson, A., Asimit, J. & Wallace, C. Fine-mapping genetic associations. Hum. Mol. Genet. 29, R81–R88 (2020).

    PubMed  PubMed Central  Google Scholar 

  10. Wang, Q. S. & Huang, H. Methods for statistical fine-mapping and their applications to auto-immune diseases. Semin. Immunopathol. 44, 101–113 (2022).

    PubMed  PubMed Central  CAS  Google Scholar 

  11. Gao, B. & Zhou, X. MESuSiE enables scalable and powerful multi-ancestry fine-mapping of causal variants in genome-wide association studies. Nat. Genet. 56, 170–179 (2024). This paper proposes a multi-ancestry fine-mapping framework that explicitly models both shared and ancestry-specific causal variants, leading to improved accuracy and resolution of fine-mapping.

    PubMed  PubMed Central  CAS  Google Scholar 

  12. Yuan, K. et al. Fine-mapping across diverse ancestries drives the discovery of putative causal variants underlying human complex traits and diseases. Nat. Genet. 56, 1841–1850 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  13. Kichaev, G. et al. Improved methods for multi-trait fine mapping of pleiotropic risk loci. Bioinformatics 33, 248–255 (2017).

    PubMed  CAS  Google Scholar 

  14. Zou, Y., Carbonetto, P., Xie, D., Wang, G. & Stephens, M. Fast and flexible joint fine-mapping of multiple traits via the sum of single effects model. Preprint at bioRxiv https://doi.org/10.1101/2023.04.14.536893 (2024).

  15. Kachuri, L. et al. Principles and methods for transferring polygenic risk scores across global populations. Nat. Rev. Genet. 25, 8–25 (2024).

    PubMed  CAS  Google Scholar 

  16. Wellcome Trust Case Control Consortium et al. Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat. Genet. 44, 1294–1301 (2012).

    Google Scholar 

  17. Hormozdiari, F., Kostem, E., Kang, E. Y., Pasaniuc, B. & Eskin, E. Identifying causal variants at loci with multiple signals of association. Genetics 198, 497–508 (2014).

    PubMed  PubMed Central  CAS  Google Scholar 

  18. van de Bunt, M. et al. Evaluating the performance of fine-mapping strategies at common variant GWAS loci. PLoS Genet. 11, e1005535 (2015).

    PubMed  PubMed Central  Google Scholar 

  19. Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. B 82, 1273–1300 (2020). This paper proposes a seminal statistical framework — the ‘sum of single effects’ model, called SuSiE — for fine-mapping that benefits both accuracy and computational efficiency.

    Google Scholar 

  20. Liu, L. et al. Conditional transcriptome-wide association study for fine-mapping candidate causal genes. Nat. Genet. 56, 348–356 (2024). This paper proposes a frequentist TWAS fine-mapping method that leverages the relatively small number of genes within each locus to systematically fine-map causal genes by conditioning on all other genes in the region.

    PubMed  CAS  Google Scholar 

  21. Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375 (2012).

    PubMed  PubMed Central  CAS  Google Scholar 

  22. Keaton, J. M. et al. Genome-wide analysis in over 1 million individuals of European ancestry yields improved polygenic risk scores for blood pressure traits. Nat. Genet. 56, 778–791 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  23. Weng, L. C. et al. The impact of common and rare genetic variants on bradyarrhythmia development. Nat. Genet. 57, 53–64 (2025).

    PubMed  PubMed Central  CAS  Google Scholar 

  24. Chen, W. et al. Fine mapping causal variants with an approximate Bayesian method using marginal test statistics. Genetics 200, 719–736 (2015).

    PubMed  PubMed Central  Google Scholar 

  25. Benner, C. et al. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 32, 1493–1501 (2016). This paper proposes a shotgun stochastic search algorithm for fine-mapping that substantially improves computational efficiency, enabling the exploration of configurations with more than a few causal variants.

    PubMed  PubMed Central  CAS  Google Scholar 

  26. Wen, X., Lee, Y., Luca, F. & Pique-Regi, R. Efficient integrative multi-SNP association analysis via deterministic approximation of posteriors. Am. J. Hum. Genet. 98, 1114–1129 (2016). This paper proposes a deterministic approximation of posteriors algorithm for fine-mapping that enables highly scalable and accurate identification of causal variants.

    PubMed  PubMed Central  CAS  Google Scholar 

  27. Stephens, M. & Balding, D. J. Bayesian statistical methods for genetic association studies. Nat. Rev. Genet. 10, 681–690 (2009).

    PubMed  CAS  Google Scholar 

  28. Weissbrod, O. et al. Functionally informed fine-mapping and polygenic localization of complex trait heritability. Nat. Genet. 52, 1355–1363 (2020). This paper proposes a statistical framework that leverages genome-wide functional annotations by coupling an extended version of stratified LD score regression with existing fine-mapping methods, leading to substantially improved fine-mapping power.

    PubMed  PubMed Central  CAS  Google Scholar 

  29. Yang, Z. K. et al. CARMA is a new Bayesian model for fine-mapping in genome-wide association meta-analyses. Nat. Genet. 55, 1057–1065 (2023).

    PubMed  CAS  Google Scholar 

  30. Zou, Y., Carbonetto, P., Wang, G. & Stephens, M. Fine-mapping from summary data with the “Sum of Single Effects” model. PLoS Genet. 18, e1010299 (2022). This paper systematically investigates summary statistics-based fine-mapping, presenting a generic strategy for extending methods to summary data, diagnostic tools for identifying inconsistencies and approaches for improving summary data consistency.

    PubMed  PubMed Central  CAS  Google Scholar 

  31. Wu, T. T., Chen, Y. F., Hastie, T., Sobel, E. & Lange, K. Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics 25, 714–721 (2009).

    PubMed  PubMed Central  CAS  Google Scholar 

  32. Fisher, V., Sebastiani, P., Cupples, L. A. & Liu, C. T. ANNORE: genetic fine-mapping with functional annotation. Hum. Mol. Genet. 31, 32–40 (2022).

    CAS  Google Scholar 

  33. Chen, W. et al. Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors. Nat. Commun. 12, 7117 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  34. Kanai, M. et al. Meta-analysis fine-mapping is often miscalibrated at single-variant resolution. Cell Genom. 2, 100210 (2022). This paper demonstrates that the inter-cohort heterogeneity from multiple sources can impair the calibration of fine-mapping when using summary statistics from GWAS meta-analyses, and proposes a quality control method — SLALOM — to mitigate this issue.

    PubMed  PubMed Central  CAS  Google Scholar 

  35. Kichaev, G. et al. Integrating functional data to prioritize causal variants in statistical fine-mapping studies. PLoS Genet. 10, e1004722 (2014).

    PubMed  PubMed Central  Google Scholar 

  36. LaPierre, N. et al. Identifying causal variants by fine mapping across multiple studies. PLoS Genet. 17, e1009733 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  37. Li, A. et al. mBAT-combo: a more powerful test to detect gene-trait associations from GWAS data. Am. J. Hum. Genet. 110, 30–43 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  38. Pirinen, M., Donnelly, P. & Spencer, C. C. A. Efficient computation with a linear mixed model on large-scale data sets with applications to genetic studies. Ann. Appl. Stat. 7, 369–390 (2013).

    Google Scholar 

  39. Zhang, H., He, K., Li, Z., Tsoi, L. C. & Zhou, X. FABIO: TWAS fine-mapping to prioritize causal genes for binary traits. PLoS Genet. 20, e1011503 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  40. Zhou, W. et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 50, 1335–1341 (2018).

    PubMed  PubMed Central  CAS  Google Scholar 

  41. Guan, Y. T. & Stephens, M. Bayesian variable selection regression for genome-wide association studies and other large-scale problems. Ann. Appl. Stat. 5, 1780–1815 (2011). This paper represents one of the earliest work that applies the Bayesian variable selection regression for fine-mapping and underlies many of the following developments.

    Google Scholar 

  42. Newcombe, P. J., Conti, D. V. & Richardson, S. JAM: a scalable Bayesian framework for joint analysis of marginal SNP effects. Genet. Epidemiol. 40, 188–201 (2016).

    PubMed  PubMed Central  Google Scholar 

  43. Wallace, C. et al. Dissection of a complex disease susceptibility region using a Bayesian stochastic search approach to fine mapping. PLoS Genet. 11, e1005272 (2015).

    PubMed  PubMed Central  Google Scholar 

  44. Li, X., Sham, P. C. & Zhang, Y. D. A Bayesian fine-mapping model using a continuous global-local shrinkage prior with applications in prostate cancer analysis. Am. J. Hum. Genet. 111, 213–226 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  45. Karhunen, V., Launonen, I., Jarvelin, M. R., Sebert, S. & Sillanpaa, M. J. Genetic fine-mapping from summary data using a nonlocal prior improves the detection of multiple causal variants. Bioinformatics 39, btad396 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  46. Wakefield, J. Bayes factors for genome-wide association studies: comparison with P-values. Genet. Epidemiol. 33, 79–86 (2009).

    PubMed  Google Scholar 

  47. Lee, Y., Luca, F., Pique-Regi, R. & Wen, X. Bayesian multi-SNP genetic association analysis: control of FDR and use of summary statistics. Preprint at bioRxiv https://doi.org/10.1101/316471 (2018).

  48. Flutre, T., Wen, X. Q., Pritchard, J. & Stephens, M. A statistical framework for joint eQTL analysis in multiple tissues. PLoS Genet. 9, e1003486 (2013).

    PubMed  PubMed Central  CAS  Google Scholar 

  49. Cui, R. et al. Improving fine-mapping by modeling infinitesimal effects. Nat. Genet. 56, 162–169 (2024). This paper proposes a fine-mapping method that relies on a SuSiE-based variational algorithm to fit BSLMM, which models both infinitesimal effects of all SNPs and large effects of a small subset of SNPs, to substantially reduce replication failure rate in real data.

    PubMed  CAS  Google Scholar 

  50. Zhang, W., Najafabadi, H. & Li, Y. SparsePro: an efficient fine-mapping method integrating summary statistics and functional annotations. PLoS Genet. 19, e1011104 (2023).

    PubMed  PubMed Central  Google Scholar 

  51. Lu, Z. et al. Improved multi-ancestry fine-mapping identifies cis-regulatory variants underlying molecular traits and disease risk. Preprint at medRxiv https://doi.org/10.1101/2024.04.15.24305836 (2024).

  52. Rossen, J. et al. MultiSuSiE improves multi-ancestry fine-mapping in all of us whole-genome sequencing data. Preprint at medRxiv https://doi.org/10.1101/2024.05.13.24307291 (2024).

  53. Zhang, X., Jiang, W. & Zhao, H. Integration of expression QTLs with fine mapping via SuSiE. PLoS Genet. 20, e1010929 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  54. Zhao, S. et al. Adjusting for genetic confounders in transcriptome-wide association studies improves discovery of risk genes of complex traits. Nat. Genet. 56, 336–347 (2024). This paper proposes a TWAS fine-mapping method that adapts SuSiE to fine-map genetically regulated expression of genes while controlling for horizontal pleiotropic effects of SNPs.

    PubMed  PubMed Central  CAS  Google Scholar 

  55. Strober, B. J., Zhang, M. J., Amariuta, T., Rossen, J. & Price, A. L. Fine-mapping causal tissues and genes at disease-associated loci. Nat. Genet. 57, 42–52 (2025).

    PubMed  PubMed Central  CAS  Google Scholar 

  56. Akdeniz, B. C. et al. Finemap-MiXeR: a variational Bayesian approach for genetic finemapping. PLoS Genet. 20, e1011372 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  57. Cai, M. et al. XMAP: cross-population fine-mapping by leveraging genetic diversity and accounting for confounding bias. Nat. Commun. 14, 6870 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  58. Blei, D. M., Kucukelbir, A. & McAuliffe, J. D. Variational inference: a review for statisticians. J. Am. Stat. Assoc. 112, 859–877 (2017).

    CAS  Google Scholar 

  59. Carbonetto, P. & Stephens, M. Scalable variational inference for Bayesian variable selection in regression, and its accuracy in genetic association studies. Bayesian Anal. 7, 73–107 (2012).

    Google Scholar 

  60. Servin, B. & Stephens, M. Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genet. 3, e114 (2007).

    PubMed  PubMed Central  Google Scholar 

  61. Ma, Y. & Zhou, X. Genetic prediction of complex traits with polygenic scores: a statistical review. Trends Genet. 37, 995–1011 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  62. Zhou, X., Carbonetto, P. & Stephens, M. Polygenic modeling with Bayesian sparse linear mixed models. PLoS Genet. 9, e1003264 (2013). This paper proposes BSLMM, a model that bridges the gap between sparse and infinitesimal genetic architectures to enable fine-mapping in the presence of a polygenic background, laying the groundwork for later methods such as SuSiE-inf and XMAP.

    PubMed  PubMed Central  CAS  Google Scholar 

  63. Zheng, Z. et al. Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries. Nat. Genet. 56, 767–777 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  64. Wu, Y. et al. Genome-wide fine-mapping improves identification of causal variants. Preprint at medRxiv https://doi.org/10.1101/2024.07.18.24310667 (2024).

  65. Gjoka, A. & Cordell, H. J. Fine-mapping the results from genome-wide association studies of primary biliary cholangitis using SuSiE and h2-D2. Genet. Epidemiol. 49, e22592 (2025).

    PubMed  CAS  Google Scholar 

  66. The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).

    Google Scholar 

  67. The ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).

    PubMed Central  Google Scholar 

  68. Roadmap Epigenomics Consortium et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).

    PubMed Central  Google Scholar 

  69. Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).

    PubMed  PubMed Central  CAS  Google Scholar 

  70. McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biol. 17, 122 (2016).

    PubMed  PubMed Central  Google Scholar 

  71. Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).

    PubMed  PubMed Central  Google Scholar 

  72. Gazal, S. et al. Functional architecture of low-frequency variants highlights strength of negative selection across coding and non-coding annotations. Nat. Genet. 50, 1600–1607 (2018).

    PubMed  PubMed Central  CAS  Google Scholar 

  73. Gazal, S. et al. Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat. Genet. 49, 1421–1427 (2017).

    PubMed  PubMed Central  CAS  Google Scholar 

  74. Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. 94, 559–573 (2014).

    PubMed  PubMed Central  CAS  Google Scholar 

  75. Alenazi, A. A., Cox, A., Juarez, M., Lin, W. Y. & Walters, K. Bayesian variable selection using partially observed categorical prior information in fine-mapping association studies. Genet. Epidemiol. 43, 690–703 (2019).

    PubMed  Google Scholar 

  76. Jiang, J. et al. Functional annotation and Bayesian fine-mapping reveals candidate genes for important agronomic traits in Holstein bulls. Commun. Biol. 2, 212 (2019).

    PubMed  PubMed Central  Google Scholar 

  77. Wang, Q. S. et al. Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs. Nat. Commun. 12, 3394 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  78. Yang, J., Fritsche, L. G., Zhou, X., Abecasis, G. & International Age-Related Macular Degeneration Genomics Consortium. A scalable Bayesian method for integrating functional information in genome-wide association studies. Am. J. Hum. Genet. 101, 404–416 (2017).

    PubMed  PubMed Central  CAS  Google Scholar 

  79. Do, C. B. & Batzoglou, S. What is the expectation maximization algorithm? Nat. Biotechnol. 26, 897–899 (2008).

    PubMed  CAS  Google Scholar 

  80. Kim, A. et al. Inferring causal cell types of human diseases and risk variants from candidate regulatory elements. Preprint at medRxiv https://doi.org/10.1101/2024.05.17.24307556 (2024).

  81. Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228–1235 (2015).

    PubMed  PubMed Central  CAS  Google Scholar 

  82. Albert, F. W. & Kruglyak, L. The role of regulatory variation in complex traits and disease. Nat. Rev. Genet. 16, 197–212 (2015).

    PubMed  CAS  Google Scholar 

  83. Bush, W. S., Oetjens, M. T. & Crawford, D. C. Unravelling the human genome-phenome relationship using phenome-wide association studies. Nat. Rev. Genet. 17, 129–145 (2016).

    PubMed  CAS  Google Scholar 

  84. van Rheenen, W., Peyrot, W. J., Schork, A. J., Lee, S. H. & Wray, N. R. Genetic correlations of polygenic disease traits: from theory to practice. Nat. Rev. Genet. 20, 567–581 (2019).

    PubMed  Google Scholar 

  85. Asimit, J. L. et al. Stochastic search and joint fine-mapping increases accuracy and identifies previously unreported associations in immune-mediated diseases. Nat. Commun. 10, 3216 (2019).

    PubMed  PubMed Central  Google Scholar 

  86. Hernandez, N. et al. The flashfm approach for fine-mapping multiple quantitative traits. Nat. Commun. 12, 6147 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  87. Arvanitis, M., Tayeb, K., Strober, B. J. & Battle, A. Redefining tissue specificity of genetic regulation of gene expression in the presence of allelic heterogeneity. Am. J. Hum. Genet. 109, 223–239 (2022).

    PubMed  PubMed Central  CAS  Google Scholar 

  88. Xu, C., Ganesh, S. K. & Zhou, X. mtPGS: leverage multiple correlated traits for accurate polygenic score construction. Am. J. Hum. Genet. 110, 1673–1689 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  89. Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).

    PubMed  PubMed Central  Google Scholar 

  90. Hormozdiari, F. et al. Colocalization of GWAS and eQTL signals detects target genes. Am. J. Hum. Genet. 99, 1245–1260 (2016).

    PubMed  PubMed Central  CAS  Google Scholar 

  91. Wallace, C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet. 17, e1009440 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  92. Bick, A. G. et al. Genomic data in the all of us research program. Nature 627, 340–346 (2024).

    Google Scholar 

  93. Nagai, A. et al. Overview of the BioBank Japan Project: study design and profile. J. Epidemiol. 27, S2–S8 (2017).

    PubMed  PubMed Central  Google Scholar 

  94. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).

    PubMed  PubMed Central  CAS  Google Scholar 

  95. Zaitlen, N., Pasaniuc, B., Gur, T., Ziv, E. & Halperin, E. Leveraging genetic variability across populations for the identification of causal variants. Am. J. Hum. Genet. 86, 23–33 (2010).

    PubMed  PubMed Central  CAS  Google Scholar 

  96. Shi, H. et al. Localizing components of shared transethnic genetic architecture of complex traits from GWAS summary data. Am. J. Hum. Genet. 106, 805–817 (2020).

    PubMed  PubMed Central  CAS  Google Scholar 

  97. Kichaev, G. & Pasaniuc, B. Leveraging functional-annotation data in trans-ethnic fine-mapping studies. Am. J. Hum. Genet. 97, 260–271 (2015).

    PubMed  PubMed Central  CAS  Google Scholar 

  98. Wen, X., Luca, F. & Pique-Regi, R. Cross-population joint analysis of eQTLs: fine mapping and functional annotation. PLoS Genet. 11, e1005176 (2015).

    PubMed  PubMed Central  Google Scholar 

  99. Zhou, F. et al. Leveraging information between multiple population groups and traits improves fine-mapping resolution. Nat. Commun. 14, 7279 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  100. The GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).

    PubMed Central  Google Scholar 

  101. Sun, B. B. et al. Genomic atlas of the human plasma proteome. Nature 558, 73–79 (2018).

    PubMed  PubMed Central  CAS  Google Scholar 

  102. McVicker, G. et al. Identification of genetic variants that affect histone modifications in human cells. Science 342, 747–749 (2013).

    PubMed  PubMed Central  CAS  Google Scholar 

  103. Shi, H. et al. Population-specific causal disease effect sizes in functionally important regions impacted by selection. Nat. Commun. 12, 1098 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  104. Gamazon, E. R. et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098 (2015).

    PubMed  PubMed Central  CAS  Google Scholar 

  105. Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245–252 (2016).

    PubMed  PubMed Central  CAS  Google Scholar 

  106. Li, Z., Gao, B. & Zhou, X. An alternative framework for transcriptome-wide association studies to detect and decipher gene-trait associations. Preprint at bioRxiv https://doi.org/10.1101/2025.03.14.643391 (2025).

  107. Wainberg, M. et al. Opportunities and challenges for transcriptome-wide association studies. Nat. Genet. 51, 592–599 (2019).

    PubMed  PubMed Central  CAS  Google Scholar 

  108. Mancuso, N. et al. Probabilistic fine-mapping of transcriptome-wide association studies. Nat. Genet. 51, 675–682 (2019).

    PubMed  PubMed Central  CAS  Google Scholar 

  109. Lu, Z. et al. Multi-ancestry fine-mapping improves precision to identify causal genes in transcriptome-wide association studies. Am. J. Hum. Genet. 109, 1388–1404 (2022).

    PubMed  PubMed Central  CAS  Google Scholar 

  110. Hill, W. G., Goddard, M. E. & Visscher, P. M. Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet. 4, e1000008 (2008).

    PubMed  PubMed Central  Google Scholar 

  111. Hivert, V. et al. Estimation of non-additive genetic variance in human complex traits from a large sample of unrelated individuals. Am. J. Hum. Genet. 108, 962 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  112. Lenz, T. L. et al. Widespread non-additive and interaction effects within HLA loci modulate the risk of autoimmune diseases. Nat. Genet. 47, 1085–1090 (2015).

    PubMed  PubMed Central  CAS  Google Scholar 

  113. Goyette, P. et al. High-density mapping of the MHC identifies a shared role for HLA-DRB1*01:03 in inflammatory bowel diseases and heterozygous advantage in ulcerative colitis. Nat. Genet. 47, 172–179 (2015).

    PubMed  PubMed Central  CAS  Google Scholar 

  114. Conomos, M. P., Miller, M. B. & Thornton, T. A. Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness. Genet. Epidemiol. 39, 276–293 (2015).

    PubMed  PubMed Central  Google Scholar 

  115. Zhou, X. & Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 44, 821–824 (2012).

    PubMed  PubMed Central  CAS  Google Scholar 

  116. Yang, Z. et al. Fine-mapping in admixed populations using CARMA-X, with applications to Latin American studies. Am. J. Hum. Genet. 112, 1215–1232 (2025).

    PubMed  PubMed Central  CAS  Google Scholar 

  117. Benner, C. et al. Prospects of fine-mapping trait-associated genomic regions by using summary statistics from genome-wide association studies. Am. J. Hum. Genet. 101, 539–551 (2017).

    PubMed  PubMed Central  CAS  Google Scholar 

  118. Zhang, W., Lu, T., Sladek, R., Dupuis, J. & Lettre, G. Robust fine-mapping in the presence of linkage disequilibrium mismatch. Preprint at bioRxiv https://doi.org/10.1101/2024.10.29.620968 (2024).

  119. Marouli, E. et al. Rare and low-frequency coding variants alter human adult height. Nature 542, 186–190 (2017).

    PubMed  PubMed Central  CAS  Google Scholar 

  120. Hawkes, G. et al. Whole-genome sequencing analysis identifies rare, large-effect noncoding variants and regulatory regions associated with circulating protein levels. Nat. Genet. 57, 626–634 (2025).

    PubMed  PubMed Central  CAS  Google Scholar 

  121. Tennessen, J. A. et al. Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science 337, 64–69 (2012).

    PubMed  PubMed Central  CAS  Google Scholar 

  122. Barton, A. R., Sherman, M. A., Mukamel, R. E. & Loh, P. R. Whole-exome imputation within UK Biobank powers rare coding variant association and fine-mapping analyses. Nat. Genet. 53, 1260–1269 (2021).

    PubMed  PubMed Central  CAS  Google Scholar 

  123. Li, B. & Leal, S. M. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am. J. Hum. Genet. 83, 311–321 (2008).

    PubMed  PubMed Central  CAS  Google Scholar 

  124. Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89, 82–93 (2011).

    PubMed  PubMed Central  CAS  Google Scholar 

  125. Sanderson, E. et al. Mendelian randomization. Nat. Rev. Methods Primers 2, 6 (2022).

    PubMed  PubMed Central  CAS  Google Scholar 

  126. Yuan, Z. et al. Likelihood-based Mendelian randomization analysis with automated instrument selection and horizontal pleiotropic modeling. Sci. Adv. 8, eabl5744 (2022).

    PubMed  PubMed Central  CAS  Google Scholar 

  127. Hou, K. et al. Causal effects on complex traits are similar for common variants across segments of different continental ancestries within admixed individuals. Nat. Genet. 55, 549–558 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  128. Ding, Y. et al. Polygenic scoring accuracy varies across the genetic ancestry continuum. Nature 618, 774–781 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  129. Hou, K. et al. Calibrated prediction intervals for polygenic scores across diverse contexts. Nat. Genet. 56, 1386–1396 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  130. Herrera-Luis, E., Benke, K., Volk, H., Ladd-Acosta, C. & Wojcik, G. L. Gene–environment interactions in human health. Nat. Rev. Genet. 25, 768–784 (2024).

    PubMed  PubMed Central  CAS  Google Scholar 

  131. Benegas, G., Ye, C., Albors, C., Li, J. C. & Song, Y. S. Genomic language models: opportunities and challenges. Trends Genet. 41, 286–302 (2025).

    PubMed  CAS  Google Scholar 

  132. Klein, J. C. et al. A systematic evaluation of the design and context dependencies of massively parallel reporter assays. Nat. Methods 17, 1083–1091 (2020).

    PubMed  PubMed Central  CAS  Google Scholar 

  133. Porto, E. M., Komor, A. C., Slaymaker, I. M. & Yeo, G. W. Base editing: advances and therapeutic opportunities. Nat. Rev. Drug Discov. 19, 839–859 (2020).

    PubMed  PubMed Central  CAS  Google Scholar 

  134. Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 50, 621–629 (2018).

    PubMed  PubMed Central  CAS  Google Scholar 

  135. Zhou, X. A unified framework for variance component estimation with summary statistics in genome-wide association studies. Ann. Appl. Stat. 11, 2027–2051 (2017).

    PubMed  PubMed Central  Google Scholar 

  136. Kanai, M. et al. Insights from complex trait fine-mapping across diverse populations. Preprint at medRxiv https://doi.org/10.1101/2021.09.03.21262975 (2021).

  137. Morris, J. A. et al. Discovery of target genes and pathways at GWAS loci by pooled single-cell CRISPR screens. Science 380, eadh7699 (2023).

    PubMed  PubMed Central  CAS  Google Scholar 

  138. Lee, S. et al. Massively parallel reporter assay investigates shared genetic variants of eight psychiatric disorders. Cell 188, 1409–1424.e21 (2025).

    PubMed  CAS  Google Scholar 

Download references

Acknowledgements

This study was supported by the National Institutes of Health (NIH) Grants R01HG009124 and R01GM144960. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors thank B. Gao for his help with interpreting the literature.

Author information

Authors and Affiliations

Authors

Contributions

The authors contributed equally to all aspects of the article.

Corresponding author

Correspondence to Xiang Zhou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Reviews Genetics thanks Jian Zeng, Bogdan Pasaniuc and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Glossary

Bayes factors

Relative marginal likelihood of the observed data under one hypothesis compared with another, often quantifying the evidence in favour of an association versus no association.

Bayesian framework

A statistical framework that represents uncertainty in model parameters using probability distributions; it combines prior beliefs with observed data through Bayes’ theorem to compute posterior distributions.

Bernoulli distribution

A discrete probability distribution for binary data that describes the probability of an event with only two possible outcomes, coded as 1 for success and 0 for failure.

Causal configurations

Specific combinations of causal statuses across SNPs within a genomic locus.

Confounding factors

Variables that influence both the outcome and the explanatory variable, leading to spurious associations between outcome and explanatory variable themselves.

Density value

The value of the probability density function evaluated at a given point, capturing the relative possibility for a continuous random variable being near that point.

Double-exponential distribution

Also known as Laplace distribution, it is a continuous probability distribution that resembles the normal distribution but with a sharper peak at the centre and heavier tails to encourage sparsity.

Expectation-maximization algorithm

An iterative algorithm to find maximum likelihood estimates in models with latent or missing variables by alternating between expectation and maximization steps.

Expression quantitative trait loci

Genomic loci in which genetic variants are associated with gene expression levels.

First-order Taylor approximation

A linear approximation of a function based on its value and first-order derivative at a given point.

Generalized linear regression

A generalization of linear regression that relates the linear combination of explanatory variables to the outcome variable through a link function, allowing for different types of outcome variables (for example, counts, binary).

Global-local shrinkage prior

A type of Bayesian prior that incorporates a global parameter to impose overall shrinkage of SNP effect sizes towards zero, while using local parameters to allow SNP-specific adaptive shrinkage of individual effect sizes.

Kullback–Leibler divergence

A measure of the difference between two probability distributions, often used to assess how one distribution diverges from another distribution.

Linkage disequilibrium

(LD). The nonrandom association of alleles at different loci.

Logistic model

A type of generalized linear regression used for binary outcomes in which the log-odds of a binary outcome is modelled as a linear combination of explanatory variables.

Marginal P values

In genome-wide association studies, a marginal P value refers to the quantification of statistical evidence for an association between a single genetic variant and the phenotype, without accounting for the effects of other variants.

Meta-analysis

A statistical analysis that combines results from multiple studies to reach a single conclusion about a common research question.

Minor allele frequency

(MAF). The frequency of the less common allele at a genetic locus in a population.

Model space

The set of all possible causal configurations within the fine-mapping model.

Multiple linear regression

A regression analysis that models a continuous outcome variable as a linear combination of multiple explanatory variables.

Per-SNP heritability

The proportion of phenotypic variance explained by a single SNP.

Phenotype residuals

The portion of phenotypic variation that remains after accounting for the effects of known covariates in a statistical model; calculated as the difference between observed and model-predicted phenotype values.

Pleiotropy

A genetic phenomenon in which one gene affects multiple traits or diseases.

Poisson distribution

A discrete probability distribution for count data that expresses the probability of a given number of events occurring in a fixed interval of time.

Posterior distribution

The probability distribution of a model parameter given the observed data and prior information.

Posterior probability

The probability of an event given the observed data and prior information.

Probit model

A type of generalized linear regression used for binary outcomes in which the probability of success is modelled using the cumulative distribution function of the standard normal distribution.

Regression

A statistical analysis that estimates the relationship between an outcome variable and one or more explanatory variables.

Regularization

In the context of covariance matrix estimation, regularization refers to the technique of adjusting the sample covariance matrix, typically by adding a multiply of the identity matrix, to ensure invertibility and improve numerical stability.

Statistical power

The probability that a statistical test will correctly detect an effect if it truly exists.

Summary statistics

Effect size estimates and their standard errors from single-variant association analysis in genome-wide association studies, along with an SNP–SNP correlation matrix typically estimated from a reference panel.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Z., Zhou, X. Towards improved fine-mapping of candidate causal variants. Nat Rev Genet 26, 847–861 (2025). https://doi.org/10.1038/s41576-025-00869-4

Download citation

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1038/s41576-025-00869-4

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing