Abstract
The HapMap project has facilitated the selection of tagging single nucleotide polymorphisms (tagSNPs) for genome-wide association studies (GWAS) under the assumption that linkage disequilibrium (LD) in the HapMap populations is similar to the populations under investigation. Earlier reports support this assumption, although in most of these studies only a few loci were evaluated. We compared pair-wise LD and LD block structure across autosomes between the Dutch population and the CEU–HapMap reference panel. The impact of sampling distribution on the estimation of LD blocks was studied by bootstrapping. A high Pearson correlation (genome-wide; 0.93) between pair-wise r2 for the Dutch and the CEU populations was found, indicating that tagSNPs from the CEU–HapMap panel capture common variation in the Dutch population. However, some genomic regions exhibited, significantly lower correlation than the genome-wide estimate. This might decrease the validity of HapMap tagSNPs in these regions and the power of GWAS. The LD block structure differed considerably between the Dutch and CEU–HapMap populations. This was not explained by demographic differences between the CEU and Dutch samples, as testing for population stratification was not significant. We also found that sampling variation had a large effect on the estimation of LD blocks, as shown by the bootstrapping analysis. Thus, in small samples, most of the observed differences in LD blocks between populations are most likely the result of sampling variation. This poor concordance in LD block structure suggests that large samples are required for robust estimations of local LD block structure in populations.
Similar content being viewed by others
Log in or create a free account to read this content
Gain free access to this article, as well as selected content from this journal and more on nature.com
or
References
Zeggini E, Weedon MN, Lindgren CM et al: Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes. Science 2007; 316: 1336–1341.
Hinds DA, Stuve LL, Nilsen GB et al: Whole-genome patterns of common DNA variation in three human populations. Science 2005; 307: 1072–1079.
Gabriel SB, Schaffner SF, Nguyen H et al: The structure of haplotype blocks in the human genome. Science 2002; 296: 2225–2229.
International HapMap Consortium: A haplotype map of the human genome. Nature 2005; 437: 1299–1320.
Collins FS, Guyer MS, Chakravarti A : Variations on a theme: cataloging human DNA sequence variation. Science 1997; 278: 1580–1581.
Daly MJ, Rioux JD, Schaffner SF, Hudson TJ, Lander ES : High-resolution haplotype structure in the human genome. Nat Genet 2001; 29: 229–232.
The International HapMap Consortium: The International HapMap Project. Nature 2003; 426: 789–796.
Xing J, Witherspoon DJ, Watkins WS, Zhang Y, Tolpinrud W, Jorde LB : HapMap tagSNP transferability in multiple populations: general guidelines. Genomics 2008; 92: 41–51.
Ribas G, Gonzalez-Neira A, Salas A et al: Evaluating HapMap SNP data transferability in a large-scale genotyping project involving 175 cancer-associated genes. Hum Genet 2006; 118: 669–679.
Mueller JC, Lohmussaar E, Magi R et al: Linkage disequilibrium patterns and tagSNP transferability among European populations. Am J Hum Genet 2005; 76: 387–398.
de Bakker PI, Yelensky R, Pe’er I, Gabriel SB, Daly MJ, Altshuler D : Efficiency and power in genetic association studies. Nat Genet 2005; 37: 1217–1223.
Gonzalez-Neira A, Ke X, Lao O et al: The portability of tagSNPs across populations: a worldwide survey. Genome Res 2006; 16: 323–330.
Conrad DF, Jakobsson M, Coop G et al: A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nat Genet 2006; 38: 1251–1260.
Montpetit A, Nelis M, Laflamme P et al: An evaluation of the performance of tag SNPs derived from HapMap in a caucasian population. PLoS Genet 2006; 2: e27.
Liu N, Sawyer SL, Mukherjee N et al: Haplotype block structures show significant variation among populations. Genet Epidemiol 2004; 27: 385–400.
Gu S, Pakstis AJ, Li H, Speed WC, Kidd JR, Kidd KK : Significant variation in haplotype block structure but conservation in tagSNP patterns among global populations. Eur J Hum Genet 2007; 15: 302–312.
Greenwood TA, Rana BK, Schork NJ : Human haplotype block sizes are negatively correlated with recombination rates. Genome Res 2004; 14: 1358–1361.
Osabe D, Tanahashi T, Nomura K et al: Evaluation of sample size effect on the identification of haplotype blocks. BMC Bioinformatics 2007; 8: 200.
Kayser M, Liu F, Janssens ACJW et al: Three genome-wide association studies and a linkage analysis identify HERC2 as a human iris color gene. Am J Hum Genet 2008; 82: 411–423.
van Es MA, van Vught PWJ, Blauw HM et al: Genetic variation in DPP6 is associated with susceptibility to amyotrophic lateral sclerosis. Nat Genet 2008; 40: 29–31.
Sulem P, Gudbjartsson DF, Stacey SN et al: Genetic determinants of hair, eye and skin pigmentation in Europeans. Nat Genet 2007; 39: 1443–1452.
GAIN Collaborative Research Group Manolio TA, Rodriguez LL, Brooks L et al: New models of collaboration in genome-wide association studies: the Genetic Association Information Network. Nat Genet 2007; 39: 1045–1051.
Boomsma DI, Willemsen G, Sullivan PF et al: Genome-wide association of major depression: description of samples for the GAIN Major Depressive Disorder Study: NTR and NESDA biobank projects. Eur J Hum Genet 2008; 16: 335–342.
Hemminger BM, Saelim B, Sullivan PF : TAMAL: an integrated approach to choosing SNPs for genetic studies of human complex traits. Bioinformatics 2006; 22: 626–627.
International HapMap Consortium, Frazer KA, Ballinger DG, Cox DR et al: A second generation human haplotype map of over 3.1 million SNPs. Nature 2007; 449: 851–861.
Wheeler DL, Barrett T, Benson DA et al: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2008; 36: D13–D21.
Barrett JC, Fry B, Maller J, Daly MJ : Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 2005; 21: 263–265.
Sokal R, Rohlf F : Correlation; in Company WHFa (ed):: Biometry. NewYork, 2003, pp 555–608.
Purcell S, Neale B, Todd-Brown K et al: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007; 81: 559–575.
Marchini J, Cutler D, Patterson N et al: A comparison of phasing algorithms for trios and unrelated individuals. Am J Hum Genet 2006; 78: 437–450.
Terwilliger JD, Hiekkalinna T : An utter refutation of the ‘Fundamental Theorem of the HapMap’. Eur J Hum Genet 2006; 14: 426–437.
Gu CC, Yu K, Rao DC : Characterization of LD structures and the utility of HapMap in genetic association studies. AdvGenet 2008; 60: 407–435.
Pardo LM, MacKay I, Oostra B, van Duijn CM, Aulchenko YS : The effect of genetic drift in a young genetically isolated population. Ann Hum Genet 2005; 69: 288–295.
Gu CC, Yu K, Boerwinkle E : Measuring marker information content by the ambiguity of block boundaries observed in dense SNP data. Ann Hum Genet 2007; 71: 127–140.
Teare MD, Dunning AM, Durocher F, Rennart G, Easton DF : Sampling distribution of summary linkage disequilibrium measures. Ann Hum Genet 2002; 66: 223–233.
Weiss KM, Clark AG : Linkage disequilibrium and the mapping of complex human traits. Trends Genet 2002; 18: 19–24.
Zeggini E, Rayner W, Morris AP et al: An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. Nat Genet 2005; 37: 1320–1322.
Ioannidis JPA : Non-replication and inconsistency in the genome-wide association setting. Hum Hered 2007; 64: 203–213.
McCarthy MI, Abecasis GR, Cardon LR et al: Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet 2008; 9: 356–369.
Acknowledgements
We would like to acknowledge support from NWO: genetic basis of anxiety and depression (904-61-090); resolving cause and effect in the association between exercise and well-being (904-61-193); twin-family database for behavior genomics studies (480-04-004); twin research focusing on behavior (400-05-717); Center for Medical Systems Biology (NWO Genomics); Spinozapremie (SPI 56-464-14192); NWO-VI016-065-318; Centre for Neurogenomics and Cognitive Research (CNCR-VU); genomewide analyses of European twin and population cohorts (EU/QLRT-2001-01254); genome scan for neuroticism (NIMH R01 MH059160); Geestkracht program of ZonMW (10-000-1002); matching funds from universities and mental health care institutes involved in NESDA (GGZ Buitenamstel-Geestgron84den, Rivierduinen, University Medical Center Groningen, GGZ Lentis, GGZ Friesland, GGZ Drenthe). Major funding for this project is from the Genetic Association Information Network of the Foundation for the US National Institutes of Health, a public–private partnership between the NIH and Pfizer Inc., Affymetrix Inc. and Abbott Laboratories. Genetic Cluster Computer is financially supported by the Netherlands Scientific Organization (NWO 480-05-003). Stichting Nationale Computerfaciliteiten –NCF (SH-104-08 Grant) is also acknowledged.
Author information
Authors and Affiliations
Corresponding author
Additional information
Supplementary Information accompanies the paper on European Journal of Human Genetics website (http://www.nature.com/ejhg)
Rights and permissions
About this article
Cite this article
Pardo, L., Bochdanovits, Z., de Geus, E. et al. Global similarity with local differences in linkage disequilibrium between the Dutch and HapMap–CEU populations. Eur J Hum Genet 17, 802–810 (2009). https://doi.org/10.1038/ejhg.2008.248
Received:
Revised:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/ejhg.2008.248
Keywords
This article is cited by
-
A common polymorphism in the ABCB1 gene is associated with side effects of PGP-dependent antidepressants in a large naturalistic Dutch cohort
The Pharmacogenomics Journal (2016)
-
Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits
Nature Genetics (2012)
-
Joint reanalysis of 29 correlated SNPs supports the role of PCLO/Piccolo as a causal risk factor for major depressive disorder
Molecular Psychiatry (2009)


