Abstract
Haplotypes are now widely used in association studies between markers and disease susceptibility locus. However, when a large number of markers are considered, the number of possible haplotypes increases leading to two problems: an increased number of degrees of freedom that may result in a lack of power and the existence of rare haplotypes that may be difficult to take into account in the statistical analysis. In a recent paper, Durrant et al proposed a method, CLADHC, to group haplotypes based on distance matrices and showed that this could considerably increase the power of the association test as compared to either single-locus analysis or haplotype analysis without prior grouping. Although the authors considered different one-disease-locus susceptibility models in their simulations, they did not study the impact of the linkage disequilibrium (LD) pattern and of the susceptibility allele frequency on their conclusions. Here, we show, using haplotype data from five regions of the genome of different lengths and with different LD patterns, that, when a single disease susceptibility locus is simulated, the prior grouping of haplotypes based on the algorithm of Durrant et al does not increase the power of association testing except in very particular situations of LD patterns and allele frequencies.
Similar content being viewed by others
Log in or create a free account to read this content
Gain free access to this article, as well as selected content from this journal and more on nature.com
or
References
Akey J, Jin L, Xiong M : Haplotypes vs single marker linkage disequilibrium tests: what do we gain? Eur J Hum Genet 2001; 9: 291–300.
Zaykin DV, Westfall PH, Young SS, Karnoub MA, Wagner MJ, Ehm MG : Testing association of statistically inferred haplotypes with discrete and continuous traits in samples of unrelated individuals. Hum Hered 2002; 53: 79–91.
Zhang K, Calabrese P, Nordborg M, Sun F : Haplotype block structure and its applications to association studies: power and study designs. Am J Hum Genet 2002; 71: 1386–1394.
Long AD, Langley CH : The power of association studies to detect the contribution of candidate genetic loci to variation in complex traits. Genome Res 1999; 9: 720–731.
Kaplan N, Morris R : Issues concerning association studies for fine mapping a susceptibility gene for a complex disease. Genet Epidemiol 2001; 20: 432–457.
Roeder K, Bacanu SA, Sonpar V, Zhang X, Devlin B : Analysis of single-locus tests to detect gene/disease associations. Genet Epidemiol 2005; 28: 207–219.
Judson R, Stephens JC : Notes from the SNP vs. haplotype front. Pharmacogenomics 2001; 2: 7–10.
Bader JS : The relative power of SNPS and haplotype as genetic markers for association tests. Pharmacogenomics 2001; 2: 11–24.
Culverhouse R, Suarez BK, Lin J, Reich T : A perspective on epistasis: limits of models displaying no main effect. Am J Hum Genet 2002; 70: 461–471.
Jannot AS, Essioux L, Reese M, Clerget-Darpoux F : Improved use of SNP information to detect the role of genes. Genet Epidemiol 2003; 25: 158–167.
Templeton AR, Boerwinkle E, Sing CF : A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. I. Basic theory and an analysis of alcohol dehydrogenase activity in Drosophila. Genetics 1987; 117: 343–351.
Durrant C, Zondervan KT, Cardon LR, Hunt S, Deloukas P, Morris AP : Linkage disequilibrium mapping via cladistic analysis of single-nucleotique polymorphism haplotypes. Am J Hum Genet 2004; 75: 35–43.
SeattleSNPs. NHLBI Program for Genomic Applications, UW-FHCRC, Seattle, WA. URL: http://pga.gs.washington.edu [accessed October 2004].
Stephens M, Smith NJ, Donnelly P : A new statistical method for haplotype reconstruction from population data. Am J Hum Genet 2001; 68: 978–989.
Stephens M, Donnelly P : A comparison of bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet 2003; 73: 1162–1169.
Hugot JP, Chamaillard M, Zouali H et al: Association of NOD2 leucin-rich repeat variants with susceptibility to Crohn's disease. Nature 2001; 411: 599–603.
Daly MJ, Kruglyak L, Pratt S et al: Genehunter, version 2.1, 2001.
Abecasis GR, Cookson WO : GOLD-graphical overview of linkage disequilibrium. Bioinformatics 2000; 16: 182–183.
Seltman H, Roeder K, Devlin B : Transmission/Disequilibrium test meets measured haplotype analysis: family-based association analysis guided by evolution of haplotypes. Am J Hum Genet 2001; 68: 1250–1263.
Bardel C, Danjean V, Hugot JP, Darlu P, Genin E : On the use of haplotype phylogeny to detect disease susceptibility loci. BMC Genetics 2005; 6.
Acknowledgements
We thank Jean-Pierre Hugot, Habib Zouali and Suzanne Lesage from the foundation, Jean Dausset for providing us with Crohn data and Caroline Durrant for kindly providing us with the software CLADHC. We also thank the Runtime team from the LaBRI for letting us run a part of our simulations on their cluster and two anonymous reviewers for their helpful comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
Supplementary Information accompanies the paper on European Journal of Human Genetics website (http://www.nature.com/ejhg)
Supplementary information
Rights and permissions
About this article
Cite this article
Bardel, C., Darlu, P. & Génin, E. Clustering of haplotypes based on phylogeny: how good a strategy for association testing?. Eur J Hum Genet 14, 202–206 (2006). https://doi.org/10.1038/sj.ejhg.5201501
Received:
Revised:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/sj.ejhg.5201501
Keywords
This article is cited by
-
Haplotype based testing for a better understanding of the selective architecture
BMC Bioinformatics (2023)
-
Global haplotype partitioning for maximal associated SNP pairs
BMC Bioinformatics (2009)
-
Gains in power for exhaustive analyses of haplotypes using variable-sized sliding window strategy: a comparison of association-mapping strategies
European Journal of Human Genetics (2009)


