Cost-effective analysis of candidate genes using htSNPs: a staged approach

Lowe, C E; Cooper, J D; Chapman, J M; Barratt, B J; Twells, R C J; Green, E A; Savage, D A; Guja, C; Ionescu-Tîrgovişte, C; Tuomilehto-Wolf, E; Tuomilehto, J; Todd, J A; Clayton, D G

doi:10.1038/sj.gene.6364064

Brief Communication
Published: 18 March 2004

Cost-effective analysis of candidate genes using htSNPs: a staged approach

C E Lowe¹^na1,
J D Cooper¹^na1,
J M Chapman¹,
B J Barratt¹,
R C J Twells¹,
E A Green¹,
D A Savage²,
C Guja³,
C Ionescu-Tîrgovişte³,
E Tuomilehto-Wolf⁴,
J Tuomilehto^4,5,
J A Todd¹ &
…
D G Clayton¹

Genes & Immunity volume 5, pages 301–305 (2004)Cite this article

552 Accesses
46 Citations
Metrics details

Abstract

We have previously shown that the selection of haplotype tag single nucleotide polymorphisms (htSNPs) and their statistical analysis in a multi-locus transmission/disequilibrium test (TDT) results in a more cost-effective genotyping strategy in disease association studies of genes by minimising redundancy due to linkage disequilibrium between SNPs. Further savings can be achieved by the use of a two-stage genotyping strategy. This approach is illustrated here in conjunction with the multi-locus TDT in determining whether common alleles of the immune regulatory genes RANK and its ligand TRANCE (RANKL) are associated with type 1 diabetes (T1D). A saving of approximately 75% of potential genotyping reactions could be made with minimal loss of power. There was little evidence from our analysis for association between the TRANCE and RANK genes and T1D in the populations tested.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Untangling the genetic link between type 1 and type 2 diabetes using functional genomics

Article Open access 06 July 2021

Functional characterization of T2D-associated SNP effects on baseline and ER stress-responsive β cell transcriptional activation

Article Open access 02 September 2021

Network-based analysis of key regulatory genes implicated in Type 2 Diabetes Mellitus and Recurrent Miscarriages in Turner Syndrome

Article Open access 21 May 2021

References

Service SK, Sandkuijl LA, Freimer NB . Cost-effective designs for linkage disequilibrium mapping of complex traits. Am J Hum Genet 2003; 72: 1213–1220.
Article CAS Google Scholar
Ioannidis JPA, Trikalinos TA, Ntzani EE, Contopoulos-Ioannidis DG . Genetic associations in large versus small studies: an empirical assessment. Lancet 2003; 361: 567–571.
Article Google Scholar
Johnson GC, Esposito L, Barratt BJ et al. Haplotype tagging for the identification of common disease genes. Nat Genet 2001; 29: 233–237.
Article CAS Google Scholar
Chapman JM, Cooper JD, Todd JA, Clayton DG . Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Heredity 2003; 56: 18–31.
Article Google Scholar
Aplenc R, Zhao H, Rebbeck TR, Propert KJ . Group sequential methods and sample size savings in biomarker–disease association studies. Genetics 2003; 163: 1215–1219.
CAS PubMed PubMed Central Google Scholar
Satagopan JM, Elston RC . Optimal two-stage genotyping in population-based association studies. Genet Epidemiol 2003; 25: 149–157.
Article Google Scholar
Zhang K, Deng M, Chen T, Waterman MS, Sun F . A dynamic programming algorithm for haplotype block partitioning. Proc Natl Acad Sci USA 2002; 99: 7335–7339.
Article CAS Google Scholar
Ke X, Cardon LR . Efficient selective screening of haplotype tag SNPs. Bioinformatics 2003; 19: 287–288.
Article CAS Google Scholar
Stram DO, Haiman CA, Hirschhorn JN et al. Choosing haplotype-tagging SNPs based on unphased genotype data using a preliminary sample of unrelated subjects with an example from the multiethnic cohort study. Hum Heredity 2003; 55: 27–36.
Article Google Scholar
Fan R, Knapp M . Genome association studies of complex diseases by case–control designs. Am J Hum Genet 2003; 72: 850–868.
Article CAS Google Scholar
Green EA, Choi Y, Flavell RA . Pancreatic lymph node-derived CD4(+)CD25(+) Treg cells: highly potent regulators of diabetes that require TRANCE–RANK signals. Immunity 2002; 16: 183–191.
Article CAS Google Scholar
Merriman TR, Twells RC, Merriman ME et al. Evidence by allelic association-dependent methods for a type 1 diabetes polygene (IDDM6) on chromosome 18q21. Hum Mol Genet 1997; 6: 1003–1010.
Article CAS Google Scholar
Vaidya B, Imrie H, Perros P et al. Evidence for a new Graves’ disease susceptibility locus at chromosome 18q21. Am J Hum Genet 2000; 66: 1710–1714.
Article CAS Google Scholar
Merriman TR, Cordell HJ, Eaves IA et al. Suggestive evidence for association of human chromosome 18q12–q21 and its orthologue on rat and mouse chromosome 18 with several autoimmune diseases. Diabetes 2001; 50: 184–194.
Article CAS Google Scholar
Jawaheer D, Seldin MF, Amos CI et al. Screening the genome for rheumatoid arthritis susceptibility genes. Arthritis Rheum 2003; 48: 906–916.
Article CAS Google Scholar
Chapman JM, Clayton DG . Detecting disease associations due to linkage disequilibrium using haplotype tags: technical addendum##http://www-gene.cimr.cam.ac.uk/clayton/tech_reports/chapman-clayton-2003.pdf.
Boos DB . On generalized score tests. Am Statistician 1992; 46: 327–333.
Google Scholar
StataCorp. Stata Statistical Software: Realease 8.0. Stata Corporation: College Station, TX, 2003.
R statistical language http://www.r-project.org/.

Download references

Acknowledgements

The Wellcome Trust and the Juvenile Diabetes Research Foundation International have funded this work. We thank Vin Everett, Geoff Dolman and Neil Walker for data management and the DNA team for sample preparation. Diabetes UK and the Human Biological Data Interchange are acknowledged for multiplex family collections. We also thank the Norwegian Study Group for Childhood Diabetes, Dag Undlien and Kjersti Ronningen for the collection and provision of Norwegian samples.

Author information

C E Lowe and J D Cooper: These authors contributed equally to this work

Authors and Affiliations

Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Cambridge, UK
C E Lowe, J D Cooper, J M Chapman, B J Barratt, R C J Twells, E A Green, J A Todd & D G Clayton
Department of Medical Genetics, Queen's University Belfast, Belfast City Hospital, Belfast, UK
D A Savage
Clinic of Diabetes, Institute of Diabetes, Nutrition and Metabolic Diseases ‘N Paulescu’, Bucharest, Romania
C Guja & C Ionescu-Tîrgovişte
Diabetes and Genetic Epidemiology Unit, National Public Health Institute, University of Helsinki, Helsinki, Finland
E Tuomilehto-Wolf & J Tuomilehto
Department of Public Health, University of Helsinki, Helsinki, Finland
J Tuomilehto

Authors

C E Lowe
View author publications
Search author on:PubMed Google Scholar
J D Cooper
View author publications
Search author on:PubMed Google Scholar
J M Chapman
View author publications
Search author on:PubMed Google Scholar
B J Barratt
View author publications
Search author on:PubMed Google Scholar
R C J Twells
View author publications
Search author on:PubMed Google Scholar
E A Green
View author publications
Search author on:PubMed Google Scholar
D A Savage
View author publications
Search author on:PubMed Google Scholar
C Guja
View author publications
Search author on:PubMed Google Scholar
C Ionescu-Tîrgovişte
View author publications
Search author on:PubMed Google Scholar
E Tuomilehto-Wolf
View author publications
Search author on:PubMed Google Scholar
J Tuomilehto
View author publications
Search author on:PubMed Google Scholar
J A Todd
View author publications
Search author on:PubMed Google Scholar
D G Clayton
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to D G Clayton.

Appendix A. Adjusting χ2 tests for stopping for futility in multi-stage association studies

We consider a study in which the test is carried out in a series of n stages, stopping for futility at each stage if the results, thus far, do not achieve a given nominal significance level. For a conventional frequentist interpretation, the nominal significance level at the end of the study should be corrected for the intermediate analyses.

χ² tests are generated by quadratic forms T=u^TV⁻¹u, where u is (asymptotically) multivariate normal with variance V. The test statistic T is then distributed as a non-central χ² distribution with df v, the rank of V and non-centrality parameter η=μ^TV⁻¹μ where μ=E(u). If the test is carried out in a series of n stages, involving proportions p₁,…,p_n of the total available sample, the score vector decomposes into independent contributions u=u₁+u₂+···+u_n, where

Writing u_[k], p_[k] for the partial sums

the test statistic carried out after stage k is

The distribution of T_k, conditional upon the history of previous results, u_1,,…,u_k−1 is that of p_kχ²/p[k], where χ² is a non-central χ² variate with v df and non-centrality parameter

We stop after stage k if T_k fails to exceed a critical value c_k. The probability of exceeding this critical value conditional upon reaching stage k is

This integral is intractable but may be approximated by simulation.

An accurate and efficient Monte Carlo method for calculating the overall probability of rejection is to simulate sequences of score vectors subject to the stopping rule described. The length of such sequences will vary from one to n. The probability of exceeding the test criterion after stage 2 conditional upon reaching stage 2 may then be calculated by averaging Pr(T₂>c₂∣u₁) over all simulated values of u₁. Similarly, the probability of exceeding the test criterion after stage 3 conditional upon reaching stage 3 may be calculated by averaging Pr(T₃>c₃∣u₁, u₂) over all simulated pairs of values (u₁, u₂). In this manner, the complete sequence of conditional probabilities can be estimated. When generating the sequences of score vectors, without loss of generality, we may take the v elements of u_i to be independent variates. The overall probability of rejecting the null hypothesis is given by the cumulative product

These calculations are implemented in the R language by the program Nstage.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lowe, C., Cooper, J., Chapman, J. et al. Cost-effective analysis of candidate genes using htSNPs: a staged approach. Genes Immun 5, 301–305 (2004). https://doi.org/10.1038/sj.gene.6364064

Download citation

Received: 30 September 2003
Revised: 16 January 2004
Accepted: 28 January 2004
Published: 18 March 2004
Issue date: 01 June 2004
DOI: https://doi.org/10.1038/sj.gene.6364064

Keywords

This article is cited by

Association analysis of PRNP gene region with chronic wasting disease in Rocky Mountain elk
- Stephen N White
- Terry R Spraker
- Katherine I O'Rourke
BMC Research Notes (2010)
Robust associations of four new chromosome regions from genome-wide analyses of type 1 diabetes
- John A Todd
- Neil M Walker
- David G Clayton
Nature Genetics (2007)
Integrated analysis of genetic data with R
- Jing Zhao
- Qihua Tan
Human Genomics (2006)
Discovery, linkage disequilibrium and association analyses of polymorphisms of the immune complement inhibitor, decay-accelerating factor gene (DAF/CD55) in type 1 diabetes
- Hidenori Taniguchi
- Christopher E Lowe
- John A Todd
BMC Genetics (2006)
Detecting multiple associations in genome-wide studies
- Frank Dudbridge
- Arief Gusnanto
- Bobby PC Koeleman
Human Genomics (2006)

Cost-effective analysis of candidate genes using htSNPs: a staged approach

Abstract

Access options

Similar content being viewed by others

Untangling the genetic link between type 1 and type 2 diabetes using functional genomics

Functional characterization of T2D-associated SNP effects on baseline and ER stress-responsive β cell transcriptional activation

Network-based analysis of key regulatory genes implicated in Type 2 Diabetes Mellitus and Recurrent Miscarriages in Turner Syndrome

References

Acknowledgements