Bayesian statistical methods for genetic association studies (original) (raw)
References
Sellke, T., Bayarri, M. J. & Berger, J. O. Calibration of p values for testing precise null hypotheses. Am. Stat.55, 62–71 (2001). Article Google Scholar
Ioannidis, J. P. A. Effect of formal statistical significance on the credibility of observational associations. Am. J. Epidem.168, 374–383 (2008). Article Google Scholar
Ayres, K. L. & Balding, D. J. Measuring departures from Hardy–Weinberg: a Markov chain Monte Carlo method for estimating the inbreeding coefficient. Heredity80, 769–777 (1998). ArticlePubMed Google Scholar
Shoemaker, J. S., Painter, I. S. & Weir, B. S. Bayesian statistics in genetics — a guide for the uninitiated. Trends Genet.15, 354–358 (1999). ArticleCASPubMed Google Scholar
Beaumont, M. A. & Rannala, B. The Bayesian revolution in genetics. Nature Rev. Genet.5, 251–261 (2004). ArticleCASPubMed Google Scholar
Marjoram, P. & Tavare, S. Modern computational approaches for analysing molecular genetic variation data. Nature Rev. Genet.7, 759–770 (2006). ArticleCASPubMed Google Scholar
O'Hara, R. B., Cano, J. M., Ovaskainen, O., Teplitsky, C. & Alho, J. S. Bayesian approaches in evolutionary quantitative genetics. J. Evol. Biol.21, 949–957 (2008). ArticleCASPubMed Google Scholar
Wakefield, J. Bayesian methods for examining Hardy–Weinberg equilibrium. Biometrics 13 May 2009 (doi:10.1111/j.1541-0420.2009.01267.x). ArticlePubMedPubMed Central Google Scholar
Lunn, D. J., Whittaker, J. C. & Best, N. A Bayesian toolkit for genetic association studies. Genet. Epidem.30, 231–247 (2006). Article Google Scholar
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nature Genet.39, 906–913 (2007). The supplementary material of this article includes a review of frequentist tests and BFs for single-SNP association and a brief review of the Laplace approximation. In particular, it describes the Bayesian analysis methods implemented in the SNPTEST software. ArticleCASPubMed Google Scholar
Servin, B. & Stephens, M. Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genet.3, e114 (2007). This paper includes a description of several of the Bayesian analysis methods that are implemented in the BIMBAM software, including the Bayesian multi-SNP analysis methods that we used in this Review. ArticlePubMedPubMed Central Google Scholar
The Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature447, 661–678 (2007). A landmark paper because of the size of the studies, the pioneering use of unphenotyped common controls for a range of diseases and the large number of novel genetic associations reported. The authors also advocate the use of Bayesian approaches for evaluating evidence of association, which was reported alongside traditionalp-values for the first time in a major study.
Wakefield, J. A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am. J. Hum. Genet.81, 208–227 (2007). ArticleCASPubMedPubMed Central Google Scholar
Hosking, F. J., Sterne, J. A. C., Smith, G. D. & Green, P. J. Inference from genome-wide association studies using a novel Markov model. Genet. Epidem.32, 497–504 (2008). Article Google Scholar
Verzilli, C. et al. Bayesian meta-analysis of genetic association studies with different sets of markers. Am. J. Hum. Genet.82, 859–872 (2008). ArticleCASPubMedPubMed Central Google Scholar
Fridley, B. L. Bayesian variable and model selection methods for genetic association studies. Genet. Epidem.33, 27–37 (2009). Article Google Scholar
Wakefield, J. Reporting and interpretation in genome-wide association studies. Intern. J. Epidem.37, 641–653 (2008). Article Google Scholar
Guan, Y. & Stephens, M. Practical issues in imputation-based association mapping. PLoS Genet.4, e1000279 (2008). This article includes a detailed discussion of the advantages of Bayesian methods over frequentist methods when assessing associations with imputed SNPs. ArticlePubMedPubMed Central Google Scholar
Balding, D. J. A tutorial on statistical methods for population association studies. Nature Rev. Genet.7, 781–791 (2006). This Review covers: preliminary analyses (of Hardy–Weinberg and linkage equilibria, inference of phase and missing genotypes); single-SNP tests of association for binary, continuous and ordinal outcomes; multi-SNP and haplotype analyses; and dealing with population stratification and multiple-testing issues, largely within the frequentist framework. ArticleCASPubMed Google Scholar
Jeffreys, H. Theory of Probability (Oxford Univ. Press, 1961). Google Scholar
Good, I. J. The Bayes/non-Bayes compromise: a brief review. J. Am. Stat. Assoc.87, 597–606 (1992). Article Google Scholar
Seaman, S. R. & Richardson, S. Equivalence of prospective and retrospective models in the Bayesian analysis of case–control studies, Biometrika91, 15–25 (2004). Article Google Scholar
Freidlin, B., Zheng, G., Li, Z. H. & Gastwirth, J. L. Trend tests for case–control studies of genetic markers: power, sample size and robustness. Hum. Hered.53, 146–152 (2002). ArticleCASPubMed Google Scholar
The SEARCH Collaborative Group. SLCO1B1 variants and statin-induced myopathy — a genomewide study. N. Engl. J. Med.359, 789–799 (2008).
Scott, L. J. et al. A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science316, 1341–1345 (2009). Article Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. B58, 267–288 (1996). Google Scholar
Hoggart, C. J., Whittaker, J. C., De Iorio, M. & Balding, D. J. Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies. PLoS Genet.4, e1000130 (2008). ArticlePubMedPubMed Central Google Scholar
Kavvoura, F. K. & Ioannidis, J. P. A. Methods for meta-analysis in genetic association studies: a review of their potential and pitfalls. Hum. Genet.123, 1–14 (2008). ArticlePubMed Google Scholar
Van Houwelingen, H. & Lebrec, J. P. in Meta-analysis and Combining Information in Genetics and Genomics (eds Guerra, R. et al.) 49–66 (CRC Press, 2009). Google Scholar
Ioannidis, J. P., Patsopoulos, N. A. & Evangelou, E. Heterogeneity in meta-analyses of genome-wide association investigations. PLoS ONE2, e841 (2007). ArticlePubMedPubMed Central Google Scholar
Lunn, D. J., Thomas, A., Best, N. & Spiegelhalter, D. WinBUGS — a Bayesian modelling framework: concepts, structure, and extensibility. Stat. Comput.10, 325–337 (2000). Article Google Scholar
Thompson, J. R., Minelli, C., Abrams, K. R., Thakkinstian, A. & Attia, J. Combining information from related meta-analyses of genetic association studies. J. R. Stat. Soc. C57, 103–115 (2008). Article Google Scholar
Hoggart, C. J., Clark, T. G., De Iorio, M., Whittaker, J. C. & Balding, D. J. Genome-wide significance for dense SNP and resequencing data. Genet. Epidem.32, 179–185 (2008). Article Google Scholar
Veyrieras, J.-B. et al. High-resolution mapping of expression-QTLs yields insight into human gene regulation. PLoS Genet.4, e1000214 (2008). ArticlePubMedPubMed Central Google Scholar
Chen, R. et al. FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease. Genome Biol.9, R170 (2008). ArticlePubMedPubMed Central Google Scholar
Tachmazidou, I., Andrew, T., Verzilli, C. J., Johnson, M. R. & De Iorio, M. Bayesian survival analysis in genetic association studies. Bioinformatics24, 2030–2036 (2008). ArticleCASPubMedPubMed Central Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate — a practical and powerful approach to multiple testing. J. R. Stat. Soc. B57, 289–300 (1995). Google Scholar
Storey, J. D. A direct approach to false discovery rates. J. R. Stat. Soc. B64, 479–498 (2002). Article Google Scholar
Wakefield, J. Bayes factors for genome-wide association studies: comparison with _P_-values. Genet. Epidem.33, 79–86 (2009). This is the last in a sequence of three single-author papers published by Wakefield in successive years. This paper uses the approximate BF introduced in Reference 14 to highlight what can be regarded as implicit assumptions in the use of standardp-values as the primary summaries of evidence for association. Article Google Scholar
Wang, W. Y. S., Barratt, B. J., Clayton, D. G. & Todd, J. A. Genome-wide association studies: theoretical and practical concerns. Nature Rev. Genet.6, 109–118 (2005). ArticleCASPubMed Google Scholar
Gorlov, I. P., Gorlova, O. Y., Sunyaev, S. R., Spitz, M. R. & Amos, C. I. Shifting paradigm of association studies: value of rare single-nucleotide polymorphisms. Am. J. Hum. Genet.82, 100–112 (2008). ArticleCASPubMedPubMed Central Google Scholar
Greenland, S. Multiple comparisons and association selection in general epidemiology. Intern. J. Epidem.37, 430–434 (2008). Article Google Scholar
Scheipl, F. & Kneib, T. Locally adaptive Bayesian P-splines with a normal-exponential-gamma prior. Comput. Stat. Data Anal.53, 3533–3552 (2009). Article Google Scholar
Reiner, A. P. et al. Polymorphisms of the HNF1A gene encoding hepatocyte nuclear factor-1α are associated with C-reactive protein. Am. J. Hum. Genet.82, 1193–1201 (2008). ArticleCASPubMedPubMed Central Google Scholar