Enriching the analysis of genomewide association studies with hierarchical modeling - PubMed (original) (raw)
. 2007 Aug;81(2):397-404.
doi: 10.1086/519794. Epub 2007 Jun 26.
Affiliations
- PMID: 17668389
- PMCID: PMC1950795
- DOI: 10.1086/519794
Enriching the analysis of genomewide association studies with hierarchical modeling
Gary K Chen et al. Am J Hum Genet. 2007 Aug.
Abstract
Genomewide association studies (GWAs) initially investigate hundreds of thousands of single-nucleotide polymorphisms (SNPs), and the most promising SNPs are further evaluated with additional subjects, for replication or a joint analysis. Deciding which SNPs merit follow-up is one of the most crucial aspects of these studies. We present here an approach for selecting the most-promising SNPs that incorporates into a hierarchical model both conventional results and other existing information about the SNPs. The model is developed for general use, its potential value is shown by application, and tools are provided for undertaking hierarchical modeling. By quantitatively harnessing all available information in GWAs, hierarchical modeling may more clearly distinguish true causal variants from noise.
Figures
Figure 1.
The smallest 500 −log10 P values estimated from ordinary linear regression of the CHI3L2 gene–expression phenotype on the genotypes of 57 CEU individuals across chromosome 1. The causal SNP rs755467 is shown at 111.48 Mb with a log10 (P value) of 7.29.
Figure 2.
A comparison of the smallest 500 −log10 P values from the CHI3L2 example with use of hierarchical models across three values of the SD parameter ρ. Larger values of ρ reduce the effect of reduction toward the second-stage mean at the region with strong prior evidence (i.e., linked region in center), whereas smaller values increase the reduction.
Figure 3.
A comparison of the smallest 500 −log10 P values estimated from ordinary linear regression (in red, as shown in fig. 1) and the hierarchical model, with
ρ=0.05
estimates superimposed in blue.
Figure 4.
Proportion of the top 500 SNPs located across windows centered at the causal variant for CHII3L2 gene expression for ordinary linear regression and for the hierarchical model. The _X_-axis denotes the distance from the causal SNP to either edge of a window.
Similar articles
- Increasing power of genome-wide association studies by collecting additional single-nucleotide polymorphisms.
Kostem E, Lozano JA, Eskin E. Kostem E, et al. Genetics. 2011 Jun;188(2):449-60. doi: 10.1534/genetics.111.128595. Epub 2011 Apr 5. Genetics. 2011. PMID: 21467568 Free PMC article. - ArchiLD: hierarchical visualization of linkage disequilibrium in human populations.
Melchiotti R, Rötzschke O, Poidinger M. Melchiotti R, et al. PLoS One. 2014 Jan 21;9(1):e86761. doi: 10.1371/journal.pone.0086761. eCollection 2014. PLoS One. 2014. PMID: 24466224 Free PMC article. - What has GWAS done for HLA and disease associations?
Kennedy AE, Ozbek U, Dorak MT. Kennedy AE, et al. Int J Immunogenet. 2017 Oct;44(5):195-211. doi: 10.1111/iji.12332. Int J Immunogenet. 2017. PMID: 28877428 Review. - Molecular genetic studies of complex phenotypes.
Marian AJ. Marian AJ. Transl Res. 2012 Feb;159(2):64-79. doi: 10.1016/j.trsl.2011.08.001. Epub 2011 Aug 31. Transl Res. 2012. PMID: 22243791 Free PMC article. Review.
Cited by
- A scalable Bayesian functional GWAS method accounting for multivariate quantitative functional annotations with applications for studying Alzheimer disease.
Chen J, Wang L, De Jager PL, Bennett DA, Buchman AS, Yang J. Chen J, et al. HGG Adv. 2022 Sep 17;3(4):100143. doi: 10.1016/j.xhgg.2022.100143. eCollection 2022 Oct 13. HGG Adv. 2022. PMID: 36204489 Free PMC article. - Estimating Genetic Similarity Matrices Using Phylogenies.
Wang S, Ge S, Colijn C, Biller P, Wang L, Elliott LT. Wang S, et al. J Comput Biol. 2021 Jun;28(6):587-600. doi: 10.1089/cmb.2020.0375. Epub 2021 Apr 29. J Comput Biol. 2021. PMID: 33926225 Free PMC article. - A Hierarchical Approach Using Marginal Summary Statistics for Multiple Intermediates in a Mendelian Randomization or Transcriptome Analysis.
Jiang L, Xu S, Mancuso N, Newcombe PJ, Conti DV. Jiang L, et al. Am J Epidemiol. 2021 Jun 1;190(6):1148-1158. doi: 10.1093/aje/kwaa287. Am J Epidemiol. 2021. PMID: 33404048 Free PMC article. - Population genetic models of GERP scores suggest pervasive turnover of constrained sites across mammalian evolution.
Huber CD, Kim BY, Lohmueller KE. Huber CD, et al. PLoS Genet. 2020 May 29;16(5):e1008827. doi: 10.1371/journal.pgen.1008827. eCollection 2020 May. PLoS Genet. 2020. PMID: 32469868 Free PMC article. - A multiple coefficient of determination-based method for parsing SNPs that correlate with mRNA expression.
Song F, Tao Y, Sun Y, Saffen D. Song F, et al. Sci Rep. 2019 Dec 27;9(1):20110. doi: 10.1038/s41598-019-56494-9. Sci Rep. 2019. PMID: 31882953 Free PMC article.
References
Web Resources
- Ensembl, http://www.ensembl.org/
- Gene Expression Omnibus, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi (for phenotype data about 57 CEU individuals [accession number GSE2552])
- International HapMap Project, http://www.hapmap.org/downloads/index.html.en (for genotype and LD data about SNPs)
- J.S.W. lab, http://www.epibiostat.ucsf.edu/witte_lab/
- NCBI FTP, http://www.ncbi.nlm.nih.gov/Ftp/
References
- Morris C (1983) Parametric empirical Bayes inference: theory and applications. J Am Stat Assoc 78:47–6510.2307/2287098 - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources