Two-stage two-locus models in genome-wide association - PubMed (original) (raw)
Two-stage two-locus models in genome-wide association
David M Evans et al. PLoS Genet. 2006.
Abstract
Studies in model organisms suggest that epistasis may play an important role in the etiology of complex diseases and traits in humans. With the era of large-scale genome-wide association studies fast approaching, it is important to quantify whether it will be possible to detect interacting loci using realistic sample sizes in humans and to what extent undetected epistasis will adversely affect power to detect association when single-locus approaches are employed. We therefore investigated the power to detect association for an extensive range of two-locus quantitative trait models that incorporated varying degrees of epistasis. We compared the power to detect association using a single-locus model that ignored interaction effects, a full two-locus model that allowed for interactions, and, most important, two two-stage strategies whereby a subset of loci initially identified using single-locus tests were analyzed using the full two-locus model. Despite the penalty introduced by multiple testing, fitting the full two-locus model performed better than single-locus tests for many of the situations considered, particularly when compared with attempts to detect both individual loci. Using a two-stage strategy reduced the computational burden associated with performing an exhaustive two-locus search across the genome but was not as powerful as the exhaustive search when loci interacted. Two-stage approaches also increased the risk of missing interacting loci that contributed little effect at the margins. Based on our extensive simulations, our results suggest that an exhaustive search involving all pairwise combinations of markers across the genome might provide a useful complement to single-locus scans in identifying interacting loci that contribute to moderate proportions of the phenotypic variance.
Conflict of interest statement
Competing interests. The authors have declared that no competing interests exist.
Figures
Figure 1. Power to Detect Association for 1,500 Individuals where Both Loci Are Responsible for 5% of the Trait Variance
Power is displayed on the vertical axis, allele frequencies at both loci on the horizontal axes. Results are shown for an (A) additive model where there is no epistasis (MA), (B) a simple looking epistatic model (M27) in which an individual requires at least one copy of the increaser allele at both loci in order to increase the quantitative phenotype above baseline levels, and (C) a more exotic model where an individual has to be heterozygous at both loci in order to have a trait mean above baseline (M16).
Figure 2. Partitioning of the Variance for Four Quantitative Trait Models
(A) Expected variance components at locus 1, locus 2, and the epistatic variance component for model M27 in which an individual requires at least one copy of the increaser allele at both loci in order to increase the quantitative phenotype above baseline levels. Note in particular that when alleles “A” and “B” are common at both loci, the majority of genetic variance resides in the epistatic component. (B) The proportion of the total genetic variance which is due to statistical epistasis for three simple models. The vertical axis denotes the proportion of variance, whereas the horizontal axes denote allele frequencies at each locus. For each model, the epistatic variance component is appreciable for a sizeable portion of the space of possible allele frequencies.
Figure 3. Power Associated with the Two-Stage Strategies in the Case of Two Loci Responsible for 5% of the Trait Variance Generated under Model M27 in which an Individual Requires at Least One Copy of the Increaser Allele at Both Loci in Order to Increase the Quantitative Phenotype above Baseline Levels as Applied to 1,000 Individuals
Five different first-stage thresholds are shown. In the case of the Both Significant Two-Stage Strategy, there is a rapid decrease in power with increasingly stringent first-stage thresholds, except for a small proportion of the space of possible allele frequencies where there is a small gain in power. In the case of the Either Significant Strategy, power declines more slowly as the first-stage threshold becomes increasingly stringent.
Figure 4. Fifty-One Biallelic Two-locus Models Incorporating Varying Degrees of Epistasis
Each row indexes the genotype at the first locus and each column refers to the genotype at the second locus. The number in each cell is the trait mean for that genotype. Models range from the very simple (e.g., M1) to the more exotic (e.g., M170).
Similar articles
- Epistasis: obstacle or advantage for mapping complex traits?
Verhoeven KJ, Casella G, McIntyre LM. Verhoeven KJ, et al. PLoS One. 2010 Aug 26;5(8):e12264. doi: 10.1371/journal.pone.0012264. PLoS One. 2010. PMID: 20865037 Free PMC article. - Optimal two-stage strategy for detecting interacting genes in complex diseases.
Ionita I, Man M. Ionita I, et al. BMC Genet. 2006 Jun 15;7:39. doi: 10.1186/1471-2156-7-39. BMC Genet. 2006. PMID: 16776843 Free PMC article. - Identifying quantitative trait locus by genetic background interactions in association studies.
Jannink JL. Jannink JL. Genetics. 2007 May;176(1):553-61. doi: 10.1534/genetics.106.062992. Epub 2006 Dec 18. Genetics. 2007. PMID: 17179077 Free PMC article. - A genome-wide search strategy for identifying quantitative trait loci involved in reading and spelling disability (developmental dyslexia).
Fisher SE, Stein JF, Monaco AP. Fisher SE, et al. Eur Child Adolesc Psychiatry. 1999;8 Suppl 3:47-51. doi: 10.1007/pl00010694. Eur Child Adolesc Psychiatry. 1999. PMID: 10638370 Review. - Genome partitioning and whole-genome analysis.
Schork NJ. Schork NJ. Adv Genet. 2001;42:299-322. doi: 10.1016/s0065-2660(01)42030-x. Adv Genet. 2001. PMID: 11037329 Review.
Cited by
- Overcoming the "feast or famine" effect: improved interaction testing in genome-wide association studies.
Zhou H, McPeek MS. Zhou H, et al. bioRxiv [Preprint]. 2024 Oct 26:2024.02.13.580168. doi: 10.1101/2024.02.13.580168. bioRxiv. 2024. PMID: 38405994 Free PMC article. Preprint. - Genome-wide association and epistasis studies reveal the genetic basis of saline-alkali tolerance at the germination stage in rice.
Zhang G, Bi Z, Jiang J, Lu J, Li K, Bai D, Wang X, Zhao X, Li M, Zhao X, Wang W, Xu J, Li Z, Zhang F, Shi Y. Zhang G, et al. Front Plant Sci. 2023 May 11;14:1170641. doi: 10.3389/fpls.2023.1170641. eCollection 2023. Front Plant Sci. 2023. PMID: 37251777 Free PMC article. - Dissecting Complex Traits Using Omics Data: A Review on the Linear Mixed Models and Their Application in GWAS.
Alamin M, Sultana MH, Lou X, Jin W, Xu H. Alamin M, et al. Plants (Basel). 2022 Nov 28;11(23):3277. doi: 10.3390/plants11233277. Plants (Basel). 2022. PMID: 36501317 Free PMC article. Review. - Multifactor dimensionality reduction reveals the effect of interaction between ERAP1 and IFIH1 polymorphisms in psoriasis susceptibility genes.
Zhang C, Qin Q, Li Y, Zheng X, Chen W, Zhen Q, Li B, Wang W, Sun L. Zhang C, et al. Front Genet. 2022 Nov 8;13:1009589. doi: 10.3389/fgene.2022.1009589. eCollection 2022. Front Genet. 2022. PMID: 36425068 Free PMC article. - Evaluating the detection ability of a range of epistasis detection methods on simulated data for pure and impure epistatic models.
Russ D, Williams JA, Cardoso VR, Bravo-Merodio L, Pendleton SC, Aziz F, Acharjee A, Gkoutos GV. Russ D, et al. PLoS One. 2022 Feb 18;17(2):e0263390. doi: 10.1371/journal.pone.0263390. eCollection 2022. PLoS One. 2022. PMID: 35180244 Free PMC article.
References
- Mackay TF. The genetic architecture of quantitative traits. Annu Rev Genet. 2001;35:303–339. - PubMed
- Chesler EJ, Lu L, Shou S, Qu Y, Gu J, et al. Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function. Nat Genet. 2005;37:233–242. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources