A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST) - PubMed (original) (raw)
. 2007 Feb 3;615(1-2):28-56.
doi: 10.1016/j.mrfmmm.2006.09.003. Epub 2006 Nov 13.
Affiliations
- PMID: 17101154
- DOI: 10.1016/j.mrfmmm.2006.09.003
A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST)
Stephan Morgenthaler et al. Mutat Res. 2007.
Abstract
A method is described to discover if a gene carries one or more allelic mutations that confer risk for any specified common disease. The method does not depend upon genetic linkage of risk-conferring mutations to high frequency genetic markers such as single nucleotide polymorphisms. Instead, the sums of allelic mutation frequencies in case and control cohorts are determined and a statistical test is applied to discover if the difference in these sums is greater than would be expected by chance. A statistical model is presented that defines the ability of such tests to detect significant gene-disease relationships as a function of case and control cohort sizes and key confounding variables: zygosity and genicity, environmental risk factors, errors in diagnosis, limits to mutant detection, linkage of neutral and risk-conferring mutations, ethnic diversity in the general population and the expectation that among all exonic mutants in the human genome greater than 90% will be neutral with regard to any effect on disease risk. Means to test the null hypothesis for, and determine the statistical power of, each test are provided. For this "cohort allelic sums test" or "CAST", the statistical model and test are provided as an Excel program, CASTAT(c) at . Based on genetics, technology and statistics, a strategy of enumerating the mutant alleles carried in the exons and splice sites of the estimated approximately 25,000 human genes in case cohort samples of 10,000 persons for each of 100 common diseases is proposed and evaluated: A wide range of possible conditions of multi-allelic or mono-allelic and monogenic, multigenic or polygenic (including epistatic) risk are found to be detectable using the statistical criteria of 1 or 10 "false positive" gene associations approximately 25,000 gene-disease pair-wise trials and a statistical power of >0.8. Using estimates of the distribution of both neutral and gene-inactivating nondeleterious mutations in humans and the sensitivity of the test to multigenic or multicausal risk, it is estimated that about 80% of nullizygous, heterozygous and functionally dominant gene-common disease associations may be discovered. Limitations include relative insensitivity of CAST to about 60% of possible associations given homozygous (wild type) risk and, more rarely, other stochastic limits when the frequency of mutations in the case cohort approaches that of the control cohort and biases such as absence of genetic risk masked by risk derived from a shared cultural environment.
Similar articles
- Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.
Moskvina V, O'Donovan MC. Moskvina V, et al. Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27. Hum Hered. 2007. PMID: 17483598 - The allelic spectra of common diseases may resemble the allelic spectrum of the full genome.
Wang WY, Pike N. Wang WY, et al. Med Hypotheses. 2004;63(4):748-51. doi: 10.1016/j.mehy.2003.12.057. Med Hypotheses. 2004. PMID: 15325027 - Quantifying the amount of missing information in genetic association studies.
Nicolae DL. Nicolae DL. Genet Epidemiol. 2006 Dec;30(8):703-17. doi: 10.1002/gepi.20181. Genet Epidemiol. 2006. PMID: 16986163 - Health-related disparities: influence of environmental factors.
Olden K, White SL. Olden K, et al. Med Clin North Am. 2005 Jul;89(4):721-38. doi: 10.1016/j.mcna.2005.02.001. Med Clin North Am. 2005. PMID: 15925646 Review. - Recent progress in rheumatoid arthritis genetics: one step towards improved patient care.
Plenge RM. Plenge RM. Curr Opin Rheumatol. 2009 May;21(3):262-71. doi: 10.1097/BOR.0b013e32832a2e2d. Curr Opin Rheumatol. 2009. PMID: 19365266 Review.
Cited by
- Aggregation of rare/low-frequency variants of the mitochondria respiratory chain-related proteins in rheumatoid arthritis patients.
Mitsunaga S, Hosomichi K, Okudaira Y, Nakaoka H, Suzuki Y, Kuwana M, Sato S, Kaneko Y, Homma Y, Oka A, Shiina T, Inoko H, Inoue I. Mitsunaga S, et al. J Hum Genet. 2015 Aug;60(8):449-54. doi: 10.1038/jhg.2015.50. Epub 2015 May 28. J Hum Genet. 2015. PMID: 26016412 - Recent advances and challenges of rare variant association analysis in the biobank sequencing era.
Chen W, Coombes BJ, Larson NB. Chen W, et al. Front Genet. 2022 Oct 6;13:1014947. doi: 10.3389/fgene.2022.1014947. eCollection 2022. Front Genet. 2022. PMID: 36276986 Free PMC article. Review. - Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies.
Lee S, Emond MJ, Bamshad MJ, Barnes KC, Rieder MJ, Nickerson DA; NHLBI GO Exome Sequencing Project—ESP Lung Project Team; Christiani DC, Wurfel MM, Lin X. Lee S, et al. Am J Hum Genet. 2012 Aug 10;91(2):224-37. doi: 10.1016/j.ajhg.2012.06.007. Epub 2012 Aug 2. Am J Hum Genet. 2012. PMID: 22863193 Free PMC article. - Rare variant analysis for family-based design.
De G, Yip WK, Ionita-Laza I, Laird N. De G, et al. PLoS One. 2013;8(1):e48495. doi: 10.1371/journal.pone.0048495. Epub 2013 Jan 15. PLoS One. 2013. PMID: 23341868 Free PMC article. - A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.
Luo L, Zhu Y, Xiong M. Luo L, et al. J Comput Biol. 2012 Jun;19(6):731-44. doi: 10.1089/cmb.2012.0035. Epub 2012 May 31. J Comput Biol. 2012. PMID: 22651812 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous