Sequencing studies in human genetics: design and interpretation (original) (raw)
Hindorff, L. A. et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc. Natl Acad. Sci. USA106, 9362–9367 (2009). CASPubMedPubMed Central Google Scholar
McCarthy, M. I. et al. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature Rev. Genet.9, 356–369 (2008). This influential Review compiles into one paper the basics of doing a GWAS, including best practice guidelines, such as controlling for population stratification. The Review also reinforces the universally followed guideline of 5 × 10−8as a threshold for significance in GWAS. ArticleCASPubMed Google Scholar
Hoggart, C. J., Clark, T. G., De Iorio, M., Whittaker, J. C. & Balding, D. J. Genome-wide significance for dense SNP and resequencing data. Genet. Epidemiol.32, 179–185 (2008). ArticlePubMed Google Scholar
Cirulli, E. T. & Goldstein, D. B. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nature Rev. Genet.11, 415–425 (2010). ArticleCASPubMed Google Scholar
Bamshad, M. J. et al. Exome sequencing as a tool for Mendelian disease gene discovery. Nature Rev. Genet.12, 745–755 (2011). ArticleCASPubMed Google Scholar
Meyerson, M., Gabriel, S. & Getz, G. Advances in understanding cancer genomes through second-generation sequencing. Nature Rev. Genet.11, 685–696 (2010). ArticleCASPubMed Google Scholar
Ding, L., Wendl, M. C., Koboldt, D. C. & Mardis, E. R. Analysis of next-generation genomic data in cancer: accomplishments and challenges. Hum. Mol. Genet.19, R188–R196 (2010). ArticleCASPubMedPubMed Central Google Scholar
Shendure, J. & Ji, H. Next-generation DNA sequencing. Nature Biotech.26, 1135–1145 (2008). ArticleCAS Google Scholar
Ajay, S. S., Parker, S. C., Abaan, H. O., Fajardo, K. V. & Margulies, E. H. Accurate and comprehensive sequencing of personal genomes. Genome Res.21, 1498–1505 (2011). ArticlePubMedPubMed Central Google Scholar
Genomes Project, C. A map of human genome variation from population-scale sequencing. Nature467, 1061–1073 (2010). ArticleCAS Google Scholar
Need, A. C. et al. Clinical application of exome sequencing in undiagnosed genetic conditions. J. Med. Genet.49, 353–361 (2012). This is the first study that estimates the 'success rate' of getting a genetic diagnosis through whole-exome sequencing of undiagnosed conditions in a real clinical setting considering 12 children with a broad range of severe childhood genetic conditions. The primary conclusion is that the success rate is remarkably high but depends in many cases on functional characterization of previously unidentified mutations in already known disease genes. ArticleCASPubMed Google Scholar
Heinzen, E. L. et al. Exome sequencing followed by large-scale genotyping fails to identify single rare variants of large effect in idiopathic generalized epilepsy. Am. J. Hum. Genet.91, 293–302 (2012). The largest epilepsy exome-sequencing study to date is reported in this paper. The results suggest high locus and allelic heterogeneity for both disorders, requiring larger sample sizes. ArticleCASPubMedPubMed Central Google Scholar
Need, A. C. et al. Exome sequencing followed by large-scale genotyping suggests a limited role for moderately rare risk factors of strong effect in schizophrenia. Am. J. Hum. Genet.91, 303–312 (2012). The largest schizophrenia exome-sequencing study to date is reported in this paper. The results suggest high locus and allelic heterogeneity for both disorders, requiring larger sample sizes. ArticleCASPubMedPubMed Central Google Scholar
Heinzen, E. L. et al. De novo mutations in ATP1A3 cause alternating hemiplegia of childhood. Nature Genet.44, 1030–1034 (2012). ArticleCASPubMed Google Scholar
Li, B. et al. A likelihood-based framework for variant calling and de novo mutation detection in families. PLoS Genet.8, e1002944 (2012). ArticleCASPubMedPubMed Central Google Scholar
Nielsen, R., Paul, J. S., Albrechtsen, A. & Song, Y. S. Genotype and SNP calling from next-generation sequencing data. Nature Rev. Genet.12, 443–451 (2011). CASPubMed Google Scholar
Flicek, P. & Birney, E. Sense from sequence reads: methods for alignment and assembly. Nature Methods6, S6–S12 (2009). ArticleCASPubMed Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genet.43, 491–498 (2011). This paper describes what has become the most widely used variant-calling environment. ArticleCASPubMed Google Scholar
Lunter, G. & Goodson, M. Stampy: a statistical algorithm for sensitive and fast mapping of Illumina sequence reads. Genome Res.21, 936–939 (2011). ArticleCASPubMedPubMed Central Google Scholar
Meacham, L. R. et al. Diabetes mellitus in long-term survivors of childhood cancer. Increased risk associated with radiation therapy: a report for the childhood cancer survivor study. Arch. Intern. Med.169, 1381–1388 (2009). ArticlePubMedPubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res.20, 1297–1303 (2010). ArticleCASPubMedPubMed Central Google Scholar
Neale, B. M. et al. Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature485, 242–245 (2012). This paper was one of the first to analyse a large number of patients with a common disease using a trio design. Importantly, the authors established a formal framework for assessing whether excessde novomutations are observed over expectation under the null hypothesis and found that autism genomes carry only modest excess of such mutations. ArticleCASPubMedPubMed Central Google Scholar
Conrad, D. F. et al. Variation in genome-wide mutation rates within and between human families. Nature Genet.43, 712–714 (2011). ArticleCASPubMed Google Scholar
Alkan, C. et al. Personalized copy number and segmental duplication maps using next-generation sequencing. Nature Genet.41, 1061–1067 (2009). ArticleCASPubMed Google Scholar
de Ligt, J. et al. Diagnostic exome sequencing in persons with severe intellectual disability. N. Engl. J. Med.367, 1921–1929 (2012). ArticleCASPubMed Google Scholar
Rauch, A. et al. Range of genetic mutations associated with severe non-syndromic sporadic intellectual disability: an exome sequencing study. Lancet380, 1674–1682 (2012). ArticleCASPubMed Google Scholar
Sanders, S. J. et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature485, 237–241 (2012). ArticleCASPubMedPubMed Central Google Scholar
O'Roak, B. J. et al. Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature485, 246–250 (2012). ArticleCASPubMedPubMed Central Google Scholar
Saunders, C. J. et al. Rapid whole-genome sequencing for genetic disease diagnosis in neonatal intensive care units. Sci. Transl. Med.4, 154ra135 (2012). ArticleCASPubMedPubMed Central Google Scholar
Bell, C. J. et al. Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci. Transl. Med.3, 65ra4 (2011). ArticleCASPubMedPubMed Central Google Scholar
Kimura, M. The Neutral Theory of Molecular Evolution (Cambridge Press, 1983). Book Google Scholar
Sim, N. L. et al. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res.40, W452–W457 (2012). ArticleCASPubMedPubMed Central Google Scholar
Stone, E. A. & Sidow, A. Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. Genome Res.15, 978–986 (2005). ArticleCASPubMedPubMed Central Google Scholar
Jordan, D. M., Ramensky, V. E. & Sunyaev, S. R. Human allelic variation: perspective from protein function, structure, and evolution. Curr. Opin. Struct. Biol.20, 342–350 (2010). ArticleCASPubMedPubMed Central Google Scholar
Schwarz, J. M., Rodelsperger, C., Schuelke, M. & Seelow, D. MutationTaster evaluates disease-causing potential of sequence alterations. Nature Methods7, 575–576 (2010). ArticleCASPubMed Google Scholar
Hicks, S., Wheeler, D. A., Plon, S. E. & Kimmel, M. Prediction of missense mutation functionality depends on both the algorithm and sequence alignment employed. Hum. Mutat.32, 661–668 (2011). ArticleCASPubMedPubMed Central Google Scholar
Cooper, G. M. & Shendure, J. Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nature Rev. Genet.12, 628–640 (2011). A comprehensive Review is presented here of the priors, such as evolutionary knowledge,in silicoprotein effect assessment and others, that can be used to prioritize variants on the basis of putative damaging impact scores. ArticleCASPubMed Google Scholar
Bustamante, C. D. et al. Natural selection on protein-coding genes in the human genome. Nature437, 1153–1157 (2005). ArticleCASPubMed Google Scholar
Asthana, S. et al. Widely distributed noncoding purifying selection in the human genome. Proc. Natl Acad. Sci. USA104, 12410–12415 (2007). ArticleCASPubMedPubMed Central Google Scholar
Stenson, P. D. et al. Human Gene Mutation Database (HGMD): 2003 update. Hum. Mutat.21, 577–581 (2003). ArticleCASPubMed Google Scholar
Morgenthaler, S. & Thilly, W. G. A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). Mutat. Res.615, 28–56 (2007). ArticleCASPubMed Google Scholar
Li, B. & Leal, S. M. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am. J. Hum. Genet.83, 311–321 (2008). ArticleCASPubMedPubMed Central Google Scholar
Madsen, B. E. & Browning, S. R. A groupwise association test for rare mutations using a weighted sum statistic. PLoS Genet.5, e1000384 (2009). ArticleCASPubMedPubMed Central Google Scholar
Price, A. L. et al. Pooled association tests for rare variants in exon-resequencing studies. Am. J. Hum. Genet.86, 832–838 (2010). ArticlePubMedPubMed Central Google Scholar
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet.89, 82–93 (2011). ArticleCASPubMedPubMed Central Google Scholar
Lin, D. Y. & Tang, Z. Z. A general framework for detecting disease associations with rare variants in sequencing studies. Am. J. Hum. Genet.89, 354–367 (2011). ArticleCASPubMedPubMed Central Google Scholar
Basu, S. & Pan, W. Comparison of statistical tests for disease association with rare variants. Genet. Epidemiol.35, 606–619 (2011). ArticlePubMedPubMed Central Google Scholar
Bansal, V., Libiger, O., Torkamani, A. & Schork, N. J. Statistical analysis strategies for association studies involving rare variants. Nature Rev. Genet.11, 773–785 (2010). ArticleCASPubMed Google Scholar
Stitziel, N. O., Kiezun, A. & Sunyaev, S. Computational and statistical approaches to analyzing variants identified by exome sequencing. Genome Biol.12, 227 (2011). ArticlePubMedPubMed Central Google Scholar
Kiezun, A. et al. Exome sequencing and the genetic basis of complex traits. Nature Genet.44, 623–630 (2012). ArticleCASPubMed Google Scholar
Ladouceur, M., Dastani, Z., Aulchenko, Y. S., Greenwood, C. M. & Richards, J. B. The empirical power of rare variant association methods: results from Sanger sequencing in 1,998 individuals. PLoS Genet.8, e1002496 (2012). ArticleCASPubMedPubMed Central Google Scholar
Zhu, Q. et al. A genome-wide comparison of the functional properties of rare and common genetic variants in humans. Am. J. Hum. Genet.88, 458–468 (2011). ArticleCASPubMedPubMed Central Google Scholar
Tennessen, J. A. et al. Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science337, 64–69 (2012). ArticleCASPubMed Google Scholar
Harrison, P. J. & Weinberger, D. R. Schizophrenia genes, gene expression, and neuropathology: on the matter of their convergence. Mol. Psychiatry10, 40–68 (2005). ArticleCASPubMed Google Scholar
Prathikanti, S. & Weinberger, D. R. Psychiatric genetics—the new era: genetic research and some clinical implications. Br. Med. Bull. 73–74, 107–122 (2005).
Mutsuddi, M. et al. Analysis of high-resolution HapMap of DTNBP1 (Dysbindin) suggests no consistency between reported common variant associations and schizophrenia. Am. J. Hum. Genet.79, 903–909 (2006). ArticleCASPubMedPubMed Central Google Scholar
Hoefen, R. et al. In silico cardiac risk assessment in patients with long QT syndrome: type 1: clinical predictability of cardiac models. J. Am. Coll. Cardiol60, 2182–2191 (2012). ArticlePubMed Google Scholar
Berecki, G., Zegers, J. G., Wilders, R. & Van Ginneken, A. C. Cardiac channelopathies studied with the dynamic action potential-clamp technique. Methods Mol. Biol.403, 233–250 (2007). ArticleCASPubMed Google Scholar
Zareba, W., Moss, A. J. & le Cessie, S. Dispersion of ventricular repolarization and arrhythmic cardiac death in coronary artery disease. Am. J. Cardiol.74, 550–553 (1994). ArticleCASPubMed Google Scholar
Redfern, W. S. et al. Relationships between preclinical cardiac electrophysiology, clinical QT interval prolongation and torsade de pointes for a broad range of drugs: evidence for a provisional safety margin in drug development. Cardiovasc. Res.58, 32–45 (2003). ArticleCASPubMed Google Scholar
Di Ventura, B., Lemerle, C., Michalodimitrakis, K. & Serrano, L. From in vivo to in silico biology and back. Nature443, 527–533 (2006). ArticleCASPubMed Google Scholar
Reid, C. A. et al. Multiple molecular mechanisms for a single GABAA mutation in epilepsy. Neurology80, 1003–1008 (2013). This paper uses an animal model to provide remarkable resolution in dissecting how a single mutation can result in two distinct clinical manifestations with one seizure type resulting from haploinsufficiency and the other from a distinct gain of function. ArticleCASPubMedPubMed Central Google Scholar
Freimuth, J. et al. Epistatic interactions between Tgfb1 and genetic loci, Tgfbm2 and Tgfbm3, determine susceptibility to an asthmatic stimulus. Proc. Natl Acad. Sci. USA109, 18042–18047 (2012). ArticlePubMedPubMed Central Google Scholar
Lehner, B. Genotype to phenotype: lessons from model organisms for human genetics. Nature Rev Genet.14, 168–178 (2013). ArticleCASPubMed Google Scholar
Tiscornia, G., Vivas, E. L. & Izpisua Belmonte, J. C. Diseases in a dish: modeling human genetic disorders using induced pluripotent cells. Nature Med.17, 1570–1576 (2011). ArticleCASPubMed Google Scholar
Overington, J. P., Al-Lazikani, B. & Hopkins, A. L. How many drug targets are there? Nature Rev. Drug Discov.5, 993–996 (2006). ArticleCAS Google Scholar
Consortium, E. P. et al. An integrated encyclopedia of DNA elements in the human genome. Nature489, 57–74 (2012). ArticleCAS Google Scholar
Pruitt, K. D. et al. The consensus coding sequence (CCDS) project: identifying a common protein-coding gene set for the human and mouse genomes. Genome Res.19, 1316–1323 (2009). ArticleCASPubMedPubMed Central Google Scholar
Davydov, E. V. et al. Identifying a high fraction of the human genome to be under selective constraint using GERP++. PLoS Comput. Biol.6, e1001025 (2010). ArticleCASPubMedPubMed Central Google Scholar
Choi, J. W., Kang, D. K., Park, H., deMello, A. J. & Chang, S. I. High-throughput analysis of protein-protein interactions in picoliter-volume droplets using fluorescence polarization. Anal. Chem.84, 3849–3854 (2012). ArticleCASPubMed Google Scholar
Ghosh, S., Matsuoka, Y., Asai, Y., Hsin, K. Y. & Kitano, H. Software for systems biology: from tools to integrated platforms. Nature Rev. Genet.12, 821–832 (2011). ArticleCASPubMed Google Scholar
Owens, J. Determining druggability. Nature Rev. Drug Discov.6, 187 (2007). ArticleCAS Google Scholar
Marth, G. T. et al. A general approach to single-nucleotide polymorphism discovery. Nature Genet.23, 452–456 (1999). ArticleCASPubMed Google Scholar
Bruce, H. A. et al. Long tandem repeats as a form of genomic copy number variation: structure and length polymorphism of a chromosome 5p repeat in control and schizophrenia populations. Psychiatr. Genet.19, 64–71 (2009). ArticlePubMedPubMed Central Google Scholar