Exome sequencing and the genetic basis of complex traits - PubMed (original) (raw)
. 2012 May 29;44(6):623-30.
doi: 10.1038/ng.2303.
Kiran Garimella, Ron Do, Nathan O Stitziel, Benjamin M Neale, Paul J McLaren, Namrata Gupta, Pamela Sklar, Patrick F Sullivan, Jennifer L Moran, Christina M Hultman, Paul Lichtenstein, Patrik Magnusson, Thomas Lehner, Yin Yao Shugart, Alkes L Price, Paul I W de Bakker, Shaun M Purcell, Shamil R Sunyaev
Affiliations
- PMID: 22641211
- PMCID: PMC3727622
- DOI: 10.1038/ng.2303
Exome sequencing and the genetic basis of complex traits
Adam Kiezun et al. Nat Genet. 2012.
Abstract
Exome sequencing is emerging as a popular approach to study the effect of rare coding variants on complex phenotypes. The promise of exome sequencing is grounded in theoretical population genetics and in empirical successes of candidate gene sequencing studies. Many projects aimed at common diseases are underway, and their results are eagerly anticipated. In this Perspective, using exome sequencing data from 438 individuals, we discuss several aspects of exome sequencing studies that we view as particularly important. We review processing and quality control of raw sequence data, evaluate the statistical properties of exome sequencing studies, discuss rare variant burden tests to detect association to phenotypes, and demonstrate the importance of accounting for population stratification in the analysis of rare variants. We conclude that enthusiasm for exome sequencing studies of complex traits should be combined with the caution that thousands of samples may be required to reach sufficient statistical power.
Conflict of interest statement
Competing Interests: The authors declare that they have no competing financial interests.
Figures
Figure 1
Discovery of novel variants for increasing numbers of samples. For each functional class, the fold-increase over the number of variants in one sample for that class is plotted as a function of the number of samples in a sequencing experiment. For example, the number of nonsense variants discovered in 300 samples is 40 times greater than the average number discovered in a single sample while the number of synonymous variants is only 10 times greater (although the absolute number of nonsense variants is a relatively minor proportion of the total variation discovered); this effect is due to purifying selection. All classes of variants are discovered at rates exceeding what would be predicted under a neutral model of evolution in a population of constant size, an effect of population growth. The crossing between curves for synonymous variants and the theoretical prediction most likely is a signature of the out-of-Africa bottleneck. See Methods for additional details.
Figure 2
Association analysis. (a) Q-Q plot of association _p_-values under the null hypothesis. (b) Distributions of lowest _p_-values under whole-exome permutations. The histograms show the distributions of the lowest _p_-values across permutations for the T5 test. The red vertical line indicates the 0.05 exome-wide significance level for the most significant gene (i.e., the most significant gene is exome-wide significant if its _p_-value is lower that the level indicated by the red line).
Figure 2
Association analysis. (a) Q-Q plot of association _p_-values under the null hypothesis. (b) Distributions of lowest _p_-values under whole-exome permutations. The histograms show the distributions of the lowest _p_-values across permutations for the T5 test. The red vertical line indicates the 0.05 exome-wide significance level for the most significant gene (i.e., the most significant gene is exome-wide significant if its _p_-value is lower that the level indicated by the red line).
Figure 3
Extrapolation of gene burden results. Horizontal solid red line shows Bonferroni genome-wide significance threshold of P = 2.5 × 10−6. Horizontal dashed line shows the threshold derived from whole-exome permutations (Figure 2b). For larger sample sizes, the permutation threshold would be closer to the Bonferroni threshold, asymptotically approaching it as the sample sizes increase.
Similar articles
- Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project.
Auer PL, Reiner AP, Wang G, Kang HM, Abecasis GR, Altshuler D, Bamshad MJ, Nickerson DA, Tracy RP, Rich SS; NHLBI GO Exome Sequencing Project; Leal SM. Auer PL, et al. Am J Hum Genet. 2016 Oct 6;99(4):791-801. doi: 10.1016/j.ajhg.2016.08.012. Epub 2016 Sep 22. Am J Hum Genet. 2016. PMID: 27666372 Free PMC article. - Exome sequencing: capture and sequencing of all human coding regions for disease gene discovery.
Priya RR, Rajasimha HK, Brooks MJ, Swaroop A. Priya RR, et al. Methods Mol Biol. 2012;884:335-51. doi: 10.1007/978-1-61779-848-1_24. Methods Mol Biol. 2012. PMID: 22688718 - Characterization of ADME gene variation in 21 populations by exome sequencing.
Hovelson DH, Xue Z, Zawistowski M, Ehm MG, Harris EC, Stocker SL, Gross AS, Jang IJ, Ieiri I, Lee JE, Cardon LR, Chissoe SL, Abecasis G, Nelson MR. Hovelson DH, et al. Pharmacogenet Genomics. 2017 Mar;27(3):89-100. doi: 10.1097/FPC.0000000000000260. Pharmacogenet Genomics. 2017. PMID: 27984508 Free PMC article. - Case-only exome sequencing and complex disease susceptibility gene discovery: study design considerations.
Wu L, Schaid DJ, Sicotte H, Wieben ED, Li H, Petersen GM. Wu L, et al. J Med Genet. 2015 Jan;52(1):10-6. doi: 10.1136/jmedgenet-2014-102697. Epub 2014 Nov 4. J Med Genet. 2015. PMID: 25371537 Free PMC article. Review. - Exome sequencing as a tool for Mendelian disease gene discovery.
Bamshad MJ, Ng SB, Bigham AW, Tabor HK, Emond MJ, Nickerson DA, Shendure J. Bamshad MJ, et al. Nat Rev Genet. 2011 Sep 27;12(11):745-55. doi: 10.1038/nrg3031. Nat Rev Genet. 2011. PMID: 21946919 Review.
Cited by
- No evidence that ACE2 or TMPRSS2 drive population disparity in COVID risks.
Pearson NM, Novembre J. Pearson NM, et al. BMC Med. 2024 Aug 26;22(1):337. doi: 10.1186/s12916-024-03539-0. BMC Med. 2024. PMID: 39183295 Free PMC article. - A method to comprehensively identify germline SNVs, INDELs and CNVs from whole exome sequencing data of BRCA1/2 negative breast cancer patients.
Bianchi A, Zelli V, D'Angelo A, Di Matteo A, Scoccia G, Cannita K, Dimas AS, Glentis S, Zazzeroni F, Alesse E, Di Marco A, Tessitore A. Bianchi A, et al. NAR Genom Bioinform. 2024 Apr 17;6(2):lqae033. doi: 10.1093/nargab/lqae033. eCollection 2024 Jun. NAR Genom Bioinform. 2024. PMID: 38633426 Free PMC article. - Functional Neural Networks for High-Dimensional Genetic Data Analysis.
Zhang S, Zhou Y, Geng P, Lu Q. Zhang S, et al. IEEE/ACM Trans Comput Biol Bioinform. 2024 May-Jun;21(3):383-393. doi: 10.1109/TCBB.2024.3364614. Epub 2024 Jun 5. IEEE/ACM Trans Comput Biol Bioinform. 2024. PMID: 38507390 - DEVOUR: Deleterious Variants on Uncovered Regions in Whole-Exome Sequencing.
Türk E, Ayaz A, Yüksek A, Süzek BE. Türk E, et al. PeerJ. 2023 Sep 15;11:e16026. doi: 10.7717/peerj.16026. eCollection 2023. PeerJ. 2023. PMID: 37727687 Free PMC article. - Exome-wide screening identifies novel rare risk variants for bone mineral density.
He D, Pan C, Zhao Y, Wei W, Qin X, Cai Q, Shi S, Chu X, Zhang N, Jia Y, Wen Y, Cheng B, Liu H, Feng R, Zhang F, Xu P. He D, et al. Osteoporos Int. 2023 May;34(5):965-975. doi: 10.1007/s00198-023-06710-0. Epub 2023 Feb 28. Osteoporos Int. 2023. PMID: 36849660
References
- Fuller CW, et al. The challenges of sequencing by synthesis. Nature Biotechnology. 2009;27:1013–1023. - PubMed
- Rusk N, Kiermer V. Primer: Sequencing—the next generation. Nature Methods. 2008;5:15. - PubMed
- Metzker ML. Sequencing technologies the next generation. Nature Reviews Genetics. 2009;11:31–46. - PubMed
- Shendure J, Ji H. Next-generation DNA sequencing. Nature Biotechnology. 2008;26:1135–1145. - PubMed
- Clarke J, et al. Continuous base identification for single-molecule nanopore DNA sequencing. Nature Nanotechnology. 2009;4:265–270. - PubMed
Publication types
MeSH terms
Grants and funding
- AI069452/AI/NIAID NIH HHS/United States
- UM1 AI069556/AI/NIAID NIH HHS/United States
- U01 AI069511/AI/NIAID NIH HHS/United States
- AI069450/AI/NIAID NIH HHS/United States
- U01 AI069477/AI/NIAID NIH HHS/United States
- U01 AI069502/AI/NIAID NIH HHS/United States
- U01 AI069513/AI/NIAID NIH HHS/United States
- AI68634/AI/NIAID NIH HHS/United States
- AI69467/AI/NIAID NIH HHS/United States
- UM1 AI069423/AI/NIAID NIH HHS/United States
- AI069415/AI/NIAID NIH HHS/United States
- U01 AI069474/AI/NIAID NIH HHS/United States
- U01 AI069434/AI/NIAID NIH HHS/United States
- AI069556/AI/NIAID NIH HHS/United States
- UM1 AI069501/AI/NIAID NIH HHS/United States
- UM1 AI069472/AI/NIAID NIH HHS/United States
- U01 AI069467/AI/NIAID NIH HHS/United States
- U01 AI069423/AI/NIAID NIH HHS/United States
- UM1 AI069513/AI/NIAID NIH HHS/United States
- UL1 TR000445/TR/NCATS NIH HHS/United States
- AI069465/AI/NIAID NIH HHS/United States
- R01 MH089208/MH/NIMH NIH HHS/United States
- R01 MH095034/MH/NIMH NIH HHS/United States
- AI069419/AI/NIAID NIH HHS/United States
- AI069511/AI/NIAID NIH HHS/United States
- AI46370/AI/NIAID NIH HHS/United States
- AI077505/AI/NIAID NIH HHS/United States
- U01 AI027661/AI/NIAID NIH HHS/United States
- UM1 AI069424/AI/NIAID NIH HHS/United States
- R01 AI077505/AI/NIAID NIH HHS/United States
- RC2 MH089905/MH/NIMH NIH HHS/United States
- U01 AI069465/AI/NIAID NIH HHS/United States
- UM1 AI069434/AI/NIAID NIH HHS/United States
- UM1 AI069432/AI/NIAID NIH HHS/United States
- AI069501/AI/NIAID NIH HHS/United States
- 5R01 MH084676/MH/NIMH NIH HHS/United States
- AI069532/AI/NIAID NIH HHS/United States
- AI069474/AI/NIAID NIH HHS/United States
- AI069434/AI/NIAID NIH HHS/United States
- UM1 AI069495/AI/NIAID NIH HHS/United States
- U01 AI069484/AI/NIAID NIH HHS/United States
- UM1 AI069471/AI/NIAID NIH HHS/United States
- U01 AI069556/AI/NIAID NIH HHS/United States
- CAPMC/ CIHR/Canada
- UM1 AI069428/AI/NIAID NIH HHS/United States
- AI069513/AI/NIAID NIH HHS/United States
- AI069502/AI/NIAID NIH HHS/United States
- TL1 RR024978/RR/NCRR NIH HHS/United States
- U01 AI069532/AI/NIAID NIH HHS/United States
- U01 AI034835/AI/NIAID NIH HHS/United States
- AI069423/AI/NIAID NIH HHS/United States
- AI30914/AI/NIAID NIH HHS/United States
- U01 AI069428/AI/NIAID NIH HHS/United States
- KL2 RR024977/RR/NCRR NIH HHS/United States
- UM1 AI069415/AI/NIAID NIH HHS/United States
- AI27661/AI/NIAID NIH HHS/United States
- UM1 AI069484/AI/NIAID NIH HHS/United States
- AI069428/AI/NIAID NIH HHS/United States
- AI34853/AI/NIAID NIH HHS/United States
- UM1 AI068634/AI/NIAID NIH HHS/United States
- U01 AI069501/AI/NIAID NIH HHS/United States
- AI069495/AI/NIAID NIH HHS/United States
- UM1 AI069477/AI/NIAID NIH HHS/United States
- U01 AI069432/AI/NIAID NIH HHS/United States
- AI069484/AI/NIAID NIH HHS/United States
- AI068636/AI/NIAID NIH HHS/United States
- U01 AI069450/AI/NIAID NIH HHS/United States
- RR024975/RR/NCRR NIH HHS/United States
- U01 AI046370/AI/NIAID NIH HHS/United States
- U01 AI068636/AI/NIAID NIH HHS/United States
- UM1 AI069474/AI/NIAID NIH HHS/United States
- U01 AI034853/AI/NIAID NIH HHS/United States
- R37 AI028568/AI/NIAID NIH HHS/United States
- 1R01MH089208-01/MH/NIMH NIH HHS/United States
- KL2 TR000446/TR/NCATS NIH HHS/United States
- UM1 AI069452/AI/NIAID NIH HHS/United States
- AI069424/AI/NIAID NIH HHS/United States
- RC2MH089905/MH/NIMH NIH HHS/United States
- U01 AI069415/AI/NIAID NIH HHS/United States
- U01 AI069495/AI/NIAID NIH HHS/United States
- UM1 AI069502/AI/NIAID NIH HHS/United States
- U01 AI025859/AI/NIAID NIH HHS/United States
- UM1 AI069450/AI/NIAID NIH HHS/United States
- TL1 TR000447/TR/NCATS NIH HHS/United States
- AI34835/AI/NIAID NIH HHS/United States
- UM1 AI069532/AI/NIAID NIH HHS/United States
- R01 MH084676/MH/NIMH NIH HHS/United States
- R01 HG005827/HG/NHGRI NIH HHS/United States
- U01 AI069424/AI/NIAID NIH HHS/United States
- UM1 AI069465/AI/NIAID NIH HHS/United States
- AL32782/PHS HHS/United States
- U01 AI032782/AI/NIAID NIH HHS/United States
- UL1 RR024975/RR/NCRR NIH HHS/United States
- AI069472/AI/NIAID NIH HHS/United States
- UM1 AI069419/AI/NIAID NIH HHS/United States
- AI28568/AI/NIAID NIH HHS/United States
- AI069432/AI/NIAID NIH HHS/United States
- R01 AI028568/AI/NIAID NIH HHS/United States
- T32 HL007604/HL/NHLBI NIH HHS/United States
- U01 AI069419/AI/NIAID NIH HHS/United States
- R01 AI030914/AI/NIAID NIH HHS/United States
- AI25859/AI/NIAID NIH HHS/United States
- UM1 AI069511/AI/NIAID NIH HHS/United States
- U01 AI069452/AI/NIAID NIH HHS/United States
- UM1 AI069467/AI/NIAID NIH HHS/United States
- AI38844/AI/NIAID NIH HHS/United States
- AI069477/AI/NIAID NIH HHS/United States
- U01 AI038844/AI/NIAID NIH HHS/United States
- AI069471/AI/NIAID NIH HHS/United States
- T32-HL07604-25/HL/NHLBI NIH HHS/United States
- U01 AI069471/AI/NIAID NIH HHS/United States
- U01 AI069472/AI/NIAID NIH HHS/United States
- U01 AI068634/AI/NIAID NIH HHS/United States
- UM1 AI068636/AI/NIAID NIH HHS/United States
LinkOut - more resources
Full Text Sources