Exome sequencing and the genetic basis of complex traits - PubMed (original) (raw)

. 2012 May 29;44(6):623-30.

doi: 10.1038/ng.2303.

Kiran Garimella, Ron Do, Nathan O Stitziel, Benjamin M Neale, Paul J McLaren, Namrata Gupta, Pamela Sklar, Patrick F Sullivan, Jennifer L Moran, Christina M Hultman, Paul Lichtenstein, Patrik Magnusson, Thomas Lehner, Yin Yao Shugart, Alkes L Price, Paul I W de Bakker, Shaun M Purcell, Shamil R Sunyaev

Affiliations

PMID: 22641211
PMCID: PMC3727622
DOI: 10.1038/ng.2303

Exome sequencing and the genetic basis of complex traits

Adam Kiezun et al. Nat Genet. 2012.

Abstract

Exome sequencing is emerging as a popular approach to study the effect of rare coding variants on complex phenotypes. The promise of exome sequencing is grounded in theoretical population genetics and in empirical successes of candidate gene sequencing studies. Many projects aimed at common diseases are underway, and their results are eagerly anticipated. In this Perspective, using exome sequencing data from 438 individuals, we discuss several aspects of exome sequencing studies that we view as particularly important. We review processing and quality control of raw sequence data, evaluate the statistical properties of exome sequencing studies, discuss rare variant burden tests to detect association to phenotypes, and demonstrate the importance of accounting for population stratification in the analysis of rare variants. We conclude that enthusiasm for exome sequencing studies of complex traits should be combined with the caution that thousands of samples may be required to reach sufficient statistical power.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors declare that they have no competing financial interests.

Figures

Figure 1

Discovery of novel variants for increasing numbers of samples. For each functional class, the fold-increase over the number of variants in one sample for that class is plotted as a function of the number of samples in a sequencing experiment. For example, the number of nonsense variants discovered in 300 samples is 40 times greater than the average number discovered in a single sample while the number of synonymous variants is only 10 times greater (although the absolute number of nonsense variants is a relatively minor proportion of the total variation discovered); this effect is due to purifying selection. All classes of variants are discovered at rates exceeding what would be predicted under a neutral model of evolution in a population of constant size, an effect of population growth. The crossing between curves for synonymous variants and the theoretical prediction most likely is a signature of the out-of-Africa bottleneck. See Methods for additional details.

Figure 2

Association analysis. (a) Q-Q plot of association _p_-values under the null hypothesis. (b) Distributions of lowest _p_-values under whole-exome permutations. The histograms show the distributions of the lowest _p_-values across permutations for the T5 test. The red vertical line indicates the 0.05 exome-wide significance level for the most significant gene (i.e., the most significant gene is exome-wide significant if its _p_-value is lower that the level indicated by the red line).

Figure 2

Figure 3

Extrapolation of gene burden results. Horizontal solid red line shows Bonferroni genome-wide significance threshold of P = 2.5 × 10−6. Horizontal dashed line shows the threshold derived from whole-exome permutations (Figure 2b). For larger sample sizes, the permutation threshold would be closer to the Bonferroni threshold, asymptotically approaching it as the sample sizes increase.

Cited by

No evidence that ACE2 or TMPRSS2 drive population disparity in COVID risks.
Pearson NM, Novembre J. Pearson NM, et al. BMC Med. 2024 Aug 26;22(1):337. doi: 10.1186/s12916-024-03539-0. BMC Med. 2024. PMID: 39183295 Free PMC article.
A method to comprehensively identify germline SNVs, INDELs and CNVs from whole exome sequencing data of BRCA1/2 negative breast cancer patients.
Bianchi A, Zelli V, D'Angelo A, Di Matteo A, Scoccia G, Cannita K, Dimas AS, Glentis S, Zazzeroni F, Alesse E, Di Marco A, Tessitore A. Bianchi A, et al. NAR Genom Bioinform. 2024 Apr 17;6(2):lqae033. doi: 10.1093/nargab/lqae033. eCollection 2024 Jun. NAR Genom Bioinform. 2024. PMID: 38633426 Free PMC article.
Functional Neural Networks for High-Dimensional Genetic Data Analysis.
Zhang S, Zhou Y, Geng P, Lu Q. Zhang S, et al. IEEE/ACM Trans Comput Biol Bioinform. 2024 May-Jun;21(3):383-393. doi: 10.1109/TCBB.2024.3364614. Epub 2024 Jun 5. IEEE/ACM Trans Comput Biol Bioinform. 2024. PMID: 38507390
DEVOUR: Deleterious Variants on Uncovered Regions in Whole-Exome Sequencing.
Türk E, Ayaz A, Yüksek A, Süzek BE. Türk E, et al. PeerJ. 2023 Sep 15;11:e16026. doi: 10.7717/peerj.16026. eCollection 2023. PeerJ. 2023. PMID: 37727687 Free PMC article.
Exome-wide screening identifies novel rare risk variants for bone mineral density.
He D, Pan C, Zhao Y, Wei W, Qin X, Cai Q, Shi S, Chu X, Zhang N, Jia Y, Wen Y, Cheng B, Liu H, Feng R, Zhang F, Xu P. He D, et al. Osteoporos Int. 2023 May;34(5):965-975. doi: 10.1007/s00198-023-06710-0. Epub 2023 Feb 28. Osteoporos Int. 2023. PMID: 36849660

References

1. Fuller CW, et al. The challenges of sequencing by synthesis. Nature Biotechnology. 2009;27:1013–1023. - PubMed
1. Rusk N, Kiermer V. Primer: Sequencing—the next generation. Nature Methods. 2008;5:15. - PubMed
1. Metzker ML. Sequencing technologies the next generation. Nature Reviews Genetics. 2009;11:31–46. - PubMed
1. Shendure J, Ji H. Next-generation DNA sequencing. Nature Biotechnology. 2008;26:1135–1145. - PubMed
1. Clarke J, et al. Continuous base identification for single-molecule nanopore DNA sequencing. Nature Nanotechnology. 2009;4:265–270. - PubMed

Exome sequencing and the genetic basis of complex traits - PubMed (original) (raw)

Exome sequencing and the genetic basis of complex traits

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources