Estimation of effect size distribution from genome-wide association studies and implications for future discoveries - PubMed (original) (raw)

Estimation of effect size distribution from genome-wide association studies and implications for future discoveries

Ju-Hyun Park et al. Nat Genet. 2010 Jul.

Abstract

We report a set of tools to estimate the number of susceptibility loci and the distribution of their effect sizes for a trait on the basis of discoveries from existing genome-wide association studies (GWASs). We propose statistical power calculations for future GWASs using estimated distributions of effect sizes. Using reported GWAS findings for height, Crohn's disease and breast, prostate and colorectal (BPC) cancers, we determine that each of these traits is likely to harbor additional loci within the spectrum of low-penetrance common variants. These loci, which can be identified from sufficiently powerful GWASs, together could explain at least 15-20% of the known heritability of these traits. However, for BPC cancers, which have modest familial aggregation, our analysis suggests that risk models based on common variants alone will have modest discriminatory power (63.5% area under curve), even with new discoveries.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Nonparametric estimates for distributions of effect sizes for susceptibility loci. (a) Curves based only on observed susceptibility loci; these curves are distorted because loci with larger effect sizes are more likely to have been detected. (b) Curves based on estimated susceptibility loci, representative of the population of all susceptibility loci. (c) Estimated nonparametric distributions after normalization over the common observed range for the three traits.

Figure 2

Figure 2

Receiver operating characteristic curves for genetic risk models. (a,b) Curves for Crohn’s disease (a) and BPC cancers (b). AUC is a measure of the discriminatory power of the risk model. Blue, a theoretical genetic risk model that explains all of the known familial risk of the trait. Green, a risk model that includes all of the susceptibility loci (142 for Crohn’s disease and 67 on average for BPC cancers) estimated to exist within the range of effect sizes seen in the current GWASs. Red, a risk model that includes only known susceptibility loci (~30 for Crohn’s disease and ~7 on average for each of the BPC cancers), which we used to estimate the distribution of effect sizes of these traits. Black, reference line corresponding to a model without discriminatory power in which cases have the same distribution of risk as controls.

Similar articles

Cited by

References

    1. Manolio TA, et al. Finding the missing heritability of complex diseases. Nature. 2009;461:747–753. - PMC - PubMed
    1. Hirschhorn JN. Genomewide association studies–illuminating biologic pathways. N. Engl. J. Med. 2009;360:1699–1701. - PubMed
    1. Goldstein DB. Common genetic variation and human traits. N. Engl. J. Med. 2009;360:1696–1698. - PubMed
    1. Kraft P, et al. Beyond odds ratios–communicating disease risk based on genetic profiles. Nat. Rev. Genet. 2009;10:264–269. - PubMed
    1. Pharoah PD, et al. Polygenic susceptibility to breast cancer and implications for prevention. Nat. Genet. 2002;31:33–36. - PubMed

Publication types

MeSH terms

LinkOut - more resources