Assessing and managing risk when sharing aggregate genetic variant data (original) (raw)
Hirschhorn, J. N. & Daly, M. J. Genome-wide association studies for common diseases and complex traits. Nature Rev. Genet.6, 95–108 (2005). ArticleCASPubMed Google Scholar
Klein, R. J. et al. Complement factor H polymorphism in age-related macular degeneration. Science308, 385–389 (2005). ArticleCASPubMed Google Scholar
Zhernakova, A. et al. Meta-analysis of genome-wide association studies in celiac disease and rheumatoid arthritis identifies fourteen non-HLA shared loci. PLoS Genet.7, e1002004 (2011). ArticleCASPubMed Google Scholar
Hollingworth, P. et al. Common variants at ABCA7, MS4A6A/MS4A4E, EPHA1, CD33 and CD2AP are associated with Alzheimer's disease. Nature Genet.43, 429–435 (2011). ArticleCASPubMed Google Scholar
Schunkert, H. et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature Genet.43, 333–338 (2011). ArticleCASPubMed Google Scholar
Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature466, 707–713 (2010). ArticleCASPubMed Google Scholar
Kho, A. N. et al. Electronic medical records for genetic research: results of the eMERGE consortium. Sci. Transl. Med.3, 79re1 (2011). ArticlePubMed Google Scholar
Skol, A. D., Scott, L. J., Abecasis, G. R. & Boehnke, M. Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies. Nature Genet.38, 209–213 (2006). ArticleCASPubMed Google Scholar
Marchini, J. & Howie, B. Genotype imputation for genome-wide association studies. Nature Rev. Genet.11, 499–511 (2010). CASPubMed Google Scholar
Durbin, R. M. et al. A map of human genome variation from population-scale sequencing. Nature467, 1061–1073 (2010). ArticleCASPubMed Google Scholar
Zheng, S. L. et al. Cumulative association of five genetic variants with prostate cancer. N. Engl. J. Med.358, 910–919 (2008). ArticleCASPubMed Google Scholar
Vacic, V. et al. Duplications of the neuropeptide receptor gene VIPR2 confer significant risk for schizophrenia. Nature471, 499–503 (2011). ArticleCASPubMed Google Scholar
Heeney, C., Hawkins, N., de Vries, J., Boddington, P. & Kaye, J. Assessing the privacy risks of data sharing in genomics. Public Health Genomics14, 17–25 (2011). ArticleCASPubMed Google Scholar
Church, G. et al. Public access to genome-wide data: five views on balancing research with privacy and protection. PLoS Genet.5, e1000665 (2009). ArticlePubMed Google Scholar
Preuss, M. et al. Design of the Coronary ARtery DIsease Genome-Wide Replication And Meta-Analysis (CARDIoGRAM) Study: a genome-wide association meta-analysis involving more than 22 000 cases and 60 000 controls. Circ. Cardiovasc. Genet.3, 475–483 (2010). ArticleCASPubMed Google Scholar
Speliotes, E. K. et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nature Genet.42, 937–948 (2010). ArticleCASPubMed Google Scholar
Cornelis, M. C. et al. The gene, environment association studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions. Genet. Epidemiol.34, 364–372 (2010). ArticlePubMed Google Scholar
The Psychiatric GWAS Consortium Steering Committee. A framework for interpreting genome-wide association studies of psychiatric disorders. Mol. Psychiatry14, 10–17 (2009).
Nelson, M. R. et al. The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research. Am. J. Hum. Genet.83, 347–358 (2008). ArticleCASPubMed Google Scholar
The International HapMap Consortium. The International HapMap Project. Nature426, 789–796 (2003).
Sherry, S. T. et al. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res.29, 308–311 (2001). ArticleCASPubMed Google Scholar
Mailman, M. D. et al. The NCBI dbGaP database of genotypes and phenotypes. Nature Genet.39, 1181–1186 (2007). ArticleCASPubMed Google Scholar
Yu, W., Gwinn, M., Clyne, M., Yesupriya, A. & Khoury, M. J. A navigator for human genome epidemiology. Nature Genet.40, 124–125 (2008). ArticleCASPubMed Google Scholar
Thorisson, G. A. et al. HGVbaseG2P: a central genetic association database. Nucleic Acids Res.37, D797–D802 (2009). ArticleCASPubMed Google Scholar
Hirakawa, M. et al. JSNP: a database of common gene variations in the Japanese population. Nucleic Acids Res.30, 158–162 (2002). ArticleCASPubMed Google Scholar
Hindorff, L. A. et al. PheGenI: an integrated resource for browsing genetic association data. Proc. of the 2011 AMIA Summit on Translational Bioinformatics[online], (2011). Google Scholar
Homer, N. et al. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PLoS Genet.4, e1000167 (2008). ArticlePubMed Google Scholar
Sankararaman, S., Obozinski, G., Jordan, M. I. & Halperin, E. Genomic privacy and limits of individual detection in a pool. Nature Genet.41, 965–967 (2009). ArticleCASPubMed Google Scholar
Jacobs, K. B. et al. A new statistic and its power to infer membership in a genome-wide association study using genotype frequencies. Nature Genet.41, 1253–1257 (2009). ArticleCASPubMed Google Scholar
Neyman, J. & Pearson, E. On the problem of the most efficient tests of statistical hypotheses. Phil. Trans. R. Soc. Lond. A231, 289–337 (1933). Article Google Scholar
Braun, R., Rowe, W., Schaefer, C., Zhang, J. & Buetow, K. Needles in the haystack: identifying individuals present in pooled genomic data. PLoS Genet.5, e1000668 (2009). ArticlePubMed Google Scholar
Wang, R., Li, Y. F., Wang, X., Tang, H. & Zhou, X. Learning your identity and disease from research papers: information leaks in genome wide association study. Proc. of the 16th ACM Conf. on Computer and Communications Security, 534–544 (2009).
Visscher, P. M. & Hill, W. G. The limits of individual identification from sample allele frequencies: theory and statistical analysis. PLoS Genet.5, e1000628 (2009). ArticlePubMed Google Scholar
Clayton, D. On inferring presence of an individual in a mixture: a Bayesian approach. Biostatistics11, 661–673 (2010). ArticlePubMed Google Scholar
Sampson, J. & Zhao, H. Identifying individuals in a complex mixture of DNA with unknown ancestry. Stat. Appl. Genet. Mol. Biol.8, 37 (2009). Article Google Scholar
Krawczak, M., Goebel, J. W. & Cooper, D. N. Is the NIH policy for sharing GWAS data running the risk of being counterproductive? Investig. Genet.1, 3 (2010). ArticlePubMed Google Scholar
Haga, S. B. & O'Daniel, J. Public perspectives regarding data-sharing practices in genomics research. Public Health Genomics 24 Mar 2011 (doi:10.1159/000324705). ArticleCASPubMed Google Scholar
Malin, B., Karp, D. & Scheuermann, R. H. Technical and policy approaches to balancing patient privacy and data sharing in clinical and translational research. J. Investig. Med.58, 11–18 (2010). ArticlePubMed Google Scholar
Elias-Sonnenschein, L. S., Viechtbauer, W., Ramakers, I. H., Verhey, F. R. & Visser, P. J. Predictive value of _APOE_-ɛ4 allele for progression from MCI to AD-type dementia: a meta-analysis. J. Neurol. Neurosurg. Psychiatry 14 Apr 2011 (doi:10.1136/jnnp.2010.231555). Article Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet.81, 559–575 (2007). ArticleCASPubMed Google Scholar