Genotype, haplotype and copy-number variation in worldwide human populations (original) (raw)

Accession codes

Primary accessions

Gene Expression Omnibus

Data deposits

The array data described in this paper are deposited in the Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) under accession number GSE10331.

References

  1. The International Haplotype Map Consortium. A haplotype map of the human genome. Nature 437, 1299–1320 (2005)
  2. Hinds, D. A. et al. Whole-genome patterns of common DNA variation in three human populations. Science 307, 1072–1079 (2005)
    Article CAS ADS Google Scholar
  3. Redon, R. et al. Global variation in copy number in the human genome. Nature 444, 444–454 (2006)
    Article CAS ADS Google Scholar
  4. Cann, H. M. et al. A human genome diversity cell line panel. Science 296, 261–262 (2002)
    Article CAS Google Scholar
  5. Kalinowski, S. T. Counting alleles with rarefaction: private alleles and hierarchical sampling designs. Conserv. Genet. 5, 539–543 (2004)
    Article CAS Google Scholar
  6. Falush, D., Stephens, M. & Pritchard, J. K. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164, 1567–1587 (2003)
    CAS PubMed PubMed Central Google Scholar
  7. Bastos-Rodrigues, L., Pimenta, J. R. & Pena, S. D. J. The genetic structure of human populations studied through short insertion–deletion polymorphisms. Ann. Hum. Genet. 70, 658–665 (2006)
    Article Google Scholar
  8. Rosenberg, N. A. et al. Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet. 1, e70 (2005)
    Article Google Scholar
  9. Rosenberg, N. A. et al. Genetic structure of human populations. Science 298, 2381–2385 (2002)
    Article CAS ADS Google Scholar
  10. Lawson Handley, L. J., Manica, A., Goudet, J. & Balloux, F. Going the distance: human population genetics in a clinal world. Trends Genet. 23, 432–439 (2007)
    Article Google Scholar
  11. Ramachandran, S. et al. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc. Natl Acad. Sci. USA 102, 15942–15947 (2005)
    Article CAS ADS Google Scholar
  12. Sabatti, C. & Risch, N. Homozygosity and linkage disequilibrium. Genetics 160, 1707–1719 (2002)
    PubMed PubMed Central Google Scholar
  13. Conrad, D. F. et al. A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nature Genet. 38, 1251–1260 (2006)
    Article CAS Google Scholar
  14. Gabriel, S. B. et al. The structure of haplotype blocks in the human genome. Science 296, 2225–2229 (2002)
    Article CAS ADS Google Scholar
  15. Reich, D. E. et al. Linkage disequilibrium in the human genome. Nature 411, 199–204 (2001)
    Article CAS ADS Google Scholar
  16. Tishkoff, S. A. & Kidd, K. K. Implications of biogeography of human populations for ‘race’ and medicine. Nature Genet. 36, S21–S27 (2004)
    Article CAS Google Scholar
  17. McVean, G. A. T. A genealogical interpretation of linkage disequilibrium. Genetics 162, 987–991 (2002)
    PubMed PubMed Central Google Scholar
  18. Bersaglieri, T. et al. Genetic signatures of strong recent positive selection at the lactase gene. Am. J. Hum. Genet. 74, 1111–1120 (2004)
    Article CAS Google Scholar
  19. Tishkoff, S. A. et al. Convergent adaptation of human lactase persistence in Africa and Europe. Nature Genet. 39, 31–40 (2007)
    Article CAS Google Scholar
  20. Wang, K. et al. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 17, 1665–1674 (2007)
    Article CAS Google Scholar
  21. Wong, K. K. et al. A comprehensive analysis of common copy-number variations in the human genome. Am. J. Hum. Genet. 80, 91–104 (2007)
    Article CAS Google Scholar
  22. Locke, D. P. et al. Linkage disequilibrium and heritability of copy-number polymorphisms within duplicated regions of the human genome. Am. J. Hum. Genet. 79, 275–290 (2006)
    Article CAS Google Scholar
  23. Sharp, A. J. et al. Segmental duplications and copy-number variation in the human genome. Am. J. Hum. Genet. 77, 78–88 (2005)
    Article CAS Google Scholar
  24. Scherer, S. W. et al. Challenges and standards in integrating surveys of structural variation. Nature Genet. 39, S7–S15 (2007)
    Article CAS Google Scholar
  25. Servin, B. & Stephens, M. Imputation-based analysis of association studies: candidate regions and quantitative traits. PLoS Genet. 3, e114 (2007)
    Article Google Scholar
  26. Need, A. C. & Goldstein, D. B. Genome-wide tagging for everyone. Nature Genet. 38, 1227–1228 (2006)
    Article CAS Google Scholar
  27. Eberle, M. A., Rieder, M. J., Kruglyak, L. & Nickerson, D. A. Allele frequency matching between SNPs reveals an excess of linkage disequilibrium in genic regions of the human genome. PLoS Genet. 2, e142 (2006)
    Article Google Scholar
  28. Scheet, P. & Stephens, M. A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am. J. Hum. Genet. 78, 629–644 (2006)
    Article CAS Google Scholar
  29. Jakobsson, M. & Rosenberg, N. A. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 (2007)
    Article CAS Google Scholar
  30. Zhang, J., Feuk, L., Duggan, G. E., Khaja, R. & Scherer, S. W. Development of bioinformatics resources for display and analysis of copy number and other structural variants in the human genome. Cytogenet. Genome Res. 115, 205–214 (2006)
    Article CAS Google Scholar

Download references

Acknowledgements

We thank the Biological Resource Center at the Fondation Jean Dausset – CEPH for preparing HGDP–CEPH diversity panel DNA samples, and S. Chanock and A. Hutchinson for assistance with the DNAs. This work was supported in part by NIH grants, by a postdoctoral fellowship from the University of Michigan Center for Genetics in Health and Medicine, by grants from the Alfred P. Sloan Foundation and the Burroughs Wellcome Fund, by the National Center for Minority Health and Health Disparities, and by the Intramural Program of the National Institute on Aging. The study used the Biowulf Linux cluster at the National Institutes of Health (http://biowulf.nih.gov).

Author Contributions N.A.R. and A.B.S. wish to be regarded as joint last authors.

Author information

Author notes

  1. Mattias Jakobsson, Sonja W. Scholz and Paul Scheet: These authors contributed equally to this work.

Authors and Affiliations

  1. Center for Computational Medicine and Biology,,
    Mattias Jakobsson, Paul Scheet, Jenna M. VanLiere, Zachary A. Szpiech, James H. Degnan & Noah A. Rosenberg
  2. Department of Human Genetics,,
    Mattias Jakobsson, James H. Degnan & Noah A. Rosenberg
  3. Department of Biostatistics, University of Michigan, Ann Arbor, Michigan 48109, USA,
    Paul Scheet & Noah A. Rosenberg
  4. Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, Maryland 20892, USA,
    Sonja W. Scholz, J. Raphael Gibbs, Hon-Chung Fung, Rita Guerreiro, Jose M. Bras, Jennifer C. Schymick, Dena G. Hernandez, Bryan J. Traynor, Javier Simon-Sanchez, Mar Matarin, Angela Britton, Joyce van de Leemput, Ian Rafferty & Andrew B. Singleton
  5. Department of Molecular Neuroscience and Reta Lila Weston Institute of Neurological Studies, Institute of Neurology, University College London, Queen Square, London WC1N 3BG, UK,
    Sonja W. Scholz, J. Raphael Gibbs, Joyce van de Leemput & John A. Hardy
  6. Department of Neurology, Chang Gung Memorial Hospital and College of Medicine, Chang Gung University, Taipei 10591, Taiwan
    Hon-Chung Fung
  7. Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA,
    Kai Wang & Maja Bucan
  8. Center for Neurosciences and Cell Biology, Faculty of Medicine, University of Coimbra, 3004-504 Coimbra, Portugal
    Rita Guerreiro & Jose M. Bras
  9. Department of Clinical Neurology, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK
    Jennifer C. Schymick
  10. Neurogenetics Branch, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland 20892, USA,
    Bryan J. Traynor
  11. Departamento de Genómica y Proteómica, Unidad de Genética Molecular, Instituto de Biomedicina de Valencia-CSIC, 46010, Valencia, Spain,
    Javier Simon-Sanchez
  12. Fondation Jean Dausset – Centre d’Étude du Polymorphisme Humain (CEPH), 27 rue Juliette Dodu, 75010 Paris, France,
    Howard M. Cann
  13. Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia 22908, USA,
    Andrew B. Singleton

Authors

  1. Mattias Jakobsson
    You can also search for this author inPubMed Google Scholar
  2. Sonja W. Scholz
    You can also search for this author inPubMed Google Scholar
  3. Paul Scheet
    You can also search for this author inPubMed Google Scholar
  4. J. Raphael Gibbs
    You can also search for this author inPubMed Google Scholar
  5. Jenna M. VanLiere
    You can also search for this author inPubMed Google Scholar
  6. Hon-Chung Fung
    You can also search for this author inPubMed Google Scholar
  7. Zachary A. Szpiech
    You can also search for this author inPubMed Google Scholar
  8. James H. Degnan
    You can also search for this author inPubMed Google Scholar
  9. Kai Wang
    You can also search for this author inPubMed Google Scholar
  10. Rita Guerreiro
    You can also search for this author inPubMed Google Scholar
  11. Jose M. Bras
    You can also search for this author inPubMed Google Scholar
  12. Jennifer C. Schymick
    You can also search for this author inPubMed Google Scholar
  13. Dena G. Hernandez
    You can also search for this author inPubMed Google Scholar
  14. Bryan J. Traynor
    You can also search for this author inPubMed Google Scholar
  15. Javier Simon-Sanchez
    You can also search for this author inPubMed Google Scholar
  16. Mar Matarin
    You can also search for this author inPubMed Google Scholar
  17. Angela Britton
    You can also search for this author inPubMed Google Scholar
  18. Joyce van de Leemput
    You can also search for this author inPubMed Google Scholar
  19. Ian Rafferty
    You can also search for this author inPubMed Google Scholar
  20. Maja Bucan
    You can also search for this author inPubMed Google Scholar
  21. Howard M. Cann
    You can also search for this author inPubMed Google Scholar
  22. John A. Hardy
    You can also search for this author inPubMed Google Scholar
  23. Noah A. Rosenberg
    You can also search for this author inPubMed Google Scholar
  24. Andrew B. Singleton
    You can also search for this author inPubMed Google Scholar

Corresponding authors

Correspondence toNoah A. Rosenberg or Andrew B. Singleton.

Supplementary information

Supplementary Information

This file contains extensive Supplementary Information with Supplementary Notes, Supplementary Data, Supplementary Tables S1-S17, Supplementary Figures S1-S30 with Legends and additional references. (PDF 10195 kb)

Rights and permissions

About this article

Cite this article

Jakobsson, M., Scholz, S., Scheet, P. et al. Genotype, haplotype and copy-number variation in worldwide human populations.Nature 451, 998–1003 (2008). https://doi.org/10.1038/nature06742

Download citation