Whole population, genome-wide mapping of hidden relatedness - PubMed (original) (raw)
Comparative Study
Whole population, genome-wide mapping of hidden relatedness
Alexander Gusev et al. Genome Res. 2009 Feb.
Abstract
We present GERMLINE, a robust algorithm for identifying segmental sharing indicative of recent common ancestry between pairs of individuals. Unlike methods with comparable objectives, GERMLINE scales linearly with the number of samples, enabling analysis of whole-genome data in large cohorts. Our approach is based on a dictionary of haplotypes that is used to efficiently discover short exact matches between individuals. We then expand these matches using dynamic programming to identify long, nearly identical segmental sharing that is indicative of relatedness. We use GERMLINE to comprehensively survey hidden relatedness both in the HapMap as well as in a densely typed island population of 3000 individuals. We verify that GERMLINE is in concordance with other methods when they can process the data, and also facilitates analysis of larger scale studies. We bolster these results by demonstrating novel applications of precise analysis of hidden relatedness for (1) identification and resolution of phasing errors and (2) exposing polymorphic deletions that are otherwise challenging to detect. This finding is supported by concordance of detected deletions with other evidence from independent databases and statistical analyses of fluorescence intensity not used by GERMLINE.
Figures
Figure 1.
Expected and detected genome-wide sharing (Kosrae Cohort). (Blue, ▲) Expected genome-wide sharing; (green, ■) detected genome-wide sharing.
Figure 2.
PLINK metrics and share length for equally related pairs. Comparison of PLINK and _Z_1 values with GERMLINE share length for individuals of equal relationship coefficients. (Left) PLINK values; (middle) PLINK _Z_1 values; (right) GERMLINE share length (cM). Error bars, 99% CI.
Figure 3.
Candidate deletion fluorescence intensity. (Black line) Population average; (open circles) individuals identified as having the deletion; (blue bar at bottom) deletion identified by GERMLINE.
Figure 4.
Verified candidate deletion regions (top 200). (dbGV) Identified in Database of Genomic Variants; (CNAT) verified by Affymetrix Copy Number Analysis Tool; (Intensity) verified as deviation from population average intensity.
Similar articles
- Unexpected relationships and inbreeding in HapMap phase III populations.
Stevens EL, Baugher JD, Shirley MD, Frelin LP, Pevsner J. Stevens EL, et al. PLoS One. 2012;7(11):e49575. doi: 10.1371/journal.pone.0049575. Epub 2012 Nov 19. PLoS One. 2012. PMID: 23185369 Free PMC article. - LDMAP: the construction of high-resolution linkage disequilibrium maps of the human genome.
Kuo TY, Lau W, Collins AR. Kuo TY, et al. Methods Mol Biol. 2007;376:47-57. doi: 10.1007/978-1-59745-389-9_4. Methods Mol Biol. 2007. PMID: 17984537 - Inbreeding coefficient estimation with dense SNP data: comparison of strategies and application to HapMap III.
Gazal S, Sahbatou M, Perdry H, Letort S, Génin E, Leutenegger AL. Gazal S, et al. Hum Hered. 2014;77(1-4):49-62. doi: 10.1159/000358224. Epub 2014 Jul 29. Hum Hered. 2014. PMID: 25060269 - Navigating the HapMap.
Barnes MR. Barnes MR. Brief Bioinform. 2006 Sep;7(3):211-24. doi: 10.1093/bib/bbl021. Epub 2006 Jul 28. Brief Bioinform. 2006. PMID: 16877472 Review. - Computing genetic imprinting expressed by haplotypes.
Cheng Y, Berg A, Wu S, Li Y, Wu R. Cheng Y, et al. Methods Mol Biol. 2009;573:189-212. doi: 10.1007/978-1-60761-247-6_11. Methods Mol Biol. 2009. PMID: 19763929 Review.
Cited by
- Length distributions of identity by descent reveal fine-scale demographic history.
Palamara PF, Lencz T, Darvasi A, Pe'er I. Palamara PF, et al. Am J Hum Genet. 2012 Nov 2;91(5):809-22. doi: 10.1016/j.ajhg.2012.08.030. Epub 2012 Oct 25. Am J Hum Genet. 2012. PMID: 23103233 Free PMC article. - High mammographic density in women of Ashkenazi Jewish descent.
Caswell JL, Kerlikowske K, Shepherd JA, Cummings SR, Hu D, Huntsman S, Ziv E. Caswell JL, et al. Breast Cancer Res. 2013 May 13;15(3):R40. doi: 10.1186/bcr3424. Breast Cancer Res. 2013. PMID: 23668689 Free PMC article. - HaploShare: identification of extended haplotypes shared by cases and evaluation against controls.
Ying D, Sham PC, Smith DK, Zhang L, Lau YL, Yang W. Ying D, et al. Genome Biol. 2015 May 9;16(1):92. doi: 10.1186/s13059-015-0662-9. Genome Biol. 2015. PMID: 25956955 Free PMC article. - Identifying bipolar disorder susceptibility loci in a densely affected pedigree.
Collins AL, Kim Y, Szatkiewicz JP, Bloom RJ, Hilliard CE, Quackenbush CR, Meier S, Rivas F, Mayoral F, Cichon S, Nöthen MM, Rietschel M, Sullivan PF. Collins AL, et al. Mol Psychiatry. 2013 Dec;18(12):1245-6. doi: 10.1038/mp.2012.176. Epub 2012 Dec 18. Mol Psychiatry. 2013. PMID: 23247078 Free PMC article. No abstract available. - Fast and Accurate Shared Segment Detection and Relatedness Estimation in Un-phased Genetic Data via TRUFFLE.
Dimitromanolakis A, Paterson AD, Sun L. Dimitromanolakis A, et al. Am J Hum Genet. 2019 Jul 3;105(1):78-88. doi: 10.1016/j.ajhg.2019.05.007. Epub 2019 Jun 6. Am J Hum Genet. 2019. PMID: 31178127 Free PMC article.
References
- Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed
- Ayers K.L., Sabatti C., Lange K. Reconstructing ancestral haplotypes with a dictionary model. J. Comput. Biol. 2006;13:767–785. - PubMed
Publication types
MeSH terms
Grants and funding
- 5R01 DK60089/DK/NIDDK NIH HHS/United States
- R01 DK060089/DK/NIDDK NIH HHS/United States
- U54 CA121852/CA/NCI NIH HHS/United States
- Howard Hughes Medical Institute/United States
- 5 U54 CA121852/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials