Estimating kinship in admixed populations - PubMed (original) (raw)
Estimating kinship in admixed populations
Timothy Thornton et al. Am J Hum Genet. 2012.
Abstract
Genome-wide association studies (GWASs) are commonly used for the mapping of genetic loci that influence complex traits. A problem that is often encountered in both population-based and family-based GWASs is that of identifying cryptic relatedness and population stratification because it is well known that failure to appropriately account for both pedigree and population structure can lead to spurious association. A number of methods have been proposed for identifying relatives in samples from homogeneous populations. A strong assumption of population homogeneity, however, is often untenable, and many GWASs include samples from structured populations. Here, we consider the problem of estimating relatedness in structured populations with admixed ancestry. We propose a method, REAP (relatedness estimation in admixed populations), for robust estimation of identity by descent (IBD)-sharing probabilities and kinship coefficients in admixed populations. REAP appropriately accounts for population structure and ancestry-related assortative mating by using individual-specific allele frequencies at SNPs that are calculated on the basis of ancestry derived from whole-genome analysis. In simulation studies with related individuals and admixture from highly divergent populations, we demonstrate that REAP gives accurate IBD-sharing probabilities and kinship coefficients. We apply REAP to the Mexican Americans in Los Angeles, California (MXL) population sample of release 3 of phase III of the International Haplotype Map Project; in this sample, we identify third- and fourth-degree relatives who have not previously been reported. We also apply REAP to the African American and Hispanic samples from the Women's Health Initiative SNP Health Association Resource (WHI-SHARe) study, in which hundreds of pairs of cryptically related individuals have been identified.
Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Figures
Figure 1
Kinship Coefficients Plotted against Zero-IBD-Sharing Probabilities Estimated kinship coefficients plotted against zero-IBD-sharing-probability estimates for three population-structure settings. (A, C, and E) Scatter plots comparing the REAP kinship-coefficient estimator from Equation 3 with the REAP zero-IBD-sharing-probability estimator from Equation 4 for population-structure settings 1 (A), 2 (C), and 3 (E). (B, D, and F) Scatter plots comparing the homogeneous-population kinship-coefficient estimator from Equation 1 with the homogeneous-population zero-IBD-sharing-probability estimator from Equation 2 for population-structure settings 1 (B), 2 (D), and 3 (F). Zero-IBD-sharing-probability and kinship-coefficient estimates were calculated with 10,000 simulated random SNPs.
Figure 2
KING-Robust and REAP Kinship-Coefficient Histograms for Unrelated Pairs with Admixture (A and B) Histograms of kinship coefficients estimated with the KING-robust kinship-coefficient estimator (A) and the REAP kinship-coefficient estimator from Equation 3 (B) for all pairs of unrelated individuals in population-structure setting 2. The vertical line at 0 in each histogram represents the true kinship coefficient for all pairs. Kinship-coefficient estimates were calculated with 10,000 simulated random SNPs.
Figure 3
Individual-Ancestry Estimates for HapMap MXL Individual-ancestry estimates for 86 HapMap MXL sample individuals from a supervised structure analysis with the frappe software program. In the figure, each individual is represented by a vertical bar; European (HapMap CEU) and African (HapMap YRI) ancestry contributions are in blue and red, respectively, and Native American (HGDP samples from the Americas) ancestry contributions are in green.
Figure 4
REAP Kinship Coefficients versus Zero-IBD-Sharing Probabilities for HapMap MXL REAP kinship-coefficient estimates are plotted against REAP zero-IBD-sharing-probability estimates for the HapMap MXL sample. REAP estimates were calculated with the kinship-coefficient and zero-IBD-sharing-probability estimators from Equations 3 and 4, respectively. Relative pairs were classified on the basis of kinship-coefficient and zero-IBD-sharing-probability estimates.
Figure 5
Example of an Extended Pedigree Reconstructed with REAP in HapMap MXL REAP-inferred pedigree relationships for four HapMap-reported pedigrees from the MXL sample are given. HapMap-reported pedigree relationships are circled, and HapMap-reported pedigree identification numbers (M008, 2382, M011, and M012) are given in bold font in each of the circles.
Figure 6
Example of Two HapMap MXL Pedigrees Connected with REAP Pedigree relationships for two HapMap-reported pedigrees from the MXL sample are given. HapMap-reported pedigree relationships are circled, and HapMap-reported pedigree identification numbers (M007 and M032) are given in bold font in each of the circles.
Figure 7
REAP Kinship Coefficients versus Zero-IBD-Sharing Probabilities for WHI-SHARe (A and B) REAP kinship-coefficient estimates are plotted against REAP zero-IBD-sharing-probability estimates for the WHI-SHARe self-reported African Americans and self-reported Hispanics, respectively. REAP estimates were calculated with the kinship-coefficient and zero-IBD-sharing-probability estimators from Equations 3 and 4, respectively.
Similar articles
- Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations.
Peterson RE, Kuchenbaecker K, Walters RK, Chen CY, Popejoy AB, Periyasamy S, Lam M, Iyegbe C, Strawbridge RJ, Brick L, Carey CE, Martin AR, Meyers JL, Su J, Chen J, Edwards AC, Kalungi A, Koen N, Majara L, Schwarz E, Smoller JW, Stahl EA, Sullivan PF, Vassos E, Mowry B, Prieto ML, Cuellar-Barboza A, Bigdeli TB, Edenberg HJ, Huang H, Duncan LE. Peterson RE, et al. Cell. 2019 Oct 17;179(3):589-603. doi: 10.1016/j.cell.2019.08.051. Epub 2019 Oct 10. Cell. 2019. PMID: 31607513 Free PMC article. Review. - Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations.
McHugh C, Brown L, Thornton TA. McHugh C, et al. Genetics. 2016 Sep;204(1):43-56. doi: 10.1534/genetics.115.184184. Epub 2016 Jul 20. Genetics. 2016. PMID: 27440868 Free PMC article. - Genome-wide Significance Thresholds for Admixture Mapping Studies.
Grinde KE, Brown LA, Reiner AP, Thornton TA, Browning SR. Grinde KE, et al. Am J Hum Genet. 2019 Mar 7;104(3):454-465. doi: 10.1016/j.ajhg.2019.01.008. Epub 2019 Feb 14. Am J Hum Genet. 2019. PMID: 30773276 Free PMC article. - Inference of kinship using spatial distributions of SNPs for genome-wide association studies.
Lee H, Chen L. Lee H, et al. BMC Genomics. 2016 May 20;17:372. doi: 10.1186/s12864-016-2696-0. BMC Genomics. 2016. PMID: 27206321 Free PMC article. - New approaches to disease mapping in admixed populations.
Seldin MF, Pasaniuc B, Price AL. Seldin MF, et al. Nat Rev Genet. 2011 Jun 28;12(8):523-8. doi: 10.1038/nrg3002. Nat Rev Genet. 2011. PMID: 21709689 Free PMC article. Review.
Cited by
- Epigenetic aging differences between Wichí and Criollos from Argentina: Insights from genomic history and ecology.
Iannuzzi V, Sarno S, Sazzini M, Abondio P, Sala C, Bacalini MG, Gentilini D, Calzari L, Masciotta F, Garagnani P, Castellani G, Moretti E, Dasso MC, Sevini F, Franceschi ZA, Franceschi C, Pettener D, Luiselli D, Giuliani C. Iannuzzi V, et al. Evol Med Public Health. 2023 Oct 16;11(1):397-414. doi: 10.1093/emph/eoad034. eCollection 2023. Evol Med Public Health. 2023. PMID: 37954982 Free PMC article. - Estimating heritability and its enrichment in tissue-specific gene sets in admixed populations.
Luo Y, Li X, Wang X, Gazal S, Mercader JM; 23 and Me Research Team; SIGMA Type 2 Diabetes Consortium; Neale BM, Florez JC, Auton A, Price AL, Finucane HK, Raychaudhuri S. Luo Y, et al. Hum Mol Genet. 2021 Jul 28;30(16):1521-1534. doi: 10.1093/hmg/ddab130. Hum Mol Genet. 2021. PMID: 33987664 Free PMC article. - Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations.
Peterson RE, Kuchenbaecker K, Walters RK, Chen CY, Popejoy AB, Periyasamy S, Lam M, Iyegbe C, Strawbridge RJ, Brick L, Carey CE, Martin AR, Meyers JL, Su J, Chen J, Edwards AC, Kalungi A, Koen N, Majara L, Schwarz E, Smoller JW, Stahl EA, Sullivan PF, Vassos E, Mowry B, Prieto ML, Cuellar-Barboza A, Bigdeli TB, Edenberg HJ, Huang H, Duncan LE. Peterson RE, et al. Cell. 2019 Oct 17;179(3):589-603. doi: 10.1016/j.cell.2019.08.051. Epub 2019 Oct 10. Cell. 2019. PMID: 31607513 Free PMC article. Review. - High level of inbreeding in final phase of 1000 Genomes Project.
Gazal S, Sahbatou M, Babron MC, Génin E, Leutenegger AL. Gazal S, et al. Sci Rep. 2015 Dec 2;5:17453. doi: 10.1038/srep17453. Sci Rep. 2015. PMID: 26625947 Free PMC article. - Privacy-aware estimation of relatedness in admixed populations.
Wang S, Kim M, Li W, Jiang X, Chen H, Harmanci A. Wang S, et al. Brief Bioinform. 2022 Nov 19;23(6):bbac473. doi: 10.1093/bib/bbac473. Brief Bioinform. 2022. PMID: 36384083 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
- 5K07CA136969/CA/NCI NIH HHS/United States
- N01WH42129-32/WH/WHI NIH HHS/United States
- GM073059/GM/NIGMS NIH HHS/United States
- N01WH32100-2/WH/WHI NIH HHS/United States
- R25 CA112355/CA/NCI NIH HHS/United States
- N01WH32122/WH/WHI NIH HHS/United States
- N01WH32105-6/WH/WHI NIH HHS/United States
- N01WH32111-13/WH/WHI NIH HHS/United States
- K07 CA136969/CA/NCI NIH HHS/United States
- N01WH22110/WH/WHI NIH HHS/United States
- N01WH24152/WH/WHI NIH HHS/United States
- N01WH32108-9/WH/WHI NIH HHS/United States
- R01 GM073059/GM/NIGMS NIH HHS/United States
- N02HL64278/HL/NHLBI NIH HHS/United States
- N01WH42107-26/WH/WHI NIH HHS/United States
- N01WH32118-32119/WH/WHI NIH HHS/United States
- K01 CA148958/CA/NCI NIH HHS/United States
- N01WH32115/WH/WHI NIH HHS/United States
- N01WH44221/WH/WHI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources