Genetics in geographically structured populations: defining, estimating and interpreting F(ST) - PubMed (original) (raw)
Review
Genetics in geographically structured populations: defining, estimating and interpreting F(ST)
Kent E Holsinger et al. Nat Rev Genet. 2009 Sep.
Abstract
Wright's F-statistics, and especially F(ST), provide important insights into the evolutionary processes that influence the structure of genetic variation within and among populations, and they are among the most widely used descriptive statistics in population and evolutionary genetics. Estimates of F(ST) can identify regions of the genome that have been the target of selection, and comparisons of F(ST) from different parts of the genome can provide insights into the demographic history of populations. For these reasons and others, F(ST) has a central role in population and evolutionary genetics and has wide applications in fields that range from disease association mapping to forensic science. This Review clarifies how F(ST) is defined, how it should be estimated, how it is related to similar statistics and how estimates of F(ST) should be interpreted.
Figures
Figure 1. Locus-specific estimates of _F_ST on human chromosome 7
Estimates are as inferred from the phase II HapMap data set. Horizontal bars indicate the locations of known genes. The red circles are posterior means for SNPs with estimates that are detectably different from the genomic background (purple circles). All ‘outliers’ show significantly more differentiation among the four populations in the sample than is consistent with the level of differentiation seen in the genomic background. The excess differentiation suggests that these SNPs are associated with genomic regions in which loci have been subject to diversifying selection among populations. CALU, calumenin; FSCN3, ascin homolog 3; GCC1, GRIP and coiled-coil domain containing 1; GRM8, glutamate receptor, metabotropic 8; LEP, leptin; SND1, staphylococcal nuclease and tudor domain containing 1. Figure is modified, with permission, from REF. © (2009) American Statistical Association.
Similar articles
- Assessing population structure: F(ST) and related measures.
Meirmans PG, Hedrick PW. Meirmans PG, et al. Mol Ecol Resour. 2011 Jan;11(1):5-18. doi: 10.1111/j.1755-0998.2010.02927.x. Epub 2010 Oct 26. Mol Ecol Resour. 2011. PMID: 21429096 Review. - Empirical Bayes inference of pairwise F(ST) and its distribution in the genome.
Kitada S, Kitakado T, Kishino H. Kitada S, et al. Genetics. 2007 Oct;177(2):861-73. doi: 10.1534/genetics.107.077263. Epub 2007 Jul 29. Genetics. 2007. PMID: 17660541 Free PMC article. - Differential confounding of rare and common variants in spatially structured populations.
Mathieson I, McVean G. Mathieson I, et al. Nat Genet. 2012 Feb 5;44(3):243-6. doi: 10.1038/ng.1074. Nat Genet. 2012. PMID: 22306651 Free PMC article. - Q(ST)-F(ST) comparisons: evolutionary and ecological insights from genomic heterogeneity.
Leinonen T, McCairns RJ, O'Hara RB, Merilä J. Leinonen T, et al. Nat Rev Genet. 2013 Mar;14(3):179-90. doi: 10.1038/nrg3395. Epub 2013 Feb 5. Nat Rev Genet. 2013. PMID: 23381120 Review.
Cited by
- Deciphering the role of FOXP4 in long COVID: exploring genetic associations, evolutionary conservation, and drug identification through bioinformatics analysis.
Gupta MK, Gouda G, Vadde R. Gupta MK, et al. Funct Integr Genomics. 2024 Sep 19;24(5):167. doi: 10.1007/s10142-024-01451-7. Funct Integr Genomics. 2024. PMID: 39298002 - Federated unsupervised random forest for privacy-preserving patient stratification.
Pfeifer B, Sirocchi C, Bloice MD, Kreuzthaler M, Urschler M. Pfeifer B, et al. Bioinformatics. 2024 Sep 1;40(Suppl 2):ii198-ii207. doi: 10.1093/bioinformatics/btae382. Bioinformatics. 2024. PMID: 39230698 Free PMC article. - Multiple ancestries and shared gene flow among modern livestock guarding dogs.
Coutinho-Lima D, Dreger DL, Doadrio I, Parker HG, Ghanavi HR, Frantz L, Larson G, Ostrander EA, Godinho R. Coutinho-Lima D, et al. iScience. 2024 Jun 28;27(8):110396. doi: 10.1016/j.isci.2024.110396. eCollection 2024 Aug 16. iScience. 2024. PMID: 39156647 Free PMC article. - Causal effects of omega-6 and LDL-C on androgenetic alopecia: A Mendelian randomization study.
Peilong L, Quanlin Z, Shuqing G. Peilong L, et al. Skin Res Technol. 2024 Aug;30(8):e70000. doi: 10.1111/srt.70000. Skin Res Technol. 2024. PMID: 39138832 Free PMC article. - Genetic insights of blood lipid metabolites on polycystic ovary syndrome risk: a bidirectional two-sample Mendelian randomization study.
Wang X, Han H, Shi X, Nie X, Zhu R, Jin J, Zhou H. Wang X, et al. Front Endocrinol (Lausanne). 2024 Jul 9;15:1391826. doi: 10.3389/fendo.2024.1391826. eCollection 2024. Front Endocrinol (Lausanne). 2024. PMID: 39045272 Free PMC article.
References
- Rosenberg NA, et al. Genetic structure of human populations. Science. 2002;298:2381–2385. - PubMed
- Li JZ, et al. Worldwide human relationships inferred from genome-wide patterns of variation. Science. 2008;319:1100–1104. - PubMed
- Wright S. The genetical structure of populations. Ann. Eugen. 1951;15:323–354. This paper develops the explicit framework for the analysis and interpretation of F-statistics in an evolutionary context.
- Malecot G. Les Mathématiques de l’Hérédié. Masson, Paris: 1948. This book develops a framework — equivalent to Wright’s F-statistics — for the analysis of genetic diversity in hierarchically structured populations.
- Wright S. Evolution in Mendelian populations. Genetics. 1931;16:97–159. A landmark paper in population genetics in which the effect of population size, mutation and migration on the abundance and distribution of genetic variation in populations is first quantitatively described.
Publication types
MeSH terms
Grants and funding
- R01 GM068449/GM/NIGMS NIH HHS/United States
- R01 GM075091/GM/NIGMS NIH HHS/United States
- 1 R01 GM 068449-01A1/GM/NIGMS NIH HHS/United States
- 1 R01 GM 075091/GM/NIGMS NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous