Gene genealogies when the sample size exceeds the effective size of the population - PubMed (original) (raw)
Gene genealogies when the sample size exceeds the effective size of the population
John Wakeley et al. Mol Biol Evol. 2003 Feb.
Abstract
We study the properties of gene genealogies for large samples using a continuous approximation introduced by R. A. Fisher. We show that the major effect of large sample size, relative to the effective size of the population, is to increase the proportion of polymorphisms at which the mutant type is found in a single copy in the sample. We derive analytical expressions for the expected number of these singleton polymorphisms and for the total number of polymorphic, or segregating, sites that are valid even when the sample size is much greater than the effective size of the population. We use simulations to assess the accuracy of these predictions and to investigate other aspects of large-sample genealogies. Lastly, we apply our results to some data from Pacific oysters sampled from British Columbia. This illustrates that, when large samples are available, it is possible to estimate the mutation rate and the effective population size separately, in contrast to the case of small samples in which only the product of the mutation rate and the effective population size can be estimated.
Similar articles
- Estimation of parameters in large offspring number models and ratios of coalescence times.
Eldon B. Eldon B. Theor Popul Biol. 2011 Aug;80(1):16-28. doi: 10.1016/j.tpb.2011.04.002. Epub 2011 May 5. Theor Popul Biol. 2011. PMID: 21570995 - Single and simultaneous binary mergers in Wright-Fisher genealogies.
Melfi A, Viswanath D. Melfi A, et al. Theor Popul Biol. 2018 May;121:60-71. doi: 10.1016/j.tpb.2018.04.001. Epub 2018 Apr 12. Theor Popul Biol. 2018. PMID: 29655651 - A method for accurate inference of population size from serially sampled genealogies distorted by selection.
O'Fallon BD. O'Fallon BD. Mol Biol Evol. 2011 Nov;28(11):3171-81. doi: 10.1093/molbev/msr153. Epub 2011 Jun 16. Mol Biol Evol. 2011. PMID: 21680870 Free PMC article. - Distortions in genealogies due to purifying selection.
Nicolaisen LE, Desai MM. Nicolaisen LE, et al. Mol Biol Evol. 2012 Nov;29(11):3589-600. doi: 10.1093/molbev/mss170. Epub 2012 Jun 22. Mol Biol Evol. 2012. PMID: 22729750
Cited by
- Scaling the discrete-time Wright-Fisher model to biobank-scale datasets.
Spence JP, Zeng T, Mostafavi H, Pritchard JK. Spence JP, et al. Genetics. 2023 Nov 1;225(3):iyad168. doi: 10.1093/genetics/iyad168. Genetics. 2023. PMID: 37724741 Free PMC article. - Scaling the Discrete-time Wright Fisher model to biobank-scale datasets.
Spence JP, Zeng T, Mostafavi H, Pritchard JK. Spence JP, et al. bioRxiv [Preprint]. 2023 May 22:2023.05.19.541517. doi: 10.1101/2023.05.19.541517. bioRxiv. 2023. PMID: 37293115 Free PMC article. Updated. Preprint. - Multiple Merger Genealogies in Outbreaks of Mycobacterium tuberculosis.
Menardo F, Gagneux S, Freund F. Menardo F, et al. Mol Biol Evol. 2021 Jan 4;38(1):290-306. doi: 10.1093/molbev/msaa179. Mol Biol Evol. 2021. PMID: 32667991 Free PMC article. - Space is the Place: Effects of Continuous Spatial Structure on Analysis of Population Genetic Data.
Battey CJ, Ralph PL, Kern AD. Battey CJ, et al. Genetics. 2020 May;215(1):193-214. doi: 10.1534/genetics.120.303143. Epub 2020 Mar 24. Genetics. 2020. PMID: 32209569 Free PMC article. - Inferring Demography and Selection in Organisms Characterized by Skewed Offspring Distributions.
Sackman AM, Harris RB, Jensen JD. Sackman AM, et al. Genetics. 2019 Mar;211(3):1019-1028. doi: 10.1534/genetics.118.301684. Epub 2019 Jan 16. Genetics. 2019. PMID: 30651284 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources