Gene Genealogy and Variance of Interpopulational Nucleotide Differences (original) (raw)

Abstract

A mathematical theory is developed for computing the probability that m genes sampled from one population (species) and n genes sampled from another are derived from l genes that existed at the time of population splitting. The expected time of divergence between the two most closely related genes sampled from two different populations and the time of divergence (coalescence) of all genes sampled are studied by using this theory. It is shown that the time of divergence between the two most closely related genes can be used as an approximate estimate of the time of population splitting (T) only when Tt/(2 N) is small, where t and N are the number of generations and the effective population size, respectively. The variance of Nei and Li's estimate (d) of the number of net nucleotide differences between two populations is also studied. It is shown that the standard error ( sd) of d is larger than the mean when T is small (T << 1). In this case, _sd_ is reduced considerably by increasing sample size. When _T_ is large (_T_ > 1), however, a large proportion of the variance of d is caused by stochastic factors, and increase in the sample size does not help to reduce sd. To reduce the stochastic variance of d, one must use data from many independent unlinked gene loci.

Full Text

The Full Text of this article is available as a PDF (901.5 KB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Bodmer M., Ashburner M. Conservation and change in the DNA sequences coding for alcohol dehydrogenase in sibling species of Drosophila. 1984 May 31-Jun 6Nature. 309(5967):425–430. doi: 10.1038/309425a0. [DOI] [PubMed] [Google Scholar]
  2. Griffiths R. C. Lines of descent in the diffusion approximation of neutral Wright-Fisher models. Theor Popul Biol. 1980 Feb;17(1):37–50. doi: 10.1016/0040-5809(80)90013-1. [DOI] [PubMed] [Google Scholar]
  3. Watterson G. A. On the number of segregating sites in genetical models without recombination. Theor Popul Biol. 1975 Apr;7(2):256–276. doi: 10.1016/0040-5809(75)90020-9. [DOI] [PubMed] [Google Scholar]