Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation - PubMed (original) (raw)

Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation

Jean-Marie Cornuet et al. Bioinformatics. 2008.

Abstract

Genetic data obtained on population samples convey information about their evolutionary history. Inference methods can extract part of this information but they require sophisticated statistical techniques that have been made available to the biologist community (through computer programs) only for simple and standard situations typically involving a small number of samples. We propose here a computer program (DIY ABC) for inference based on approximate Bayesian computation (ABC), in which scenarios can be customized by the user to fit many complex situations involving any number of populations and samples. Such scenarios involve any combination of population divergences, admixtures and population size changes. DIY ABC can be used to compare competing scenarios, estimate parameters for one or more scenarios and compute bias and precision measures for a given scenario and known values of parameters (the current version applies to unlinked microsatellite data). This article describes key methods used in the program and provides its main features. The analysis of one simulated and one real dataset, both with complex evolutionary scenarios, illustrates the main possibilities of DIY ABC.

Availability: The software DIY ABC is freely available at http://www.montpellier.inra.fr/CBGP/diyabc.

PubMed Disclaimer

Figures

Fig. 1.

Fig. 1.

First example: the three evolutionary scenarios. The dataset used as an example has been simulated according to scenario 1 (left). The parameter values were the following: all populations had an effective (diploid) size of 1000, the times of successive events (backward in time) were _t_1 = 10, _t_2 = 500, _t_3 = 10 000, _t_4 = 20 000 and _t_5 = 200 000, the two admixture rates were _r_1 = 0.6 and _r_2 = 0.4. Scenario 1 includes six populations, the four that have been sampled and two left parental populations in the admixture events. Scenario 2 and 3 include 5 and 4 populations, respectively. Samples 3 and 4 have been collected 2 and 4 generations earlier than the first two samples, hence their slightly upward locations on the graphs. Time is not at scale.

Fig. 2.

Fig. 2.

Second example: screenshot of the scenario used in the analysis of the Z. lateralis lateralis dataset. In 1830, Z. l. lateralis colonized the South Island of New Zealand (Pop 2) from Tasmania (Pop 1). In the following years, the population began expanding and dispersing, and reached the North Island by 1856 (Pop 3). Chatham Island (Pop 4) was colonized in 1856 from the South Island, and Norfolk Island (Pop 5) was colonized in 1904 from the North Island (historical information reviewed in Estoup and Clegg 2003). Sample collection times are 1997 for Tasmania (Sa 1), South and North Island of New Zealand (Sa 2 and Sa 3, respectively), Chatham Island (Sa 4) and Norfolk Island (Sa 5), 1994 for the second sample from Norfolk (Sa 6), and 1992 for the second sample from the North Island of New Zealand (Sa 7). Splitting events and sampling dates in years were translated in number of generations since the most recent sampling date by assuming a generation time of 3 years (Estoup and Clegg, 2003). We hence fixed t1, t2, t3 and t4 to 31, 47, 47 and 56 generations, respectively.

Fig. 3.

Fig. 3.

Second example: posterior distributions of the bottleneck severity (see definition in text) for the invasions of four Pacific Islands by Z.lateralis lateralis. The four discontinuous lines with small dashes, dots, dash-dots and long dashes correspond to Norfolk, Chatham, North Island and South Island of New Zealand, respectively. The continuous line corresponds to the prior distribution, which is identical for each island. This graph has been made with the locfit function of the R statistical package (Ihaka and Gentleman, 1996), using an option of DIYABC which saves the sample of the parameter values adjusted by the local linear regression (Beaumont et al., 2002).

Similar articles

Cited by

References

    1. Beaumont MA. Detecting population expansion and decline using microsatellites. Genetics. 1999;153:2013–2029. - PMC - PubMed
    1. Beaumont MA. Estimation of population growth or decline in genetically monitored populations. Genetics. 2003;164:1139–1160. - PMC - PubMed
    1. Beaumont MA. Joint determination of topology, divergence time and immigration in population trees. In: Matsumura S, Forster P, Renfrew C, editors. Simulations, Genetics and Human Prehistory. Cambridge: (McDonald Institute Monographs), McDonald Institute for Archaeological Research; 2008. pp. 134–154.
    1. Beaumont MA, et al. Approximate Bayesian computation in Population Genetics. Genetics. 2002;162:2025–2035. - PMC - PubMed
    1. Bertorelle G, Excoffier L. Inferring admixture proportion from molecular data. Mol. Biol. Evol. 1998;15:1298–1311. - PubMed

Publication types

MeSH terms

LinkOut - more resources