Cosi2: an efficient simulator of exact and approximate coalescent with selection - PubMed (original) (raw)
Cosi2: an efficient simulator of exact and approximate coalescent with selection
Ilya Shlyakhter et al. Bioinformatics. 2014.
Abstract
Motivation: Efficient simulation of population genetic samples under a given demographic model is a prerequisite for many analyses. Coalescent theory provides an efficient framework for such simulations, but simulating longer regions and higher recombination rates remains challenging. Simulators based on a Markovian approximation to the coalescent scale well, but do not support simulation of selection. Gene conversion is not supported by any published coalescent simulators that support selection.
Results: We describe cosi2, an efficient simulator that supports both exact and approximate coalescent simulation with positive selection. cosi2 improves on the speed of existing exact simulators, and permits further speedup in approximate mode while retaining support for selection. cosi2 supports a wide range of demographic scenarios, including recombination hot spots, gene conversion, population size changes, population structure and migration. cosi2 implements coalescent machinery efficiently by tracking only a small subset of the Ancestral Recombination Graph, sampling only relevant recombination events, and using augmented skip lists to represent tracked genetic segments. To preserve support for selection in approximate mode, the Markov approximation is implemented not by moving along the chromosome but by performing a standard backwards-in-time coalescent simulation while restricting coalescence to node pairs with overlapping or near-overlapping genetic material. We describe the algorithms used by cosi2 and present comparisons with existing selection simulators.
Availability and implementation: A free C++ implementation of cosi2 is available at http://broadinstitute.org/mpg/cosi2.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Similar articles
- Critical assessment of coalescent simulators in modeling recombination hotspots in genomic sequences.
Yang T, Deng HW, Niu T. Yang T, et al. BMC Bioinformatics. 2014 Jan 3;15:3. doi: 10.1186/1471-2105-15-3. BMC Bioinformatics. 2014. PMID: 24387001 Free PMC article. - The Bacterial Sequential Markov Coalescent.
De Maio N, Wilson DJ. De Maio N, et al. Genetics. 2017 May;206(1):333-343. doi: 10.1534/genetics.116.198796. Epub 2017 Mar 3. Genetics. 2017. PMID: 28258183 Free PMC article. - Markovian approximation to the finite loci coalescent with recombination along multiple sequences.
Hobolth A, Jensen JL. Hobolth A, et al. Theor Popul Biol. 2014 Dec;98:48-58. doi: 10.1016/j.tpb.2014.01.002. Epub 2014 Jan 28. Theor Popul Biol. 2014. PMID: 24486389 - Ancestral Population Genomics.
Dutheil JY, Hobolth A. Dutheil JY, et al. Methods Mol Biol. 2019;1910:555-589. doi: 10.1007/978-1-4939-9074-0_18. Methods Mol Biol. 2019. PMID: 31278677 Review. - An overview of population genetic data simulation.
Yuan X, Miller DJ, Zhang J, Herrington D, Wang Y. Yuan X, et al. J Comput Biol. 2012 Jan;19(1):42-54. doi: 10.1089/cmb.2010.0188. Epub 2011 Dec 9. J Comput Biol. 2012. PMID: 22149682 Free PMC article. Review.
Cited by
- RETROSPECTIVE VARYING COEFFICIENT ASSOCIATION ANALYSIS OF LONGITUDINAL BINARY TRAITS: APPLICATION TO THE IDENTIFICATION OF GENETIC LOCI ASSOCIATED WITH HYPERTENSION.
Xu G, Amei A, Wu W, Liu Y, Shen L, Oh EC, Wang Z. Xu G, et al. Ann Appl Stat. 2024 Mar;18(1):487-505. doi: 10.1214/23-aoas1798. Epub 2024 Jan 31. Ann Appl Stat. 2024. PMID: 38577266 Free PMC article. - Population genetic simulation: Benchmarking frameworks for non-standard models of natural selection.
Johnson OL, Tobler R, Schmidt JM, Huber CD. Johnson OL, et al. Mol Ecol Resour. 2024 Apr;24(3):e13930. doi: 10.1111/1755-0998.13930. Epub 2024 Jan 21. Mol Ecol Resour. 2024. PMID: 38247258 - Excalibur: A new ensemble method based on an optimal combination of aggregation tests for rare-variant association testing for sequencing data.
Boutry S, Helaers R, Lenaerts T, Vikkula M. Boutry S, et al. PLoS Comput Biol. 2023 Sep 14;19(9):e1011488. doi: 10.1371/journal.pcbi.1011488. eCollection 2023 Sep. PLoS Comput Biol. 2023. PMID: 37708232 Free PMC article. - Ultrafast genome-wide inference of pairwise coalescence times.
Schweiger R, Durbin R. Schweiger R, et al. Genome Res. 2023 Jul;33(7):1023-1031. doi: 10.1101/gr.277665.123. Epub 2023 Aug 10. Genome Res. 2023. PMID: 37562965 Free PMC article. - Kernel-based genetic association analysis for microbiome phenotypes identifies host genetic drivers of beta-diversity.
Liu H, Ling W, Hua X, Moon JY, Williams-Nguyen JS, Zhan X, Plantinga AM, Zhao N, Zhang A, Knight R, Qi Q, Burk RD, Kaplan RC, Wu MC. Liu H, et al. Microbiome. 2023 Apr 20;11(1):80. doi: 10.1186/s40168-023-01530-0. Microbiome. 2023. PMID: 37081571 Free PMC article.
References
- Pugh W. Skip lists: a probabilistic alternative to balanced trees. Commun. ACM. 1990;33:668–676.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources