Accounting for bias from sequencing error in population genetic estimates - PubMed (original) (raw)
. 2008 Jan;25(1):199-206.
doi: 10.1093/molbev/msm239. Epub 2007 Nov 2.
Affiliations
- PMID: 17981928
- DOI: 10.1093/molbev/msm239
Accounting for bias from sequencing error in population genetic estimates
Philip L F Johnson et al. Mol Biol Evol. 2008 Jan.
Abstract
Sequencing error presents a significant challenge to population genetic analyses using low-coverage sequence in general and single-pass reads in particular. Bias in parameter estimates becomes severe when the level of polymorphism (signal) is low relative to the amount of error (noise). Choosing an arbitrary quality score cutoff yields biased estimates, particularly with newer, non-Sanger sequencing technologies that have different quality score distributions. We propose a rule of thumb to judge when a given threshold will lead to significant bias and suggest alternative approaches that reduce bias.
Similar articles
- How to infer reliable diploid genotypes from NGS or traditional sequence data: from basic probability to experimental optimization.
Chenuil A. Chenuil A. J Evol Biol. 2012 May;25(5):949-60. doi: 10.1111/j.1420-9101.2012.02488.x. Epub 2012 Mar 16. J Evol Biol. 2012. PMID: 22420488 - To what extent do microsatellite markers reflect genome-wide genetic diversity in natural populations?
Väli U, Einarsson A, Waits L, Ellegren H. Väli U, et al. Mol Ecol. 2008 Sep;17(17):3808-17. doi: 10.1111/j.1365-294X.2008.03876.x. Epub 2008 Jul 18. Mol Ecol. 2008. PMID: 18647238 - Impact and quantification of the sources of error in DNA pooling designs.
Jawaid A, Sham P. Jawaid A, et al. Ann Hum Genet. 2009 Jan;73(1):118-24. doi: 10.1111/j.1469-1809.2008.00486.x. Epub 2008 Oct 15. Ann Hum Genet. 2009. PMID: 18945289 - Genome sequence data: management, storage, and visualization.
Batley J, Edwards D. Batley J, et al. Biotechniques. 2009 Apr;46(5):333-4, 336. doi: 10.2144/000113134. Biotechniques. 2009. PMID: 19480628 Review. - Phylogenetic understanding of clonal populations in an era of whole genome sequencing.
Pearson T, Okinaka RT, Foster JT, Keim P. Pearson T, et al. Infect Genet Evol. 2009 Sep;9(5):1010-9. doi: 10.1016/j.meegid.2009.05.014. Epub 2009 May 27. Infect Genet Evol. 2009. PMID: 19477301 Review.
Cited by
- Empirical validation of pooled whole genome population re-sequencing in Drosophila melanogaster.
Zhu Y, Bergland AO, González J, Petrov DA. Zhu Y, et al. PLoS One. 2012;7(7):e41901. doi: 10.1371/journal.pone.0041901. Epub 2012 Jul 26. PLoS One. 2012. PMID: 22848651 Free PMC article. - jPopGen Suite: population genetic analysis of DNA polymorphism from nucleotide sequences with errors.
Liu X. Liu X. Methods Ecol Evol. 2012 Aug 1;3(4):624-627. doi: 10.1111/j.2041-210X.2012.00194.x. Epub 2012 Mar 2. Methods Ecol Evol. 2012. PMID: 22905315 Free PMC article. - Error and error mitigation in low-coverage genome assemblies.
Hubisz MJ, Lin MF, Kellis M, Siepel A. Hubisz MJ, et al. PLoS One. 2011 Feb 14;6(2):e17034. doi: 10.1371/journal.pone.0017034. PLoS One. 2011. PMID: 21340033 Free PMC article. - Association studies for next-generation sequencing.
Luo L, Boerwinkle E, Xiong M. Luo L, et al. Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26. Genome Res. 2011. PMID: 21521787 Free PMC article. - Ancient structure in Africa unlikely to explain Neanderthal and non-African genetic similarity.
Yang MA, Malaspinas AS, Durand EY, Slatkin M. Yang MA, et al. Mol Biol Evol. 2012 Oct;29(10):2987-95. doi: 10.1093/molbev/mss117. Epub 2012 Apr 18. Mol Biol Evol. 2012. PMID: 22513287 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources