PoPoolation: a toolbox for population genetic analysis of next generation sequencing data from pooled individuals - PubMed (original) (raw)
PoPoolation: a toolbox for population genetic analysis of next generation sequencing data from pooled individuals
Robert Kofler et al. PLoS One. 2011.
Abstract
Recent statistical analyses suggest that sequencing of pooled samples provides a cost effective approach to determine genome-wide population genetic parameters. Here we introduce PoPoolation, a toolbox specifically designed for the population genetic analysis of sequence data from pooled individuals. PoPoolation calculates estimates of θ(Watterson), θ(π), and Tajima's D that account for the bias introduced by pooling and sequencing errors, as well as divergence between species. Results of genome-wide analyses can be graphically displayed in a sliding window plot. PoPoolation is written in Perl and R and it builds on commonly used data formats. Its source code can be downloaded from http://code.google.com/p/popoolation/. Furthermore, we evaluate the influence of mapping algorithms, sequencing errors, and read coverage on the accuracy of population genetic parameter estimates from pooled data.
Conflict of interest statement
Competing Interests: The authors have declared that no competing interests exist.
Figures
Figure 1. Outline of a population genetic analysis from pooled sequence data.
Sequencer figure from
Figure 2. Graphical output of polymorphism and divergence estimates using PoPoolation.
Sliding window analysis of θ π of a Portuguese D. melanogaster population on chromosome 3R (black line). The red line shows divergence (dxy) between D. melanogaster and D. simulans using the same window size and step size as for θ π. Note that dxy is scaled by 1/10. Both lines are based on non-overlapping windows of 50 kb.
Figure 3. Sequencing errors in relation to coverage, minor allele count, and sequence quality.
PhiX sequences (74 bp) generated with an Illumina GAIIx sequencer were analyzed for sequencing error rate (number of mutated bases after quality filtering). The gray bar indicates the presence of a polymorphic site in the PhiX sequence, which results in a minimum sequencing error rate.
Figure 4. Improvement of the alignment for diverged regions using the PE-SW remap algorithm.
IGV screenshot of the mapping of pooled sequence reads in a highly divergent region of D. melanogaster. The upper panel shows an alignment of the PE reads without the PE-SW remap and the lower panel shows the same region with the PE-SW remap.
Figure 5. The influence of coverage and window size on the accuracy of the estimated θ π.
The accuracy was measured as the mean standardized difference between θ π estimated for a given window size and its expectation.
Similar articles
- SNP calling by sequencing pooled samples.
Raineri E, Ferretti L, Esteve-Codina A, Nevado B, Heath S, Pérez-Enciso M. Raineri E, et al. BMC Bioinformatics. 2012 Sep 20;13:239. doi: 10.1186/1471-2105-13-239. BMC Bioinformatics. 2012. PMID: 22992255 Free PMC article. - The next generation of molecular markers from massively parallel sequencing of pooled DNA samples.
Futschik A, Schlötterer C. Futschik A, et al. Genetics. 2010 Sep;186(1):207-18. doi: 10.1534/genetics.110.114397. Epub 2010 May 10. Genetics. 2010. PMID: 20457880 Free PMC article. - Population genomics from pool sequencing.
Ferretti L, Ramos-Onsins SE, Pérez-Enciso M. Ferretti L, et al. Mol Ecol. 2013 Nov;22(22):5561-76. doi: 10.1111/mec.12522. Epub 2013 Oct 28. Mol Ecol. 2013. PMID: 24102736 - Model-based quality assessment and base-calling for second-generation sequencing data.
Bravo HC, Irizarry RA. Bravo HC, et al. Biometrics. 2010 Sep;66(3):665-74. doi: 10.1111/j.1541-0420.2009.01353.x. Biometrics. 2010. PMID: 19912177 Free PMC article. Review. - Statistical challenges associated with detecting copy number variations with next-generation sequencing.
Teo SM, Pawitan Y, Ku CS, Chia KS, Salim A. Teo SM, et al. Bioinformatics. 2012 Nov 1;28(21):2711-8. doi: 10.1093/bioinformatics/bts535. Epub 2012 Aug 31. Bioinformatics. 2012. PMID: 22942022 Review.
Cited by
- Genetic Variation in Jamaican Populations of the Coffee Berry Borer, Hypothenemus hampei.
Errbii M, Myrie A, Robinson D, Schultner E, Schrader L, Oettler J. Errbii M, et al. Genome Biol Evol. 2024 Nov 1;16(11):evae217. doi: 10.1093/gbe/evae217. Genome Biol Evol. 2024. PMID: 39486017 Free PMC article. - Transcriptome sequencing of Eucalyptus camaldulensis seedlings subjected to water stress reveals functional single nucleotide polymorphisms and genes under selection.
Thumma BR, Sharma N, Southerton SG. Thumma BR, et al. BMC Genomics. 2012 Aug 1;13:364. doi: 10.1186/1471-2164-13-364. BMC Genomics. 2012. PMID: 22853646 Free PMC article. - Estimating the information value of polymorphic sites using pooled sequences.
Malde K. Malde K. BMC Genomics. 2014;15 Suppl 6(Suppl 6):S20. doi: 10.1186/1471-2164-15-S6-S20. Epub 2014 Oct 17. BMC Genomics. 2014. PMID: 25571927 Free PMC article. - Genetic linkage of distinct adaptive traits in sympatrically speciating crater lake cichlid fish.
Fruciano C, Franchini P, Kovacova V, Elmer KR, Henning F, Meyer A. Fruciano C, et al. Nat Commun. 2016 Sep 6;7:12736. doi: 10.1038/ncomms12736. Nat Commun. 2016. PMID: 27597183 Free PMC article. - Parental care shapes the evolution of molecular genetic variation.
Mashoodh R, Trowsdale AT, Manica A, Kilner RM. Mashoodh R, et al. Evol Lett. 2023 Sep 5;7(6):379-388. doi: 10.1093/evlett/qrad039. eCollection 2023 Dec. Evol Lett. 2023. PMID: 38045719 Free PMC article.
References
- Turner TL, Bourne EC, Von Wettberg EJ, Hu TT, Nuzhdin SV. Population resequencing reveals local adaptation of Arabidopsis lyrata to serpentine soils. Nat Genet. 2010. - PubMed
- Rubin CJ, Zody MC, Eriksson J, Meadows JR, Sherwood E, et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature. 2010;464:587–591. - PubMed
- Quinlan AR, Stewart DA, Stromberg MP, Marth GT. Pyrobayes: an improved base caller for SNP discovery in pyrosequences. Nat Methods. 2008;5:179–181. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous