2SNP: scalable phasing based on 2-SNP haplotypes - PubMed (original) (raw)
2SNP: scalable phasing based on 2-SNP haplotypes
Dumitru Brinza et al. Bioinformatics. 2006.
Abstract
2SNP software package implements a new very fast scalable algorithm for haplotype inference based on genotype statistics collected only for pairs of SNPs. This software can be used for comparatively accurate phasing of large number of long genome sequences, e.g. obtained from DNA arrays. As an input 2SNP takes genotype matrix and outputs the corresponding haplotype matrix. On datasets across 79 regions from HapMap 2SNP is several orders of magnitude faster than GERBIL and PHASE while matching them in quality measured by the number of correctly phased genotypes, single-site and switching errors. For example, 2SNP requires 41 s on Pentium 4 2 Ghz processor to phase 30 genotypes with 1381 SNPs (ENm010.7p15:2 data from HapMap) versus GERBIL and PHASE requiring more than a week and admitting no less errors than 2SNP.
Similar articles
- 2SNP: scalable phasing method for trios and unrelated individuals.
Brinza D, Zelikovsky A. Brinza D, et al. IEEE/ACM Trans Comput Biol Bioinform. 2008 Apr-Jun;5(2):313-8. doi: 10.1109/TCBB.2007.1068. IEEE/ACM Trans Comput Biol Bioinform. 2008. PMID: 18451440 - Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays.
Di X, Matsuzaki H, Webster TA, Hubbell E, Liu G, Dong S, Bartell D, Huang J, Chiles R, Yang G, Shen MM, Kulp D, Kennedy GC, Mei R, Jones KW, Cawley S. Di X, et al. Bioinformatics. 2005 May 1;21(9):1958-63. doi: 10.1093/bioinformatics/bti275. Epub 2005 Jan 18. Bioinformatics. 2005. PMID: 15657097 - Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays.
Matsuzaki H, Dong S, Loi H, Di X, Liu G, Hubbell E, Law J, Berntsen T, Chadha M, Hui H, Yang G, Kennedy GC, Webster TA, Cawley S, Walsh PS, Jones KW, Fodor SP, Mei R. Matsuzaki H, et al. Nat Methods. 2004 Nov;1(2):109-11. doi: 10.1038/nmeth718. Nat Methods. 2004. PMID: 15782172 - Genome resequencing and genetic variation.
Stratton M. Stratton M. Nat Biotechnol. 2008 Jan;26(1):65-6. doi: 10.1038/nbt0108-65. Nat Biotechnol. 2008. PMID: 18183021 Review. No abstract available. - Navigating the HapMap.
Barnes MR. Barnes MR. Brief Bioinform. 2006 Sep;7(3):211-24. doi: 10.1093/bib/bbl021. Epub 2006 Jul 28. Brief Bioinform. 2006. PMID: 16877472 Review.
Cited by
- Inferring viral quasispecies spectra from 454 pyrosequencing reads.
Astrovskaya I, Tork B, Mangul S, Westbrooks K, Măndoiu I, Balfe P, Zelikovsky A. Astrovskaya I, et al. BMC Bioinformatics. 2011;12 Suppl 6(Suppl 6):S1. doi: 10.1186/1471-2105-12-S6-S1. Epub 2011 Jul 28. BMC Bioinformatics. 2011. PMID: 21989211 Free PMC article. - Shape-IT: new rapid and accurate algorithm for haplotype inference.
Delaneau O, Coulonges C, Zagury JF. Delaneau O, et al. BMC Bioinformatics. 2008 Dec 16;9:540. doi: 10.1186/1471-2105-9-540. BMC Bioinformatics. 2008. PMID: 19087329 Free PMC article. - Accelerating haplotype-based genome-wide association study using perfect phylogeny and phase-known reference data.
He Y, Li C, Amos CI, Xiong M, Ling H, Jin L. He Y, et al. PLoS One. 2011;6(7):e22097. doi: 10.1371/journal.pone.0022097. Epub 2011 Jul 15. PLoS One. 2011. PMID: 21789217 Free PMC article. - WinHAP2: an extremely fast haplotype phasing program for long genotype sequences.
Pan W, Zhao Y, Xu Y, Zhou F. Pan W, et al. BMC Bioinformatics. 2014 May 30;15:164. doi: 10.1186/1471-2105-15-164. BMC Bioinformatics. 2014. PMID: 24884701 Free PMC article. - WinHAP: an efficient haplotype phasing algorithm based on scalable sliding windows.
Xu Y, Cheng W, Nie P, Zhou F. Xu Y, et al. PLoS One. 2012;7(8):e43163. doi: 10.1371/journal.pone.0043163. Epub 2012 Aug 14. PLoS One. 2012. PMID: 22905221 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials