Automating sequence-based detection and genotyping of SNPs from diploid samples - PubMed (original) (raw)
doi: 10.1038/ng1746. Epub 2006 Feb 19.
Affiliations
- PMID: 16493422
- DOI: 10.1038/ng1746
Automating sequence-based detection and genotyping of SNPs from diploid samples
Matthew Stephens et al. Nat Genet. 2006 Mar.
Abstract
The detection of sequence variation, for which DNA sequencing has emerged as the most sensitive and automated approach, forms the basis of all genetic analysis. Here we describe and illustrate an algorithm that accurately detects and genotypes SNPs from fluorescence-based sequence data. Because the algorithm focuses particularly on detecting SNPs through the identification of heterozygous individuals, it is especially well suited to the detection of SNPs in diploid samples obtained after DNA amplification. It is substantially more accurate than existing approaches and, notably, provides a useful quantitative measure of its confidence in each potential SNP detected and in each genotype called. Calls assigned the highest confidence are sufficiently reliable to remove the need for manual review in several contexts. For example, for sequence data from 47-90 individuals sequenced on both the forward and reverse strands, the highest-confidence calls from our algorithm detected 93% of all SNPs and 100% of high-frequency SNPs, with no false positive SNPs identified and 99.9% genotyping accuracy. This algorithm is implemented in a software package, PolyPhred version 5.0, which is freely available for academic use.
Similar articles
- Large-scale genotyping of complex DNA.
Kennedy GC, Matsuzaki H, Dong S, Liu WM, Huang J, Liu G, Su X, Cao M, Chen W, Zhang J, Liu W, Yang G, Di X, Ryder T, He Z, Surti U, Phillips MS, Boyce-Jacino MT, Fodor SP, Jones KW. Kennedy GC, et al. Nat Biotechnol. 2003 Oct;21(10):1233-7. doi: 10.1038/nbt869. Epub 2003 Sep 7. Nat Biotechnol. 2003. PMID: 12960966 - A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.
Xiao Y, Segal MR, Yang YH, Yeh RF. Xiao Y, et al. Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25. Bioinformatics. 2007. PMID: 17459966 - Dynamic variable selection in SNP genotype autocalling from APEX microarray data.
Podder M, Welch WJ, Zamar RH, Tebbutt SJ. Podder M, et al. BMC Bioinformatics. 2006 Nov 30;7:521. doi: 10.1186/1471-2105-7-521. BMC Bioinformatics. 2006. PMID: 17137502 Free PMC article. - Digital genotyping using molecular affinity and mass spectrometry.
Kim S, Ruparel HD, Gilliam TC, Ju J. Kim S, et al. Nat Rev Genet. 2003 Dec;4(12):1001-8. doi: 10.1038/nrg1230. Nat Rev Genet. 2003. PMID: 14631360 Review. - Single-nucleotide polymorphisms and lung disease: clinical implications.
Tebbutt SJ, James A, Paré PD. Tebbutt SJ, et al. Chest. 2007 Apr;131(4):1216-23. doi: 10.1378/chest.06-2252. Chest. 2007. PMID: 17426230 Review.
Cited by
- Familial CCM Genes Might Not Be Main Drivers for Pathogenesis of Sporadic CCMs-Genetic Similarity between Cancers and Vascular Malformations.
Zhang J, Croft J, Le A. Zhang J, et al. J Pers Med. 2023 Apr 17;13(4):673. doi: 10.3390/jpm13040673. J Pers Med. 2023. PMID: 37109059 Free PMC article. Review. - Loss of Crb2b-lf leads to anterior segment defects in old zebrafish.
Kujawski S, Crespo C, Luz M, Yuan M, Winkler S, Knust E. Kujawski S, et al. Biol Open. 2020 Feb 11;9(2):bio047555. doi: 10.1242/bio.047555. Biol Open. 2020. PMID: 31988089 Free PMC article. - The Genetic Polymorphism UGT1A4*3 Is Associated with Low Posaconazole Plasma Concentrations in Hematological Malignancy Patients Receiving the Oral Suspension.
Suh HJ, Yoon SH, Yu KS, Cho JY, Park SI, Lee E, Lee JO, Koh Y, Song KH, Choe PG, Kim ES, Bang SM, Kim HB, Kim I, Kim NJ, Song SH, Park WB, Oh MD. Suh HJ, et al. Antimicrob Agents Chemother. 2018 Jun 26;62(7):e02230-17. doi: 10.1128/AAC.02230-17. Print 2018 Jul. Antimicrob Agents Chemother. 2018. PMID: 29661871 Free PMC article. - The Crumbs_C isoform of Drosophila shows tissue- and stage-specific expression and prevents light-dependent retinal degeneration.
Spannl S, Kumichel A, Hebbar S, Kapp K, Gonzalez-Gaitan M, Winkler S, Blawid R, Jessberger G, Knust E. Spannl S, et al. Biol Open. 2017 Feb 15;6(2):165-175. doi: 10.1242/bio.020040. Biol Open. 2017. PMID: 28202468 Free PMC article. - Rare intronic variants of TCF7L2 arising by selective sweeps in an indigenous population from Mexico.
Acosta JL, Hernández-Mondragón AC, Correa-Acosta LC, Cazañas-Padilla SN, Chávez-Florencio B, Ramírez-Vega EY, Monge-Cázares T, Aguilar-Salinas CA, Tusié-Luna T, Del Bosque-Plata L. Acosta JL, et al. BMC Genet. 2016 May 26;17(1):68. doi: 10.1186/s12863-016-0372-7. BMC Genet. 2016. PMID: 27230431 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
- 1R01HG/LM-02585/HG/NHGRI NIH HHS/United States
- ES-15478/ES/NIEHS NIH HHS/United States
- HL-66682/HL/NHLBI NIH HHS/United States
- T32 HG00035-06/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources