A SNP resource for human chromosome 22: extracting dense clusters of SNPs from the genomic sequence - PubMed (original) (raw)
Y Chen, S Hunt, L J Smink, A Hunt, K Rice, S Livingston, S Bumpstead, R Bruskiewich, P Sham, R Ganske, M Adams, K Kawasaki, N Shimizu, S Minoshima, B Roe, D Bentley, I Dunham
Affiliations
- PMID: 11156626
- PMCID: PMC311026
- DOI: 10.1101/gr.156901
A SNP resource for human chromosome 22: extracting dense clusters of SNPs from the genomic sequence
E Dawson et al. Genome Res. 2001 Jan.
Abstract
The recent publication of the complete sequence of human chromosome 22 provides a platform from which to investigate genomic sequence variation. We report the identification and characterization of 12,267 potential variants (SNPs and other small insertions/deletions) of human chromosome 22, discovered in the overlaps of 460 clones used for the chromosome sequencing. We found, on average, 1 potential variant every 1.07 kb and approximately 18% of the potential variants involve insertions/deletions. The SNPs have been positioned both relative to each other, and to genes, predicted genes, repeat sequences, other genetic markers, and the 2730 SNPs previously identified on the chromosome. A subset of the SNPs were verified experimentally using either PCR-RFLP or genomic Invader assays. These experiments confirmed 92% of the potential variants in a panel of 92 individuals. [Details of the SNPs and RFLP assays can be found at http://www.sanger.ac.uk and in dbSNP.]
Figures
Figure 1
(following page) Distribution of polymorphisms on human chromosome 22. An ideogram of chromosome 22 with a schematic representation of the Giemsa banding pattern is shown at left. Next, the region containing the finished sequence is expanded to show the SNP map. The SNP density (the number of candidate variants in consecutive 100-kb regions) is plotted (blue line) superimposed on a plot of GC density (green). GC content is calculated as a percentage of the sequence using a sliding 100-kb window moved in 50-kb increments. The first column to the right of the graph represents sequences color-coded as per the collaborating institutions that contributed to the sequence pale yellow bars drawn horizontally across the map represent current gaps in the completed sequence. The next column to the right (Var. Density) shows a gray scale coding the number of potential variations recorded in the given 100-kb region. Such potential variations may be SNP, insertion, or deletion polymorphisms relative to the published reference sequence. The next column represents these variations as line annotations with color coding to represent the position of the variation relative to genomic features such as exons and CpG islands (a color key is on the diagram). The last column represents the corresponding map of recently published TSC (The SNP Consortium) SNPs reported by our group. The very high density of the overlap variations described in this paper (the Variations column) is not evident from the diagram because of the limits of the resolution. This diagram can also be viewed as a link from
http://www.sanger.ac.uk/cgi-bin/humace/snp\_search
where a zoom facility allows regions to be enlarged to show the positions of individual SNPs relative to annotated exons and CpG islands.
Similar articles
- QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species.
Tang J, Vosman B, Voorrips RE, van der Linden CG, Leunissen JA. Tang J, et al. BMC Bioinformatics. 2006 Oct 9;7:438. doi: 10.1186/1471-2105-7-438. BMC Bioinformatics. 2006. PMID: 17029635 Free PMC article. - Chromosomal regions containing high-density and ambiguously mapped putative single nucleotide polymorphisms (SNPs) correlate with segmental duplications in the human genome.
Estivill X, Cheung J, Pujana MA, Nakabayashi K, Scherer SW, Tsui LC. Estivill X, et al. Hum Mol Genet. 2002 Aug 15;11(17):1987-95. doi: 10.1093/hmg/11.17.1987. Hum Mol Genet. 2002. PMID: 12165560 - A cSNP map and database for human chromosome 21.
Deutsch S, Iseli C, Bucher P, Antonarakis SE, Scott HS. Deutsch S, et al. Genome Res. 2001 Feb;11(2):300-7. doi: 10.1101/gr.164901. Genome Res. 2001. PMID: 11157793 Free PMC article. - Tag SNP selection for association studies.
Stram DO. Stram DO. Genet Epidemiol. 2004 Dec;27(4):365-74. doi: 10.1002/gepi.20028. Genet Epidemiol. 2004. PMID: 15372618 Review. - High-density genotyping and linkage disequilibrium in the human genome using chromosome 22 as a model.
Remm M, Metspalu A. Remm M, et al. Curr Opin Chem Biol. 2002 Feb;6(1):24-30. doi: 10.1016/s1367-5931(01)00285-x. Curr Opin Chem Biol. 2002. PMID: 11827819 Review.
Cited by
- Human diallelic insertion/deletion polymorphisms.
Weber JL, David D, Heil J, Fan Y, Zhao C, Marth G. Weber JL, et al. Am J Hum Genet. 2002 Oct;71(4):854-62. doi: 10.1086/342727. Epub 2002 Sep 4. Am J Hum Genet. 2002. PMID: 12205564 Free PMC article. - Finishing the finished human chromosome 22 sequence.
Cole CG, McCann OT, Collins JE, Oliver K, Willey D, Gribble SM, Yang F, McLaren K, Rogers J, Ning Z, Beare DM, Dunham I. Cole CG, et al. Genome Biol. 2008;9(5):R78. doi: 10.1186/gb-2008-9-5-r78. Epub 2008 May 13. Genome Biol. 2008. PMID: 18477386 Free PMC article. - A genome wide survey of SNP variation reveals the genetic structure of sheep breeds.
Kijas JW, Townley D, Dalrymple BP, Heaton MP, Maddox JF, McGrath A, Wilson P, Ingersoll RG, McCulloch R, McWilliam S, Tang D, McEwan J, Cockett N, Oddy VH, Nicholas FW, Raadsma H; International Sheep Genomics Consortium. Kijas JW, et al. PLoS One. 2009;4(3):e4668. doi: 10.1371/journal.pone.0004668. Epub 2009 Mar 3. PLoS One. 2009. PMID: 19270757 Free PMC article. - The missing indels: an estimate of indel variation in a human genome and analysis of factors that impede detection.
Jiang Y, Turinsky AL, Brudno M. Jiang Y, et al. Nucleic Acids Res. 2015 Sep 3;43(15):7217-28. doi: 10.1093/nar/gkv677. Epub 2015 Jun 30. Nucleic Acids Res. 2015. PMID: 26130710 Free PMC article. - Direct micro-haplotyping by multiple double PCR amplifications of specific alleles (MD-PASA).
Eitan Y, Kashi Y. Eitan Y, et al. Nucleic Acids Res. 2002 Jun 15;30(12):e62. doi: 10.1093/nar/gnf062. Nucleic Acids Res. 2002. PMID: 12060700 Free PMC article.
References
- Altshuler D, Pollara VJ, Cowles C, Van Etten WJ, Baldwin J, Linton L, Lander ES. A human SNP map generated by reduced representation shotgun sequencing. Nature. 2000;407:513–516. - PubMed
- Averof M, Rokas A, Wolfe KH, Sharp PM. Evidence for a high frequency of simultaneous double-nucleotide substitutions. Science. 2000;287:1283–1286. - PubMed
- Buetow KH, Edmonson MN, Cassidy AB. Reliable identification of large numbers of candidate SNPs from public EST data. Nature Genet. 1999;21:323–325. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources