STACK: Sequence Tag Alignment and Consensus Knowledgebase - PubMed (original) (raw)
STACK: Sequence Tag Alignment and Consensus Knowledgebase
A Christoffels et al. Nucleic Acids Res. 2001.
Abstract
STACK is a tool for detection and visualisation of expressed transcript variation in the context of developmental and pathological states. The datasystem organizes and reconstructs human transcripts from available public data in the context of expression state. The expression state of a transcript can include developmental state, pathological association, site of expression and isoform of expressed transcript. STACK consensus transcripts are reconstructed from clusters that capture and reflect the growing evidence of transcript diversity. The comprehensive capture of transcript variants is achieved by the use of a novel clustering approach that is tolerant of sub-sequence diversity and does not rely on pairwise alignment. This is in contrast with other gene indexing projects. STACK is generated at least four times a year and represents the exhaustive processing of all publicly available human EST data extracted from GenBank. This processed information can be explored through 15 tissue-specific categories, a disease-related category and a whole-body index and is accessible via WWW at http://www.sanbi.ac.za/Dbases.html. STACK represents a broadly applicable resource, as it is the only reconstructed transcript database for which the tools for its generation are also broadly available (http://www.sanbi.ac.za/CODES).
Figures
Figure 1
Craw output for a whole-body index cluster displaying alternate gene isoforms of the fibulin gene. The blue box indicates the region capturing the fibulin-1B isoform whereas sequences capturing fibulin-1C are surrounded by a red box.
Figure 2
WebProbe, the STACK database extraction and viewing tool. The STACK tissue category is used as input to the ‘project name’ field that returns a list of all the clustered information.
Figure 3
An example linked entry for the olfactory tissue. The clusters contributing to the linked entry are displayed together with the ESTs comprising each cluster. A mouse click on a specific clusterID executes the download of the FASTA formatted multi-sequence file. The hyperlinks to the right of the clusterID provides for the display of detailed information pertaining to a cluster such as phrap alignments, consensus sequence and assembly analysis information. UniGene links to each EST are provided if they exist.
Similar articles
- The Genome Sequence DataBase: towards an integrated functional genomics resource.
Skupski MP, Booker M, Farmer A, Harpold M, Huang W, Inman J, Kiphart D, Kodira C, Root S, Schilkey F, Schwertfeger J, Siepel A, Stamper D, Thayer N, Thompson R, Wortman J, Zhuang JJ, Harger C. Skupski MP, et al. Nucleic Acids Res. 1999 Jan 1;27(1):35-8. doi: 10.1093/nar/27.1.35. Nucleic Acids Res. 1999. PMID: 9847136 Free PMC article. - The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species.
Quackenbush J, Cho J, Lee D, Liang F, Holt I, Karamycheva S, Parvizi B, Pertea G, Sultana R, White J. Quackenbush J, et al. Nucleic Acids Res. 2001 Jan 1;29(1):159-64. doi: 10.1093/nar/29.1.159. Nucleic Acids Res. 2001. PMID: 11125077 Free PMC article. - The TIGR gene indices: reconstruction and representation of expressed gene sequences.
Quackenbush J, Liang F, Holt I, Pertea G, Upton J. Quackenbush J, et al. Nucleic Acids Res. 2000 Jan 1;28(1):141-5. doi: 10.1093/nar/28.1.141. Nucleic Acids Res. 2000. PMID: 10592205 Free PMC article. - A hitchhiker's guide to expressed sequence tag (EST) analysis.
Nagaraj SH, Gasser RB, Ranganathan S. Nagaraj SH, et al. Brief Bioinform. 2007 Jan;8(1):6-21. doi: 10.1093/bib/bbl015. Epub 2006 May 23. Brief Bioinform. 2007. PMID: 16772268 Review. - Database analysis and gene discovery in pharmacogenetics.
Board P, Tetlow N, Blackburn A, Chelvanayagam G. Board P, et al. Clin Chem Lab Med. 2000 Sep;38(9):863-7. doi: 10.1515/CCLM.2000.125. Clin Chem Lab Med. 2000. PMID: 11097341 Review.
Cited by
- Cross-species transferability and mapping of genomic and cDNA SSRs in pines.
Chagné D, Chaumeil P, Ramboer A, Collada C, Guevara A, Cervera MT, Vendramin GG, Garcia V, Frigerio JM, Echt C, Richardson T, Plomion C. Chagné D, et al. Theor Appl Genet. 2004 Oct;109(6):1204-14. doi: 10.1007/s00122-004-1683-z. Epub 2004 Sep 22. Theor Appl Genet. 2004. PMID: 15448894 - EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments.
Masoudi-Nejad A, Tonomura K, Kawashima S, Moriya Y, Suzuki M, Itoh M, Kanehisa M, Endo T, Goto S. Masoudi-Nejad A, et al. Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W459-62. doi: 10.1093/nar/gkl066. Nucleic Acids Res. 2006. PMID: 16845049 Free PMC article. - Finding signals that regulate alternative splicing in the post-genomic era.
Ladd AN, Cooper TA. Ladd AN, et al. Genome Biol. 2002 Oct 23;3(11):reviews0008. doi: 10.1186/gb-2002-3-11-reviews0008. Epub 2002 Oct 23. Genome Biol. 2002. PMID: 12429065 Free PMC article. Review. - Asymptotic behaviour and optimal word size for exact and approximate word matches between random sequences.
Forêt S, Kantorovitz MR, Burden CJ. Forêt S, et al. BMC Bioinformatics. 2006 Dec 18;7 Suppl 5(Suppl 5):S21. doi: 10.1186/1471-2105-7-S5-S21. BMC Bioinformatics. 2006. PMID: 17254306 Free PMC article. - Diversity of aminopeptidases, derived from four lepidopteran gene duplications, and polycalins expressed in the midgut of Helicoverpa armigera: identification of proteins binding the delta-endotoxin, Cry1Ac of Bacillus thuringiensis.
Angelucci C, Barrett-Wilt GA, Hunt DF, Akhurst RJ, East PD, Gordon KH, Campbell PM. Angelucci C, et al. Insect Biochem Mol Biol. 2008 Jul;38(7):685-96. doi: 10.1016/j.ibmb.2008.03.010. Epub 2008 Apr 4. Insect Biochem Mol Biol. 2008. PMID: 18549954 Free PMC article.
References
- Houlgatte R., Mariage-Samson,R., Duprat,S., Tessier,A., Bentolila,S., Lamy,B. and Auffray,C. (1995) The Genexpress Index: a resource for gene discovery and the genic map of the human genome. Genome Res., 5, 272–304. - PubMed
- Schuler G.D. (1997) Pieces of the puzzle: expressed sequence tags. Nature Genet., 4, 332–333. - PubMed
- Hide W., Burke,J. and Davidson,D. (1994) Biological evaluation of d2, an algorithm for high-performance sequence comparison. J. Comput. Biol., 1, 199–215. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials