The SYSTERS protein sequence cluster set - PubMed (original) (raw)
The SYSTERS protein sequence cluster set
A Krause et al. Nucleic Acids Res. 2000.
Abstract
The SYSTERS (short for SYSTEmatic Re-Searching) protein sequence cluster set consists of the classification of all sequences from SWISS-PROT and PIR into disjoint protein family clusters and hierarchically into superfamily and subfamily clusters. The cluster set can be searched with a sequence using the SSMAL search tool or a traditional database search tool like BLAST or FASTA. Additionally a multiple alignment is generated for each cluster and annotated with domain information from the Pfam database of protein domain families. A taxonomic overview of the organisms covered by a cluster is given based on the NCBI taxonomy. The cluster set is available for querying and browsing at http://www.dkfz-heidelberg. de/tbi/services/cluster/systersform
Figures
Figure 1
Overview of the SYSTERS Web server. As an example, the cluster set was searched with a query sequence (top left). A more detailed insight into the overlapping cluster O97 containing 17 sequences sorted into two subfamilies is given (middle right). Additionally, the taxonomic overview of the organisms covered by the cluster (middle left), the list of clusters contained in the corresponding superfamily (bottom left), and the domain composition of the sequences in the cluster (bottom right) is shown.
Similar articles
- WWW access to the SYSTERS protein sequence cluster set.
Krause A, Nicodème P, Bornberg-Bauer E, Rehmsmeier M, Vingron M. Krause A, et al. Bioinformatics. 1999 Mar;15(3):262-3. doi: 10.1093/bioinformatics/15.3.262. Bioinformatics. 1999. PMID: 10222416 - SYSTERS, GeneNest, SpliceNest: exploring sequence space from genome to protein.
Krause A, Haas SA, Coward E, Vingron M. Krause A, et al. Nucleic Acids Res. 2002 Jan 1;30(1):299-300. doi: 10.1093/nar/30.1.299. Nucleic Acids Res. 2002. PMID: 11752319 Free PMC article. - SSMAL: similarity searching with alignment graphs.
Nicodème P. Nicodème P. Bioinformatics. 1998;14(6):508-15. doi: 10.1093/bioinformatics/14.6.508. Bioinformatics. 1998. PMID: 9694989 - Bioinformatics in protein analysis.
Persson B. Persson B. EXS. 2000;88:215-31. doi: 10.1007/978-3-0348-8458-7_14. EXS. 2000. PMID: 10803381 Review. - Clustered sequence representation for fast homology search.
Cameron M, Bernstein Y, Williams HE. Cameron M, et al. J Comput Biol. 2007 Jun;14(5):594-614. doi: 10.1089/cmb.2007.R005. J Comput Biol. 2007. PMID: 17683263 Review.
Cited by
- Review on the Application of Machine Learning Algorithms in the Sequence Data Mining of DNA.
Yang A, Zhang W, Wang J, Yang K, Han Y, Zhang L. Yang A, et al. Front Bioeng Biotechnol. 2020 Sep 4;8:1032. doi: 10.3389/fbioe.2020.01032. eCollection 2020. Front Bioeng Biotechnol. 2020. PMID: 33015010 Free PMC article. Review. - Chitinase from Thermomyces lanuginosus SSBP and its biotechnological applications.
Khan FI, Bisetty K, Singh S, Permaul K, Hassan MI. Khan FI, et al. Extremophiles. 2015 Nov;19(6):1055-66. doi: 10.1007/s00792-015-0792-8. Extremophiles. 2015. PMID: 26462798 Review. - Towards New Drug Targets? Function Prediction of Putative Proteins of Neisseria meningitidis MC58 and Their Virulence Characterization.
Shahbaaz M, Bisetty K, Ahmad F, Hassan MI. Shahbaaz M, et al. OMICS. 2015 Jul;19(7):416-34. doi: 10.1089/omi.2015.0032. Epub 2015 Jun 15. OMICS. 2015. PMID: 26076386 Free PMC article. - Optimizing high performance computing workflow for protein functional annotation.
Stanberry L, Rekepalli B, Liu Y, Giblock P, Higdon R, Montague E, Broomall W, Kolker N, Kolker E. Stanberry L, et al. Concurr Comput. 2014 Sep 10;26(13):2112-2121. doi: 10.1002/cpe.3264. Concurr Comput. 2014. PMID: 25313296 Free PMC article. - kClust: fast and sensitive clustering of large protein sequence databases.
Hauser M, Mayer CE, Söding J. Hauser M, et al. BMC Bioinformatics. 2013 Aug 15;14:248. doi: 10.1186/1471-2105-14-248. BMC Bioinformatics. 2013. PMID: 23945046 Free PMC article.
References
- Altschul S.F., Gish,W., Miller,W., Myers,E.W. and Lipman,D.J. (1990) J. Mol. Biol., 215, 403–410. - PubMed
- Krause A. and Vingron,M. (1998) Bioinformatics, 14, 430–438. - PubMed
- Krause A., Nicodème,P., Bornberg-Bauer,E., Rehmsmeier,M. and Vingron,M. (1999) Bioinformatics, 15, 262–263. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials