The Universal Protein Resource (UniProt): an expanding universe of protein information - PubMed (original) (raw)

. 2006 Jan 1;34(Database issue):D187-91.

doi: 10.1093/nar/gkj161.

Rolf Apweiler, Amos Bairoch, Darren A Natale, Winona C Barker, Brigitte Boeckmann, Serenella Ferro, Elisabeth Gasteiger, Hongzhan Huang, Rodrigo Lopez, Michele Magrane, Maria J Martin, Raja Mazumder, Claire O'Donovan, Nicole Redaschi, Baris Suzek

Affiliations

The Universal Protein Resource (UniProt): an expanding universe of protein information

Cathy H Wu et al. Nucleic Acids Res. 2006.

Abstract

The Universal Protein Resource (UniProt) provides a central resource on protein sequences and functional annotation with three database components, each addressing a key need in protein bioinformatics. The UniProt Knowledgebase (UniProtKB), comprising the manually annotated UniProtKB/Swiss-Prot section and the automatically annotated UniProtKB/TrEMBL section, is the preeminent storehouse of protein annotation. The extensive cross-references, functional and feature annotations and literature-based evidence attribution enable scientists to analyse proteins and query across databases. The UniProt Reference Clusters (UniRef) speed similarity searches via sequence space compression by merging sequences that are 100% (UniRef100), 90% (UniRef90) or 50% (UniRef50) identical. Finally, the UniProt Archive (UniParc) stores all publicly available protein sequences, containing the history of sequence data with links to the source databases. UniProt databases continue to grow in size and in availability of information. Recent and upcoming changes to database contents, formats, controlled vocabularies and services are described. New download availability includes all major releases of UniProtKB, sequence collections by taxonomic division and complete proteomes. A bibliography mapping service has been added, and an ID mapping service will be available soon. UniProt databases can be accessed online at http://www.uniprot.org or downloaded at ftp://ftp.uniprot.org/pub/databases/.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Overview of the major data sources of the UniProt databases.

Similar articles

Cited by

References

    1. Kretschmann E., Fleischmann W., Apweiler R. Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT. Bioinformatics. 2001;17:920–926. - PubMed
    1. Gattiker A., Michoud K., Rivoire C., Auchincloss A.H., Coudert E., Lima T., Kersey P., Pagni M., Sigrist C.J., Lachaize C., et al. Automated annotation of microbial proteomes in SWISS-PROT. Comput. Biol. Chem. 2003;27:49–58. - PubMed
    1. Wu C.H., Huang H., Yeh L.S., Barker W.C. Protein family classification and functional annotation. Comput. Biol. Chem. 2003;27:37–47. - PubMed
    1. Fleischmann W., Moller S., Gateau A., Apweiler R. A novel method for automatic functional annotation of proteins. Bioinformatics. 1999;15:228–233. - PubMed
    1. Holm L., Sander C. Dictionary of recurrent domains in protein structures. Proteins. 1998;33:88–96. - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources