UniProt: a worldwide hub of protein knowledge - PubMed (original) (raw)

UniProt: a worldwide hub of protein knowledge

UniProt Consortium. Nucleic Acids Res. 2019.

Abstract

The UniProt Knowledgebase is a collection of sequences and annotations for over 120 million proteins across all branches of life. Detailed annotations extracted from the literature by expert curators have been collected for over half a million of these proteins. These annotations are supplemented by annotations provided by rule based automated systems, and those imported from other resources. In this article we describe significant updates that we have made over the last 2 years to the resource. We have greatly expanded the number of Reference Proteomes that we provide and in particular we have focussed on improving the number of viral Reference Proteomes. The UniProt website has been augmented with new data visualizations for the subcellular localization of proteins as well as their structure and interactions. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.

PubMed Disclaimer

Figures

Figure 1.

Figure 1.

Growth of UniProt sequences over the last decade.

Figure 2.

Figure 2.

Growth of the total number of Complete Proteomes and Reference Proteomes since 2015.

Figure 3.

Figure 3.

Functional annotation describing human METTL14 (UniProtKB Q9HCE5).

Figure 4.

Figure 4.

Growth of curated automatic annotation rules within the UniRule system.

Figure 5.

Figure 5.

Interaction matrix of the human Parkin protein.

Figure 6.

Figure 6.

The subcellular localization view of a UniProt entry (UniProtKB P35670).

Figure 7.

Figure 7.

The molecular structure of the Spike protein of the Human SARS coronavirus (PDB ID: 1WNC) structure as shown in the ProtVista protein viewer. The 3D viewer is interactively connected with the sequence level annotations in UniProt e.g. domains, PTMs and mutations. Note that the user can select from any of the structures that map to the protein entry.

Similar articles

Cited by

References

    1. The UniProt Consortium UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017; 45:D158–D169. - PMC - PubMed
    1. Karsch-Mizrachi I., Takagi T., Cochrane G. International Nucleotide Sequence Database Collaboration . The international nucleotide sequence database collaboration. Nucleic Acids Res. 2018; 46:D48–D51. - PMC - PubMed
    1. Zerbino D.R., Achuthan P., Akanni W., Amode M.R., Barrell D., Bhai J., Billis K., Cummins C., Gall A., Girón C.G. et al. . Ensembl 2018. Nucleic Acids Res. 2018; 46:D754–D761. - PMC - PubMed
    1. Giraldo-Calderón G.I., Emrich S.J., MacCallum R.M., Maslen G., Dialynas E., Topalis P., Ho N., Gesing S. VectorBase Consortium . VectorBase Consortium. Madey G., et al. VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases. Nucleic Acids Res. 2015; 43:D707–D713. - PMC - PubMed
    1. Howe K.L., Bolt B.J., Shafie M., Kersey P., Berriman M.. WormBase ParaSite—a comprehensive resource for helminth genomics. Mol. Biochem. Parasitol. 2017; 215:2–10. - PMC - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources