STRING v10: protein-protein interaction networks, integrated over the tree of life - PubMed (original) (raw)
. 2015 Jan;43(Database issue):D447-52.
doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.
Andrea Franceschini 1, Stefan Wyder 1, Kristoffer Forslund 2, Davide Heller 1, Jaime Huerta-Cepas 2, Milan Simonovic 1, Alexander Roth 1, Alberto Santos 3, Kalliopi P Tsafou 3, Michael Kuhn 4, Peer Bork 5, Lars J Jensen 6, Christian von Mering 7
Affiliations
- PMID: 25352553
- PMCID: PMC4383874
- DOI: 10.1093/nar/gku1003
STRING v10: protein-protein interaction networks, integrated over the tree of life
Damian Szklarczyk et al. Nucleic Acids Res. 2015 Jan.
Abstract
The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein-protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein-protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
Figure 1.
The STRING network view. Combined screenshots from the STRING website, which has been queried with a subset of proteins belonging to two different protein complexes in yeast (the COP9 signalosome, as well as the proteasome). Colored lines between the proteins indicate the various types of interaction evidence. Protein nodes which are enlarged indicate the availability of 3D protein structure information. Inset top right: for each protein, accessory information is available which includes annotations, cross-links and domain structures. Inset bottom right: the same network is shown after the addition of a user-configurable ‘payload’-dataset (26). In this case, the payload corresponds to color-coded protein abundance information, and reveals systematic differences in the expression strength of both complexes.
Figure 2.
Improved Co-expression analysis. STRING v10 features a completely re-designed pipeline for accessing and processing gene expression information. Left: overview of the individual steps; note that redundant expression experiments are now detected and pruned automatically. Right: improved benchmark performance of the resulting co-expression links, relative to the previous version of STRING, in four model organisms (ROC curves). The benchmark is based on the KEGG pathway maps; predicted interactions are considered to be true positives when both interacting proteins are annotated to the same KEGG map.
Figure 3.
Access to STRING from R/Bioconductor. Left: example session describing how to initialize a human protein network from the STRING database backend, and how to map a set of gene names against it. A subset of the proteins is then plotted as a STRING network (right), complete with auxiliary numerical payload-information highlighting some nodes of interest (red color halos).
Similar articles
- STRING 8--a global view on proteins and their functional interactions in 630 organisms.
Jensen LJ, Kuhn M, Stark M, Chaffron S, Creevey C, Muller J, Doerks T, Julien P, Roth A, Simonovic M, Bork P, von Mering C. Jensen LJ, et al. Nucleic Acids Res. 2009 Jan;37(Database issue):D412-6. doi: 10.1093/nar/gkn760. Epub 2008 Oct 21. Nucleic Acids Res. 2009. PMID: 18940858 Free PMC article. - The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.
Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, Santos A, Doncheva NT, Roth A, Bork P, Jensen LJ, von Mering C. Szklarczyk D, et al. Nucleic Acids Res. 2017 Jan 4;45(D1):D362-D368. doi: 10.1093/nar/gkw937. Epub 2016 Oct 18. Nucleic Acids Res. 2017. PMID: 27924014 Free PMC article. - STRING 7--recent developments in the integration and prediction of protein interactions.
von Mering C, Jensen LJ, Kuhn M, Chaffron S, Doerks T, Krüger B, Snel B, Bork P. von Mering C, et al. Nucleic Acids Res. 2007 Jan;35(Database issue):D358-62. doi: 10.1093/nar/gkl825. Epub 2006 Nov 10. Nucleic Acids Res. 2007. PMID: 17098935 Free PMC article. - STRING v9.1: protein-protein interaction networks, with increased coverage and integration.
Franceschini A, Szklarczyk D, Frankild S, Kuhn M, Simonovic M, Roth A, Lin J, Minguez P, Bork P, von Mering C, Jensen LJ. Franceschini A, et al. Nucleic Acids Res. 2013 Jan;41(Database issue):D808-15. doi: 10.1093/nar/gks1094. Epub 2012 Nov 29. Nucleic Acids Res. 2013. PMID: 23203871 Free PMC article. - Merging in-silico and in vitro salivary protein complex partners using the STRING database: A tutorial.
Crosara KTB, Moffa EB, Xiao Y, Siqueira WL. Crosara KTB, et al. J Proteomics. 2018 Jan 16;171:87-94. doi: 10.1016/j.jprot.2017.08.002. Epub 2017 Aug 3. J Proteomics. 2018. PMID: 28782718 Review.
Cited by
- Protein-protein interaction network study of metallo-beta-lactamase-L1 present in Stenotrophomonas maltophilia and identification of potential drug targets.
Sreenithya KH, Sugumar S. Sreenithya KH, et al. In Silico Pharmacol. 2024 Oct 29;12(2):94. doi: 10.1007/s40203-024-00270-9. eCollection 2024. In Silico Pharmacol. 2024. PMID: 39479381 - Identification of a prognostic long noncoding RNA signature in lung squamous cell carcinoma: a population-based study with a mean follow-up of 3.5 years.
Zheng R, Zheng M, Wang M, Lu F, Hu M. Zheng R, et al. Arch Public Health. 2021 Apr 28;79(1):61. doi: 10.1186/s13690-021-00588-2. Arch Public Health. 2021. PMID: 33910626 Free PMC article. - Quantitative proteomic analysis of the microbial degradation of 3-aminobenzoic acid by Comamonas sp. QT12.
Zhao S, Pan C, Zhao J, Du H, Li M, Yu H, Chen X. Zhao S, et al. Sci Rep. 2022 Oct 20;12(1):17609. doi: 10.1038/s41598-022-17570-9. Sci Rep. 2022. PMID: 36266292 Free PMC article. - Integrated Microarray Analysis to Identify Genes and Small-Molecule Drugs Associated with Stroke Progression.
Cui S, Zhao Y, Huang M, Zhang H, Zhao W, Chen Z. Cui S, et al. Evid Based Complement Alternat Med. 2022 Sep 1;2022:7634509. doi: 10.1155/2022/7634509. eCollection 2022. Evid Based Complement Alternat Med. 2022. PMID: 36091596 Free PMC article. Retracted. - Genome-wide identification, characterization and expression analysis of the LIM transcription factor family in quinoa.
Zhu X, Wang B, Wang X, Zhang C, Wei X. Zhu X, et al. Physiol Mol Biol Plants. 2021 Apr;27(4):787-800. doi: 10.1007/s12298-021-00988-2. Epub 2021 Apr 13. Physiol Mol Biol Plants. 2021. PMID: 33967462 Free PMC article.
References
- Lee D., Redfern O., Orengo C. Predicting protein function from sequence and structure. Nat. Rev. Mol. Cell Biol. 2007;8:995–1005. - PubMed
- Ouzounis C.A., Coulson R.M., Enright A.J., Kunin V., Pereira-Leal J.B. Classification schemes for protein structure and function. Nat. Rev. Genet. 2003;4:508–519. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous