An improved prediction of catalytic residues in enzyme structures - PubMed (original) (raw)
. 2008 May;21(5):295-302.
doi: 10.1093/protein/gzn003. Epub 2008 Feb 20.
Affiliations
- PMID: 18287176
- DOI: 10.1093/protein/gzn003
An improved prediction of catalytic residues in enzyme structures
Yu-Rong Tang et al. Protein Eng Des Sel. 2008 May.
Abstract
The protein databases contain a huge number of function unknown proteins, including many proteins with newly determined 3D structures resulted from the Structural Genomics Projects. To accelerate experiment-based assignment of function, de novo prediction of protein functional sites, like active sites in enzymes, becomes increasingly important. Here, we attempted to improve the prediction of catalytic residues in enzyme structures by seeking and refining different encodings (i.e. residue properties) as well as employing new machine learning algorithms. In particular, considering that catalytic residues can often reveal specific network centrality when representing enzyme structure as a residue contact network, the corresponding measurement (i.e. closeness centrality) was used as one of the most important encodings in our new predictor. Meanwhile, a genetic algorithm integrated neural network (GANN) was also employed. Thanks to the above strategies, our GANN predictor demonstrated a high accuracy of 91.2% in the prediction of catalytic residues based on balanced datasets (i.e. the 1:1 ratio of catalytic to non-catalytic residues). When the GANN method was optimally applied to real enzyme structures, 73.9% of the tested structures had the active site correctly located. Compared with two existing methods, the proposed GANN method also demonstrated a better performance.
Similar articles
- GANNPhos: a new phosphorylation site predictor based on a genetic algorithm integrated neural network.
Tang YR, Chen YZ, Canchaya CA, Zhang Z. Tang YR, et al. Protein Eng Des Sel. 2007 Aug;20(8):405-12. doi: 10.1093/protein/gzm035. Epub 2007 Jul 24. Protein Eng Des Sel. 2007. PMID: 17652129 - Identification of catalytic residues from protein structure using support vector machine with sequence and structural features.
Pugalenthi G, Kumar KK, Suganthan PN, Gangal R. Pugalenthi G, et al. Biochem Biophys Res Commun. 2008 Mar 14;367(3):630-4. doi: 10.1016/j.bbrc.2008.01.038. Epub 2008 Jan 17. Biochem Biophys Res Commun. 2008. PMID: 18206645 - Computed protonation properties: unique capabilities for protein functional site prediction.
Murga LF, Wei Y, Ondrechen MJ. Murga LF, et al. Genome Inform. 2007;19:107-18. Genome Inform. 2007. PMID: 18546509 - Understanding nature's catalytic toolkit.
Gutteridge A, Thornton JM. Gutteridge A, et al. Trends Biochem Sci. 2005 Nov;30(11):622-9. doi: 10.1016/j.tibs.2005.09.006. Epub 2005 Oct 7. Trends Biochem Sci. 2005. PMID: 16214343 Review. - Searching for functional sites in protein structures.
Jones S, Thornton JM. Jones S, et al. Curr Opin Chem Biol. 2004 Feb;8(1):3-7. doi: 10.1016/j.cbpa.2003.11.001. Curr Opin Chem Biol. 2004. PMID: 15036149 Review.
Cited by
- Automatic prediction of catalytic residues by modeling residue structural neighborhood.
Cilia E, Passerini A. Cilia E, et al. BMC Bioinformatics. 2010 Mar 3;11:115. doi: 10.1186/1471-2105-11-115. BMC Bioinformatics. 2010. PMID: 20199672 Free PMC article. - An automatic method for assessing structural importance of amino acid positions.
Sadowski MI, Jones DT. Sadowski MI, et al. BMC Struct Biol. 2009 Mar 4;9:10. doi: 10.1186/1472-6807-9-10. BMC Struct Biol. 2009. PMID: 19261183 Free PMC article. - Sequence conservation in the prediction of catalytic sites.
Dou Y, Geng X, Gao H, Yang J, Zheng X, Wang J. Dou Y, et al. Protein J. 2011 Apr;30(4):229-39. doi: 10.1007/s10930-011-9324-2. Protein J. 2011. PMID: 21465136 - Networks of high mutual information define the structural proximity of catalytic sites: implications for catalytic residue identification.
Marino Buslje C, Teppa E, Di Doménico T, Delfino JM, Nielsen M. Marino Buslje C, et al. PLoS Comput Biol. 2010 Nov 4;6(11):e1000978. doi: 10.1371/journal.pcbi.1000978. PLoS Comput Biol. 2010. PMID: 21079665 Free PMC article. - Catalytic residues in hydrolases: analysis of methods designed for ligand-binding site prediction.
Prymula K, Jadczyk T, Roterman I. Prymula K, et al. J Comput Aided Mol Des. 2011 Feb;25(2):117-33. doi: 10.1007/s10822-010-9402-0. Epub 2010 Nov 21. J Comput Aided Mol Des. 2011. PMID: 21104192 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources