Large-scale prediction of disulphide bridges using kernel methods, two-dimensional recursive neural networks, and weighted graph matching - PubMed (original) (raw)
. 2006 Mar 15;62(3):617-29.
doi: 10.1002/prot.20787.
Affiliations
- PMID: 16320312
- DOI: 10.1002/prot.20787
Large-scale prediction of disulphide bridges using kernel methods, two-dimensional recursive neural networks, and weighted graph matching
Jianlin Cheng et al. Proteins. 2006.
Abstract
The formation of disulphide bridges between cysteines plays an important role in protein folding, structure, function, and evolution. Here, we develop new methods for predicting disulphide bridges in proteins. We first build a large curated data set of proteins containing disulphide bridges to extract relevant statistics. We then use kernel methods to predict whether a given protein chain contains intrachain disulphide bridges or not, and recursive neural networks to predict the bonding probabilities of each pair of cysteines in the chain. These probabilities in turn lead to an accurate estimation of the total number of disulphide bridges and to a weighted graph matching problem that can be addressed efficiently to infer the global disulphide bridge connectivity pattern. This approach can be applied both in situations where the bonded state of each cysteine is known, or in ab initio mode where the state is unknown. Furthermore, it can easily cope with chains containing an arbitrary number of disulphide bridges, overcoming one of the major limitations of previous approaches. It can classify individual cysteine residues as bonded or nonbonded with 87% specificity and 89% sensitivity. The estimate for the total number of bridges in each chain is correct 71% of the times, and within one from the true value over 94% of the times. The prediction of the overall disulphide connectivity pattern is exact in about 51% of the chains. In addition to using profiles in the input to leverage evolutionary information, including true (but not predicted) secondary structure and solvent accessibility information yields small but noticeable improvements. Finally, once the system is trained, predictions can be computed rapidly on a proteomic or protein-engineering scale. The disulphide bridge prediction server (DIpro), software, and datasets are available through www.igb.uci.edu/servers/psss.html.
(c) 2005 Wiley-Liss, Inc.
Similar articles
- Disulfide connectivity prediction using recursive neural networks and evolutionary information.
Vullo A, Frasconi P. Vullo A, et al. Bioinformatics. 2004 Mar 22;20(5):653-9. doi: 10.1093/bioinformatics/btg463. Epub 2004 Jan 22. Bioinformatics. 2004. PMID: 15033872 - Three-stage prediction of protein beta-sheets by neural networks, alignments and graph algorithms.
Cheng J, Baldi P. Cheng J, et al. Bioinformatics. 2005 Jun;21 Suppl 1:i75-84. doi: 10.1093/bioinformatics/bti1004. Bioinformatics. 2005. PMID: 15961501 - Cysteine separations profiles on protein sequences infer disulfide connectivity.
Zhao E, Liu HL, Tsai CH, Tsai HK, Chan CH, Kao CY. Zhao E, et al. Bioinformatics. 2005 Apr 15;21(8):1415-20. doi: 10.1093/bioinformatics/bti179. Epub 2004 Dec 7. Bioinformatics. 2005. PMID: 15585533 - Sequence comparison and protein structure prediction.
Dunbrack RL Jr. Dunbrack RL Jr. Curr Opin Struct Biol. 2006 Jun;16(3):374-84. doi: 10.1016/j.sbi.2006.05.006. Epub 2006 May 19. Curr Opin Struct Biol. 2006. PMID: 16713709 Review. - Disulphide bond formation in food protein aggregation and gelation.
Visschers RW, de Jongh HH. Visschers RW, et al. Biotechnol Adv. 2005 Jan;23(1):75-80. doi: 10.1016/j.biotechadv.2004.09.005. Biotechnol Adv. 2005. PMID: 15610968 Review.
Cited by
- Towards accurate residue-residue hydrophobic contact prediction for alpha helical proteins via integer linear optimization.
Rajgaria R, McAllister SR, Floudas CA. Rajgaria R, et al. Proteins. 2009 Mar;74(4):929-47. doi: 10.1002/prot.22202. Proteins. 2009. PMID: 18767158 Free PMC article. - DBCP: a web server for disulfide bonding connectivity pattern prediction without the prior knowledge of the bonding state of cysteines.
Lin HH, Tseng LY. Lin HH, et al. Nucleic Acids Res. 2010 Jul;38(Web Server issue):W503-7. doi: 10.1093/nar/gkq514. Epub 2010 Jun 8. Nucleic Acids Res. 2010. PMID: 20530534 Free PMC article. - BactPepDB: a database of predicted peptides from a exhaustive survey of complete prokaryote genomes.
Rey J, Deschavanne P, Tuffery P. Rey J, et al. Database (Oxford). 2014 Nov 6;2014:bau106. doi: 10.1093/database/bau106. Print 2014. Database (Oxford). 2014. PMID: 25377257 Free PMC article. - Alga-PrAS (Algal Protein Annotation Suite): A Database of Comprehensive Annotation in Algal Proteomes.
Kurotani A, Yamada Y, Sakurai T. Kurotani A, et al. Plant Cell Physiol. 2017 Jan 1;58(1):e6. doi: 10.1093/pcp/pcw212. Plant Cell Physiol. 2017. PMID: 28069893 Free PMC article. - A novel pathogenic variant of the LDLR gene in the Asian population and its clinical correlation with familial hypercholesterolemia.
Chahil JK, Lye SH, Bagali PG, Alex L. Chahil JK, et al. Mol Biol Rep. 2012 Jul;39(7):7831-8. doi: 10.1007/s11033-012-1626-8. Epub 2012 Apr 28. Mol Biol Rep. 2012. PMID: 22544571
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources