Improving fold recognition without folds - PubMed (original) (raw)
Improving fold recognition without folds
Dariusz Przybylski et al. J Mol Biol. 2004.
Abstract
The most reliable way to align two proteins of unknown structure is through sequence-profile and profile-profile alignment methods. If the structure for one of the two is known, fold recognition methods outperform purely sequence-based alignments. Here, we introduced a novel method that aligns generalised sequence and predicted structure profiles. Using predicted 1D structure (secondary structure and solvent accessibility) significantly improved over sequence-only methods, both in terms of correctly recognising pairs of proteins with different sequences and similar structures and in terms of correctly aligning the pairs. The scores obtained by our generalised scoring matrix followed an extreme value distribution; this yielded accurate estimates of the statistical significance of our alignments. We found that mistakes in 1D structure predictions correlated between proteins from different sequence-structure families. The impact of this surprising result was that our method succeeded in significantly out-performing sequence-only methods even without explicitly using structural information from any of the two. Since AGAPE also outperformed established methods that rely on 3D information, we made it available through. If we solved the problem of CPU-time required to apply AGAPE on millions of proteins, our results could also impact everyday database searches.
Similar articles
- A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.
Rice DW, Eisenberg D. Rice DW, et al. J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924. J Mol Biol. 1997. PMID: 9135128 - Protein structure mining using a structural alphabet.
Tyagi M, de Brevern AG, Srinivasan N, Offmann B. Tyagi M, et al. Proteins. 2008 May 1;71(2):920-37. doi: 10.1002/prot.21776. Proteins. 2008. PMID: 18004784 - Protein structure prediction by threading methods: evaluation of current techniques.
Lemer CM, Rooman MJ, Wodak SJ. Lemer CM, et al. Proteins. 1995 Nov;23(3):337-55. doi: 10.1002/prot.340230308. Proteins. 1995. PMID: 8710827 - Sequence comparison and protein structure prediction.
Dunbrack RL Jr. Dunbrack RL Jr. Curr Opin Struct Biol. 2006 Jun;16(3):374-84. doi: 10.1016/j.sbi.2006.05.006. Epub 2006 May 19. Curr Opin Struct Biol. 2006. PMID: 16713709 Review. - 3D-1D threading methods for protein fold recognition.
David R, Korenberg MJ, Hunter IW. David R, et al. Pharmacogenomics. 2000 Nov;1(4):445-55. doi: 10.1517/14622416.1.4.445. Pharmacogenomics. 2000. PMID: 11257928 Review.
Cited by
- Simple fold composition and modular architecture of the nuclear pore complex.
Devos D, Dokudovskaya S, Williams R, Alber F, Eswar N, Chait BT, Rout MP, Sali A. Devos D, et al. Proc Natl Acad Sci U S A. 2006 Feb 14;103(7):2172-7. doi: 10.1073/pnas.0506345103. Epub 2006 Feb 6. Proc Natl Acad Sci U S A. 2006. PMID: 16461911 Free PMC article. - LTHREADER: prediction of extracellular ligand-receptor interactions in cytokines using localized threading.
Pulim V, Bienkowska J, Berger B. Pulim V, et al. Protein Sci. 2008 Feb;17(2):279-92. doi: 10.1110/ps.073178108. Epub 2007 Dec 20. Protein Sci. 2008. PMID: 18096641 Free PMC article. - Cell cycle kinases predicted from conserved biophysical properties.
Wrzeszczynski KO, Rost B. Wrzeszczynski KO, et al. Proteins. 2009 Feb 15;74(3):655-68. doi: 10.1002/prot.22181. Proteins. 2009. PMID: 18704950 Free PMC article. - The PredictProtein server.
Rost B, Yachdav G, Liu J. Rost B, et al. Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W321-6. doi: 10.1093/nar/gkh377. Nucleic Acids Res. 2004. PMID: 15215403 Free PMC article. - PSS-3D1D: an improved 3D1D profile method of protein fold recognition for the annotation of twilight zone sequences.
Ganesan K, Parthasarathy S. Ganesan K, et al. J Struct Funct Genomics. 2011 Dec;12(4):181-9. doi: 10.1007/s10969-011-9119-x. Epub 2011 Dec 3. J Struct Funct Genomics. 2011. PMID: 22160493
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources