Comparison of sequence profiles. Strategies for structural predictions using sequence information - PubMed (original) (raw)
Comparative Study
Comparison of sequence profiles. Strategies for structural predictions using sequence information
L Rychlewski et al. Protein Sci. 2000 Feb.
Abstract
Distant homologies between proteins are often discovered only after three-dimensional structures of both proteins are solved. The sequence divergence for such proteins can be so large that simple comparison of their sequences fails to identify any similarity. New generation of sensitive alignment tools use averaged sequences of entire homologous families (profiles) to detect such homologies. Several algorithms, including the newest generation of BLAST algorithms and BASIC, an algorithm used in our group to assign fold predictions for proteins from several genomes, are compared to each other on the large set of structurally similar proteins with little sequence similarity. Proteins in the benchmark are classified according to the level of their similarity, which allows us to demonstrate that most of the improvement of the new algorithms is achieved for proteins with strong functional similarities, with almost no progress in recognizing distant fold similarities. It is also shown that details of profile calculation strongly influence its sensitivity in recognizing distant homologies. The most important choice is how to include information from diverging members of the family, avoiding generating false predictions, while accounting for entire sequence divergence within a family. PSI-BLAST takes a conservative approach, deriving a profile from core members of the family, providing a solid improvement without almost any false predictions. BASIC strives for better sensitivity by increasing the weight of divergent family members and paying the price in lower reliability. A new FFAS algorithm introduced here uses a new procedure for profile generation that takes into account all the relations within the family and matches BASIC sensitivity with PSI-BLAST like reliability.
Similar articles
- Improving the quality of twilight-zone alignments.
Jaroszewski L, Rychlewski L, Godzik A. Jaroszewski L, et al. Protein Sci. 2000 Aug;9(8):1487-96. doi: 10.1110/ps.9.8.1487. Protein Sci. 2000. PMID: 10975570 Free PMC article. - Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.
Yona G, Levitt M. Yona G, et al. J Mol Biol. 2002 Feb 1;315(5):1257-75. doi: 10.1006/jmbi.2001.5293. J Mol Biol. 2002. PMID: 11827492 - Sensitive methods for determining the relatedness of proteins with limited sequence homology.
Argos P. Argos P. Curr Opin Biotechnol. 1994 Aug;5(4):361-71. doi: 10.1016/0958-1669(94)90044-2. Curr Opin Biotechnol. 1994. PMID: 7765168 Review. - Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Altschul SF, et al. Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389. Nucleic Acids Res. 1997. PMID: 9254694 Free PMC article. Review.
Cited by
- TIM-Finder: a new method for identifying TIM-barrel proteins.
Si JN, Yan RX, Wang C, Zhang Z, Su XD. Si JN, et al. BMC Struct Biol. 2009 Dec 14;9:73. doi: 10.1186/1472-6807-9-73. BMC Struct Biol. 2009. PMID: 20003393 Free PMC article. - Advances in homology protein structure modeling.
Xiang Z. Xiang Z. Curr Protein Pept Sci. 2006 Jun;7(3):217-27. doi: 10.2174/138920306777452312. Curr Protein Pept Sci. 2006. PMID: 16787261 Free PMC article. Review. - Three-dimensional structure of the catalytic domain of the yeast beta-(1,3)-glucan transferase Gas1: a molecular modeling investigation.
Papaleo E, Fantucci P, Vai M, De Gioia L. Papaleo E, et al. J Mol Model. 2006 Jan;12(2):237-48. doi: 10.1007/s00894-005-0025-7. Epub 2005 Oct 21. J Mol Model. 2006. PMID: 16240096 - Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline.
Lesley SA, Kuhn P, Godzik A, Deacon AM, Mathews I, Kreusch A, Spraggon G, Klock HE, McMullan D, Shin T, Vincent J, Robb A, Brinen LS, Miller MD, McPhillips TM, Miller MA, Scheibe D, Canaves JM, Guda C, Jaroszewski L, Selby TL, Elsliger MA, Wooley J, Taylor SS, Hodgson KO, Wilson IA, Schultz PG, Stevens RC. Lesley SA, et al. Proc Natl Acad Sci U S A. 2002 Sep 3;99(18):11664-9. doi: 10.1073/pnas.142413399. Epub 2002 Aug 22. Proc Natl Acad Sci U S A. 2002. PMID: 12193646 Free PMC article. - Alignment of protein sequences by their profiles.
Marti-Renom MA, Madhusudhan MS, Sali A. Marti-Renom MA, et al. Protein Sci. 2004 Apr;13(4):1071-87. doi: 10.1110/ps.03379804. Protein Sci. 2004. PMID: 15044736 Free PMC article.
References
- J Mol Biol. 1970 Mar;48(3):443-53 - PubMed
- Proteins. 1999;Suppl 3:88-103 - PubMed
- J Mol Biol. 1990 Oct 5;215(3):403-10 - PubMed
- Nucleic Acids Res. 1993 Jul 1;21(13):3105-9 - PubMed
- J Mol Biol. 1994 Nov 4;243(4):574-8 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials