Comparison of sequence profiles. Strategies for structural predictions using sequence information - PubMed (original) (raw)
Comparative Study
Comparison of sequence profiles. Strategies for structural predictions using sequence information
L Rychlewski et al. Protein Sci. 2000 Feb.
Abstract
Distant homologies between proteins are often discovered only after three-dimensional structures of both proteins are solved. The sequence divergence for such proteins can be so large that simple comparison of their sequences fails to identify any similarity. New generation of sensitive alignment tools use averaged sequences of entire homologous families (profiles) to detect such homologies. Several algorithms, including the newest generation of BLAST algorithms and BASIC, an algorithm used in our group to assign fold predictions for proteins from several genomes, are compared to each other on the large set of structurally similar proteins with little sequence similarity. Proteins in the benchmark are classified according to the level of their similarity, which allows us to demonstrate that most of the improvement of the new algorithms is achieved for proteins with strong functional similarities, with almost no progress in recognizing distant fold similarities. It is also shown that details of profile calculation strongly influence its sensitivity in recognizing distant homologies. The most important choice is how to include information from diverging members of the family, avoiding generating false predictions, while accounting for entire sequence divergence within a family. PSI-BLAST takes a conservative approach, deriving a profile from core members of the family, providing a solid improvement without almost any false predictions. BASIC strives for better sensitivity by increasing the weight of divergent family members and paying the price in lower reliability. A new FFAS algorithm introduced here uses a new procedure for profile generation that takes into account all the relations within the family and matches BASIC sensitivity with PSI-BLAST like reliability.
Similar articles
- Improving the quality of twilight-zone alignments.
Jaroszewski L, Rychlewski L, Godzik A. Jaroszewski L, et al. Protein Sci. 2000 Aug;9(8):1487-96. doi: 10.1110/ps.9.8.1487. Protein Sci. 2000. PMID: 10975570 Free PMC article. - Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.
Yona G, Levitt M. Yona G, et al. J Mol Biol. 2002 Feb 1;315(5):1257-75. doi: 10.1006/jmbi.2001.5293. J Mol Biol. 2002. PMID: 11827492 - Sensitive methods for determining the relatedness of proteins with limited sequence homology.
Argos P. Argos P. Curr Opin Biotechnol. 1994 Aug;5(4):361-71. doi: 10.1016/0958-1669(94)90044-2. Curr Opin Biotechnol. 1994. PMID: 7765168 Review. - Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Altschul SF, et al. Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389. Nucleic Acids Res. 1997. PMID: 9254694 Free PMC article. Review.
Cited by
- FINDSITE(X): a structure-based, small molecule virtual screening approach with application to all identified human GPCRs.
Zhou H, Skolnick J. Zhou H, et al. Mol Pharm. 2012 Jun 4;9(6):1775-84. doi: 10.1021/mp3000716. Epub 2012 May 21. Mol Pharm. 2012. PMID: 22574683 Free PMC article. - A simple recipe for the non-expert bioinformaticist for building experimentally-testable hypotheses for proteins with no known homologs.
Zawaira A, Shibayama Y. Zawaira A, et al. J Struct Funct Genomics. 2012 Dec;13(4):185-200. doi: 10.1007/s10969-012-9141-7. Epub 2012 Sep 7. J Struct Funct Genomics. 2012. PMID: 22956349 Review. - AI-Driven Deep Learning Techniques in Protein Structure Prediction.
Chen L, Li Q, Nasif KFA, Xie Y, Deng B, Niu S, Pouriyeh S, Dai Z, Chen J, Xie CY. Chen L, et al. Int J Mol Sci. 2024 Aug 1;25(15):8426. doi: 10.3390/ijms25158426. Int J Mol Sci. 2024. PMID: 39125995 Free PMC article. Review. - mRNA:guanine-N7 cap methyltransferases: identification of novel members of the family, evolutionary analysis, homology modeling, and analysis of sequence-structure-function relationships.
Bujnicki JM, Feder M, Radlinska M, Rychlewski L. Bujnicki JM, et al. BMC Bioinformatics. 2001;2:2. doi: 10.1186/1471-2105-2-2. Epub 2001 Jun 22. BMC Bioinformatics. 2001. PMID: 11472630 Free PMC article. - Molecular modeling of the von Willebrand factor A2 Domain and the effects of associated type 2A von Willebrand disease mutations.
Sutherland JJ, O'Brien LA, Lillicrap D, Weaver DF. Sutherland JJ, et al. J Mol Model. 2004 Aug;10(4):259-70. doi: 10.1007/s00894-004-0194-9. Epub 2004 Aug 3. J Mol Model. 2004. PMID: 15322948
References
- J Mol Biol. 1970 Mar;48(3):443-53 - PubMed
- Proteins. 1999;Suppl 3:88-103 - PubMed
- J Mol Biol. 1990 Oct 5;215(3):403-10 - PubMed
- Nucleic Acids Res. 1993 Jul 1;21(13):3105-9 - PubMed
- J Mol Biol. 1994 Nov 4;243(4):574-8 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials