COACH: profile-profile alignment of protein families using hidden Markov models - PubMed (original) (raw)
Comparative Study
. 2004 May 22;20(8):1309-18.
doi: 10.1093/bioinformatics/bth091. Epub 2004 Feb 12.
Affiliations
- PMID: 14962937
- DOI: 10.1093/bioinformatics/bth091
Comparative Study
COACH: profile-profile alignment of protein families using hidden Markov models
Robert C Edgar et al. Bioinformatics. 2004.
Abstract
Motivation: Alignments of two multiple-sequence alignments, or statistical models of such alignments (profiles), have important applications in computational biology. The increased amount of information in a profile versus a single sequence can lead to more accurate alignments and more sensitive homolog detection in database searches. Several profile-profile alignment methods have been proposed and have been shown to improve sensitivity and alignment quality compared with sequence-sequence methods (such as BLAST) and profile-sequence methods (e.g. PSI-BLAST). Here we present a new approach to profile-profile alignment we call Comparison of Alignments by Constructing Hidden Markov Models (HMMs) (COACH). COACH aligns two multiple sequence alignments by constructing a profile HMM from one alignment and aligning the other to that HMM.
Results: We compare the alignment accuracy of COACH with two recently published methods: Yona and Levitt's prof_sim and Sadreyev and Grishin's COMPASS. On two sets of reference alignments selected from the FSSP database, we find that COACH is able, on average, to produce alignments giving the best coverage or the fewest errors, depending on the chosen parameter settings.
Availability: COACH is freely available from www.drive5.com/lobster
Similar articles
- A comparison of scoring functions for protein sequence profile alignment.
Edgar RC, Sjölander K. Edgar RC, et al. Bioinformatics. 2004 May 22;20(8):1301-8. doi: 10.1093/bioinformatics/bth090. Epub 2004 Feb 12. Bioinformatics. 2004. PMID: 14962936 - Protein homology detection by HMM-HMM comparison.
Söding J. Söding J. Bioinformatics. 2005 Apr 1;21(7):951-60. doi: 10.1093/bioinformatics/bti125. Epub 2004 Nov 5. Bioinformatics. 2005. PMID: 15531603 - SATCHMO: sequence alignment and tree construction using hidden Markov models.
Edgar RC, Sjölander K. Edgar RC, et al. Bioinformatics. 2003 Jul 22;19(11):1404-11. doi: 10.1093/bioinformatics/btg158. Bioinformatics. 2003. PMID: 12874053 - Revisiting Evaluation of Multiple Sequence Alignment Methods.
Warnow T. Warnow T. Methods Mol Biol. 2021;2231:299-317. doi: 10.1007/978-1-0716-1036-7_17. Methods Mol Biol. 2021. PMID: 33289899 Review. - Phylogenomic inference of protein molecular function: advances and challenges.
Sjölander K. Sjölander K. Bioinformatics. 2004 Jan 22;20(2):170-9. doi: 10.1093/bioinformatics/bth021. Bioinformatics. 2004. PMID: 14734307 Review.
Cited by
- Automatic generation of bioinformatics tools for predicting protein-ligand binding sites.
Komiyama Y, Banno M, Ueki K, Saad G, Shimizu K. Komiyama Y, et al. Bioinformatics. 2016 Mar 15;32(6):901-7. doi: 10.1093/bioinformatics/btv593. Epub 2015 Nov 5. Bioinformatics. 2016. PMID: 26545824 Free PMC article. - De-DUFing the DUFs: Deciphering distant evolutionary relationships of Domains of Unknown Function using sensitive homology detection methods.
Mudgal R, Sandhya S, Chandra N, Srinivasan N. Mudgal R, et al. Biol Direct. 2015 Jul 31;10:38. doi: 10.1186/s13062-015-0069-2. Biol Direct. 2015. PMID: 26228684 Free PMC article. - MUSCLE: a multiple sequence alignment method with reduced time and space complexity.
Edgar RC. Edgar RC. BMC Bioinformatics. 2004 Aug 19;5:113. doi: 10.1186/1471-2105-5-113. BMC Bioinformatics. 2004. PMID: 15318951 Free PMC article. - MUSCLE: multiple sequence alignment with high accuracy and high throughput.
Edgar RC. Edgar RC. Nucleic Acids Res. 2004 Mar 19;32(5):1792-7. doi: 10.1093/nar/gkh340. Print 2004. Nucleic Acids Res. 2004. PMID: 15034147 Free PMC article. - Contrastive learning on protein embeddings enlightens midnight zone.
Heinzinger M, Littmann M, Sillitoe I, Bordin N, Orengo C, Rost B. Heinzinger M, et al. NAR Genom Bioinform. 2022 Jun 11;4(2):lqac043. doi: 10.1093/nargab/lqac043. eCollection 2022 Jun. NAR Genom Bioinform. 2022. PMID: 35702380 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous