Protein homology detection by HMM-HMM comparison - PubMed (original) (raw)
Comparative Study
. 2005 Apr 1;21(7):951-60.
doi: 10.1093/bioinformatics/bti125. Epub 2004 Nov 5.
Affiliations
- PMID: 15531603
- DOI: 10.1093/bioinformatics/bti125
Comparative Study
Protein homology detection by HMM-HMM comparison
Johannes Söding. Bioinformatics. 2005.
Erratum in
- Bioinformatics. 2005 May 1;21(9):2144
Abstract
Motivation: Protein homology detection and sequence alignment are at the basis of protein structure prediction, function prediction and evolution.
Results: We have generalized the alignment of protein sequences with a profile hidden Markov model (HMM) to the case of pairwise alignment of profile HMMs. We present a method for detecting distant homologous relationships between proteins based on this approach. The method (HHsearch) is benchmarked together with BLAST, PSI-BLAST, HMMER and the profile-profile comparison tools PROF_SIM and COMPASS, in an all-against-all comparison of a database of 3691 protein domains from SCOP 1.63 with pairwise sequence identities below 20%.Sensitivity: When the predicted secondary structure is included in the HMMs, HHsearch is able to detect between 2.7 and 4.2 times more homologs than PSI-BLAST or HMMER and between 1.44 and 1.9 times more than COMPASS or PROF_SIM for a rate of false positives of 10%. Approximately half of the improvement over the profile-profile comparison methods is attributable to the use of profile HMMs in place of simple profiles. Alignment quality: Higher sensitivity is mirrored by an increased alignment quality. HHsearch produced 1.2, 1.7 and 3.3 times more good alignments ('balanced' score >0.3) than the next best method (COMPASS), and 1.6, 2.9 and 9.4 times more than PSI-BLAST, at the family, superfamily and fold level, respectively.Speed: HHsearch scans a query of 200 residues against 3691 domains in 33 s on an AMD64 2GHz PC. This is 10 times faster than PROF_SIM and 17 times faster than COMPASS.
Similar articles
- COACH: profile-profile alignment of protein families using hidden Markov models.
Edgar RC, Sjölander K. Edgar RC, et al. Bioinformatics. 2004 May 22;20(8):1309-18. doi: 10.1093/bioinformatics/bth091. Epub 2004 Feb 12. Bioinformatics. 2004. PMID: 14962937 - HH-suite3 for fast remote homology detection and deep protein annotation.
Steinegger M, Meier M, Mirdita M, Vöhringer H, Haunsberger SJ, Söding J. Steinegger M, et al. BMC Bioinformatics. 2019 Sep 14;20(1):473. doi: 10.1186/s12859-019-3019-7. BMC Bioinformatics. 2019. PMID: 31521110 Free PMC article. - SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection.
Shah AR, Oehmen CS, Webb-Robertson BJ. Shah AR, et al. Bioinformatics. 2008 Mar 15;24(6):783-90. doi: 10.1093/bioinformatics/btn028. Epub 2008 Feb 1. Bioinformatics. 2008. PMID: 18245127 - Five hierarchical levels of sequence-structure correlation in proteins.
Bystroff C, Shao Y, Yuan X. Bystroff C, et al. Appl Bioinformatics. 2004;3(2-3):97-104. doi: 10.2165/00822942-200403020-00004. Appl Bioinformatics. 2004. PMID: 15693735 Review. - Profile hidden Markov models.
Eddy SR. Eddy SR. Bioinformatics. 1998;14(9):755-63. doi: 10.1093/bioinformatics/14.9.755. Bioinformatics. 1998. PMID: 9918945 Review.
Cited by
- Janus: prediction and ranking of mutations required for functional interconversion of enzymes.
Addington TA, Mertz RW, Siegel JB, Thompson JM, Fisher AJ, Filkov V, Fleischman NM, Suen AA, Zhang C, Toney MD. Addington TA, et al. J Mol Biol. 2013 Apr 26;425(8):1378-89. doi: 10.1016/j.jmb.2013.01.034. Epub 2013 Feb 6. J Mol Biol. 2013. PMID: 23396064 Free PMC article. - Computational Design of a Novel VLP-Based Vaccine for Hepatitis B Virus.
Mobini S, Chizari M, Mafakher L, Rismani E, Rismani E. Mobini S, et al. Front Immunol. 2020 Aug 27;11:2074. doi: 10.3389/fimmu.2020.02074. eCollection 2020. Front Immunol. 2020. PMID: 33042118 Free PMC article. - Overview of computational vaccinology: vaccine development through information technology.
Vaishnav N, Gupta A, Paul S, John GJ. Vaishnav N, et al. J Appl Genet. 2015 Aug;56(3):381-91. doi: 10.1007/s13353-014-0265-2. Epub 2014 Dec 23. J Appl Genet. 2015. PMID: 25534541 Review. - learnMSA2: deep protein multiple alignments with large language and hidden Markov models.
Becker F, Stanke M. Becker F, et al. Bioinformatics. 2024 Sep 1;40(Suppl 2):ii79-ii86. doi: 10.1093/bioinformatics/btae381. Bioinformatics. 2024. PMID: 39230690 Free PMC article. - Understanding and identifying amino acid repeats.
Luo H, Nijveen H. Luo H, et al. Brief Bioinform. 2014 Jul;15(4):582-91. doi: 10.1093/bib/bbt003. Brief Bioinform. 2014. PMID: 23418055 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials