Profile analysis: detection of distantly related proteins - PubMed (original) (raw)
Comparative Study
Profile analysis: detection of distantly related proteins
M Gribskov et al. Proc Natl Acad Sci U S A. 1987 Jul.
Abstract
Profile analysis is a method for detecting distantly related proteins by sequence comparison. The basis for comparison is not only the customary Dayhoff mutational-distance matrix but also the results of structural studies and information implicit in the alignments of the sequences of families of similar proteins. This information is expressed in a position-specific scoring table (profile), which is created from a group of sequences previously aligned by structural or sequence similarity. The similarity of any other sequence (target) to the group of aligned sequences (probe) can be tested by comparing the target to the profile using dynamic programming algorithms. The profile method differs in two major respects from methods of sequence comparison in common use: (i) Any number of known sequences can be used to construct the profile, allowing more information to be used in the testing of the target than is possible with pairwise alignment methods. (ii) The profile includes the penalties for insertion or deletion at each position, which allow one to include the probe secondary structure in the testing scheme. Tests with globin and immunoglobulin sequences show that profile analysis can distinguish all members of these families from all other sequences in a database containing 3800 protein sequences.
Similar articles
- Profile scanning for three-dimensional structural patterns in protein sequences.
Gribskov M, Homyak M, Edenfield J, Eisenberg D. Gribskov M, et al. Comput Appl Biosci. 1988 Mar;4(1):61-6. doi: 10.1093/bioinformatics/4.1.61. Comput Appl Biosci. 1988. PMID: 3383004 - Using CLUSTAL for multiple sequence alignments.
Higgins DG, Thompson JD, Gibson TJ. Higgins DG, et al. Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8. Methods Enzymol. 1996. PMID: 8743695 - Finding homologs to nucleic acid or protein sequences using the framesearch program.
Healy M. Healy M. Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.2. doi: 10.1002/0471250953.bi0302s00. Curr Protoc Bioinformatics. 2002. PMID: 18792937 Review. - Nucleic acid and protein sequence databases.
Kneale GG, Bishop MJ. Kneale GG, et al. Comput Appl Biosci. 1985;1(1):11-7. doi: 10.1093/bioinformatics/1.1.11. Comput Appl Biosci. 1985. PMID: 3916889 Review.
Cited by
- A generalized protein identification method for novel and diverse sequencing technologies.
Bhandari BK, Goldman N. Bhandari BK, et al. NAR Genom Bioinform. 2024 Sep 18;6(3):lqae126. doi: 10.1093/nargab/lqae126. eCollection 2024 Sep. NAR Genom Bioinform. 2024. PMID: 39296929 Free PMC article. - nail: software for high-speed, high-sensitivity protein sequence annotation.
Roddy JW, Rich DH, Wheeler TJ. Roddy JW, et al. bioRxiv [Preprint]. 2024 Jan 30:2024.01.27.577580. doi: 10.1101/2024.01.27.577580. bioRxiv. 2024. PMID: 38352323 Free PMC article. Preprint. - In silico protein function prediction: the rise of machine learning-based approaches.
Chen J, Gu Z, Lai L, Pei J. Chen J, et al. Med Rev (2021). 2023 Nov 29;3(6):487-510. doi: 10.1515/mr-2023-0038. eCollection 2023 Dec. Med Rev (2021). 2023. PMID: 38282798 Free PMC article. Review. - Bioinformatic identification of ClpI, a distinct class of Clp unfoldases in Actinomycetota.
Jiang J, Schmitz KR. Jiang J, et al. Front Microbiol. 2023 Apr 17;14:1161764. doi: 10.3389/fmicb.2023.1161764. eCollection 2023. Front Microbiol. 2023. PMID: 37138635 Free PMC article. - Rational Design of Profile HMMs for Sensitive and Specific Sequence Detection with Case Studies Applied to Viruses, Bacteriophages, and Casposons.
Oliveira LS, Reyes A, Dutilh BE, Gruber A. Oliveira LS, et al. Viruses. 2023 Feb 13;15(2):519. doi: 10.3390/v15020519. Viruses. 2023. PMID: 36851733 Free PMC article.
References
- J Mol Biol. 1966 Mar;16(1):9-16 - PubMed
- Nucleic Acids Res. 1986 Aug 26;14(16):6745-63 - PubMed
- Annu Rev Biochem. 1978;47:251-76 - PubMed
- J Mol Biol. 1980 Jan 25;136(3):225-70 - PubMed
- Science. 1981 Oct 9;214(4517):149-59 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources