Genome analysis: Assigning protein coding regions to three-dimensional structures - PubMed (original) (raw)
Comparative Study
Genome analysis: Assigning protein coding regions to three-dimensional structures
A A Salamov et al. Protein Sci. 1999 Apr.
Abstract
We describe the results of a procedure for maximizing the number of sequences that can be reliably linked to a protein of known three-dimensional structure. Unlike other methods, which try to increase sensitivity through the use of fold recognition software, we only use conventional sequence alignment tools, but apply them in a manner that significantly increases the number of relationships detected. We analyzed 11 genomes and found that, depending on the genome, between 23 and 32% of the ORFs had significant matches to proteins of known structure. In all cases, the aligned region consisted of either >100 residues or >50% of the smaller sequence. Slightly higher percentages could be attained if smaller motifs were also included. This is significantly higher than most previously reported methods, even those that have a fold-recognition component. We survey the biochemical and structural characteristics of the most frequently occurring proteins, and discuss the extent to which alignment methods can realistically assign function to gene products.
Similar articles
- Structural and functional insights into Mimivirus ORFans.
Saini HK, Fischer D. Saini HK, et al. BMC Genomics. 2007 May 9;8:115. doi: 10.1186/1471-2164-8-115. BMC Genomics. 2007. PMID: 17490476 Free PMC article. - [A turning point in the knowledge of the structure-function-activity relations of elastin].
Alix AJ. Alix AJ. J Soc Biol. 2001;195(2):181-93. J Soc Biol. 2001. PMID: 11727705 Review. French. - Assigning amino acid sequences to 3-dimensional protein folds.
Fischer D, Rice D, Bowie JU, Eisenberg D. Fischer D, et al. FASEB J. 1996 Jan;10(1):126-36. doi: 10.1096/fasebj.10.1.8566533. FASEB J. 1996. PMID: 8566533 Review.
Cited by
- Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments.
Friedberg I, Kaplan T, Margalit H. Friedberg I, et al. Protein Sci. 2000 Nov;9(11):2278-84. doi: 10.1110/ps.9.11.2278. Protein Sci. 2000. PMID: 11152139 Free PMC article. - Pcons: a neural-network-based consensus predictor that improves fold recognition.
Lundström J, Rychlewski L, Bujnicki J, Elofsson A. Lundström J, et al. Protein Sci. 2001 Nov;10(11):2354-62. doi: 10.1110/ps.08501. Protein Sci. 2001. PMID: 11604541 Free PMC article. - Rapid protein domain assignment from amino acid sequence using predicted secondary structure.
Marsden RL, McGuffin LJ, Jones DT. Marsden RL, et al. Protein Sci. 2002 Dec;11(12):2814-24. doi: 10.1110/ps.0209902. Protein Sci. 2002. PMID: 12441380 Free PMC article. - A comparison of position-specific score matrices based on sequence and structure alignments.
Panchenko AR, Bryant SH. Panchenko AR, et al. Protein Sci. 2002 Feb;11(2):361-70. doi: 10.1110/ps.19902. Protein Sci. 2002. PMID: 11790846 Free PMC article. - HUNT: launch of a full-length cDNA database from the Helix Research Institute.
Yudate HT, Suwa M, Irie R, Matsui H, Nishikawa T, Nakamura Y, Yamaguchi D, Peng ZZ, Yamamoto T, Nagai K, Hayashi K, Otsuki T, Sugiyama T, Ota T, Suzuki Y, Sugano S, Isogai T, Masuho Y. Yudate HT, et al. Nucleic Acids Res. 2001 Jan 1;29(1):185-8. doi: 10.1093/nar/29.1.185. Nucleic Acids Res. 2001. PMID: 11125086 Free PMC article.
References
- Proc Natl Acad Sci U S A. 1997 Oct 28;94(22):11929-34 - PubMed
- Structure. 1997 Aug 15;5(8):1093-108 - PubMed
- Proteins. 1998 Feb 15;30(3):275-86 - PubMed
- Protein Sci. 1998 Feb;7(2):233-42 - PubMed
- Proc Natl Acad Sci U S A. 1998 May 26;95(11):6073-8 - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources