Finding protein similarities with nucleotide sequence databases - PubMed (original) (raw)
Comparative Study
Finding protein similarities with nucleotide sequence databases
S Henikoff et al. Methods Enzymol. 1990.
Abstract
In this chapter we describe strategies for the searching of translated nucleotide sequence databases. By applying standard searching techniques developed for protein databases, we have found that previously unrecognized homologies can be detected. In addition, we have shown that extremely high sensitivity can be obtained using the scoring matrix strategy for short regions of similarity. The latter approach is particularly effective for detecting homologs found at the ends of sequences and within data of poor quality. These individual methods are demonstrated for the LysR family of bacterial activator proteins. Successive applications of these methods allow for sensitive detection of complex relationships, as demonstrated for the AraC family and for the complex LuxR-OmpR-NtrC families of bacterial activator proteins. Although our examples are drawn from bacterial sequences, these methods are likewise effective for higher eukaryotic genomic sequences, where protein-coding sequences are usually interrupted by introns. This should be particularly important in the future, since much of the expected increase in nucleotide sequence databases is likely to come from eukaryotic genomic sequencing projects.
Similar articles
- A novel sequence similarity searching and visualization method based on overlappingly translated nucleic acids: the blastNP.
Biro JC, Biro JM. Biro JC, et al. Med Hypotheses. 2004;62(4):568-74. doi: 10.1016/j.mehy.2003.11.020. Med Hypotheses. 2004. PMID: 15050109 - Searching for patterns in protein and nucleic acid sequences.
Staden R. Staden R. Methods Enzymol. 1990;183:193-211. doi: 10.1016/0076-6879(90)83014-z. Methods Enzymol. 1990. PMID: 1690333 No abstract available. - Detection of protein similarities using nucleotide sequence databases.
Henikoff S, Wallace JC. Henikoff S, et al. Nucleic Acids Res. 1988 Jul 11;16(13):6191-204. doi: 10.1093/nar/16.13.6191. Nucleic Acids Res. 1988. PMID: 3135536 Free PMC article. - Finding homologs to nucleic acid or protein sequences using the framesearch program.
Healy M. Healy M. Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.2. doi: 10.1002/0471250953.bi0302s00. Curr Protoc Bioinformatics. 2002. PMID: 18792937 Review. - An overview of computer software developed to search biological sequence databases.
Butler B. Butler B. Antisense Res Dev. 1993 Fall;3(3):243-52. doi: 10.1089/ard.1993.3.243. Antisense Res Dev. 1993. PMID: 8286925 Review.
Cited by
- The rcsA gene of Escherichia coli O9:K30:H12 is involved in the expression of the serotype-specific group I K (capsular) antigen.
Keenleyside WJ, Jayaratne P, MacLachlan PR, Whitfield C. Keenleyside WJ, et al. J Bacteriol. 1992 Jan;174(1):8-16. doi: 10.1128/jb.174.1.8-16.1992. J Bacteriol. 1992. PMID: 1729226 Free PMC article. - Recent enhancements to the Blocks Database servers.
Henikoff JG, Pietrokovski S, Henikoff S. Henikoff JG, et al. Nucleic Acids Res. 1997 Jan 1;25(1):222-5. doi: 10.1093/nar/25.1.222. Nucleic Acids Res. 1997. PMID: 9016540 Free PMC article. - DNA binding and DNA bending by the MelR transcription activator protein from Escherichia coli.
Bourgerie SJ, Michán CM, Thomas MS, Busby SJ, Hyde EI. Bourgerie SJ, et al. Nucleic Acids Res. 1997 May 1;25(9):1685-93. doi: 10.1093/nar/25.9.1685. Nucleic Acids Res. 1997. PMID: 9108148 Free PMC article. - Engineering Bafilomycin High-Producers by Manipulating Regulatory and Biosynthetic Genes in the Marine Bacterium Streptomyces lohii.
Li Z, Li S, Du L, Zhang X, Jiang Y, Liu W, Zhang W, Li S. Li Z, et al. Mar Drugs. 2021 Jan 11;19(1):29. doi: 10.3390/md19010029. Mar Drugs. 2021. PMID: 33440628 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases