Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach - PubMed (original) (raw)
Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach
E C Uberbacher et al. Proc Natl Acad Sci U S A. 1991.
Abstract
Genes in higher eukaryotes may span tens or hundreds of kilobases with the protein-coding regions accounting for only a few percent of the total sequence. Identifying genes within large regions of uncharacterized DNA is a difficult undertaking and is currently the focus of many research efforts. We describe a reliable computational approach for locating protein-coding portions of genes in anonymous DNA sequence. Using a concept suggested by robotic environmental sensing, our method combines a set of sensor algorithms and a neural network to localize the coding regions. Several algorithms that report local characteristics of the DNA sequence, and therefore act as sensors, are also described. In its current configuration the "coding recognition module" identifies 90% of coding exons of length 100 bases or greater with less than one false positive coding exon indicated per five coding exons indicated. This is a significantly lower false positive rate than any method of which we are aware. This module demonstrates a method with general applicability to sequence-pattern recognition problems and is available for current research efforts.
Similar articles
- Locating protein coding regions in human DNA using a decision tree algorithm.
Salzberg S. Salzberg S. J Comput Biol. 1995 Fall;2(3):473-85. doi: 10.1089/cmb.1995.2.473. J Comput Biol. 1995. PMID: 8521276 - Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks.
Snyder EE, Stormo GD. Snyder EE, et al. Nucleic Acids Res. 1993 Feb 11;21(3):607-13. doi: 10.1093/nar/21.3.607. Nucleic Acids Res. 1993. PMID: 8441672 Free PMC article. - Recognizing exons in genomic sequence using GRAIL II.
Xu Y, Mural R, Shah M, Uberbacher E. Xu Y, et al. Genet Eng (N Y). 1994;16:241-53. Genet Eng (N Y). 1994. PMID: 7765200 - Prediction of function in DNA sequence analysis.
Gelfand MS. Gelfand MS. J Comput Biol. 1995 Spring;2(1):87-115. doi: 10.1089/cmb.1995.2.87. J Comput Biol. 1995. PMID: 7497122 Review. - Engineering Aspects of Olfaction.
Persaud KC. Persaud KC. In: Persaud KC, Marco S, Gutiérrez-Gálvez A, editors. Neuromorphic Olfaction. Boca Raton (FL): CRC Press/Taylor & Francis; 2013. Chapter 1. In: Persaud KC, Marco S, Gutiérrez-Gálvez A, editors. Neuromorphic Olfaction. Boca Raton (FL): CRC Press/Taylor & Francis; 2013. Chapter 1. PMID: 26042329 Free Books & Documents. Review.
Cited by
- The glutathione reductase GSR-1 determines stress tolerance and longevity in Caenorhabditis elegans.
Lüersen K, Stegehake D, Daniel J, Drescher M, Ajonina I, Ajonina C, Hertel P, Woltersdorf C, Liebau E. Lüersen K, et al. PLoS One. 2013 Apr 8;8(4):e60731. doi: 10.1371/journal.pone.0060731. Print 2013. PLoS One. 2013. PMID: 23593298 Free PMC article. - Normal and compound poisson approximations for pattern occurrences in NGS reads.
Zhai Z, Reinert G, Song K, Waterman MS, Luan Y, Sun F. Zhai Z, et al. J Comput Biol. 2012 Jun;19(6):839-54. doi: 10.1089/cmb.2012.0029. J Comput Biol. 2012. PMID: 22697250 Free PMC article. - Molecular analysis of radiation-induced albino (c)-locus mutations that cause death at preimplantation stages of development.
Rinchik EM, Tönjes RR, Paul D, Potter MD. Rinchik EM, et al. Genetics. 1993 Dec;135(4):1107-16. doi: 10.1093/genetics/135.4.1107. Genetics. 1993. PMID: 8307326 Free PMC article. - Construction of a genomic DNA 'feature map' by sequencing from nested deletions: application to the HLA class I region.
Krishnan BR, Jamry I, Berg DE, Berg CM, Chaplin DD. Krishnan BR, et al. Nucleic Acids Res. 1995 Jan 11;23(1):117-22. doi: 10.1093/nar/23.1.117. Nucleic Acids Res. 1995. PMID: 7870576 Free PMC article. - Hex1: a new human Rad2 nuclease family member with homology to yeast exonuclease 1.
Wilson DM 3rd, Carney JP, Coleman MA, Adamson AW, Christensen M, Lamerdin JE. Wilson DM 3rd, et al. Nucleic Acids Res. 1998 Aug 15;26(16):3762-8. doi: 10.1093/nar/26.16.3762. Nucleic Acids Res. 1998. PMID: 9685493 Free PMC article.
References
- Nucleic Acids Res. 1988 Mar 11;16(5):1861-3 - PubMed
- Science. 1989 Sep 29;245(4925):1434-5 - PubMed
- Nucleic Acids Res. 1984 Dec 21;12(24):9567-75 - PubMed
- Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):387-95 - PubMed
- Proc Natl Acad Sci U S A. 1982 Apr;79(8):2554-8 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases