Prediction of complete gene structures in human genomic DNA - PubMed (original) (raw)
Comparative Study
. 1997 Apr 25;268(1):78-94.
doi: 10.1006/jmbi.1997.0951.
Affiliations
- PMID: 9149143
- DOI: 10.1006/jmbi.1997.0951
Free article
Comparative Study
Prediction of complete gene structures in human genomic DNA
C Burge et al. J Mol Biol. 1997.
Free article
Abstract
We introduce a general probabilistic model of the gene structure of human genomic sequences which incorporates descriptions of the basic transcriptional, translational and splicing signals, as well as length distributions and compositional features of exons, introns and intergenic regions. Distinct sets of model parameters are derived to account for the many substantial differences in gene density and structure observed in distinct C + G compositional regions of the human genome. In addition, new models of the donor and acceptor splice signals are described which capture potentially important dependencies between signal positions. The model is applied to the problem of gene identification in a computer program, GENSCAN, which identifies complete exon/intron structures of genes in genomic DNA. Novel features of the program include the capacity to predict multiple genes in a sequence, to deal with partial as well as complete genes, and to predict consistent sets of genes occurring on either or both DNA strands. GENSCAN is shown to have substantially higher accuracy than existing methods when tested on standardized sets of human and vertebrate genes, with 75 to 80% of exons identified exactly. The program is also capable of indicating fairly accurately the reliability of each predicted exon. Consistently high levels of accuracy are observed for sequences of differing C + G content and for distinct groups of vertebrates.
Similar articles
- Finding genes in DNA with a Hidden Markov Model.
Henderson J, Salzberg S, Fasman KH. Henderson J, et al. J Comput Biol. 1997 Summer;4(2):127-41. doi: 10.1089/cmb.1997.4.127. J Comput Biol. 1997. PMID: 9228612 - The Gene-Finder computer tools for analysis of human and model organisms genome sequences.
Solovyev V, Salamov A. Solovyev V, et al. Proc Int Conf Intell Syst Mol Biol. 1997;5:294-302. Proc Int Conf Intell Syst Mol Biol. 1997. PMID: 9322052 - Compensatory relationship between splice sites and exonic splicing signals depending on the length of vertebrate introns.
Dewey CN, Rogozin IB, Koonin EV. Dewey CN, et al. BMC Genomics. 2006 Dec 8;7:311. doi: 10.1186/1471-2164-7-311. BMC Genomics. 2006. PMID: 17156453 Free PMC article. - Mutations that alter RNA splicing of the human HPRT gene: a review of the spectrum.
O'Neill JP, Rogan PK, Cariello N, Nicklas JA. O'Neill JP, et al. Mutat Res. 1998 Nov;411(3):179-214. doi: 10.1016/s1383-5742(98)00013-1. Mutat Res. 1998. PMID: 9804951 Review. - Advances in the Exon-Intron Database (EID).
Shepelev V, Fedorov A. Shepelev V, et al. Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9. Brief Bioinform. 2006. PMID: 16772261 Review.
Cited by
- From computational models of the splicing code to regulatory mechanisms and therapeutic implications.
Capitanchik C, Wilkins OG, Wagner N, Gagneur J, Ule J. Capitanchik C, et al. Nat Rev Genet. 2025 Mar;26(3):171-190. doi: 10.1038/s41576-024-00774-2. Epub 2024 Oct 2. Nat Rev Genet. 2025. PMID: 39358547 Review. - Characterisation of bovine leukocyte Ig-like receptors.
Hogan L, Bhuju S, Jones DC, Laing K, Trowsdale J, Butcher P, Singh M, Vordermeier M, Allen RL. Hogan L, et al. PLoS One. 2012;7(4):e34291. doi: 10.1371/journal.pone.0034291. Epub 2012 Apr 2. PLoS One. 2012. PMID: 22485161 Free PMC article. - Two hAT transposon genes were transferred from Brassicaceae to broomrapes and are actively expressed in some recipients.
Sun T, Renner SS, Xu Y, Qin Y, Wu J, Sun G. Sun T, et al. Sci Rep. 2016 Jul 25;6:30192. doi: 10.1038/srep30192. Sci Rep. 2016. PMID: 27452947 Free PMC article. - A Swollenin From Talaromyces leycettanus JCM12802 Enhances Cellulase Hydrolysis Toward Various Substrates.
Zhang H, Wang Y, Brunecky R, Yao B, Xie X, Zheng F, Luo H. Zhang H, et al. Front Microbiol. 2021 Mar 29;12:658096. doi: 10.3389/fmicb.2021.658096. eCollection 2021. Front Microbiol. 2021. PMID: 33854492 Free PMC article. - Two high quality chromosome-scale genome assemblies of female and male silver pomfret (Pampus argenteus).
Hu J, Zhang Y, Li Y, Li Y, Zhang M, Huang W, Xu S, Wang D, Wang X, Liu J, Wang Y, Yan X. Hu J, et al. Sci Data. 2024 Oct 8;11(1):1100. doi: 10.1038/s41597-024-03914-9. Sci Data. 2024. PMID: 39379396 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases