Statistical features of human exons and their flanking regions - PubMed (original) (raw)
Statistical features of human exons and their flanking regions
M Q Zhang. Hum Mol Genet. 1998 May.
Abstract
To facilitate gene finding and for the investigation of human molecular genetics on a genome scale, we present a comprehensive survey on various statistical features of human exons. We first show that human exons with flanking genomic DNA sequences can be classified into 12 mutually exclusive categories. This classification could serve as a standard for future studies so that direct comparisons of results can be made. A database for eight categories (related to human genes in which coding regions are split by introns) was built from GenBank release 87.0 and analyzed by a number of methods to characterize statistical features of these sequences that may serve as controls or regulatory signals for gene expression. The statistical information compiled includes profiles of signals for transcription, splicing and translation, various compositional statistics and size distributions. Further analyses reveal novel correlations and constraints among different splicing features across an internal exon that are consistent with the Exon Definition model. This information is fundamental for a quantitative view of human gene organization, and should be invaluable for individual scientists to design human molecular genetics experiments.
Similar articles
- Fission yeast gene structure and recognition.
Zhang MQ, Marr TG. Zhang MQ, et al. Nucleic Acids Res. 1994 May 11;22(9):1750-9. doi: 10.1093/nar/22.9.1750. Nucleic Acids Res. 1994. PMID: 8202381 Free PMC article. - A relationship between GC content and coding-sequence length.
Oliver JL, Marín A. Oliver JL, et al. J Mol Evol. 1996 Sep;43(3):216-23. doi: 10.1007/BF02338829. J Mol Evol. 1996. PMID: 8703087 - The 5' leader of plant PgiC has an intron: the leader shows both the loss and maintenance of constraints compared with introns and exons in the coding region.
Gottlieb LD, Ford VS. Gottlieb LD, et al. Mol Biol Evol. 2002 Sep;19(9):1613-23. doi: 10.1093/oxfordjournals.molbev.a004223. Mol Biol Evol. 2002. PMID: 12200488 - Biased distribution of adenine and thymine in gene nucleotide sequences.
Mrázek J, Kypr J. Mrázek J, et al. J Mol Evol. 1994 Nov;39(5):439-47. doi: 10.1007/BF00173412. J Mol Evol. 1994. PMID: 7528807 - Advances in the Exon-Intron Database (EID).
Shepelev V, Fedorov A. Shepelev V, et al. Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9. Brief Bioinform. 2006. PMID: 16772261 Review.
Cited by
- Genotype Characterization and MiRNA Expression Profiling in Usher Syndrome Cell Lines.
Tom WA, Chandel DS, Jiang C, Krzyzanowski G, Fernando N, Olou A, Fernando MR. Tom WA, et al. Int J Mol Sci. 2024 Sep 17;25(18):9993. doi: 10.3390/ijms25189993. Int J Mol Sci. 2024. PMID: 39337481 Free PMC article. - A hybrid approach of ensemble learning and grey wolf optimizer for DNA splice junction prediction.
Hamouda E, Tarek M. Hamouda E, et al. PLoS One. 2024 Sep 23;19(9):e0310698. doi: 10.1371/journal.pone.0310698. eCollection 2024. PLoS One. 2024. PMID: 39312561 Free PMC article. - Genetic Testing in Patients with Autoimmune Lymphoproliferative Syndrome: Experience of 802 Patients at Cincinnati Children's Hospital Medical Center.
Xu X, Denton J, Wu Y, Liu J, Guan Q, Dawson DB, Bleesing J, Zhang W. Xu X, et al. J Clin Immunol. 2024 Jul 26;44(7):166. doi: 10.1007/s10875-024-01772-z. J Clin Immunol. 2024. PMID: 39060684 Free PMC article. - Co-transcriptional gene regulation in eukaryotes and prokaryotes.
Shine M, Gordon J, Schärfen L, Zigackova D, Herzel L, Neugebauer KM. Shine M, et al. Nat Rev Mol Cell Biol. 2024 Jul;25(7):534-554. doi: 10.1038/s41580-024-00706-2. Epub 2024 Mar 20. Nat Rev Mol Cell Biol. 2024. PMID: 38509203 Review. - CRISPR activation to characterize splice-altering variants in easily accessible cells.
Terkelsen T, Mikkelsen NS, Bak EN, Vad-Nielsen J, Blechingberg J, Weiss S, Drue SO, Andersen H, Andresen BS, Bak RO, Jensen UB. Terkelsen T, et al. Am J Hum Genet. 2024 Feb 1;111(2):309-322. doi: 10.1016/j.ajhg.2023.12.024. Epub 2024 Jan 24. Am J Hum Genet. 2024. PMID: 38272032 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases