Analysis of donor splice sites in different eukaryotic organisms - PubMed (original) (raw)
Analysis of donor splice sites in different eukaryotic organisms
I B Rogozin et al. J Mol Evol. 1997 Jul.
Abstract
We present here a new algorithm for functional site analysis. It is based on four main assumptions: each variation of nucleotide composition makes a different contribution to the overall binding free energy of interaction between a functional site and another molecule; nonfunctioning site-like regions (pseudosites) are absent or rare in genomes; there may be errors in the sample of sites; and nucleotides of different site positions are considered to be mutually dependent. In this algorithm, the site set is divided into subsets, each described by a certain consensus. Donor splice sites of the human protein-coding genes were analyzed. Comparing the results with other methods of donor splice site prediction has demonstrated a more accurate prediction of consensus sequences AG/GU(A,G), G/GUnAG, /GU(A,G)AG, /GU(A,G)nGU, and G/GUA than is achieved by weight matrix and consensus (A,C)AG/GU(A,G)AGU with mismatches. The probability of the first type error, E1, for the obtained consensus set was about 0.05, and the probability of the second type error, E2, was 0.15. The analysis demonstrated that accuracy of the functional site prediction could be improved if one takes into account correlations between the site positions. The accuracy of prediction by using human consensus sequences was tested on sequences from different organisms. Some differences in consensus sequences for the plant Arabidopsis sp., the invertebrate Caenorhabditis sp., and the fungus Aspergillus sp. were revealed. For the yeast Saccharomyces sp. only one conservative consensus, /GUA(U,A,C)G(U,A,C), was revealed (E1 = 0.03, E2 = 0.03). Yeast is a very interesting model to use for analysis of molecular mechanisms of splicing.
Similar articles
- Logitlinear models for the prediction of splice sites in plant pre-mRNA sequences.
Kleffe J, Hermann K, Vahrson W, Wittig B, Brendel V. Kleffe J, et al. Nucleic Acids Res. 1996 Dec 1;24(23):4709-18. doi: 10.1093/nar/24.23.4709. Nucleic Acids Res. 1996. PMID: 8972857 Free PMC article. - Signals for the selection of a splice site in pre-mRNA. Computer analysis of splice junction sequences and like sequences.
Ohshima Y, Gotoh Y. Ohshima Y, et al. J Mol Biol. 1987 May 20;195(2):247-59. doi: 10.1016/0022-2836(87)90647-4. J Mol Biol. 1987. PMID: 3656413 - Analysis of canonical and non-canonical splice sites in mammalian genomes.
Burset M, Seledtsov IA, Solovyev VV. Burset M, et al. Nucleic Acids Res. 2000 Nov 1;28(21):4364-75. doi: 10.1093/nar/28.21.4364. Nucleic Acids Res. 2000. PMID: 11058137 Free PMC article. - The mutational spectrum of single base-pair substitutions in mRNA splice junctions of human genes: causes and consequences.
Krawczak M, Reiss J, Cooper DN. Krawczak M, et al. Hum Genet. 1992 Sep-Oct;90(1-2):41-54. doi: 10.1007/BF00210743. Hum Genet. 1992. PMID: 1427786 - Arabidopsis intron mutations and pre-mRNA splicing.
Brown JW. Brown JW. Plant J. 1996 Nov;10(5):771-80. doi: 10.1046/j.1365-313x.1996.10050771.x. Plant J. 1996. PMID: 8953241 Review.
Cited by
- U6 snRNA m6A modification is required for accurate and efficient splicing of C. elegans and human pre-mRNAs.
Shen A, Hencel K, Parker MT, Scott R, Skukan R, Adesina AS, Metheringham CL, Miska EA, Nam Y, Haerty W, Simpson GG, Akay A. Shen A, et al. Nucleic Acids Res. 2024 Aug 27;52(15):9139-9160. doi: 10.1093/nar/gkae447. Nucleic Acids Res. 2024. PMID: 38808663 Free PMC article. - Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies.
Cormier MJ, Pedersen BS, Bayrak-Toydemir P, Quinlan AR. Cormier MJ, et al. BMC Bioinformatics. 2022 Nov 14;23(1):482. doi: 10.1186/s12859-022-05041-x. BMC Bioinformatics. 2022. PMID: 36376793 Free PMC article. - Risk Association, Linkage Disequilibrium, and Haplotype Analyses of β-Like Globin Gene Polymorphisms with Malaria Risk in the Sabah Population of Malaysian Borneo.
Chong ETJ, Goh LPW, Yap HJ, Yong EWC, Lee PC. Chong ETJ, et al. Genes (Basel). 2022 Jul 11;13(7):1229. doi: 10.3390/genes13071229. Genes (Basel). 2022. PMID: 35886012 Free PMC article. - Computational analysis of missense filamin-A variants, including the novel p.Arg484Gln variant of two brothers with periventricular nodular heterotopia.
Gerlevik U, Saygı C, Cangül H, Kutlu A, Çaralan EF, Topçu Y, Özören N, Sezerman OU. Gerlevik U, et al. PLoS One. 2022 May 25;17(5):e0265400. doi: 10.1371/journal.pone.0265400. eCollection 2022. PLoS One. 2022. PMID: 35613087 Free PMC article. - What's Wrong in a Jump? Prediction and Validation of Splice Site Variants.
Riolo G, Cantara S, Ricci C. Riolo G, et al. Methods Protoc. 2021 Sep 5;4(3):62. doi: 10.3390/mps4030062. Methods Protoc. 2021. PMID: 34564308 Free PMC article. Review.
References
- J Mol Biol. 1992 Dec 20;228(4):1124-36 - PubMed
- Nucleic Acids Res. 1992 Aug 25;20(16):4255-62 - PubMed
- Nat Genet. 1994 Oct;8(2):183-8 - PubMed
- Comput Appl Biosci. 1993 Oct;9(5):499-509 - PubMed
- Nucleic Acids Res. 1994 Dec 11;22(24):5156-63 - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources