Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics - PubMed (original) (raw)
Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics
Stefan Washietl et al. J Mol Biol. 2004.
Abstract
Facing the ever-growing list of newly discovered classes of functional RNAs, it can be expected that further types of functional RNAs are still hidden in recently completed genomes. The computational identification of such RNA genes is, therefore, of major importance. While most known functional RNAs have characteristic secondary structures, their free energies are generally not statistically significant enough to distinguish RNA genes from the genomic background. Additional information is required. Considering the wide availability of new genomic data of closely related species, comparative studies seem to be the most promising approach. Here, we show that prediction of consensus structures of aligned sequences can be a significant measure to detect functional RNAs. We report a new method to test multiple sequence alignments for the existence of an unusually structured and conserved fold. We show for alignments of six types of well-known functional RNA that an energy score consisting of free energy and a covariation term significantly improves sensitivity compared to single sequence predictions. We further test our method on a number of non-coding RNAs from Caenorhabditis elegans/Caenorhabditis briggsae and seven Saccharomyces species. Most RNAs can be detected with high significance. We provide a Perl implementation that can be used readily to score single alignments and discuss how the methods described here can be extended to allow for efficient genome-wide screens.
Similar articles
- Prediction of structured non-coding RNAs in the genomes of the nematodes Caenorhabditis elegans and Caenorhabditis briggsae.
Missal K, Zhu X, Rose D, Deng W, Skogerbø G, Chen R, Stadler PF. Missal K, et al. J Exp Zool B Mol Dev Evol. 2006 Jul 15;306(4):379-92. doi: 10.1002/jez.b.21086. J Exp Zool B Mol Dev Evol. 2006. PMID: 16425273 - A local multiple alignment method for detection of non-coding RNA sequences.
Tabei Y, Asai K. Tabei Y, et al. Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17. Bioinformatics. 2009. PMID: 19376823 - CentroidAlign: fast and accurate aligner for structured RNAs by maximizing expected sum-of-pairs score.
Hamada M, Sato K, Kiryu H, Mituyama T, Asai K. Hamada M, et al. Bioinformatics. 2009 Dec 15;25(24):3236-43. doi: 10.1093/bioinformatics/btp580. Epub 2009 Oct 6. Bioinformatics. 2009. PMID: 19808876 - Non-coding RNA genes and the modern RNA world.
Eddy SR. Eddy SR. Nat Rev Genet. 2001 Dec;2(12):919-29. doi: 10.1038/35103511. Nat Rev Genet. 2001. PMID: 11733745 Review. - An Ariadne's thread to the identification and annotation of noncoding RNAs in eukaryotes.
Soldà G, Makunin IV, Sezerman OU, Corradin A, Corti G, Guffanti A. Soldà G, et al. Brief Bioinform. 2009 Sep;10(5):475-89. doi: 10.1093/bib/bbp022. Epub 2009 Apr 21. Brief Bioinform. 2009. PMID: 19383843 Review.
Cited by
- Selection Pressures on RNA Sequences and Structures.
Nowick K, Walter Costa MB, Höner Zu Siederdissen C, Stadler PF. Nowick K, et al. Evol Bioinform Online. 2019 Aug 29;15:1176934319871919. doi: 10.1177/1176934319871919. eCollection 2019. Evol Bioinform Online. 2019. PMID: 31496634 Free PMC article. - Grass evolution inferred from chromosomal rearrangements and geometrical and statistical features in RNA structure.
Caetano-Anollés G. Caetano-Anollés G. J Mol Evol. 2005 May;60(5):635-52. doi: 10.1007/s00239-004-0244-z. J Mol Evol. 2005. PMID: 15983872 - Genome-wide discovery and verification of novel structured RNAs in Plasmodium falciparum.
Mourier T, Carret C, Kyes S, Christodoulou Z, Gardner PP, Jeffares DC, Pinches R, Barrell B, Berriman M, Griffiths-Jones S, Ivens A, Newbold C, Pain A. Mourier T, et al. Genome Res. 2008 Feb;18(2):281-92. doi: 10.1101/gr.6836108. Epub 2007 Dec 20. Genome Res. 2008. PMID: 18096748 Free PMC article. - Robust identification of noncoding RNA from transcriptomes requires phylogenetically-informed sampling.
Lindgreen S, Umu SU, Lai AS, Eldai H, Liu W, McGimpsey S, Wheeler NE, Biggs PJ, Thomson NR, Barquist L, Poole AM, Gardner PP. Lindgreen S, et al. PLoS Comput Biol. 2014 Oct 30;10(10):e1003907. doi: 10.1371/journal.pcbi.1003907. eCollection 2014 Oct. PLoS Comput Biol. 2014. PMID: 25357249 Free PMC article. - An improved method for identification of small non-coding RNAs in bacteria using support vector machine.
Barman RK, Mukhopadhyay A, Das S. Barman RK, et al. Sci Rep. 2017 Apr 6;7:46070. doi: 10.1038/srep46070. Sci Rep. 2017. PMID: 28383059 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases