Fast and reliable prediction of noncoding RNAs - PubMed (original) (raw)
Comparative Study
. 2005 Feb 15;102(7):2454-9.
doi: 10.1073/pnas.0409169102. Epub 2005 Jan 21.
Affiliations
- PMID: 15665081
- PMCID: PMC548974
- DOI: 10.1073/pnas.0409169102
Comparative Study
Fast and reliable prediction of noncoding RNAs
Stefan Washietl et al. Proc Natl Acad Sci U S A. 2005.
Abstract
We report an efficient method for detecting functional RNAs. The approach, which combines comparative sequence analysis and structure prediction, already has yielded excellent results for a small number of aligned sequences and is suitable for large-scale genomic screens. It consists of two basic components: (i) a measure for RNA secondary structure conservation based on computing a consensus secondary structure, and (ii) a measure for thermodynamic stability, which, in the spirit of a z score, is normalized with respect to both sequence length and base composition but can be calculated without sampling from shuffled sequences. Functional RNA secondary structures can be identified in multiple sequence alignments with high sensitivity and high specificity. We demonstrate that this approach is not only much more accurate than previous methods but also significantly faster. The method is implemented in the program rnaz, which can be downloaded from www.tbi.univie.ac.at/\~wash/RNAz. We screened all alignments of length n > or = 50 in the Comparative Regulatory Genomics database, which compiles conserved noncoding elements in upstream regions of orthologous genes from human, mouse, rat, Fugu, and zebrafish. We recovered all of the known noncoding RNAs and cis-acting elements with high significance and found compelling evidence for many other conserved RNA secondary structures not described so far to our knowledge.
Figures
Fig. 1.
z scores calculated by SVM regression in comparison with z scores determined from 1,000 random samples for each data point. As test sequences we chose 100 sequences from random locations in the human genome and 100 known ncRNAs from the Rfam database (31). (Upper) Correlation of z scores from two independent samplings (mean squared error: 0.00990). (Lower) Correlation of calculated z scores and sampled z scores (mean squared error: 0.00998)
Fig. 2.
Classification based on z scores and SCI by using a SVM. Alignments of tRNAs and 5S rRNAs with two to four sequences per alignment and mean pairwise identities between 60% and 90% are shown. Green circles represent native alignments, and red crosses represent shuffled random controls. The background color ranging from red to green indicates the RNA class probability for different regions of the _z_–SCI plane.
Comment in
- Tracking down noncoding RNAs.
Moulton V. Moulton V. Proc Natl Acad Sci U S A. 2005 Feb 15;102(7):2269-70. doi: 10.1073/pnas.0500129102. Epub 2005 Feb 9. Proc Natl Acad Sci U S A. 2005. PMID: 15703286 Free PMC article. No abstract available.
Similar articles
- RNAz 2.0: improved noncoding RNA detection.
Gruber AR, Findeiß S, Washietl S, Hofacker IL, Stadler PF. Gruber AR, et al. Pac Symp Biocomput. 2010:69-79. Pac Symp Biocomput. 2010. PMID: 19908359 - Identifying structural noncoding RNAs using RNAz.
Washietl S, Hofacker IL. Washietl S, et al. Curr Protoc Bioinformatics. 2007 Sep;Chapter 12:Unit 12.7. doi: 10.1002/0471250953.bi1207s19. Curr Protoc Bioinformatics. 2007. PMID: 18428784 - The RNAz web server: prediction of thermodynamically stable and evolutionarily conserved RNA structures.
Gruber AR, Neuböck R, Hofacker IL, Washietl S. Gruber AR, et al. Nucleic Acids Res. 2007 Jul;35(Web Server issue):W335-8. doi: 10.1093/nar/gkm222. Epub 2007 Apr 22. Nucleic Acids Res. 2007. PMID: 17452347 Free PMC article. - Sequence and structure analysis of noncoding RNAs.
Washietl S. Washietl S. Methods Mol Biol. 2010;609:285-306. doi: 10.1007/978-1-60327-241-4_17. Methods Mol Biol. 2010. PMID: 20221926 Review. - Folding and finding RNA secondary structure.
Mathews DH, Moss WN, Turner DH. Mathews DH, et al. Cold Spring Harb Perspect Biol. 2010 Dec;2(12):a003665. doi: 10.1101/cshperspect.a003665. Epub 2010 Aug 4. Cold Spring Harb Perspect Biol. 2010. PMID: 20685845 Free PMC article. Review.
Cited by
- Clusters of mammalian conserved RNA structures in UTRs associate with RBP binding sites.
Gadekar VP, Munk AW, Miladi M, Junge A, Backofen R, Seemann SE, Gorodkin J. Gadekar VP, et al. NAR Genom Bioinform. 2024 Aug 9;6(3):lqae089. doi: 10.1093/nargab/lqae089. eCollection 2024 Sep. NAR Genom Bioinform. 2024. PMID: 39131818 Free PMC article. - Devising Isolation Forest-Based Method to Investigate the sRNAome of Mycobacterium tuberculosis Using sRNA-seq Data.
Maity U, Aggarwal R, Balasubramanian R, Venkatraman DL, R Hegde S. Maity U, et al. Bioinform Biol Insights. 2024 Jul 30;18:11779322241263674. doi: 10.1177/11779322241263674. eCollection 2024. Bioinform Biol Insights. 2024. PMID: 39091283 Free PMC article. - Comparative RNA Genomics.
Backofen R, Gorodkin J, Hofacker IL, Stadler PF. Backofen R, et al. Methods Mol Biol. 2024;2802:347-393. doi: 10.1007/978-1-0716-3838-5_12. Methods Mol Biol. 2024. PMID: 38819565 - Unveiling hidden structural patterns in the SARS-CoV-2 genome: Computational insights and comparative analysis.
Ziesel A, Jabbari H. Ziesel A, et al. PLoS One. 2024 Apr 4;19(4):e0298164. doi: 10.1371/journal.pone.0298164. eCollection 2024. PLoS One. 2024. PMID: 38574063 Free PMC article. - Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure prediction and RNA drug discovery.
Sato K, Hamada M. Sato K, et al. Brief Bioinform. 2023 Jul 20;24(4):bbad186. doi: 10.1093/bib/bbad186. Brief Bioinform. 2023. PMID: 37232359 Free PMC article. Review.
References
- Eddy, S. R. (2001) Nat. Rev. Genet. 2, 919-929. - PubMed
- Storz, G. (2002) Science 296, 1260-1263. - PubMed
- Mattick, J. S. (2003) BioEssays 25, 930-939. - PubMed
- He, L. & Hannon, G. J. (2004) Nat. Rev. Genet 5, 522-531. - PubMed
- Avner, P. & Heard, E. (2001) Nat. Rev. Genet. 2, 59-67. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources