Pairwise local structural alignment of RNA sequences with sequence similarity less than 40% - PubMed (original) (raw)
Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%
Jakob Hull Havgaard et al. Bioinformatics. 2005.
Abstract
Motivation: Searching for non-coding RNA (ncRNA) genes and structural RNA elements (eleRNA) are major challenges in gene finding today as these often are conserved in structure rather than in sequence. Even though the number of available methods is growing, it is still of interest to pairwise detect two genes with low sequence similarity, where the genes are part of a larger genomic region.
Results: Here we present such an approach for pairwise local alignment which is based on foldalign and the Sankoff algorithm for simultaneous structural alignment of multiple sequences. We include the ability to conduct mutual scans of two sequences of arbitrary length while searching for common local structural motifs of some maximum length. This drastically reduces the complexity of the algorithm. The scoring scheme includes structural parameters corresponding to those available for free energy as well as for substitution matrices similar to RIBOSUM. The new foldalign implementation is tested on a dataset where the ncRNAs and eleRNAs have sequence similarity <40% and where the ncRNAs and eleRNAs are energetically indistinguishable from the surrounding genomic sequence context. The method is tested in two ways: (1) its ability to find the common structure between the genes only and (2) its ability to locate ncRNAs and eleRNAs in a genomic context. In case (1), it makes sense to compare with methods like Dynalign, and the performances are very similar, but foldalign is substantially faster. The structure prediction performance for a family is typically around 0.7 using Matthews correlation coefficient. In case (2), the algorithm is successful at locating RNA families with an average sensitivity of 0.8 and a positive predictive value of 0.9 using a BLAST-like hit selection scheme.
Availability: The program is available online at http://foldalign.kvl.dk/
Similar articles
- Multiple structural alignment and clustering of RNA sequences.
Torarinsson E, Havgaard JH, Gorodkin J. Torarinsson E, et al. Bioinformatics. 2007 Apr 15;23(8):926-32. doi: 10.1093/bioinformatics/btm049. Epub 2007 Feb 25. Bioinformatics. 2007. PMID: 17324941 - The FOLDALIGN web server for pairwise structural RNA alignment and mutual motif search.
Havgaard JH, Lyngsø RB, Gorodkin J. Havgaard JH, et al. Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W650-3. doi: 10.1093/nar/gki473. Nucleic Acids Res. 2005. PMID: 15980555 Free PMC article. - A local multiple alignment method for detection of non-coding RNA sequences.
Tabei Y, Asai K. Tabei Y, et al. Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17. Bioinformatics. 2009. PMID: 19376823 - Customized strategies for discovering distant ncRNA homologs.
Mosig A, Zhu L, Stadler PF. Mosig A, et al. Brief Funct Genomic Proteomic. 2009 Nov;8(6):451-60. doi: 10.1093/bfgp/elp035. Epub 2009 Sep 24. Brief Funct Genomic Proteomic. 2009. PMID: 19779009 Review. - Sequence and structure analysis of noncoding RNAs.
Washietl S. Washietl S. Methods Mol Biol. 2010;609:285-306. doi: 10.1007/978-1-60327-241-4_17. Methods Mol Biol. 2010. PMID: 20221926 Review.
Cited by
- Classification and assessment tools for structural motif discovery algorithms.
Badr G, Al-Turaiki I, Mathkour H. Badr G, et al. BMC Bioinformatics. 2013;14 Suppl 9(Suppl 9):S4. doi: 10.1186/1471-2105-14-S9-S4. Epub 2013 Jun 28. BMC Bioinformatics. 2013. PMID: 23902564 Free PMC article. - LocARNAscan: Incorporating thermodynamic stability in sequence and structure-based RNA homology search.
Will S, Siebauer MF, Heyne S, Engelhardt J, Stadler PF, Reiche, Backofen R. Will S, et al. Algorithms Mol Biol. 2013 Apr 20;8:14. doi: 10.1186/1748-7188-8-14. eCollection 2013. Algorithms Mol Biol. 2013. PMID: 23601347 Free PMC article. - On the importance of cotranscriptional RNA structure formation.
Lai D, Proctor JR, Meyer IM. Lai D, et al. RNA. 2013 Nov;19(11):1461-73. doi: 10.1261/rna.037390.112. RNA. 2013. PMID: 24131802 Free PMC article. Review. - Detecting and comparing non-coding RNAs in the high-throughput era.
Bussotti G, Notredame C, Enright AJ. Bussotti G, et al. Int J Mol Sci. 2013 Jul 24;14(8):15423-58. doi: 10.3390/ijms140815423. Int J Mol Sci. 2013. PMID: 23887659 Free PMC article. Review. - A Structurally Conserved RNA Element within SARS-CoV-2 ORF1a RNA and S mRNA Regulates Translation in Response to Viral S Protein-Induced Signaling in Human Lung Cells.
Basu A, Penumutchu S, Nguyen K, Mbonye U, Tolbert BS, Karn J, Komar AA, Mazumder B. Basu A, et al. J Virol. 2022 Jan 26;96(2):e0167821. doi: 10.1128/JVI.01678-21. Epub 2021 Nov 10. J Virol. 2022. PMID: 34757848 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials