RNAMotif, an RNA secondary structure definition and search algorithm - PubMed (original) (raw)

RNAMotif, an RNA secondary structure definition and search algorithm

T J Macke et al. Nucleic Acids Res. 2001.

Abstract

RNA molecules fold into characteristic secondary and tertiary structures that account for their diverse functional activities. Many of these RNA structures are assembled from a collection of RNA structural motifs. These basic building blocks are used repeatedly, and in various combinations, to form different RNA types and define their unique structural and functional properties. Identification of recurring RNA structural motifs will therefore enhance our understanding of RNA structure and help associate elements of RNA structure with functional and regulatory elements. Our goal was to develop a computer program that can describe an RNA structural element of any complexity and then search any nucleotide sequence database, including the complete prokaryotic and eukaryotic genomes, for these structural elements. Here we describe in detail a new computational motif search algorithm, RNAMotif, and demonstrate its utility with some motif search examples. RNAMotif differs from other motif search tools in two important aspects: first, the structure definition language is more flexible and can specify any type of base-base interaction; second, RNAMotif provides a user controlled scoring section that can be used to add capabilities that patterns alone cannot provide.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Examples of RNA structure motifs and descriptor constraints with important conserved nucleotides and scoring values. (A) A helical stem closed by a tetraloop. (B) An E-loop motif. (C) The core of the E-loop depicted with the observed non-canonical base pairing interactions. The most significant structural elements within the motif are shown within the dotted box. The structural components (stems or single-stranded regions) that flank this internal loop can vary in length and composition. (D) E-loop descriptor derived from the constraints shown in (B). Additional constraints for the stems flanking the core motif not shown were used in the overall score calculations reported in Table 2.

Figure 1

Figure 1

Examples of RNA structure motifs and descriptor constraints with important conserved nucleotides and scoring values. (A) A helical stem closed by a tetraloop. (B) An E-loop motif. (C) The core of the E-loop depicted with the observed non-canonical base pairing interactions. The most significant structural elements within the motif are shown within the dotted box. The structural components (stems or single-stranded regions) that flank this internal loop can vary in length and composition. (D) E-loop descriptor derived from the constraints shown in (B). Additional constraints for the stems flanking the core motif not shown were used in the overall score calculations reported in Table 2.

Figure 1

Figure 1

Examples of RNA structure motifs and descriptor constraints with important conserved nucleotides and scoring values. (A) A helical stem closed by a tetraloop. (B) An E-loop motif. (C) The core of the E-loop depicted with the observed non-canonical base pairing interactions. The most significant structural elements within the motif are shown within the dotted box. The structural components (stems or single-stranded regions) that flank this internal loop can vary in length and composition. (D) E-loop descriptor derived from the constraints shown in (B). Additional constraints for the stems flanking the core motif not shown were used in the overall score calculations reported in Table 2.

Figure 2

Figure 2

Models for the IRE structure and generic descriptor. (A) IRE bulge model. (B) IRE internal loop model. (C) Descriptor that captures both types of IRE.

Figure 3

Figure 3

SRP-RNA alignment and structure. (A) Structure-based alignment of the domain IV stem–loop region of SRP-RNA based on the full-length alignment from the SRP-RNA website (http://psyche.uthct.edu/dbs/SRPDB/SRPDB.html). Sequences of a few representative organisms from the three major phylogenetic kingdoms (Bacteria, Archaea and Eukarya) are shown here. The top two lines of the alignment show the base pairing schema. ( and ) are used to denote the 5′ and 3′ sides of a helix, respectively, and the numbers indicate the paired segments. (B) A consensus diagram of the SRP-RNA structure derived from analyzing the alignment of over 100 organisms. The biases in nucleotides and range of size variations in loops and helices are shown. An RNAMotif descriptor (not shown) was derived based on the constraints shown here.

Figure 4

Figure 4

Summary of results of a RNAMotif search for SRP-RNA. (A) Score distribution of true and false hits based on RNAMotif scores in a search of the GenBank nucleotide database with an SRP-RNA descriptor. (B) Summary of scores for true and false hits.

Similar articles

Cited by

References

    1. Theil E.C. (1998) The iron responsive element (IRE) family of mRNA regulators. Regulation of iron transport and uptake compared in animals, plants and microorganisms. Met. Ions Biol. Syst., 35, 403–434. - PubMed
    1. Kim H.Y., LaVaute,T., Iwai,K., Klausner,R.D. and Rouault,T.A. (1996) Identification of a conserved and functional iron-responsive element in the 5′-untranslated region of mammalian mitochondrial aconitase. J. Biol. Chem., 271, 24226–24230. - PubMed
    1. Son S.Y. (1993) The structure and regulation of histone genes. Saenghwahak Nyusu, 13, 64–70.
    1. Shepherd R.K., Gabryszuk,J., Al-Ali,M., Allen,C.A., Joyce,I., Holmes,W.M. and Zehner,Z.E. (1997) A dual stem–loop structure in the 3′ untranslated region of vimentin mRNA binds specific protein(s). Nucleic Acids Symp. Ser., 36, 142–145.
    1. Zehner Z.E., Shepherd,R.K., Gabryszuk,J., Fu,T.-F., Al-Ali,M. and Holmes,W.M. (1997) RNA–protein interactions within the 3′ untranslated region of vimentin mRNA. Nucleic Acids Res., 25, 3362–3370. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources