Identification of functional, endogenous programmed −1 ribosomal frameshift signals in the genome of Saccharomyces cerevisiae (original) (raw)

Journal Article

Department of Cell Biology & Molecular Genetics, University of Maryland

2135 Microbiology Building, College Park, MD 20742, USA

Search for other works by this author on:

Department of Cell Biology & Molecular Genetics, University of Maryland

2135 Microbiology Building, College Park, MD 20742, USA

Search for other works by this author on:

Department of Cell Biology & Molecular Genetics, University of Maryland

2135 Microbiology Building, College Park, MD 20742, USA

Search for other works by this author on:

Department of Cell Biology & Molecular Genetics, University of Maryland

2135 Microbiology Building, College Park, MD 20742, USA

* To whom correspondence should be addressed. Tel: +1 301 405 0918; Fax: +1 301 314 9489; Email: dinman@umd.edu

Search for other works by this author on:

Present address: Jonathan L. Jacobs, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Advanced Technology Center, Gaithersburg, MD 20877, USA

Author Notes

Revision received:

29 September 2006

Accepted:

06 November 2006

Published:

07 December 2006

Cite

Jonathan L. Jacobs, Ashton T. Belew, Rasa Rakauskaite, Jonathan D. Dinman, Identification of functional, endogenous programmed −1 ribosomal frameshift signals in the genome of Saccharomyces cerevisiae, Nucleic Acids Research, Volume 35, Issue 1, 1 January 2007, Pages 165–174, https://doi.org/10.1093/nar/gkl1033
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

In viruses, programmed −1 ribosomal frameshifting (−1 PRF) signals direct the translation of alternative proteins from a single mRNA. Given that many basic regulatory mechanisms were first discovered in viral systems, the current study endeavored to: (i) identify −1 PRF signals in genomic databases, (ii) apply the protocol to the yeast genome and (iii) test selected candidates at the bench. Computational analyses revealed the presence of 10 340 consensus −1 PRF signals in the yeast genome. Of the 6353 yeast ORFs, 1275 contain at least one strong and statistically significant −1 PRF signal. Eight out of nine selected sequences promoted efficient levels of PRF in vivo . These findings provide a robust platform for high throughput computational and laboratory studies and demonstrate that functional −1 PRF signals are widespread in the genome of Saccharomyces cerevisiae . The data generated by this study have been deposited into a publicly available database called the PRFdb. The presence of stable mRNA pseudoknot structures in these −1 PRF signals, and the observation that the predicted outcomes of nearly all of these genomic frameshift signals would direct ribosomes to premature termination codons, suggest two possible mRNA destabilization pathways through which −1 PRF signals could post-transcriptionally regulate mRNA abundance.

INTRODUCTION

Programmed ribosomal frameshifting (PRF) is a translational recoding phenomenon historically associated with viruses and retrotransposons. A PRF signal stochastically redirects translating ribosomes into a new reading frame (i.e. by +1 or −1 nt) and, in the typical viral context, these signals allow ribosomes to bypass the usual in-frame stop codon and continue synthesis of a C-terminally extended fusion protein. As with most basic molecular mechanisms, although first described in viruses, it is becoming increasingly apparent that PRF is much more widespread and is likely employed by organisms representing every branch in the tree of life [for reviews see ( 1–4 )].

This report focuses on programmed −1 ribosomal frameshifting (−1 PRF) and its use by chromosomally encoded mRNAs of yeast. The most well defined −1 PRF phenomena are directed by an mRNA sequence motif composed of three important elements: a ‘slippery site’ composed of seven nucleotides where the translational shift in reading frame actually takes place; a short spacer sequence of usually <12 nt and a downstream stimulatory structure (usually a pseudoknot). A ‘typical’ −1 PRF signal is shown in Figure 1 . In eukaryotic viruses, the slippery site has the heptameric motif N NNW WWH ( 3 ). Current models posit that aminoacyl- and peptidyl-tRNAs are positioned on this sequence while the ribosome pauses at the downstream secondary structure ( 5–8 ). The nature of the slippery sequence enables re-pairing of the non-wobble bases of both the aminoacyl- and peptidyl-tRNAs with the −1 frame codons. While it is generally accepted that mRNA pseudoknots are the most common type of downstream stimulatory structures, other mRNA structures are capable of filling this role as well ( 9 , 10 ). Nonetheless, it is thought that the essential function of the stimulatory structure is to provide an energetic barrier to a translating ribosome.

Typical −1 PRF signals consist of a heptameric slippery site that fits the motif N NNW WWH (spaces indicate zero frame codons), a short spacer region of 5–12 nt, and an mRNA pseudoknot with two stem and three loop regions (S1, S2 and L1–L3, respectively). See Materials and Methods for the pseudoknot motif criteria used in this study.

Figure 1

A growing number of examples now exist of PRF signals in expressed eukaryotic genes ( 11–16 ). The existence of these PRF signals in a wide variety of viral and prokaryotic genomes suggests an ancient and possibly universal mechanism for controlling the expression of actively translated mRNAs. There have been several published reports aimed at the in silico identification of ‘recoding signals’ using a wide variety of computational approaches ( 4 , 17–21 ). While the methodologies of each study covered a broad range of bioinformatics techniques, the general goal of each of these [with the exception of ( 19 )] was fundamentally the same. Specifically, the strategy has been to first find out-of-frame ORFs followed by the identification of PRF signals in the overlapping region between them that could act to potentially redirect ribosomes from the upstream ORF into the downstream one, thereby resulting in the translation of a fusion protein. The strength of this approach is that it can identify new classes of cis-acting signals capable of directing efficient PRF. However, this computational strategy is based on the assumption that PRF outcomes should mimic those observed in viral genomes, and thus its weakness is that it cannot identify new functional outcomes of frameshifting.

In contrast, while ‘outcome-neutral’ approaches using mRNA motifs known to promote efficient PRF cannot identify new frameshift signals, they can enable an expansion of our understanding of functional uses for PRF. In this vein, the first computational search for eukaryotic −1 PRF signals ( 19 ) did not focus solely on identifying two overlapping out-of-frame ORFs, but instead aimed to find these motifs throughout the entire CDS of the yeast genome. This early study found some 260 putative −1 PRF signals in the annotated portion of the Saccharomyces cerevisiae genome. The shortcomings of the Hammell et al . ( 19 ) study were its limitation by incomplete annotation of the yeast genome, and relatively insufficient computational resources available at the time (ca. 1995–1998). Thus, in order to achieve a more comprehensive approach, a new set of informatics tools were developed and applied using faster and more robust computational platforms. The results of the current study show that: (i) pattern matching approaches coupled with a predictive method for folding RNA sequences provide a dramatic improvement in the results; (ii) −1 PRF motifs are widespread in the genome of S.cerevisiae ; (iii) many of the putative signals identified have predicted secondary structures with statistically significant measures of free energy; (iv) putative −1 PRF signals from a variety of S.cerevisiae genes promote efficient recoding when tested in vivo . Additionally, analysis of the predicted outcomes of −1 PRF events suggests that the vast majority would direct translating ribosomes to premature termination codons, suggesting that PRF could be used to post-transcriptionally regulate gene expression through the nonsense-mediated mRNA decay pathway (NMD).

MATERIALS AND METHODS

Hardware and software

All software was compiled and run on one or more of the following systems: Dell Precision 620, 2× PIII XEON 866 MHz running Mandrake Linux 10.×; Apple Power Macintosh, 2× G4 1.4 GHz PowerPC running OS X Tiger; SGI Cluster 64× MIPS R14K 600 MHz running Irix 6.5; SGI Altix 3000, 64× 1.5 GHz Itanium II running Linux-64. Supercomputing resources were made available courtesy of The National Cancer Institute's Advanced Biomedical Computing Center in Frederick, MD (ABCC, Author Webpage ). Unless otherwise noted, data mining and analysis was carried out using scripts written in PERL ( Author Webpage ), each of which are available on request. In all cases, data is stored in a mySQL 4.× relational database ( Author Webpage ) referred to as the PRFdb ( Author Webpage ).

Pattern matching

RNAMotif ( 22 ) was utilized for finding subsequences in the coding regions of S.cerevisiae that serve as potential translational frameshift signals. The descriptor of the putative programmed frameshift signal motif was created ad hoc from analysis of 56 known viral −1 PRF signals from the RECODE ( 23 ) database. The RNAMotif descriptor had the following requirements: (i) define slippery sites as using the IUPAC nomenclature ‘N NNW WWH’, where: (A) spaces indicate zero-frame codon boundaries, (B) N NN represents any three identical nucleotides, (C) WWW represents AAA or UUU, and (D) H ≠ G; (ii) allow any sequence between 0 and 12 nt in length to serve as the spacer between the slippery site and the pseudoknot; (iii) allow G:U base pairing in pseudoknot stems; (iv) each stem in the pseudoknot must be between 4 and 20 nt in length; (v) the first loop must be between 1 and 3 nt in length; (vi) the second loop is optional; (vii) the third loop must be at least as long as one-half the length of the first stem and no longer than 100 nt.

Genome randomization

The complete coding sequence (CDS) of S.cerevisiae was randomized using seven different methods for sequence randomization. Each method of randomization was conducted such that each genome had the same number of open reading frames (ORFs) and identical ORF lengths of the natural S.cerevisiae genome. In addition, each random genome was generated in such a way such that stop codons were only present in the terminal 3′ position (i.e. no in-frame termination codons). Beyond these similarities, the seven methods for randomization included: NoBias, randomized ORFs with unbiased nucleotide bias; ntShuffle, nucleotides from each natural ORF are shuffled by triplicate mononucleotide permutations; ntBias, randomized ORFs using the CDS single-nucleotide frequency; cdnShuffle, codons from each natural ORF are shuffled by triplicate monocodon permutation; SilentBias, a silent bias where the codons are randomized in place so as to maintain protein coding sequence; cdnBias, randomized ORFs using the observed CDS codon usage bias; diNuc, randomized ORFs generated using the observed CDS dinucleotide frequency. A total of 100 randomized replicate genomes were generated for each of the seven methods (700 random genomes total, ∼6.5 billion nucleotides). As was done for the natural S.cerevisiae genome, RNAMotif was used to search these randomized genomes.

Secondary structure prediction

Pknots ( 24 ) was used to predict the minimum free energy (MFE) ‘fold’ of each motif hit identified by RNAMotif. Each motif hit identified by RNAMotif was folded by pknots and assigned a predicted MFE value (MFE in kcal/mol) and a predicted secondary structure.

Randomization and statistical analysis

Each folded motif hit was randomly shuffled and refolded 100 times using pknots (with pseudoknot folding disabled), producing a distribution of random MFEs specific for each of the motif hits. Distributions of random MFE values using pknots with pseudoknots folding disabled were in general not statistically different from those generated using pknots with this option enabled (data not shown), but had considerably shorter generation time, identical energy parameters, and could be run on the same computing platform. Motif hits were then compared to the resulting distribution and assigned a normalized z -score:

where X is the predicted MFE value for each sequence, x¯ is the estimate of the mean for the distribution of MFE values obtained from 100 randomizations, and σ is the SD of random structure MFE values. The normalized value of z R ( z -random) obtained provides an estimate of the statistical significance and uniqueness of the predicted structure for the natural sequence: i.e. is the sequence more or less stable than expected by chance ( 25–32 ).

Genetic methods and plasmid construction

Escherichia coli strain DH5 α was used to amplify plasmids, and E.coli transformations were performed using the high efficiency method of Inoue et al . ( 33 ). YPAD and synthetic complete medium (H − ) were used as described previously ( 34 ). Isogenic ResGen yeast strains (Invitrogen, Carlsbad, CA) derived from BY4742 (JD1158: MAT α his3 Δ 1 leu2 Δ 0 lys2 Δ 0 ura3 Δ 0 and JD1181: MAT α upf3::Kan R his3 Δ 1 leu2 Δ 0 lys2 Δ 0 ura3 Δ 0 ) were used for in vivo measurement of programmed −1 ribosomal frameshifting. All yeast cells were transformed using the alkali cation method ( 35 ). Dual luciferase plasmids pJD375 and pJD376 have been described previously ( 36 ).

Computationally identified putative −1 PRF signals derived from BUB3 , CTS2 , EST2 , FKS1 , FLR1 , NUP82 , PPR1 , SPR6 and TBF1 were designed with the appropriate restriction sites on the 5′ and 3′ ends (Supplementary Table 1). Furthermore, naturally occurring termination codons were eliminated from the −1 reading frame by shortening the spacer region between slippery site and the putative downstream stimulatory structure by a 1 nt. PAGE purified oligonucleotides (Integrated DNA Technology, Coralville, IA) corresponding to each −1 PRF signal were annealed and gel purified. pJD375, the ‘zero-frame’ control dual-luciferase frameshift reporter (DLR) plasmid, was used as a vector backbone and each putative PRF signal was cloned into unique SalI and BamHI restriction sites located in the multiple cloning site (MCS) between the Renilla and firefly luciferase orfs. The resulting new PRF-reporter vectors were verified by DNA sequencing (Macrogen, Seoul, Korea). In vivo Dual-Luciferase ® Reporter Assays (Promega Corporation, Madison, WI) for programmed −1 ribosomal frameshifting were performed as described previously in yeast strain JD1158 ( 36 ). Luminescence readings were obtained using a Turner Designs TD20/20™ Luminometer (Sunnyvale, CA). A minimum of 12 replicate assays were carried out for each candidate −1 PRF signal. Statistical analyses of each luciferase dataset followed an established protocol aimed at identifying outliers, validating the statistical assumptions of sample size and distribution, and for the accurate comparison of multiple bicistronic reporter assays ( 37 ).

RESULTS

Pattern matching with RNAMotif

The main differences between this study and the previous work by Hammell et al . are (i) the availability of a completely annotated yeast genome, (ii) significantly more powerful computational resources, (iii) application of more sophisticated statistical analyses and (iv) a different parameter was employed for the −1 PRF motif. RNAMotif ( 22 ) was exploited, and an appropriate albeit somewhat relaxed, ‘descriptor’ of known viral −1 PRF signals was developed (see Materials and Methods) by analysis of a database of experimentally confirmed recoding signals ( 23 ) as the first step toward computationally identifying putative −1 PRF signals. The results of this pattern matching approach identified 10 340 slippery sites in the 6353 annotated coding sequences (CDS) of the yeast genome, 6016 of which are followed by at least one pseudoknot motif. In total, RNAMotif identified 173 452 sequence windows that matched the specified parameters (many are partly overlapping).

Whole genome randomization

To determine the statistical significance of these results, they were compared to what would be expected by chance. One method of identifying statistically significant motifs in nucleic acid sequences is to repeat the initial motif search using a large set of randomized sequences. The frequency of finding the motif in randomized sequences can provide some insight into the likelihood that a match in a natural sequence occurs by chance. In this report, a conservative approach was applied by randomizing the whole yeast CDS genome using seven different strategies so as to not introduce bias due to choice of any one method. All of the randomized genomes contained the same number of ORFs (rORF) as the natural yeast genome and the same number of total nucleotides in the CDS sequence space. Furthermore, rORFs with in-frame premature termination codons were discarded and randomly re-generated until full length read-through sequences were obtained. The results ( Table 1 ) show that the actual number of motif hits found is statistically different when compared to any of the seven randomized datasets; suggesting that the prevalence of −1 PRF signals may be under multiple selective pressures.

Table 1

The yeast genome has a significant number of putative programmed −1 ribosomal frameshift signals compared to randomized genomes created using any one of seven different randomization strategies

RNAMotif	SD	P -value
S.cerevisiae	6016	—	—
noBias	3044	64.07	<0.01
nShuffle	4567	70.84	<0.01
nBias	4660	65.89	<0.01
cShuffle	6551	85.13	0.02
sBias	6580	82.13	0.02
cBias	6639	86.52	0.02
dnBias	6774	88.16	0.01

RNAMotif	SD	P -value
S.cerevisiae	6016	—	—
noBias	3044	64.07	<0.01
nShuffle	4567	70.84	<0.01
nBias	4660	65.89	<0.01
cShuffle	6551	85.13	0.02
sBias	6580	82.13	0.02
cBias	6639	86.52	0.02
dnBias	6774	88.16	0.01

RNAMotif, the number of motif hits using our descriptor of functional −1 PRF signals ( 22 ); stdev, standard deviation of for each randomization strategy. Seven methods for randomization were used. These were: NoBias, randomized ORFs with unbiased nucleotide bias; ntShuffle, nucleotides from each natural ORF are shuffled by triplicate mononucleotide permutations; ntBias, randomized ORFs using the CDS single-nucleotide frequency; cdnShuffle, codons from each natural ORF are shuffled by triplicate monocodon permutation; SilentBias, a silent bias where the codons are randomized in place so as to maintain protein coding sequence; cdnBias, randomized ORFs using the observed CDS codon usage bias; diNuc, randomized ORFs generated using the observed CDS dinucleotide frequency.

Table 1

The yeast genome has a significant number of putative programmed −1 ribosomal frameshift signals compared to randomized genomes created using any one of seven different randomization strategies

RNAMotif	SD	P -value
S.cerevisiae	6016	—	—
noBias	3044	64.07	<0.01
nShuffle	4567	70.84	<0.01
nBias	4660	65.89	<0.01
cShuffle	6551	85.13	0.02
sBias	6580	82.13	0.02
cBias	6639	86.52	0.02
dnBias	6774	88.16	0.01

RNAMotif	SD	P -value
S.cerevisiae	6016	—	—
noBias	3044	64.07	<0.01
nShuffle	4567	70.84	<0.01
nBias	4660	65.89	<0.01
cShuffle	6551	85.13	0.02
sBias	6580	82.13	0.02
cBias	6639	86.52	0.02
dnBias	6774	88.16	0.01

Each of the randomization types designed to mimic the natural CDS of yeast (cShuffle, sBias, cBias and dnBias; see Materials and Methods), retained the information content of the yeast genome. These randomization strategies generated genomes that harbored more −1 PRF signals than are actually found in the natural genome. This suggests a selective pressure against the random acquisition of functional −1 PRF signals in yeast; i.e. the yeast genome would be expected to have more −1 PRF signals than were actually observed. This is consistent with the notion that −1 PRF signals can lead to aberrant translation and (most likely) dysfunctional proteins. In contrast, randomization strategies that seek to mimic the overall genome-wide or individual CDS nucleotide bias (nBias and nShuffle, respectively) produce random genomes with significantly fewer −1 PRF signals than are actually observed. If there were strong and genome-wide evolutionary pressures against the presence of any −1 PRF signals, then they would be expected to be significantly underrepresented in the yeast genome and statistically indistinguishable from the nShuffle and nBias randomization datasets. However, this is not the case. This set of comparisons suggests that there may be evolutionary pressure for the maintenance of certain classes of existing frameshift signals. In addition, randomized genomes using an unbiased nucleotide frequency (noBias) were generated as a negative control. These random genomes contained far fewer −1 PRF signals than observed for the actual yeast genome and far less than any of the other randomization strategies. In sum, the number of slippery sites followed by at least one pseudoknot motif (6016) present in the actual yeast genome is highly statistically significant ( P ≤ 0.02) when compared to the number of expected −1 PRF signals for all of the randomization strategies employed ( Table 1 ). Therefore, although unexpectedly large, this analysis suggests that at least some of the predicted −1 PRF signals have been functionally selected for.

Secondary structure prediction

The next step was to assign additional layers of predictive metrics to the dataset so as to enhance the ability to identify functional −1 PRF signals for empirical testing. The first step was to assign a MFE value to each motif hit identified by RNAMotif. This was not a trivial task since nearly all known −1 PRF signals require an mRNA pseudoknot ( 7 ) and pseudoknot prediction represents a well known and computationally difficult problem ( 38 ). However, pknots ( 24 ), an algorithmic extension of mfold ( 39 ) is capable of predicting H-type pseudoknots of the type that are generally found associated with functional −1 PRF signals. Coupled with a set of scripts written in PERL ( 40 ), pknots was able to fold every sequence window identified as a potential motif hit by RNAMotif (173 452 sequence windows) in ∼5000 CPU h. (∼5 months) using the computational resources available at the National Cancer Institute's Advanced Biomedical Computing Center in Frederick, MD. Once the initial folding was completed, the dataset was then reduced to a structurally non-redundant dataset of 66 842 structures. The nearly 3-fold reduction in the data was possible due to the huge number of overlapping motif hits initially made by RNAMotif. These analyses provide each non-redundant RNAMotif match with a predicted RNA secondary structure and MFE value. The overall distribution of all MFE values determined by pknots for the most stable predicted secondary structures (lowest kcal/mol) for each structure 3′ from the slippery motif is shown in Supplementary Figure 1A and fits a normal distribution. The distribution of base pair counts for each structure fits an extreme-value distribution and is shown in Supplementary Figure 1B. The feature correlations and summary statistics of these 10 340 predicted structures are shown in Supplementary Tables 2A and B, respectively.

Statistical significance of predicted MFE values

To identify statistically significant motif hits, z -scores ( z R ) were calculated for each predicted RNA secondary structure folded by pknots (see Materials and Methods). For each candidate signal, the MFE value of the predicted structure was compared to the distribution of MFE values obtained from 100 permutations (mononucleotide shuffles) of the same sequence using an implementation in PERL of a similar algorithm previously described ( 28 ). The randomization approach disrupts the nucleotide base order and any potential secondary structure for each input sequence but preserves the exact mononucleotide count of each base within the shuffling window. Significance scores derived from permutation shuffling approaches such as this have previously been successful in finding biologically meaningful RNA structures from primary sequence data both by ourselves ( 41 ) and several other research groups ( 25 , 26 , 28 , 42 ). Furthermore, it is expected that this measure of significance is sufficient since functional secondary structures in mRNA sequences are considered more stable than random sequence and are under selective pressure ( 29 , 31 , 43 , 44 ). It should be noted, however, that several reports have indicated that this randomization strategy is not accurate for estimating the significance of RNA secondary structures in-general and that a superior method of randomization lies in preserving both mono- and dinucleotide ratios ( 32 , 45–47 ). Nonetheless, for the purposes of this study the randomization strategy employed for the calculation of z R was adequate. For this dataset, the randomization step was limited to 100 permutations per sequence due to the sheer number of input sequences that required z R scores. This reasonably estimated a normal distribution of MFE values for each input sequence and a probability plot correlation coefficient (PPCC) goodness-of-fit test ( 48 ) was carried out for each distribution to statistically verify each estimation of a normal distribution. A PPCC ≥ 0.98 was found for greater than 99% of all the candidate signals in the database indicating that 100 random shuffles was sufficient for good estimates of z R (data not shown).

Any z R ≤ −1.65 indicates a structure that is more stable than expected ( P < 0.05). The distribution of z R scores for all candidate PRF signals fits a normal distribution (Supplementary Figure 1C, PPCC = 0.98). A total of 3228 candidate signals (out of 66 842 non-redundant structures) include putative structures that meet or exceed the criteria for significance, having z R scores in the range of z R = [−7.10,−1.65]. These significant motif hits are distributed among 2025 ORFs. A total of 1203 individual slippery sites in 751 ORFs are found to have more than one significant structure immediately downstream. Each of these statistically significant structures (and the associated 5′ slippery sites) are considered candidate −1 PRF signals (cPRF) open for further investigation.

An interesting finding from this analysis is that statistically significant motif hits do not necessarily have low MFE values; a result that was previously shown to be true for structural RNAs in general ( 25 , 28 , 29 ). We therefore sought to filter the list of putative −1 PRF signals further by comparing z R scores and MFE values, which are only weakly correlated features in the database (Correlation = 0.53, Supplementary Table 3A), similar to an approach previously employed ( 27 ). Energetically strong candidates with statistically significant predicted secondary structures are considered strong-candidate −1 PRF signals ( Figure 2 ). From this analysis, 1706 strong candidate signals were identified with significant z R ≤ −1.65 and whose MFE values are in the lowest 25% (MFE ≤ 17.3 kcal/mol). These strong candidate signals are distributed among 1275 individual ORFs, where 320 ORFs have two or more strong signals.

Figure 2

Scatterplot of MFE values (predicted using pknots, ( 24 ) versus z R scores for 10 340 candidate −1 PRF signals demonstrates the weak correlation between these two feature statistics (see Supplementary Table 2B). The red diamonds and associated labels indicate the location and parental gene of nine sequences empirically tested for frameshifting. The hypothetical distributions were created using summary statistics from Supplementary Table 2B.

Selection of candidate signals for empirical analyses

Computationally generated information is best when empirically tested. To this end, nine candidate signals possessing a wide range of feature statistics were selected for empirical testing. First and foremost, −1 PRF signals were selected from genes having scorable phenotypes when under- or over-expressed. Second, eight of the nine candidate signals chosen are predicted to fold into a pseudoknot, the exception being the signal chosen from FKS1 . Third, not all the selected signals should fully meet the criteria outlined above for strong candidate signals. For example, the two signals from FLR1 and SPR6 met all of the criteria for strong-candidate −1 PRF signals having z R ≤ −1.65 and predicted MFE values in the lowest 25% of all structures in the PRFdb. The signals from CTS2 , EST2 , NUP82 and TBF1 meet less stringent criteria in that, although they are not in the first quartile of the most stable structures, they nonetheless are considered significant with z R ≤ −1.65. Candidate signals from BUB3 and PPR1 were chosen because they specifically do not meet any of the criteria above. The predicted slippery sites and associated secondary structures are shown in Figure 3 . The feature statistics of each candidate signal are summarized in Supplementary Table 3.

Nine examples of candidate −1 PRF signals chosen to generally represent the diversity of features present in the PRFdb. Gene names are shown with RNA sequence and corresponding CDS nucleotide start and stop locations. The predicted structure is shown for each empirically tested candidate signal. See Supplementary Table 3 for statistical data.

Figure 3

Testing for frameshifting

Each of the nine candidate −1 PRF signals were cloned into pJD375, a dual-luciferase frameshift reporter plasmid (DLR), and their abilities to promote −1 PRF was measured after transformation into a wild-type yeast strain (JD1158) as described previously ( 36 ). Briefly, the ratio of firefly to Renilla luciferase expression promoted by −1 PRF signal containing reporters is normalized to a ‘zero-frame’ control reporter (pJD375), and these ratios are statistically tested for normalcy, sample size and significance as described previously ( 37 ). At least ten replicate experiments were carried out for each reporter. The results demonstrate that every signal containing a predicted mRNA pseudoknot promoted −1 PRF at levels that significantly exceeded non-programmed (or background) frameshifting ( Figure 4A and B ). In contrast, the sequence derived from FKS1 , which is not predicted to contain a pseudoknot, did not promote measurable frameshifting. In a broad sense, the experimental data divides the signals into high-, medium- and low-efficiency −1 PRF signals. The signals cloned from CTS2 , EST2 and PPR1 promoted −1 PRF at ∼64, 56 and 43%, respectively ( Figure 4A ). Although these rates of −1 PRF are extremely high, concurrent studies in our laboratory show levels of −1 PRF as high as 91% in a variant of the SARS-CoV −1 PRF signal (E. P. Plant and J. D. Dinman, unpublished data). The signal cloned from TBF1 falls in the middle-range (5.2%), while the three remaining functional signals promoted −1 PRF at levels between 0.4–0.9% ( Figure 4B ). For purposes of comparison, the well-characterized −1 PRF signal from the yeast L-A virus promoted 9.1% frameshifting, while values for out-of-frame controls are <0.02%. It is important to note that in order to measure frameshifting it was necessary to re-engineer the frameshift signals to bypass −1 frame encoded termination codons present downstream of the slippery sites, and that these were all located in the mRNA pseudoknots. Mutating these to sense codons would cause unknown changes to pseudoknot structures, which would uncontrollably affect frameshifting. In contrast, although deleting delete one base in the spacer region should change −1 PRF efficiencies (usually downward, see Ref. ( 49 ), such changes should not completely abrogate frameshifting, nor should they create functional −1 PRF signals de novo . Thus, although the values presented here cannot be taken as absolute, the important issue is qualitative: these mRNA elements can indeed promote efficient frameshifting.

Figure 4

Measurement of −1 PRF efficiency for nine candidate signals. ( A ) High-efficiency frameshifting including the frameshift signal from the endogenous yeast L-A virus. ( B ) Medium- and low-efficiency frameshifting including the sequence from the FKS1 gene that did not promote −1 PRF above background levels. The parental genes of each candidate signal are indicated with the percentage of −1 PRF efficiency as was measured using a dual-luciferase reporter assay system ( 36 , 37 ).

The PRFdb and data availability

The PRFdb ( Author Webpage ) is a publicly available database that stores the results of the bioinformatics data presented in this report. This online resource allows interested readers to search for and analyze candidate −1 PRF signals in the genome of S.cerevisiae . Finally, the PRFdb contains a searchable list of strong candidate −1 PRF signals that may warrant further empirical investigations.

DISCUSSION

Programmed ribosomal frameshifting was first identified as a translational phenomenon in the Rous sarcoma virus over two decades ago ( 50 ). Since then, it has been shown to be a general mechanism of gene regulation utilized by a wide variety of RNA viruses ( 1–4 ). Frameshifting has also been demonstrated to be functionally important for the expression of a growing list of prokaryotic ( 51–53 ), archaeal ( 54 ) and eukaryotic genes ( 13 , 14 ). Thus, it is becoming increasingly apparent that PRF is a fundamental mechanism of post-transcriptional gene regulation and is present in every branch of the tree of life. The need to identify PRF signals in higher organisms has grown in importance as we have become more aware of their prevalence. In response, there have been numerous computational studies aimed at identifying PRF signals ( 17–21 , 55–57 ). Each study has met with varying degrees of success, but empirical testing of predicted PRF signals suggest that there are indeed functional, and previously unannotated, PRF signals in a variety of contexts within the coding regions of genes derived from higher organisms. With the exception of the earliest study by Hammell et al . all of these studies have focused on recoding in the ‘viral-context’: i.e. they were aimed towards finding PRF signals predicted to direct ribosomes into a new reading frame so as to produce functional alternative C-terminal extensions of the native proteins. The study by Hammell et al . was context neutral, focusing instead on searching for mRNA motifs that resembled known viral −1 PRF signals.

The findings presented by this study suggest that PRF signals can function efficiently in a number of different ways. For example, while sequences that are predicted to fold into strong, statistically significant, pseudoknotted mRNA structures serve as efficient stimulators of −1 PRF (e.g. signals from CTS2 and EST2 ), the presence of multiple overlapping slippery sites can also have an equally strong effect, even if the stimulatory structure is not ideal (e.g. signals from BUB3 , PPR1 and NUP82 ). Most importantly, it appears that the presence of a pseudoknot as the ‘most stable’ structure following a slippery site is critical, providing further support for the ‘torsional restraint’ model of −1 PRF ( 8 ). This is further evidenced by the fact that the very energetically favorable and highly significant structure derived from FKS1 failed to promote detectable −1 PRF in vivo . This last point may be because there is no predicted pseudoknot structure with an MFE value lower than that of the predicted non-pseudoknotted structure immediately following the slippery site of interest in FKS1 ( Figure 3 ).

The PRFdb was constructed to serve as a repository for all the predicted structures, slippery sites and statistical data in this study. This database is accessible via the Internet at Author Webpage . Currently, visitors are limited to searching the existing data for putative −1 PRF signals in the yeast genome. It is expected, however, that this service will be expanded in the future to include the genomes of seven additional budding yeast species in addition to S.cerevisiae , the human genome, and several other ‘model system’ genomes.

While the current study revisits the original question posed by Hammell et al . (i.e. how often are functional −1 PRF signals present in the yeast genome?), it also asks an important second question: are genome encoded −1 PRF signals capable of promoting −1 PRF, and if so, how might this affect the expression of the mRNAs encoding them? Analysis of the PRFdb from the perspective of alternative recoding reveals that greater than 99% of the expected outcomes of −1 PRF would result in premature termination ( Figure 5A ). The prevalence of out-of-frame termination signals is not unexpected since the average distance a ribosome can continue elongation in an alternative reading frame is ∼6 codons in either the +1 or −1 frame for all CDS in yeast (data not shown). Only 10 −1 PRF signals out of 10 340 potential slippery sites are predicted to bypass the normal zero-frame termination codon and encode an alternative C-terminal extension (i.e. −1 PRF in the viral context). However, BLAST analyses ( 58 ) revealed that none of these extensions are predicted to encode functional alternative protein domains (data not shown). This suggests that although potential −1 PRF signals are widespread in the yeast genome, they are almost uniformly predicted to direct ribosomes to a premature termination signals. We have previously shown that such PRF events are sufficient to target transcripts to the nonsense-mediated mRNA decay pathway for rapid degradation ( 59 ). Thus, we suggest that −1 PRF may be used to post-transcriptionally regulate gene expression of cellular encoded mRNAs. In support of this notion, natural mRNA substrates for NMD that do not contain in-frame PTCs have been discovered in both the yeast and human transcriptomes ( 60–67 ), suggesting that regulation of mRNA expression by NMD is broadly conserved and is used to regulate a variety of physiological processes. If this is true, then these mRNAs should be well represented in DNA microarray databases identifying mRNAs that are stabilized when this pathway has been abrogated, e.g. upf1 Δ upf2 Δ, or upf3 Δ ( 60 ). Consistent with this hypothesis, the overlap between mRNAs stabilized in these genetic backgrounds and ORFs containing high probability −1 PRF signals is 38.8% (278 of 717 genes) ( Figure 5B ). Also implicit in this model is that modulation of PRF efficiency could be used as the controlling effector of transcript degradation rates. This model could be applied to a variety of biological examples where the stability of individual mRNAs or whole classes of mRNAs require a flexible stability threshold responsive to environmental cues.

Figure 5

( A ) The CDS of S.cerevisiae is not prone to lengthy out-of-frame translation. The relative positions of candidate −1 PRF signals from the start codon of each ORF compared to the expected overall change in peptide length if a frameshifting event were to occur. ( B ) Fraction of ORFs containing high probability −1 PRF signals represented as mRNAs stabilized in strains deficient in NMD ( upf1 Δ, upf2 Δ or upf3 Δ) ( 60 ), No-go decay ( xrn1 Δ or dcp1 Δ) ( 60 ), or having half-lives less than the yeast transcriptome average (t 1/2 ) ( 70 ).

Independent of NMD, It has also been suggested that specific sequences present in coding regions of mRNAs are capable of translationally stalling ribosomes long enough to direct them to be endonucleolytically cleaved and specifically degraded in both prokaryotic and eukaryotic organisms ( 68 , 69 ), a process termed ‘No-go decay’. Many of the predicted candidate −1 PRF signals identified in the current work are predicted to be more stable than the mRNA structure used by Doma & Parker (data not shown) and thus would be expected to be similarly capable to stall translating ribosomes. Thus, it is also possible that mRNAs containing these extremely stable mRNA structures identified in the current study may also be targeted for No-go decay. If this is true, then these mRNAs should be stabilized in cells in which this pathway has been abrogated, e.g. xrn1 Δ or dcp1 Δ ( 60 ). Figure 5B shows the overlap between mRNAs stabilized in these genetic backgrounds and genes containing high probability −1 PRF signals is 50.5% (362 of 717 genes), consistent with this hypothesis. Furthermore, regardless of the mRNA destabilization pathway, −1 PRF signal containing mRNAs should be inherently less stable than those that do not contain this motif. Comparison with the yeast transcriptome half-life microarray database ( 70 ) reveals that 60.8% (327 of 538) of the mRNAs containing high probability −1 PRF signals have half-lives less than the transcriptome average (<26 min) ( Figure 5B ). Conversely, it is possible that the presence of a slippery site just upstream from strong secondary structures may be the specific feature that allows for such mRNAs to evade being subject to this pathway ( 7 ). In conclusion, it is reasonable to envision that a general function of PRF signals in the coding regions of eukaryotic mRNAs is to act as post-transcriptional capacitors of gene expression.

AUTHOR CONTRIBUTIONS

J. L. Jacobs and J. D. Dinman conceived and designed experiments. J. L. Jacobs, A. T. Belew and R. Rakauskaite performed experiments and contributed reagents/materials. J. L. Jacobs and A. T. Belew wrote source code, developed the PRFdb and analyzed the computational and empirical data. J. L. Jacobs and J. D. Dinman wrote the paper.

ACCESSION NUMBERS

The primary SDG accession numbers of genes from which the entire CDS or a subssequence of the CDS was used in this study are: BUB3 (#S000005552), CTS2 (#S000002779), EST2 (#S000004310), FLR1 (#S000000212), FKS1 (#S000004334), NUP82 (#S000003597), PPR1 (#S000004004), SPR6 (#S000000917), TBF1 (#S000006049).

This work was supported by a fellowship to J.L.J. from the National Libraries of Medicine (F37 LM8333), grants to J.D.D. from the National Institutes of Health (R05 GM58859 and R21 GM068123) and the National Science Foundation (MCB-0084559). A.T.B. was partially supported by funds from the National Institutes of Health (T32 AI51967). The authors wish to thank Kristi Muldoon-Jacobs, Ewan Plant and John Russ for advice, suggestions and helpful reviewing of the manuscript. The authors also want to acknowledge the helpful technical support and computing resources from the National Cancer Institute's Advanced Biomedical Computing Center in Frederick, MD. Funding to pay the Open Access publication charges for this article was provided by a grant to J.D.D. from the National Institutes of Health (GM58859).

Conflict of interest statement. None declared.

REFERENCES

Recoding: translational bifurcations in gene expression

Gene

2002

, vol.

286

(pg.

187

201

)

Recoding in archaea

Mol. Microbiol.

2005

, vol.

(pg.

339

348

)

An ‘integrated model’ of programmed ribosomal frameshifting and post-transcriptional surveillance

Trends Biochem. Sci.

2002

, vol.

(pg.

448

454

)

Reprogrammed genetic decoding in cellular gene expression

Mol. Cell

2004

, vol.

(pg.

157

168

)

Ribosomal pausing at a frameshifter RNA pseudoknot is sensitive to reading phase but shows little correlation with frameshift efficiency

Mol. Cell. Biol.

2001

, vol.

(pg.

8657

8670

)

Kinetics of ribosomal pausing during programmed −1 translational frameshifting

Mol. Cell. Biol.

2000

, vol.

(pg.

1095

1103

)

The 9-angstrom solution: How mRNA pseudoknots promote efficient programmed −1 ribosomal frameshifting

RNA

2003

, vol.

(pg.

168

174

)

Torsional restraint: a new twist on frameshifting pseudoknots

Nucleic Acids Res.

2005

, vol.

(pg.

1825

1833

)

The frameshift stimulatory signal of human immunodeficiency virus type 1 group O is a pseudoknot

J. Mol. Biol.

2003

, vol.

331

(pg.

571

583

)

Regulated ribosomal frameshifting by an RNA-protein interaction

RNA

1996

, vol.

(pg.

316

323

)

Characterization of the frameshift signal of Edr, a mammalian example of programmed −1 ribosomal frameshifting

Nucleic Acids Res.

2005

, vol.

(pg.

1553

1563

)

Reading two bases twice: mammalian antizyme frameshifting in yeast

EMBO J.

1996

, vol.

(pg.

1360

1370

)

Identification and characterisation of a developmentally regulated mammalian gene that utilises −1 programmed ribosomal frameshifting

Nucleic Acids Res.

2001

, vol.

(pg.

4079

4088

)

A functional −1 ribosomal frameshift signal in the human paraneoplastic Ma3 gene

J. Biol. Chem.

2006

, vol.

281

(pg.

7082

7088

)

Programmed translational frameshifting in a gene required for yeast telomere replication

Curr. Biol.

1997

, vol.

(pg.

969

976

)

Upstream open reading frames as regulators of mRNA translation

Mol. Cell. Biol.

2000

, vol.

(pg.

8635

8642

)

Translational recoding signals between gag and pol in diverse LTR retrotransposons

RNA

2003

, vol.

(pg.

1422

1430

)

Sequences that direct significant levels of frameshifting are frequent in coding regions of Escherichia coli

EMBO J.

2003

, vol.

(pg.

5941

5950

)

Identification of putative programmed −1 ribosomal frameshift signals in large DNA databases

Genome Res.

1999

, vol.

(pg.

417

427

)

Predicting genes expressed via −1 and +1 frameshifts

Nucleic Acids Res.

2004

, vol.

(pg.

4884

4892

)

Computational identification of putative programmed translational frameshift sites

Bioinformatics

2002

, vol.

(pg.

1046

1053

)

RNAMotif, an RNA secondary structure definition and search algorithm

Nucleic Acids Res.

2001

, vol.

(pg.

4724

4735

)

Recode 2003

Nucleic Acids Res.

2003

, vol.

(pg.

)

A dynamic programming algorithm for RNA structure prediction including pseudoknots

J. Mol. Biol.

1999

, vol.

285

(pg.

2053

2068

)

Thermodynamic stability and statistical significance of potential stem–loop structures situated at the frameshift sites of retroviruses

Nucleic Acids Res.

1989

, vol.

(pg.

6143

6152

)

Local thermodynamic stability scores are well represented by a non-central student's t distribution

J. Theor. Biol.

2001

, vol.

210

(pg.

411

423

)

RNA molecules with structure dependent functions are uniquely folded

Nucleic Acids Res.

2002

, vol.

(pg.

3574

3582

)

mRNAs have greater negative folding free energies than shuffled or codon choice randomized sequences

Nucleic Acids Res.

1999

, vol.

(pg.

1578

1584

)

Estimating the contributions of selection and self-organization in RNA secondary structure

J. Mol. Evol.

1999

, vol.

(pg.

)

Thermodynamic and phylogenetic prediction of RNA secondary structures in the coding region of hepatitis C virus

RNA

2002

, vol.

(pg.

824

841

)

Evidence for selection on synonymous mutations affecting stability of mRNA secondary structure in mammals

Genome Biol.

2005

, vol.

pg.

R75

A comparison of RNA folding measures

BMC Bioinformatics

2005

, vol.

pg.

241

High efficiency transformation of Escherichia coli with plasmids

Gene

1990

, vol.

(pg.

)

A −1 ribosomal frameshift in a double-stranded RNA virus forms a Gag-pol fusion protein

Proc. Natl Acad. Sci. USA

1991

, vol.

(pg.

174

178

)

Transformation of intact yeast cells treated with alkali cations

J. Bacteriol.

1983

, vol.

153

(pg.

163

168

)

An in vivo dual-luciferase assay system for studying translational recoding in the yeast Saccharomyces cerevisiae

RNA

2003

, vol.

(pg.

1019

1024

)

Systematic analysis of bicistronic reporter assay data

Nucleic Acids Res.

2004

, vol.

(pg.

e160

e170

)

RNA pseudoknot prediction in energy-based models

J. Comput. Biol.

2000

, vol.

(pg.

409

427

)

Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure

J. Mol. Biol.

1999

, vol.

288

(pg.

911

940

)

Programming Perl

2000

3rd edn.

Sebastopol, CA

O'Reilly & Associates

A Three-Stemmed mRNA Pseudoknot in the SARS Coronavirus Frameshift Signal

PLoS. Biol.

2005

, vol.

(pg.

1012

1023

)

Pseudoknots in prion protein mRNAs confirmed by comparative sequence analysis and pattern searching

Nucleic Acids Res.

2001

, vol.

(pg.

753

758

)

Widespread selection for local RNA secondary structure in coding regions of bacterial genes

Genome Res.

2003

, vol.

(pg.

2042

2051

)

Folding free energies of 5′-UTRs impact post-transcriptional regulation on a genomic scale in yeast

PLoS. Comput. Biol.

2005

, vol.

pg.

e72

No evidence that mRNAs have lower folding free energies than random sequences with the same dinucleotide distribution

Nucleic Acids Res.

1999

, vol.

(pg.

4816

4822

)

The language of RNA: a formal grammar that includes pseudoknots

Bioinformatics

2000

, vol.

(pg.

334

340

)

Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency

RNA

2005

, vol.

(pg.

578

591

)

Probability plot correlation coefficient test for normality

Technometrics

1975

, vol.

(pg.

111

117

)

Mutational analysis of the ‘slippery-sequence’ component of a coronavirus ribosomal frameshifting signal

J. Mol. Biol.

1992

, vol.

227

(pg.

463

479

)

Expression of the Rous Sarcoma Virus pol gene by ribosomal frameshifting

Science

1985

, vol.

230

(pg.

1237

1242

)

Translational frameshifting generates the gamma subunit of DNA polymerase III holoenzyme

Proc. Natl Acad. Sci. USA

1990

, vol.

(pg.

2516

2520

)

Programmed ribosomal frameshifting generates the Escherichia coli DNA polymerase III gamma subunit from within the tau subunit reading frame

Nucleic Acids Res.

1990

, vol.

(pg.

1725

1729

)

Frameshifting is required for production of the transposase encoded by insertion sequence 1

Proc. Natl Acad. Sci. USA

1989

, vol.

(pg.

4609

4613

)

Identification of an archaeal alpha-L-fucosidase encoded by an interrupted gene. Production of a functional enzyme by mutations mimicking programmed −1 frameshifting

J. Biol. Chem.

2003

, vol.

278

(pg.

14622

14631

)

Predicting genes expressed via −1 and +1 frameshifts

Nucleic Acids Res.

2004

, vol.

(pg.

4884

4892

)

Identification of stop codon readthrough genes in Saccharomyces cerevisiae

Nucleic Acids Res.

2003

, vol.

(pg.

2289

2296

)

A small reservoir of disabled ORFs in the yeast genome and its implications for the dynamics of proteome evolution

J. Mol. Biol.

2002

, vol.

316

(pg.

409

419

)

Basic local alignment search tool

J. Mol. Biol.

1990

, vol.

215

(pg.

403

410

)

A programmed −1 ribosomal frameshift signal can function as a cis-acting mRNA destabilizing element

Nucleic Acids Res.

2004

, vol.

(pg.

784

790

)

Genome-wide analysis of mRNAs regulated by the nonsense-mediated and 5′ to 3′ mRNA decay pathways in yeast

Mol. Cell

2003

, vol.

(pg.

1439

1452

)

Yeast Upf proteins required for RNA surveillance affect global expression of the yeast transcriptome

Mol. Cell. Biol.

1999

, vol.

(pg.

6710

6719

)

hUPF2 silencing identifies physiologic substrates of mammalian nonsense-mediated mRNA decay

Mol. Cell. Biol.

2006

, vol.

(pg.

1272

1287

)

Nonsense surveillance regulates expression of diverse classes of mammalian transcripts and mutes genomic noise

Nature Genet.

2004

, vol.

(pg.

1073

1078

)

Quantitative microarray profiling provides evidence against widespread coupling of alternative splicing with nonsense-mediated mRNA decay to control gene expression

Genes Dev.

2006

, vol.

(pg.

153

158

)

Gene set coregulated by the Saccharomyces cerevisiae nonsense-mediated mRNA decay pathway

Eukaryot. Cell

2005

, vol.

(pg.

2066

2077

)

Selenium deficiency reduces the abundance of mRNA for Se-dependent glutathione peroxidase 1 by a UGA-dependent mechanism likely to be nonsense codon-mediated decay of cytoplasmic mRNA

Mol. Cell. Biol.

1998

, vol.

(pg.

2932

2939

)

Mammalian Staufen1 recruits Upf1 to specific mRNA 3′UTRs so as to elicit mRNA decay

Cell

2005

, vol.

120

(pg.

195

208

)

Endonucleolytic cleavage of eukaryotic mRNAs with stalls in translation elongation

Nature

2006

, vol.

440

(pg.

561

564

)

Nascent-peptide-mediated ribosome stalling at a stop codon induces mRNA cleavage resulting in nonstop mRNA that is recognized by tmRNA

RNA

2004

, vol.

(pg.

378

386

)

Genome-wide analysis of mRNA translation profiles in Saccharomyces cerevisiae

Proc. Natl Acad. Sci. USA

2003

, vol.

100

(pg.

3889

3894

)

Author notes

Present address: Jonathan L. Jacobs, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Advanced Technology Center, Gaithersburg, MD 20877, USA

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data

I agree to the terms and conditions. You must accept the terms and conditions.

Submit a comment

Name

Affiliations

Comment title

Comment

You have entered an invalid code

Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.

Citations

Views

Altmetric

Metrics

Total Views 1,130

739 Pageviews

391 PDF Downloads

Since 1/1/2017

Month:	Total Views:
January 2017	3
February 2017	7
March 2017	3
April 2017	5
May 2017	5
June 2017	4
July 2017	6
October 2017	4
November 2017	2
December 2017	9
January 2018	4
February 2018	7
March 2018	16
April 2018	4
May 2018	5
June 2018	2
July 2018	8
August 2018	9
September 2018	8
October 2018	5
November 2018	11
December 2018	7
January 2019	7
February 2019	3
March 2019	22
April 2019	16
May 2019	12
June 2019	15
July 2019	8
August 2019	10
September 2019	11
October 2019	15
November 2019	15
December 2019	14
January 2020	11
February 2020	11
March 2020	40
April 2020	16
May 2020	24
June 2020	6
July 2020	12
August 2020	14
September 2020	27
October 2020	12
November 2020	6
December 2020	10
January 2021	12
February 2021	20
March 2021	20
April 2021	6
May 2021	12
June 2021	13
July 2021	6
August 2021	9
September 2021	5
October 2021	11
November 2021	14
December 2021	12
January 2022	9
February 2022	37
March 2022	27
April 2022	52
May 2022	31
June 2022	34
July 2022	16
August 2022	28
September 2022	18
October 2022	8
November 2022	10
December 2022	9
January 2023	6
February 2023	13
March 2023	10
April 2023	38
May 2023	13
June 2023	7
July 2023	3
August 2023	3
September 2023	5
October 2023	4
November 2023	5
December 2023	12
January 2024	15
February 2024	15
March 2024	7
April 2024	5
May 2024	12
June 2024	11
July 2024	19
August 2024	14
September 2024	13
October 2024	10

Citations

59 Web of Science

Identification of functional, endogenous programmed −1 ribosomal frameshift signals in the genome of Saccharomyces cerevisiae (original) (raw)

Cite

Abstract

INTRODUCTION

MATERIALS AND METHODS

Hardware and software

Pattern matching

Genome randomization

Secondary structure prediction

Randomization and statistical analysis

Genetic methods and plasmid construction

RESULTS

Pattern matching with RNAMotif

Whole genome randomization

Secondary structure prediction

Statistical significance of predicted MFE values

Selection of candidate signals for empirical analyses

Testing for frameshifting

The PRFdb and data availability

DISCUSSION

AUTHOR CONTRIBUTIONS

ACCESSION NUMBERS

REFERENCES

Author notes

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via