A computational study of off-target effects of RNA interference - PubMed (original) (raw)

A computational study of off-target effects of RNA interference

Shibin Qiu et al. Nucleic Acids Res. 2005.

Abstract

RNA interference (RNAi) is an intracellular mechanism for post-transcriptional gene silencing that is frequently used to study gene function. RNAi is initiated by short interfering RNA (siRNA) of approximately 21 nt in length, either generated from the double-stranded RNA (dsRNA) by using the enzyme Dicer or introduced experimentally. Following association with an RNAi silencing complex, siRNA targets mRNA transcripts that have sequence identity for destruction. A phenotype resulting from this knockdown of expression may inform about the function of the targeted gene. However, 'off-target effects' compromise the specificity of RNAi if sequence identity between siRNA and random mRNA transcripts causes RNAi to knockdown expression of non-targeted genes. The complete off-target effects must be investigated systematically on each gene in a genome by adjusting a group of parameters, which is too expensive to conduct experimentally and motivates a study in silico. This computational study examined the potential for off-target effects of RNAi, employing the genome and transcriptome sequence data of Homo sapiens, Caenorhabditis elegans and Schizosaccharomyces pombe. The chance for RNAi off-target effects proved considerable, ranging from 5 to 80% for each of the organisms, when using as parameter the exact identity between any possible siRNA sequences (arbitrary length ranging from 17 to 28 nt) derived from a dsRNA (range 100-400 nt) representing the coding sequences of target genes and all other siRNAs within the genome. Remarkably, high-sequence specificity and low probability for off-target reactivity were optimally balanced for siRNA of 21 nt, the length observed mostly in vivo. The chance for off-target RNAi increased (although not always significantly) with greater length of the initial dsRNA sequence, inclusion into the analysis of available untranslated region sequences and allowing for mismatches between siRNA and target sequences. siRNA sequences from within 100 nt of the 5' termini of coding sequences had low chances for off-target reactivity. This may be owing to coding constraints for signal peptide-encoding regions of genes relative to regions that encode for mature proteins. Off-target distribution varied along the chromosomes of C.elegans, apparently owing to the use of more unique sequences in gene-dense regions. Finally, biological and thermodynamical descriptors of effective siRNA reduced the number of potential siRNAs compared with those identified by sequence identity alone, but off-target RNAi remained likely, with an off-target error rate of approximately 10%. These results also suggest a direction for future in vivo studies that could both help in calibrating true off-target rates in living organisms and also in contributing evidence toward the debate of whether siRNA efficacy is correlated with, or independent of, the target molecule. In summary, off-target effects present a real but not prohibitive concern that should be considered for RNAi experiments.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Graphic depiction of variables tested computationally to investigate chance of off-target effect in each of the three organisms (H.sapiens, C.elegans and S.pombe). (A) General considerations: a target sequence (representing one particular expressed mRNA) is used as the source of dsRNA of which a pool of all possible siRNA is derived (mimicking the action of Dicer). Each sequence within the siRNA pool was compared for sequence identity (exact: m = 0; with mismatch: m > 0) to all possible siRNA sequences in the transcriptome through the feature map Φ(·) to determine chance of off-target errors. The parameters tested are as follows: (B) length of siRNA (n); (C) length of dsRNA (l); (D) addition of available UTR data in the target sequences (_u_3 and _u_5); and (E) position of the dsRNA along the target sequence (pos).

Figure 2

Figure 2

Effect of siRNA length and dsRNA length on off-target error rates under an exact match (m = 0) siRNA homology function. (A) H.sapiens and (B) C.elegans. (C) S.pombe, l = 100–400, n = 17–28. (D) Comparison of the three organisms for l = 300, n = 17–28 and m = 0. Within the length range tested, siRNA of 21 nt optimally combined target specificity and minimum length. Chance for off-target error increases significantly for smaller siRNAs, yet is not significantly different from longer siRNAs. Increased length of dsRNA yielded more diverse pools of siRNA, resulting in increased chance for off-target errors.

Figure 3

Figure 3

Effect of mismatch on RNAi off-target error rate in (A) C.elegans and (B) H.sapiens. Allowing for mismatches significantly increased the off-target error for siRNA of various lengths derived from CDSs of the transcriptome. Results shown were derived for l = 200, n = 17–35 and m = 0–9. Dashed curves indicate the positions of the planes of n = 21 and solid curves indicate the positions of the planes of m = 3 in the three-dimensional plots.

Figure 4

Figure 4

Effect of mismatch positions in H.sapiens, C.elegans and S.pombe. Mismatches in the region of 2–9 nt at the 5′ end of the guiding strand reduced off-target chances. Results shown were derived for n = 21, l = 100, m = 3 and mpos = 1–19.

Figure 5

Figure 5

Effect of including UTR sequences with the CDS data for RNAi off-target error in (A) H.sapiens and (B) C.elegans. The off-target error for siRNA (various lengths) derived from CDSs of the transcriptome did not change significantly when available UTR sequence data were included among the sequences considered as the transcriptome (average _P_-value = 82%); l = 100–400, n = 17–28 and m = 0. CDS indicates the case where UTR is not included; ‘+UTR’ indicates the case where 3′-UTR is included.

Figure 6

Figure 6

siRNA frequency distribution. The _x_-axis is the popularity rank of the siRNAs (the most frequently occurred siRNA is the most popular one and has the smallest abscissa). The _y_-axis is the corresponding count with which the siRNA occurs in the genome. The curves have very long and flat tails and the _x_-axis is actually only 1000th of the total data.

Figure 7

Figure 7

RNAi off-target error relative to varied position of the dsRNA along the target sequences. The off-target error is significantly lower when targeting the beginning 100–140 nt by dsRNA for generating siRNA. The _x_-axis is the start position of the dsRNA, pos. The _y_-axis is the error rate. (A) For C.elegans, off-target error shown for dsRNA lengths l = 100 and 200 nt, length of siRNA n = 22 and position pos 0–600. (B) Off-target error as a function of dsRNA position effect in H.sapiens (l = 100, 200, n = 25). (C) For S.pombe l = 200, 300, n = 25. When pos ≈ 100, the error reaches a local minimum.

Figure 8

Figure 8

The distribution of RNAi off-target error of each gene plotted against the position of genes on the physical map of chromosomes of C.elegans (A–F represent chromosomes I–V and X, respectively). These plots disclosed that the chromosomal position is predictive of the off-target error rate. The curves were smoothed by averaging the off-target errors between neighboring genes. Dashed bars indicate physical center of the chromosomes (cM = 0), solid bars identify the position where most genes are centered (density center). Parameters are l = 300, n = 21, mpos = 0. Unit on the _x_-axis is million nt.

Figure 9

Figure 9

Effect of rational siRNA design in H.sapiens and C.elegans. (A) H.sapiens and (B) C.elegans. Results shown for l = 300, n = 17–29 (only odd number of lengths are considered since rational design rules need to compute the center of an siRNA), pool size = 5, 10, 20 and rational filter is also considered. The selection of siRNA with properties that were empirically found to be associated with highly functional siRNA sequences reduced off-target error relative to that of non-filtered siRNA. Note that off-target error is never reduced to zero.

Similar articles

Cited by

References

    1. Fire A., Xu S.Q., Montgomery M.K., Kostas S.A., Driver S.E., Mello C.C. Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature. 1998;391:806–811. - PubMed
    1. Fraser A.J.G., Kamath R.S., Zipperten P., Campos M.M., Sohrmann M., Ahringer J. Functional genomic analysis of C.elegans chromosome I by systemic RNA interference. Nature. 2000;408:325–330. - PubMed
    1. Dillin A. The specifics of small interfering RNA specificity. Proc. Natl Acad. Sci. USA. 2003;100:6289–6291. - PMC - PubMed
    1. Agrawal N., Dasaradhi P.V.N., Mohmmed A., Malhotra P., Bhatnagar R.K., Mukherjee S.K. RNA interference: biology, mechanism and applications. Microbiol. Mol. Biol. Rev. 2003;67:657–685. - PMC - PubMed
    1. Jacque J.M., Triques K., Stevenson M. Modulation of HIV-1 replication by RNA interference. Nature. 2002;418:435–438. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources