CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites (original) (raw)

Journal Article

,

1Database Center for Life Science (DBCLS), 2National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima, Shizuoka 411-8540, Japan and 3Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

1Database Center for Life Science (DBCLS), 2National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima, Shizuoka 411-8540, Japan and 3Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

*To whom correspondence should be addressed.

Search for other works by this author on:

,

1Database Center for Life Science (DBCLS), 2National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima, Shizuoka 411-8540, Japan and 3Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

1Database Center for Life Science (DBCLS), 2National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima, Shizuoka 411-8540, Japan and 3Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

Search for other works by this author on:

,

1Database Center for Life Science (DBCLS), 2National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima, Shizuoka 411-8540, Japan and 3Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

1Database Center for Life Science (DBCLS), 2National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima, Shizuoka 411-8540, Japan and 3Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

Search for other works by this author on:

1Database Center for Life Science (DBCLS), 2National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima, Shizuoka 411-8540, Japan and 3Department of Biological Sciences, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

Search for other works by this author on:

†The authors wish it to be known that, in their opinion, the first two authors should be regarded as Joint First Authors.

Associate Editor: Alfonso Valencia

Author Notes

Revision received:

15 October 2014

Accepted:

05 November 2014

Published:

09 December 2014

Cite

Yuki Naito, Kimihiro Hino, Hidemasa Bono, Kumiko Ui-Tei, CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites, Bioinformatics, Volume 31, Issue 7, April 2015, Pages 1120–1123, https://doi.org/10.1093/bioinformatics/btu743
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

Summary: CRISPRdirect is a simple and functional web server for selecting rational CRISPR/Cas targets from an input sequence. The CRISPR/Cas system is a promising technique for genome engineering which allows target-specific cleavage of genomic DNA guided by Cas9 nuclease in complex with a guide RNA (gRNA), that complementarily binds to a ∼20 nt targeted sequence. The target sequence requirements are twofold. First, the 5′-NGG protospacer adjacent motif (PAM) sequence must be located adjacent to the target sequence. Second, the target sequence should be specific within the entire genome in order to avoid off-target editing. CRISPRdirect enables users to easily select rational target sequences with minimized off-target sites by performing exhaustive searches against genomic sequences. The server currently incorporates the genomic sequences of human, mouse, rat, marmoset, pig, chicken, frog, zebrafish, Ciona, fruit fly, silkworm, Caenorhabditis elegans, Arabidopsis, rice, Sorghum and budding yeast.

Availability: Freely available at http://crispr.dbcls.jp/.

Contact: y-naito@dbcls.rois.ac.jp

Supplementary information: Supplementary data are available at Bioinformatics online.

1 Introduction

Genome engineering is a promising technique to manipulate endogenous chromosomal DNA in a site-specific manner. A novel system that employs the prokaryotic immune defense system based on the clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) protein has been reported as a prominent genome engineering approach (Cho et al., 2013; Cong et al., 2013; Jinek et al., 2013; Mali et al., 2013). Recent studies utilize the RNA-guided endonuclease Cas9 from Streptococcus pyogenes and a guide RNA (gRNA), which acts as a guide to define the target site to introduce DNA double-stranded break. A remarkable advantage of the CRISPR/Cas system is that the target DNA sequence is recognized by simple base-pairing complementarity by the gRNA. Thus, the CRISPR/Cas system can be programmed only by changing the gRNA sequence, and the synthesis of the gRNA for targeting a specific gene is easy at low cost. However, it should be a critical issue to avoid the cleavage of the unintended off-target genes, since double-stranded break results in stable and heritable modification of the genome.

In this study, we present CRISPRdirect (http://crispr.dbcls.jp/), which provides efficient selection of CRISPR/Cas target sites with reduced numbers of potential off-target candidates. CRISPRdirect investigates the entire genome for perfect matches with each candidate target sequence (20 mer) and their seed sequence (12 or 8 mer) flanking the PAM. Users can also browse the detailed list of potential off-target sites that have partial complementarity with the selected sequence. The server incorporates genomic sequences of human, mouse, rat, marmoset, pig, chicken, frog, zebrafish, Ciona, fruit fly, silkworm, Caenorhabditis elegans, Arabidopsis, rice, Sorghum and budding yeast. Currently, several web servers are available for designing CRISPR/Cas gRNAs (Supplementary Table S1). CRISPR Design (Hsu et al., 2013; Ran et al., 2013) performs gRNA selection from an input sequence up to 250 bp, and gRNAs are scored based on predicted off-target interactions. E-CRISP (Heigwer et al., 2014) ranks gRNAs according to on-target specificity and number of off-targets. E-CRISP, ZiFiT (Sander et al., 2010), Cas9 Design (Ma et al., 2013) and CHOPCHOP (Montague et al., 2014) utilize Bowtie (Langmead and Salzberg, 2012) to perform off-target searches allowing mismatches. On the other hand, DNA2.0 gRNA Design Tool (https://www.dna20.com/eCommerce/startCas9) searches for perfect matches with 12 nt seed to identify off-target sites. These servers except CRISPR Design and ZiFiT can process at least 10 kbp of input sequence. Web servers for checking off-target sites for given 20 nt sequences are also available, such as Cas-OFFinder (Bae et al., 2014) and GGGenome (http://GGGenome.dbcls.jp/). These web servers are useful for designing gRNAs for a few input sequences, but processing large number of input sequences requires a laborious process. Even in such cases, CRISPRdirect returns the results quickly and provides a convenient interface for automated gRNA design as described in the Data export and API section, making it a powerful tool for using CRISPR/Cas system on a genome-wide scale.

2 Web server implementation

2.1 Overview

The web server accepts an accession number, a genome coordinate or an arbitrary nucleotide sequence up to 10 kbp as input (Fig. 1A) and returns a list of CRISPR/Cas target candidates. Target sequences of 20 nt adjacent to the PAM sequence (e.g. NGG, NRG) are searched from both strands of the input sequence and listed as shown in Figure 1B. The list contains target position, target sequence, additional information on the sequence and the number of target sites in the genome. The additional information on the sequences such as GC content and calculated melting temperature (Tm) are provided, since previous report suggested that sgRNA sequences with very high or low GC content were less effective against their targets (Wang et al., 2014). The presence or absence of TTTT (four consecutive T’s that cause pol III termination) in the target sequence is also indicated in order to avoid TTTT in gRNA vectors with pol III promoter. A detailed description of the web server is provided in Supplementary Methods.

Screenshot from the CRISPRdirect web server. (A) Top page. The server accepts either an accession number or a nucleotide sequence as input. (B) Typical output of CRISPRdirect. A list of CRISPR/Cas target candidates is displayed. (C) A graphical view of target sites demonstrates the position and orientation of each site. (D) The results can be exported as tab-delimited text or in JSON format. (E) Detailed list of potential off-target sites which visualizes the positions of mismatches and gaps

Fig. 1.

Screenshot from the CRISPRdirect web server. (A) Top page. The server accepts either an accession number or a nucleotide sequence as input. (B) Typical output of CRISPRdirect. A list of CRISPR/Cas target candidates is displayed. (C) A graphical view of target sites demonstrates the position and orientation of each site. (D) The results can be exported as tab-delimited text or in JSON format. (E) Detailed list of potential off-target sites which visualizes the positions of mismatches and gaps

2.2 Off-target evaluation

The number of target sites in the genome (Fig. 1B) is counted using Jellyfish (Marçais and Kingsford, 2011). The column ‘20 mer+PAM’ shows the number of hits with perfect matches for each target sequence (20 mer) adjacent to the PAM. Although the exact length of the completely complementary region necessary for cleavage by CRISPR nucleases is unknown, the mutations within the ‘seed’ sequence at 8–12 nt immediately adjacent to the PAM are known to impair cleavage, suggesting that this region is the most critical determinant of target specificity (Cong et al., 2013; Fu et al., 2013; Hsu et al., 2013; Pattanayak et al., 2013). Therefore, we built up the columns ‘12 mer+PAM’ and ‘8 mer+PAM’ in order to show the number of hits with perfect matches for their seed sequence (12 or 8 mer, respectively) adjacent to the PAM. Note that the numbers of hits displayed here include both on-target and off-target sites. For instance, one (‘1’) in these columns indicates that the sequence has only one perfect match with the intended target site. Any number greater than one indicates that there are some potential off-target sites. Thus, in terms of avoiding off-target editing, the smaller the number (but not zero) is, the better. Zero (‘0’) in these columns means that the sequence has no match in the genomic sequence; such sequences may possibly span over exon–exon junctions, so their use should be avoided. CRISPRdirect highlights the CRISPR/Cas targets that have relatively fewer off-target sites (Fig. 1B and C). A detailed list of off-target candidates can be investigated by clicking the ‘detail’ link (Fig. 1E). The searches allowing mismatches and gaps (insertions and/or deletions) are performed using GGGenome (http://GGGenome.dbcls.jp/) REST API developed by the authors’ group instead of widely used BLAST (Altschul et al., 1990), because BLAST may overlook some potential off-targets as mentioned in our previous work describing siDirect (Naito et al., 2004), a web server for designing functional siRNA with reduced off-target effects. GGGenome quickly searches short nucleotide sequences utilizing suffix arrays and inverse suffix links indexed on solid state drive (SSD). As shown in Supplementary Table S1, off-target searches allowing gaps are not yet available in other existing web tools. However, the most recent report shows that CRISPR/Cas9 system has off-target activity with insertions or deletions between target DNA and gRNA sequences (Lin et al., 2014). Therefore, we consider that off-target searches allowing mismatches and gaps would be a more suitable procedure to list off-target candidates exhaustively. The positions of the mismatches and gaps are visualized in the list (Fig. 1E), which may help predict the potency of off-target editing.

CRISPRdirect incorporates genomic sequences of various organisms to perform off-target searches. Although Xenopus laevis has long been used as a preferred model organism among developmental biologists, we incorporated X . tropicalis genome instead of X. laevis genome, because X. tropicalis is diploid while X. laevis is allotetraploid which makes it difficult to select specific targets.

There are some loci that are difficult to select specific targets. Typical examples are the histone clusters (NM_021059, etc.) and ribosomal proteins (NM_022551, etc.), which are known to form multigene families. When designing CRISPR targets for such genes, users should manually investigate a detailed list of potential off-target sites (Fig. 1E) and select the sequence that has fewer off-target hits on unrelated loci. Alternatively, if site-specific gRNA could not be designed within intended region, multiple gRNA approaches would be considerable (Guilinger et al., 2014; Ran et al., 2013; Tsai et al., 2014). For such strategy, graphical view of CRISPRdirect results which visualizes the position and orientation of target sites (Fig. 1C) would be helpful for selecting paired gRNAs.

2.3 Data export and API

The results can be exported as tab-delimited text or in JSON format from the bottom of the result page (Fig. 1D). Users can copy–paste the text results into a spreadsheet or text editor for downstream analysis. The results can also be downloaded as a separate file by clicking the ‘download’ link. Alternatively, tab-delimited text or JSON output can be obtained via API, which is convenient for users to design a number of CRISPR/Cas targets in an automated manner.

Funding

Life Science Database Integration Project, National Bioscience Database Center (NBDC) of Japan Science and Technology Agency (JST) (to Y.N. and H.B.); Grant-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan (to Y.N. and K.U.-T.); Cell Innovation Program of MEXT (to K.U.-T.).

Conflict of interest: none declared.

References

et al. . (

1990

)

Basic local alignment search tool

.

J. Mol. Biol.

,

215

,

403

410

.

et al. . (

2014

)

Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases

.

Bioinformatics

,

30

,

1473

1475

.

et al. . (

2013

)

Targeted genome engineering in human cells with the Cas9 RNA-guided endonuclease

.

Nat. Biotechnol.

,

31

,

230

232

.

et al. . (

2013

)

Multiplex genome engineering using CRISPR/Cas systems

.

Science

,

339

,

819

823

.

et al. . (

2013

)

High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells

.

Nat. Biotechnol.

,

31

,

822

826

.

et al. . (

2014

)

Fusion of catalytically inactive Cas9 to FokI nuclease improves the specificity of genome modification

.

Nat. Biotechnol.

,

32

,

577

582

.

et al. . (

2014

)

E-CRISP: fast CRISPR target site identification

.

Nat. Methods

,

11

,

122

123

.

et al. . (

2013

)

DNA targeting specificity of RNA-guided Cas9 nucleases

.

Nat. Biotechnol.

,

31

,

827

832

.

et al. . (

2013

)

RNA-programmed genome editing in human cells

.

eLife

,

2

,

e00471

.

(

2012

)

Fast gapped-read alignment with Bowtie 2

.

Nat. Methods

,

9

,

357

359

.

et al. . (

2014

)

CRISPR/Cas9 systems have off-target activity with insertions or deletions between target DNA and guide RNA sequences

.

Nucleic Acids Res.

,

42

,

7473

7485

.

et al. . (

2013

)

A guide RNA sequence design platform for the CRISPR/Cas9 system for model organism genomes

.

Biomed. Res. Int.

,

2013, 270805

.

et al. . (

2013

)

RNA-guided human genome engineering via Cas9

.

Science

,

339

,

823

826

.

(

2011

)

A fast, lock-free approach for efficient parallel counting of occurrences of _k_-mers

.

Bioinformatics

,

27

,

764

770

.

et al. . (

2014

)

CHOPCHOP: a CRISPR/Cas9 and TALEN web tool for genome editing

.

Nucleic Acids Res.

,

42

,

W401

W407

.

et al. . (

2004

)

siDirect: highly effective, target-specific siRNA design software for mammalian RNA interference

.

Nucleic Acids Res.

,

32

,

W124

W129

.

et al. . (

2013

)

High-throughput profiling of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity

.

Nat. Biotechnol.

,

31

,

839

843

.

et al. . (

2013

)

Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity

.

Cell

,

154

,

1380

1389

.

et al. . (

2010

)

ZiFiT (Zinc Finger Targeter): an updated zinc finger engineering tool

.

Nucleic Acids Res.

,

38

,

W462

W468

.

et al. . (

2014

)

Dimeric CRISPR RNA-guided FokI nucleases for highly specific genome editing

.

Nat. Biotechnol.

,

32

,

569

576

.

et al. . (

2014

)

Genetic screens in human cells using the CRISPR-Cas9 system

.

Science

,

343

,

80

84

.

Author notes

†The authors wish it to be known that, in their opinion, the first two authors should be regarded as Joint First Authors.

Associate Editor: Alfonso Valencia

© The Author 2014. Published by Oxford University Press.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data

Citations

Views

Altmetric

Metrics

Total Views 18,608

14,327 Pageviews

4,281 PDF Downloads

Since 11/1/2016

Month: Total Views:
November 2016 30
December 2016 22
January 2017 89
February 2017 117
March 2017 120
April 2017 90
May 2017 124
June 2017 125
July 2017 103
August 2017 82
September 2017 115
October 2017 125
November 2017 137
December 2017 199
January 2018 274
February 2018 251
March 2018 272
April 2018 277
May 2018 296
June 2018 225
July 2018 235
August 2018 268
September 2018 231
October 2018 221
November 2018 261
December 2018 217
January 2019 219
February 2019 212
March 2019 257
April 2019 361
May 2019 284
June 2019 216
July 2019 245
August 2019 213
September 2019 245
October 2019 227
November 2019 217
December 2019 225
January 2020 208
February 2020 180
March 2020 157
April 2020 123
May 2020 144
June 2020 135
July 2020 160
August 2020 170
September 2020 151
October 2020 188
November 2020 182
December 2020 216
January 2021 177
February 2021 171
March 2021 233
April 2021 236
May 2021 216
June 2021 170
July 2021 163
August 2021 158
September 2021 212
October 2021 226
November 2021 249
December 2021 213
January 2022 181
February 2022 218
March 2022 330
April 2022 280
May 2022 280
June 2022 249
July 2022 195
August 2022 228
September 2022 229
October 2022 180
November 2022 212
December 2022 156
January 2023 210
February 2023 222
March 2023 236
April 2023 184
May 2023 228
June 2023 197
July 2023 191
August 2023 149
September 2023 175
October 2023 211
November 2023 169
December 2023 171
January 2024 209
February 2024 206
March 2024 274
April 2024 174
May 2024 172
June 2024 183
July 2024 149
August 2024 182
September 2024 113

Citations

798 Web of Science

×

Email alerts

Citing articles via

More from Oxford Academic