Unraveling CRISPR-Cas9 genome engineering parameters via a library-on-library approach - PubMed (original) (raw)

Unraveling CRISPR-Cas9 genome engineering parameters via a library-on-library approach

Raj Chari et al. Nat Methods. 2015 Sep.

Abstract

We developed an in vivo library-on-library methodology to simultaneously assess single guide RNA (sgRNA) activity across ∼1,400 genomic loci. Assaying across multiple human cell types and end-processing enzymes as well as two Cas9 orthologs, we unraveled underlying nucleotide sequence and epigenetic parameters. Our results and software (http://crispr.med.harvard.edu/sgRNAScorer) enable improved design of reagents, shed light on mechanisms of genome targeting, and provide a generalizable framework to study nucleic acid-nucleic acid interactions and biochemistry in high throughput.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Schematic of the library-on-library approach employed in our study. (A) sgRNA sequences corresponding to ~1,400 endogenous target sites were synthesized and cloned to make a sgRNA plasmid library. In parallel, replicas of the ~1,400 target sites were synthesized and cloned to make a lentiviral plasmid library. Subsequently, this lentiviral target library was integrated in our cells. Next, the sgRNA plasmid library was transiently transfected and cells were harvested for DNA 72 h post-transfection. Primer sequences designed against the constant sequences surrounding the target sites were used for PCR and libraries were prepared for Illumina sequencing. (B) In target site naïve cells, the above mentioned sgRNA plasmid library was transiently transfected and DNA was harvested 72 h post-transfection. Agilent SureSelect enrichment was performed on the genomic DNA with a custom set of probes specific to regions encompassing the ~1,400 target sites and libraries were prepared for Illumina sequencing.

Figure 2

Figure 2

Versatility of the library-on-library approach. (A) Scatter plot assessing the reproducibility of observed relative mutation rates across biological replicates. Data compared were from two independent sgRNA library transfections. (B) Comparison of activity between Cas9Sp nuclease vs. Cas9Sp nickase across all of the assessed sites. Box plots depicting the range of activities for (C) Cas9Sp nuclease in K562 cells. In total, 1,206 sites are represented for the control and 1,228 for the Cas9Sp nuclease treated cells. (D) Cas9St1 nuclease in 293T cells. In total, data for 1,172 sites are shown for the control and 1,169 for the nuclease treated cells. (E) Impact of TREX2 on altering patterns of NHEJ. A heavy bias towards deletions and slightly away from insertions is observed upon addition of TREX2. Data shown is for 293T using Cas9Sp nuclease. For all box plots, the size of the boxes represent the interquartile range (IQR) of the data. Whiskers are drawn to 1.5 * IQR. Red horizontal line represents the median. ‘+’ represent outlier data points that are beyond 1.5 * IQR.

Figure 3

Figure 3

(A) Position by position comparison of the base distributions between high and low activity sgRNA sequences. Position 20 exhibited the most striking difference. P values were calculated using a 2×4 Fisher’s exact test. Comparison of (B) DNase I hypersensitivity and (C) H3K4-trimethylation between regions of high and low sgRNA activity. P-values were calculated by employing a t-test comparing the values from each group. (D) Using the top 30 sgRNA sequences corresponding to regions with low activity and high DNase I sensitivity, the sequences were scored by the SVM for predicted activity. Strikingly, the majority of the sequences were predicted to have low activity. (E) Frequency of off-target activity and predicted activity. Using a recently published dataset, we scored all off-target sites with our SVM and compared those scores with the frequency of off-target activity observed. No discernable relationship was observed, suggesting these two aspects are independent.

Similar articles

Cited by

References

MAIN REFERENCES

    1. Mali P, et al. Science. 2013;339:823–826. - PMC - PubMed
    1. Cong L, et al. Science. 2013;339:819–823. - PMC - PubMed
    1. Doench JG, et al. Nat. Biotechnol. 2014;32:1262–1267. - PMC - PubMed
    1. Gagnon JA, et al. PLoS ONE. 2014;9:e98186. - PMC - PubMed
    1. Certo MT, et al. Nat. Methods. 2012;9:973–975. - PMC - PubMed

ONLINE REFERENCES

    1. Futreal PA, et al. Nat. Rev. Cancer. 2004;4:177–183. - PMC - PubMed
    1. Karolchik D, et al. Nucleic Acids Res. 2004;32:D493–D496. - PubMed
    1. Jiang H, Wong WH. Bioinforma. Oxf. Engl. 2008;24:2395–2396. - PMC - PubMed
    1. Xu Q, Schlabach MR, Hannon GJ, Elledge SJ. Proc. Natl. Acad. Sci. U. S. A. 2009;106:2289–2294. - PMC - PubMed
    1. Esvelt KM, et al. Nat. Methods. 2013;10:1116–1121. - PMC - PubMed

MeSH terms

Substances

LinkOut - more resources