Massively parallel kinetic profiling of natural and engineered CRISPR nucleases - PubMed (original) (raw)
Massively parallel kinetic profiling of natural and engineered CRISPR nucleases
Stephen K Jones Jr et al. Nat Biotechnol. 2021 Jan.
Abstract
Engineered SpCas9s and AsCas12a cleave fewer off-target genomic sites than wild-type (wt) Cas9. However, understanding their fidelity, mechanisms and cleavage outcomes requires systematic profiling across mispaired target DNAs. Here we describe NucleaSeq-nuclease digestion and deep sequencing-a massively parallel platform that measures the cleavage kinetics and time-resolved cleavage products for over 10,000 targets containing mismatches, insertions and deletions relative to the guide RNA. Combining cleavage rates and binding specificities on the same target libraries, we benchmarked five SpCas9 variants and AsCas12a. A biophysical model built from these data sets revealed mechanistic insights into off-target cleavage. Engineered Cas9s, especially Cas9-HF1, dramatically increased cleavage specificity but not binding specificity compared to wtCas9. Surprisingly, AsCas12a cleavage specificity differed little from that of wtCas9. Initial DNA cleavage sites and end trimming varied by nuclease, guide RNA and the positions of mispaired nucleotides. More broadly, NucleaSeq enables rapid, quantitative and systematic comparisons of specificity and cleavage outcomes across engineered and natural nucleases.
Conflict of interest statement
Competing interests
The authors declare competing financial interests. The authors have filed patent applications on the CHAMP platform. The Regents of the University of California have patents issued and pending for CRISPR technologies on which J.A.D. is an inventor. J.A.D. is a co-founder of Caribou Biosciences, Editas Medicine, Intellia Therapeutics, Scribe Therapeutics and Mammoth Biosciences. J.A.D. is a scientific advisory board member of Caribou Biosciences, Intellia Therapeutics, eFFECTOR Therapeutics, Scribe Therapeutics, Synthego, Mammoth Biosciences and Inari. J.A.D. is a member of the board of directors at Driver and Johnson & Johnson and has sponsored research projects by Roche Biopharma and Biogen. J.A.C. is a co-founder of Mammoth Biosciences. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The authors declare no competing non-financial interests.
Figures
Fig. 1 ∣. Overview of the integrated NucleaSeq and CHAMP platform.
a, Crystal structures and domain maps of Cas9 and Cas12a RNP complexes (Protein Data Bank: 5F9R and 5B43). Stars: engineered Cas9 mutation sites. Scissors: cleavage sites. b, For NucleaSeq, a CRISPR–Cas nuclease digests a synthesized library of mispaired target DNAs under single-turnover conditions. DNAs contain unique left and right barcodes. Time point barcodes are added before NGS. NGS chips are recovered to profile DNA binding specificity via CHAMP. c, DNA libraries include targets with randomized PAMs or up to two guide-RNA-relative alterations. Right, read distribution by target type for CHAMP. d, A wtCas9 nuclease reaction time course (sgRNA 1) resolved by capillary electrophoresis. Each sample was run separately—two independent replicates for each. e, Cleavage rates are computed by fitting single exponential functions (lines) to uncut DNA depletions (circles). f, Cleavage rate reproducibility for wtCas9-sgRNA1 experiments. The gray area contains targets with rates beyond the experimental dynamic range. r: Pearson’s correlation coefficient excluding gray area. g, Cut DNA fragments from matched DNAs (black in diagram) report the time-dependent distribution of Cas9-generated cut sites in the TSs and NTSs. wtCas9-sgRNA1 cuts bluntly between the 3rd and 4th nucleotides (left). wtCas9-sgRNA2 produces a one-nucleotide 5′ overhang and then trims it off the NTS (right). Colors: cut positions (triangles in diagram). Error bars: median ± s.e.m. of n = 146 guide-RNA-matched library members. h, Ranked relative cleavage rates of all library members for all five nucleases. Limit: relative cleavage rate beyond detection limit. i, CHAMP reports the apparent binding affinity of nuclease-inactive CRISPR enzymes. Library DNAs on the surface of an NGS chip are incubated with increasing concentrations of a fluorescent dCas9 (cyan puncta). Their sequences are bioinformatically determined by comparison to the NGS output. Scale bar, 50 μm; inset, 5 μm. j, ABAs are computed by fitting Hill functions (lines) to mean fluorescence DNA clusters intensities (circles). AU, arbitrary fluorescence units. Median ± s.d. from bootstrap analysis of n ≥ 5 DNA clusters for each target. k, Correlation of dCas9 ΔABAs measured with CHAMP to dCas9 on-rates from a high-throughput assay. ΔABA, change in apparent binding affinity from the matched target, normalized to that of a scrambled DNA. r: Pearson’s correlation coefficient. x axis: median ± s.d. from bootstrap analysis of n ≥ 5 DNA clusters for each target. y axis: median ± s.e.m. of n ≥ 6 for each target DNA.
Fig. 2 ∣. Comprehensive analysis of off-target wtCas9 DNA binding and cleavage.
a, dCas9 ΔABAs for targets with one sgRNA1-relative mismatch. Dashed line: normalized matched target ΔABA (0); solid line: scrambled DNA ΔABA (negative control, 1). Median ± s.d. from bootstrap analysis of n ≥ 5 DNA clusters for each target. b, Cas9 cleavage rates for the same targets as in a. Dashed line: cleavage rate of the matched target; solid line: limit of detection for the slowest-cleaving targets. Error bars: s.d. from 50 bootstrap analysis measurements. c, ΔABAs (upper, grays) and cleavage rates (lower, blues) for targets containing two sgRNA1-relative mismatches. Black boxes expanded in callouts. d, dCas9 ΔABAs (upper, median ± s.d. from bootstrap analysis of n ≥ 5 DNA clusters for each target) and Cas9 cleavage rates (lower, error bars: s.d. from 50 bootstrap analysis measurements) for targets containing one sgRNA1-relative deletion or (e) insertion. f, Normalized reads for the TS and NTS of DNAs containing either a mismatch at position 3 (C3T or A3T) or a deletion at position 1 compared to sgRNA1 (left) or sgRNA2 (right). Error bars: maximum s.d. for cut products from cleavage of 146 matched DNA controls. g, Average cut site positions for each strand (TS and NTS) from DNAs containing one mismatch relative to sgRNA 1 (upper) or sgRNA 2 (lower). Range: earliest time point with more than 33% cut reads (open diamonds) to final time point (filled diamonds). Dashed and solid horizontal lines: mean cut site positions for 146 matched DNAs (M) at early and late time points.
Fig. 3 ∣. Comparison of engineered Cas9 nucleases.
a,b, Two-dimensional density plots correlate Cas9-HF1 ΔABAs (a) and cleavage rates (b) with those from wtCas9, Cas9-Enh and Cas9-Hypa (sgRNA1). Histograms: all ΔABAs or cleavage rates for the respective nuclease. r: Pearson’s correlation coefficient. c, The ratio of Cas9-HF1 to wtCas9 cleavage specificities for targets with two sgRNA1-relative mismatches. Red: slower cleavage by Cas9-HF1; blue: slower cleavage by wtCas9. Black-outlined range expanded in callout. d, Cas9-HF1 cleavage patterns on the TS and NTS of select target DNAs (sgRNA1). Normalized counts of cut products comprising ≥10% of the total cut reads at any time point. Error bars: maximum s.d. for cut products from cleavage of 146 matched DNA controls. e, Average cut site positions generated by Cas9-HF1 for each strand (TS and NTS) for targets containing one sgRNA-relative mismatch. Range: earliest timepoint with more than 33% cut reads (open diamonds) to final time point (filled diamonds). Dashed and solid horizontal lines: mean cut site positions for 146 matched DNAs (M) at early and late time points.
Fig. 4 ∣. Comprehensive analysis of off-target Cas12a cleavage.
a, Cas12a cleavage rates for DNAs containing one crRNA3-relative mismatch. Dashed line: cleavage rate of the matched target. Solid line: limit of detection for the slowest-cleaving targets. Error bars: s.d. from 50 bootstrap analysis measurements. b, Total cleavage rate compared to reported target strand cleavage (left) and R-loop propagation (right) rates. Total cleavage rates error bars: s.d. from 50 bootstrap analysis measurements. TS cleavage and R-loop propagation rate error bars: rate from a hyperbolic fit ± s.d. from n = 3 independent experiments. r: Pearson’s correlation coefficient. c, Cleavage rates for targets with two crRNA3-relative mismatches. Black box expanded in callout. d, Cleavage rates for DNAs containing one crRNA3-relative deletion (upper) or insertion (lower). Error bars: s.d. from 50 bootstrap analysis measurements. Nucleotides inserted to the left of the given positions. e, Normalized reads for the TSs and NTSs of the indicated targets. Parenthesis: crRNA. Error bars: maximum s.d. for cut products from cleavage of 146 matched DNA controls. f, Average cut site positions for each strand (TS and NTS) from DNAs with one crRNA3-relative mismatch (upper), deletion (middle) or insertion (lower). Range: earliest time point with more than 33% cut reads (open diamonds) to final time point (filled diamonds). Dashed and solid horizontal lines: mean cut site positions for 146 matched DNAs (M) at early and late time points.
Fig. 5 ∣. Statistical modeling of CRISPR–Cas nuclease cleavage.
a, AIC values for five biophysical models relying on the indicated sequence parameters. The most detailed model (V) has the lowest AIC (information loss)—that is, the best goodness of fit. R-loop position-specific parameters reduce the AIC most. b, Correlation between measured and modeled cleavage rates for Cas9-HF1 (left, red) and Cas12a (right, purple) using model V. Histograms: distributions of fit or measured values beyond the upper and lower detection limits. Percentages: quantity of data with one or both values beyond detection limits. r: Pearson’s correlation coefficient. c, Base identity-dependent weights for mismatches and insertions averaged across all Cas9 and Cas12a enzymes. See Supplementary Fig. 8 and text for additional information. d, Modeled specificity penalties for one guide-RNA-relative mismatch (upper), insertion (middle) or deletion (lower). PAMs are oriented left for comparison. Arrows and dashed lines: values below the detection limit. e, The predicted reduction in mismatch-dependent cleavage rates correlates with previous high-throughput measurements of reduced edit efficiencies for wtCas9 (blue) and Cas12a (purple). See Methods for associated data. ρ: Spearman’s correlation coefficient. f, The number of off-target sites in the human genome with a predicted cleavage specificity greater than the indicated specificity threshold. For each nuclease, n = 1,000 targets, selected randomly from exomic DNA. The cleavage specificities of the potential off-target cleavage sites across the genome were calculated using model V. Top whisker (maxima): top of 90% confidence interval; top box: third quartile; center line: median; lower box: second quartile; lower whisker (minima): bottom of 90% confidence interval.
Similar articles
- Editor's cut: DNA cleavage by CRISPR RNA-guided nucleases Cas9 and Cas12a.
Swartjes T, Staals RHJ, van der Oost J. Swartjes T, et al. Biochem Soc Trans. 2020 Feb 28;48(1):207-219. doi: 10.1042/BST20190563. Biochem Soc Trans. 2020. PMID: 31872209 Free PMC article. Review. - Systematic in vitro specificity profiling reveals nicking defects in natural and engineered CRISPR-Cas9 variants.
Murugan K, Suresh SK, Seetharam AS, Severin AJ, Sashital DG. Murugan K, et al. Nucleic Acids Res. 2021 Apr 19;49(7):4037-4053. doi: 10.1093/nar/gkab163. Nucleic Acids Res. 2021. PMID: 33744974 Free PMC article. - Structural basis for self-cleavage prevention by tag:anti-tag pairing complementarity in type VI Cas13 CRISPR systems.
Wang B, Zhang T, Yin J, Yu Y, Xu W, Ding J, Patel DJ, Yang H. Wang B, et al. Mol Cell. 2021 Mar 4;81(5):1100-1115.e5. doi: 10.1016/j.molcel.2020.12.033. Epub 2021 Jan 19. Mol Cell. 2021. PMID: 33472057 Free PMC article. - Coordinated Actions of Cas9 HNH and RuvC Nuclease Domains Are Regulated by the Bridge Helix and the Target DNA Sequence.
Babu K, Kathiresan V, Kumari P, Newsom S, Parameshwaran HP, Chen X, Liu J, Qin PZ, Rajan R. Babu K, et al. Biochemistry. 2021 Dec 14;60(49):3783-3800. doi: 10.1021/acs.biochem.1c00354. Epub 2021 Nov 10. Biochemistry. 2021. PMID: 34757726 Free PMC article. - Application of CHyMErA Cas9-Cas12a combinatorial genome-editing platform for genetic interaction mapping and gene fragment deletion screening.
Aregger M, Xing K, Gonatopoulos-Pournatzis T. Aregger M, et al. Nat Protoc. 2021 Oct;16(10):4722-4765. doi: 10.1038/s41596-021-00595-1. Epub 2021 Sep 10. Nat Protoc. 2021. PMID: 34508260 Review.
Cited by
- The Evolution of Nucleic Acid-Based Diagnosis Methods from the (pre-)CRISPR to CRISPR era and the Associated Machine/Deep Learning Approaches in Relevant RNA Design.
Chakraborty SS, Ray Dutta J, Ganesan R, Minary P. Chakraborty SS, et al. Methods Mol Biol. 2025;2847:241-300. doi: 10.1007/978-1-0716-4079-1_17. Methods Mol Biol. 2025. PMID: 39312149 - Gene editing of angiotensin for blood pressure management.
Masi S, Dalpiaz H, Borghi C. Masi S, et al. Int J Cardiol Cardiovasc Risk Prev. 2024 Aug 20;23:200323. doi: 10.1016/j.ijcrp.2024.200323. eCollection 2024 Dec. Int J Cardiol Cardiovasc Risk Prev. 2024. PMID: 39258007 Free PMC article. - Impact of predictive selection of LbCas12a CRISPR RNAs upon on- and off-target editing rates in soybean.
Rymarquis L, Wu C, Hohorst D, Vega-Sanchez M, Mullen TE, Vemulapalli V, Smith DR. Rymarquis L, et al. Plant Direct. 2024 Aug 16;8(8):e627. doi: 10.1002/pld3.627. eCollection 2024 Sep. Plant Direct. 2024. PMID: 39157758 Free PMC article. - CRISPR technology in human diseases.
Feng Q, Li Q, Zhou H, Wang Z, Lin C, Jiang Z, Liu T, Wang D. Feng Q, et al. MedComm (2020). 2024 Jul 29;5(8):e672. doi: 10.1002/mco2.672. eCollection 2024 Aug. MedComm (2020). 2024. PMID: 39081515 Free PMC article. Review. - CRISPR-Cas12a exhibits metal-dependent specificity switching.
Nguyen GT, Schelling MA, Raju A, Buscher KA, Sritharan A, Sashital DG. Nguyen GT, et al. Nucleic Acids Res. 2024 Sep 9;52(16):9343-9359. doi: 10.1093/nar/gkae613. Nucleic Acids Res. 2024. PMID: 39019776 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous