Harnessing accurate non-homologous end joining for efficient precise deletion in CRISPR/Cas9-mediated genome editing
Genome Biology volume 19, Article number: 170 (2018)
Many applications of CRISPR/Cas9-mediated genome editing require Cas9-induced non-homologous end joining (NHEJ), which was thought to be error prone. However, with directly ligatable ends, Cas9-induced DNA double strand breaks may be repaired preferentially by accurate NHEJ.
In the repair of two adjacent double strand breaks induced by paired Cas9-gRNAs at 71 genome sites, accurate NHEJ accounts for about 50% of NHEJ events. This paired Cas9-gRNA approach underestimates the level of accurate NHEJ due to frequent + 1 templated insertions, which can be avoided by the predefined Watson/Crick orientation of protospacer adjacent motifs (PAMs). The paired Cas9-gRNA strategy also provides a flexible, reporter-less approach for analyzing both accurate and mutagenic NHEJ in cells and in vivo, and it has been validated in cells deficient for XRCC4 and in mouse liver. Due to high frequencies of precise deletions of defined “3n”-, “3n + 1”-, or “3n + 2”-bp length, accurate NHEJ is used to improve the efficiency and homogeneity of gene knockouts and targeted in-frame deletions. Compared to “3n + 1”-bp, “3n + 2”-bp can overcome + 1 templated insertions to increase the frequency of out-of-frame mutations. By applying paired Cas9-gRNAs to edit MDC1 and key 53BP1 domains, we are able to generate predicted, precise deletions for functional analysis. Lastly, a Plk3 inhibitor promotes NHEJ with bias towards accurate NHEJ, providing a chemical approach to improve genome editing requiring precise deletions.
NHEJ is inherently accurate in repair of Cas9-induced DNA double strand breaks and can be harnessed to improve CRISPR/Cas9 genome editing requiring precise deletion of a defined length.
The CRISPR/Cas9 technology has revolutionized many fields of research and commercial development in basic science, medicine, and agriculture [1, 2]. As one of the most common applications, genome editing mediated by CRISPR/Cas9 usually requires targeted induction of a DNA double strand break (DSB) and its repair at a target genome site [3,4,5]. Manipulation of such DSB induction and repair yields opportunities for optimizing genome editing for a variety of efficient and precise genome modifications [1, 6, 7].
In mammalian cells, Cas9-induced DSBs are repaired largely by non-homologous end joining (NHEJ) and partly by homology-directed repair (HDR) that includes gene conversion, microhomology-mediated end joining (MMEJ), and single strand annealing (SSA) [1, 6]. As an error prone repair pathway, NHEJ generates a high frequency of insertions or deletions (indels) at the repair junctions. Despite potential bias induced by end resection and use of microhomologies during end joining, generation of indels is rather random [7,8,9,10]. Thus, these indels are heterogeneous in size and in DNA context. While Cas9-induced indels could inactivate a gene by frame-shift or disruption of key elements, generating gene knockouts, many remain in-frame. As a result, the knockout efficiency is low and additional effort has to be made in identification of knockout clones.
Previously, we and others have shown that a majority (up to 75%) of I-SceI-induced DSBs are rejoined without error in mammalian cells proficient for the classic NHEJ pathway [9,10,11,12,13,14,15]. It is expected that Cas9-induced blunt ends can also be readily and accurately re-ligated . In fact, repair profiling of two Cas9-induced DSBs that are hundreds or thousands of base pairs (bp) apart at a few loci has revealed that a portion of distal ends of these two DSBs are accurately ligated [4, 17,18,19,20,21,22,23]. This suggests that many Cas9-induced DSBs, if not most, are repaired by accurate NHEJ. Knowledge of accurate NHEJ in repair of Cas9-induced DSBs could help yield a strategy to shift NHEJ away from accurate NHEJ or even to harness accurate NHEJ to generate precise indels of defined length, promoting CRISPR/Cas9-mediated gene knockouts. It is therefore important to systematically examine accurate NHEJ and determine its level in repair of Cas9-induced DSBs.
Genome editing often requires precise deletion of a particular DNA region with defined length. Continuing expansion of CRISPR/Cas9 protospacer adjacent motif (PAM) compatibility will further increase the application spectrum for precise deletion in genome editing [24,25,26]. Currently, precise deletion can be achieved by MMEJ-mediated insertion or HDR-mediated replacement with a donor DNA containing predefined deletion [1, 6, 7]. However, these two approaches are usually inefficient, and MMEJ-mediated insertion is error prone, limiting their application for precise deletion. A dual nuclease was recently constructed to generate precise deletions of 33 to 36 bp at frequencies of up to 40% . In order to expand the range and efficiency of precise deletion and the scope of its application in genome editing, new approaches are still needed.
Here, we used paired gRNAs to guide Streptococcus pyogenes Cas9 to induce two concurrent DSBs that are 23–148 bp apart at 70 endogenous genome sites in mouse and human cells and 1439 bp apart at one site. Repair profiling revealed that NHEJ is inherently accurate in the repair of Cas9-induced DSBs. By identifying and subsequently controlling the factors that influence accurate NHEJ in repair of two close and concurrent DSBs induced by paired Cas9-gRNAs, we were able to increase precise out-of-frame or in-frame deletions of defined length and improve CRISPR/Cas9-mediated genome editing, including gene knockouts and targeted in-frame deletions. In addition, this paired Cas9-gRNA approach was validated as a flexible and reliable reporterless assay for both accurate and mutagenic NHEJ in cells and in vivo.
Repair of CRISPR/Cas9-induced DSBs by NHEJ is inherently accurate
Previously, we developed a reporter to analyze NHEJ in mouse embryonic stem (ES) cells and found that I-SceI-induced NHEJ is mostly accurate [11, 28]. In this reporter, no wild-type GFP can be synthesized in cells due to the upstream, out-of-frame translation start site “Koz-ATG” flanked by two I-SceI sites 34 bp apart (Fig. 1a). Upon I-SceI expression, a DSB can be induced at either or both I-SceI sites. NHEJ repair of I-SceI-induced DSBs generates GFP+ cells because of pop-out of “Koz-ATG” by simultaneous DNA cleavage at two I-SceI sites, disruption of “Koz-ATG” by end resection initiated from either I-SceI-induced DSB, or correction of the GFP reading frame by indel-mediated loss of “3n + 1” bp or gain of “3n + 2” bp introduced at the downstream I-SceI cutting site. We previously analyzed a number of individual I-SceI-induced GFP+ clones by Sanger sequencing and a pool of sorted GFP+ cells by Illumina amplicon deep sequencing and showed that two distal and compatible DNA ends of I-SceI-induced DSBs were rejoined mostly by accurate NHEJ [11, 28].
Given that blunt ends of Cas9-induced DSBs could be directly ligated, we hypothesized that Cas9-induced DSBs also favor accurate NHEJ for their end-joining repair. As accurate NHEJ of a single DSB induced by Cas9 generates a product indistinguishable from uncut DNA, we thus designed paired Cas9-gRNAs, mimicking an I-SceI pair, to induce simultaneous Cas9 cleavage in the vicinity of two I-SceI sites in the reporter. We analyzed NHEJ by Illumina amplicon deep sequencing and compared accurate NHEJ repair of Cas9-induced DSBs with that of I-SceI-induced DSBs (Fig. 1a). The gRNA pair (i.e., gsGEJ2/gsGEJ2a), along with Cas9, induced DSBs at target sites that were 95 bp apart. For unbiased NHEJ analysis, we amplified the target DNA region of unsorted cells by PCR after NHEJ repair and analyzed repair junctions by Illumina deep sequencing. We categorized NHEJ repair of I-SceI- or Cas9-induced DSBs into four groups according to which two ends were ligated together (Fig. 1a): (1) group I for joining the distal ends of two DSBs induced by I-SceI or Cas9; (2) group II for rejoining two ends only at the first target site for I-SceI or Cas9; (3) group III for rejoining two ends only at the second target site for I-SceI or Cas9; (4) group IV for rejoining two ends respectively at each of two I-SceI or Cas9 target sites. In groups II–IV, products of accurate NHEJ were indistinguishable from the uncut templates and could not be counted. In group I, however, if distal ends were compatible or directly ligatable, accurate NHEJ occurred with deletion of the intervening DNA sequence between the two breakage sites and was therefore distinguishable and measurable. After normalization with transfection efficiencies, indicating the percentages of cells expressing I-SceI or Cas9, the level of Cas9-induced NHEJ was clearly higher than that of I-SceI-induced NHEJ (Fig. 1b). This is likely due to more efficient cleavage by Cas9. Group I events representing joining of distal ends of paired DSBs accounted for about 50% of I-SceI-induced NHEJ and over 65% of Cas9-induced NHEJ (Fig. 1c). While over 80% of group I events in I-SceI-induced NHEJ were accurate NHEJ, nearly 70% of Cas9-induced group I events were accurate (Fig. 1d). This suggested that Cas9-induced DSBs are more likely repaired by accurate NHEJ than mutagenic NHEJ.
To further test whether Cas9-induced DSBs are indeed repaired preferentially by accurate NHEJ, we designed paired gRNAs targeting an additional 70 genome sites in mouse and human cells and analyzed NHEJ repair of paired gRNA-guided Cas9-induced DSBs by deep sequencing (Additional file 1: Table S1). As the distance between two DSBs may have a suppressive effect on the accuracy and the efficiency of NHEJ [22, 29, 30], we restricted the distance between paired DSBs induced by Cas9 to less than 150 bp, with one exception of 1469 bp, to assess NHEJ of Cas9-induced DSBs, including accurate and mutagenic NHEJ. Among the 70 sites out of 71 analyzed, editing efficiencies varied significantly between 6.52% and 94.99% (mean ± standard deviation (SD), 35.87% ± 21.96%), group I between 1.82% and 96.01% (mean ± SD, 58.53% ± 20.80%), and accurate NHEJ between 0.00% and 94.05% (mean ± SD, 47.33% ± 27.59%) (Fig.1e). In particular, 49 out of 71 sites had accurate NHEJ over 30% of group I events and these accurate NHEJ products were also the most frequent single events in group I (Fig. 1e and Additional file 1: Table S1). Even among the 22 sites with a frequency lower than 30%, accurate NHEJ remained the most frequent single NHEJ events in four sites (Fig. 1e and Additional file 1: Table S1, S2). The difference was small in the efficiencies of accurate NHEJ between human cells and mouse ES cells (Additional file 2: Figure S1). Taken together, repair of Cas9-induced DSBs is inherently biased towards accurate NHEJ in both mouse and human cells with a frequency at about 50%.
We also performed three independent experiments to analyze NHEJ repair of paired DSBs induced by Cas9-gRNA on 4 genome sites, one at PIF1 exon 1 of mouse ES cells (mPIF1), one at LDHA intron 5 of mouse ES cells (mLDHA), and two on p53 exon 3 of human HEK293 cells (hp53#2 and hp53#3) (Additional file 1: Table S1). The editing efficiencies, i.e. the efficiencies of Cas9-induced NHEJ, were normalized with transfection efficiencies and varied from 14.41% to 81.80% among these 4 sites (Fig. 1f). These variations were likely due to the difference in either induction or repair of paired DSBs at these sites. Consistently, majority (71.70–95.81%) of Cas9-induced NHEJ was Group I events (Fig. 1g), in which accurate NHEJ accounted for 47.66%, 57.62%, 70.38% and 94.09% respectively (Fig. 1h). These data suggested that repair of Cas9-induced DSBs is inherently accurate.
Accurate NHEJ is hindered by frequent + 1 templated insertions
Because the efficiency of accurate NHEJ at 22 out of 71 sites analyzed was less than 30% of group I events (Fig. 1e and Additional file 1: Table S1), we wondered why accurate NHEJ was inefficient in these cases. Examination of repair junctions revealed that + 1 insertions with no additional mutations were frequent and nearly all of them were templated, not random (Additional file 1: Table S3 and Additional file 2: Figure S2), confirming the findings of previous work [31, 32]. In fact, + 1 and + 2 templated insertions with no additional mutations occurred frequently (up to 85.83% of group I events) and were correlated inversely with accurate NHEJ (Fig. 2a). Over 10% of group I events were + 1 templated insertions in 36 out of 71 sites, among which + 2 templated insertions also accounted for over 10% of group I events in eight sites (Fig. 2a and Additional file 1: Table S3). The + 1 or + 2 templated insertions had frequencies over 30% of group I even at 11 sites and were the most frequent at 14 sites (Additional file 1: Tables S2 and S3).
Previous work has proposed that Cas9 cleavage could generate not only blunt ends but also 1- or 2-nucleotide (nt) 5′-overhanging ends because the RuvC nuclease domain of Cas9 could cleave at the fourth or fifth base upstream of a PAM in addition to the third [3, 31,32,33]. These 5′-overhanging ends are filled in by DNA polymerases, thus producing + 1 or + 2 templated insertions . The observed high levels of + 1 or + 2 templated insertions indicate that Cas9 cleavage can generate 1- or 2-nt 5′-overhanging ends frequently. However, at individual breakage sites of paired DSBs, the 1- or 2-nt 5′-overhanging ends are complementary and, like blunt ends, can be rejoined readily and accurately, thus preventing templated insertions. In contrast, distal ends of paired DSBs with 1- or 2-nt 5′-overhangs may be incompatible for accurate end joining, and filling in these ends by DNA polymerases can thus generate templated insertions in group I. Consequently, the frequency of accurate end joining in group I may underestimate the true frequency of accurate NHEJ in repair of Cas9-induced DSBs. To better reflect accurate NHEJ of Cas9-induced DSBs, we combined accurate end joining in group I with templated insertion events that had no additional indels and termed them “Accurate+TI”. Compared to the frequency of accurate NHEJ—49.74% on average—the average frequency of “Accurate+TI” was 66.88%, indicating that accurate NHEJ of Cas9-induced DSBs might be underestimated by 17.14% (Fig. 2b). Because accurate rejoining of blunt ends or complementary 5′-overhanging ends at individual breakage sites provides templates for recleavage by Cas9, we speculated that this accurate repair would increase the simultaneous cleavage by paired gRNA-guided Cas9, promoting group I NHEJ. Indeed, the frequency of group I events was positively correlated with that of Accurate+TI (Fig. 2c).
The stable binding of Cas9 to target DNA is directed by the position of the PAM (i.e., NGG) on either the Watson strand (W) or the Crick strand (C). The orientation of paired PAMs for paired Cas9-gRNAs has four different combinations: W/W, W/C, C/W and C/C (Fig. 2d). We wondered whether the difference in paired PAM orientations would affect the efficiency of accurate NHEJ. As revealed, while the difference was not significant in the efficiency of accurate NHEJ between W/W, C/C, and C/W, accurate NHEJ induced by W/C was more efficient than that by W/W, C/C, and C/W (one-way ANOVA, P < 0.0001; post hoc pairwise comparison, P < 0.05; Fig. 2e). However, when accurate NHEJ was combined with templated insertions (i.e., “Accurate+TI”), the difference induced by W/C was lost (Fig. 2e). This implied that W/C and the other three combinations of PAM orientations might induce different frequencies of templated insertions. In fact, W/C generated few templated insertions whereas W/W, C/W, and C/C gave rise to frequent + 1 or + 2 templated insertions at many sites (one-way ANOVA, P < 0.0001; post hoc pairwise comparison, P < 0.005 for W/C vs others and P > 0.05 for other comparisons; Fig. 2f). This could be predicted according to the position of simultaneous Cas9 cleavage that deletes the intervening sequence and generates 1-nt 5′-overhanging ends (Additional file 2: Figure S3). Nearly all of + 1 templated insertions were as predicted (Additional file 2: Figure S4). This suggested that the W/C orientation, not W/W, C/W, or C/C, could avoid the interference of + 1 and + 2 templated insertions on accurate NHEJ.
To determine whether the distance between paired DSBs induced by Cas9 affects the efficiency of accurate NHEJ, we performed correlation analysis on the data from all 71 sites. The frequencies of accurate NHEJ or Accurate+TI were not correlated with the distance between paired DSBs (Fig. 2g). Given that 70 sites out of 71 were packed within the range 23–148 bp, this indicated that this distance has no effect on accurate NHEJ whether or not accurate NHEJ is combined with templated insertions. To exclude the possible interference of the PAM orientations in the analysis, we assessed the data only with the W/W orientation of the PAMs. Consistently, we did not find any significant effect of the distance on accurate NHEJ or Accurate+TI (Fig. 2h). Therefore, it appears that accurate NHEJ is not affected by the distance between paired DSBs within the range 23–148 bp.
Validation of a reporterless NHEJ assay in cells and in vivo
As described above, in the repair of two adjacent DSBs induced simultaneously by paired Cas9-gRNAs, accurate NHEJ and mutagenic NHEJ could be distinguished and measured. This provides a convenient method to analyze NHEJ, including both accurate and mutagenic NHEJ in cells and in vivo, without using an NHEJ reporter. To further validate this approach, we selected the LDHA locus and the ROSA26 locus (mROSA26#1) that were analyzed before in mouse ES cells (Additional file 1: Table S1) and examined the effect of XRCC4 loss on Cas9-induced NHEJ. As a core NHEJ factor for end ligation during NHEJ, XRCC4 may also control end ligation for Cas9-induced NHEJ. We thus determined whether XRCC4 regulates Cas9-induced NHEJ in the same way as in our previous NHEJ reporter assay with I-SceI . Due to precise deletions of the 57-bp or 33-bp intervening sequence between paired Cas9-gRNA target sites at the LDHA and ROSA26 loci, respectively, two major bands were separated by DNA gel electrophoresis for PCR products of the target DNA after repair (Fig. 3a). The upper band was expected to contain the unedited and edited targets with small indels and the lower band most likely the edited target with deletion of the intervening sequence. The intensity of the lower band relative to the upper band reflected the efficiency of simultaneous cleavage by paired Cas9-gRNAs, providing a rapid and convenient method to identify efficient paired gRNAs (Fig. 3a).
After induction of paired DSBs on the LDHA site or the ROSA26 site in isogenic XRCC4+/+ and XRCC4−/− mouse ES cells and subsequent repair, we performed Illumina amplicon sequencing of repair junctions. Examination of repair junctions revealed that the overall editing efficiency decreased from 47.70 ± 25.10% in XRCC4+/+ cells to 23.57 ± 23.93% in XRCC4−/− cells at the LDHA site and from 29.50 ± 4.15% in XRCC4+/+ cells to 12.56 ± 7.18% in XRCC4−/− cells at the ROSA26 site (Fig. 3b). Given the difference in the expression levels of gRNA and Cas9 and the Cas9 cleavage efficiency between XRCC4+/+ and XRCC4−/− cells and between experiments, the editing efficiencies were normalized with the transfection efficiencies of either cells in each independent experiment as described previously . As the normalized editing efficiency reflects the efficiency of overall NHEJ, its reduction in XRCC4−/− cells suggested that XRCC4 facilitates NHEJ in repair of Cas9-induced DSBs as in repair of I-SceI-induced DSBs. In edited products, group I events were also reduced from 62.06 ± 2.34% in XRCC4+/+ cells to 36.68 ± 11.81% in XRCC4−/− cells at the LDHA site and from 46.70 ± 4.53% in XRCC4+/+ cells to 16.47 ± 3.92% in XRCC4−/− cells at the ROSA26 site (Fig. 3c). This indicated that XRCC4 biases NHEJ towards group I. Accurate NHEJ of Cas9-induced DSBs was also dramatically reduced from 57.62 ± 4.88% in XRCC4+/+ cells to 5.47 ± 4.23% in XRCC4−/− cells at the LDHA site and from 57.12 ± 3.20% in XRCC4+/+ cells to 8.62 ± 9.72% in XRCC4−/− cells at the ROSA26 site (Fig. 3d). This is in line with previous observations that XRCC4 promotes accurate NHEJ of I-SceI-induced DSBs [11, 15].
Furthermore, compared to XRCC4+/+ cells, XRCC4−/− cells were biased towards longer deletion length (Fig. 3e; Mann-Whitney test, P < 0.0001). The median length of group I deletion in XRCC4−/− cells was 64 bp (i.e., 7 bp + 57-bp pop-out) at the LDHA site and 45 bp (i.e., 12 bp + 33-bp pop-out) at the ROSA26 site, longer, respectively, than 57 bp and 33 bp in XRCC4+/+ cells (Fig. 3e). The XRCC4 deficiency caused more frequent mutagenic NHEJ events with deletions of over 63 bp (i.e., 6 bp + 57-bp pop-out) at the LDHA site (93.38% vs 69.95% between XRCC4−/− and XRCC4+/+ cells; χ2 test, P < 0.0001) and over 39 bp (i.e., 6 bp + 33-bp pop-out) at the ROSA26 site (88.69% vs 58.55% between XRCC4−/− and XRCC4+/+ cells; χ2 test, P < 0.0001), but less with deletions of 58–63 bp (i.e., 1–6 bp + 57-bp pop-out) at the LDHA site (6.62% vs 30.05% between XRCC4−/− and XRCC4+/+ cells; χ2 test, P < 0.0001) and 34–39 bp at the ROSA26 site (i.e., 1–6 bp + 33-bp pop-out) (11.31% vs 41.45% between XRCC4−/− and XRCC4+/+ cells; χ2 test, P < 0.0001) (Fig. 3f and inset). These data demonstrate that XRCC4 promotes accurate NHEJ and suppresses mutagenic NHEJ in repair of Cas9-induced DSBs.
Previous studies have shown that loss of XRCC4 promotes increased use of microhomologies (MH) in NHEJ [11, 15, 34]. We thus examined the use of MH in Cas9-induced NHEJ and observed that the frequency of NHEJ with no MH at repair junctions was much higher in XRCC4+/+ cells than in XRCC4−/− cells (Fig. 3g), indicating increased use of MH in Cas9-induced NHEJ in the absence of XRCC4. Especially, the use of 3-bp MH increased from 40.50% and 21.80% in XRCC4+/+ cells to 64.95% and 31.30% in XRCC4−/− cells at the LDHA and ROSA26 sites, respectively (Fig. 3g), whereas the use of 1-bp MH decreased from 31.25% and 31.17% in XRCC4+/+ cells to 16.68% and 16.32% in XRCC4−/− cells. Notably, due to the presence of 3-bp MH (i.e., GTG at the LDHA site and CAC at the ROSA26 site), the frequencies of 64-bp deletions (i.e., 7 bp + 57-bp pop-out) at the LDHA site and 42-bp deletions (i.e., 9 bp + 33-bp pop-out) at the ROSA26 site were quite high in XRCC4+/+ cells and were further increased in XRCC4−/− cells (Fig. 3g).
To test whether this reporterless NHEJ assay can be applied in vivo, we delivered the expression plasmids for paired Cas9-gRNAs targeting the LDHA site into two mice through hydrodynamic tail vein injection, harvested livers 30 days post-injection, and analyzed accurate and mutagenic NHEJ (Fig. 4a). In four pieces of mouse liver tissue, two each mouse, the editing efficiencies ranged from 1 to 4%, much lower than that in cultured cells due to inefficient delivery of plasmids into mouse liver (Fig. 4b). More than 80% of the edited events were group I NHEJ products (Fig. 4c) and approximately a half of group I were accurate NHEJ (Fig. 4d). This indicated that accurate NHEJ is also innately favored in mouse liver in repair of Cas9-induced DSBs. Further analysis of group I events revealed little difference in the deletion pattern (Fig. 4e, f) and in the MH usage in Cas9-induced NHEJ among four liver samples (Fig. 4g). Moreover, the frequency of accurate NHEJ, the deletion pattern, and the MH usage in mouse liver were similar to those in cultured mouse ES cells (Fig. 4d–g). Therefore, the reporterless NHEJ assay described here could be used as a convenient, flexible, and accurate method to analyze both accurate and mutagenic NHEJ in cells and in vivo.
Precise out-of-frame deletion mediated by accurate NHEJ improves gene knockout
Among major applications of CRISPR/Cas9 genome editing, gene knockouts are originated mostly from heterogeneous out-of-frame indels induced during NHEJ repair of Cas9-induced DSBs [1, 2]. Given efficient generation of precise deletions of defined length by accurate NHEJ via paired Cas9-gRNAs, we wondered whether accurate NHEJ could be harnessed to improve the efficiency and homogeneity of gene knockouts. Thus, we first compared the efficiency of out-of-frame indels induced by this paired Cas9-gRNA method involving accurate NHEJ (termed “Paired”) with the common dual gRNA-guided editing approach (termed “Common”), which is represented by the paired Cas9-gRNA method excluding group I NHEJ. Group I NHEJ alone mimicked an ideal paired Cas9-gRNA method (termed “Ideal”), where all of Cas9 cleavage guided by paired gRNAs is concurrent, thereby editing the genome all by group I NHEJ (Fig. 5a). Analysis revealed that the frequency of out-of-frame mutations induced by the Paired approach and by the Ideal approach, but not by the Common approach, was positively correlated with the frequency of accurate NHEJ (Fig. 5b). With accurate NHEJ at a frequency over 29.8%, both the Paired and the Ideal approaches generated out-of-frame mutations more efficiently than the Common method, and the more efficient accurate NHEJ was, the better improvement in the efficiency of out-of-frame mutations (Fig. 5b and Additional file 2: Figure S7). For example, with the frequency at 68.36%, accurate NHEJ increased the knockout efficiency from 65.52% to 88.51% at a site of the mouse Artemis locus (Fig. 5b and Additional file 1: Table S4). By contrast, the out-of-frame editing efficiency tended to be reduced by accurate NHEJ at a level below 30% (Fig. 5b).
Due to the inverse relationship between accurate NHEJ and frequent + 1 templated insertions, we speculated that frequent + 1 templated insertions might reduce the out-of-frame editing frequency. In fact, the frequencies of + 1 templated insertions varied inversely with those of out-of-frame mutations (Additional file 2: Figure S8). Importantly, with deletions of predefined 3n + 1 bp, the frequencies of + 1 templated insertions were strongly and negatively correlated with the frequencies of out-of-frame mutations induced by Ideal and by Paired methods (Additional file 2: Figure S6). When the frequency of + 1 templated insertions was less than 20.5%, accurate NHEJ resulting in precise deletions of defined 3n + 1-bp length was able to improve the out-of-frame frequency, and the less the frequency of + 1 templated insertions, the better the improvement (Additional file 2: Figure S6). In contrast, no correlation was observed between + 1 templated insertions and deletions of predefined 3n + 2 bp or out-of-frame mutations induced by the Common approach (Additional file 2: Figure S6). This indicated that + 1 templated insertions might convert out-of-frame deletions of 3n + 1 bp, but not 3n + 2 bp, into in-frame deletions of 3n bp, thus reducing the out-of-frame efficiency. As a result, the out-of-frame editing efficiency was improved by deletions of predefined 3n + 2 bp via accurate NHEJ, but not by 3n + 1 bp, in both Ideal and Paired approaches when + 1 templated insertions were frequent (Fig. 5c). Among 20 genome sites edited with deletions of predefined 3n + 2 bp, the out-of-frame efficiency was increased, on average, from 67.76% to 86.48% in the Ideal method (Student’s paired t-test, P < 0.0001) and from 67.76% to 79.26% in the Paired approach (Student’s paired t-test, P < 0.0001) (Fig. 5c). This suggested that the distance between paired Cas9-gRNA cleavage sites should be preset preferably at 3n + 2 bp rather than 3n + 1 bp in order to improve the efficiency of out-of-frame mutations, including gene knockouts. Only when the frequency of templated insertions is low could 3n + 1 bp be useful.
To validate the application of accurate NHEJ induced by paired Cas9-gRNAs in out-of-frame mutations, we knocked out the MDC1 gene in mouse ES cells by targeting exon 2 of MDC1 (mMDC1#2) with paired Cas9-gRNA as an example (Fig. 5d and Additional file 1: Table S1). Between two cleavage sites was a 52-bp intervening sequence. We transfected cells with the expression plasmids for single gRNA or paired gRNAs, along with Cas9, isolated genomic DNA at 72 h post-transfection, and amplified the target sequence by PCR. Compared to single gRNA or the empty vector control, transfection with paired gRNAs g2 and g3 generated a weaker 350-bp upper band and a much stronger 300-bp lower band (Fig. 5d), indicating that these two gRNAs were not only individually effective but also highly efficient in inducing concurrent Cas9 cleavage. Further analysis of the repair junctions revealed that the editing efficiencies of single gRNAs and paired gRNAs were about 80% and 96.20%, respectively, after normalization with transfection efficiencies (Fig. 5e). Among these editing efficiencies, the knockout frequency was higher with paired gRNAs at 70.4% than single gRNAs at 50–60% (Fig. 5e). In the edited events, 67.74% were group I events, confirming that concurrent Cas9 cleavage guided by paired gRNAs was highly efficient (Fig. 5f). Notably, accurate NHEJ accounted for 44.83% of group I events (Fig. 5g).
Because paired Cas9-gRNA induced a higher frequency of MDC1 knockouts, we predicted that paired gRNAs should be more effective in disrupting the function of MDC1 than single gRNA. We thus performed homologous recombination (HR) assays with MDC1 gRNA in HR reporter mouse ES cells previously established . In this HR reporter, the frequency of I-SceI- or Cas9-induced GFP+ cells reflects the relative efficiency of HR (Fig. 5h). We found that single gRNA reduced I-SceI- or Cas9-induced HR by about 25% and paired gRNAs by nearly a half compared to the empty vector negative control (Fig. 5i, j). This indicated that paired gRNAs are more effective in inactivating MDC1 than single gRNA.
Next, we applied paired gRNAs to generate MDC1 knockout clones. After transfection of mouse HR reporter ES cells with the expression plasmids for paired gRNAs and Cas9, these cells were sparsely plated and grown without any selection. In 2 weeks, we randomly picked 35 clones and examined by PCR and Sanger sequencing the target sites. MDC1 was edited in all these 35 clones; 18 were heterozygotes and 17 homozygotes (Fig. 5k). Among these homozygotes, eight contained precise 52-bp deletions mediated by accurate NHEJ in both alleles and were therefore knockout clones (Fig. 5k). Of the remaining nine clones, four were knockout clones carrying additional mutations besides deletion of the 52-bp intervening sequence, and five had in-frame mutations (Fig. 5k). It was apparent that the heterogeneity of indels was low in MDC1 editing by paired Cas9-gRNAs. MDC1 knockout was further confirmed by western blotting with anti-MDC1 antibody (Fig. 5l). We performed HR assays in three MDC1 knockout clones that have a precise 52-bp deletion on both alleles and parental MDC1 wild-type cells. These MDC1 knockout clones have a lower level of I-SceI-induced HR, approximately 0.5%, compared to parental cells at 1.8% (Fig. 5m). These results demonstrated that accurate NHEJ in the paired Cas9-gRNA method can improve the efficiency and reduce the heterogeneity of gene knockouts with precise deletions of defined length in CRISPR/Cas9 genome editing.
Targeted in-frame deletion mediated by accurate NHEJ assists functional domain analysis in situ
To evaluate whether paired Cas9-gRNAs can generate efficient precise in-frame deletion mediated by accurate NHEJ, we first examined the repair junction data from 20 genome sites and determined whether accurate NHEJ could indeed increase in-frame mutations induced by the Ideal or the Paired method (Additional file 1: Table S4). The data showed that the frequency of in-frame indels was strongly and positively correlated with the frequency of accurate NHEJ that generated precise deletions of 3n bp (Fig. 6a and Additional file 1: Table S4). Compared to the Common approach, the frequency of in-frame indels was increased from 31.43% to 59.88% on average by the Ideal method (Student’s paired t-test, P < 0.0001) and from 31.43% to 49.29% by the Paired method (Student’s paired t-test, P < 0.001) (Additional file 2: Figure S9). This suggests that the more efficient accurate NHEJ is, the better the improvement in the in-frame editing efficiency.
To further validate the application in targeted in-frame deletions, we designed two sets of paired gRNAs to target exon 18 and exon 21 of the DNA damage response gene 53BP1 for precise in-frame deletions mediated by accurate NHEJ. We generated two 53BP1 mutant clones, which were confirmed by PCR and Sanger sequencing. One had precise biallelic deletion of the predefined 60-bp intervening sequence in the oligomerization domain (ΔOD) and the other precise biallelic deletion of the predefined 54-bp intervening sequence in the Tudor domain (ΔTudor) (Fig. 6b). These two small, in-frame deletions do not affect the size and the steady-state level of truncated 53BP1 proteins (Fig. 6c), suggesting that truncated 53BP1 is fully translated and stable. However, when we treated mouse ES cells with ionized radiation (IR) of 3 Gy, no 53BP1 foci were detected in the 53BP1 ΔTudor and ΔOD clones, while wild-type 53BP1 formed IR-induced nuclear foci (Fig. 6d). This is consistent with previous findings that both OD and Tudor domains are important for IR-induced 53BP1 focus formation [35,36,37]. IR-induced γH2AX foci were normally formed in both the 53BP1 wild-type clone and mutant clones as expected (Fig. 6d).
53BP1 is thought to protect DSBs from end resection in G1 phase of the cell cycle and antagonize BRCA1 in S and G2 [37, 38]. We reasoned that the inability of 53BP1ΔTudor and 53BP1ΔOD to bind the damaged chromatin would limit the end-protection function of 53BP1, thus increasing end resection of DSBs, including Cas9-induced DSBs, and affecting the efficiency and the accuracy of NHEJ. Thus, using our reporterless NHEJ assay described above, we analyzed repair junctions of the LDHA target site in the 53BP1ΔTudor clone and the 53BP1ΔOD clone as well as 53BP1 wild-type mouse ES cells. Disruption of either Tudor or OD had little effect on the normalized editing efficiency (Fig. 6e) and the frequency of group I events (Fig. 6f). There was a small but insignificant difference in the frequency of accurate NHEJ between wild-type cells and the 53BP1ΔTudor clone or the 53BP1ΔOD clone (Fig. 6g). However, deletions were shifted towards modestly longer length in 53BP1ΔTudor cells and in 53BP1 ΔOD cells (Fig. 6h; Mann-Whitney test, P < 0.0001). The median length of group I deletion was 59 bp in 53BP1ΔTudor cells and 58 bp in 53BP1ΔOD cells, 1–2 bp longer than 57 bp in 53BP1+/+ cells (i.e., 57-bp pop-out; Fig. 6h).
Further examination of the combined data from three independent experiments revealed that the level of accurate NHEJ was lower in both 53BP1ΔTudor and 53BP1ΔOD cells than in wild-type parental cells (Fig. 6i and inset; χ2 test, P < 0.0001). Both 53BP1ΔTudor and 53BP1ΔOD cells generated less mutagenic NHEJ events with deletions of 58–63 bp (i.e., 1–6 bp + 57-bp pop-out), 15.69% and 16.06%, respectively, vs 24.24% in 53BP1 wild-type cells (Fig. 6i and inset; χ2 test, P < 0.0001). In contrast, deletions of over 63-bp (i.e., 6 bp + 57-bp pop-out) were more frequent in 53BP1ΔTudor cells at 84.31% and in 53BP1ΔOD cells at 83.94% than 53BP1 wild-type cells at 75.76% (Fig. 6i and inset; χ2 test, P < 0.0001). However, the MH usage in mutagenic NHEJ was not significantly altered between 53BP1 wild-type cells and 53BP1 mutant cells (Additional file 2: Figure S10). This suggested that 53BP1 may suppress end resection at close ends of Cas9-induced DSBs by locating to the damaged chromatin, thus biasing NHEJ away from longer deletions. These results also demonstrate that targeted in-frame deletions mediated by accurate NHEJ can be harnessed to efficiently and accurately disrupt a particular motif or domain of a protein for functional analysis.
The Plk3 inhibitor GW843682X enhances accurate NHEJ in repair of Cas9-induced DSBs
As accurate NHEJ could improve the CRISPR/Cas9 genome editing that requires precise deletions of defined length, it is beneficial to find a method to enhance accurate NHEJ in this application. As CtIP-dependent end resection is a key step in NHEJ and tends to repress accurate end joining [39,40,41], we wondered whether inhibition of CtIP-dependent end resection could elevate accurate NHEJ. We first examined the effect of CtIP depletion on overall NHEJ using our mouse ES cells containing the sGEJ reporter . Depletion of CtIP by siRNA increased both I-SceI- and Cas9-induced NHEJ (Fig. 7a, b). Previous studies have shown that CtIP-dependent end resection was activated via CtIP phosphorylation by polo-like kinase 3 (Plk3) and could be inhibited by the Plk3 inhibitor GW843682X [39, 42]. We thus treated the sGEJ reporter cells with GW843682X at concentrations of 1, 3, and 5 μM in our NHEJ assays and observed that Cas9-induced NHEJ was stimulated compared to the mock treatment with DMSO (Fig. 7c).
To investigate whether the editing efficiency and the frequency of accurate NHEJ were increased by GW843682X, we performed our reporterless NHEJ assay at the LDHA site with paired Cas9-gRNAs. The normalized editing efficiency was elevated by GW843682X (Fig. 7d). In the edited products, group I events were increased from 68.78% ± 12.72% with DMSO to 81.87% ± 8.48% with the inhibitor (Fig. 7e). Importantly, GW843682X promoted accurate NHEJ in group I (Fig. 7f). The enhancement in both overall editing and accurate NHEJ indicated that GW843682X could be used to improve the genome editing requiring precise deletions of defined length mediated by accurate NHEJ.
Detailed analysis of repair junctions further revealed that deletions were shifted towards shorter length in cells treated with GW843682X, although the median length remained at 57 bp in all treatments, including DMSO (Fig. 7g; Mann–Whitney test, P < 0.0001). Furthermore, GW843682X enhanced accurate NHEJ and reduced mutagenic NHEJ. The frequency of accurate NHEJ was elevated to 69.62%, 73.78%, and 74.70%, respectively, by 1, 3, and 5 μM GW843682X from 58.64% with DMSO (Fig. 7h and inset; χ2 test, P < 0.0001). However, treatment with GW843682X at 1, 3, and 5 μM promoted deletions of 58–63 bp (i.e., 1–6 bp + 57-bp pop-out) with the frequencies at 49.06%, 53.78%, and 51.51%, respectively, vs 30.74% with the DMSO treatment (Fig. 7h and inset; χ2 test, P < 0.0001). In contrast, the frequencies of mutagenic NHEJ over 63 bp (i.e., 6 bp + 57-bp pop-out) were reduced to 50.94%, 46.22%, and 48.49% by different concentrations of GW843682X from 66.26% with the DMSO control (Fig. 7h and inset; χ2 test, P < 0.0001). These results suggest that the Plk3 inhibitor GW843682X might inhibit CtIP-dependent end resection, promoting both overall NHEJ and accurate end joining in repair of Cas9-induced DSBs, and could be used to improve genome editing that requires precise deletions of defined length mediated by accurate NHEJ.
The NHEJ process was thought to be error prone partly due to end trimming by active DNA nucleases and polymerases prior to end ligation in mammalian cells [8,9,10]. Evolutionarily, end trimming is essential to deal with modified or incompatible ends of DSBs induced by ionized radiation and radiomimetic chemicals, ensuring ends are ligatable but inducing indels [8,9,10]. In CRISPR/Cas9-mediated genome editing, many applications are dependent upon repair of Cas9-induced DSBs by mutagenic NHEJ, which generates heterogeneous indels at repair junctions [7, 32]. This gave an impression that NHEJ is also error prone in repair of Cas9-induced DSBs. However, for directly ligatable ends, such as those generated by Cas9 as well as I-SceI, end trimming may be unnecessary and even intrinsically suppressed in mammalian cells . As a result, accurate NHEJ may occur at high frequencies. Indeed, using the paired Cas9-gRNA approach that was able to distinguish accurate NHEJ from unedited targets, we detected a high frequency of accurate NHEJ in the repair of Cas9-induced DSBs, indicating that NHEJ is inherently accurate for repairing Cas9-induced DSBs.
In order to analyze NHEJ that includes accurate and mutagenic end joining, a common approach is to use artificial NHEJ reporter systems integrated in the genome [43, 44]. However, this approach requires reporters whose development and integration into the genome are laborious. Moreover, NHEJ analysis is restricted to the site on the reporter. As both accurate and mutagenic NHEJ can be analyzed and measured at intended sites in cells and in vivo by the paired Cas9-gRNA approach presented here, this provides an alternative but flexible and reporterless approach for NHEJ assays. This method was validated in XRCC4−/− mouse ES cells and in mouse liver and could be further expanded to different genomic loci, organs, development stages, and genetic background for analysis of NHEJ, whether accurate or mutagenic.
In addition to frequent use of accurate NHEJ, joining distal ends of adjacently paired Cas9-induced DSBs has several other distinct features. First, + 1 and + 2 templated insertions occur frequently. Previous studies have shown that Cas9 generated mainly blunt ends but sometimes also ends with 5′-overhangs, especially 1- and 2-nt 5′-overhangs [3, 22, 31,32,33]. It was proposed that these 5′-overhanging ends could be filled in by DNA polymerases, resulting in templated insertions . At a single Cas9-induced DSB, these 5′-overhanging ends are complementary and can be accurately rejoined without any fill-in by DNA polymerases. In contrast, distal ends with 5′-overhangs in paired DSBs may not be compatible for accurate NHEJ, leading to more frequent templated insertions and underestimation of accurate NHEJ in repair of single Cas9-induced DSB. However, by placing paired Cas9-gRNAs in the W/C orientation of the PAMs, not in W/W, C/W, and C/C, these frequent + 1 and + 2 templated insertions can be avoided. This provides a strategy to increase accurate NHEJ in genome editing.
Second, the distance between paired DSBs is flexible but has little effect on the occurrence of accurate NHEJ within the range of 23–148 bp. Prior work has indicated that the distance between paired DSBs influences the efficiency of accurate NHEJ when the distance is over 1 kb [22, 29, 30]. In this study, we focused on distances ranging from 23 to 148 bp with the exception of 1469 bp and detected little distance effect on accurate NHEJ. Because adjacently paired Cas9 on a target should not overlap with each other in order to prevent inhibition of simultaneous Cas9 cleavage, the minimum length of precise deletions by the paired Cas9-gRNA method could be 12 bp with the W/C orientation, 20 bp with C/C and W/W, and 34 bp with C/W.
Lastly, the frequencies of + 1 and + 2 templated insertions vary significantly at different genome sites. While this variation can be influenced by the orientations of the paired PAMs and possibly the distance between two joining ends, it can still dramatically differ even with the same PAM orientations and similar distance between paired DSBs. For example, at two sites of human HPRT locus (hHPRT#4 and #9; Additional file 1: Table S3), which carry the same W/W orientation and the same distance of 85 bp, one has a frequency of + 1 templated insertions without any additional mutations of 57.52% and the other 14.34%. This suggests that there exist additional but unknown determinants for highly frequent template insertions. Nevertheless, templated insertions may provide a target for improving the efficiency and the accuracy of CRISPR/Cas9-mediated genome editing.
As precise genomic deletions of defined length at a target are an important application in genome editing, a high level of accurate NHEJ would improve such an application. Previously, targeted genomic deletions via two concurrent DSBs have been used in generating disease models and developing gene therapies for human diseases as well as in studying coding and non-coding elements at individual genome loci or in a high-throughput fashion [4, 23, 45,46,47,48,49,50,51,52,53,54,55,56,57]. However, while these targeted genomic deletions were long and many were precise, accurate NHEJ was not systematically analyzed nor harnessed to improve these applications. The paired Cas9-gRNA method presented here can generate precise genomic deletions of defined length at high frequencies and has the capacity to increase the efficiency of gene knockouts and targeted in-frame deletions. The extent of the increase is usually higher in targeted in-frame deletions than in gene knockouts because the basal level is already high at about 66.7% in out-of-frame mutations even without engaging accurate NHEJ. Nevertheless, the heterogeneity of mutations is significantly reduced in both applications since the majority of deletions in these applications are precise deletions of defined “out-of-frame” or “in-frame” length.
This method can also be used for knock-in via precise blunt end ligation with exogenous DNA fragments in place of the excised sequence as done previously [17, 21, 58]. Similar to improvement of gene knockouts and targeted in-frame deletions, we could further improve the efficiency of precise knock-in by choosing the W/C orientation of paired PAMs to avoid frequent + 1 and + 2 templated insertions and by using the Plk3 inhibitor GW843682X to enhance accurate NHEJ. As the PAM compatibility is expanded, more sites at a target could be selected for the paired Cas9-gRNA approach, providing flexibility in choosing paired gRNAs according to the predefined deletion length, the PAM orientation, the efficiency of concurrent Cas9 cleavage, and even the efficiency of accurate NHEJ directly [24,25,26]. Expanded compatibility of PAM will further broaden applications of this method, which is a valuable addition to the CRISPR/Cas9 toolbox in genome editing.
NHEJ is inherently accurate in the repair of Cas9-induced DSBs, leading to a high frequency of accurate NHEJ. This can be exploited by use of paired Cas9-gRNAs to improve genome editing that requires precise deletions of defined length, such as gene knockouts and targeted in-frame deletions. In this paired Cas9-gRNA method, the frequency of accurate NHEJ is hindered by frequent + 1 and + 2 templated insertions. Based on the findings, several rules have been devised to promote accurate NHEJ for genome editing (Additional file 2: Figure S11). These rules include preferentially selecting the W/C orientation of paired PAMs to prevent templated insertions, choosing the distance of predefined 3n + 2 bp between paired DSBs for efficient gene knockouts, and using the Plk3 inhibitor GW843682X to inhibit end resection by CtIP. With expanded compatibility of PAMs, applications of this strategy can be further broadened in genome editing and synthetic biology that require precise deletions of variable length. In addition, the paired Cas9-gRNA method provides a convenient, flexible, and reporterless approach to analyze both accurate and mutagenic NHEJ in cells and in vivo. By overcoming the limitations present in NHEJ reporters, this method can be used to analyze NHEJ at different loci and development stages and in different organs and genetic backgrounds.
Expression plasmids for gRNAs were constructed from the pU6-gRNA vector as described previously . The Cas9 plasmid pX330 was originally obtained from Addgene (catalog number 42230). The pcDNA3β-based expression vectors for I-SceI and GFP were as described before . The gRNA target sequences are listed in Additional file 1: Tables S5. Newly constructed plasmids were confirmed by Sanger sequencing.
Mouse ES cells containing the sGEJ reporter, the BGN reporter, and the HR reporter were previously established and cultured as described before [11, 13, 28]. Isogenic XRCC4+/+ and XRCC4−/− mouse ES cells containing the sGEJ reporter were also previously generated . Human HEK293 and U2OS cells were cultured in DMEM containing 10% fetal bovine serum, 1% penicillin-streptomycin, and 2 mM L-glutamine. For Cas9-mediated MDC1 knockout and 53BP1 targeted in-frame deletions, 2 × 105 ES cells were transfected with the expression plasmids for paired gRNAs and Cas9 in a 24-well plate using Lipofectamine 2000 (Invitrogen) as previously described , and were seeded on mouse embryonic fibroblast (MEF) feeder cells for single clones without antibiotic selection in a 10-cm plate at 72 h post-transfection. In about 2 weeks, knockout clones and clones with targeted in-frame deletions were picked, grown, and verified by PCR along with Sanger sequencing and western blotting. Primers are listed in Additional file 1: Tables S5.
Genome editing by paired I-SceI and paired Cas9-gRNAs
For genome editing by paired I-SceI, mouse ES cells harboring the sGEJ reporter or not were transfected with pcDNA3β-I-SceI. For genome editing by paired Cas9-gRNA, mouse ES cells and human HEK293 and U2OS cells were transfected with the expression plasmids for paired gRNAs and pX330 for Cas9. These cells were also transfected with pcDNA3β-GFP for transfection efficiencies. Transfection was done with Lipofectamine 2000 (Invitrogen) in 24-well plates as described previously . At 72–96 h post-transfection, cells were harvested and genomic DNA was isolated for analysis of genome editing. For genome editing in vivo, wild-type male C57BL/6 mice, 6–8 weeks old, were purchased from Shanghai SLAC Laboratory Animal Co. Ltd. Hydrodynamic tail vein injection was performed as previously described with modifications . Briefly, the expression plasmids for spCas9 and paired gRNAs (gRNA#1 and gRNA#2) targeting the LDHA sites were suspended in 1.8 mL saline and injected into mice in less than 10 s via the tail vein. The amount of injected DNA was 200 μg for Cas9, 150 μg for gLDHA#1, and 150 μg for gLDHA#2 for each mouse. At 30 days post-injection, liver tissues were harvested and genomic DNA was isolated for analysis of genome editing at the LDHA site.
Genomic DNA extraction, PCR amplification, and Illumina deep sequencing
For analysis of targeted genome editing at endogenous genome loci, cells or liver tissues were collected after NHEJ of paired DSBs induced by paired I-SceI or paired Cas9-gRNAs. Genomic DNA was isolated from these cells or tissues using a genomic DNA purification kit (Axygen). The genomic regions targeted by I-SceI and CRISPR/Cas9 were PCR amplified with respective primers (Additional file 1: Table S5). PCR products were purified using a gel extraction kit (Axygen). PCR products of two to six different genomic target sites were then mixed, end-repaired, adenylated at 3′ ends, ligated with adapters, purified, and amplified by the second round of PCR to incorporate the P7 and P5 Illumina adapters and a unique 8-mer barcode sequence according to the manufacturer’s protocols. Next-generation sequencing was performed on the Illumina Hiseq at Veritas Genetics Asia Inc. (Hangzhou). Sequences were analyzed to identify edited events with different indels at repair junctions using DBS-Aligner as described previously .
HR and NHEJ reporter assays
Mouse ES cells harboring the NHEJ or HR reporter were transfected with pcDNA3β-I-SceI, the pU6-gRNA plasmids and pX330, and/or siRNA as previously described . siRNAs were purchased from RiboBio Co., and siRNA sequences were 5′-CGTACGCGGAATACTTCGA -3′ for the luciferase control and 5′-GGAACTCTGGACAAAACTA- 3′ for mouse CtIP. Both siRNAs were tested before in mouse ES cells . For drug treatment, GW843682X (Sigma-Aldrich) was added at 6 h post-transfection and replaced fresh the next day for continued treatment for the rest of the experiment. Cells transfected and/or treated were analyzed for GFP+ frequencies using the Beckman Coulter CytoFLEX flow cytometer 3 days post-transfection. The NHEJ and HR frequencies were calculated after being corrected with background readings and normalized with transfection efficiencies as described before .
Antibodies, western blotting, and immunofluorescence
Primary antibodies included anti-MDC1 (ab11169; 1:1000) and anti-53BP1 (ab21083; 1:1000) from Abcam, anti-CtIP (sc-271339; 1:1000) from Santa Cruz, anti-β-actin (A5441; 1:10000) from Sigma, and anit-γH2AX (2455636; 1:1000) from Millipore. For western blotting, cells were washed with cold PBS and lysed with RIPA buffer for 30 min. Cell extracts were separated by SDS-PAGE and analyzed by western blotting with corresponding antibodies. For immunofluorescence staining, cells were seeded on glass coverslips in six-well plates overnight and irradiated with 3 Gy X-rays. After 1-h recovery, cells were fixed with 4% paraformaldehyde, stained with primary antibodies and Alexa Fluor 488 or Alexa Fluor 546 secondary antibodies (Invitrogen), and imaged by the Zeiss AXIO Observer A1 microscope.
Data were analyzed by Student’s paired and unpaired t-test, one-way ANOVA with post-hoc least significant difference (LSD) pairwise comparisons, Kruskal–Wallis test, regression analysis, Mann–Whitney test, χ2 test for determination of P value and correlation coefficients as indicated in the figures.
Hsu PD, Lander ES, Zhang F. Development and applications of CRISPR-Cas9 for genome engineering. Cell. 2014;157:1262–78.
Jiang F, Doudna JA. CRISPR–Cas9 structures and mechanisms. Annu Rev Biophys. 2017;46:505–29.
Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–21.
Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–23.
Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, et al. RNA-guided human genome engineering via Cas9. Science. 2013;339:823–6.
Gallagher DN, Haber JE. Repair of a site-specific DNA cleavage: old-school lessons for Cas9-mediated gene editing. ACS Chem Biol. 2018;13:397–405.
Pawelczak KS, Gavande NS, VanderVere-Carozza PS, Turchi JJ. Modulating DNA repair pathways to improve precision genome engineering. ACS Chem Biol. 2018;13:389–96.
Menon V, Povirk LF. End-processing nucleases and phosphodiesterases: an elite supporting cast for the non-homologous end joining pathway of DNA double-strand break repair. DNA Repair (Amst). 2016;43:57–68.
Pannunzio NR, Watanabe G, Lieber MR. Nonhomologous DNA end joining for repair of DNA double-strand breaks. J Biol Chem. 2018;293:10512–23.
Symington LS, Gautier J. Double-strand break end resection and repair pathway choice. Annu Rev Genet. 2011;45:247–71.
Xie A, Kwok A, Scully R. Role of mammalian Mre11 in classical and alternative nonhomologous end joining. Nat Struct Mol Biol. 2009;16:814–8.
Xie A, Hartlerode A, Stucki M, Odate S, Puget N, Kwok A, et al. Distinct roles of chromatin-associated proteins MDC1 and 53BP1 in mammalian double-strand break repair. Mol Cell. 2007;28:1045–57.
Rass E, Chandramouly G, Zha S, Alt FW, Xie A. Ataxia telangiectasia mutated (ATM) is dispensable for endonuclease I-SceI-induced homologous recombination in mouse embryonic stem cells. J Biol Chem. 2013;288:7086–95.
Guirouilh-Barbat J, Huck S, Bertrand P, Pirzio L, Desmaze C, Sabatier L, et al. Impact of the KU80 pathway on NHEJ-induced genome rearrangements in mammalian cells. Mol Cell. 2004;14:611–23.
Guirouilh-Barbat J, Rass E, Plo I, Bertrand P, Lopez BS. Defects in XRCC4 and KU80 differentially affect the joining of distal nonhomologous ends. Proc Natl Acad Sci U S A. 2007;104:20902–7.
Bétermier M, Bertrand P, Lopez BS. Is non-homologous end-joining really an inherently error-prone process? PLoS Genet. 2014;10:e1004086.
Geisinger JM, Turan S, Hernandez S, Spector LP, Calos MP. In vivo blunt-end cloning through CRISPR/Cas9-facilitated non-homologous end-joining. Nucleic Acids Res. 2016;44:e76.
Ghezraoui H, Piganeau M, Renouf B, Renaud J-B, Sallmyr A, Ruis B, et al. Chromosomal translocations in human cells are generated by canonical nonhomologous end-joining. Mol Cell. 2014;55:829–42.
Mandal PK, Ferreira LMR, Collins R, Meissner TB, Boutwell CL, Friesen M, et al. Efficient ablation of genes in human hematopoietic stem and effector cells using CRISPR/Cas9. Cell Stem Cell. 2014;15:643–52.
Zheng Q, Cai X, Tan MH, Schaffert S, Arnold CP, Gong X, et al. Precise gene deletion and replacement using the CRISPR/Cas9 system in human cells. BioTechniques. 2014;57:115–24.
Zhou J, Wang J, Shen B, Chen L, Su Y, Yang J, et al. Dual sgRNAs facilitate CRISPR/Cas9-mediated mouse genome targeting. FEBS J. 2014;281:1717–25.
Canver MC, Bauer DE, Dass A, Yien YY, Chung J, Masuda T, et al. Characterization of genomic deletion efficiency mediated by clustered regularly interspaced palindromic repeats (CRISPR)/Cas9 nuclease system in mammalian cells. J Biol Chem. 2014;289:21312–24.
Zhu S, Li W, Liu J, Chen C-H, Liao Q, Xu P, et al. Genome-scale deletion screening of human long non-coding RNAs using a paired-guide RNA CRISPR-Cas9 library. Nat Biotechnol. 2016;34:1279–86.
Hu JH, Miller SM, Geurts MH, Tang W, Chen L, Sun N, et al. Evolved Cas9 variants with broad PAM compatibility and high DNA specificity. Nature. 2018;556:57–63.
Kleinstiver BP, Prew MS, Tsai SQ, Nguyen NT, Topkar VV, Zheng Z, et al. Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition. Nat Biotechnol. 2015;33:1293–8.
Kleinstiver BP, Prew MS, Tsai SQ, Topkar VV, Nguyen NT, Zheng Z, et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015;523:481–5.
Wolfs JM, Hamilton TA, Lant JT, Laforet M, Zhang J, Salemi LM, et al. Biasing genome-editing events toward precise length deletions with an RNA-guided TevCas9 dual nuclease. Proc Natl Acad Sci U S A. 2016;113:14988–93.
Feng Y-L, Xiang J-F, Liu S-C, Guo T, Yan G-F, Feng Y, et al. H2AX facilitates classical non-homologous end joining at the expense of limited nucleotide loss at repair junctions. Nucleic Acids Res. 2017;45:10614–33.
Guirouilh-Barbat J, Gelot C, Xie A, Dardillac E, Scully R, Lopez BS. 53BP1 protects against CtIP-dependent capture of ectopic chromosomal sequences at the junction of distant double-strand breaks. PLoS Genet. 2016;12:e1006230.
Boubakour-Azzouz I, Ricchetti M. Low joining efficiency and non-conservative repair of two distant double-strand breaks in mouse embryonic stem cells. DNA Repair (Amst). 2008;7:149–61.
Lemos BR, Kaplan AC, Bae JE, Ferrazzoli AE, Kuo J, Anand RP, et al. CRISPR/Cas9 cleavages in budding yeast reveal templated insertions and strand-specific insertion/deletion profiles. Proc Natl Acad Sci U S A. 2018;115:E2040–7.
van Overbeek M, Capurso D, Carter MM, Thompson MS, Frias E, Russ C, et al. DNA repair profiling reveals nonrandom outcomes at Cas9-mediated breaks. Mol Cell. 2016;63:633–46.
Zuo Z, Liu J. Cas9-catalyzed DNA cleavage generates staggered ends: evidence from molecular dynamics simulations. Sci Rep. 2016;5:37584.
Yan CT, Boboila C, Souza EK, Franco S, Hickernell TR, Murphy M, et al. IgH class switching and translocations use a robust non-classical end-joining pathway. Nature. 2007;449:478–82.
Zgheib O, Pataky K, Brugger J, Halazonetis TD. An oligomerized 53BP1 tudor domain suffices for recognition of DNA double-strand breaks. Mol Cell Biol. 2009;29:1050–8.
Botuyan MV, Lee J, Ward IM, Kim J-E, Thompson JR, Chen J, et al. Structural basis for the methylation state-specific recognition of histone H4-K20 by 53BP1 and Crb2 in DNA repair. Cell. 2006;127:1361–73.
Panier S, Boulton SJ. Double-strand break repair: 53BP1 comes into focus. Nat Rev Mol Cell Biol. 2014;15:7–18.
Bunting SF, Callén E, Wong N, Chen H-T, Polato F, Gunn A, et al. 53BP1 inhibits homologous recombination in Brca1-deficient cells by blocking resection of DNA breaks. Cell. 2010;141:243–54.
Biehs R, Steinlage M, Barton O, Juhász S, Künzel J, Spies J, et al. DNA double-strand break resection occurs during non-homologous end joining in G1 but is distinct from resection during homologous recombination. Mol Cell. 2017;65:671–684.e5.
You Z, Bailis JM. DNA damage and decisions: CtIP coordinates DNA repair and cell cycle checkpoints. Trends Cell Biol. 2010;20:402–9.
Yun MH, Hiom K. CtIP-BRCA1 modulates the choice of DNA double-strand-break repair pathway throughout the cell cycle. Nature. 2009;459:460–3.
Barton O, Naumann SC, Diemer-Biehs R, Künzel J, Steinlage M, Conrad S, et al. Polo-like kinase 3 regulates CtIP during DNA double-strand break repair in G1. J Cell Biol. 2014;206:877–94.
Sugawara N, Haber JE. Characterization of double-strand break-induced recombination: homology requirements and single-stranded DNA formation. Mol Cell Biol. 1992;12:563–75.
Weinstock DM, Nakanishi K, Helgadottir HR, Jasin M. Assaying double-strand break repair pathway choice in mammalian cells using a targeted endonuclease or the RAG recombinase. Meth Enzymol. 2006;409:524–40.
Lee HJ, Kim E, Kim J-S. Targeted chromosomal deletions in human cells using zinc finger nucleases. Genome Res. 2010;20:81–9.
Xiao A, Wang Z, Hu Y, Wu Y, Luo Z, Yang Z, et al. Chromosomal deletions and inversions mediated by TALENs and CRISPR/Cas in zebrafish. Nucleic Acids Res. 2013;41:e141.
Gupta A, Hall VL, Kok FO, Shin M, McNulty JC, Lawson ND, et al. Targeted chromosomal deletions and inversions in zebrafish. Genome Res. 2013;23:1008–17.
Ran FA, Hsu PD, Wright J, Agarwala V, Scott DA, Zhang F. Genome engineering using the CRISPR-Cas9 system. Nat Protoc. 2013;8:2281–308.
Tabebordbar M, Zhu K, Cheng JKW, Chew WL, Widrick JJ, Yan WX, et al. In vivo gene editing in dystrophic mouse muscle and muscle stem cells. Science. 2016;351:407–11.
Nelson CE, Hakim CH, Ousterout DG, Thakore PI, Moreb EA, Castellanos Rivera RM, et al. In vivo genome editing improves muscle function in a mouse model of Duchenne muscular dystrophy. Science. 2016;351:403–7.
Aparicio-Prat E, Arnan C, Sala I, Bosch N, Guigó R, Johnson R. DECKO: Single-oligo, dual-CRISPR deletion of genomic elements including long non-coding RNAs. BMC Genomics. 2015;16:846.
Yang H, Wang H, Shivalila CS, Cheng AW, Shi L, Jaenisch R. One-step generation of mice carrying reporter and conditional alleles by CRISPR/Cas-mediated genome engineering. Cell. 2013;154:1370–9.
Maddalo D, Manchado E, Concepcion CP, Bonetti C, Vidigal JA, Han Y-C, et al. In vivo engineering of oncogenic chromosomal rearrangements with the CRISPR/Cas9 system. Nature. 2014;516:423–7.
Han J, Zhang J, Chen L, Shen B, Zhou J, Hu B, et al. Efficient in vivo deletion of a large imprinted lncRNA by CRISPR/Cas9. RNA Biol. 2014;11:829–35.
Long C, Amoasii L, Mireault AA, McAnally JR, Li H, Sanchez-Ortiz E, et al. Postnatal genome editing partially restores dystrophin expression in a mouse model of muscular dystrophy. Science. 2016;351:400–3.
Pulido-Quetglas C, Aparicio-Prat E, Arnan C, Polidori T, Hermoso T, Palumbo E, et al. Scalable design of paired CRISPR guide RNAs for genomic deletion. PLoS Comput Biol. 2017;13:e1005341.
Wettstein R, Bodak M, Ciaudo C. Generation of a knockout mouse embryonic stem cell line using a paired CRISPR/Cas9 genome engineering tool. Methods Mol Biol. 2016;1341:321–43.
He X, Tan C, Wang F, Wang Y, Zhou R, Cui D, et al. Knock-in of large reporter genes in human cells via CRISPR/Cas9-induced homology-dependent and independent DNA repair. Nucleic Acids Res. 2016;44:e85.
Xue W, Chen S, Yin H, Tammela T, Papagiannakopoulos T, Joshi NS, et al. CRISPR-mediated direct mutation of cancer genes in the mouse liver. Nature. 2014;514:380–4.
Guo T, Feng YL, Xiao JJ, Liu Q, Sun XN, Xiang JF, et.al, Harnessing accurate non-homologous end joining for efficient precise deletion in CRISPR/Cas9-mediated genome editing. [dataset] Sequence Read Archive. 2018. https://www.ncbi.nlm.nih.gov/sra/SRP150149. Accessed 12 June 2018.
The authors are grateful to the other members of the Xie laboratory for helpful discussions. We thank William Folk (University of Missouri at Columbia, USA) for helpful discussions and critically reading the manuscript.
This work was supported by the National Natural Science Foundation of China (number 81472755, number 31671385, and number 81661128008 to A.Y.X.), the Natural Science Foundation of Zhejiang Province (LZ17C060001 to A.Y.X. and LY18C050001 to Y.L.F), and the Department of Science and Technology of Zhejiang Province (2015C03047 to A.Y.X.).
Availability of data and materials
Deep sequencing raw data are available in the Sequence Read Archive (SRA) under accession number SRP150149 .
Ethics approval and consent to participate
All animal procedures were reviewed and revised by the Animal Welfare and Ethical Review Body at Zhejiang University, with approval number ZJU2015–378-01.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. The frequencies of accurate NHEJ in repair of Cas9-induced DSBs. Table S2. Top five group I NHEJ events in repair of adjacently paired DSBs induced by paired Cas9-gRNA. Table S3. Most of the + 1 insertions at repair junctions are templated insertions. Table S4. The effect of accurate NHEJ on the frequency of out-of-frame and in-fame editing. Table S5. Sequence information on gRNAs, primers, and genome editing target sites. (XLSX 216 kb)
Figure S1. Comparison of accurate NHEJ in repair of Cas9-induced DSBs between human cells (HEK293 or U2OS) and mouse ES cells. Figure S2. The frequency of “Accurate”, “Deletion”, “Insertion” and “IndDel” in group I events from 71 endogenous gene loci. Figure S3. Nearly all of the + 1 insertions with no additional mutations at repair junctions were templated. Figure S4. Generation of + 1 templated insertions (TI) resulted from paired Cas9 cleavage with the W/W, W/C, C/W, or C/C orientation of paired PAMs. Figure S5. Nearly all of the + 1 templated insertions with no additional mutations could be predicted from paired Cas9 cleavage at the third and fourth base upstream of a PAM. Figure S6. Correlation between + 1 template insertions (TI) and out-of-frame mutations derived from either of the Common, Ideal, or Paired methods. Figure S7. Effect of accurate NHEJ with a frequency below or above 30% on the frequency of out-of-frame editing. Figure S8. Correlation between the out-of-frame editing frequency and the frequency of templated insertions (TI) at 50 genome sites. Figure S9. Comparison of the in-frame editing frequency induced by Common, Paired, and Ideal at 20 genome sites. Figure S10. Distributions of microhomology at repair junctions in 53BP1 wild-type, 53BP1ΔTudor, and 53BP1ΔOD mouse ES cells. Figure S11. Flowchart of the paired Cas9-gRNA protocol for genome editing that requires precise deletion of defined length. (PDF 4737 kb)
About this article
Cite this article
Guo, T., Feng, YL., Xiao, JJ. et al. Harnessing accurate non-homologous end joining for efficient precise deletion in CRISPR/Cas9-mediated genome editing. Genome Biol 19, 170 (2018). https://doi.org/10.1186/s13059-018-1518-x
- Paired gRNAs
- Accurate NHEJ
- Templated insertions
- Precise deletion
- Targeted in-frame deletion
- Genome editing