- Open Access
DNA Topoisomerase I differentially modulates R-loops across the human genome
- Stefano G. Manzo†1, 4,
- Stella R. Hartono†2,
- Lionel A. Sanz2,
- Jessica Marinello1,
- Sara De Biasi3,
- Andrea Cossarizza3,
- Giovanni Capranico1Email author and
- Frederic Chedin2Email authorView ORCID ID profile
© The Author(s). 2018
- Received: 26 January 2018
- Accepted: 10 July 2018
- Published: 30 July 2018
Co-transcriptional R-loops are abundant non-B DNA structures in mammalian genomes. DNA Topoisomerase I (Top1) is often thought to regulate R-loop formation owing to its ability to resolve both positive and negative supercoils. How Top1 regulates R-loop structures at a global level is unknown.
Here, we perform high-resolution strand-specific R-loop mapping in human cells depleted for Top1 and find that Top1 depletion results in both R-loop gains and losses at thousands of transcribed loci, delineating two distinct gene classes. R-loop gains are characteristic for long, highly transcribed, genes located in gene-poor regions anchored to Lamin B1 domains and in proximity to H3K9me3-marked heterochromatic patches. R-loop losses, by contrast, occur in gene-rich regions overlapping H3K27me3-marked active replication initiation regions. Interestingly, Top1 depletion coincides with a block of the cell cycle in G0/G1 phase and a trend towards replication delay.
Our findings reveal new properties of Top1 in regulating R-loop homeostasis in a context-dependent manner and suggest a potential role for Top1 in modulating the replication process via R-loop formation.
Biological processes such as transcription and replication generate torsional stress on the DNA double helix that, if not properly dealt with, can lead to genome instability . R-loop structures, a prevalent non-B DNA structure in mammalian genomes, have been particularly linked to genomic instability by causing interference between the replication and transcription machineries [2, 3]. R-loops are formed during transcription upon reannealing of the nascent transcript to the DNA template strand, forming an RNA:DNA hybrid and forcing the non-template strand to loop out. Mapping data indicate that these non-B DNA structures are prevalent in mammalian genomes, where they form dynamically over conserved regions [4, 5]. Negative supercoiling generated behind the elongating RNA polymerase  is thought to facilitate R-loop formation by inducing an underwound DNA state favorable to the re-annealing of the nascent transcript . DNA Topoisomerase I (Top1) is one main cellular factor controlling topological homeostasis [8, 9]. Top1 activity can relax negative supercoils by cutting one of the DNA strands, creating a transient Top1-DNA cleavage complex (Top1cc), and performing a controlled rotation of the cut strand around the uncut strand [10, 11]. The relaxation activity on negative supercoils is thought to reduce co-transcriptional R-loop formation which in turns prevents replication / transcription interference and favors genome stability. Indeed, deletion of the bacterial topA gene, an enzyme that only relaxes negative supercoils, creates R-loop-prone hypernegatively supercoiled DNA and causes a growth defect that can be suppressed by over-expression of Ribonuclease H (RNase H), an enzyme that degrades RNA strands in RNA:DNA hybrids [7, 12, 13]. Furthermore, persistent depletion of Top1 in mammalian cells leads to replicative stress and replication-transcription conflicts that can be rescued by overexpression of RNase H . Finally, stabilization of Top1cc by Top1 inhibitors such as camptothecin and its derivatives  leads to R-loop stabilization in human cells upon short treatment [16, 17] and to transcription-dependent DNA breakage that can be partially suppressed by RNase H expression .
Thus, while it is clear that Top1 regulates R-loops and prevents R-loop-induced genomic instability, the range of loci that are sensitive to R-loop modulation by Top1 is not known. Addressing this gap in knowledge is important given rising evidence that R-loops are abundant in mammalian genomes and also participate in important biological processes [19–21]. For instance, R-loops are involved in regulating chromatin states [4, 5, 22], in mediating transcription termination , and in immunoglobulin class switch recombination . Studies also suggest a role for R-loops in priming DNA replication in prokaryotic systems and yeast [25–28]. How R-loop formation is dynamically regulated to permit the physiological roles of R-loops while minimizing the negative impacts of excessive R-loops on genome stability is not clear. In this study, we used the DRIPc-seq technique  to map R-loop structures genome-wide in human cells experiencing an acute but transient depletion of Top1. Our work reveals that Top1 modulates R-loop structures differently according to genomic context and provide new evidence that R-loops may play a role in the replication process.
Top1 depletion causes subtle R-loop gains and losses
We next quantified R-loop formation genome wide by employing DRIPc-seq, a technique that allows high-resolution, strand-specific genomic mapping of R-loop structures . R-loop structures were observed over 69,066 peaks using a standard peak calling algorithm, (Additional file 2: Table S1), covering ~ 200 megabases (Mb) of genomic space, which is in close agreement with previous data . As expected, R-loop formation was predominantly genic, with promoters and terminators representing hotspots of signal (Additional file 1: Figure S1D). Detection of R-loop signal changes  indicated that only a small subset of R-loop peaks (4.07%) showed significant changes upon Top1 depletion. However, inspection of signal in Top1-depleted samples revealed numerous instances of signal spreading from existing R-loop peaks (Additional file 1: Figure S1E). To properly account for these events, we optimized a high-sensitivity version of our peak-calling algorithm and applied it to all samples (see Methods). This method identified a total of 399,953 peaks and significant, reproducible R-loop signal gains and losses upon Top1 depletion occurred at 15,112 and 12,977 peaks, respectively (7.02% of total peaks) (Additional file 1: Figure S1F, see Additional file 2: Table S1 for a comparison between standard and high-sensitivity methods). These changes were independently validated using DRIP-qPCR at representative test loci (Additional file 1: Figure S1G). Similar results were obtained when we induced Top1 depletion with an additional siRNA and similar trends towards R-loop gains and losses were also detected at earlier Top1 depletion time points (Additional file 1: Figure S1H and I). Finally, similar trends towards R-loop gains and losses were observed with the S9.6-independent method DRIVE-qPCR , in which a catalytically inactive RNASEH1 protein is used to capture R-loops (Additional file 1: Figure S1J).
Quantitative analysis of DRIPc-seq signal over high-sensitivity peaks of R-loop gain (RLG) and R-loop loss (RLL) was consistent with the increased S9.6 signal detected by dot blot: RLG peaks occupied ~ 100 Mb of sequence space whereas RLL peaks occupied 36 Mb (Fig. 1c). Furthermore, the intensity of R-loop signals measured as the total number of reads over all RLG and RLL peaks also showed an overall net increase in Top1-depleted cells (Fig. 1d). Since ribosomal DNA arrays harbor a major source of cellular R-loops  and Top1 depletion in yeast was shown to cause R-loop gains over the 5’ETS region , we reasoned that ribosomal R-loops could also contribute to the increased R-loop signal observed by dot blots. To address this, we visualized R-loop loads over the ribosomal DNA region, which revealed that R-loops increased over the 5’ ETS, but not the transcribed 28S region (Fig. 1e). This was further validated by DRIP-qPCR and similar results were obtained with a second siRNA against Top1 (Additional file 1: Figure S1K and L). Therefore overall, results from genomic profiling and dot blot analysis were consistent and point to the fact that Top1 depletion results in a net increase in cellular R-loop loads, although only a minority of R-loop peaks are directly affected.
R-loop gains and losses in Top1-depleted cells define distinct gene categories
To understand the nature of the differential response to Top1 depletion, we focused on RLG and RLL genes and first investigated their lengths. RLG genes were significantly longer (2.7 fold on average) compared to RLL genes or genes with no or mixed R-loop change (Fig. 2b). By contrast, RLL genes were only slightly longer than control genes with no or mixed changes in R-loops. This suggests that longer genes are more prone to RLG upon Top1 depletion. To confirm this, we clustered all R-loop peaks by their ratio of R-loop signal change from control to Top1-depleted conditions regardless of whether they belonged to RLL or RLG genes. We then measured gene lengths as a function of R-loop signal fold change in each cluster. We observed a strong positive correlation between the relative intensity of R-loop signal changes and each cluster’s length (Fig. 2c). This, together with the marked distribution of RLG peaks to gene bodies, suggests that transcription elongation through long genes is more prone to R-loop stabilization in absence of Top1. To determine if expression levels could also distinguish RLG and RLL genes, we performed total RNA-seq on both control and Top1-depleted cells. This analysis revealed that RLG genes were significantly more expressed (1.5–3.5 fold on average) than RLL genes, which themselves were not significantly different than control genes (Fig. 2d). Thus, RLG genes tend to be long and highly expressed. Since long genes often reside in gene-poor areas of the genome, we measured the distance between RLG and RLL genes and their nearest neighbor. RLG genes were located significantly further away from potential neighbors than control or RLL genes (Fig. 2e). This indicates that RLG genes tend to occupy gene-poor neighborhoods. Consistent with this, there was a strong inverse correlation between chromosomal gene density and R-loop loss/gain ratio: gene-rich chromosomes predominantly showed loss of R-loop signal loss following Top1 knockdown, while gene-poor chromosomes favored R-loop gain events (Fig. 2f).
Finally, given that Top1 can regulate RNAP release from promoter-proximal pausing [32–34], we asked whether RLG and RLL genes differ in how frequently they undergo pause-release. For this, we performed RNA Polymerase II ChIP-seq in control and Top1-depleted cells and determined the pausing index of each gene according to well-defined categories . RLG genes were strongly enriched in genes that undergo promoter-proximal pausing (Fig. 2g). By contrast, RLL genes mostly corresponded to genes that do not undergo pausing and their distribution was not significantly different from that of control genes. More broadly, our analysis confirmed that Top1 depletion causes RNAP accumulation in the vicinity of the TSS, particularly for paused genes  (Additional file 1: Figure S2C). Altogether, this shows that the differential response to Top1 depletion defines two broadly distinct classes of genes.
Top1 depletion favors co-transcriptional R-loop gains
R-loop gains upon Top1 depletion preferentially associate with heterochromatin and nuclear lamina
Since lamin and H3K9me3 association might represent physical constraints to the dissipation of transcription-induced DNA supercoils in the absence of Top1, we therefore investigated the relationship between RLG and these parameters further. We calculated the distance between RLG and RLL peaks and the nearest annotated H3K9me3 or LAD peak. RLG peaks were significantly closer to H3K9me3 peaks compared to matched R-loop forming peaks (Fig. 4b). This proximity was true regardless of the genic location of the RLG peak (promoter, gene body and terminator) and was striking given that the median value for distance was close to zero. By contrast, RLL peaks were located further away from H3K9me3 peaks (median distance of 8 kb) than matched control peaks. RLG peaks therefore tend to reside in immediate proximity to H3K9me3 peaks. We next asked if the intensity of R-loop signal gains was correlated to the distance to H3K9me3 peaks. For this, we clustered all R-loop peaks by their R-loop change ratios and measured the distance between these loci and the nearest H3K9me3 peak in each cluster. A strong correlation between the two parameters was observed such that the R-loop peaks with the strongest gains were located closest to H3K9me3 peaks. Vice-versa, R-loop peaks with the strongest losses were located furthest away from H3K9me3 peaks (Fig. 4c). Furthermore in 70% of cases, the H3K9me3 peaks were located upstream of the RLG peaks relative to gene transcription (not shown). Similar observations were made with LADs: RLG peaks were located significantly closer to LADs than RLL genes or control matched genes (Fig. 4b). In addition, we observed a strong correlation between distance to LADs and strength of R-loop change (Fig. 4c). While the median distance between RLG and annotated LADs was comparatively large (~ 200 kb; Fig. 4b), we note that top-ranked RLG genes in terms of signal gains were often closely juxtaposed to LADs (Additional file 1: Figure S3A). Similarly, promoters of RLG genes were characterized by a strong upstream Lamin B1 signal and marked lamin B1 depletion around the TSS region (Additional file 1: Figure S3B). To further characterize the arrangement of RLG genes relative to LAD boundaries, we calculated the density of promoters around LAD boundaries as a function of genic orientation. RLG genes transcribing away from LADs showed a sharp promoter density peak at or near the LAD border (Fig. 4d). By contrast, no peak was observed for genes transcribing towards the LAD. A corresponding promoter peak was not observed for matched control genes and only weakly for all genes. Altogether, this data shows that long, highly transcribed genes accumulate R-loops likely due to the proximity to H3K9me3 peaks and/or LADs, which may impede the dissipation of DNA supercoils along the chromatin template and therefore allow the local accumulation of R-loop-favorable negative DNA supercoils.
R-loop losses upon Top1 depletion preferentially associate with early, active, replication origins
Since Top1 activity has been mapped to a well-established, conserved human replication origin , we also included replication initiation sites mapped by Short Nascent Strand sequencing (SNS-seq)  to our analysis. To our surprise, RLL peaks showed strong overlap enrichment with such loci compared to matched controls while RLG peaks showed clear depletion (Fig. 5a). To investigate the relationship between RLL peaks and origins further, we measured the distance between RLL and RLG loci to the nearest annotated SNS-seq loci. RLL peaks were located significantly closer to origins than matched controls and dramatically closer than RLG peaks which themselves were located further away from origins than matched controls (Fig. 5b). Furthermore, we observed a strong correlation between the intensity of the R-loop signal gain/loss upon Top1 depletion measured over RLL and RLG peaks and the distance from these peaks to the nearest SNS-seq origin (Fig. 5c). Strikingly, peaks with the strongest loss tended to directly match onto SNS-seq origins (median distance of zero). Increased overlap of RLL peaks over CpG island loci, which are often replication origins , was also observed, while RLG peaks showed the opposite trend (Additional file 1: Figure S4A). Thus, topoisomerase I depletion associates with a loss of R-loop signal at peaks that are proximal to replication origins.
To understand whether the association of RLL peaks with replication origins may have functional significance, we next asked whether RLL peaks also overlapped with SNS-seq signal, which reflects the frequency of replication initiation events. SNS-seq signal was significantly higher for RLL peaks over both promoters and terminators compared to the signal observed over matched controls (Fig. 5d). Similarly, SNS-seq signal over RLG peaks was lower than that of matched controls. Overall, a strong correlation was observed between replication signal and the strength of R-loop gains and losses (Fig. 5e). This indicates that RLL peaks delineate regions with high replication initiation activity while RLG peaks match further away from sites of replication initiation onto regions with poor replication initiation potential. Analysis of replication timing data (Repli-seq, ) confirmed that RLL peaks replicate as early as late G1 (G1b) and S1 phases (Fig. 5f). Compared to matched loci, RLL peaks were significantly more likely to replicate in late G1 and less likely to replicate in later phases. By contrast, RLG peaks replicated predominantly in later phases of the cell cycle (S1 and S2) and showed a significant tendency towards later replication compared to matched loci.
To ensure that the association between RLL peaks and replication origins is robust, we analyzed independent datasets where origins were mapped through Okazaki fragment sequencing (OK-seq) . While SNS-seq and OK-seq datasets produce distinct replication initiation maps, both methods nonetheless show significant overlap in particular around gene bodies and terminal genic regions (data not shown). RLL peaks showed increased overlap with OK-seq-derived initiation peaks (AS peaks, ) while RLG peaks showed decreased overlap (Fig. 5a). Likewise RLG peaks were located further away than expected from matched control genes, while RLL peaks were distributed as expected from, or closer than, control peaks (Additional file 1: Figure S4B). Finally, RLG peaks showed reduced densities of AS peaks compared to matched controls while RLL peaks showed the opposite trend (Additional file 1: Figure S4C).
While the analysis above was restricted to genic regions so we could ensure stringent matching procedures, we also investigated intergenic RLL peaks and found determined that intergenic RLL peaks also showed a 4–5 times higher overlap with replication initiation regions (SNS-seq or OK-seq) than expected at random (data not shown). Thus, altogether the present genomic analyses show a robust association between peaks of R-loop loss in response to Top1 depletion and active, early replication origins, suggesting a role for Top1 in modulating the replication process.
Top1 depletion triggers G0/G1 block and replication timing delays with minimal DNA damage
Given the accumulation of Top1-depleted cells in G0/G1, we wondered if the R-loop gains and losses we observed could be due to preferential R-loop formation by RLG and RLL genes within and outside of G1, respectively. To test this, we synchronized cells in G2, released them, and monitored R-loop formation in G1 and mid-S by DRIP-qPCR at a range of loci. Other than an expected drop of R-loop formation in G2/M, we did not observe any specific trend across RLG and RLL loci analyzed here (Additional file 2: Figure S6A). To further determine if the pattern of R-loop gains and losses could be explained by an accumulation of cells in G0/G1, we took advantage of a recent R-loop dataset obtained from breast cancer cells (MCF7) in the presence or absence of estradiol . Without estradiol, 85% of MCF7 cells accumulate in G0/G1. Upon stimulation with estradiol, cells rapidly re-enter the cell cycle. We reasoned that if R-loop gains and losses are caused by the G0/G1 arrest, then a similar, perhaps even amplified, pattern should be observed in MCF7 cells. To test this, we determined the overlap between MCF7 R-loop peaks in unstimulated or estradiol-stimulated conditions (2 and 24 h) and both RLG genes and RLL genes. The proportion of RLG and RLL genes that intersected with R-loops (i.e. at least on R-loop peak in the gene) was relatively constant through time (Additional file 2: Figure S6B). Thus we did not observe that the proportion of R-loop-positive RLG genes decreased with cell cycle re-entry as would be expected if R-loop formation at RLG genes was a property of the G0/G1 state. Similarly, we did not observe a significant increase in the proportion of R-loop positive RLL genes with cell cycle re-entry, as would be expected if R-loop formation was only allowed outside the G0/G1 phase in RLL genes. Similar results were obtained when we measured the total R-loop loads (i.e. length occupied by R-loop peaks) of RLL and RLG genes (Additional file 2: Figure S6B). Overall, we conclude that the patterns of R-loop losses and gains observed upon Top1 depletion are unlikely to simply reflect shifts in cell cycle patterns and are more likely to result from the response to Top1 depletion.
The observation that Top1-depleted cells undergo a G1/S transition block, combined with the close association of RLL peaks with early, highly active, replication origins, suggested that Top1 depletion may affect the replication program. To test this, we analyzed the replication timing of multiple early-replicating RLL regions using BrdU incorporation to mark newly replicated strands, followed by immunoprecipitation and qPCR after cell sorting into G0/G1, early S, late S and G2/M phases. Top1-depleted cells showed delayed replication timing with a consistent switch from G1 to early S or from early S to late S phase for 5 out of 7 RLL loci (Fig. 6f). It should be noted however, that genes with no or mixed changes in R-loop distribution also showed a similar trend for 7 out of 8 loci tested (Additional file 2: Figure S6B). A minority of RLG genes (3 out of 10) also showed a modest tendency towards replication delay (Additional file 2: Figure S6C). This effect was specific for the nuclear genome, as mitochondrial DNA replication timing was not affected by Top1 knockdown (Fig. 6g). Thus, when cells are able to overcome the G1/G0 block and initiate S phase, they nonetheless show a trend towards replication delay in a way that appears influenced by, but not strictly dictated by, R-loop gains or losses.
Multiple studies have identified Top1 as a factor that prevents R-loop formation since the enzyme relaxes negative supercoils during transcription [44, 45] thereby preventing an R-loop favorable underwound DNA state [7, 12]. To understand how Top1 modulates R-loop formation in vivo, we profiled these structures globally in Top1-depleted human cells. Surprisingly, Top1 depletion caused both increases and decreases of R-loop levels depending on the genomic context (Figs. 1 and 2).
Consistent with the expectation that Top1 prevents R-loops, we identified a clear class of genes that respond to Top1 depletion by gaining R-loops. These genes were long, highly transcribed, and located in gene-poor areas of the genome. These observations are consistent with prior studies and allow us to refine a model for Top1 activity during transcription elongation. Long genes were shown to be more sensitive to Top1 poisoning by Camptothecin  or to Top1 depletion in mouse and human neurons . These studies are in agreement with our observations that RLG peaks arise co-transcriptionally on long and highly expressed genes, where they principally match to gene bodies (Figs. 1, 2 and 3). Our work also shows that RLG genes preferentially undergo promoter stalling (Fig. 2) which is in agreement with a recent study showing that Top1 becomes physically associated with the RNAP complex and catalytically activated upon release of the transcription machinery into elongation from a promoter-proximal paused state . Similarly, these observations are in agreement with prior observations that Top1 inhibition with camptothecin has an immediate effect on the RNAP II pause/release cycle at active promoters including at the long human HIF-1α gene . Our present work supports the view that Top1 facilitates transcription elongation and precisely defines the class of genes that are most dependent on this enzyme: Top1 efficiently prevents co-transcriptional R-loops specifically for long gene units with high transcription levels.
Interestingly, the generation of topological stress during transcription requires that the DNA fiber is placed under some physical constraints so as to prevent spontaneous dissipation of supercoils . Our genome-wide data reveals that in the case of R-loop stabilization through RLG genes, proximity to H3K9me3-marked chromatin and lamin-associated domains may represent the main source of such physical constraints. A subset of RLG genes were located in close proximity to LADs and faced away from the LAD boundary, suggesting that LADs might physically trap supercoils, causing an increase in negative supercoil density behind the transcribing RNAP in the absence of Top1. Indeed, the strength of R-loop gains clearly correlated with the proximity to LADs (Fig. 4). We therefore suggest that the association of genes to the nuclear envelope sensitizes them to topological disruptions. In addition to LADs, we identify heterochromatic H3K9me3-marked patches as a second important distinguishing feature of RLG genes. These patches often were in close proximity to RLG peaks and the strength of RLGs was inversely correlated to their distance from H3K9me3 peaks. We suggest that H3K9me3-marked heterochromatic patches might prevent dissipation of torsional tension because of their closed chromatin nature. Additionally, H3K9me3 was shown to mediate perinuclear anchoring, which could further prevent supercoil dissipation . Altogether, our data reveals that long, highly expressed genes in proximity to LADs or H3K9me3 patches are important reservoirs of R-loops that require proper topological control by Top1. Top1 depletion may be less critical for genes without such topological constraints where activity of Top2 isoforms may be sufficient to substitute for Top1 absence.
Given the association between R-loops and RNAP pausing  as well as DNA breakage , we speculate that R-loop suppression by Top1 plays an important role in ensuring proper gene expression and genome stability. We note that, in contrast to other studies [14, 49], we did not detect telltale signs of genomic instability (Fig. 6). Importantly, these studies used cell lines in which Top1 was stably knocked down and that were forced to undergo replication by passaging and selection. By contrast, our study only involved transient Top1 knockdown and caused a strong G0/G1 cell cycle block (Fig. 6; see below). Given that passage through S phase is required for R-loop-induced DNA breakage and instability phenotypes [2, 14], the reduction of cells in S phase likely counteracted the accumulation of DNA damage in our cell model. We speculate, however, that RLG genes may represent a source of genomic instability once cells are able to replicate. We also note that Top1 depletion in our system did not result in major changes in gene expression (Additional file 1: Figure S2D) or notable accumulation of RNAP at sites of RLG (data not shown). This indicates that, while transient Top1 depletion caused R-loop accumulation in RLG genes, gene expression still proceeded mostly unchanged. It is possible that the loss of Top1 activity, particularly in removing positive supercoils that might hinder RNAP progression, was compensated by the redundant activity of Top2. Recent evidence indeed shows that genes with high transcriptional outputs require Top2 activity to properly handle the resulting torsional stress . Thus, unlike widely held views, Top1 depletion does not result in a global R-loop increase but rather affects a specific subset of genes. This study identifies RLG genes as uniquely Top1-responsive and reveals the molecular features that render these genes dependent on Top1 for R-loop control.
Unexpectedly, Top1 depletion also led to R-loop losses over a class of genes entirely distinct from RLG genes. RLL genes were of average length, resided in gene-rich neighborhoods, and were moderately expressed (Figs. 2, 4). The most striking feature of RLL peaks was their tendency to co-localize with replication initiation regions as defined either by SNS-seq or OK-seq (Fig. 5). This co-localization was underscored by the fact that RLL loci showed higher SNS-seq signal than matched or RLG loci. Initiation regions highlighted by their RLL overlap replicated early (predominantly G1), earlier than other Top1-invariant R-loop forming loci matched for gene expression and gene densities. Studies of replication origins in mammalian systems indicate that early origins are characterized by marks of open chromatin and by transcription [50–52]. Fittingly, co-transcriptional R-loops preferentially associate with increased DNase accessibility, histone H3 acetylation, and histone H3 lysine 4 methylation [4, 22]. Top1-responsive RLL peaks further include a significant association with the H3K27me3 Polycomb mark and components of the PRC complexes (Fig. 5a). Interestingly, a subset of early, highly efficient replication origins was previously associated with a very similar chromatin pattern [40, 51]. Thus RLL peaks correspond to Top1-responsive R-loop forming loci that are enriched over a subset of early active replication origins and preferentially carry chromatin marks previously defined for these loci.
Interestingly, a primary cellular response to Top1 depletion is the accumulation of cells in G1/G0 and a delay in replication timing at certain genomic loci. One possible mechanism to account for this observation is if Top1 and R-loops participate in origin function. Top1 is known to bind to replication origin sequences [39, 53] as part of the replication progression complex (RPC) which comprises the MCM and GINS proteins . In the SV40 system, almost all RPC components are individually dispensable for activation of SV40 origin in crude extracts, except for Top1 and its interaction with the T antigen for the priming of viral replication [55, 56]. Top1 is therefore a part of the basal complex responsible for origin activation and nascent fork formation. Furthermore, Top1 DNA cleavage sites have been mapped at the lamin B2 origin and Top1 inhibition by low camptothecin concentrations abolished origin firing, suggesting that Top1 and DNA topology play a key role in this process . A plausible hypothesis is that the catalytic activity of Top1 is necessary at replication origins to remove the positive, but not negative, supercoils generated by the unwinding of DNA mediated by MCM helicases , leaving the DNA template more negatively supercoiled and thus favoring DNA strand separation. If so, the absence of Top1 will cause the inefficient removal of positive supercoils which in turns will disfavor R-loop formation and cause the appearance of RLL loci. Our work therefore highlights that Top1 may play an important role in modulating replication origin function in human cells at a subset of early origins. It nonetheless remains possible that Top1 depletion may affect replication timing and cell cycle progression in an indirect and more complex manner; further investigations will be necessary to fully define the molecular mechanisms linking Top1 and replication origin activity.
In addition, our work also raises the possibility that R-loop formation may be linked to replication origin function in human cells. The notion that R-loops may contribute to origin function is supported by a wide array of observations. As mentioned above, R-loop forming regions associate with chromatin signatures that are typical of replication origins. R-loops and origins both show hotspots of distribution at gene ends [42, 58]. CpG island promoters in particular, are R-loop and origin hotspots [5, 52, 59–63], and associate with conserved patterns of GC skew [58, 64], a sequence characteristic that intrinsically favors the formation of G-rich signatures often referred to as origin G-rich repeated elements . Such G-rich motifs have the potential to form G quadruplex structures that have been implicated as determinants of origin positioning and efficiency [65, 66]. While it is unclear if G quadruplex can spontaneously nucleate in the context of double-stranded DNA, it is reasonable to propose that R-loop structures can favor G4 formation on the looped out single-strand . Interestingly, the ORC1 subunit of the origin Recognition Complex was shown to bind G4-preferrable ssDNA , thereby suggesting that R-loop formation may favor origin licensing. Several historical precedents further underscore the connections between R-loops and origins. In the T4 bacteriophage and in ColEI-replicons in E. coli, R-loops function as replication origins [25–27, 69]. In E. coli, recombination-mediated R-loops in RNase H-deficient strains support an OriC-independent mode of replication [70, 71]. Increased R-loop formation in RNase H-deficient yeast strains subjected to Top1 inhibition led to origin-independent DNA replication initiation in the rDNA . Finally, the mitochondrial genome is thought to initiate DNA replication priming through R-loop intermediates [72–74] and a recent study showed that replication origins are specified in an R-loop dependent manner at murine class switch immunoglobulin regions . Thus, as judged from location overlaps, chromatin features, and functional associations, our work is consistent with an intimate connection between R-loop formation and replication origin specification . Future work will be necessary to delineate the detailed mechanistic connections that link transcription, R-loop formation, topoisomerase activity, and replication initiation.
Altogether, our work establishes that Top1 regulates R-loop formation in a context-dependent manner. Long, highly transcribed genes for which supercoil dissipation is not possible due to physical anchoring were particularly susceptible to Top1 depletion and responded by gaining R-loops. By contrast, a class of loci overlapping with efficient early replication origins showed an unexpected loss of R-loops upon Top1 depletion. Many genes in addition, showed a mixed response including R-loop gains and losses. This shows that unlike what was previously believed, Top1 exerts subtle effects on genomic R-loop formation, and highlights the importance of using precise R-loop genomic mapping technologies to determine the effect of a given factor on R-loop metabolism.
Cell Lines and Drugs
HEK293 cells (ATCC) were maintained in DMEM (Thermofisher) supplemented with 10% FBS in a humidified incubator at 5% of CO2. Camptothecin (Sigma Aldrich) was dissolved in DMSO at 10 mM concentration, stored in aliquots at − 20°, and used as a 1,000× stock during 1 h treatments.
HEK293 cells were counted and seeded at 150,000 cells per 35 mm dish. 24 h after seeding, cells were reverse transfected using RNAimax transfection reagent (Thermofisher) and with 10 nM of Top1-specific validated siRNA (ThermoFisher) targeting exon 16 (siRNA #1; Cat: S14304) and exon 15 (siRNA #2; Cat: S14306) of the nuclear Top1 transcript, or with a negative control RNA (scramble; Cat: AM4613). 48 h after the first transfection, one fifth of the cells were transfected again in a similar manner. Cells were harvested 72 h after the second transfection for all subsequent analysis. Knockdown was verified by Western blot (Fig. 1) and at the RNA level by RNA-seq. The mitochondrial Top1 enzyme (TOP1MT) was not affected by the knockdown as measured by RNA-seq.
Dot Blot Analysis
Genomic DNA was extracted according to DRIP protocol and digested with restriction enzyme cocktail mix. Two-fold serial dilutions starting from 7.5 micrograms of DNA were spotted on a nitrocellulose membrane and crosslinked with UV light (120 mJ/cm2). Membrane was blocked with PBS-Tween (0.1%) and 3% BSA for 30 min and then incubated with S9.6 antibody diluted to 1 μg/ml in PBS-Tween (0.1%), 3% BSA. After washing, membrane was incubated with HRP-conjugated or Alexa-fluor 488 anti-mouse secondary antibodies, further washed and developed with ECL techniques or directly in fluorescence scanning. In case of treatment with RNase H genomic DNA was pre-incubated with 10 U of enzyme for two hours at 37 °C. To ensure equal loading, we systematically withdrew an aliquot of DNA prior to application on the membrane and loaded it on an agarose gel. Densitometry was used to confirm that all samples were equally digested and of equal intensities (Additional file 1: Figure S1A).
Western blot analysis was performed according to standard procedures. Membranes were incubated with the following antibodies: anti Top1 (c15, sc5342), anti beta-actin (I-19, sc1616), anti p-ATM (10H11.E12, sc47739), anti histone H1 (AE-4, sc8030) from Santa Cruz Biotechnology. Anti Phospho-H2AX antibody (ser139, JBW301) was from Millipore. Anti Phospho-ChK1 (Ser345, 133D3) and anti Phospho-Chk2 (Thr68, C13C1) were from Cell Signaling.
DRIP and DRIPc-seq
DRIPc-seq was performed as previously described . Briefly, DRIP immunoprecipitates obtained from 40 micrograms of digested genomic DNA were collected and treated with DNase I (Fermentas). The resulting RNA strands were purified and reverse-transcribed using the iScript kit (Bio-Rad). Second strand synthesis was performed using dUTP instead of dTTP. Ligation of Illumina Truseq adapters was performed according to manufacturer’s instructions and a UDG glycosylase treatment was introduced before library amplification to permit strand-specific R-loop detection. In case of treatment with RNase H or RNase A, digested genomic DNA was pre-treated with 10 units of RNase H or 10 μg/ml of RNase A for two hours at 37 °C before DRIP.
RNA Pol II ChIP-seq and total RNA-seq
RNA Pol II ChIP was performed as previously described . Immunoprecipitated DNA was purified and used to construct Illumina NGS libraries according to manufacturer procedures. Total RNA-seq was performed after ribosomal RNA depletion using an Illumina Truseq RNA-seq kit according to the manufacturer’s instructions.
DRIPc-seq, RNA-seq, and RNA Pol II ChIP-seq Mapping and Peak Calling
Sequenced single-end reads were subjected to standard quality control pipeline using fastq-mcf software and mapped using Tophat2 for RNA-seq and Bowtie2 for the rest with default parameters. Sequencing read depths were normalized by number of mapped reads between samples, and only uniquely mapped reads were considered. High copy-number or contamination-prone regions such as rDNA, mitochondria, centromere, and ENCODE blacklisted regions were excluded. DRIPc-seq peak calling was performed using a previously developed Hidden Markov Model  modified to enable higher sensitivity in particular when dealing with lower and trailing signal (see https://github.com/srhartono/highsenshmm). This method was about 2.5-fold more sensitive, generating about 200,000 peaks of signal covering about 500 MB of genomic space in each replicate. For analysis, all DRIPc-seq peaks present in at least one sample were considered and regions showing significant differences in signal between Top1-depleted and control cells were identified using DESeq2 using significance thresholds of an adjusted p-value < 0.1 and signal fold-change higher than 1.25× or lower than 0.8× (using a more stringent adjusted p-value < 0.05 did not affect our conclusions; data not shown). Genes shorter than 5 kb were eliminated from this analysis.
Overlap Analysis with Other Datasets
Datasets for lamin, chromatin marks, ChromHMM states, SNS-seq and OK-seq replication origins or zones were downloaded from published sources. The RNAP pausing state of each gene was categorized as in  using RNA Pol II ChIP-seq datasets generated from control HEK293 cells. The enrichment or depletion of RLL and RLG peaks over chromatin features of interest was first measured in terms of peak overlap. For this, we determined the overlap of RLL and RLG peaks over chromatin peaks of interest and then calculated the peak overlap for control peaks. These control peaks were stringently selected following an earlier strategy . In brief, these peaks belonged to expression- and length-matched R-loop forming genes that were not affected by Top1 depletion (no and mixed changes in Fig. 2a). In all cases, these peaks were matched to a similar-sized R-loop peak on the matched gene. In the case of promoters and terminators, the precise position of the initial and shuffled peaks was maintained. Each initial RLL or RLG peak was independently matched multiple times to avoid outliers. We next determined the ratio of overlaps between RLL or RLG peaks and control peaks and expressed this ratio as a heatmap. The absolute overlap of chromatin features with RLL and RLG peaks is indicated by stars, as shown in Figs. 4 and 5. SNS-seq origin peaks were from  for human K562 and HeLa cells. OK-seq data was downloaded as RFD values from  for human HeLa cells. The RFD signal was processed using an HMM model configured as described by Petryk et al.  to call replication initiation zones. Overlap between SNS-seq origins and OK-seq initiation zones was measured relative to stringent matched controls. Given that R-loop mapping was performed in HEK293 cells, it is likely that the overlap with replication initiation regions was under-estimated.
48 h after second round of transfection with siRNA oligonucleotides, cells were detached and seeded at 300,000 cells per 35 mm dish on a glass coverslip pre-treated with gelatin. 24 h after seeding, cells were methanol fixed and treated with acetone. Blocking and ki-67 (Abcam, ab15580) or γH2AX antibody incubation were performed in 4× SSC and 3% BSA at 20 °C for 30 min and 2 h, respectively. Secondary antibody was anti-rabbit or anti-mouse Alexa-fluor 488. Nuclei were counter-stained with DAPI.
Cell Cycle Analysis and Replication Timing
Cell cycle analysis and replication timing were performed as described previously . Briefly, cells were pulse-labeled with BrdU (50 μM) for two hours. Cells were then harvested, fixed in 70% ethanol, and stored at − 20 °C. Before cell cycle analysis and sorting, cells were labeled with Propidium Iodide (50 μg/ml) and treated with RNase A (250 μg/ml). Cells were analyzed and sorted with a Biorad S3e cell sorter. After sorting, cells were lysed and genomic DNA was extracted. DNA was immunoprecipitated with anti BrdU antibody (B44, BD Biosciences, 347,580), purified, and used as template in qPCR. To assess R-loop formation across the cell cycle, cells were first synchronized in G2 after thymidine block (24 h) and released into nocodazole-containing media (12 h). Cells were then allowed to cycle in fresh medium and harvested in G1 and mid-S of the following cycle for DRIP-qPCR analysis. Cytofluorometric analysis after propidium iodide staining confirmed that > 85% of the cells were in the correct cell cycle phases.
All high-throughput sequencing data have been deposited into NCBI GEO entry GSE102474. Publicly available datasets used are as follows: Lamin-associated domain , ChromHMM , ENCODE histones , SNS-seq , and OK-seq  coordinates.
The review history is available as Additional file 3.
This work was supported by a National Institutes of Health R01 grants (GM120607) to F.C. and a grant from Associazione Italiana per la Ricerca sul Cancro (AIRC, IG 15886) to G.C. S.M. was funded through a three-year FIRC (Fondazione Italiana per la Ricerca sul Cancro) fellowship. S.R.H was funded by a Howard Hughes Medical Institute International Student Research fellowship. S.D.B is an International Society for Advancement of Cytometry (ISAC) Marylou Ingram Scholar.
SGM, FC, and GC designed research. SGM and LS performed experiments. SRH performed bioinformatics analysis. SDB and AC performed cell sorting analysis. SGM, SRH, FC, GC analyzed the data and wrote the paper. All authors read and approved the final manuscript.
Ethics approval and consent to participate
These are not applicable.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Bermejo R, Lai MS, Foiani M. Preventing replication stress to maintain genome stability: resolving conflicts between replication and transcription. Mol Cell. 2012;45:710–8.View ArticlePubMedGoogle Scholar
- Aguilera A, Garcia-Muse T. R loops: from transcription byproducts to threats to genome stability. Mol Cell. 2012;46:115–24.View ArticlePubMedGoogle Scholar
- Hamperl S, Cimprich KA. The contribution of co-transcriptional RNA:DNA hybrid structures to DNA damage and genome instability. DNA Repair (Amst). 2014;19:84–94.View ArticleGoogle Scholar
- Sanz LA, Hartono SR, Lim YW, Steyaert S, Rajpurkar A, Ginno PA, Xu X, Chedin F. Prevalent, Dynamic, and Conserved R-Loop Structures Associate with Specific Epigenomic Signatures in Mammals. Mol Cell. 2016;63:167–78.View ArticlePubMedPubMed CentralGoogle Scholar
- Ginno PA, Lott PL, Christensen HC, Korf I, Chedin F. R-loop formation is a distinctive characteristic of unmethylated human CpG island promoters. Mol Cell. 2012;45:814–25.View ArticlePubMedPubMed CentralGoogle Scholar
- Liu LF, Wang JC. Supercoiling of the DNA template during transcription. Proc Natl Acad Sci U S A. 1987;84:7024–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Drolet M, Bi X, Liu LF. Hypernegative supercoiling of the DNA template during transcription elongation in vitro. J Biol Chem. 1994;269:2068–74.PubMedGoogle Scholar
- Wang JC. Cellular roles of DNA topoisomerases: a molecular perspective. Nat Rev Mol Cell Biol. 2002;3:430–40.View ArticlePubMedGoogle Scholar
- Baranello L, Kouzine F, Levens D. DNA topoisomerases beyond the standard role. Transcription. 2013;4:232–7.View ArticlePubMedGoogle Scholar
- Champoux JJ. DNA topoisomerases: structure, function, and mechanism. Annu Rev Biochem. 2001;70:369–413.View ArticlePubMedGoogle Scholar
- Capranico G, Marinello J, Chillemi G. Type I DNA Topoisomerases. J Med Chem. 2017;60:2169–92.View ArticlePubMedGoogle Scholar
- Masse E, Phoenix P, Drolet M. DNA topoisomerases regulate R-loop formation during transcription of the rrnB operon in Escherichia coli. J Biol Chem. 1997;272:12816–23.View ArticlePubMedGoogle Scholar
- Drolet M, Phoenix P, Menzel R, Masse E, Liu LF, Crouch RJ. Overexpression of RNase H partially complements the growth defect of an Escherichia coli delta topA mutant: R-loop formation is a major problem in the absence of DNA topoisomerase I. Proc Natl Acad Sci U S A. 1995;92:3526–30.View ArticlePubMedPubMed CentralGoogle Scholar
- Tuduri S, Crabbe L, Conti C, Tourriere H, Holtgreve-Grez H, Jauch A, Pantesco V, De Vos J, Thomas A, Theillet C, et al. Topoisomerase I suppresses genomic instability by preventing interference between replication and transcription. Nat Cell Biol. 2009;11:1315–24.View ArticlePubMedPubMed CentralGoogle Scholar
- Pommier Y. Topoisomerase I inhibitors: camptothecins and beyond. Nat Rev Cancer. 2006;6:789–802.View ArticlePubMedGoogle Scholar
- Marinello J, Bertoncini S, Aloisi I, Cristini A, Malagoli Tagliazucchi G, Forcato M, Sordet O, Capranico G. Dynamic Effects of Topoisomerase I Inhibition on R-Loops and Short Transcripts at Active Promoters. PLoS One. 2016;11:e0147053.View ArticlePubMedPubMed CentralGoogle Scholar
- Marinello J, Chillemi G, Bueno S, Manzo SG, Capranico G. Antisense transcripts enhanced by camptothecin at divergent CpG-island promoters associated with bursts of topoisomerase I-DNA cleavage complex and R-loop formation. Nucleic Acids Res. 2013;41:10110–23.View ArticlePubMedPubMed CentralGoogle Scholar
- Sordet O, Redon CE, Guirouilh-Barbat J, Smith S, Solier S, Douarre C, Conti C, Nakamura AJ, Das BB, Nicolas E, et al. Ataxia telangiectasia mutated activation by transcription- and topoisomerase I-induced DNA double-strand breaks. EMBO Rep. 2009;10:887–93.View ArticlePubMedPubMed CentralGoogle Scholar
- Santos-Pereira JM, Aguilera A. R loops: new modulators of genome dynamics and function. Nat Rev Genet. 2015;Google Scholar
- Costantino L, Koshland D. The Yin and Yang of R-loop biology. Curr Opin Cell Biol. 2015;34:39–45.View ArticlePubMedPubMed CentralGoogle Scholar
- Skourti-Stathaki K, Proudfoot NJ. A double-edged sword: R loops as threats to genome integrity and powerful regulators of gene expression. Genes Dev. 2014;28:1384–96.View ArticlePubMedPubMed CentralGoogle Scholar
- Chen PB, Chen HV, Acharya D, Rando OJ, Fazzio TG. R loops regulate promoter-proximal chromatin architecture and cellular differentiation. Nat Struct Mol Biol. 2015;22:999–1007.View ArticlePubMedPubMed CentralGoogle Scholar
- Proudfoot NJ. Transcriptional termination in mammals: Stopping the RNA polymerase II juggernaut. Science. 2016;352:aad9926.View ArticlePubMedPubMed CentralGoogle Scholar
- Yu K, Chedin F, Hsieh CL, Wilson TE, Lieber MR. R-loops at immunoglobulin class switch regions in the chromosomes of stimulated B cells. Nat Immunol. 2003;4:442–51.View ArticlePubMedGoogle Scholar
- Carles-Kinch K, Kreuzer KN. RNA-DNA hybrid formation at a bacteriophage T4 replication origin. J Mol Biol. 1997;266:915–26.PubMedGoogle Scholar
- Masukata H, Tomizawa J. A mechanism of formation of a persistent hybrid between elongating RNA and template DNA. Cell. 1990;62:331–8.View ArticlePubMedGoogle Scholar
- Itoh T, Tomizawa J. Formation of an RNA primer for initiation of replication of ColE1 DNA by ribonuclease H. Proc Natl Acad Sci U S A. 1980;77:2450–4.View ArticlePubMedPubMed CentralGoogle Scholar
- Stuckey R, Garcia-Rodriguez N, Aguilera A, Wellinger RE. Role for RNA:DNA hybrids in origin-independent replication priming in a eukaryotic system. Proc Natl Acad Sci U S A. 2015;112:5779–84.View ArticlePubMedPubMed CentralGoogle Scholar
- Boguslawski SJ, Smith DE, Michalak MA, Mickelson KE, Yehle CO, Patterson WL, Carrico RJ. Characterization of monoclonal antibody to DNA.RNA and its application to immunodetection of hybrids. J Immunol Methods. 1986;89:123–30.View ArticlePubMedGoogle Scholar
- Phillips DD, Garboczi DN, Singh K, Hu Z, Leppla SH, Leysath CE. The sub-nanomolar binding of DNA-RNA hybrids by the single-chain Fv fragment of antibody S9.6. J Mol Recognit. 2013;26:376–81.View ArticlePubMedPubMed CentralGoogle Scholar
- El Hage A, French SL, Beyer AL, Tollervey D. Loss of Topoisomerase I leads to R-loop-mediated transcriptional blocks during ribosomal RNA synthesis. Genes Dev. 2010;24:1546–58.View ArticlePubMedPubMed CentralGoogle Scholar
- Baranello L, Wojtowicz D, Cui K, Devaiah BN, Chung HJ, Chan-Salis KY, Guha R, Wilson K, Zhang X, Zhang H, et al. RNA Polymerase II Regulates Topoisomerase 1 Activity to Favor Efficient Transcription. Cell. 2016;165:357–71.View ArticlePubMedPubMed CentralGoogle Scholar
- Khobta A, Ferri F, Lotito L, Montecucco A, Rossi R, Capranico G. Early effects of topoisomerase I inhibition on RNA polymerase II along transcribed genes in human cells. J Mol Biol. 2006;357:127–38.View ArticlePubMedGoogle Scholar
- Baranello L, Bertozzi D, Fogli MV, Pommier Y, Capranico G. DNA topoisomerase I inhibition by camptothecin induces escape of RNA polymerase II from promoter-proximal pause site, antisense transcription and histone acetylation at the human HIF-1alpha gene locus. Nucleic Acids Res. 2010;38:159–71.View ArticlePubMedGoogle Scholar
- Core LJ, Waterfall JJ, Lis JT. Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science. 2008;322:1845–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Ernst J, Kellis M. ChromHMM: automating chromatin-state discovery and characterization. Nat Methods. 2012;9:215–6. http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeBroadHmm/wgEncodeBroadHmmK562HMM.bed.gz View ArticlePubMedPubMed CentralGoogle Scholar
- Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. https://genome.ucsc.edu/cgi-bin/hgFileUi?db=hg19&g=wgEncodeBroadHistone View ArticleGoogle Scholar
- Guelen L, Pagie L, Brasset E, Meuleman W, Faza MB, Talhout W, Eussen BH, de Klein A, Wessels L, de Laat W, van Steensel B. Domain organization of human chromosomes revealed by mapping of nuclear lamina interactions. Nature. 2008;453:948–51. http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/laminB1Lads.txt.gz View ArticlePubMedGoogle Scholar
- Falaschi A. Binding of DNA topoisomerases I and II to replication origins. Methods Mol Biol. 2009;582:131–43.View ArticlePubMedGoogle Scholar
- Picard F, Cadoret JC, Audit B, Arneodo A, Alberti A, Battail C, Duret L, Prioleau MN. The spatiotemporal program of DNA replication is associated with specific combinations of chromatin marks in human cells. PLoS Genet. 2014;10:e1004282. http://pbil.univ-lyon1.fr/members/fpicard/oriseq/ View ArticlePubMedPubMed CentralGoogle Scholar
- Hansen RS, Thomas S, Sandstrom R, Canfield TK, Thurman RE, Weaver M, Dorschner MO, Gartler SM, Stamatoyannopoulos JA. Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci U S A. 2010;107:139–44.View ArticlePubMedGoogle Scholar
- Petryk N, Kahli M, d'Aubenton-Carafa Y, Jaszczyszyn Y, Shen Y, Silvain M, Thermes C, Chen CL, Hyrien O. Replication landscape of the human genome. Nat Commun. 2016;7:10208. http://18.104.22.168/cgi-bin/gbrowse/gbrowse/okazaki_ref/ View ArticlePubMedPubMed CentralGoogle Scholar
- Stork CT, Bocek M, Crossley MP, Sollier J, Sanz LA, Chedin F, Swigut T, Cimprich KA. Co-transcriptional R-loops are the main cause of estrogen-induced DNA damage. Elife. 2016;5:e17548.Google Scholar
- Kouzine F, Gupta A, Baranello L, Wojtowicz D, Ben-Aissa K, Liu J, Przytycka TM, Levens D. Transcription-dependent dynamic supercoiling is a short-range genomic force. Nat Struct Mol Biol. 2013;20:396–403.View ArticlePubMedPubMed CentralGoogle Scholar
- Naughton C, Avlonitis N, Corless S, Prendergast JG, Mati IK, Eijk PP, Cockroft SL, Bradley M, Ylstra B, Gilbert N. Transcription forms and remodels supercoiling domains unfolding large-scale chromatin structures. Nat Struct Mol Biol. 2013;20:387–95.View ArticlePubMedPubMed CentralGoogle Scholar
- Solier S, Ryan MC, Martin SE, Varma S, Kohn KW, Liu H, Zeeberg BR, Pommier Y. Transcription poisoning by Topoisomerase I is controlled by gene length, splice sites, and miR-142-3p. Cancer Res. 2013;73:4830–9.View ArticlePubMedGoogle Scholar
- King IF, Yandava CN, Mabb AM, Hsiao JS, Huang HS, Pearson BL, Calabrese JM, Starmer J, Parker JS, Magnuson T, et al. Topoisomerases facilitate transcription of long genes linked to autism. Nature. 2013;501:58–62.View ArticlePubMedPubMed CentralGoogle Scholar
- Gonzalez-Sandoval A, Towbin BD, Kalck V, Cabianca DS, Gaidatzis D, Hauer MH, Geng L, Wang L, Yang T, Wang X, et al. Perinuclear Anchoring of H3K9-Methylated Chromatin Stabilizes Induced Cell Fate in C. elegans Embryos. Cell. 2015;163:1333–47.View ArticlePubMedGoogle Scholar
- Miao ZH, Player A, Shankavaram U, Wang YH, Zimonjic DB, Lorenzi PL, Liao ZY, Liu H, Shimura T, Zhang HL, et al. Nonclassic functions of human topoisomerase I: genome-wide and pharmacologic analyses. Cancer Res. 2007;67:8752–61.View ArticlePubMedGoogle Scholar
- Fragkos M, Ganier O, Coulombe P, Mechali M. DNA replication origin activation in space and time. Nat Rev Mol Cell Biol. 2015;16:360–74.View ArticlePubMedGoogle Scholar
- Cayrou C, Ballester B, Peiffer I, Fenouil R, Coulombe P, Andrau JC, van Helden J, Mechali M. The chromatin environment shapes DNA replication origin organization and defines origin classes. Genome Res. 2015;25:1873–85.View ArticlePubMedPubMed CentralGoogle Scholar
- Sequeira-Mendes J, Diaz-Uriarte R, Apedaile A, Huntley D, Brockdorff N, Gomez M. Transcription initiation activity sets replication origin efficiency in mammalian cells. PLoS Genet. 2009;5:e1000446.View ArticlePubMedPubMed CentralGoogle Scholar
- Abdurashidova G, Radulescu S, Sandoval O, Zahariev S, Danailov MB, Demidovich A, Santamaria L, Biamonti G, Riva S, Falaschi A. Functional interactions of DNA topoisomerases with a human replication origin. EMBO J. 2007;26:998–1009.View ArticlePubMedPubMed CentralGoogle Scholar
- Gambus A, Jones RC, Sanchez-Diaz A, Kanemaki M, van Deursen F, Edmondson RD, Labib K. GINS maintains association of Cdc45 with MCM in replisome progression complexes at eukaryotic DNA replication forks. Nat Cell Biol. 2006;8:358–66.View ArticlePubMedGoogle Scholar
- Simmons DT, Melendy T, Usher D, Stillman B. Simian virus 40 large T antigen binds to topoisomerase I. Virology. 1996;222:365–74.View ArticlePubMedGoogle Scholar
- Wold MS, Weinberg DH, Virshup DM, Li JJ, Kelly TJ. Identification of cellular proteins required for simian virus 40 DNA replication. J Biol Chem. 1989;264:2801–9.PubMedGoogle Scholar
- Falaschi A, Abdurashidova G. Molecular mechanics and DNA replication regulation. HFSP J. 2007;1:215–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Ginno PA, Lim YW, Lott PL, Korf I, Chedin F. GC skew at the 5′ and 3′ ends of human genes links R-loop formation to epigenetic regulation and transcription termination. Genome Res. 2013;23:1590–600.View ArticlePubMedPubMed CentralGoogle Scholar
- Cadoret JC, Meisch F, Hassan-Zadeh V, Luyten I, Guillet C, Duret L, Quesneville H, Prioleau MN. Genome-wide studies highlight indirect links between human replication origins and gene regulation. Proc Natl Acad Sci U S A. 2008;105:15837–42.View ArticlePubMedPubMed CentralGoogle Scholar
- Necsulea A, Guillet C, Cadoret JC, Prioleau MN, Duret L. The relationship between DNA replication and human genome organization. Mol Biol Evol. 2009;26:729–41.View ArticlePubMedGoogle Scholar
- Delgado S, Gomez M, Bird A, Antequera F. Initiation of DNA replication at CpG islands in mammalian chromosomes. EMBO J. 1998;17:2426–35.View ArticlePubMedPubMed CentralGoogle Scholar
- Cayrou C, Coulombe P, Vigneron A, Stanojcic S, Ganier O, Peiffer I, Rivals E, Puy A, Laurent-Chabalier S, Desprat R, Mechali M. Genome-scale analysis of metazoan replication origins reveals their organization in specific but flexible sites defined by conserved features. Genome Res. 2011;21:1438–49.View ArticlePubMedPubMed CentralGoogle Scholar
- Karnani N, Taylor CM, Malhotra A, Dutta A. Genomic study of replication initiation in human chromosomes reveals the influence of transcription regulation and chromatin structure on origin selection. Mol Biol Cell. 2010;21:393–404.View ArticlePubMedPubMed CentralGoogle Scholar
- Hartono SR, Korf IF, Chedin F. GC skew is a conserved property of unmethylated CpG island promoters across vertebrates. Nucleic Acids Res. 2015;43(20):9729-41.Google Scholar
- Cayrou C, Coulombe P, Puy A, Rialle S, Kaplan N, Segal E, Mechali M. New insights into replication origin characteristics in metazoans. Cell Cycle. 2012;11:658–67.View ArticlePubMedPubMed CentralGoogle Scholar
- Valton AL, Hassan-Zadeh V, Lema I, Boggetto N, Alberti P, Saintome C, Riou JF, Prioleau MN. G4 motifs affect origin positioning and efficiency in two vertebrate replicators. EMBO J. 2014;33:732–46.View ArticlePubMedPubMed CentralGoogle Scholar
- Duquette ML, Handa P, Vincent JA, Taylor AF, Maizels N. Intracellular transcription of G-rich DNAs induces formation of G-loops, novel structures containing G4 DNA. Genes Dev. 2004;18:1618–29.View ArticlePubMedPubMed CentralGoogle Scholar
- Hoshina S, Yura K, Teranishi H, Kiyasu N, Tominaga A, Kadoma H, Nakatsuka A, Kunichika T, Obuse C, Waga S. Human origin recognition complex binds preferentially to G-quadruplex-preferable RNA and single-stranded DNA. J Biol Chem. 2013;288:30161–71.View ArticlePubMedPubMed CentralGoogle Scholar
- Kreuzer KN, Brister JR. Initiation of bacteriophage T4 DNA replication and replication fork dynamics: a review in the Virology Journal series on bacteriophage T4 and its relatives. Virol J. 2010;7:358.View ArticlePubMedPubMed CentralGoogle Scholar
- Kogoma T. Stable DNA replication: interplay between DNA replication, homologous recombination, and transcription. Microbiol Mol Biol Rev. 1997;61:212–38.PubMedPubMed CentralGoogle Scholar
- Maduike NZ, Tehranchi AK, Wang JD, Kreuzer KN. Replication of the Escherichia coli chromosome in RNase HI-deficient cells: multiple initiation regions and fork dynamics. Mol Microbiol. 2014;91:39–56.View ArticlePubMedGoogle Scholar
- Akman G, Desai R, Bailey LJ, Yasukawa T, Dalla Rosa I, Durigon R, Holmes JB, Moss CF, Mennuni M, Houlden H, et al. Pathological ribonuclease H1 causes R-loop depletion and aberrant DNA segregation in mitochondria. Proc Natl Acad Sci U S A. 2016;113:E4276–85.View ArticlePubMedPubMed CentralGoogle Scholar
- Lee DY, Clayton DA. Initiation of mitochondrial DNA replication by transcription and R-loop processing. J Biol Chem. 1998;273:30614–21.View ArticlePubMedGoogle Scholar
- Xu B, Clayton DA. A persistent RNA-DNA hybrid is formed during transcription at a phylogenetically conserved mitochondrial DNA sequence. Mol Cell Biol. 1995;15:580–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Wiedemann EM, Peycheva M, Pavri R. DNA Replication Origins in Immunoglobulin Switch Regions Regulate Class Switch Recombination in an R-Loop-Dependent Manner. Cell Rep. 2016;17:2927–42.View ArticlePubMedGoogle Scholar
- Lombrana R, Almeida R, Alvarez A, Gomez M. R-loops and initiation of DNA replication in human cells: a missing link? Front Genet. 2015;6:158.View ArticlePubMedPubMed CentralGoogle Scholar
- Manzo SG, Zhou ZL, Wang YQ, Marinello J, He JX, Li YC, Ding J, Capranico G, Miao ZH. Natural product triptolide mediates cancer cell death by triggering CDK7-dependent degradation of RNA polymerase II. Cancer Res. 2012;72:5363–73.View ArticlePubMedGoogle Scholar
- Core LJ, Waterfall JJ, Gilchrist DA, Fargo DC, Kwak H, Adelman K, Lis JT. Defining the status of RNA polymerase at promoters. Cell Rep. 2012;2:1025–35.View ArticlePubMedPubMed CentralGoogle Scholar
- Ryba T, Battaglia D, Pope BD, Hiratani I, Gilbert DM. Genome-scale analysis of replication timing: from bench to bioinformatics. Nat Protoc. 2011;6:870–95.View ArticlePubMedPubMed CentralGoogle Scholar