Tracking replication enzymology in vivo by genome-wide mapping of ribonucleotide incorporation - PubMed (original) (raw)

Tracking replication enzymology in vivo by genome-wide mapping of ribonucleotide incorporation

Anders R Clausen et al. Nat Struct Mol Biol. 2015 Mar.

Abstract

Ribonucleotides are frequently incorporated into DNA during replication in eukaryotes. Here we map genome-wide distribution of these ribonucleotides as markers of replication enzymology in budding yeast, using a new 5' DNA end-mapping method, hydrolytic end sequencing (HydEn-seq). HydEn-seq of DNA from ribonucleotide excision repair-deficient strains reveals replicase- and strand-specific patterns of ribonucleotides in the nuclear genome. These patterns support the roles of DNA polymerases α and δ in lagging-strand replication and of DNA polymerase ɛ in leading-strand replication. They identify replication origins, termination zones and variations in ribonucleotide incorporation frequency across the genome that exceed three orders of magnitude. HydEn-seq also reveals strand-specific 5' DNA ends at mitochondrial replication origins, thus suggesting unidirectional replication of a circular genome. Given the conservation of enzymes that incorporate and process ribonucleotides in DNA, HydEn-seq can be used to track replication enzymology in other organisms.

PubMed Disclaimer

Conflict of interest statement

COMPETING FINANCIAL INTERESTS

The authors declare no competing financial interests. The authors declare no competing financial interests.

Figures

Figure 1

Figure 1. Mapping ribonucleotides by HydEn-Seq

(a) HydEn-Seq protocol. The procedure was performed as described in Methods, using the oligonucleotides listed in Supplementary Table 1. (b) Alkaline agarose gel electrophoresis. The analysis was performed as previously described . Genomic DNA samples from the indicated yeast strains (lanes 1–10) were treated with alkali, separated by 1% alkaline agarose gel electrophoresis, and imaged after staining with SYBR Gold. Migration positions of two DNA size standards are indicated. (c) Densitometry scans of the gel image in (b). The Y-axis is scaled to maximum intensity for each pair of lanes. (d) Mean HydEn-seq end counts per haploid genome (see Nends calculation in Methods; error bars represent ranges of two to four independent measurements).

Figure 2

Figure 2. Strand-specific ribonucleotide mapping of chromosome 10

(a) Top, map of chromosome 10 showing the fraction of end reads mapped to the top strand in bins of 200 base pairs, after background subtraction (see Methods). Origin prediction was complicated at chromosome ends (I) and in other highly repetitive regions (II). Middle, an expanded 200 kbp region of chromosome 10. Excursions (in purple) from the simplest polymerase division of labor (Pol α or δ lagging, Pol ε leading) fall into two classes: unexpected Pol α, δ or ε correspondence (III) and Pol α or δ divergence (IV). Bottom, 100 kbp region of chromosome 10. Inter-origin regions are more (e.g. ARS1012-ARS1014) or less (e.g. ARS1011-ARS1012) symmetrical, depending on fork progression rates and origins firing times. Some origins in the origin database have little effect on ribonucleotide strand bias (e.g. ARS1013), indicating either an incorrect call, minority participation in normally growing cells, unidirectional origin firing, or simply later firing, such that forks proceeding from adjacent origins approach to within current detection thresholds. (b) A stylized chromosome with two replication origins, showing the division of polymerase labor, as predicted from the direction of strand bias transitions at origins (compare with the ARS1012-ARS1014 region above). Roughly three quarters of previously confirmed replication origins (orange diamonds in panel (a); S. cerevisiae OriDB) align with abrupt transitions in strand preference (see Methods for quantitation). This allows algorithmic prediction of origins (black diamonds). ARS1013 was not detected via HydEn-seq. It is indicated (orange diamond) but not labeled.

Figure 3

Figure 3. Genome-wide replication origins located by HydEn-Seq

(a) A 60 kbp segment of chromosome 1 showing the fraction of ends, after background subtraction, that mapped to the top strand from Pol ε data (pol2-M644G rnh201Δ; blue) and the fraction mapped to the bottom strand for Pol α and δ data (pol1-Y869A and pol3-L612G, in red and green, respectively). Grey points are the weighted average of the other three data sets in each bin (see Methods). All curves are trend lines smoothed over 10 bins. (b) As per (a), but for all 16 S. cerevisiae chromosomes. Shown for reference are the locations of the URA3 mutational reporter gene (near ARS306; used in our previous studies of leading and lagging strand replication fidelity , and the rDNA locus in chromosome 12 (not drawn to scale; the highly repetitive sequence precludes read mapping).

Figure 4

Figure 4. Distribution of ribonucleotides near origins in RER-deficient strains

(a) Heat maps for the top and bottom strands of the nuclear genome in five different _rnh201_Δ strains, scaled per million reads and centered across a 4 kbp window of the 394 replication origins reported in the yeast origin database . (b) Meta-analysis of strand-specific ribonucleotides at 214 replication origins analyzed in a previous study , again scaled per million reads, in bins of 50 bp.

Figure 5

Figure 5. Ribonucleotide base identity

The proportion of each ribonucleotide base present in the nuclear genome of the three _rnh201_Δ strains encoding the indicated variant replicases. The base composition of the genome is shown on the left. Ribonucleotide proportions were calculated from the most highly strand-biased 10% of the genome (i.e. windows near replication origins; examples in Fig. 2a).

Figure 6

Figure 6. Meta-analysis of ribonucleotides at the nucleosome dyad

(a) Meta-analysis of strand-specific ribonucleotide mapping at 37,888 nucleosome dyads for the _rnh201_Δ strain, scaled per million reads and centered within a 400 bp window. Each dot indicates the number of 5′-DNA ends reads at one base pair. The vertical dotted line indicates the dyad. (b) As in (a) but for the _pol2-M644G rnh201_Δ strain. (c) As in (a) but for the _pol1-Y869A rnh201_Δ strain. (d) As in (a) but for the _pol3-L612G rnh201_Δ strain. The solid lines are the smoothed averages for a sliding window. (e) The base composition surrounding the nucleosome dyad.

Figure 7

Figure 7. HydEn-Seq maps of mitochondrial DNA

Mitochondrial genomes for six strains are shown to indicate the base pair (bp) locations and proportions of strand-specific 5′-DNA ends detected by HydEn-Seq (blue for plus strand, red for minus). Previously assigned replication origins are shaded in green, coding sequences in grey, tRNA genes in orange and genes for other non-coding RNAs in pink. Total mitochondrial end counts are shown for each strain with the number of replicate HydEn-Seq libraries for each in parentheses.

Similar articles

Cited by

References

    1. Johansson E, Dixon N. Replicative DNA polymerases. Cold Spring Harb Perspect Biol. 2013;5 - PMC - PubMed
    1. Georgescu RE, et al. Mechanism of asymmetric polymerase assembly at the eukaryotic replication fork. Nat Struct Mol Biol. 2014;21:664–70. - PMC - PubMed
    1. Burgers PM. Polymerase dynamics at the eukaryotic DNA replication fork. J Biol Chem. 2009;284:4041–5. - PMC - PubMed
    1. Pursell ZF, Isoz I, Lundstrom EB, Johansson E, Kunkel TA. Yeast DNA polymerase epsilon participates in leading-strand DNA replication. Science. 2007;317:127–30. - PMC - PubMed
    1. Lujan SA, et al. Mismatch repair balances leading and lagging strand DNA replication fidelity. PLoS Genet. 2012;8:e1003016. - PMC - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources