Transcription initiation activity sets replication origin efficiency in mammalian cells - PubMed (original) (raw)

Transcription initiation activity sets replication origin efficiency in mammalian cells

Joana Sequeira-Mendes et al. PLoS Genet. 2009 Apr.

Abstract

Genomic mapping of DNA replication origins (ORIs) in mammals provides a powerful means for understanding the regulatory complexity of our genome. Here we combine a genome-wide approach to identify preferential sites of DNA replication initiation at 0.4% of the mouse genome with detailed molecular analysis at distinct classes of ORIs according to their location relative to the genes. Our study reveals that 85% of the replication initiation sites in mouse embryonic stem (ES) cells are associated with transcriptional units. Nearly half of the identified ORIs map at promoter regions and, interestingly, ORI density strongly correlates with promoter density, reflecting the coordinated organisation of replication and transcription in the mouse genome. Detailed analysis of ORI activity showed that CpG island promoter-ORIs are the most efficient ORIs in ES cells and both ORI specification and firing efficiency are maintained across cell types. Remarkably, the distribution of replication initiation sites at promoter-ORIs exactly parallels that of transcription start sites (TSS), suggesting a co-evolution of the regulatory regions driving replication and transcription. Moreover, we found that promoter-ORIs are significantly enriched in CAGE tags derived from early embryos relative to all promoters. This association implies that transcription initiation early in development sets the probability of ORI activation, unveiling a new hallmark in ORI efficiency regulation in mammalian cells.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1

Figure 1. Genomic distribution of ORIs in embryonic stem cells.

(A) ORI distribution at different genomic regions along 10.1 Mb of the mouse genome detected by 300–800 nt long nascent strand hybridisation (n = 97) or both by 300–800 nt and 100–600 nt long nascent strands hybridisation (n = 38). (B) Inter-origin distances along 10.1 Mb of the mouse genome (average = 103 kb; n = 97).(C–E) Distribution of ORIs and promoters in gene-rich versus gene-poor regions. ORI location (C), inter-origin distances (D) and density plots (E), of promoter-rich (chromosome 3 and zone 2 of chromosome X) and promoter-poor regions (zone 1 of chromosome X). The genomic features covered by the array, ORI distribution and percentages of ORI occurrence relative to the annotated genes along the 10.1 Mb and per region examined are summarised in Table S1.

Figure 2

Figure 2. Sensitivity of the ORI identification method.

Array profiles and nascent strand abundance measurements by Q-PCR of 18 positive regions located at 5′ends of genes (A), at less than 200 bp of exons (B), including one at the 3′ UTR of two genes of convergent transcription (ORI 67065), or at intergenic zones (C). Similar analysis was performed for 3 negative regions (D). The maps above each graph show the annotated genomic features and probe distribution of the regions analysed. Blue and red rectangles indicate exons transcribed from the upper or the lower strand, respectively, and black arrows show the position of the major annotated TSS. Grey rectangles represent array probes. The red dashed line depicts the threshold of the array duplicates. Q-PCR experiments were carried out in duplicate in at least two independent preparations of 300–800 nt long nascent strands and values were normalised to the flanking primer pair detecting the lowest amount of nascent strands at each region. Standard deviation bars are indicated. Primer pairs were designed to amplify across the array probes in all possible cases and their sequences are shown in Table S3. ORI 67276 corresponds to the CpG island region of the Mecp2 gene.

Figure 3

Figure 3. Replication initiation activity at CpG island-ORIs.

(A) Array profiles of 9 CpG island-ORIs. Symbols are like in Figure 2.(B) Q-PCR measurements of nascent strands abundance across the positive probes defining the ORIs shown in A in preparations of replication intermediates of the indicated sizes. Primer pairs span less than 2 kb at each region and values were normalised to the average of those obtained at the three negative regions in each gradient fraction. Numbers below each panel indicate fold enrichment of the ORI peak relative to the averaged negative regions. Primer pair sequences are listed in Table S3.

Figure 4

Figure 4. Replication initiation activity at non-promoter-ORIs.

Same analysis as on Figure 3 for 10 non promoter-ORI regions.

Figure 5

Figure 5. ORI specification and firing efficiency across cell types.

Relative abundance of 9 CpG island-ORI regions and 10 non promoter-ORI regions in 300–800 nt long nascent strands derived from ES cells, MEFs and NIH/3T3 fibroblasts. Averaged values for the non-ORI regions were considered as baseline in each cell type.

Figure 6

Figure 6. Organisation of replication and transcription initiation at promoter-ORIs.

Maps indicate the number and position of the CAGE tags annotated at CpG island promoter-ORIs with unidirectional (A) and with alternative or bidirectional transcriptional activity (B) . Blue and red arrows indicate transcription from the upper or the lower strand, respectively, and brackets show the position of the CpG islands. Graphs show the nascent strand profiles of the arrays hybridised with 300–800 nt preparations along the same regions. Other symbols are like in Figure 2. (C) Analysis of the TSS and nascent strand profiles at the Flna and Tbx15 loci.

Figure 7

Figure 7. Prediction of novel TSS and association with embryonic transcription.

(A) Enrichment for H3K4me3 and H3K9,14ac modifications relative to total H3 detected by ChIP. Values for the regions flanking Mecp2 and Zad20d1 CpG island-ORIs (ORIs 67276 and 105455, respectively) were considered as baseline. Q-PCR reactions were carried out in duplicate in three independent preparations of immunoprecipitated material. Standard deviation bars are indicated. (B) Expression levels relative to empty vector in transient transfection reporter assays. Constructs carrying the Notch2 and Aprt promoters cloned in the sense orientation were used as positive controls and a region at the first intron of the Notch2 gene cloned in both orientations as the negative one. Histograms represent the averaged normalised values of two independent transfections carried out in duplicate. Standard deviation bars are indicated. Primer pair sequences used and the sizes of the cloned fragments are listed in Table S3. (C) Frequency of promoter-ORIs relative to the number of mapped CAGE tags at each TSS . Grey bars represent ORIs identified by the strict algorithm (n = 40) and white bars ORIs identified when applying a less stringent algorithm (n = 75). (D) Frequency of total promoters or promoter-ORIs transcriptionally active in early development . A chi-square test was used to compare the frequency of tagged promoters between promoter-ORIs identified by the algorithms and the rest of promoters.

Similar articles

Cited by

References

    1. Arias EE, Walter JC. Strength in numbers: preventing rereplication via multiple mechanisms in eukaryotic cells. Genes Dev. 2007;21:497–518. - PubMed
    1. Diffley JF. Regulation of early events in chromosome replication. Curr Biol. 2004;14:R778–786. - PubMed
    1. Tabancay AP, Jr, Forsburg SL. Eukaryotic DNA replication in a chromatin context. Curr Top Dev Biol. 2006;76:129–184. - PubMed
    1. Remus D, Beall EL, Botchan MR. DNA topology, not DNA sequence, is a critical determinant for Drosophila ORC-DNA binding. Embo J. 2004;23:897–907. - PMC - PubMed
    1. Vashee S, Cvetic C, Lu W, Simancek P, Kelly TJ, et al. Sequence-independent DNA binding and replication initiation by the human origin recognition complex. Genes Dev. 2003;17:1894–1908. - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources