Eukaryotic core promoters and the functional basis of transcription initiation (original) (raw)

. Author manuscript; available in PMC: 2019 Apr 1.

Published in final edited form as: Nat Rev Mol Cell Biol. 2018 Oct;19(10):621–637. doi: 10.1038/s41580-018-0028-8

Abstract

RNA polymerase II (Pol II) core promoters are specialized DNA sequences at transcription start sites of protein-coding and non-coding genes that support the assembly of the transcription machinery and transcription initiation. They enable the highly regulated transcription of genes by selectively receiving and integrating regulatory cues from distal enhancers and associated regulatory proteins. In this Review we discuss the defining properties of gene core promoters, including their sequence features, chromatin architecture, and transcription initiation patterns. We provide an overview of molecular mechanisms underlying the function and regulation of core promoters and their emerging functional diversity, which defines distinct transcription programmes. Based on the established properties of gene core promoters, we discuss transcription start sites within enhancers and integrate recent results obtained from dedicated functional assays to propose a functional model of transcription initiation. This model can explain the nature and function of transcription initiation at gene starts and at enhancers and the different functional roles of core promoters, of RNA polymerase II and its associated factors and of the activating cues provided by enhancers and the transcription factors and cofactors they recruit.

Introduction

The development of complex organisms with many morphologically and functionally diverse cell types from a single cell is largely determined by the genetic information contained within genomic DNA1,2. This genetic information includes both protein-coding sequences of genes and non-coding regulatory elements that govern when, where and to what level each gene will be expressed. Regulated gene expression is essential for the integrity of all eukaryotic cells and organisms3, has a central role in cell differentiation and metabolism, and its disruption leads to disease4.

Gene expression starts with transcription, the copying of a DNA sequence into an RNA transcript by RNA polymerase II (Pol II), which transcribes all protein-coding and many non-coding genes. Transcription typically initiates at a defined position, the transcription start site (TSS), at the 5’ end of a gene, which we refer to as gene start. The TSS is embedded within a core promoter, which is a short sequence encompassing ˜50 base-pairs (bp) upstream and ˜50 bp downstream of the TSS (FIG. 1a). The core promoter serves as a binding platform for the transcription machinery, which comprises Pol II and its associated general transcription factors (GTFs)5. Core promoters are sufficient to direct transcription initiation6, but generally have low basal activity, which can be further suppressed by chromatin or activated by often more distally located regulatory elements called enhancers1,7,8. Enhancers bind regulatory proteins known as transcription factors and recruit transcription cofactors (reviewed in REFS 1,9), and can increase transcription from a core promoter independent of their relative distance and orientation1,7,8. More recently, this traditional view of gene expression and the role of enhancers and core promoters have been challenged by the observation that many genomic positions outside annotated gene starts initiate transcription, including positions within enhancers (FIG. 1b).

Figure 1. Properties and function of core promoters and enhancers.

a) The traditional view of transcription initiation postulates that transcription initiates at gene core promoters, which recruit the transcription machinery consisting of RNA polymerase II (Pol II) and general transcription factors (GTFs), thereby leading to the formation of the pre-initiation-complex (PIC) and transcription initiation. Transcription from core promoters is activated by enhancers, which can be located distally and bind sequence-specific transcription factors (TF), which recruit cofactors (COF) that convey the activating cues to the PIC at the core promoter. (b) Active enhancers exhibit divergent transcription of short, unstable enhancer RNAs (eRNAs) from two separate transcription start sites (TSSs) located at the edges of the nucleosome-depleted region where the enhancer resides. (c) Promoters produce long, stable mRNAs from a gene core promoter in the sense direction (orientation of the gene) and short, unstable upstream antisense RNAs (uaRNAs) from the upstream edge of a nucleosome depleted region that contains the transcription factor-bound proximal promoter. Separate pre-initiation complexes drive unidirectional transcription from each of the two TSSs.

Genome-wide transcription initiation

Sites of transcription initiation can be identified using various methods that capture the 5’ ends of Pol II transcripts by exploiting their characteristic properties. For example, cap analysis of gene expression (CAGE)10 and similar 5’ end-capture approaches11,12 take advantage of the cap structure at the 5’ end of Pol II transcripts to detect the TSS and RNA abundance. Complementary methods use properties of nascent transcripts associated with Pol II to detect their TSSs and assess their transcription rates13–16, thereby distinguishing true initiation events from sites of potential post-transcriptional cleavage and recapping17.

Applying such large-scale approaches to map TSSs genome-wide in different cell types of various model organisms12,18–22 is not only building comprehensive catalogues of gene TSSs and the regulation of transcription initiation, but has revealed the pervasive transcription of eukaryotic genomes23,24. Transcription initiation at many positions distal to annotated gene starts, especially at enhancers, is challenging the traditional model of gene expression, which has implied that transcription is initiated specifically at gene core promoters and regulated by distally located enhancers14,15,25,26 (FIG. 1a).

Transcription initiation at enhancers

Widespread transcription of mammalian enhancers was detected in many cell types14,25–28, and the production of enhancer RNAs (eRNAs) was suggested to be predictive of active enhancers26,29. Indeed, eRNA transcription correlates with target gene transcription in inducible systems30,31 and in different cell types26, and often, though not always, precedes the target-gene activation29,31.

Transcription from enhancers is often bi-directional15,26 and initiates at two distinct sites, which drive divergent transcription from the edges of a nucleosome-depleted region (NDR) that is established at active enhancers (FIG. 1b). However, unlike gene core promoters, which support the production of stable transcripts, enhancers mainly produce short, unstable transcripts in both directions15,32.

Antisense transcription at promoters

Bi-directional transcription was also detected at promoters, where the transcription of protein-coding genes is often coupled with the transcription of short non-coding RNAs in the reverse orientation15,33–36. These antisense transcripts, known as promoter upstream transcripts (PROMPTs) or upstream antisense RNAs (uaRNAs), are transcribed by separate Pol II complexes from divergently oriented TSSs located at the upstream edge of the nucleosome-depleted proximal promoter region that contains transcription-factor binding sites37,38 (FIG. 1c). Similar to eRNAs, these antisense transcripts are typically unstable, though some promoters seem to produce long and polyadenylated divergent transcripts39,40.

The observed divergent transcription at promoter and enhancer regions, together with other similarities, prompted the proposal of a unified architecture of transcription initiation at those elements15,41,42. According to this model, promoters and enhancers both initiate transcription similarly, but only at gene promoters are transcripts stabilized post-initiation by the presence of 5’ splice sites and by the absence of premature polyadenylation signals15,43,44.

In this review, we first summarize the insights obtained from studying core promoters of annotated genes and then discuss to what extent the properties of these bona fide core promoters can be found at TSSs within other genomic regulatory elements, including enhancers. This order of discussion reflects notion that gene core promoters have specifically evolved to initiate stable transcripts in a highly regulated manner, whereas the cause and the role of transcription initiation outside gene starts has remained unclear. We further discuss the assembly and activation of the transcription machinery at core promoters and how this machinery is regulated by distal enhancers via transcription factors and cofactors. Finally, we integrate these established promoter properties with recent results from dedicated functional assays to propose a functional model of transcription initiation that can account for transcription from promoters and from enhancers based on these elements’ sequence-encoded activities.

Properties of gene core promoters

Mapping endogenous transcription initiation sites14–16,19–22,45 has characterized different features of core promoters, including their diverse sequence and chromatin properties and the (focused or dispersed) distribution of transcription initiation sites, which together define three different types of core promoters46 (BOX 1).

Box 1. Transcription initiation patterns and core-promoter types.

The comprehensive mapping of gene core promoters has revealed several transcription initiation patterns and sequence and chromatin properties.

Dichotomy of the promoter shape

Mapping endogenous transcription initiation at single nucleotide resolution revealed striking differences between core promoters45,58, leading to the classification of ‘focused’ or ‘sharp’ core promoters, which have a single, well-defined transcription start site (TSS; see figure, part a) and ’dispersed’ or ’broad’ promoters45, which have multiple closely-spaced TSSs that are used with similar frequency (see figure, part b). These transcription initiation patterns (or promoter shapes) are found across species, including in fish21 and fly12,19,68, and are associated with distinct gene categories: focused initiation preferentially occurs in core promoters of highly cell-type specific genes with restricted expression patterns, whereas dispersed initiation is mainly associated with housekeeping genes expressed in many cell types19,22,45,68 and in mammals with CpG-island (CGI)-overlapping promoters of regulators of development.

Three types of core promoters

Based on different properties, including initiation pattern, sequence composition and motifs, chromatin configuration and gene function, three main types of core promoters in metazoa have been proposed46: (1) core promoters with sharp initiation patterns, un-precisely positioned nucleosomes89 and TATA-box and Inr motifs (see figure, part a). These promoters tend to have key regulatory elements near their TSSs235 and are activate in terminally differentiated cells in adult tissues, in which case they acquire histone H3 Lys 4 trimethylation (H3K4me3) and H3 Lys 27 acetylation (H3K27ac), which are associated with active transcription. (2) Core promoters of broadly expressed housekeeping genes, which are associated with dispersed transcription initiation19,45 and a well-defined nucleosome-depleted region (NDR) flanked by precisely positioned nucleosomes89 marked by H3K4me3 and H3K27ac (see figure, part b). In mammals, these core promoters overlap individual CGIs45; in flies they are enriched in a specific set of variably-positioned motifs including Ohler1, Ohler6 and DNA replication-related element (DRE)68. (3) Core promoters of key developmental transcription factors involved in patterning and morphogenesis. In mammals they resemble housekeeping-gene core promoters, which in embryonic stem cells however are distinctly bivalently marked with both H3K4me3 and the repressive modification H3K27me3 (REF. 236; see figure, part c). This presumably primes them for activation in the correct cell lineage and for silencing in all other cells. In mammals such ‘poised’ promoters are associated with long individual CGIs or multiple CGIs75 and often produce long non-coding divergent transcripts39,40. In flies promoters of this class tend to contain a downstream promoter element (DPE) and have focused initiation62. Both in mammals and flies, they are often surrounded by arrays of highly conserved non-coding elements, which might act as distal enhancers62,75.

Box 1 figure.

Sequence properties

By definition, the main task of core promoters is to support the assembly of the pre-initiation complex (PIC), which consists of Pol II and GTFs, and to guide transcription initiation from precise positions at defined levels6. The important role of the core promoter sequence in conferring these functions was recently corroborated by analyzing single nucleotide polymorphisms and other genetic variants, which across different fruit fly strains affected both transcription levels and TSS choice within core promoters47. These variations were found to often disrupt crucial sequence features known as core-promoter motifs, many of which are known to recruit GTFs and mediate PIC assembly (Table 1).

Table 1. Known core-promoter motifs and the (general) transcription factors that bind to them.

Core-promoter motifs

Several core-promoter motifs have fixed positioning relative to a single, well-defined TSS. For example, the well-known TATA-box motif48,49 is located ˜30bp upstream of a single dominant TSS50 in ’focused’ core promoters (BOX 1). Although the TATA-box is conserved from yeast to human, it is found only in a minority of core promoters, for instance ˜5% in fly51,52. The TATA-box is recognized and bound by the TATA-box binding protein53 (TBP; Table 1), one of the components of the Transcription Factor IID (TFIID) complex, a GTF that mediates Pol II recruitment and PIC assembly54,55 and thereby might determine TSS choice at a fixed downstream position.

Another core promoter motif with a fixed position relative to transcription initiation is the Initiator (Inr) motif, which directly overlaps the TSS56. The Inr is more abundant than the TATA-box52 but is not universal, and its consensus sequence differs between fly and human. The fly Inr motif is longer, more information-rich and encompasses several nucleotides that are adjacent to the TSS and were shown to serve as a binding site for additional components of TFIID57 (Table 1). By contrast, human Inr was initially defined as pyrimidine (C or T) followed by a purine (A or G), positioned such that the purine is the first transcribed nucleotide45. However, more recently a human Inr motif with higher information content was found in focused core promoters, and several nucleotides outside the dinucleotide core motif were suggested to be important for transcription initiation in vitro58 (Table 1).

In promoters that lack a TATA-box, the Inr is often accompanied by another motif, the downstream promoter element (DPE), which is positioned downstream of the TSS59 (Table 1). The DPE motif was initially discovered in fly and, based on the investigation of individual promoters, was suggested to also be present in human60, even though it was never found over-represented in human promoters45,52. Several subunits of TFIID are suggested to bind DPE, and a strict requirement for Inr–DPE spacing is thought to be essential for cooperative binding of TFIID55,60. Since in fly TATA-box and DPE rarely co-occur, they were suggested to be associated with functionally distinct groups of genes51,52,61,62 (BOX 1).

In addition to these three most abundant core-promoter motifs, other motifs with defined positions relative to the TSS include ten element (MTE)63 in fly, TFIIB recognition elements (BREs)64,65 and downstream core elements (DCE)66 in human. These motifs are bound by specific GTFs in vitro64,67 (Table 1), thus potentially mediating PIC recruitment and assembly. Furthermore, analysis of large collections of core promoters allowed the computational definition of over-represented sequences, leading to the discovery of other motifs without apparent spacing requirements relative to the TSS51,52. In flies, these include Ohler motifs 1, 6 & 7, and DNA replication-related element (DRE), which were found mainly in promoters with dispersed initiation patterns associated with housekeeping genes51,68 (BOX 1).

The described core-promoter motifs are over-represented in gene core promoters and are more rarely associated with non-genic initiation sites. Some enhancer TSSs and promoter antisense TSSs contain weak or degenerate forms of TATA-box or Inr motifs15,26,38, and the closer such motifs are to the consensus, the more promoter-like the enhancers are69 (see below).

The discovery of core-promoter motifs and their importance for transcription initiation has motivated the design of synthetic core promoters that efficiently assemble the PIC and support high level of transcription initiation for transgene expression in both fly and human systems70–72. Such promoters are also often used for biochemical and structural characterization of the PIC.

Characteristic (di)nucleotide composition

Apart from defined sequence motifs, gene core promoters often have distinct nucleotide compositions. For example in vertebrates many core promoters overlap with CpG islands (CGI), which are regions with elevated GC content and high density of CpG dinucleotides73. CGI promoters typically lack defined motifs and are mainly associated with housekeeping genes45,74 or key developmental regulators involved in embryo patterning and morphogenesis75 (BOX 1). The mechanisms by which CGIs confer core promoter function are still unknown.

Characteristic patterns of dinucleotide composition have also been found downstream of the TSS, where A- or T-containing dinucleotides occur in periodic patterns21,76. The similarity between such patterns and the preferential sequence composition reported to underlie nucleosomal DNA77–79 suggests a close connection between nucleosome positioning and TSS positions, especially at core promoters that lack motifs and have broad initiation patterns21,22,76.

Chromatin configuration

While most genomic DNA shows limited accessibility as it is wrapped around histone octamers to form nucleosomes, active core promoters are devoid of nucleosomes, which makes them accessible and allows PIC assembly and Pol II recruitment. Indeed, NDRs flanked by precisely positioned and phased downstream nucleosomes are hallmarks of active core promoters in all eukaryotic cells80–82. However, recent studies suggested that such NDRs might not be depleted of nucleosomes but rather occupied by highly dynamic nucleosomes containing the histone variants H3.3 and H2A.Z83, and other non-canonical or partial nucleosomal particles84–86. These features were proposed to ensure accessibility of the transcription machinery and associated factors to DNA, suggesting that nucleosome occupancy and accessibility to DNA at core promoters are not necessarily mutually exclusive87,88.

Promoters with different initiation patterns differ in chromatin architecture and nucleosome positioning: dispersed promoters have more clearly defined NDRs and are associated with well-positioned nucleosomes downstream of the TSS89 (BOX 1). Similarly, in yeast two distinct types of promoters can be distinguished by the presence of either fragile nucleosomes or stably positioned nucleosomes, which correlates with distinct underlying sequences90.

Despite the obvious correlation between open, accessible chromatin and active transcription from promoters, the causal relationship between the two is still not clear. There is evidence that some transcription factors, sometimes called pioneer factors91, can bind to closed chromatin and recruit chromatin remodelling factors to open the chromatin, thereby allowing Pol II binding and transcription initiation92,93 (reviewed in REF. 9). Similarly, the presence of H2A.Z in the first downstream (+1) nucleosome is believed to decrease the barrier this nucleosome imposes on transcribing Pol II94. A complementary possibility is that low level of transcription by Pol II is required to keep the chromatin open and allow transcription factors to bind38,95,96. These mechanisms are not mutually exclusive and they are likely combined, presumably with different contributions at different types of core promoters96. H3.3 for example appears to be both downstream and upstream of transcription: it is deposited into nucleosomes independently of DNA replication97 preferentially at promoters and enhancers98 where it replaces the canonical H3 histone that is ejected during transcription. Once it accumulates at promoters, it could facilitate subsequent rounds of transcription98.

Post-translational histone modifications

Another prominent feature of promoter-associated chromatin is the presence of specific post-translational modifications of histones99,100. Nucleosomes downstream of active promoters bear tri-methylation of histone H3 Lys 4 (H3K4me3) and acetylation of H3 Lys 27 (H3K27ac)100 (BOX 1). Whether and how these modifications contribute to promoter function is unclear. In budding yeast, for example, H3K4 methylation occurs downstream of transcription and is mediated by the recruitment of histone-lysine N-methyltransferase, H3 lysine-4 specific (SET1) by the transcribing Pol II (REF. 101). H3K4me3 was suggested to provide a memory (‘bookmark’) of recent transcriptional activity, thereby facilitating new rounds of transcription101. However, the rapid and complete loss of H3K4me3 and transcription in the absence of transcription activators suggests that H3K4me3 alone is not sufficient to maintain active transcription102. A bookmarking function was also proposed for H4K5ac, which can recruit the transcriptional cofactor bromodomain-containing protein 4 (BRD4) and facilitate post-mitotic re-activation of a previously active genomic locus103. Histone acetylation might work through decreasing the affinity of DNA to nucleosomes and promoting open chromatin, similar to acetylation of the histone core104–106, or by directly providing binding sites for cofactors that bind acetylated lysine residues, such as BRD4107.

Although H3K4me3 and H3K27ac correlate strongly with transcriptional activity, whether they are causally involved in transcription is not clear. H3K4me3 seems dispensable for transcription in flies, since cells containing non-methylatable forms of both canonical and variant H3 histones show regulated transcription108,109. Similarly, cells with a Lys-to-Arg mutation at position 27 on canonical histone H3 exhibit de-repression of Polycomb silenced genes, implying that transcription does not require Lys 27 acetylation at canonical H3 (REF. 110). This suggests that Lys 27 acetylation of the histone variant H3.3 is important or that histone acetylation is only a by-product of the acetyltransferases P300/CBP, whose relevant targets could include transcription factors111–113 and the Pol II complex itself114. Such data, together with recent studies that found the pervasive enhancer mark H3K4me1 to be dispensable for enhancer activity115,116, caution against attributing functions to histone modifications based purely on correlation and emphasize the need for functional studies to discern causation from correlation117.

A striking example of histone modifications that causally direct transcription was recently found at Piwi-interacting RNA (piRNA) source loci in fly heterochromatin. Transcription of these loci is carried out by an alternative transcription machinery that is specifically recruited to the heterochromatin mark H3K9me3 through the H3K9me3 reader heterochromatin protein 1 (HP1; REF. 118). Although this shows that histone modifications associated with bona fide core promoters are not necessarily required for transcription, it also demonstrates that in principle modified histones are able to modulate transcription.

Transcription initiation at promoters

Transcription from gene core promoters is a step-wise process that results in a defined transcriptional output. Understanding the molecular mechanisms underlying each of the individual steps is essential for understanding their activation by distal cues.

Role of the pre-initiation complex

Assembly of the PIC at core promoters and initiation of transcription involves six GTFs, which recognize and bind core promoter elements, recruit Pol II and activate it for productive transcription119 (FIG. 2a). A sequential model of PIC assembly, proposed based on biochemical and structural studies, includes the recognition of core-promoter elements by TFIID, binding of TFIIA and TFIIB, recruitment of the Pol II–TFIIF complex, and finally the binding of TFIIE followed by TFIIH (reviewed in REFS 120,121). This model was further supported by a recent single-molecule imaging study that provided additional insight into the dynamics of GTF binding122. PIC assembly is followed by DNA-duplex melting and the formation of an open PIC, which supports the synthesis of the first nucleotides of the nascent transcript, after which Pol II is released from the core-promoter and the GTFs that bind it (’promoter escape’; FIG. 2b). High-resolution structures of both closed and open PICs, including double-stranded and melted DNA, respectively, revealed contacts between individual GTFs and core promoter DNA and shed light on the molecular events leading to PIC assembly, promoter opening and transcription initiation at core promoters55,123,124.

Figure 2. Regulation of different steps of transcription from core promoters.

a) Pre-initiation complex (PIC) assembly and RNA polymerase II (Pol II) recruitment. The first step of transcription initiation is the assembly of the PIC consisting of Pol II and six general transcription factors (GTFs): transcription factor IIA (TFIIA), TFIIB, TFIID, TFIIE, TFIIF and TFIIH (left). Enhancers can promote PIC assembly by recruiting transcription factors (TFs) and cofactors (COFs) that directly interact with GTFs or Pol II (right). b) Initiation by Pol II and ’promoter escape’. After PIC assembly, the DNA duplex at core promoters melts (not shown) and allows Pol II to initiate transcription at the transcription start site (TSS). To continue transcribing, Pol II has to dissociate (escape) from the TSS-binding GTFs, which is mediated by phosphorylation of Ser 5 and Ser 7 of the Pol II carboxy-terminal domain (CTD) by TFIIH. Enhancers can aid this process by recruiting cofactors such as the Mediator complex (MED) or the acetyltransferase CBP/P300 (see main text for these and other cofactors’ functions). c) Pol II promoter-proximal pausing. After escaping from the TSS, Pol II synthetizes a short stretch of nascent RNA (30-50 nucleotides) and then pauses downstream of the TSS. DRB sensitivity inducing factor (DSIF) and negative elongation factor (NELF) bind to Pol II and the nascent RNA and promote Pol II pausing. Pause-release is mediated by cyclin-dependent kinase 9 (CDK9), which is a subunit of the positive transcription elongation factor b (P-TEFb) that phosphorylates DSIF, NELF and Ser 2 of the Pol II CTD. This leads to dissociation of NELF and entry of Pol II into productive elongation. Enhancers promote this process by recruiting cofactors that either recruit and stimulate CDK9 or directly affect pause-release, such as Brd4 and p300. d) Regulation of transcription bursting. Transcription occurs in short ‘bursts’, which comprise groups of initiation events separated by periods of inactivity. The core promoter sequence determines burst size, that is the number of transcribing Pol II molecules per burst (left), while enhancers increase bursting frequency from their target core promoter (right). ‘+’ denotes target activation and ‘-‘ denotes target inhibition.

Both biochemical and structural studies agree that TFIID has a central role in recognizing and binding core-promoter elements and nucleating PIC assembly. In addition, TFIID selectively binds H3K4me3, thereby enabling cross-talk between chromatin and PIC assembly125. Apart from regulating accessibility to DNA (reviewed in REF. 9), TFIID recruitment is therefore the first step at which transcription can be regulated and indeed, some transcription factors can bind and potentially recruit TFIID to core promoters126–128. In addition, TFIID composition might also influence transcription. Canonical TFIID consists of TBP and TBP-associated factors54, which can be replaced by different paralogs to form alternative TFIID complexes (reviewed in REFS 129–132). For example, TBP-related factor 2 (TRF2) substitutes TBP at promoters of many housekeeping genes and is essential for their activation133–135.

As biochemical and structural studies of PIC assembly and function typically consider only a few well-defined or synthetic core promoters that contain canonical core promoter motifs55,70, the mechanism of GTF recruitment and regulation at other types of core promoters is unclear and might differ. Indeed, mapping the binding sites of various PIC components genome-wide in yeast revealed a distinct interplay between the PIC and nucleosomes at promoters containing strong TATA-box motifs versus those with only weak or no TATA-box motifs136. In yeast, the presence of a strong TATA-box has been used to distinguish between SAGA complex-dominated and TFIID-dominated promoters137,138. SAGA-dominated promoters more often contain strong TATA-box motifs and are associated with genes responsive to stress, whereas TFIID-dominated promoters are depleted of such strong TATA-box motifs137,138. However, the two complexes might not be mutually exclusively employed at distinct types of promoters, but regulate different steps that are more or less rate-limiting at the different promoter types138,139. This is consistent with recent observations that the transcription of nearly all yeast genes depends to some extent on TFIID140 and that SAGA is involved in regulating both TATA-containing and TATA-less promoters139.

RNA Polymerase II pausing

At many genes, once Pol II has cleared from the TSS, it transcribes only 30-50 nucleotides downstream of the TSS and then undergoes promoter-proximal pausing141–143 (FIG. 2c). Paused Pol II was initially detected at heat-shock-responsive genes in their inactive state144 and shown to be rapidly released into productive elongation upon heat-shock145, thereby enabling strong and rapid gene activation. Release from promoter-proximal pausing involves phosphorylation by the cyclin-dependent kinase 9 (CDK9) subunit of the positive transcription elongation factor b (P-TEFb) of several components of the paused transcription elongation complex, including negative elongation factor (NELF), DRB sensitivity inducing factor (DSIF) and Pol II itself145,146 (FIG. 2c).

The prevalence and tight regulation of Pol II promoter-proximal pausing demonstrates that PIC recruitment and transcription initiation are not necessarily the rate-limiting steps of transcription at all promoters. Rather, promoter-proximal pausing provides an additional opportunity to regulate transcription by allowing rapid release of already engaged Pol II into productive elongation146, thereby eliminating dependencies on the slower steps of recruitment and initiation. This might be beneficial when rapid or synchronous changes in gene expression are required. For example, in early fly embryos promoters with paused Pol II are activated synchronously across all cells147, which is important for coordinating tissue morphogenesis148. Similarly, genes with paused Pol II in fly embryos were enriched for developmental regulators and it is likely that pausing facilitates rapid changes in spatial and temporal activity of these genes during development141. By contrast, in mouse embryonic stem cells paused Pol II is enriched at genes regulating cell cycle and signal transduction, and is suggested to regulate development through the control of signaling pathways149.

Different genes might, however, differ in their rate-limiting step for productive transcription. Some genes could predominantly be regulated by releasing stably paused Pol II, whereas for other genes regulation might occur mainly at the initiation step. In addition, the stability of paused Pol II at different promoters greatly differs: half-lives of paused Pol II measured by inhibiting both pause-release and de novo initiation, range from several minutes to an hour and more150–152. At promoters that support stable Pol II pausing with low turn-over rates (half-life >30 min), stalled Pol II seems to block new transcription initiation151,153, presumably by steric hindrance as previously predicted154. By contrast, at promoters with high turn-over of paused Pol II (half-life of only minutes) there may be no interference with transcription initiation152, potentially allowing tight regulation at the initiation step followed by non-limiting pause-release. Such an antagonistic relationship between pausing duration and transcription initiation frequency might create a pause–initiation balance153, which could allow influencing one step by regulating another step, for example increasing initiation frequency by stimulating CDK9-mediated release of paused Pol II (REFS 153,154).

The nature of the trigger of Pol II pausing is not known and it was suggested that the sequence downstream of the TSS might play an important role. Core promoters of the most strongly paused genes often have elevated GC content downstream of the TSS, including the GC-rich DPE or Pause button (PB) motifs155 (Table 1). While these motifs might recruit specific proteins, GC-rich sequences might also simply slow down the Pol II (REF. 156). Similarly, transcription might also be hindered by the topological stress due to supercoiling of DNA downstream of the transcribing Pol II (REFS 157,158). In addition, chromatin has been implicated in Pol II pausing, since the +1 nucleosome could represent a barrier to Pol II at essentially all genes resulting in downstream or distal pausing94. However, the causal relationship between nucleosome positioning and Pol II transcription is not clear and it was also suggested that the paused Pol II is required to keep the promoter region clear of nucleosomes96, rather than the other way around.

Interestingly, most or all genes seem to require CDK9 for productive elongation, including those without GC-rich sequences downstream of the TSSs and those for which no accumulation of paused Pol II is detected146,150,153. The global down-regulation of transcription upon CDK9 inhibition, even at enhancers159,160, indicates that Pol II pausing or a pausing-like checkpoint between initiation and elongation occurs for essentially all Pol II-mediated transcription, irrespective of whether paused Pol II accumulates to detectable levels. Such a checkpoint might be important to ensure RNA 5’ capping, the assembly of a functional elongation complex, including Topoisomerase I recruitment and activation161, and the recruitment of other proteins required for elongation and co-transcriptional processes. This suggests that promoter-proximal pausing is an inherent property of transcription by Pol II and is triggered independently of the core-promoter sequence, potentially via the 5’ end of the nascent RNA, which after transcription of about 18 nucleotides starts protruding from Pol II. Indeed, the two pausing-establishing factors DSIF and NELF require a nascent transcript longer than 18 nucleotides to stably associate with the Pol II elongation complex162 (reviewed in REF. 163). Furthermore, recent biochemical and structural studies of a complex containing Pol II and DSIF revealed that DSIF contacts nascent RNA exiting from Pol II, suggesting a role of this interaction in establishing Pol II pausing164–166. According to this model, pausing is triggered independently of the sequence and chromatin properties at the pause-site, which nevertheless might influence the stability of the interactions between DNA, nascent RNA and paused Pol II. Strengthening these interactions could increase the duration of pausing and potentially explain the elevated GC content at sites that accumulate high levels of paused Pol II, that is stable RNA-DNA hybrids owing to higher GC content at the pause site might increase the duration of pausing.

Regulation by enhancers and cofactors

Active promoters are often in spatial proximity to enhancers167–170 and the establishment of such contacts between promoters and distal enhancers is related to the three-dimensional organisation of chromatin in the nucleus9,171–173. Promoter activation might occur upon establishing contacts with an enhancer, or by the recruitment of transcription factors to pre-formed enhancer–core promoter interactions. The latter was found to be prevalent in fly development, where enhancer–core promoter interactions are established prior to gene activation and appear stable during development174. In either case, promoters need to be sufficiently close to their enhancers to be activated.

Modes of core-promoter activation

The different steps required for productive transcription by Pol II all provide opportunity for regulation: PIC assembly, Pol II activation and transcription initiation, Pol II pausing and release into productive elongation (see above and REF. 175). Core promoters receive regulatory input from enhancers and this is mediated by transcription factors that directly bind short transcription-factor binding sites within enhancers, and by transcriptional cofactors, which are recruited by transcription factors through protein–protein interactions. Cofactors often have enzymatic activities and can post-translationally modify components of the transcription machinery and the surrounding nucleosomes, thereby affecting the different processes taking place at target core promoters.

Promoting pre-initiation complex assembly and RNA polymerase II activation

The most straightforward way to increase transcription from a core promoter is to increase the rate of transcription initiation by promoting PIC assembly and Pol II recruitment and activation. Several transcription factors or cofactors recruited by enhancers directly interact with components of the transcription machinery leading to stabilization of PIC at core promoters and increased initiation (FIG. 2a). For example, the Mediator complex is recruited to enhancers, interacts with the PIC at core promoters and transduces activating cues to increase Pol II recruitment and PIC assembly176. In yeast, Mediator seems to directly contact TFIIH and stimulate phosphorylation of the Ser 5 residues in the carboxy-terminal domain (CTD) of Pol II by the TFIIH subunit cyclin-dependent kinase 7 (CDK7) (REF. 177; Supplementary information S1 (box)). Ser 5 phosphorylation is considered important for Pol II to escape from the core promoter-bound GTFs and to initiate transcription (FIG. 2b). Similarly, the acetyltransferase p300, which is a cofactor widely associated with many active enhancers178, can acetylate GTFs or Pol II at target core promoters112,179 and this is required for the induction of growth-factor response genes114.

Promoting Pol II pause–release

Many core promoters support the recruitment of high levels of Pol II and are rather regulated at the level of pause-release142,145,180. Transition into productive elongation is coupled to phosphorylation of the Pol II CTD at Ser 2 residues (Supplementary information S1 (box)) and of DSIF and NELF by CDK9, which is the kinase subunit of P-TEFb (FIG. 2c). P-TEFb can be recruited to core promoters by the transcriptional cofactor BRD4181,182, which is bound to many enhancers and is involved in regulating specific subset of genes183,184. Thus, enhancers that recruit high levels of BRD4, such as those involved in oncogene activation185,186 may preferentially function through releasing paused Pol II through CDK9. However, BRD proteins also regulate the transition to productive transcription elongation independently of CDK9 recruitment, since BRD protein degradation globally impairs transcription elongation but does not impact CDK9 recruitment to target genes187,188. P300 and Pol II-associated factor 1 (PAF1) have also been reported to be involved in pause release179,189. PAF1 seems to be required for pausing at enhancers and promoters and the loss of PAF1 leads to increased promoter activity, potentially through enhancer activation160.

Modulating transcription bursts

Transcription occurs in short but intense ‘bursts’, which comprise groups of initiation events separated by periods of inactivity190,191, as if promoters stochastically transition between inactive and active or permissive states192,193. This stochastic nature of transcription means that transcription activation could be achieved in one of two ways: by increasing the amplitude (size) of bursts, that is, the number of transcribing Pol II molecules per burst, or by increasing the frequency of bursts. The latter was shown to be the case both in regulation of developmental genes in fly embryos193 and in activation of the β-globin promoter by its locus-control region194. In contrast, burst size is a fixed property of the core promoter that is determined by the core promoter sequence, which mediates GTF binding192,195,196 (FIG. 2d). Indeed, the presence of the TATA-box motif supports larger burst size in yeast195, which might enable rapid transcriptional responses to stress196, yet appears to disproportionally contribute to transcriptional noise and increased cell-to-cell transcript variability197. Activation of core promoters that support large burst size by an enhancer that increases the frequency of bursting will lead to high transcriptional output. This might explain the observation made in reporter assays that enhancers most highly activate TATA-box-containing core promoters198.

Specificity and responsiveness

Although forced interaction of an enhancer with a core promoter can be sufficient to activate transcription199, this is not the case for all promoters suggesting that enhancers have preferences or specificities towards some promoters and, vice versa, that promoters can only be activated by certain enhancers but not others.

Sequence-encoded enhancer–core-promoter specificity

For example, reporter genes with TATA-box-containing or with DPE-containing promoters integrated at identical genomic positions were differentially expressed in fly embryos200, suggesting that they differentially responded to genomic enhancers. Similarly, core promoters derived from fly housekeeping genes or from developmental genes were differentially activated by distinct sets of enhancers in an otherwise constant plasmid environment201. This is indicative of a sequence-encoded enhancer–core-promoter specificity that separates developmental and housekeeping transcription programs201, a notion that was corroborated by a complementary approach that showed that different promoters respond specifically either to developmental enhancers or to housekeeping enhancers198.

The specificity of core promoters towards regulatory input is not necessarily confined to different sets of genes. For example, in zebrafish a global switch in initiation pattern from focused to dispersed occurs at many genes during embryonic development21, suggesting that they use two different, overlapping core promoter sequences that respond differentially to enhancers active during either maternal or zygotic transcription.

Enhancer-binding regulatory proteins mediate core-promoter specificities

Activation of core promoters by enhancers is mediated by transcription factors and cofactors that have a central role in conveying regulatory cues from enhancers to core promoters and presumably mediate the enhancers’ specificities. Some transcription factors and cofactors can activate transcription on their own when tethered to core promoters202–206. Furthermore, when tested with different core promoters in a constant reporter setup, some factors displayed preferences towards certain core promoters206,207. An intriguing hypothesis that could explain such observations is that different types of core promoter support the assembly of structurally or compositionally distinct PIC complexes that are biochemically compatible with different types of transcription factors and cofactors. One such example is TRF2 replacing TBP in PICs assembled at housekeeping gene promoters133–135 (FIG. 3; reviewed in REFS 9,208).

Figure 3. Sequence-Bencoded specificity of core promoters towards enhancers and activation by specific transcription (co)factors.

Different types of core promoters respond differentially to distal enhancers, that is an enhancer can activate them (solid arrows) or not (dashed arrows). This selectivity or specificity is mediated by different transcription factors (TF) and cofactors (COF), which display core promoter preferences likely based on biochemical compatibilities between the cofactors and core promoter-bound general transcription factors (GTFs). Mapping and understanding preferences and compatibilities between cofactors and core promoters is an important goal for future research. Pol II, RNA polymerase II; TBP, TATA-box binding protein; TRF2, TBP-related factor 2.

The suggested specificity between core promoters and activating factors was further corroborated by loss-of-function studies that either specifically inhibited cofactor function179,183,184 or depleted cofactors139,140,209 and showed preferential downregulation of certain genes but not others. For example, in yeast, the depletion of different Mediator subunits leads to differential gene downregulation and seems to preferentially affect SAGA-regulated genes209. In mammals, inhibition of BRD4 leads to preferential downregulation of Myc183,185 — a property that is exploited for therapeutic purposes. Similarly, inhibition of p300 seems to most strongly affect core promoters of highly paused genes characterized by distinct chromatin configuration and binding of specific factors, and appears to differentially affect Pol II recruitment and initiation versus Pol II pause-release, depending on the core promoter type179. These observations suggest that transcription of different genes might depend on different cofactors.

A functional model of transcription

The properties of core promoters establish them as specialized sequences that support transcription initiation and Pol II pause-release in response to activating cues from distal enhancers. Enhancers have been regarded as amplifiers of transcription from proximal or distal core promoters7,8, a function mediated by transcription factors and cofactors8. The term ‘promoters’ refers to sequences at gene starts, which can autonomously drive high levels of productive transcription. Promoters comprise in close proximity core promoters and supporting activating sequences, which are called proximal promoters or proximal enhancers (discussed in REFS 198,210). Enhancers therefore share several characteristics with promoters, such as the binding of transcription factors and cofactors26, but also – more unexpectedly – the binding of GTFs and Pol II (REFS 28,211,212) and the ability to initiate transcription15,26,29,42,213 (FIG. 1b).

To understand the similarities and differences between core promoters, enhancers and promoters, it is instructive to establish activity-based definitions of these elements using dedicated assays designed to specifically probe the defining function of each of these elements (BOX 2). Such assays specifically assess enhancer activity as the ability to activate transcription at a distal core promoter214,215; core promoter function as the ability to initiate transcription in response to distal regulatory cues198; and promoter activity as the ability to autonomously drive transcription198,216. One recently-developed assay simultaneously measures both enhancer and promoter activity69.

Box 2. Measuring core promoter and enhancer activities.

Dedicated activity-based assays that specifically measure enhancer, core promoter or promoter activity allow function-based definition of regulatory elements (see fig.).

Enhancers activate transcription at distal core promoters

Enhancer activity is measured in reporter assays that test the ability to activate transcription at a distal core promoter and drive the expression of a reporter gene (fig. a). Enhancer activity has been reported for intergenic and intronic sequences, but also for some sequences that overlap gene promoters201,214,215,217,218,237,238. Such promoters support both core-promoter activity and enhancer activity through the respective sequence elements (fig. d). The many promoter regions that do not show enhancer activity likely support only core promoter functionality and therefore cannot activate transcription at a distal core promoter.

Core promoters initiate transcription in response to regulatory input

Analogously to directly measuring distal enhancer activity, core promoter activity can be specifically assessed in dedicated reporter assays that measure the ability to initiate transcription in response to activating input from an enhancer, that is, measure enhancer responsiveness (fig. b). Candidates with high enhancer responsiveness mainly coincide with gene transcription start sites (TSSs) and contain core-promoter motifs such as TATA-boxes and Inr motifs198,239,240. Unlike gene core promoters, TSSs within enhancers show very low or no responsiveness198, suggesting that enhancers in general have a very weak or no sequence-based propensity to respond to distal activating cues and act as core promoters (fig. d).

Autonomous promoter function is conferred by sequences supporting both core promoter and enhancer activities

Although the above methods assess core-promoter activity as the responsiveness to a defined regulatory input, promoter activity is typically defined as the ability to drive transcription autonomously69,216 (fig. c). Such autonomously functioning promoters typically contain both core promoter activity and enhancer activity; an enhancer in this context is also called a proximal promoter or an upstream-activating sequence (fig. d).

Box 2 figure.

Such dedicated functional assays demonstrated for example that promoter regions can activate transcription from distal core promoters, meaning they can function as enhancers198,201,214,217,218, and that enhancer regions can autonomously give rise to productive transcription and function as promoters69,216,217. However, these approaches also found that enhancer function and core-promoter function frequently do not co-occur69,198,201,214,216–218, indicating that the two functions can be carried-out by the same genomic region, but are not strictly coupled or interdependent219.

Fortuitous initiation at enhancers

Enhancer activity is mediated through the binding of transcription factors and the recruitment of cofactors, which not only mediate activation of target core promoters but create high transcription activation potential at the enhancers themselves. Enhancers should therefore naturally have the tendency to activate transcription close to or within the enhancer, presumably at sites that most closely resemble bona fide core promoters. Given the low sequence stringency (that is ‘information content’) of many core promoter motifs (Table 1), many sequences at either side of an enhancer resemble degenerate core-promoter motifs. Transcription initiation is in fact expected at any (random) sequence that is in the vicinity of strongly activating factors, because achieving perfect activation specificity towards core promoters or entirely preventing background initiation at accessible DNA would be energetically costly and could only evolve under strong selective pressure.

Fortuitous transcription initiation resulting from high activator concentrations can explain several observations related to transcription initiation at enhancers, including the presence of degenerate Inr and TATA-box motifs at TSSs within enhancers15,26,38, the bidirectional initiation pattern at enhancers15,32 or at open chromatin in general220 (FIG. 4a), and the observations that eRNAs are inducible25 and cell-type specific26 — in both cases, eRNA transcription follows the activity of the enhancer, that is, the recruitment of strong transcription activators. It is also consistent with TSSs within enhancers generally showing very low enhancer responsiveness and thus having little or no capacity to support distally regulated transcription initiation as bona fide core promoters do198 (BOX 2). Moreover, the more similar the TSSs within enhancers are to bona fide core promoters, the higher the level of productive transcription from the enhancer69. Therefore, although some enhancers can function as promoters, enhancers generally do not do so and the difference stems from the presence or absence of sequence-encoded core-promoter functionality.

Figure 4. Functional model of transcription initiation at genomic promoters and enhancers.

a) Model of transcription initiation at enhancers (left) and promoters (right) arising from their distinct sequence-encoded activities. Enhancers bind transcription factors (TF) and recruit cofactors (COF), thereby creating a high local concentration of transcription activators. This should lead to fortuitous transcription initiation at proximal sites that resemble bona fide core promoters (“best-of-random sites”), resulting in divergent transcription of short unstable enhancer RNAs (eRNAs). Promoters transcribe stable mRNAs from a dedicated gene core promoter and – due to high activator concentration – will also show fortuitous transcription initiation in the antisense direction. b) Model of evolution of a functional core promoter or an enhancer. Newly emerging transcription-factor binding sites (blue) create enhancer-like activity and exhibit low levels of bidirectional transcription at best-of-random sites. If such transcription is harmful, it might be actively suppressed by DNA methylation (pins)222, repressive factors223 or repressive chromatin and the transcription factor binding sites will degenerate over time. If by contrast the transcription in one or both directions is beneficial, the respective transcription start site will be positively selected and evolve to a fully functional core promoter (red) with strong core promoter motifs, able to support high levels of regulated and productive transcription. Transcription in the non-beneficial direction will remain low and yield non-stable upstream-antisense RNAs (uaRNAs). The activator binding sites near core promoters are often referred to as ‘proximal promoter’. Finally, if the transcription from a putative regulatory sequence is neutral and its enhancer activity is beneficial, the enhancer function should be strengthened and the enhancer will transcribe low levels of bidirectional eRNAs from best-of-random sites.

Evolution of enhancers and promoters

The model of fortuitous initiation at enhancers is consistent with the finding that bidirectional transcription is the ground state of evolutionarily new promoter regions and that uni-directionality is an acquired trait of gene core promoters221. Newly emerged transcription-factor binding sites confer enhancer-like activity, which initially leads to low levels of bidirectional transcription initiation221 (FIG. 4b). If this transcription is harmful, it might become silenced, for example by repressive chromatin222,223, and the binding site might eventually decay. If by contrast transcription in one or both directions is beneficial, the respective TSS sequence could be positively selected and evolve into a fully functional core promoter with strong core promoter motifs, able to support regulated and productive transcription. Similarly, a core promoter that is regulated exclusively by distal enhancers could acquire proximal activator binding sites and thus promoter activity.

The functions of enhancer RNAs

In the model of fortuitous initiation at enhancers, eRNAs are unavoidable by-products of transcription activators, yet this does not exclude the possibility that eRNA transcription or eRNAs themselves are functional. It is possible that evolution took advantage of their correlation with transcriptional activity to modulate enhancer activity (reviewed in REF. 224). For example, eRNA transcription might ensure accessibility to DNA95,96 and eRNAs might be involved in the formation of activating micro-environments in the form of non-membrane bound compartments with high concentrations of transcription activators225–227, which is similar to what has been reported for for germline P granules228, RNA granules229,230 and the formation of condensed heterochromatin231,232. Such hypotheses that consider conceptually novel ways to understand the regulatory environment at enhancers should motivate future studies of eRNA function.

Perspective and future directions

Pol II core promoters are genomic elements that support PIC assembly and transcription initiation, and function as specialized sequences that have evolved to enable highly regulated gene transcription. We propose a functional model that defines regulatory elements by their function rather than by their genomic position; we argue that core promoters and enhancers are the two principal gene regulatory elements, that they have distinct functionalities and that they have evolved for distinct purposes: initiating productive transcription locally (core promoters) versus boosting transcription locally or distally (enhancers).

We are intrigued by the widespread occurrence of Pol II pausing at most promoters and enhancers153,159,160, which might indicate that a pausing-like checkpoint between transcription initiation and elongation is an intrinsic property of all Pol II-mediated transcription. As such, it might be triggered not by the DNA sequence at the down-stream pause-site but, for example, by the 5’ end of the nascent RNA as it protrudes from Pol II. It will be interesting to see if the successful resolution of this checkpoint is necessary for productive elongation and whether this could be the main difference between transcription at promoters and enhancers.

We would like to highlight the existence of different types of core promoters with distinct properties, especially preferences towards different enhancers and cofactors that are presumably based on biochemical compatibilities (FIG. 3). Elucidating such preferences and compatibilities and determining the differences between various core-promoter types is crucial at a time when we have an increasingly complete understanding of the mechanisms that determine genome structure and spatial contacts of enhancers and their target core promoters (reviewed in REFS 233,234), and when transcription regulation is becoming the focus of targeted intervention and novel therapeutic strategies.

Supplementary Material

Supplementary information S1

Glossary.

Core promoter

Short sequence flanking the transcription start site (typically ˜50 base-pairs upstream and ˜50 base-pairs downstream) that is sufficient to assemble the RNA polymerase II transcription machinery and initiate transcription.

General transcription factors

(GTF) Proteins that together with RNA polymerase II constitute the transcription machinery at the core promoter.

Transcription factors

Proteins that directly bind a specific DNA sequence through their DNA-binding domain and regulate the level of transcription by recruiting Pol II or transcriptional cofactors through their trans-activation domain.

Transcriptional cofactors

Proteins that do not directly bind DNA, but are recruited by DNA-binding transcription factors to regulate transcription of target genes.

Enhancer RNAs

(eRNAs) Short unstable non-coding RNAs (<2kb), usually not spliced or polyadenylated, which are transcribed from enhancers and rapidly degraded by the exosome.

Nucleosome-depleted region

(NDR) Genomic region depleted of canonical nucleosomes; usually associated with active regulatory elements such as promoters and enhancers.

Promoters

Genomic regions encompassing a gene core promoter and an upstream proximal promoter, which together autonomously drive transcription.

Proximal promoter

Transcription-activating sequence immediately upstream of the core promoter (typically up to 250bp upstream of the transcription start site), which contains binding sites for sequence-specific transcription factors and functions like an enhancer.

Pre-initiation complex

(PIC) A large complex of proteins, including RNA polymerase II and its general transcription factors, that assembles at core promoters and is required for transcription initiation.

CpG islands

(CGI) GC-rich genomic sequences with the frequency of CpG dinucleotides higher than in the rest of the genome (which is generally depleted of CpG dinucleotides in mammals).

Piwi-interacting RNA

(piRNA) Small non-coding RNA (26-31 nucleotides) that interacts with Argonaute proteins from the Piwi family and mediates transcriptional and post-transcriptional gene silencing of transposable elements.

Promoter-proximal pausing

Pausing of RNA polymerase II downstream of the transcription start site; controls the transition into productive transcription elongation.

Enhancer responsiveness

The extent to which transcription from a core promoter is induced by a distal enhancer.

SAGA complex

Spt–Ada–Gcn5-acetyltransferase (SAGA) is a coactivator complex with different chromatin-modifying modules, including for example the Gcn5 histone acetyltransferease.

Acknowledgements

The authors thank F. Mürdter, M.A. Zabidi, P.R. Andersen, C. Plaschka and C. Bernecky for helpful comments on the manuscript. V.H. is supported by a long-term postdoctoral fellowship from the Human Frontier Science Program (HFSP, grant number LT000324/2016-L). Research in the Stark group is supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement no. 647320) and by the Austrian Science Fund (FWF, F4303-B09). Basic research at the IMP is supported by Boehringer Ingelheim GmbH and the Austrian Research Promotion Agency (FFG).

Footnotes

Competing interests

The authors declare no competing interests.

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author contributions

All authors contributed equally to all aspects of the article.

References

1.Spitz F, Furlong EEM. Transcription factors: from enhancer binding to developmental control. Nat Rev Genet. 2012;13:613–626. doi: 10.1038/nrg3207. [DOI] [PubMed] [Google Scholar]
2.Levine M, Tjian R. Transcription regulation and animal diversity. Nature. 2003;424:147–151. doi: 10.1038/nature01763. [DOI] [PubMed] [Google Scholar]
3.Levine M, Cattoglio C, Tjian R. Looping back to leap forward: transcription enters a new era. Cell. 2014;157:13–25. doi: 10.1016/j.cell.2014.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Herz H-M, Hu D, Shilatifard A. Enhancer malfunction in cancer. Mol Cell. 2014;53:859–866. doi: 10.1016/j.molcel.2014.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Hampsey M. Molecular genetics of the RNA polymerase II general transcriptional machinery. Microbiol Mol Biol Rev. 1998;62:465–503. doi: 10.1128/mmbr.62.2.465-503.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Kadonaga JT. Perspectives on the RNA polymerase II core promoter. Wiley Interdiscip Rev Dev Biol. 2012;1:40–51. doi: 10.1002/wdev.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Banerji J, Rusconi S, Schaffner W. Expression of a beta-globin gene is enhanced by remote SV40 DNA sequences. Cell. 1981;27:299–308. doi: 10.1016/0092-8674(81)90413-x. [DOI] [PubMed] [Google Scholar]
8.Shlyueva D, Stampfel G, Stark A. Transcriptional enhancers: from properties to genome-wide predictions. Nat Rev Genet. 2014;15:272–286. doi: 10.1038/nrg3682. [DOI] [PubMed] [Google Scholar]
9.Zabidi MA, Stark A. Regulatory Enhancer–Core- Promoter Communication via Transcription Factors and Cofactors. Trends Genet. 2016;32:801–814. doi: 10.1016/j.tig.2016.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Shiraki T, et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc Natl Acad Sci USA. 2003;100:15776–15781. doi: 10.1073/pnas.2136655100. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Gu W, et al. CapSeq and CIP-TAP identify Pol II start sites and reveal capped small RNAs as C. elegans piRNA precursors. Cell. 2012;151:1488–1500. doi: 10.1016/j.cell.2012.11.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Ni T, et al. A paired-end sequencing strategy to map the complex landscape of transcription initiation. Nat Methods. 2010;7:521–527. doi: 10.1038/nmeth.1464. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Nechaev S, et al. Global analysis of short RNAs reveals widespread promoter-proximal stalling and arrest of Pol II in Drosophila. Science. 2010;327:335–338. doi: 10.1126/science.1181421. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Lam MTY, et al. Rev-Erbs repress macrophage gene expression by inhibiting enhancer-directed transcription. Nature. 2013;498:511–515. doi: 10.1038/nature12209. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Core LJ, et al. Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers. Nat Genet. 2014;46:1311–1320. doi: 10.1038/ng.3142. [This work proposes a unified model of transcription initiation at promoters and enhancers and emphasizes that post-initiation transcript stability is the main distinction between the two elements.] [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Kwak H, Fuda NJ, Core LJ, Lis JT. Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science. 2013;339:950–953. doi: 10.1126/science.1229386. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project. Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs. Nature. 2009;457:1028–1032. doi: 10.1038/nature07759. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.The FANTOM Consortium and RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) The Transcriptional Landscape of the Mammalian Genome. Science. 2005;309:1559–1563. doi: 10.1126/science.1112014. [DOI] [PubMed] [Google Scholar]
19.Hoskins RA, et al. Genome-wide analysis of promoter architecture in Drosophila melanogaster. Genome Res. 2011;21:182–192. doi: 10.1101/gr.112466.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Chen RA-J, et al. The landscape of RNA polymerase II transcription initiation in C. elegans reveals promoter and enhancer architectures. Genome Res. 2013;23:1339–1347. doi: 10.1101/gr.153668.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Haberle V, et al. Two independent transcription initiation codes overlap on vertebrate core promoters. Nature. 2014;507:381–385. doi: 10.1038/nature12974. [This study reveals a widespread switch in TSS usage associated with distinct sequence properties during early embryonic development of zebrafish.] [DOI] [PMC free article] [PubMed] [Google Scholar]
22.The FANTOM Consortium & the RIKEN PMI and CLST (DGT) A promoter-level mammalian expression atlas. Nature. 2014;507:462–470. doi: 10.1038/nature13182. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.The ENCODE Project Consortium et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007;447:799–816. doi: 10.1038/nature05874. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Kapranov P, et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007;316:1484–1488. doi: 10.1126/science.1138341. [DOI] [PubMed] [Google Scholar]
25.Kim T-K, et al. Widespread transcription at neuronal activity-regulated enhancers. Nature. 2010;465:182–187. doi: 10.1038/nature09033. [This is the first report of widespread bidirectional transcription from enhancers giving rise to eRNAs.] [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Andersson R, et al. An atlas of active enhancers across human cell types and tissues. Nature. 2014;507:455–461. doi: 10.1038/nature12787. [This study uses bidirectional transcription initiation to predict enhancers and their activity across numerous human cell types.] [DOI] [PMC free article] [PubMed] [Google Scholar]
27.De Santa F, et al. A large fraction of extragenic RNA pol II transcription sites overlap enhancers. Plos Biol. 2010;8:e1000384. doi: 10.1371/journal.pbio.1000384. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Koch F, et al. Transcription initiation platforms and GTF recruitment at tissue-specific enhancers and promoters. Nat Struct Mol Biol. 2011;18:956–963. doi: 10.1038/nsmb.2085. [DOI] [PubMed] [Google Scholar]
29.Arner E, et al. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science. 2015;347:1010–1014. doi: 10.1126/science.1259418. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Li W, et al. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature. 2013;498:516–520. doi: 10.1038/nature12210. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Schaukowitch K, et al. Enhancer RNA facilitates NELF release from immediate early genes. Mol Cell. 2014;56:29–42. doi: 10.1016/j.molcel.2014.08.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Andersson R, et al. Nuclear stability and transcriptional directionality separate functionally distinct RNA species. Nat Commun. 2014;5 doi: 10.1038/ncomms6336. 5336. [DOI] [PubMed] [Google Scholar]
33.Seila AC, et al. Divergent transcription from active promoters. Science. 2008;322:1849–1851. doi: 10.1126/science.1162253. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Preker P, et al. RNA exosome depletion reveals transcription upstream of active human promoters. Science. 2008;322:1851–1854. doi: 10.1126/science.1164096. [DOI] [PubMed] [Google Scholar]
35.Core LJ, Waterfall JJ, Lis JT. Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science. 2008;322:1845–1848. doi: 10.1126/science.1162228. [References 33–35 were the first to report widespread antisense transcription from gene promoters giving rise to short unstable upstream antisense RNAs.] [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Andersson R, et al. Human Gene Promoters Are Intrinsically Bidirectional. Mol Cell. 2015;60:346–347. doi: 10.1016/j.molcel.2015.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Duttke SHC, et al. Human promoters are intrinsically directional. Mol Cell. 2015;57:674–684. doi: 10.1016/j.molcel.2014.12.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Scruggs BS, et al. Bidirectional Transcription Arises from Two Distinct Hubs of Transcription Factor Binding and Active Chromatin. Mol Cell. 2015;58:1101–1112. doi: 10.1016/j.molcel.2015.04.006. [References 37 and 38 demonstrate that bidirectional transcription from promoters arises from two separate and intrinsically directional transcription complexes.] [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Sigova AA, et al. Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells. Proc Natl Acad Sci USA. 2013;110:2876–2881. doi: 10.1073/pnas.1221904110. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Lepoivre C, et al. Divergent transcription is associated with promoters of transcriptional regulators. BMC Genomics. 2013;14:914. doi: 10.1186/1471-2164-14-914. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Andersson R. Promoter or enhancer, what's the difference? Deconstruction of established distinctions and presentation of a unifying model. BioEssays. 2015;37:314–323. doi: 10.1002/bies.201400162. [DOI] [PubMed] [Google Scholar]
42.Kim T-K, Shiekhattar R. Architectural and Functional Commonalities between Enhancers and Promoters. Cell. 2015;162:948–959. doi: 10.1016/j.cell.2015.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Almada AE, Wu X, Kriz AJ, Burge CB, Sharp PA. Promoter directionality is controlled by U1 snRNP and polyadenylation signals. Nature. 2013;499:360–363. doi: 10.1038/nature12349. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Ntini E, et al. Polyadenylation site-induced decay of upstream transcripts enforces promoter directionality. Nat Struct Mol Biol. 2013;20:923–928. doi: 10.1038/nsmb.2640. [DOI] [PubMed] [Google Scholar]
45.Carninci P, et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat Genet. 2006;38:626–635. doi: 10.1038/ng1789. [This work uses genome-wide maps of human and mouse TSSs to describe two classes of promoters that differ in initiation pattern: focused TATA-box-enriched promoters and broad CpG-rich promoters.] [DOI] [PubMed] [Google Scholar]
46.Lenhard B, Sandelin A, Carninci P. Metazoan promoters: emerging characteristics and insights into transcriptional regulation. Nat Rev Genet. 2012;13:233–245. doi: 10.1038/nrg3163. [DOI] [PubMed] [Google Scholar]
47.Schor IE, et al. Promoter shape varies across populations and affects promoter evolution and expression noise. Nat Genet. 2017;49:550–558. doi: 10.1038/ng.3791. [This work identifies natural genetic variants that affect TSS distributions within core promoters in flies.] [DOI] [PubMed] [Google Scholar]
48.Lifton RP, Goldberg ML, Karp RW, Hogness DS. The organization of the histone genes in Drosophila melanogaster: functional and evolutionary implications. Cold Spring Harb Symp Quant Biol. 1978;42:1047–1051. doi: 10.1101/sqb.1978.042.01.105. [DOI] [PubMed] [Google Scholar]
49.Goldberg ML. Sequence analysis of Drosophila histone genes. PhD Dissertation; Stanford University: 1979. [Google Scholar]
50.Ponjavic J, et al. Transcriptional and structural impact of TATA-initiation site spacing in mammalian core promoters. Genome Biol. 2006;7:R78. doi: 10.1186/gb-2006-7-8-r78. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Ohler U, Liao G-C, Niemann H, Rubin GM. Computational analysis of core promoters in the Drosophila genome. Genome Biol. 2002;3 doi: 10.1186/gb-2002-3-12-research0087. RESEARCH0087. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.FitzGerald PC, Sturgill D, Shyakhtenko A, Oliver B, Vinson C. Comparative genomics of Drosophila and human core promoters. Genome Biol. 2006;7:R53. doi: 10.1186/gb-2006-7-7-r53. [References 51 and 52 provide comprehensive computational analyses of over-represented sequence motifs in fly and human core promoters.] [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Patikoglou GA, et al. TATA element recognition by the TATA box-binding protein has been conserved throughout evolution. Genes Dev. 1999;13:3217–3230. doi: 10.1101/gad.13.24.3217. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Burley SK, Roeder RG. Biochemistry and structural biology of transcription factor IID (TFIID) Annu Rev Biochem. 1996;65:769–799. doi: 10.1146/annurev.bi.65.070196.004005. [DOI] [PubMed] [Google Scholar]
55.Louder RK, et al. Structure of promoter-bound TFIID and model of human pre-initiation complex assembly. Nature. 2016;531:604–609. doi: 10.1038/nature17394. [This work reports a structure of the human promoter-bound TFIID, revealing contacts between TFIID subunits and specific core promoter motifs.] [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Smale ST, Baltimore D. The ‘initiator’ as a transcription control element. Cell. 1989;57:103–113. doi: 10.1016/0092-8674(89)90176-1. [DOI] [PubMed] [Google Scholar]
57.Chalkley GE, Verrijzer CP. DNA binding site selection by RNA polymerase II TAFs: a TAF(II)250-TAF(II)150 complex recognizes the initiator. EMBO J. 1999;18:4835–4845. doi: 10.1093/emboj/18.17.4835. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Vo Ngoc L, Cassidy CJ, Huang CY, Duttke SHC, Kadonaga JT. The human initiator is a distinct and abundant element that is precisely positioned in focused core promoters. Genes Dev. 2017;31:6–11. doi: 10.1101/gad.293837.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Burke TW, Kadonaga JT. Drosophila TFIID binds to a conserved downstream basal promoter element that is present in many TATA-box-deficient promoters. Genes Dev. 1996;10:711–724. doi: 10.1101/gad.10.6.711. [DOI] [PubMed] [Google Scholar]
60.Burke TW, Kadonaga JT. The downstream core promoter element, DPE, is conserved from Drosophila to humans and is recognized by TAFII60 of Drosophila. Genes Dev. 1997;11:3020–3031. doi: 10.1101/gad.11.22.3020. [DOI] [PMC free article] [PubMed] [Google Scholar]
61.Kutach AK, Kadonaga JT. The downstream promoter element DPE appears to be as widely used as the TATA box in Drosophila core promoters. Mol Cell Biol. 2000;20:4754–4764. doi: 10.1128/mcb.20.13.4754-4764.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Engstrom PG, Ho Sui SJ, Drivenes O, Becker TS, Lenhard B. Genomic regulatory blocks underlie extensive microsynteny conservation in insects. Genome Res. 2007;17:1898–1908. doi: 10.1101/gr.6669607. [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Lim CY, et al. The MTE, a new core promoter element for transcription by RNA polymerase II. Genes Dev. 2004;18:1606–1617. doi: 10.1101/gad.1193404. [DOI] [PMC free article] [PubMed] [Google Scholar]
64.Lagrange T, Kapanidis AN, Tang H, Reinberg D, Ebright RH. New core promoter element in RNA polymerase II-dependent transcription: sequence-specific DNA binding by transcription factor IIB. Genes Dev. 1998;12:34–44. doi: 10.1101/gad.12.1.34. [DOI] [PMC free article] [PubMed] [Google Scholar]
65.Deng W, Roberts SGE. A core promoter element downstream of the TATA box that is recognized by TFIIB. Genes Dev. 2005;19:2418–2423. doi: 10.1101/gad.342405. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Lewis BA, Kim TK, Orkin SH. A downstream element in the human beta-globin promoter: evidence of extended sequence-specific transcription factor IID contacts. Proc Natl Acad Sci USA. 2000;97:7172–7177. doi: 10.1073/pnas.120181197. [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Lee D-H, et al. Functional characterization of core promoter elements: the downstream core element is recognized by TAF1. Mol Cell Biol. 2005;25:9674–9686. doi: 10.1128/MCB.25.21.9674-9686.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Rach EA, Yuan H-Y, Majoros WH, Tomancak P, Ohler U. Motif composition, conservation and condition-specificity of single and alternative transcription start sites in the Drosophila genome. Genome Biol. 2009;10:R73. doi: 10.1186/gb-2009-10-7-r73. [This work shows differential motif enrichment and spatio-temporal utilization of core promoters with focused and dispersed initiation patterns in flies.] [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Mikhaylichenko O, et al. The degree of enhancer or promoter activity is reflected by the levels and directionality of eRNA transcription. Genes Dev. 2018;32:42–57. doi: 10.1101/gad.308619.117. [This work introduces a functional assay to simultaneously measure enhancer and promoter activity of candidate fragments and shows that core-promoter motifs confer promoter-functionality to enhancers.] [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Juven-Gershon T, Cheng S, Kadonaga JT. Rational design of a super core promoter that enhances gene expression. Nat Methods. 2006;3:917–922. doi: 10.1038/nmeth937. [DOI] [PubMed] [Google Scholar]
71.Pfeiffer BD, et al. Tools for neuroanatomy and neurogenetics in Drosophila. Proc Natl Acad Sci USA. 2008;105:9715–9720. doi: 10.1073/pnas.0803697105. [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Even DY, et al. Engineered Promoters for Potent Transient Overexpression. PLoS ONE. 2016;11:e0148918. doi: 10.1371/journal.pone.0148918. [DOI] [PMC free article] [PubMed] [Google Scholar]
73.Gardiner-Garden M, Frommer M. CpG islands in vertebrate genomes. J Mol Biol. 1987;196:261–282. doi: 10.1016/0022-2836(87)90689-9. [DOI] [PubMed] [Google Scholar]
74.Saxonov S, Berg P, Brutlag DL. A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters. Proc Natl Acad Sci USA. 2006;103:1412–1417. doi: 10.1073/pnas.0510310103. [DOI] [PMC free article] [PubMed] [Google Scholar]
75.Akalin A, et al. Transcriptional features of genomic regulatory blocks. Genome Biol. 2009;10:R38. doi: 10.1186/gb-2009-10-4-r38. [DOI] [PMC free article] [PubMed] [Google Scholar]
76.Dreos R, Ambrosini G, Bucher P. Influence of Rotational Nucleosome Positioning on Transcription Start Site Selection in Animal Promoters. PLoS Comput Biol. 2016;12:e1005144. doi: 10.1371/journal.pcbi.1005144. [DOI] [PMC free article] [PubMed] [Google Scholar]
77.Satchwell SC, Drew HR, Travers AA. Sequence periodicities in chicken nucleosome core DNA. J Mol Biol. 1986;191:659–675. doi: 10.1016/0022-2836(86)90452-3. [DOI] [PubMed] [Google Scholar]
78.Widom J. Role of DNA sequence in nucleosome stability and dynamics. Q Rev Biophys. 2001;34:269–324. doi: 10.1017/s0033583501003699. [DOI] [PubMed] [Google Scholar]
79.Segal E, et al. A genomic code for nucleosome positioning. Nature. 2006;442:772–778. doi: 10.1038/nature04979. [DOI] [PMC free article] [PubMed] [Google Scholar]
80.Yuan G-C, et al. Genome-scale identification of nucleosome positions in S. cerevisiae. Science. 2005;309:626–630. doi: 10.1126/science.1112178. [DOI] [PubMed] [Google Scholar]
81.Mavrich TN, et al. Nucleosome organization in the Drosophila genome. Nature. 2008;453:358–362. doi: 10.1038/nature06929. [This work was the first to map nucleosome positions across a metazoan genome and reveal the organization of nucleosomes around active promoters] [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Jiang C, Pugh BF. Nucleosome positioning and gene regulation: advances through genomics. Nat Rev Genet. 2009;10:161–172. doi: 10.1038/nrg2522. [DOI] [PMC free article] [PubMed] [Google Scholar]
83.Jin C, et al. H3.3/H2A.Z double variant-containing nucleosomes mark ‘nucleosome-free regions’ of active promoters and other regulatory regions. Nat Genet. 2009;41:941–945. doi: 10.1038/ng.409. [DOI] [PMC free article] [PubMed] [Google Scholar]
84.Fei J, et al. The prenucleosome, a stable conformational isomer of the nucleosome. Genes Dev. 2015;29:2563–2575. doi: 10.1101/gad.272633.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
85.Henikoff JG, Belsky JA, Krassovsky K, MacAlpine DM, Henikoff S. Epigenome characterization at single base-pair resolution. Proc Natl Acad Sci USA. 2011;108:18318–18323. doi: 10.1073/pnas.1110731108. [DOI] [PMC free article] [PubMed] [Google Scholar]
86.Rhee HS, Bataille AR, Zhang L, Pugh BF. Subnucleosomal structures and nucleosome asymmetry across a genome. Cell. 2014;159:1377–1388. doi: 10.1016/j.cell.2014.10.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
87.Mueller B, et al. Widespread changes in nucleosome accessibility without changes in nucleosome occupancy during a rapid transcriptional induction. Genes Dev. 2017;31:451–462. doi: 10.1101/gad.293118.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
88.Mieczkowski J, et al. MNase titration reveals differences between nucleosome occupancy and chromatin accessibility. Nat Commun. 2016;7 doi: 10.1038/ncomms11485. 11485. [DOI] [PMC free article] [PubMed] [Google Scholar]
89.Rach EA, et al. Transcription initiation patterns indicate divergent strategies for gene regulation at the chromatin level. PLoS Genet. 2011;7:e1001274. doi: 10.1371/journal.pgen.1001274. [DOI] [PMC free article] [PubMed] [Google Scholar]
90.Kubik S, et al. Nucleosome Stability Distinguishes Two Different Promoter Types at All Protein-Coding Genes in Yeast. Mol Cell. 2015;60:422–434. doi: 10.1016/j.molcel.2015.10.002. [DOI] [PubMed] [Google Scholar]
91.Zaret KS, Carroll JS. Pioneer transcription factors: establishing competence for gene expression. Genes Dev. 2011;25:2227–2241. doi: 10.1101/gad.176826.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
92.Shimojima T, et al. Drosophila FACT contributes to Hox gene expression through physical and functional interactions with GAGA factor. Genes Dev. 2003;17:1605–1616. doi: 10.1101/gad.1086803. [DOI] [PMC free article] [PubMed] [Google Scholar]
93.Fuda NJ, et al. GAGA factor maintains nucleosome-free regions and has a role in RNA polymerase II recruitment to promoters. PLoS Genet. 2015;11:e1005108. doi: 10.1371/journal.pgen.1005108. [DOI] [PMC free article] [PubMed] [Google Scholar]
94.Weber CM, Ramachandran S, Henikoff S. Nucleosomes are context-specific, H2A.Z-modulated barriers to RNA polymerase. Mol Cell. 2014;53:819–830. doi: 10.1016/j.molcel.2014.02.014. [DOI] [PubMed] [Google Scholar]
95.Mousavi K, et al. eRNAs promote transcription by establishing chromatin accessibility at defined genomic loci. Mol Cell. 2013;51:606–617. doi: 10.1016/j.molcel.2013.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
96.Gilchrist DA, et al. Pausing of RNA polymerase II disrupts DNA-specified nucleosome organization to enable precise gene regulation. Cell. 2010;143:540–551. doi: 10.1016/j.cell.2010.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
97.Ahmad K, Henikoff S. The histone variant H3.3 marks active chromatin by replication-independent nucleosome assembly. Mol Cell. 2002;9:1191–1200. doi: 10.1016/s1097-2765(02)00542-7. [DOI] [PubMed] [Google Scholar]
98.Pradhan SK, et al. EP400 Deposits H3.3 into Promoters and Enhancers during Gene Activation. Mol Cell. 2016;61:27–38. doi: 10.1016/j.molcel.2015.10.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
99.Raisner RM, et al. Histone variant H2A.Z marks the 5' ends of both active and inactive genes in euchromatin. Cell. 2005;123:233–248. doi: 10.1016/j.cell.2005.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
100.Barski A, et al. High-Resolution Profiling of Histone Methylations in the Human Genome. Cell. 2007;129:823–837. doi: 10.1016/j.cell.2007.05.009. [This study mapped different histone modifications genome-wide and identified those associated with active or repressed promoters.] [DOI] [PubMed] [Google Scholar]
101.Ng H-H, Robert F, Young RA, Struhl K. Targeted recruitment of Set1 histone methylase by elongating Pol II provides a localized mark and memory of recent transcriptional activity. Mol Cell. 2003;11:709–719. doi: 10.1016/s1097-2765(03)00092-3. [DOI] [PubMed] [Google Scholar]
102.Hathaway NA, et al. Dynamics and memory of heterochromatin in living cells. Cell. 2012;149:1447–1460. doi: 10.1016/j.cell.2012.03.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
103.Zhao R, Nakamura T, Fu Y, Lazar Z, Spector DL. Gene bookmarking accelerates the kinetics of post-mitotic transcriptional re-activation. Nat Cell Biol. 2011;13:1295–1304. doi: 10.1038/ncb2341. [DOI] [PMC free article] [PubMed] [Google Scholar]
104.Tropberger P, et al. Regulation of transcription through acetylation of H3K122 on the lateral surface of the histone octamer. Cell. 2013;152:859–872. doi: 10.1016/j.cell.2013.01.032. [DOI] [PubMed] [Google Scholar]
105.Neumann H, et al. A method for genetically installing site-specific acetylation in recombinant histones defines the effects of H3 K56 acetylation. Mol Cell. 2009;36:153–163. doi: 10.1016/j.molcel.2009.07.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
106.Tessarz P, Kouzarides T. Histone core modifications regulating nucleosome structure and dynamics. Nat Rev Mol Cell Biol. 2014;15:703–708. doi: 10.1038/nrm3890. [DOI] [PubMed] [Google Scholar]
107.Dey A, Chitsaz F, Abbasi A, Misteli T, Ozato K. The double bromodomain protein Brd4 binds to acetylated chromatin during interphase and mitosis. Proc Natl Acad Sci USA. 2003;100:8758–8763. doi: 10.1073/pnas.1433065100. [DOI] [PMC free article] [PubMed] [Google Scholar]
108.Hödl M, Basler K. Transcription in the absence of histone H3.2 and H3K4 methylation. Curr Biol. 2012;22:2253–2257. doi: 10.1016/j.cub.2012.10.008. [DOI] [PubMed] [Google Scholar]
109.Hödl M, Basler K. Transcription in the absence of histone H3.3. Curr Biol. 2009;19:1221–1226. doi: 10.1016/j.cub.2009.05.048. [DOI] [PubMed] [Google Scholar]
110.Pengelly AR, Copur Ö, Jäckle H, Herzig A, Müller J. A histone mutant reproduces the phenotype caused by loss of histone-modifying factor Polycomb. Science. 2013;339:698–699. doi: 10.1126/science.1231382. [DOI] [PubMed] [Google Scholar]
111.Sterner DE, Berger SL. Acetylation of histones and transcription-related factors. Microbiol Mol Biol Rev. 2000;64:435–459. doi: 10.1128/mmbr.64.2.435-459.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
112.Imhof A, et al. Acetylation of general transcription factors by histone acetyltransferases. Curr Biol. 1997;7:689–692. doi: 10.1016/s0960-9822(06)00296-x. [DOI] [PubMed] [Google Scholar]
113.Roe J-S, Mercan F, Rivera K, Pappin DJ, Vakoc CR. BET Bromodomain Inhibition Suppresses the Function of Hematopoietic Transcription Factors in Acute Myeloid Leukemia. Mol Cell. 2015;58:1028–1039. doi: 10.1016/j.molcel.2015.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
114.Schröder S, et al. Acetylation of RNA Polymerase II Regulates Growth-Factor-Induced Gene Transcription in Mammalian Cells. Mol Cell. 2013;52:314–324. doi: 10.1016/j.molcel.2013.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
115.Rickels R, et al. Histone H3K4 monomethylation catalyzed by Trr and mammalian COMPASS-like proteins at enhancers is dispensable for development and viability. Nat Genet. 2017;156:645–1653. doi: 10.1038/ng.3965. [DOI] [PMC free article] [PubMed] [Google Scholar]
116.Dorighi KM, et al. Mll3 and Mll4 Facilitate Enhancer RNA Synthesis and Transcription from Promoters Independently of H3K4 Monomethylation. Mol Cell. 2017;66:568–576.e4. doi: 10.1016/j.molcel.2017.04.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
117.Pollex T, Furlong EEM. Correlation Does Not Imply Causation: Histone Methyltransferases, but Not Histone Methylation, SET the Stage for Enhancer Activation. Mol Cell. 2017;66:439–441. doi: 10.1016/j.molcel.2017.05.005. [DOI] [PubMed] [Google Scholar]
118.Andersen PR, Tirian L, Vunjak M, Brennecke J. A heterochromatin-dependent transcription machinery drives piRNA expression. Nature. 2017;549:54–59. doi: 10.1038/nature23482. [This study shows that histone modifications recruit the transcription machinery to transcribe heterochromatic loci that are source of small RNAs.] [DOI] [PMC free article] [PubMed] [Google Scholar]
119.Thomas MC, Chiang C-M. The general transcription machinery and general cofactors. Crit Rev Biochem Mol Biol. 2006;41:105–178. doi: 10.1080/10409230600648736. [DOI] [PubMed] [Google Scholar]
120.Orphanides G, Lagrange T, Reinberg D. The general transcription factors of RNA polymerase II. Genes Dev. 1996;10:2657–2683. doi: 10.1101/gad.10.21.2657. [DOI] [PubMed] [Google Scholar]
121.Sainsbury S, Bernecky C, Cramer P. Structural basis of transcription initiation by RNA polymerase II. Nat Rev Mol Cell Biol. 2015;16:129–143. doi: 10.1038/nrm3952. [DOI] [PubMed] [Google Scholar]
122.Zhang Z, et al. Rapid dynamics of general transcription factor TFIIB binding during preinitiation complex assembly revealed by single-molecule analysis. Genes Dev. 2016;30:2106–2118. doi: 10.1101/gad.285395.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
123.He Y, et al. Near-atomic resolution visualization of human transcription promoter opening. Nature. 2016;533:359–365. doi: 10.1038/nature17970. [DOI] [PMC free article] [PubMed] [Google Scholar]
124.Plaschka C, et al. Transcription initiation complex structures elucidate DNA opening. Nature. 2016;533:353–358. doi: 10.1038/nature17990. [This study reports structures of open and closed yeast PIC complexes and proposes a mechanism of DNA duplex opening.] [DOI] [PubMed] [Google Scholar]
125.Vermeulen M, et al. Selective Anchoring of TFIID to Nucleosomes by Trimethylation of Histone H3 Lysine 4. Cell. 2007;131:58–69. doi: 10.1016/j.cell.2007.08.016. [DOI] [PubMed] [Google Scholar]
126.Papai G, et al. TFIIA and the transactivator Rap1 cooperate to commit TFIID for transcription initiation. Nature. 2010;465:956–960. doi: 10.1038/nature09080. [DOI] [PMC free article] [PubMed] [Google Scholar]
127.Liu W-L, et al. Structures of three distinct activator-TFIID complexes. Genes Dev. 2009;23:1510–1521. doi: 10.1101/gad.1790709. [DOI] [PMC free article] [PubMed] [Google Scholar]
128.Chopra VS, et al. Transcriptional activation by GAGA factor is through its direct interaction with dmTAF3. Dev Biol. 2008;317:660–670. doi: 10.1016/j.ydbio.2008.02.008. [DOI] [PubMed] [Google Scholar]
129.Hochheimer A, Tjian R. Diversified transcription initiation complexes expand promoter selectivity and tissue-specific gene expression. Genes Dev. 2003;17:1309–1320. doi: 10.1101/gad.1099903. [DOI] [PubMed] [Google Scholar]
130.Taatjes DJ, Marr MT, Tjian R. Regulatory diversity among metazoan co-activator complexes. Nat Rev Mol Cell Biol. 2004;5:403–410. doi: 10.1038/nrm1369. [DOI] [PubMed] [Google Scholar]
131.Goodrich JA, Tjian R. Unexpected roles for core promoter recognition factors in cell-type-specific transcription and gene regulation. Nat Rev Genet. 2010;11:549–558. doi: 10.1038/nrg2847. [DOI] [PMC free article] [PubMed] [Google Scholar]
132.Jones KA. Changing the core of transcription. eLife. 2014;3:e03575. doi: 10.7554/eLife.03575. [DOI] [PMC free article] [PubMed] [Google Scholar]
133.Hochheimer A, Zhou S, Zheng S, Holmes MC, Tjian R. TRF2 associates with DREF and directs promoter-selective gene expression in Drosophila. Nature. 2002;420:439–445. doi: 10.1038/nature01167. [DOI] [PubMed] [Google Scholar]
134.Wang Y-L, et al. TRF2, but not TBP, mediates the transcription of ribosomal protein genes. Genes Dev. 2014;28:1550–1555. doi: 10.1101/gad.245662.114. [References 133 and 134 reveal that TRF2 replaces TBP within the PIC to drive transcription of a specific subset of genes.] [DOI] [PMC free article] [PubMed] [Google Scholar]
135.Isogai Y, Keles S, Prestel M, Hochheimer A, Tjian R. Transcription of histone gene cluster by differential core-promoter factors. Genes Dev. 2007;21:2936–2949. doi: 10.1101/gad.1608807. [DOI] [PMC free article] [PubMed] [Google Scholar]
136.Rhee HS, Pugh BF. Genome-wide structure and organization of eukaryotic pre-initiation complexes. Nature. 2012;483:295–301. doi: 10.1038/nature10799. [This study was the first to map positions of PIC components at high resolution across the entire yeast genome.] [DOI] [PMC free article] [PubMed] [Google Scholar]
137.Basehoar AD, Zanton SJ, Pugh BF. Identification and distinct regulation of yeast TATA box-containing genes. Cell. 2004;116:699–709. doi: 10.1016/s0092-8674(04)00205-3. [DOI] [PubMed] [Google Scholar]
138.Struhl K. Constitutive and inducible Saccharomyces cerevisiae promoters: evidence for two distinct molecular mechanisms. Mol Cell Biol. 1986;6:3847–3853. doi: 10.1128/mcb.6.11.3847. [DOI] [PMC free article] [PubMed] [Google Scholar]
139.Baptista T, et al. SAGA Is a General Cofactor for RNA Polymerase II Transcription. Mol Cell. 2017;68:1–20. doi: 10.1016/j.molcel.2017.08.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
140.Warfield L, et al. Transcription of Nearly All Yeast RNA Polymerase II- Transcribed Genes Is Dependent on Transcription Factor TFIID. Mol Cell. 2017;68:1–18. doi: 10.1016/j.molcel.2017.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
141.Zeitlinger J, et al. RNA polymerase stalling at developmental control genes in the Drosophila melanogaster embryo. Nat Genet. 2007;39:1512–1516. doi: 10.1038/ng.2007.26. [DOI] [PMC free article] [PubMed] [Google Scholar]
142.Muse GW, et al. RNA polymerase is poised for activation across the genome. Nat Genet. 2007;39:1507–1511. doi: 10.1038/ng.2007.21. [References 141 and 142 report widespread Pol II pausing at developmentally regulated genes in fly.] [DOI] [PMC free article] [PubMed] [Google Scholar]
143.Guenther MG, Levine SS, Boyer LA, Jaenisch R, Young RA. A chromatin landmark and transcription initiation at most promoters in human cells. Cell. 2007;130:77–88. doi: 10.1016/j.cell.2007.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
144.Rougvie AE, Lis JT. The RNA polymerase II molecule at the 5' end of the uninduced hsp70 gene of D. melanogaster is transcriptionally engaged. Cell. 1988;54:795–804. doi: 10.1016/s0092-8674(88)91087-2. [This study was the first to show that Pol II pauses downstream of the TSS.] [DOI] [PubMed] [Google Scholar]
145.Lis JT, Mason P, Peng J, Price DH, Werner J. P-TEFb kinase recruitment and function at heat shock loci. Genes Dev. 2000;14:792–803. [PMC free article] [PubMed] [Google Scholar]
146.Henriques T, et al. Stable pausing by RNA polymerase II provides an opportunity to target and integrate regulatory signals. Mol Cell. 2013;52:517–528. doi: 10.1016/j.molcel.2013.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
147.Boettiger AN, Levine M. Synchronous and stochastic patterns of gene activation in the Drosophila embryo. Science. 2009;325:471–473. doi: 10.1126/science.1173976. [DOI] [PMC free article] [PubMed] [Google Scholar]
148.Lagha M, et al. Paused Pol II coordinates tissue morphogenesis in the Drosophila embryo. Cell. 2013;153:976–987. doi: 10.1016/j.cell.2013.04.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
149.Williams LH, et al. Pausing of RNA polymerase II regulates mammalian developmental potential through control of signaling networks. Mol Cell. 2015;58:311–322. doi: 10.1016/j.molcel.2015.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
150.Jonkers I, Kwak H, Lis JT. Genome-wide dynamics of Pol II elongation and its interplay with promoter proximal pausing, chromatin, and exons. eLife. 2014;3:e02407. doi: 10.7554/eLife.02407. [DOI] [PMC free article] [PubMed] [Google Scholar]
151.Shao W, Zeitlinger J. Paused RNA polymerase II inhibits new transcriptional initiation. Nat Genet. 2017;16:129–1051. doi: 10.1038/ng.3867. [DOI] [PubMed] [Google Scholar]
152.Krebs AR, et al. Genome-wide Single-Molecule Footprinting Reveals High RNA Polymerase II Turnover at Paused Promoters. Mol Cell. 2017;67:411–422.e4. doi: 10.1016/j.molcel.2017.06.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
153.Gressel S, et al. CDK9-dependent RNA polymerase II pausing controls transcription initiation. eLife. 2017;6:R106. doi: 10.7554/eLife.29736. [References 151–153 report a wide-range of paused Pol II half-lives at promoters genome-wide.] [DOI] [PMC free article] [PubMed] [Google Scholar]
154.Ehrensberger AH, Kelly GP, Svejstrup JQ. Mechanistic interpretation of promoter-proximal peaks and RNAPII density maps. Cell. 2013;154:713–715. doi: 10.1016/j.cell.2013.07.032. [DOI] [PubMed] [Google Scholar]
155.Hendrix DA, Hong J-W, Zeitlinger J, Rokhsar DS, Levine MS. Promoter elements associated with RNA Pol II stalling in the Drosophila embryo. Proc Natl Acad Sci USA. 2008;105:7762–7767. doi: 10.1073/pnas.0802406105. [DOI] [PMC free article] [PubMed] [Google Scholar]
156.Veloso A, et al. Rate of elongation by RNA polymerase II is associated with specific gene features and epigenetic modifications. Genome Res. 2014;24:896–905. doi: 10.1101/gr.171405.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
157.Gartenberg MR, Wang JC. Positive supercoiling of DNA greatly diminishes mRNA synthesis in yeast. Proc Natl Acad Sci USA. 1992;89:11461–11465. doi: 10.1073/pnas.89.23.11461. [DOI] [PMC free article] [PubMed] [Google Scholar]
158.Joshi RS, Piña B, Roca J. Positional dependence of transcriptional inhibition by DNA torsional stress in yeast chromosomes. EMBO J. 2010;29:740–748. doi: 10.1038/emboj.2009.391. [DOI] [PMC free article] [PubMed] [Google Scholar]
159.Henriques T, et al. Widespread transcriptional pausing and elongation control at enhancers. Genes Dev. 2018;32:26–41. doi: 10.1101/gad.309351.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
160.Chen FX, et al. PAF1 regulation of promoter-proximal pause release via enhancer activation. Science. 2017;357:1294–1298. doi: 10.1126/science.aan3269. [References 159 and 160 report widespread pausing of Pol II at enhancers.] [DOI] [PMC free article] [PubMed] [Google Scholar]
161.Baranello L, et al. RNA Polymerase II Regulates Topoisomerase 1 Activity to Favor Efficient Transcription. Cell. 2016;165:357–371. doi: 10.1016/j.cell.2016.02.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
162.Missra A, Gilmour DS. Interactions between DSIF (DRB sensitivity inducing factor), NELF (negative elongation factor), and the Drosophila RNA polymerase II transcription elongation complex. Proc Natl Acad Sci USA. 2010;107:11301–11306. doi: 10.1073/pnas.1000681107. [DOI] [PMC free article] [PubMed] [Google Scholar]
163.Yamaguchi Y, Shibata H, Handa H. Transcription elongation factors DSIF and NELF: promoter-proximal pausing and beyond. Biochim Biophys Acta. 2013;1829:98–104. doi: 10.1016/j.bbagrm.2012.11.007. [DOI] [PubMed] [Google Scholar]
164.Bernecky C, Plitzko JM, Cramer P. Structure of a transcribing RNA polymerase II-DSIF complex reveals a multidentate DNA-RNA clamp. Nat Struct Mol Biol. 2017;24:809–815. doi: 10.1038/nsmb.3465. [DOI] [PubMed] [Google Scholar]
165.Qiu Y, Gilmour DS. Identification of Regions in the Spt5 Subunit of DRB Sensitivity-inducing Factor (DSIF) That Are Involved in Promoter-proximal Pausing. J Biol Chem. 2017;292:5555–5570. doi: 10.1074/jbc.M116.760751. [References 164 and 165 provide structural and biochemical evidence that DSIF contacts nascent RNA protruding from Pol II ascribing it a role in triggering Pol II pausing.] [DOI] [PMC free article] [PubMed] [Google Scholar]
166.Ehara H, et al. Structure of the complete elongation complex of RNA polymerase II with basal factors. Science. 2017;357:921–924. doi: 10.1126/science.aan8552. [DOI] [PubMed] [Google Scholar]
167.Li G, et al. Extensive Promoter-Centered Chromatin Interactions Provide a Topological Basis for Transcription Regulation. Cell. 2012;148:84–98. doi: 10.1016/j.cell.2011.12.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
168.Rao SSP, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–1680. doi: 10.1016/j.cell.2014.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
169.Sanyal A, Lajoie BR, Jain G, Dekker J. The long-range interaction landscape of gene promoters. Nature. 2012;489:109–113. doi: 10.1038/nature11279. [DOI] [PMC free article] [PubMed] [Google Scholar]
170.Beagrie RA, et al. Complex multi-enhancer contacts captured by genome architecture mapping. Nature. 2017;543:519–524. doi: 10.1038/nature21411. [DOI] [PMC free article] [PubMed] [Google Scholar]
171.Symmons O, et al. Functional and topological characteristics of mammalian regulatory domains. Genome Res. 2014;24:390–400. doi: 10.1101/gr.163519.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
172.Merkenschlager M, Nora EP. CTCF and Cohesin in Genome Folding and Transcriptional Gene Regulation. Annu Rev Genom Human Genet. 2016;17:17–43. doi: 10.1146/annurev-genom-083115-022339. [DOI] [PubMed] [Google Scholar]
173.Spitz F. Gene regulation at a distance: From remote enhancers to 3D regulatory ensembles. Semin Cell Dev Biol. 2016;57:57–67. doi: 10.1016/j.semcdb.2016.06.017. [DOI] [PubMed] [Google Scholar]
174.Ghavi-Helm Y, et al. Enhancer loops appear stable during development and are associated with paused polymerase. Nature. 2014;512:96–100. doi: 10.1038/nature13417. [DOI] [PubMed] [Google Scholar]
175.Michel M, Cramer P. Transitions for regulating early transcription. Cell. 2013;153:943–944. doi: 10.1016/j.cell.2013.04.050. [DOI] [PubMed] [Google Scholar]
176.Eychenne T, et al. Functional interplay between Mediator and TFIIB in preinitiation complex assembly in relation to promoter architecture. Genes Dev. 2016;30:2119–2132. doi: 10.1101/gad.285775.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
177.Esnault C, et al. Mediator-dependent recruitment of TFIIH modules in preinitiation complex. Mol Cell. 2008;31:337–346. doi: 10.1016/j.molcel.2008.06.021. [DOI] [PubMed] [Google Scholar]
178.Visel A, et al. ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature. 2009;457:854–858. doi: 10.1038/nature07730. [DOI] [PMC free article] [PubMed] [Google Scholar]
179.Boija A, et al. CBP Regulates Recruitment and Release of Promoter-Proximal RNA Polymerase II. Mol Cell. 2017;68:491–503.e5. doi: 10.1016/j.molcel.2017.09.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
180.Sawado T, Halow J, Bender MA, Groudine M. The beta -globin locus control region (LCR) functions primarily by enhancing the transition from transcription initiation to elongation. Genes Dev. 2003;17:1009–1018. doi: 10.1101/gad.1072303. [DOI] [PMC free article] [PubMed] [Google Scholar]
181.Yang Z, et al. Recruitment of P-TEFb for stimulation of transcriptional elongation by the bromodomain protein Brd4. Mol Cell. 2005;19:535–545. doi: 10.1016/j.molcel.2005.06.029. [DOI] [PubMed] [Google Scholar]
182.Jang MK, et al. The bromodomain protein Brd4 is a positive regulatory component of P-TEFb and stimulates RNA polymerase II-dependent transcription. Mol Cell. 2005;19:523–534. doi: 10.1016/j.molcel.2005.06.027. [DOI] [PubMed] [Google Scholar]
183.Zuber J, et al. RNAi screen identifies Brd4 as a therapeutic target in acute myeloid leukaemia. Nature. 2011;478:524–528. doi: 10.1038/nature10334. [DOI] [PMC free article] [PubMed] [Google Scholar]
184.Delmore JE, et al. BET bromodomain inhibition as a therapeutic strategy to target c-Myc. Cell. 2011;146:904–917. doi: 10.1016/j.cell.2011.08.017. [References 183 and 184 identify BRD4 as a gene-specific regulator whose depletion affects only a subset of genes.] [DOI] [PMC free article] [PubMed] [Google Scholar]
185.Lovén J, et al. Selective inhibition of tumor oncogenes by disruption of super-enhancers. Cell. 2013;153:320–334. doi: 10.1016/j.cell.2013.03.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
186.Rathert P, et al. Transcriptional plasticity promotes primary and acquired resistance to BET inhibition. Nature. 2015;525:543–547. doi: 10.1038/nature14898. [DOI] [PMC free article] [PubMed] [Google Scholar]
187.Winter GE, et al. BET Bromodomain Proteins Function as Master Transcription Elongation Factors Independent of CDK9 Recruitment. Mol Cell. 2017;67:5–18. doi: 10.1016/j.molcel.2017.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
188.Muhar M, et al. SLAM-seq defines direct gene-regulatory functions of the BRD4-MYC axis. Science. 2018 doi: 10.1126/science.aao2793. [References 187 and 188 demonstrate that BRD proteins, in particular Brd4, are globally required for transition into productive elongation in a manner that this is independent of CDK9 recruitment.] [DOI] [PMC free article] [PubMed] [Google Scholar]
189.Chen FX, et al. PAF1, a Molecular Regulator of Promoter-Proximal Pausing by RNA Polymerase II. Cell. 2015;162:1003–1015. doi: 10.1016/j.cell.2015.07.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
190.Chubb JR, Trcek T, Shenoy SM, Singer RH. Transcriptional pulsing of a developmental gene. Curr Biol. 2006;16:1018–1025. doi: 10.1016/j.cub.2006.03.092. [DOI] [PMC free article] [PubMed] [Google Scholar]
191.Raj A, Peskin CS, Tranchina D, Vargas DY, Tyagi S. Stochastic mRNA synthesis in mammalian cells. Plos Biol. 2006;4:e309. doi: 10.1371/journal.pbio.0040309. [DOI] [PMC free article] [PubMed] [Google Scholar]
192.Tantale K, et al. A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting. Nat Commun. 2016;7 doi: 10.1038/ncomms12248. 12248. [DOI] [PMC free article] [PubMed] [Google Scholar]
193.Fukaya T, Lim B, Levine M. Enhancer Control of Transcriptional Bursting. Cell. 2016;166:358–368. doi: 10.1016/j.cell.2016.05.025. [This study provides evidence that enhancers regulate the frequency of transcription bursts synchronously from multiple promoters in their vicinity.] [DOI] [PMC free article] [PubMed] [Google Scholar]
194.Bartman CR, Hsu SC, Hsiung CC-S, Raj A, Blobel GA. Enhancer Regulation of Transcriptional Bursting Parameters Revealed by Forced Chromatin Looping. Mol Cell. 2016;62:237–247. doi: 10.1016/j.molcel.2016.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
195.Hornung G, et al. Noise-mean relationship in mutated promoters. Genome Res. 2012;22:2409–2417. doi: 10.1101/gr.139378.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
196.Blake WJ, et al. Phenotypic consequences of promoter-mediated transcriptional noise. Mol Cell. 2006;24:853–865. doi: 10.1016/j.molcel.2006.11.003. [DOI] [PubMed] [Google Scholar]
197.Tirosh I, Weinberger A, Carmi M, Barkai N. A genetic signature of interspecies variations in gene expression. Nat Genet. 2006;38:830–834. doi: 10.1038/ng1819. [DOI] [PubMed] [Google Scholar]
198.Arnold CD, et al. Genome-wide assessment of sequence-intrinsic enhancer responsiveness at single-base-pair resolution. Nat Biotechnol. 2016;35:136–144. doi: 10.1038/nbt.3739. [This study measures enhancer responsiveness for all core promoters across the fly genome and demonstrates that core promoters show differential responses to different enhancers.] [DOI] [PMC free article] [PubMed] [Google Scholar]
199.Deng W, et al. Controlling long-range genomic interactions at a native locus by targeted tethering of a looping factor. Cell. 2012;149:1233–1244. doi: 10.1016/j.cell.2012.03.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
200.Butler JE, Kadonaga JT. Enhancer-romoter specificity mediated by DPE or TATA core promoter motifs. Genes Dev. 2001;15:2515–2519. doi: 10.1101/gad.924301. [This work demonstrates that TATA-box- and DPE-containing core promoters can be differentially activated when integrated in the same genomic locus.] [DOI] [PMC free article] [PubMed] [Google Scholar]
201.Zabidi MA, et al. Enhancer-core-promoter specificity separates developmental and housekeeping gene regulation. Nature. 2015;518:556–559. doi: 10.1038/nature13994. [This work provides evidence for sequence-encoded enhancer–core promoter specificity that distinguishes between housekeeping and developmental transcription programmes in the fly genome.] [DOI] [PMC free article] [PubMed] [Google Scholar]
202.Ptashne M, Gann A. Transcriptional activation by recruitment. Nature. 1997;386:569–577. doi: 10.1038/386569a0. [DOI] [PubMed] [Google Scholar]
203.Brent R, Ptashne M. A eukaryotic transcriptional activator bearing the DNA specificity of a prokaryotic repressor. Cell. 1985;43:729–736. doi: 10.1016/0092-8674(85)90246-6. [DOI] [PubMed] [Google Scholar]
204.Hope IA, Struhl K. Functional dissection of a eukaryotic transcriptional activator protein, GCN4 of yeast. Cell. 1986;46:885–894. doi: 10.1016/0092-8674(86)90070-x. [DOI] [PubMed] [Google Scholar]
205.Keung AJ, Bashor CJ, Kiriakov S, Collins JJ, Khalil AS. Using Targeted Chromatin Regulators to Engineer Combinatorial and Spatial Transcriptional Regulation. Cell. 2014;158:110–120. doi: 10.1016/j.cell.2014.04.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
206.Stampfel G, et al. Transcriptional regulators form diverse groups with context-dependentregulatory functions. Nature. 2015;528:147–151. doi: 10.1038/nature15545. [References 202–206 show that artificial recruitment of transcription factors and cofactors can be sufficient to drive transcription and that their activity is often context-dependent.] [DOI] [PubMed] [Google Scholar]
207.Juven-Gershon T, Hsu J-Y, Kadonaga JT. Caudal, a key developmental regulator, is a DPE-specific transcriptional factor. Genes Dev. 2008;22:2823–2830. doi: 10.1101/gad.1698108. [DOI] [PMC free article] [PubMed] [Google Scholar]
208.van Arensbergen J, van Steensel B, Bussemaker HJ. In search of the determinants of enhancer-promoter interaction specificity. Trends Cell Biol. 2014;24:695–702. doi: 10.1016/j.tcb.2014.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
209.Petrenko N, Jin Y, Wong KH, Struhl K. Evidence that Mediator is essential for Pol II transcription, but is not a required component of the preinitiation complex in vivo. eLife. 2017;6:155. doi: 10.7554/eLife.28447. [This work demonstrates that the depletion of different Mediator subunits affects transcription of a specific subset of genes more strongly than others.] [DOI] [PMC free article] [PubMed] [Google Scholar]
210.Huminiecki Ł, Horbańczuk J. Can We Predict Gene Expression by Understanding Proximal Promoter Architecture? Trends Biotechnol. 2017;35:530–546. doi: 10.1016/j.tibtech.2017.03.007. [DOI] [PubMed] [Google Scholar]
211.Bonn S, et al. Cell type-specific chromatin immunoprecipitation from multicellular complex samples using BiTS-ChIP. Nature Protocols. 2012;7:978–994. doi: 10.1038/nprot.2012.049. [DOI] [PubMed] [Google Scholar]
212.Lai WKM, Pugh BF. Genome-wide uniformity of human ‘open’ pre-initiation complexes. Genome Res. 2017;27:15–26. doi: 10.1101/gr.210955.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
213.Andersson R, Sandelin A, Danko CG. A unified architecture of transcriptional regulatory elements. Trends Genet. 2015;31:426–433. doi: 10.1016/j.tig.2015.05.007. [DOI] [PubMed] [Google Scholar]
214.Arnold CD, et al. Genome-wide quantitative enhancer activity maps identified by STARR-seq. Science. 2013;339:1074–1077. doi: 10.1126/science.1232542. [This work was the first to functionally map enhancer activity across an entire genome.] [DOI] [PubMed] [Google Scholar]
215.Muerdter F, et al. Resolving systematic errors in widely used enhancer activity assays in human cells. Nat Methods. 2018;15:141–149. doi: 10.1038/nmeth.4534. [DOI] [PMC free article] [PubMed] [Google Scholar]
216.van Arensbergen J, et al. Genome-wide mapping of autonomous promoter activity in human cells. Nat Biotechnol. 2016;35:145–153. doi: 10.1038/nbt.3754. [This study reports autonomous promoter activity across the human genome.] [DOI] [PMC free article] [PubMed] [Google Scholar]
217.Nguyen TA, et al. High-throughput functional comparison of promoter and enhancer activities. Genome Res. 2016;26:1023–1033. doi: 10.1101/gr.204834.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
218.Dao LTM, et al. Genome-wide characterization of mammalian promoters with distal enhancer functions. Nat Genet. 2017;49:1073–1081. doi: 10.1038/ng.3884. [DOI] [PubMed] [Google Scholar]
219.Catarino RR, Neumayr C, Stark A. Promoting transcription over long distances. Nat Genet. 2017;49:972–973. doi: 10.1038/ng.3904. [DOI] [PubMed] [Google Scholar]
220.Young RS, Kumar Y, Bickmore WA, Taylor MS. Bidirectional transcription initiation marks accessible chromatin and is not specific to enhancers. Genome Biol. 2017;18:242. doi: 10.1186/s13059-017-1379-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
221.Jin Y, Eser U, Struhl K, Churchman LS. The Ground State and Evolution of Promoter Region Directionality. Cell. 2017;170:1–21. doi: 10.1016/j.cell.2017.07.006. [This work shows that transcription from newly emerged promoter regions in yeast is bidirectional and that transcription directionality is an evolutionarily selected trait.] [DOI] [PMC free article] [PubMed] [Google Scholar]
222.Neri F, et al. Intragenic DNA methylation prevents spurious transcription initiation. Nature. 2017;543:72–77. doi: 10.1038/nature21373. [DOI] [PubMed] [Google Scholar]
223.Kim J, et al. Blocking promiscuous activation at cryptic promoters directs cell type-specific gene expression. Science. 2017;356:717–721. doi: 10.1126/science.aal3096. [DOI] [PMC free article] [PubMed] [Google Scholar]
224.Lam MTY, Li W, Rosenfeld MG, Glass CK. Enhancer RNAs and regulated transcriptional programs. Trends Biochem Sci. 2014;39:170–182. doi: 10.1016/j.tibs.2014.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
225.Tsai A, et al. Nuclear microenvironments modulate transcription from low-affinity enhancers. eLife. 2017;6:e1006441. doi: 10.7554/eLife.28975. [DOI] [PMC free article] [PubMed] [Google Scholar]
226.Hnisz D, Shrinivas K, Young RA, Chakraborty AK, Sharp PA. A Phase Separation Model for Transcriptional Control. Cell. 2017;169:13–23. doi: 10.1016/j.cell.2017.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
227.Muerdter F, Stark A. Gene Regulation: Activation through Space. Curr Biol. 2016;26:R895–R898. doi: 10.1016/j.cub.2016.08.031. [DOI] [PubMed] [Google Scholar]
228.Brangwynne CP, et al. Germline P granules are liquid droplets that localize by controlled dissolution/condensation. Science. 2009;324:1729–1732. doi: 10.1126/science.1172046. [DOI] [PubMed] [Google Scholar]
229.Kato M, et al. Cell-free formation of RNA granules: low complexity sequence domains form dynamic fibers within hydrogels. Cell. 2012;149:753–767. doi: 10.1016/j.cell.2012.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
230.Han TW, et al. Cell-free formation of RNA granules: bound RNAs identify features and components of cellular assemblies. Cell. 2012;149:768–779. doi: 10.1016/j.cell.2012.04.016. [DOI] [PubMed] [Google Scholar]
231.Larson AG, et al. Liquid droplet formation by HP1α suggests a role for phase separation in heterochromatin. Nature. 2017;547:236–240. doi: 10.1038/nature22822. [DOI] [PMC free article] [PubMed] [Google Scholar]
232.Strom AR, et al. Phase separation drives heterochromatin domain formation. Nature. 2017;547:241–245. doi: 10.1038/nature22989. [DOI] [PMC free article] [PubMed] [Google Scholar]
233.Dekker J, Mirny L. The 3D Genome as Moderator of Chromosomal Communication. Cell. 2016;164:1110–1121. doi: 10.1016/j.cell.2016.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
234.Dixon JR, Gorkin DU, Ren B. Chromatin Domains: The Unit of Chromosome Organization. Mol Cell. 2016;62:668–680. doi: 10.1016/j.molcel.2016.05.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
235.Roider HG, Lenhard B, Kanhere A, Haas SA, Vingron M. CpG-depleted promoters harbor tissue-specific transcription factor binding signals--implications for motif overrepresentation analyses. Nucleic Acids Res. 2009;37:6305–6315. doi: 10.1093/nar/gkp682. [DOI] [PMC free article] [PubMed] [Google Scholar]
236.Bernstein BE, et al. A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell. 2006;125:315–326. doi: 10.1016/j.cell.2006.02.041. [DOI] [PubMed] [Google Scholar]
237.Diao Y, et al. A tiling-deletion-based genetic screen for cis-regulatory element identification in mammalian cells. Nat Methods. 2017;503:290–635. doi: 10.1038/nmeth.4264. [DOI] [PMC free article] [PubMed] [Google Scholar]
238.Rajagopal N, et al. High-throughput mapping of regulatory DNA. Nat Biotechnol. 2016;34:167–174. doi: 10.1038/nbt.3468. [DOI] [PMC free article] [PubMed] [Google Scholar]
239.Lubliner S, et al. Core promoter sequence in yeast is a major determinant of expression level. Genome Res. 2015;25:1008–1017. doi: 10.1101/gr.188193.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
240.Patwardhan RP, et al. High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesis. Nat Biotechnol. 2009;27:1173–1175. doi: 10.1038/nbt.1589. [DOI] [PMC free article] [PubMed] [Google Scholar]
241.Bucher P. Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J Mol Biol. 1990;212:563–578. doi: 10.1016/0022-2836(90)90223-9. [DOI] [PubMed] [Google Scholar]
242.Hahn S, Buratowski S, Sharp PA, Guarente L. Yeast TATA-binding protein TFIID binds to TATA elements with both consensus and nonconsensus DNA sequences. Proc Natl Acad Sci USA. 1989;86:5718–5722. doi: 10.1073/pnas.86.15.5718. [DOI] [PMC free article] [PubMed] [Google Scholar]
243.Arkhipova IR, et al. The steps of reverse transcription of Drosophila mobile dispersed genetic elements and U3-R-U5 structure of their LTRs. Cell. 1986;44:555–563. doi: 10.1016/0092-8674(86)90265-5. [DOI] [PubMed] [Google Scholar]
244.Li J, Gilmour DS. Distinct mechanisms of transcriptional pausing orchestrated by GAGA factor and M1BP, a novel transcription factor. EMBO J. 2013;32:1829–1841. doi: 10.1038/emboj.2013.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
245.Hirose F, Yamaguchi M, Handa H, Inomata Y, Matsukage A. Novel 8-base pair sequence (Drosophila DNA replication-related element) and specific binding factor involved in the expression of Drosophila genes for DNA polymerase alpha and proliferating cell nuclear antigen. J Biol Chem. 1993;268:2092–2099. [PubMed] [Google Scholar]
246.Parry TJ, et al. The TCT motif, a key component of an RNA polymerase II transcription system for the translational machinery. Genes Dev. 2010;24:2013–2018. doi: 10.1101/gad.1951110. [DOI] [PMC free article] [PubMed] [Google Scholar]
247.Tokusumi Y, Ma Y, Song X, Jacobson RH, Takada S. The new core promoter element XCPE1 (X Core Promoter Element 1) directs activator-, mediator-, and TATA-binding protein-dependent but TFIID-independent RNA polymerase II transcription from TATA-less promoters. Mol Cell Biol. 2007;27:1844–1858. doi: 10.1128/MCB.01363-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
248.Anish R, Hossain MB, Jacobson RH, Takada S. Characterization of transcription from TATA-less promoters: identification of a new core promoter element XCPE2 and analysis of factor requirements. PLoS ONE. 2009;4:e5103. doi: 10.1371/journal.pone.0005103. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary information S1