A high-resolution map of active promoters in the human genome - PubMed (original) (raw)

. 2005 Aug 11;436(7052):876-80.

doi: 10.1038/nature03877. Epub 2005 Jun 29.

Affiliations

A high-resolution map of active promoters in the human genome

Tae Hoon Kim et al. Nature. 2005.

Abstract

In eukaryotic cells, transcription of every protein-coding gene begins with the assembly of an RNA polymerase II preinitiation complex (PIC) on the promoter. The promoters, in conjunction with enhancers, silencers and insulators, define the combinatorial codes that specify gene expression patterns. Our ability to analyse the control logic encoded in the human genome is currently limited by a lack of accurate information regarding the promoters for most genes. Here we describe a genome-wide map of active promoters in human fibroblast cells, determined by experimentally locating the sites of PIC binding throughout the human genome. This map defines 10,567 active promoters corresponding to 6,763 known genes and at least 1,196 un-annotated transcriptional units. Features of the map suggest extensive use of multiple promoters by the human genes and widespread clustering of active promoters in the genome. In addition, examination of the genome-wide expression profile reveals four general classes of promoters that define the transcriptome of the cell. These results provide a global view of the functional relationships among transcriptional machinery, chromatin structure and gene expression in human cells.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Identification and characterization of active promoters in the human genome. (a) Outline of the strategy employed to map TFIID-binding sites in the genome. (b) A representative view of the results from TFIID ChIP-on-chip analysis. The logarithmic ratio (log2 R) of hybridization intensities between TFIID ChIP DNA and a control DNA, and RefSeq gene annotation is shown in the top and middle panels, respectively. A close-up view of two replicate sets of TFIID ChIP-on-chip hybridization signals around the 5′ end of the TCFL1 gene is shown in the bottom panel. Arrows indicate the position of TFIID-binding site determined by a peak-finding algorithm. (c) Distribution of TFIID-binding sites relative to the 5′ end of the matched transcripts. (d & e) Venn diagrams showing number of identified promoters that matched EnsEMBL genes (d) or promoters annotated in DBTSS (e). (f) Chart showing the percentage of IMR90 or DBTSS promoters overlapping with CpG islands, or containing conserved TATA box, INR or DPE elements (see supplemental information for details).

Figure 2

Figure 2

Chromatin modification features of the active promoters. Logarithmic ratios of the ChIP-on-chip hybridization intensities (log2 R) of probes from 0.5 Kbp upstream to 0.5 Kbp downstream of the identified TFIID-binding sites for TFIID, RNAP, AcH3, and MeH3K4 are plotted in a yellow-blue colored scale for 9,330 transcript-matched promoters. The bottom panel shows a yellow-blue colored scale used to color each cell with corresponding log2 R values. (b) A detailed view of TFIID, RNAP, AcH3, and MeH3K4 profiles on the promoter of RPS24 gene.

Figure 3

Figure 3

Utilization of multiple promoters by human genes. (a) Annotation of the WEE1 gene locus and the corresponding TFIID-binding profile. Black bars over the first and second exons in transcripts indicate the positions of the primers used for real-time quantitative RT-PCR analysis of each transcript. (b) RT-PCR analysis of NM_003390 and AK122837 transcripts in asynchronous population of IMR90 cells. (c) Real-time quantitative RT-PCR analysis of NM_003390 and AK122837 transcripts in cell cycle synchronized population of IMR90 cells. Transcript levels observed for each cell cycle phase were normalized to the level observed in the asynchronous population.

Figure 4

Figure 4

Four distinct classes of promoters define the transcriptome of IMR90 cells. (a) A 2x2 matrix describes the distribution of genes defined by expression and PIC occupancy on the promoter. (b & c) Matrices showing the percentages of genes associated with the AcH3 (b) or MeH3K4 (c) modification for each of the four classes of genes. Italicized numbers in some boxes represent extrapolation from the 29 ENCODE regions.

Similar articles

Cited by

References

    1. Smale ST, Kadonaga JT. The RNA polymerase II core promoter. Annu Rev Biochem. 2003;72:449–79. - PubMed
    1. Tjian R, Maniatis T. Transcriptional activation: a complex puzzle with few easy pieces. Cell. 1994;77:5–8. - PubMed
    1. Trinklein ND, Aldred SJ, Saldanha AJ, Myers RM. Identification and functional analysis of human transcriptional promoters. Genome Res. 2003;13:308–12. - PMC - PubMed
    1. Reinberg D, et al. The RNA polymerase II general transcription factors: past, present, and future. Cold Spring Harb Symp Quant Biol. 1998;63:83–103. - PubMed
    1. Ren B, et al. Genome-wide location and function of DNA binding proteins. Science. 2000;290:2306–9. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources