Global mapping of protein-DNA interactions in vivo by digital genomic footprinting - PubMed (original) (raw)

doi: 10.1038/nmeth.1313. Epub 2009 Mar 22.

Xiaoyu Chen, Zhihong Zhang, Peter J Sabo, Richard Sandstrom, Alex P Reynolds, Robert E Thurman, Shane Neph, Michael S Kuehn, William S Noble, Stanley Fields, John A Stamatoyannopoulos

Affiliations

Global mapping of protein-DNA interactions in vivo by digital genomic footprinting

Jay R Hesselberth et al. Nat Methods. 2009 Apr.

Abstract

The orchestrated binding of transcriptional activators and repressors to specific DNA sequences in the context of chromatin defines the regulatory program of eukaryotic genomes. We developed a digital approach to assay regulatory protein occupancy on genomic DNA in vivo by dense mapping of individual DNase I cleavages from intact nuclei using massively parallel DNA sequencing. Analysis of >23 million cleavages across the Saccharomyces cerevisiae genome revealed thousands of protected regulatory protein footprints, enabling de novo derivation of factor binding motifs and the identification of hundreds of new binding sites for major regulators. We observed striking correspondence between single-nucleotide resolution DNase I cleavage patterns and protein-DNA interactions determined by crystallography. The data also yielded a detailed view of larger chromatin features including positioned nucleosomes flanking factor binding regions. Digital genomic footprinting should be a powerful approach to delineate the cis-regulatory framework of any organism with an available genome sequence.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Fig. 1

Fig. 1. Digital DNase I analysis of yeast chromatin structure from chromosomal to nucleotide resolution

(a) Per-nucleotide DNase I cleavage density across an exemplary 100-kilobase region of chromosome 10 (chr10:625,000-725,000) containing ∼50 open reading frames (ORFs). (b) Magnification of exemplary ∼2.5 kb regions containing RSM17/YJR115W and ECM17/IML1 intergenic intervals. (c) Further magnification showing positions of individual DNase I cleavage events (stacked vertical black tick marks), revealing short regions protected from DNase I cleavage (DNase I “footprints”). (d) Resolution of individual DNase I footprints (red shading) with known motifs for yeast regulatory factors Rap1, Reb1, Abf1 and Cbf1. The dashed black line indicates the average level of DNase I cleavage throughout the genome (avg. ∼2 cleavages per bp).

Fig. 2

Fig. 2. Detection of footprints and corresponding sequence motifs

(a) Visualization of DNase I protection (footprinting) around 907 computationally-predicted Reb1 sites in a heat map. Rows show levels of DNase I cleavage 25 bp up- and downstream of each motif instance and are sorted by the ratio of mean cleavage over flanking regions to that within the motif itself. Red ticks (at left) indicate motif instances (n = 580) that coincide with footprints (FDR = 0.05) containing _de novo_-derived Reb1 motifs. Blue ticks (right) indicate motif instances (n = 151) coinciding with those identified by ChIP. All motif instances are uniquely mappable within the yeast genome. (b) Mean per nucleotide DNase I cleavage (red) and evolutionary conservation (Phastcons?; blue) calculated for footprints that match the Reb1, Abf1, Rap1 and Hsf1 motifs (subpanel vertical axes). Significance of observed conservation patterns (blue text) (Supplementary Methods), extent of consensus motifs derived from the footprinted region (green shading), motifs derived from ChIP and footprinting below. Venn diagrams depict the overlap of motifs derived from and mapping to footprints (red) vs. ChIP (blue).

Fig. 3

Fig. 3. Mean nucleotide-level accessibility parallels protein:DNA interactions

(a) Structure of Mcm1 (green) bound to a single DNA recognition site (adjacent Matα2 was removed for clarity). Colored DNA bases correspond to positions within the footprint-derived Mcm1 motif (below), and red DNA backbone coloration reflects the extent of observed DNase I cleavage across 88 Mcm1 sites (red trace in subpanel). Mean nucleotide-level conservation by PhastCons is shown in parallel (blue trace in subpanel; P < 10-5). (b) Structure of the human homolog of CBF1 is shown relative to the mean nucleotide level cleavage and conservation (P < 10-3) across 243 Cbf1 sites.

Fig. 4

Fig. 4. Individual yeast regulatory regions and factor binding sites

(a) Rap1 binds to two adjacent sites also predicted from ChIP experiments upstream of RPS6A (chr16:378,775-378,874). (b) Reb1 binds to a canonical site upstream of TUF1 (chr15:683,707-683,806) but a non-canonical site upstream is only inferred from ChIP data (c) Mcm1 site upstream of MFA1 (chr4:1,384,893-1,384,993) exhibits hypersensitive nucleotides illustrated in Fig. 3a. (d) Hsf1 site identified by ChIP in BTN2 promoter (chr7:772,068-772,167) is identified as a footprint. (e) Two Reb1 binding sites in the REB1 promoter (chr2:336,885-337,084) are identified as footprints; a Cbf1 site predicted by ChIP shows a footprint, but a Rpn4 site defined by ChIP does not. (f) Two Pdr3 sites in the PDR5 promoter (chr15:619,227-619,476) are identified as footprints, in addition to an evolutionarily conserved region further upstream. Each panel shows per nucleotide DNase I cleavage, detected footprints (red boxes), assigned motifs (pink boxes), binding sites inferred from ChIP experiments (blue boxes), and evolutionary conservation (dark blue, Phastcons, bottom).

Fig. 5

Fig. 5. Higher-order patterns of DNA accessibility

(a) Mapped DNase I cleavages relative to 5,006 TSSs. Four major clusters are exposed by _k_-means analysis (red, blue, green and purple bars, respectively). In the red cluster, maximal DNase I cleavage occurs in a stereotypic ∼50 bp band ∼100 bp upstream of the TSS (grey arrowhead, top). In the blue, green and purple clusters, the extent and intensity of DNase I cleavage upstream of the TSS widens to the -1, -2, and -3 nucleosomes (respectively). (b) Spatial restriction of footprints near TSSs. Distribution of footprints matching Reb1, Abf1, Rap1, Mcm1, Pdr3, Cbf1 and Hsf1 relative to the TSSs (dashed black lines) and start codons of 1,260 genes sorted by the length of the 5′UTR. Enrichment within a ∼50 bp region centered ∼100 bp upstream of the TSS (dashed red lines). (c) DNase I cleavage profiles aligned relative to Reb1, Abf1, Rap1 and Mcm1 footprints. (d) mRNA abundance for genes found in each of the four clusters correlates with the accessibility of the promoters of those genes (colors as in a; median expression denoted by a black bar).

Similar articles

Cited by

References

    1. Maniatis T, Ptashne M. Structure of the lambda operators. Nature. 1973;246:133–6. - PubMed
    1. Gilbert W. Polymerization in Biological Systems. Elsevier; North-Holland, Amsterdam: 1972. pp. 245–259.
    1. Galas DJ, Schmitz A. DNAse footprinting: a simple method for the detection of protein-DNA binding specificity. Nucleic Acids Res. 1978;5:3157–70. - PMC - PubMed
    1. Ren B, et al. Genome-wide location and function of DNA binding proteins. Science. 2000;290:2306–9. - PubMed
    1. Johnson DS, Mortazavi A, Myers RM, Wold B. Genome-wide mapping of in vivo protein-DNA interactions. Science. 2007;316:1497–502. - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources