Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays - PubMed (original) (raw)

Comparative Study

. 2004 Dec;36(12):1331-9.

doi: 10.1038/ng1473. Epub 2004 Nov 14.

Affiliations

Comparative Study

Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays

Sonali Mukherjee et al. Nat Genet. 2004 Dec.

Abstract

We developed a new DNA microarray-based technology, called protein binding microarrays (PBMs), that allows rapid, high-throughput characterization of the in vitro DNA binding-site sequence specificities of transcription factors in a single day. Using PBMs, we identified the DNA binding-site sequence specificities of the yeast transcription factors Abf1, Rap1 and Mig1. Comparison of these proteins' in vitro binding sites with their in vivo binding sites indicates that PBM-derived sequence specificities can accurately reflect in vivo DNA sequence specificities. In addition to previously identified targets, Abf1, Rap1 and Mig1 bound to 107, 90 and 75 putative new target intergenic regions, respectively, many of which were upstream of previously uncharacterized open reading frames. Comparative sequence analysis indicated that many of these newly identified sites are highly conserved across five sequenced sensu stricto yeast species and, therefore, are probably functional in vivo binding sites that may be used in a condition-specific manner. Similar PBM experiments should be useful in identifying new cis regulatory elements and transcriptional regulatory networks in various genomes.

PubMed Disclaimer

Figures

Figure 1

Figure 1. Protein binding microarray (PBM) schematic

(a) Overview of PBM experiments. (b) Whole-genome yeast intergenic microarray bound by Rap1. The fluorescence intensities of the spots are shown in false-color, with white indicating saturated signal intensity, red indicating high signal intensity, green indicating moderate signal intensity, and blue indicating low signal intensity. (c) Zoom-in on a portion of the whole-genome yeast intergenic microarray bound by Rap1.

Figure 1

Figure 1. Protein binding microarray (PBM) schematic

(a) Overview of PBM experiments. (b) Whole-genome yeast intergenic microarray bound by Rap1. The fluorescence intensities of the spots are shown in false-color, with white indicating saturated signal intensity, red indicating high signal intensity, green indicating moderate signal intensity, and blue indicating low signal intensity. (c) Zoom-in on a portion of the whole-genome yeast intergenic microarray bound by Rap1.

Figure 2

Figure 2. Identifying the specifically bound spots

(a) Distribution of Rap1 PBM ratio (PBM / SybrGreen I) data. The arrow indicates those spots passing a P value threshold of 0.001 after correction for multiple hypothesis testing. Indicated in red are spots harboring an exact match to a sequence belonging to our discovered Rap1 binding site motif. (b) Zoom-in on intergenic regions, from both PBMs (left) and SybrGreen I stained microarrays (right), upstream of RPL14A, RPL8A, and OPI3, which are known direct targets of Rap1. The fluorescence intensities of the spots are shown in false-color, color-coded as described previously (Fig. 1 legend). PBM P values are corrected for multiple hypotheses. Determination of binding in ChIP-chip experiments (“YES” or “NO”) is described in Methods. All regions shown have an exact match to a sequence belonging to the discovered Rap1 motif (“YES”). For each region, the binding site is conserved across five sensu stricto yeast strains either to within two standard deviations (“2 SD”) or 100% identical at each position (“Exact”) as described in Methods. “*” indicates Rap1 ChIP-chip data from Lee et al., and “#” indicates Rap1 ChIP-chip data from Lieb et al..

Figure 2

Figure 2. Identifying the specifically bound spots

(a) Distribution of Rap1 PBM ratio (PBM / SybrGreen I) data. The arrow indicates those spots passing a P value threshold of 0.001 after correction for multiple hypothesis testing. Indicated in red are spots harboring an exact match to a sequence belonging to our discovered Rap1 binding site motif. (b) Zoom-in on intergenic regions, from both PBMs (left) and SybrGreen I stained microarrays (right), upstream of RPL14A, RPL8A, and OPI3, which are known direct targets of Rap1. The fluorescence intensities of the spots are shown in false-color, color-coded as described previously (Fig. 1 legend). PBM P values are corrected for multiple hypotheses. Determination of binding in ChIP-chip experiments (“YES” or “NO”) is described in Methods. All regions shown have an exact match to a sequence belonging to the discovered Rap1 motif (“YES”). For each region, the binding site is conserved across five sensu stricto yeast strains either to within two standard deviations (“2 SD”) or 100% identical at each position (“Exact”) as described in Methods. “*” indicates Rap1 ChIP-chip data from Lee et al., and “#” indicates Rap1 ChIP-chip data from Lieb et al..

Figure 3

Figure 3. DNA binding site motifs as determined by PBMs compared to motifs derived from ChIP-chip data and from TRANSFAC

Sequence logos were generated essentially as described previously. Group specificity scores were calculated as described in Methods. “*” indicates Rap1, Abf1, and Mig1 ChIP-chip data from Lee et al., and “#” indicates Rap1 ChIP-chip data from Lieb et al.. Although the Mig1 binding site motif derived from the ChIP-chip data has a statistically significant group specificity score, it is not a match to either the TRANSFAC or PBM Mig1 motif. The Pearson correlation coefficients comparing the PBM versus ChIP-chip motifs, as well as those comparing each of these motifs versus the motifs present in the TRANSFAC database, were as follows: Rap1 PBM versus Lee et al. ChIP-chip: 0.992; Rap1 PBM versus Lieb et al. ChIP-chip: 0.995; Rap1 PBM versus TRANSFAC: 0.953; Rap1 Lee et al. versus Lieb et al. ChIP-chip: 0.985; Rap1 Lee et al. ChIP-chip versus TRANSFAC: 0.921; Rap1 Lieb et al. ChIP-chip versus TRANSFAC: 0.950; Abf1 PBM versus ChIP-chip: 0.989; Abf1 PBM versus TRANSFAC: 0.978; Abf1 ChIP-chip versus TRANSFAC: 0.986; Mig1 PBM versus ChIP-chip: 0.453; Mig1 PBM versus TRANSFAC: 0.938; Mig1 ChIP-chip versus TRANSFAC: 0.406.

Figure 4

Figure 4. EMSAs of PBM-derived Rap1 binding site sequences

(a) Rap1 binding site sequences present within the DNA probes corresponding to portions of the intergenic regions iYLL051C (P = 3.20 × 10-16) and iYPL221W (P = 3.91 x 10-21), aligned against the TRANSFAC and PBM-derived Rap1 binding site sequence logos. (b) Lanes 1 and 2, positive control DNA probe; lanes 3 and 4, negative control DNA probe; lanes 5 and 6, DNA probe corresponding to the best Rap1 binding site sequence that could be identified within the iYLL051C intergenic region; lanes 7 and 8, DNA probe corresponding to the PBM-derived Rap1 binding site sequence within the iYPL221W intergenic region. “-” indicates Rap1 protein was not present in the binding reaction; “+” indicates Rap1 protein was present the binding reaction.

Figure 4

Figure 4. EMSAs of PBM-derived Rap1 binding site sequences

(a) Rap1 binding site sequences present within the DNA probes corresponding to portions of the intergenic regions iYLL051C (P = 3.20 × 10-16) and iYPL221W (P = 3.91 x 10-21), aligned against the TRANSFAC and PBM-derived Rap1 binding site sequence logos. (b) Lanes 1 and 2, positive control DNA probe; lanes 3 and 4, negative control DNA probe; lanes 5 and 6, DNA probe corresponding to the best Rap1 binding site sequence that could be identified within the iYLL051C intergenic region; lanes 7 and 8, DNA probe corresponding to the PBM-derived Rap1 binding site sequence within the iYPL221W intergenic region. “-” indicates Rap1 protein was not present in the binding reaction; “+” indicates Rap1 protein was present the binding reaction.

Figure 5

Figure 5. Comparison of bound intergenic regions derived from PBM data as compared to those derived from ChIP-chip,

Venn diagrams depicting the results of the comparison for (a) Rap1, (b) Abf1, and (c) Mig1. The Venn diagrams depict data only for those intergenic regions for which data were available for both ChIP-chip and PBMs. “*” indicates Rap1, Abf1, and Mig1 ChIP-chip data from Lee et al., and “#” indicates Rap1 ChIP-chip data from Lieb et al..

Figure 5

Figure 5. Comparison of bound intergenic regions derived from PBM data as compared to those derived from ChIP-chip,

Venn diagrams depicting the results of the comparison for (a) Rap1, (b) Abf1, and (c) Mig1. The Venn diagrams depict data only for those intergenic regions for which data were available for both ChIP-chip and PBMs. “*” indicates Rap1, Abf1, and Mig1 ChIP-chip data from Lee et al., and “#” indicates Rap1 ChIP-chip data from Lieb et al..

Figure 5

Figure 5. Comparison of bound intergenic regions derived from PBM data as compared to those derived from ChIP-chip,

Venn diagrams depicting the results of the comparison for (a) Rap1, (b) Abf1, and (c) Mig1. The Venn diagrams depict data only for those intergenic regions for which data were available for both ChIP-chip and PBMs. “*” indicates Rap1, Abf1, and Mig1 ChIP-chip data from Lee et al., and “#” indicates Rap1 ChIP-chip data from Lieb et al..

Figure 6

Figure 6. Cross-species sequence conservation of binding sites identified from PBM data as compared to those identified from ChIP-chip data

From left to right for a single TF, bars represent all spots bound in PBMs, all spots bound in ChIP-chip, and spots bound in PBMs and not ChIP-chip. Spots were called `bound' as described in Methods. Shown in dark gray is the subset of bound spots with S. cerevisiae binding sites conserved to within two standard deviations of the motif average across all five sensu stricto species. Shown in black is the subset of S. cerevisiae bound spots with conserved sites 100% identical across all five species. The remaining bound spots are in light gray.

Similar articles

Cited by

References

    1. Schena M, Shalon D, Davis RW, Brown PO. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science. 1995;270:467–470. - PubMed
    1. Wodicka L, Dong H, Mittmann M, Ho MH, Lockhart DJ. Genome-wide expression monitoring in Saccharomyces cerevisiae. Nat. Biotechnol. 1997;15:1359–1367. - PubMed
    1. Ren B, et al. Genome-wide location and function of DNA binding proteins. Science. 2000;290:2306–2309. - PubMed
    1. Iyer VR, et al. Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF. Nature. 2001;409:533–538. - PubMed
    1. Lieb JD, Liu X, Botstein D, Brown PO. Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association. Nat. Genet. 2001;28:327–334. - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources