Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities (original) (raw)
- Letter
- Published: 24 September 2006
- Anthony A Philippakis1,2,3 na1,
- Aaron M Qureshi1,4,
- Fangxue S He1,3,
- Preston W Estep III5 &
- …
- Martha L Bulyk1,2,3,6,1
Nature Biotechnology volume 24, pages 1429–1435 (2006)Cite this article
- 6133 Accesses
- 516 Citations
- 22 Altmetric
- Metrics details
Abstract
Transcription factors (TFs) interact with specific DNA regulatory sequences to control gene expression throughout myriad cellular processes. However, the DNA binding specificities of only a small fraction of TFs are sufficiently characterized to predict the sequences that they can and cannot bind. We present a maximally compact, synthetic DNA sequence design for protein binding microarray (PBM) experiments1 that represents all possible DNA sequence variants of a given length k (that is, all '_k_-mers') on a single, universal microarray. We constructed such all _k_-mer microarrays covering all 10–base pair (bp) binding sites by converting high-density single-stranded oligonucleotide arrays to double-stranded (ds) DNA arrays. Using these microarrays we comprehensively determined the binding specificities over a full range of affinities for five TFs of different structural classes from yeast, worm, mouse and human. The unbiased coverage of all _k_-mers permits high-throughput interrogation of binding site preferences, including nucleotide interdependencies, at unprecedented resolution.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Additional access options:
Similar content being viewed by others
References
- Mukherjee, S. et al. Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays. Nat. Genet. 36, 1331–1339 (2004).
Article CAS Google Scholar - Bulyk, M.L., Huang, X., Choo, Y. & Church, G.M. Exploring the DNA-binding specificities of zinc fingers with DNA microarrays. Proc. Natl. Acad. Sci. USA 98, 7158–7163 (2001).
Article CAS Google Scholar - Berger, M.F. & Bulyk, M.L. Protein binding microarrays (PBMs) for rapid, high-throughput characterization of the sequence specificities of DNA-binding proteins. Methods Mol. Biol. 338, 245–260 (2006).
CAS Google Scholar - Golomb, S. . Shift Register Sequences (Aegean Park Press, Laguna Hills, California, USA, 1967).
- Kwan, A.H., Czolij, R., Mackay, J.P. & Crossley, M. Pentaprobe: a comprehensive sequence for the one-step detection of DNA-binding activities. Nucleic Acids Res. 31, e124 (2003).
Article Google Scholar - Linnell, J. et al. Quantitative high-throughput analysis of transcription factor binding specificities. Nucleic Acids Res. 32, e44 (2004).
Article Google Scholar - Oliphant, A.R., Brandl, C.J. & Struhl, K. Defining the sequence specificity of DNA-binding proteins by selecting binding sites from random-sequence oligonucleotides: analysis of yeast GCN4 protein. Mol. Cell. Biol. 9, 2944–2949 (1989).
Article CAS Google Scholar - Sandelin, A., Alkema, W., Engstrom, P., Wasserman, W.W. & Lenhard, B. JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res. 32, D91–D94 (2004).
Article CAS Google Scholar - Liu, J. & Stormo, G.D. Quantitative analysis of EGR proteins binding to DNA: assessing additivity in both the binding site and the protein. BMC Bioinformatics 6, 176 (2005).
Article Google Scholar - Miller, J.C. & Pabo, C.O. Rearrangement of side-chains in a Zif268 mutant highlights the complexities of zinc finger–DNA recognition. J. Mol. Biol. 313, 309–315 (2001).
Article CAS Google Scholar - Bjerve, S. Error bounds for linear combinations of order statistics. Ann. Stat. 5, 357–369 (1977).
Article Google Scholar - Lieb, J.D., Liu, X., Botstein, D. & Brown, P.O. Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association. Nat. Genet. 28, 327–334 (2001).
Article CAS Google Scholar - Harbison, C.T. et al. Transcriptional regulatory code of a eukaryotic genome. Nature 431, 99–104 (2004).
Article CAS Google Scholar - Berg, O.G. & von Hippel, P.H. Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. J. Mol. Biol. 193, 723–750 (1987).
Article CAS Google Scholar - Bulyk, M.L., Johnson, P.L. & Church, G.M. Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factors. Nucleic Acids Res. 30, 1255–1261 (2002).
Article CAS Google Scholar - Jiang, J. & Levine, M. Binding affinities and cooperative interactions with bHLH activators delimit threshold responses to the dorsal gradient morphogen. Cell 72, 741–752 (1993).
Article CAS Google Scholar - Gaudet, J. & Mango, S.E. Regulation of organogenesis by the Caenorhabditis elegans FoxA protein PHA-4. Science 295, 821–825 (2002).
Article CAS Google Scholar - Tanay, A. Extensive low-affinity transcriptional interactions in the yeast genome. Genome Res. 16, 962–972 (2006).
Article CAS Google Scholar - Warren, C.L. et al. Defining the sequence-recognition profile of DNA-binding molecules. Proc. Natl. Acad. Sci. USA 103, 867–872 (2006).
Article CAS Google Scholar - Singh-Gasson, S. et al. Maskless fabrication of light-directed oligonucleotide microarrays using a digital micromirror array. Nat. Biotechnol. 17, 974–978 (1999).
Article CAS Google Scholar - Blancafort, P., Segal, D.J. & Barbas, C.F., III. Designing transcription factor architectures for drug discovery. Mol. Pharmacol. 66, 1361–1371 (2004).
Article CAS Google Scholar - Philippakis, A.A. et al. Expression-guided in silico evaluation of candidate cis regulatory codes for Drosophila muscle founder cells. PLoS Comput. Biol. 2, e53 (2006).
Article Google Scholar - Braun, P. et al. Proteome-scale purification of human proteins from bacteria. Proc. Natl. Acad. Sci. USA 99, 2654–2659 (2002).
Article CAS Google Scholar - Li, M.Z. & Elledge, S.J. MAGIC, an in vivo genetic method for the rapid construction of recombinant DNA molecules. Nat. Genet. 37, 311–319 (2005).
Article CAS Google Scholar - Dudley, A.M., Aach, J., Steffen, M.A. & Church, G.M. Measuring absolute expression with microarrays with a calibrated reference sample and an extended signal intensity range. Proc. Natl. Acad. Sci. USA 99, 7554–7559 (2002).
Article CAS Google Scholar - Morton, T.A. & Myszka, D.G. Kinetic analysis of macromolecular interactions using surface plasmon resonance biosensors. Methods Enzymol. 295, 268–294 (1998).
Article CAS Google Scholar - Wilmen, A., Pick, H., Niedenthal, R.K., Sen-Gupta, M. & Hegemann, J.H. The yeast centromere CDEI/Cpf1 complex: differences between in vitro binding and in vivo function. Nucleic Acids Res. 22, 2791–2800 (1994).
Article CAS Google Scholar - Christy, B. & Nathans, D. DNA binding site of the growth factor–inducible protein Zif268. Proc. Natl. Acad. Sci. USA 86, 8737–8741 (1989).
Article CAS Google Scholar - Okkema, P.G. & Fire, A. The Caenorhabditis elegans NK-2 class homeoprotein CEH-22 is involved in combinatorial activation of gene expression in pharyngeal muscle. Development 120, 2175–2186 (1994).
CAS Google Scholar - Klemm, J.D., Rould, M.A., Aurora, R., Herr, W. & Pabo, C.O. Crystal structure of the Oct-1 POU domain bound to an octamer site: DNA recognition with tethered DNA-binding modules. Cell 77, 21–32 (1994).
Article CAS Google Scholar
Acknowledgements
We thank T.V.S. Murthy, Leo Brizuela and Josh LaBaer for providing the Cbf1 and Rap1 clones, Gwenael Badis-Breard and Tim Hughes for providing the Zif268 DNA-binding domain clone and Shufen Meng for assistance with the Biacore technology. We also thank Stephen Gisselbrecht, Amy Donner and Rachel McCord for critical reading of the manuscript. This work was funded in part by grants R01 HG003985 and R01 HG003420 from National Institutes of Health/National Human Genome Research Institute to M.L.B. M.F.B. was supported in part by a National Science Foundation Graduate Research Fellowship. A.A.P. was supported in part by a National Defense Science and Engineering Graduate Fellowship, a National Science Foundation Graduate Research Fellowship and an Athinoula Martinos Fellowship.
Author information
Author notes
- Michael F Berger and Anthony A Philippakis: These authors contributed equally to this work.
Authors and Affiliations
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, 02115, Massachusetts, USA
Michael F Berger, Anthony A Philippakis, Aaron M Qureshi, Fangxue S He, Martha L Bulyk & Martha L Bulyk - Harvard University Graduate Biophysics Program, Cambridge, 02138, Massachusetts, USA
Michael F Berger, Anthony A Philippakis & Martha L Bulyk - Harvard/MIT Division of Health Sciences and Technology (HST), Harvard Medical School, Boston, 02115, Massachusetts, USA
Anthony A Philippakis, Fangxue S He & Martha L Bulyk - Department of Mathematics, University of Maryland, College Park, Maryland, 20742, USA
Aaron M Qureshi - Longenity, Inc., Waltham, 02451, Massachusetts, USA
Preston W Estep III - Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, 02115, Massachusetts, USA
Martha L Bulyk
Authors
- Michael F Berger
You can also search for this author inPubMed Google Scholar - Anthony A Philippakis
You can also search for this author inPubMed Google Scholar - Aaron M Qureshi
You can also search for this author inPubMed Google Scholar - Fangxue S He
You can also search for this author inPubMed Google Scholar - Preston W Estep III
You can also search for this author inPubMed Google Scholar - Martha L Bulyk
You can also search for this author inPubMed Google Scholar
Contributions
M.F.B. participated in the array design, experimental design, analysis of results and drafting of the manuscript, and performed the experiments; A.A.P. conceived the idea of using de Bruijn sequences and participated in the binding site survey, array design, experimental design, analysis of results and drafting of the manuscript; A.M.Q. provided linear feedback shift register expertise and assisted in the array design; F.S.H. participated in the binding site survey; P.W.E. III conceived the concept of the compact universal array; M.L.B. conceived the concept of the compact universal array and participated in the array design, experimental design, analysis of the results and drafting of the manuscript.
Corresponding author
Correspondence toMartha L Bulyk.
Ethics declarations
Competing interests
A subset of the authors—M.L.B., A.A.P and P.W.E. 3rd—have filed a patent application, through Brigham and Women's Hospital, covering the sequence design described in this paper.
Supplementary information
Rights and permissions
About this article
Cite this article
Berger, M., Philippakis, A., Qureshi, A. et al. Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities.Nat Biotechnol 24, 1429–1435 (2006). https://doi.org/10.1038/nbt1246
- Received: 06 June 2006
- Accepted: 28 July 2006
- Published: 24 September 2006
- Issue Date: 01 November 2006
- DOI: https://doi.org/10.1038/nbt1246