A bacterial one-hybrid system for determining the DNA-binding specificity of transcription factors - PubMed (original) (raw)

A bacterial one-hybrid system for determining the DNA-binding specificity of transcription factors

Xiangdong Meng et al. Nat Biotechnol. 2005 Aug.

Abstract

The DNA-binding specificities of transcription factors can be used to computationally predict cis-regulatory modules (CRMs) that regulate gene expression. However, the absence of specificity data for the majority of transcription factors limits the widespread implementation of this approach. We have developed a bacterial one-hybrid system that provides a simple and rapid method to determine the DNA-binding specificity of a transcription factor. Using this technology, we successfully determined the DNA-binding specificity of seven previously characterized transcription factors and one novel transcription factor, the Drosophila melanogaster factor Odd-skipped. Regulatory targets of Odd-skipped were successfully predicted using this information, demonstrating that the data produced by the bacterial one-hybrid system are relevant to in vivo function.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Overview of the bacterial one-hybrid system. a) Schematic representation of the HIS3/URA3 cistron in the pH3U3 prey vector. If the DNA-binding domain (DBD) of the bait recognizes a sequence in the randomized region, the fusion to the alpha subunit will recruit RNA polymerase, to the weak lac promoter and activate transcription of HIS3 and URA3. b) Schematic outline of the B1H selection procedure. The prey library and bait are introduced into the bacterial selection strain. These cells are plated on NM selective media containing the desired concentration of 3-AT to select for bait-prey combinations that activate the reporter. Subsequent steps depend on the prey library (either the original or purified version) that was used for the selection. If the original library is used then prey from the colonies that grow under the selective conditions are isolated and reintroduced into the selection strain in the absence of the bait. These cells are challenged to survive stringent counterselection (5-FOA) to remove any self-activating clones that represent false positives. DNA is isolated from individual colonies that grow under these conditions and the randomized region of each prey is sequenced. If the purified library was used for the selection, then DNA can be isolated and sequenced from colonies on the positive selection plate since the majority of self-activating prey have already been eliminated. The unique DNA sequences recovered from the selection are analyzed by MEME to identify any overrepresented sequence motifs, which should represent the recognition sequence of the bait. c) A purified library can be constructed to simplify the selection of bait-prey combinations that activate the reporter genes. The original prey library is introduced into the selection strain and cells are challenged to survive stringent counterselection conditions (5-FOA) to remove any self-activating clones that represent false positives. Prey vectors from the surviving colonies are isolated as a pool to generate the purified prey library for use with any desired bait.

Figure 1

Figure 1

Overview of the bacterial one-hybrid system. a) Schematic representation of the HIS3/URA3 cistron in the pH3U3 prey vector. If the DNA-binding domain (DBD) of the bait recognizes a sequence in the randomized region, the fusion to the alpha subunit will recruit RNA polymerase, to the weak lac promoter and activate transcription of HIS3 and URA3. b) Schematic outline of the B1H selection procedure. The prey library and bait are introduced into the bacterial selection strain. These cells are plated on NM selective media containing the desired concentration of 3-AT to select for bait-prey combinations that activate the reporter. Subsequent steps depend on the prey library (either the original or purified version) that was used for the selection. If the original library is used then prey from the colonies that grow under the selective conditions are isolated and reintroduced into the selection strain in the absence of the bait. These cells are challenged to survive stringent counterselection (5-FOA) to remove any self-activating clones that represent false positives. DNA is isolated from individual colonies that grow under these conditions and the randomized region of each prey is sequenced. If the purified library was used for the selection, then DNA can be isolated and sequenced from colonies on the positive selection plate since the majority of self-activating prey have already been eliminated. The unique DNA sequences recovered from the selection are analyzed by MEME to identify any overrepresented sequence motifs, which should represent the recognition sequence of the bait. c) A purified library can be constructed to simplify the selection of bait-prey combinations that activate the reporter genes. The original prey library is introduced into the selection strain and cells are challenged to survive stringent counterselection conditions (5-FOA) to remove any self-activating clones that represent false positives. Prey vectors from the surviving colonies are isolated as a pool to generate the purified prey library for use with any desired bait.

Figure 1

Figure 1

Overview of the bacterial one-hybrid system. a) Schematic representation of the HIS3/URA3 cistron in the pH3U3 prey vector. If the DNA-binding domain (DBD) of the bait recognizes a sequence in the randomized region, the fusion to the alpha subunit will recruit RNA polymerase, to the weak lac promoter and activate transcription of HIS3 and URA3. b) Schematic outline of the B1H selection procedure. The prey library and bait are introduced into the bacterial selection strain. These cells are plated on NM selective media containing the desired concentration of 3-AT to select for bait-prey combinations that activate the reporter. Subsequent steps depend on the prey library (either the original or purified version) that was used for the selection. If the original library is used then prey from the colonies that grow under the selective conditions are isolated and reintroduced into the selection strain in the absence of the bait. These cells are challenged to survive stringent counterselection (5-FOA) to remove any self-activating clones that represent false positives. DNA is isolated from individual colonies that grow under these conditions and the randomized region of each prey is sequenced. If the purified library was used for the selection, then DNA can be isolated and sequenced from colonies on the positive selection plate since the majority of self-activating prey have already been eliminated. The unique DNA sequences recovered from the selection are analyzed by MEME to identify any overrepresented sequence motifs, which should represent the recognition sequence of the bait. c) A purified library can be constructed to simplify the selection of bait-prey combinations that activate the reporter genes. The original prey library is introduced into the selection strain and cells are challenged to survive stringent counterselection conditions (5-FOA) to remove any self-activating clones that represent false positives. Prey vectors from the surviving colonies are isolated as a pool to generate the purified prey library for use with any desired bait.

Figure 2

Figure 2

Binding site motifs for seven proteins determined using the B1H system. The binding site motif determined using each bait in the B1H system is displayed as a Sequence logo. The maximum possible information content at each position is 2 bits. The Zif268, PLAG-1 and Runt/Bgb binding motifs were generated using the two-step selection method, whereas motifs for the other factors were generated using the purified library in a single selection step. The previously reported consensus sequence for each factor as determined by SELEX is displayed or listed below each Logo. a) Comparison of the binding site motifs produced for Zif268 by the B1H system and from a previously reported in vitro SELEX experiment. b) PLAG1 can tolerate a 0 to 4 base pair gap between the two recognition motifs based on analysis of the raw sequences by Bioprospector. A one base pair gap (position 8) is shown in its motif, but a 2 base pair gap was most prevalent in the isolated sequences (7 of 18). The PLAG1 motif is consistent with the previously described PLAG1 consensus sequence (GG(g/a)GGCCNNNNNN(g/a)GG(g/t)) determined by SELEX. The most 5' base identified in the SELEX analysis could not be conclusively defined in the B1H data because of overlap of the majority of sequences with the edge of the constant region abutting the library. c) The ZnFp53 motif is consistent with the previously described consensus sequence ((g/a)GACACGT) determined by SELEX for a nearly identical clone. d) The Dorsal motif is consistent with the previously described consensus sequence (GGG(a/t)(a/t)(t/a)(t/c/a)C(c/t)) determined by SELEX. e) The LAG-1 motif is consistent with the previously described consensus sequence (A(c/g)CGTGGGAA(a/c)) for the mouse homolog of LAG-1 (RBP-Jκ) determined by SELEX. f) The Paired motif contains at its core a sequence similar to the consensus sequence ((t/c)CGTCACG(g/c)TT(g/c)) determined by SELEX for the paired domain in the absence of the homeodomain. SELEX selections reported for both DNA-binding domains of Paired resulted a complex mixture of recognition sequences, a subset contained a core homeodomain binding site abutting the 5' end of the paired domain binding site (

AATTA

GTCACGC; where the homeodomain element is underlined), which is similar to the 5' end of our motif. g) The Runt/Bgb motif is consistent with the previously described consensus sequence ((t/c)G(t/c)GGT(t/c)) for CBFα/β determined by SELEX. Raw sequences of the prey from each selection are listed in Supplementary Table 2.

Figure 3

Figure 3

Analysis of the DNA-binding specificity of Odd determined using the B1H system. a) The binding site motifs obtained at 1.5, 2.5 and 5 mM 3-AT were each compiled from more than 20 sequences that contained an overrepresented sequence motif identified by MEME. The tolerance of Odd for both A and T at position 4 (boxed) in its binding site becomes apparent at lower selection stringencies. Raw sequences of the prey from each selection are listed in Supplementary Table 2. b) Effect of point mutations on the recognition by Odd of its consensus sequence. i) Gel shift assay examining the effect of different DNA competitors on formation of a complex between Odd and its labeled consensus binding site. The presence of Odd and the type of cold competitor in each binding reaction is indicated above each lane of the gel. wt = contains the consensus Odd sequence GCTACTGTA. The other competitors have mutations at each of the first five positions of the consensus sequence where the number represents the position and the letter represents the substitution. For example, 1c = cCTACTGTA. ii) Growth rates for bacteria assayed at 2 mM 3-AT containing the Odd bait and either the wild type or a mutant prey corresponding to DNA competitors used in the gel shift analysis. Growth rates were defined for each bait-prey combination based on serial dilutions of cells harboring these vectors on plates containing 3-AT (Supplementary Fig. 3). The growth rates for cells containing the various bait-prey combinations are qualitatively similar to the degree of competition observed in the gel shift experiments.

Figure 3

Figure 3

Analysis of the DNA-binding specificity of Odd determined using the B1H system. a) The binding site motifs obtained at 1.5, 2.5 and 5 mM 3-AT were each compiled from more than 20 sequences that contained an overrepresented sequence motif identified by MEME. The tolerance of Odd for both A and T at position 4 (boxed) in its binding site becomes apparent at lower selection stringencies. Raw sequences of the prey from each selection are listed in Supplementary Table 2. b) Effect of point mutations on the recognition by Odd of its consensus sequence. i) Gel shift assay examining the effect of different DNA competitors on formation of a complex between Odd and its labeled consensus binding site. The presence of Odd and the type of cold competitor in each binding reaction is indicated above each lane of the gel. wt = contains the consensus Odd sequence GCTACTGTA. The other competitors have mutations at each of the first five positions of the consensus sequence where the number represents the position and the letter represents the substitution. For example, 1c = cCTACTGTA. ii) Growth rates for bacteria assayed at 2 mM 3-AT containing the Odd bait and either the wild type or a mutant prey corresponding to DNA competitors used in the gel shift analysis. Growth rates were defined for each bait-prey combination based on serial dilutions of cells harboring these vectors on plates containing 3-AT (Supplementary Fig. 3). The growth rates for cells containing the various bait-prey combinations are qualitatively similar to the degree of competition observed in the gel shift experiments.

Figure 4

Figure 4

Altered gene expression following ectopic expression of Drosophila Odd. Probes used for RNA in situ hybridization are indicated to the left of each pair of panels. (upper panels) Wild type embryos showing the head and trunk stripes of odd expression at the pair rule and segment polarity stages. (lower panels) Expression of putative targets of the Odd repressor was examined in wild type (left) or heat-shock odd embryos (right) that were fixed 19 minutes following a six minute heat shock. In heat-shock odd embryos: stripe 1 of h disappears (arrow); the even stripes of prd are missing (bracket); the segment polarity stripes of gsb are missing or reduced (bracket); and the head stripe of Gsc is fainter (arrow). For analysis of Gsc, the embryos were co-stained with a probe to Ubx (central stripe), which did not significantly change following ectopic expression of odd. The changes observed in h and prd expression following ectopic expression of odd have been previously described.

Similar articles

Cited by

References

    1. Berman BP, et al. Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome. Proc Natl Acad Sci U S A. 2002;99:757–762. - PMC - PubMed
    1. Roulet E, et al. High-throughput SELEX SAGE method for quantitative modeling of transcription-factor binding sites. Nat Biotechnol. 2002;20:831–835. - PubMed
    1. Bulyk ML, Huang X, Choo Y, Church GM. Exploring the DNA-binding specificities of zinc fingers with DNA microarrays. Proc Natl Acad Sci U S A. 2001;98:7158–7163. - PMC - PubMed
    1. Mukherjee S, et al. Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays. Nat Genet. 2004;36:1331–1339. - PMC - PubMed
    1. Lee TI, et al. Transcriptional regulatory networks in Saccharomyces cerevisiae. Science. 2002;298:799–804. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources