Deciphering a transcriptional regulatory code: modeling short-range repression in the Drosophila embryo - PubMed (original) (raw)
Deciphering a transcriptional regulatory code: modeling short-range repression in the Drosophila embryo
Walid D Fakhouri et al. Mol Syst Biol. 2010.
Abstract
Systems biology seeks a genomic-level interpretation of transcriptional regulatory information represented by patterns of protein-binding sites. Obtaining this information without direct experimentation is challenging; minor alterations in binding sites can have profound effects on gene expression, and underlie important aspects of disease and evolution. Quantitative modeling offers an alternative path to develop a global understanding of the transcriptional regulatory code. Recent studies have focused on endogenous regulatory sequences; however, distinct enhancers differ in many features, making it difficult to generalize to other cis-regulatory elements. We applied a systematic approach to simpler elements and present here the first quantitative analysis of short-range transcriptional repressors, which have central functions in metazoan development. Our fractional occupancy-based modeling uncovered unexpected features of these proteins' activity that allow accurate predictions of regulation by the Giant, Knirps, Krüppel, and Snail repressors, including modeling of an endogenous enhancer. This study provides essential elements of a transcriptional regulatory code that will allow extensive analysis of genomic information in Drosophila melanogaster and related organisms.
Conflict of interest statement
The authors declare that they have no conflict of interest.
Figures
Figure 1
Transformation of DNA sequence and protein information by gene modeling. (A) An enhancer with three repressors (red squares) and four activators (green circles) is modeled, to generate the gene expression surface shown in (B). The axes represent normalized activator, repressor, and gene activity levels. (C) A Drosophila embryo with Giant repressor (red stripes) and Dorsal activator (green) staining is shown. Each embryo provides a diversity of potential inputs to the regulatory element: the white arrow points to a region in which activator levels are high and repressor levels are low. The black arrow points to a region in which both activator and repressor levels are low. The white triangle points to a region in which activator and repressor levels are both high, and the black triangle points to a region in which repressor levels are high and activator levels are low. (D) Output of regulatory element shown in (A), which mirrors values from (C) being mapped through surface shown in (B). (E) Formal scheme of data collection, analysis, and modeling.
Figure 2
Structures of genes assayed to determine context dependence of short-range repressor activity, and representative in situ images showing lacZ activity. Mid blastoderm embryos are oriented dorsal up, anterior to the left. Genes 1–8 test activator–repressor spacing, 9–10 and 16–18 activator–repressor stoichiometry and spacing, 12–15 and 19 arrangement and promoter proximity, 11 and 20 activator number and affinity, and 21–27 alternative short-range repressors.
Figure 3
Representative [_lacZ_] versus [Gt] plots. (A) Structures of three genes assayed (1, 9, and 10). (B, C) Representative embryos imaged for Giant protein and lacZ reporter gene activity. (D) The data from multiple confocal embryo images was processed and compiled to provide normalized reporter gene [_lacZ_] versus normalized repressor [Gt].
Figure 4
Parameters found by the ES parameter estimation technique for scheme 2 of the model. (A) Root mean square error, E, is shown on the left, with corresponding scale shown on the left axis. Repressor-scaling factor R (referred to as _S_R in fractional occupancy model in Materials and methods) and cooperativity C are shown in the central and right portions, respectively, with scale shown on the right axis. (B) Quenching efficiency parameters are shown for increasing distances of repressors located 5′ of the activators on the left. Quenching efficiency levels relative to Twist proximal (T) sites and Dorsal proximal (D) sites are shown in the right panel. A non-monotonic decrease in quenching efficiency for increasing distances is observed.
Figure 5
Parameters for scheme 2 with the constraint that quenching efficiency parameters decrease monotonically. (A) Root mean square error E, repressor-scaling factor R, and cooperativity C labeled as in Figure 4. (B) Quenching efficiency parameters and relative quenching of Dorsal and Twist sites. Under this constraint, the level of quenching efficiency changes very little from 28 to 66 bp, in contrast to observed trends (Figure 2).
Figure 6
Parameters for scheme 2 with cooperativity parameters set to different levels. (A, B) Parameters found in our study (circles) and parameters found by constraint of cooperativity parameters to those from Segal et al (2008) (diamonds). The increased cooperativity value is compensated by a decreased repressor-scaling factor R. (C) Root mean square errors (RMSE) for cooperativity parameters (constrained to values between 0 and 30). Estimated cooperativity values from our model lie near the lowest point in this curve.
Figure 7
Validation of modeling by prediction of subsets of the data from parameters derived from the remainder of the data. (A) Leave-one-out analysis. Root mean square errors are calculated using parameters found by 11 genes excepting the genes indicated, and all the genes. Relative RMSE ratios, indicating greater errors for prediction of genes 2, 9 and, 16, indicating their greater contribution to the parameter constraints. (B) Leave-sets-out analysis for nine distinct sets of genes defined by their shared properties (Table II). Root mean square errors are calculated using parameters found from the reduced set and the entire set. Relative RMSE ratios, indicating greater errors for prediction of sets 1, 2, and 4, indicating their greater contribution to the parameter constraints. (C) Predictions for leaving out set 8. Genes 1, 10, and 12 are predicted by using parameters found from other 9 genes. Points represent average values for [_lacZ_] versus [Gt] data, which was divided into 20 bins. (D) Parameter estimation results are shown for different amounts of data 50, 75, and 100%. The data is cut randomly from each gene at the same percentage.
Figure 8
Extension of the model to endogenous regulatory elements. (A) The rhomboid gene is expressed in the blastoderm embryo in two lateral stripes (one shown in focal plane), under control of the Dorsal and Twist activators. Ventral expression is inhibited by the Snail short-range repressor, which is expressed in the presumptive mesoderm. The _cis_-regulatory modules used for analysis are shown. Different forms of rhomboid NEE enhancer are depicted, with varying number and arrangements of Snail short-range repressor-binding sites. Dorsal and Twist activators are shown by large and small green circles, respectively, and Snail repressors are shown by red squares. On the right are the predicted repression levels caused by Snail-binding sites shown in each module based on parameter estimation using this group of enhancers. (B) Predicted parameters for scaling factors for each transcription factor and cooperativity. Average and standard deviation for 20 estimation runs are shown.
Similar articles
- Transcriptional repression by the Drosophila giant protein: cis element positioning provides an alternative means of interpreting an effector gradient.
Hewitt GF, Strunk BS, Margulies C, Priputin T, Wang XD, Amey R, Pabst BA, Kosman D, Reinitz J, Arnosti DN. Hewitt GF, et al. Development. 1999 Mar;126(6):1201-10. doi: 10.1242/dev.126.6.1201. Development. 1999. PMID: 10021339 - Short-range transcriptional repressors mediate both quenching and direct repression within complex loci in Drosophila.
Gray S, Levine M. Gray S, et al. Genes Dev. 1996 Mar 15;10(6):700-10. doi: 10.1101/gad.10.6.700. Genes Dev. 1996. PMID: 8598297 - Groucho corepressor functions as a cofactor for the Knirps short-range transcriptional repressor.
Payankaulam S, Arnosti DN. Payankaulam S, et al. Proc Natl Acad Sci U S A. 2009 Oct 13;106(41):17314-9. doi: 10.1073/pnas.0904507106. Epub 2009 Sep 28. Proc Natl Acad Sci U S A. 2009. PMID: 19805071 Free PMC article. - Logic functions of the genomic cis-regulatory code.
Istrail S, Davidson EH. Istrail S, et al. Proc Natl Acad Sci U S A. 2005 Apr 5;102(14):4954-9. doi: 10.1073/pnas.0409624102. Epub 2005 Mar 23. Proc Natl Acad Sci U S A. 2005. PMID: 15788531 Free PMC article. Review. - One thousand and one ways of making functionally similar transcriptional enhancers.
Veitia RA. Veitia RA. Bioessays. 2008 Nov;30(11-12):1052-7. doi: 10.1002/bies.20849. Bioessays. 2008. PMID: 18937349 Review.
Cited by
- Shadow enhancers modulate distinct transcriptional parameters that differentially effect downstream patterning events.
Whitney PH, Shrestha B, Xiong J, Zhang T, Rushlow CA. Whitney PH, et al. Development. 2022 Nov 1;149(21):dev200940. doi: 10.1242/dev.200940. Epub 2022 Nov 1. Development. 2022. PMID: 36264246 Free PMC article. - ChIP-nexus enables improved detection of in vivo transcription factor binding footprints.
He Q, Johnston J, Zeitlinger J. He Q, et al. Nat Biotechnol. 2015 Apr;33(4):395-401. doi: 10.1038/nbt.3121. Epub 2015 Mar 9. Nat Biotechnol. 2015. PMID: 25751057 Free PMC article. - Quantitative modeling of gene expression using DNA shape features of binding sites.
Peng PC, Sinha S. Peng PC, et al. Nucleic Acids Res. 2016 Jul 27;44(13):e120. doi: 10.1093/nar/gkw446. Epub 2016 Jun 1. Nucleic Acids Res. 2016. PMID: 27257066 Free PMC article. - Versatility of cooperative transcriptional activation: a thermodynamical modeling analysis for greater-than-additive and less-than-additive effects.
Frank TD, Carmody AM, Kholodenko BN. Frank TD, et al. PLoS One. 2012;7(4):e34439. doi: 10.1371/journal.pone.0034439. Epub 2012 Apr 10. PLoS One. 2012. PMID: 22506020 Free PMC article. - Synthetic Developmental Biology: Understanding Through Reconstitution.
Schlissel G, Li P. Schlissel G, et al. Annu Rev Cell Dev Biol. 2020 Oct 6;36:339-357. doi: 10.1146/annurev-cellbio-020620-090650. Annu Rev Cell Dev Biol. 2020. PMID: 33021822 Free PMC article. Review.
References
- Arnosti DN, Barolo S, Levine M, Small S (1996b) The eve stripe 2 enhancer employs multiple modes of transcriptional synergy. Development 122: 205–214 - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials