Harm van Bakel | Union Graduate College-Icahn School of Medicine at Mount Sinai Bioethics Program (original) (raw)

Papers by Harm van Bakel

Research paper thumbnail of Cooperative action of NC2 and Mot1p to regulate TATA-binding protein function across the genome

Promoter recognition by TATA-binding protein (TBP) is an essential step in the initiation of RNA ... more Promoter recognition by TATA-binding protein (TBP) is an essential step in the initiation of RNA polymerase II (pol II) mediated transcription. Genetic and biochemical studies in yeast have shown that Mot1p and NC2 play important roles in inhibiting TBP activity. To understand how TBP activity is regulated in a genome-wide manner, we profiled the binding of TBP, NC2, Mot1p, TFIID,

Research paper thumbnail of BCL11B expression in intramembranous osteogenesis during murine craniofacial suture development

Gene expression patterns : GEP, 2015

Sutures, where neighboring craniofacial bones are separated by undifferentiated mesenchyme, are m... more Sutures, where neighboring craniofacial bones are separated by undifferentiated mesenchyme, are major growth sites during craniofacial development. Pathologic fusion of bones within sutures occurs in a wide variety of craniosynostosis conditions and can result in dysmorphic craniofacial growth and secondary neurologic deficits. Our knowledge of the genes involved in suture formation is poor. Here we describe the novel expression pattern of the BCL11B transcription factor protein during murine embryonic craniofacial bone formation. We examined BCL11B protein expression at E14.5, E16.5, and E18.5 in 14 major craniofacial sutures of C57BL/6J mice. We found BCL11B expression to be associated with all intramembranous craniofacial bones examined. The most striking aspects of BCL11B expression were its high levels in suture mesenchyme and increasingly complementary expression with RUNX2 in differentiating osteoblasts during development. BCL11B was also expressed in mesenchyme at the non-su...

Research paper thumbnail of Senataxin suppresses the antiviral transcriptional response and controls viral biogenesis

Nature Immunology, 2015

The human helicase senataxin (SETX) has been linked to the neurodegenerative diseases amyotrophic... more The human helicase senataxin (SETX) has been linked to the neurodegenerative diseases amyotrophic lateral sclerosis (ALS4) and ataxia with oculomotor apraxia (AOA2). Here we identified a role for SETX in controlling the antiviral response. Cells that had undergone depletion of SETX and SETX-deficient cells derived from patients with AOA2 had higher expression of antiviral mediators in response to infection than did wild-type cells. Mechanistically, we propose a model whereby SETX attenuates the activity of RNA polymerase II (RNAPII) at genes stimulated after a virus is sensed and thus controls the magnitude of the host response to pathogens and the biogenesis of various RNA viruses (e.g., influenza A virus and West Nile virus). Our data indicate a potentially causal link among inborn errors in SETX, susceptibility to infection and the development of neurologic disorders.

Research paper thumbnail of Monitoring global messenger RNA changes in externally controlled microarray experiments

Embo Reports, 2003

Expression profiling is a universal tool, with a range of applications that benefit from the accu... more Expression profiling is a universal tool, with a range of applications that benefit from the accurate determination of differential gene expression. To allow normalization using endogenous transcript levels, current microarray analyses assume that relatively few transcripts vary, or that any changes that occur are balanced. When normalization using endogenous genes is carried out, changes in expression levels are calculated relative

Research paper thumbnail of A novel germline mutation of PTEN associated with brain tumours of multiple lineages

We have identified a novel germline mutation in the PTEN tumour suppressor gene. The mutation was... more We have identified a novel germline mutation in the PTEN tumour suppressor gene. The mutation was identified in a patient with a glioma, and turned out to be a heterozygous germline mutation of PTEN (Arg234Gln), without loss of heterozygosity in tumour DNA. The biological consequences of this germline mutation were investigated by means of transfection studies of the mutant PTEN molecule compared to wild-type PTEN. In contrast to the wild-type molecule, the mutant PTEN protein is not capable of inducing apoptosis, induces increased cell proliferation and leads to high constitutive PKB/Akt activation, which cannot be increased anymore by stimulation with insulin. The reported patient, in addition to glioma, had suffered from benign meningioma in the past but did not show any clinical signs of Cowden disease or other hereditary diseases typically associated with PTEN germline mutations. The functional consequences of the mutation in transfection studies are consistent with high proliferative activity. Together, these findings suggest that the Arg234Gln missense mutation in PTEN has oncogenic properties and predisposes to brain tumours of multiple lineages.

Research paper thumbnail of A systematic characterization of Cwc21, the yeast ortholog of the human spliceosomal protein SRm300

Cwc21 (complexed with Cef1 protein 21) is a 135 amino acid yeast protein that shares homology wit... more Cwc21 (complexed with Cef1 protein 21) is a 135 amino acid yeast protein that shares homology with the N-terminal domain of human SRm300/SRRM2, a large serine/arginine-repeat protein shown previously to associate with the splicing coactivator and 39-end processing stimulatory factor, SRm160. Proteomic analysis of spliceosomal complexes has suggested a role for Cwc21 and SRm300 at the core of the spliceosome. However, specific functions for these proteins have remained elusive. In this report, we employ quantitative genetic interaction mapping, mass spectrometry of tandem affinity-purified complexes, and microarray profiling to investigate genetic, physical, and functional interactions involving Cwc21. Combined data from these assays support multiple roles for Cwc21 in the formation and function of splicing complexes. Consistent with a role for Cwc21 at the core of the spliceosome, we observe strong genetic, physical, and functional interactions with Isy1, a protein previously implicated in the first catalytic step of splicing and splicing fidelity. Together, the results suggest multiple functions for Cwc21/SRm300 in the splicing process, including an important role in the activation of splicing in association with Isy1.

Research paper thumbnail of Response to "The reality of pervasive transcription

Clark et al. criticize several aspects of our study , and specifically challenge our assertion th... more Clark et al. criticize several aspects of our study , and specifically challenge our assertion that the degree of pervasive transcription has previously been overstated. We disagree with much of their reasoning and their interpretation of our work. For example, many of our conclusions are based on overall sequence read distributions, while Clark et al. focus on transcript units and seqfrags (sets of overlapping reads). A key point is that one can derive a robust estimate of the relative amounts of different transcript types without having a complete reconstruction of every single transcript.

Research paper thumbnail of A Compendium of Nucleosome and Transcript Profiles Reveals Determinants of Chromatin Architecture and Transcription

Nucleosomes in all eukaryotes examined to date adopt a characteristic architecture within genes a... more Nucleosomes in all eukaryotes examined to date adopt a characteristic architecture within genes and play fundamental roles in regulating transcription, yet the identity and precise roles of many of the trans-acting factors responsible for the establishment and maintenance of this organization remain to be identified. We profiled a compendium of 50 yeast strains carrying conditional alleles or complete deletions of genes involved in transcriptional regulation, histone biology, and chromatin remodeling, as well as compounds that target transcription and histone deacetylases, to assess their respective roles in nucleosome positioning and transcription. We find that nucleosome patterning in genes is affected by many factors, including the CAF-1 complex, Spt10, and Spt21, in addition to previously reported remodeler ATPases and histone chaperones. Disruption of these factors or reductions in histone levels led genic nucleosomes to assume positions more consistent with their intrinsic sequence preferences, with pronounced and specific shifts of the +1 nucleosome relative to the transcription start site. These shifts of +1 nucleosomes appear to have functional consequences, as several affected genes in Ino80 mutants exhibited altered expression responses. Our parallel expression profiling compendium revealed extensive transcription changes in intergenic and antisense regions, most of which occur in regions with altered nucleosome occupancy and positioning. We show that the nucleosome-excluding transcription factors Reb1, Abf1, Tbf1, and Rsc3 suppress cryptic transcripts at their target promoters, while a combined analysis of nucleosome and expression profiles identified 36 novel transcripts that are normally repressed by Tup1/Cyc8. Our data confirm and extend the roles of chromatin remodelers and chaperones as major determinants of genic nucleosome positioning, and these data provide a valuable resource for future studies.

Research paper thumbnail of Global transcriptional responses of fission and budding yeast to changes in copper and iron levels: a comparative study

Research paper thumbnail of EXPRESSION PROFILES OF COELIAC DISEASE BIOPSIES DURING MUCOSA RECOVERY: SY2-02

Journal of Pediatric Gastroenterology and Nutrition, 2005

Research paper thumbnail of A Library of Yeast Transcription Factor Motifs Reveals a Widespread Function for Rsc3 in Targeting Nucleosome Exclusion at Promoters

Molecular Cell, 2008

The sequence specificity of DNA-binding proteins is the primary mechanism by which the cell recog... more The sequence specificity of DNA-binding proteins is the primary mechanism by which the cell recognizes genomic features. Here, we describe systematic determination of yeast transcription factor DNA-binding specificities. We obtained binding specificities for 112 DNA-binding proteins representing 19 distinct structural classes, one-third of which have not been previously reported. Several newly discovered binding sequences have striking genomic distributions relative to transcription start sites, supporting their biological relevance and suggesting a role in promoter architecture. Among these are Rsc3 binding sequences, containing the core CGCG, which are found preferentially ~100 bp upstream of transcription start sites. Mutation of RSC3 results in a dramatic increase in nucleosome occupancy in hundreds of proximal promoters containing a Rsc3 binding element, but has little impact on promoters lacking Rsc3 binding sequences, indicating that Rsc3 plays a broad role in targeting nucleosome exclusion at yeast promoters. 8Corresponding author: t.hughes@utoronto.ca; Tel. 416-946-8260; FAX 416-978-8258. Supplementary material and URLs. Supplementary data files including clone sequences and 8-mer scores and motifs for all TFs are posted at

Research paper thumbnail of Two-Color Cell Array Screen Reveals Interdependent Roles for Histone Chaperones and a Chromatin Boundary Regulator in Histone Gene Repression

Research paper thumbnail of Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity

Cell, 2014

These data present a powerful resource for mapping transcriptional networks across eukaryotes.

Research paper thumbnail of The draft genome and transcriptome of Cannabis sativa

Genome Biology, 2011

Background: Cannabis sativa has been cultivated throughout human history as a source of fiber, oi... more Background: Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored.

Research paper thumbnail of Lsr2 is a nucleoid-associated protein that targets AT-rich sequences and virulence genes in Mycobacterium tuberculosis

Proceedings of the National Academy of Sciences, 2010

Bacterial nucleoid-associated proteins play important roles in chromosome organization and global... more Bacterial nucleoid-associated proteins play important roles in chromosome organization and global gene regulation. We find that Lsr2 of Mycobacterium tuberculosis is a unique nucleoid-associated protein that binds AT-rich regions of the genome, including genomic islands acquired by horizontal gene transfer and regions encoding major virulence factors, such as the ESX secretion systems, the lipid virulence factors PDIM and PGL, and the PE/PPE families of antigenic proteins. Comparison of genome-wide binding data with expression data indicates that Lsr2 binding results in transcriptional repression. Domain-swapping experiments demonstrate that Lsr2 has an N-terminal dimerization domain and a C-terminal DNA-binding domain. Nuclear magnetic resonance analysis of the DNA-binding domain of Lsr2 and its interaction with DNA reveals a unique structure and a unique mechanism that enables Lsr2 to discriminately target AT-rich sequences through interactions with the minor groove of DNA. Taken together, we provide evidence that mycobacteria have employed a structurally distinct molecule with an apparently different DNA recognition mechanism to achieve a function similar to the Enterobacteriaceae H-NS, likely coordinating global gene regulation and virulence in this group of medically important bacteria.

Research paper thumbnail of Most “Dark Matter” Transcripts Are Associated With Known Genes

PLoS Biology, 2010

A series of reports over the last few years have indicated that a much larger portion of the mamm... more A series of reports over the last few years have indicated that a much larger portion of the mammalian genome is transcribed than can be accounted for by currently annotated genes, but the quantity and nature of these additional transcripts remains unclear. Here, we have used data from single-and paired-end RNA-Seq and tiling arrays to assess the quantity and composition of transcripts in PolyA+ RNA from human and mouse tissues. Relative to tiling arrays, RNA-Seq identifies many fewer transcribed regions (''seqfrags'') outside known exons and ncRNAs. Most nonexonic seqfrags are in introns, raising the possibility that they are fragments of pre-mRNAs. The chromosomal locations of the majority of intergenic seqfrags in RNA-Seq data are near known genes, consistent with alternative cleavage and polyadenylation site usage, promoter-and terminator-associated transcripts, or new alternative exons; indeed, reads that bridge splice sites identified 4,544 new exons, affecting 3,554 genes. Most of the remaining seqfrags correspond to either single reads that display characteristics of random sampling from a low-level background or several thousand small transcripts (median length = 111 bp) present at higher levels, which also tend to display sequence conservation and originate from regions with open chromatin. We conclude that, while there are bona fide new intergenic transcripts, their number and abundance is generally low in comparison to known exons, and the genome is not as pervasively transcribed as previously reported.

Research paper thumbnail of Gene expression profiling and phenotype analyses of S. cerevisiae in response to changing copper reveals six genes with new roles in copper and iron metabolism

Physiological Genomics, 2005

Exhaustive microarray time-course analyses of S.cerevisiae during copper starvation and copper ex... more Exhaustive microarray time-course analyses of S.cerevisiae during copper starvation and copper excess reveal new aspects of metal-induced gene regulation. Besides identifying targets of established copper-and iron-responsive transcription factors, we find that genes encoding mitochondrial proteins are down-regulated and that copper-independent iron transport genes are preferentially upregulated, both during prolonged copper deprivation. The experiments also suggest the presence of a small regulatory iron pool that links copper and iron responses. 128 genes with putative roles in metal metabolism were further investigated by several systematic phenotype screens. Of the novel phenotypes uncovered, hsp12-∆ and arn1-∆ display increased sensitivity to copper, cyc1-∆ and crr1-∆ show resistance to high copper, vma13-∆ exhibits increased sensitivity to iron deprivation, and pep12-∆ results in reduced growth in high copper and low iron. Besides revealing new components of eukaryotic metal trafficking pathways, the results underscore the previously determined intimate links between iron and copper metabolism, mitochondrial and vacuolar function in metal trafficking. The analyses further suggest that copper starvation can specifically lead to downregulation of respiratory function to preserve iron and copper for other cellular processes.

Research paper thumbnail of Improved genome-wide localization by ChIP-chip using double-round T7 RNA polymerase-based amplification

Nucleic Acids Research, 2007

Chromatin immunoprecipitation combined with DNA microarrays (ChIP-chip) is a powerful technique t... more Chromatin immunoprecipitation combined with DNA microarrays (ChIP-chip) is a powerful technique to detect in vivo protein-DNA interactions. Due to low yields, ChIP assays of transcription factors generally require amplification of immunoprecipitated genomic DNA. Here, we present an adapted linear amplification method that involves two rounds of T7 RNA polymerase amplification (double-T7). Using this we could successfully amplify as little as 0.4 ng of ChIP DNA to sufficient amounts for microarray analysis. In addition, we compared the double-T7 method to the ligation-mediated polymerase chain reaction (LM-PCR) method in a ChIP-chip of the yeast transcription factor Gsm1p. The double-T7 protocol showed lower noise levels and stronger binding signals compared to LM-PCR. Both LM-PCR and double-T7 identified strongly bound genomic regions, but the double-T7 method increased sensitivity and specificity to allow detection of weaker binding sites.

Research paper thumbnail of Mediator Expression Profiling Epistasis Reveals a Signal Transduction Pathway with Antagonistic Submodules and Highly Specific Downstream Targets

Molecular Cell, 2005

Mediator is an evolutionarily conserved coregulator of RNA polymerase II transcription. Microarra... more Mediator is an evolutionarily conserved coregulator of RNA polymerase II transcription. Microarray structure-function analysis of S.cerevisiae Mediator reveals functional antagonism between the cyclin dependent kinase (Cdk) submodule and components from the Tail (Med15, Med2, Med3), Head (Med20, Med18) and Middle (Med31). Certain genes exhibit increased or decreased expression, depending on which subunit is deleted. Epistasis analysis with expression-profile phenotypes, shows that MED2 and MED18 are downstream of CDK8. Strikingly, Cdk8mediated modification of a single amino acid within Mediator represses the regulon of a single transcription factor, Rcs1/Aft1. Highly specific gene regulation is thought to be determined by activators and combinatorial use of cofactors. Here, subtle modification of the general transcription machinery through one of its own components is shown to determine highly specific expression patterns. Expression-profiling can therefore precisely map regulatory cascades and our findings support a role for Mediator as a direct processor of signaling pathways for determining specificity.

Research paper thumbnail of Interplay of host microbiota, genetic perturbations, and inflammation promotes local development of intestinal neoplasms in mice

Journal of Experimental Medicine, 2014

Abbreviations used: FDR, false discovery rate; GI, gastrointesti nal; OTU, operational taxo nomic... more Abbreviations used: FDR, false discovery rate; GI, gastrointesti nal; OTU, operational taxo nomic unit; qPCR, quantitative PCR; rDNA, ribosomal DNA; SP, serrated polyp.

Research paper thumbnail of Cooperative action of NC2 and Mot1p to regulate TATA-binding protein function across the genome

Promoter recognition by TATA-binding protein (TBP) is an essential step in the initiation of RNA ... more Promoter recognition by TATA-binding protein (TBP) is an essential step in the initiation of RNA polymerase II (pol II) mediated transcription. Genetic and biochemical studies in yeast have shown that Mot1p and NC2 play important roles in inhibiting TBP activity. To understand how TBP activity is regulated in a genome-wide manner, we profiled the binding of TBP, NC2, Mot1p, TFIID,

Research paper thumbnail of BCL11B expression in intramembranous osteogenesis during murine craniofacial suture development

Gene expression patterns : GEP, 2015

Sutures, where neighboring craniofacial bones are separated by undifferentiated mesenchyme, are m... more Sutures, where neighboring craniofacial bones are separated by undifferentiated mesenchyme, are major growth sites during craniofacial development. Pathologic fusion of bones within sutures occurs in a wide variety of craniosynostosis conditions and can result in dysmorphic craniofacial growth and secondary neurologic deficits. Our knowledge of the genes involved in suture formation is poor. Here we describe the novel expression pattern of the BCL11B transcription factor protein during murine embryonic craniofacial bone formation. We examined BCL11B protein expression at E14.5, E16.5, and E18.5 in 14 major craniofacial sutures of C57BL/6J mice. We found BCL11B expression to be associated with all intramembranous craniofacial bones examined. The most striking aspects of BCL11B expression were its high levels in suture mesenchyme and increasingly complementary expression with RUNX2 in differentiating osteoblasts during development. BCL11B was also expressed in mesenchyme at the non-su...

Research paper thumbnail of Senataxin suppresses the antiviral transcriptional response and controls viral biogenesis

Nature Immunology, 2015

The human helicase senataxin (SETX) has been linked to the neurodegenerative diseases amyotrophic... more The human helicase senataxin (SETX) has been linked to the neurodegenerative diseases amyotrophic lateral sclerosis (ALS4) and ataxia with oculomotor apraxia (AOA2). Here we identified a role for SETX in controlling the antiviral response. Cells that had undergone depletion of SETX and SETX-deficient cells derived from patients with AOA2 had higher expression of antiviral mediators in response to infection than did wild-type cells. Mechanistically, we propose a model whereby SETX attenuates the activity of RNA polymerase II (RNAPII) at genes stimulated after a virus is sensed and thus controls the magnitude of the host response to pathogens and the biogenesis of various RNA viruses (e.g., influenza A virus and West Nile virus). Our data indicate a potentially causal link among inborn errors in SETX, susceptibility to infection and the development of neurologic disorders.

Research paper thumbnail of Monitoring global messenger RNA changes in externally controlled microarray experiments

Embo Reports, 2003

Expression profiling is a universal tool, with a range of applications that benefit from the accu... more Expression profiling is a universal tool, with a range of applications that benefit from the accurate determination of differential gene expression. To allow normalization using endogenous transcript levels, current microarray analyses assume that relatively few transcripts vary, or that any changes that occur are balanced. When normalization using endogenous genes is carried out, changes in expression levels are calculated relative

Research paper thumbnail of A novel germline mutation of PTEN associated with brain tumours of multiple lineages

We have identified a novel germline mutation in the PTEN tumour suppressor gene. The mutation was... more We have identified a novel germline mutation in the PTEN tumour suppressor gene. The mutation was identified in a patient with a glioma, and turned out to be a heterozygous germline mutation of PTEN (Arg234Gln), without loss of heterozygosity in tumour DNA. The biological consequences of this germline mutation were investigated by means of transfection studies of the mutant PTEN molecule compared to wild-type PTEN. In contrast to the wild-type molecule, the mutant PTEN protein is not capable of inducing apoptosis, induces increased cell proliferation and leads to high constitutive PKB/Akt activation, which cannot be increased anymore by stimulation with insulin. The reported patient, in addition to glioma, had suffered from benign meningioma in the past but did not show any clinical signs of Cowden disease or other hereditary diseases typically associated with PTEN germline mutations. The functional consequences of the mutation in transfection studies are consistent with high proliferative activity. Together, these findings suggest that the Arg234Gln missense mutation in PTEN has oncogenic properties and predisposes to brain tumours of multiple lineages.

Research paper thumbnail of A systematic characterization of Cwc21, the yeast ortholog of the human spliceosomal protein SRm300

Cwc21 (complexed with Cef1 protein 21) is a 135 amino acid yeast protein that shares homology wit... more Cwc21 (complexed with Cef1 protein 21) is a 135 amino acid yeast protein that shares homology with the N-terminal domain of human SRm300/SRRM2, a large serine/arginine-repeat protein shown previously to associate with the splicing coactivator and 39-end processing stimulatory factor, SRm160. Proteomic analysis of spliceosomal complexes has suggested a role for Cwc21 and SRm300 at the core of the spliceosome. However, specific functions for these proteins have remained elusive. In this report, we employ quantitative genetic interaction mapping, mass spectrometry of tandem affinity-purified complexes, and microarray profiling to investigate genetic, physical, and functional interactions involving Cwc21. Combined data from these assays support multiple roles for Cwc21 in the formation and function of splicing complexes. Consistent with a role for Cwc21 at the core of the spliceosome, we observe strong genetic, physical, and functional interactions with Isy1, a protein previously implicated in the first catalytic step of splicing and splicing fidelity. Together, the results suggest multiple functions for Cwc21/SRm300 in the splicing process, including an important role in the activation of splicing in association with Isy1.

Research paper thumbnail of Response to "The reality of pervasive transcription

Clark et al. criticize several aspects of our study , and specifically challenge our assertion th... more Clark et al. criticize several aspects of our study , and specifically challenge our assertion that the degree of pervasive transcription has previously been overstated. We disagree with much of their reasoning and their interpretation of our work. For example, many of our conclusions are based on overall sequence read distributions, while Clark et al. focus on transcript units and seqfrags (sets of overlapping reads). A key point is that one can derive a robust estimate of the relative amounts of different transcript types without having a complete reconstruction of every single transcript.

Research paper thumbnail of A Compendium of Nucleosome and Transcript Profiles Reveals Determinants of Chromatin Architecture and Transcription

Nucleosomes in all eukaryotes examined to date adopt a characteristic architecture within genes a... more Nucleosomes in all eukaryotes examined to date adopt a characteristic architecture within genes and play fundamental roles in regulating transcription, yet the identity and precise roles of many of the trans-acting factors responsible for the establishment and maintenance of this organization remain to be identified. We profiled a compendium of 50 yeast strains carrying conditional alleles or complete deletions of genes involved in transcriptional regulation, histone biology, and chromatin remodeling, as well as compounds that target transcription and histone deacetylases, to assess their respective roles in nucleosome positioning and transcription. We find that nucleosome patterning in genes is affected by many factors, including the CAF-1 complex, Spt10, and Spt21, in addition to previously reported remodeler ATPases and histone chaperones. Disruption of these factors or reductions in histone levels led genic nucleosomes to assume positions more consistent with their intrinsic sequence preferences, with pronounced and specific shifts of the +1 nucleosome relative to the transcription start site. These shifts of +1 nucleosomes appear to have functional consequences, as several affected genes in Ino80 mutants exhibited altered expression responses. Our parallel expression profiling compendium revealed extensive transcription changes in intergenic and antisense regions, most of which occur in regions with altered nucleosome occupancy and positioning. We show that the nucleosome-excluding transcription factors Reb1, Abf1, Tbf1, and Rsc3 suppress cryptic transcripts at their target promoters, while a combined analysis of nucleosome and expression profiles identified 36 novel transcripts that are normally repressed by Tup1/Cyc8. Our data confirm and extend the roles of chromatin remodelers and chaperones as major determinants of genic nucleosome positioning, and these data provide a valuable resource for future studies.

Research paper thumbnail of Global transcriptional responses of fission and budding yeast to changes in copper and iron levels: a comparative study

Research paper thumbnail of EXPRESSION PROFILES OF COELIAC DISEASE BIOPSIES DURING MUCOSA RECOVERY: SY2-02

Journal of Pediatric Gastroenterology and Nutrition, 2005

Research paper thumbnail of A Library of Yeast Transcription Factor Motifs Reveals a Widespread Function for Rsc3 in Targeting Nucleosome Exclusion at Promoters

Molecular Cell, 2008

The sequence specificity of DNA-binding proteins is the primary mechanism by which the cell recog... more The sequence specificity of DNA-binding proteins is the primary mechanism by which the cell recognizes genomic features. Here, we describe systematic determination of yeast transcription factor DNA-binding specificities. We obtained binding specificities for 112 DNA-binding proteins representing 19 distinct structural classes, one-third of which have not been previously reported. Several newly discovered binding sequences have striking genomic distributions relative to transcription start sites, supporting their biological relevance and suggesting a role in promoter architecture. Among these are Rsc3 binding sequences, containing the core CGCG, which are found preferentially ~100 bp upstream of transcription start sites. Mutation of RSC3 results in a dramatic increase in nucleosome occupancy in hundreds of proximal promoters containing a Rsc3 binding element, but has little impact on promoters lacking Rsc3 binding sequences, indicating that Rsc3 plays a broad role in targeting nucleosome exclusion at yeast promoters. 8Corresponding author: t.hughes@utoronto.ca; Tel. 416-946-8260; FAX 416-978-8258. Supplementary material and URLs. Supplementary data files including clone sequences and 8-mer scores and motifs for all TFs are posted at

Research paper thumbnail of Two-Color Cell Array Screen Reveals Interdependent Roles for Histone Chaperones and a Chromatin Boundary Regulator in Histone Gene Repression

Research paper thumbnail of Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity

Cell, 2014

These data present a powerful resource for mapping transcriptional networks across eukaryotes.

Research paper thumbnail of The draft genome and transcriptome of Cannabis sativa

Genome Biology, 2011

Background: Cannabis sativa has been cultivated throughout human history as a source of fiber, oi... more Background: Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored.

Research paper thumbnail of Lsr2 is a nucleoid-associated protein that targets AT-rich sequences and virulence genes in Mycobacterium tuberculosis

Proceedings of the National Academy of Sciences, 2010

Bacterial nucleoid-associated proteins play important roles in chromosome organization and global... more Bacterial nucleoid-associated proteins play important roles in chromosome organization and global gene regulation. We find that Lsr2 of Mycobacterium tuberculosis is a unique nucleoid-associated protein that binds AT-rich regions of the genome, including genomic islands acquired by horizontal gene transfer and regions encoding major virulence factors, such as the ESX secretion systems, the lipid virulence factors PDIM and PGL, and the PE/PPE families of antigenic proteins. Comparison of genome-wide binding data with expression data indicates that Lsr2 binding results in transcriptional repression. Domain-swapping experiments demonstrate that Lsr2 has an N-terminal dimerization domain and a C-terminal DNA-binding domain. Nuclear magnetic resonance analysis of the DNA-binding domain of Lsr2 and its interaction with DNA reveals a unique structure and a unique mechanism that enables Lsr2 to discriminately target AT-rich sequences through interactions with the minor groove of DNA. Taken together, we provide evidence that mycobacteria have employed a structurally distinct molecule with an apparently different DNA recognition mechanism to achieve a function similar to the Enterobacteriaceae H-NS, likely coordinating global gene regulation and virulence in this group of medically important bacteria.

Research paper thumbnail of Most “Dark Matter” Transcripts Are Associated With Known Genes

PLoS Biology, 2010

A series of reports over the last few years have indicated that a much larger portion of the mamm... more A series of reports over the last few years have indicated that a much larger portion of the mammalian genome is transcribed than can be accounted for by currently annotated genes, but the quantity and nature of these additional transcripts remains unclear. Here, we have used data from single-and paired-end RNA-Seq and tiling arrays to assess the quantity and composition of transcripts in PolyA+ RNA from human and mouse tissues. Relative to tiling arrays, RNA-Seq identifies many fewer transcribed regions (''seqfrags'') outside known exons and ncRNAs. Most nonexonic seqfrags are in introns, raising the possibility that they are fragments of pre-mRNAs. The chromosomal locations of the majority of intergenic seqfrags in RNA-Seq data are near known genes, consistent with alternative cleavage and polyadenylation site usage, promoter-and terminator-associated transcripts, or new alternative exons; indeed, reads that bridge splice sites identified 4,544 new exons, affecting 3,554 genes. Most of the remaining seqfrags correspond to either single reads that display characteristics of random sampling from a low-level background or several thousand small transcripts (median length = 111 bp) present at higher levels, which also tend to display sequence conservation and originate from regions with open chromatin. We conclude that, while there are bona fide new intergenic transcripts, their number and abundance is generally low in comparison to known exons, and the genome is not as pervasively transcribed as previously reported.

Research paper thumbnail of Gene expression profiling and phenotype analyses of S. cerevisiae in response to changing copper reveals six genes with new roles in copper and iron metabolism

Physiological Genomics, 2005

Exhaustive microarray time-course analyses of S.cerevisiae during copper starvation and copper ex... more Exhaustive microarray time-course analyses of S.cerevisiae during copper starvation and copper excess reveal new aspects of metal-induced gene regulation. Besides identifying targets of established copper-and iron-responsive transcription factors, we find that genes encoding mitochondrial proteins are down-regulated and that copper-independent iron transport genes are preferentially upregulated, both during prolonged copper deprivation. The experiments also suggest the presence of a small regulatory iron pool that links copper and iron responses. 128 genes with putative roles in metal metabolism were further investigated by several systematic phenotype screens. Of the novel phenotypes uncovered, hsp12-∆ and arn1-∆ display increased sensitivity to copper, cyc1-∆ and crr1-∆ show resistance to high copper, vma13-∆ exhibits increased sensitivity to iron deprivation, and pep12-∆ results in reduced growth in high copper and low iron. Besides revealing new components of eukaryotic metal trafficking pathways, the results underscore the previously determined intimate links between iron and copper metabolism, mitochondrial and vacuolar function in metal trafficking. The analyses further suggest that copper starvation can specifically lead to downregulation of respiratory function to preserve iron and copper for other cellular processes.

Research paper thumbnail of Improved genome-wide localization by ChIP-chip using double-round T7 RNA polymerase-based amplification

Nucleic Acids Research, 2007

Chromatin immunoprecipitation combined with DNA microarrays (ChIP-chip) is a powerful technique t... more Chromatin immunoprecipitation combined with DNA microarrays (ChIP-chip) is a powerful technique to detect in vivo protein-DNA interactions. Due to low yields, ChIP assays of transcription factors generally require amplification of immunoprecipitated genomic DNA. Here, we present an adapted linear amplification method that involves two rounds of T7 RNA polymerase amplification (double-T7). Using this we could successfully amplify as little as 0.4 ng of ChIP DNA to sufficient amounts for microarray analysis. In addition, we compared the double-T7 method to the ligation-mediated polymerase chain reaction (LM-PCR) method in a ChIP-chip of the yeast transcription factor Gsm1p. The double-T7 protocol showed lower noise levels and stronger binding signals compared to LM-PCR. Both LM-PCR and double-T7 identified strongly bound genomic regions, but the double-T7 method increased sensitivity and specificity to allow detection of weaker binding sites.

Research paper thumbnail of Mediator Expression Profiling Epistasis Reveals a Signal Transduction Pathway with Antagonistic Submodules and Highly Specific Downstream Targets

Molecular Cell, 2005

Mediator is an evolutionarily conserved coregulator of RNA polymerase II transcription. Microarra... more Mediator is an evolutionarily conserved coregulator of RNA polymerase II transcription. Microarray structure-function analysis of S.cerevisiae Mediator reveals functional antagonism between the cyclin dependent kinase (Cdk) submodule and components from the Tail (Med15, Med2, Med3), Head (Med20, Med18) and Middle (Med31). Certain genes exhibit increased or decreased expression, depending on which subunit is deleted. Epistasis analysis with expression-profile phenotypes, shows that MED2 and MED18 are downstream of CDK8. Strikingly, Cdk8mediated modification of a single amino acid within Mediator represses the regulon of a single transcription factor, Rcs1/Aft1. Highly specific gene regulation is thought to be determined by activators and combinatorial use of cofactors. Here, subtle modification of the general transcription machinery through one of its own components is shown to determine highly specific expression patterns. Expression-profiling can therefore precisely map regulatory cascades and our findings support a role for Mediator as a direct processor of signaling pathways for determining specificity.

Research paper thumbnail of Interplay of host microbiota, genetic perturbations, and inflammation promotes local development of intestinal neoplasms in mice

Journal of Experimental Medicine, 2014

Abbreviations used: FDR, false discovery rate; GI, gastrointesti nal; OTU, operational taxo nomic... more Abbreviations used: FDR, false discovery rate; GI, gastrointesti nal; OTU, operational taxo nomic unit; qPCR, quantitative PCR; rDNA, ribosomal DNA; SP, serrated polyp.