A map of the cis-regulatory sequences in the mouse genome (original) (raw)
- Letter
- Published: 01 July 2012
- Feng Yue1 na1,
- David F. McCleary1,
- Zhen Ye1,
- Lee Edsall1,
- Samantha Kuan1,
- Ulrich Wagner1,
- Jesse Dixon1,2,3,
- Leonard Lee1,
- Victor V. Lobanenkov4 &
- …
- Bing Ren1,5
Nature volume 488, pages 116–120 (2012)Cite this article
- 37k Accesses
- 1053 Citations
- 44 Altmetric
- Metrics details
Subjects
Abstract
The laboratory mouse is the most widely used mammalian model organism in biomedical research. The 2.6 × 109 bases of the mouse genome possess a high degree of conservation with the human genome1, so a thorough annotation of the mouse genome will be of significant value to understanding the function of the human genome. So far, most of the functional sequences in the mouse genome have yet to be found, and the _cis_-regulatory sequences in particular are still poorly annotated. Comparative genomics has been a powerful tool for the discovery of these sequences2, but on its own it cannot resolve their temporal and spatial functions. Recently, ChIP-Seq has been developed to identify _cis_-regulatory elements in the genomes of several organisms including humans, Drosophila melanogaster and Caenorhabditis elegans3,4,5. Here we apply the same experimental approach to a diverse set of 19 tissues and cell types in the mouse to produce a map of nearly 300,000 murine _cis_-regulatory sequences. The annotated sequences add up to 11% of the mouse genome, and include more than 70% of conserved non-coding sequences. We define tissue-specific enhancers and identify potential transcription factors regulating gene expression in each tissue or cell type. Finally, we show that much of the mouse genome is organized into domains of coordinately regulated enhancers and promoters. Our results provide a resource for the annotation of functional elements in the mammalian genome and for the study of mechanisms regulating tissue-specific gene expression.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Additional access options:
Similar content being viewed by others
Accession codes
Primary accessions
Gene Expression Omnibus
References
- Waterston, R. H. et al. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002)
Article ADS CAS Google Scholar - Visel, A., Rubin, E. M. & Pennacchio, L. A. Genomic views of distant-acting enhancers. Nature 461, 199–205 (2009)
Article ADS CAS Google Scholar - The ENCODE Project Consortium. A user’s guide to the encyclopedia of DNA elements (ENCODE). PLoS Biol. 9. e1001046 (2011)
- Gerstein, M. B. et al. Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project. Science 330, 1775–1787 (2010)
Article ADS CAS Google Scholar - Roy, S. et al. Identification of functional elements and regulatory circuits by Drosophila modENCODE. Science 330, 1787–1797 (2010)
Article ADS CAS Google Scholar - Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007)
Article CAS Google Scholar - Creyghton, M. P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl Acad. Sci. USA 107, 21931–21936 (2010)
Article ADS CAS Google Scholar - Kim, T. H. et al. A high-resolution map of active promoters in the human genome. Nature 436, 876–880 (2005)
Article ADS CAS Google Scholar - Rada-Iglesias, A. et al. A unique chromatin signature uncovers early developmental enhancers in humans. Nature 470, 279–283 (2011)
Article ADS CAS Google Scholar - Heintzman, N. D. et al. Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature 459, 108–112 (2009)
Article ADS CAS Google Scholar - Heintzman, N. D. et al. Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nature Genet. 39, 311–318 (2007)
Article CAS Google Scholar - Kim, T. H. et al. Analysis of the vertebrate insulator protein CTCF-binding sites in the human genome. Cell 128, 1231–1245 (2007)
Article CAS Google Scholar - Visel, A. et al. ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature 457, 854–858 (2009)
Article ADS CAS Google Scholar - Parkhomchuk, D. et al. Transcriptome analysis by strand-specific sequencing of complementary DNA. Nucleic Acids Res. 37, e123 (2009)
Article Google Scholar - Pruitt, K. D., Tatusova, T. & Maglott, D. R. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35, D61–D65 (2007)
Article CAS Google Scholar - Visel, A., Minovitsky, S., Dubchak, I. & Pennacchio, L. A. VISTA Enhancer Browser—a database of tissue-specific human enhancers. Nucleic Acids Res. 35, D88–D92 (2007)
Article CAS Google Scholar - Chen, X. et al. Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell 133, 1106–1117 (2008)
Article CAS Google Scholar - Schmidt, D. et al. Five-vertebrate ChIP-seq reveals the evolutionary dynamics of transcription factor binding. Science 328, 1036–1040 (2010)
Article ADS CAS Google Scholar - Birney, E. et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007)
Article ADS CAS Google Scholar - Siepel, A. et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034–1050 (2005)
Article CAS Google Scholar - Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49 (2011)
Article ADS CAS Google Scholar - Li, G. et al. Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell 148, 84–98 (2012)
Article CAS Google Scholar - Ong, C. T. & Corces, V. G. Enhancer function: new insights into the regulation of tissue-specific gene expression. Nature Rev. Genet. 12, 283–293 (2011)
Article CAS Google Scholar - Kagey, M. H. et al. Mediator and cohesin connect gene expression and chromatin architecture. Nature 467, 430–435 (2010)
Article ADS CAS Google Scholar - Splinter, E. et al. CTCF mediates long-range chromatin looping and local histone modification in the β-globin locus. Genes Dev. 20, 2349–2354 (2006)
Article CAS Google Scholar - Nora, E. P. et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485, 381–385 (2012)
Article ADS CAS Google Scholar - Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012)
Article ADS CAS Google Scholar - Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009)
Article ADS CAS Google Scholar - Chepelev, I., Wei, G., Wangsa, D., Tang, Q. & Zhao, K. Characterization of genome-wide enhancer–promoter interactions reveals co-expression of interacting genes and modes of higher order chromatin organization. Cell Res. 22, 490–503 (2012)
Article CAS Google Scholar - Hawkins, R. D. et al. Distinct epigenomic landscapes of pluripotent and lineage-committed human cells. Cell Stem Cell 6, 479–491 (2010)
Article CAS Google Scholar
Acknowledgements
We thank F. Jin, Y. Luu, S. Klugman, A. Y.-J. Kim, Q.-M. Ngo, B. A. Gomez and S. Selvaraj for consultation. The mESC line Bruce4 was a gift from UCSD Transgenic Core. Research funding was provided by the National Human Genome Research Institute (R01HG003991) and the Ludwig Institute for Cancer Research to B.R. Y.S. is supported by a postdoctoral fellowship from the International Rett Syndrome Foundation. J.D. is supported by a pre-doctoral fellowship from the California Institute for Regenerative Medicine.
Author information
Author notes
- Yin Shen and Feng Yue: These authors contributed equally to this work.
Authors and Affiliations
- Ludwig Institute for Cancer Research, 9500 Gilman Drive, La Jolla, California 92093-0653, USA ,
Yin Shen, Feng Yue, David F. McCleary, Zhen Ye, Lee Edsall, Samantha Kuan, Ulrich Wagner, Jesse Dixon, Leonard Lee & Bing Ren - Medical Scientist Training Program, University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093-0653, USA ,
Jesse Dixon - Biomedical Sciences Graduate Program, University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093-0653, USA ,
Jesse Dixon - Laboratory of Immunogenetics, National Institute of Allergy and Infectious Diseases, Twinbrook I NIAID Facility, Room 1417, 5640 Fishers Lane, Rockville, Maryland 20852, USA ,
Victor V. Lobanenkov - Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, California 92093-0653, USA,
Bing Ren
Authors
- Yin Shen
You can also search for this author inPubMed Google Scholar - Feng Yue
You can also search for this author inPubMed Google Scholar - David F. McCleary
You can also search for this author inPubMed Google Scholar - Zhen Ye
You can also search for this author inPubMed Google Scholar - Lee Edsall
You can also search for this author inPubMed Google Scholar - Samantha Kuan
You can also search for this author inPubMed Google Scholar - Ulrich Wagner
You can also search for this author inPubMed Google Scholar - Jesse Dixon
You can also search for this author inPubMed Google Scholar - Leonard Lee
You can also search for this author inPubMed Google Scholar - Victor V. Lobanenkov
You can also search for this author inPubMed Google Scholar - Bing Ren
You can also search for this author inPubMed Google Scholar
Contributions
Y.S., F.Y. and B.R. designed the experiments. Y.S., D.M., Z.Y. and L.L. conducted experiments. F.Y. performed computational analysis. U.W. contributed to RNA-Seq data analysis. J.D. contributed to Hi-C data analysis. S.K. and L.E. performed DNA sequencing and initial data processing. V.L. provided CTCF monoclonal antibodies. Y.S., F.Y. and B.R. prepared the manuscript.
Corresponding author
Correspondence toBing Ren.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Additional information
Data sets are available from the ENCODE website (http://genome.ucsc.edu/ENCODE), the supporting website for this paper (http://chromosome.sdsc.edu/mouse/index.html) and the Gene Expression Omnibus (GSE29184).
Supplementary information
Supplementary Information
This file contains Supplementary Text, Supplementary References, Supplementary Figures 1-16 and Supplementary Tables 1-3, 6, 8 and 11-16 - see separate zipped file for Supplementary Tables 4, 5, 7 and 9-10. (PDF 5454 kb)
Supplementary Data
This zipped file contains Supplementary Tables 4, 5, 7 and 9-10. (ZIP 23308 kb)
PowerPoint slides
Rights and permissions
About this article
Cite this article
Shen, Y., Yue, F., McCleary, D. et al. A map of the _cis_-regulatory sequences in the mouse genome.Nature 488, 116–120 (2012). https://doi.org/10.1038/nature11243
- Received: 12 May 2011
- Accepted: 18 May 2012
- Published: 01 July 2012
- Issue Date: 02 August 2012
- DOI: https://doi.org/10.1038/nature11243
This article is cited by
Editorial Summary
Further annotation of the mouse genome
The identification of _cis_-regulatory sequences in the mouse genome has lagged behind that of other model organisms. Here, a genomic map of nearly 300,000 potential _cis_-regulatory sequences has been experimentally determined from diverse mouse tissues and cell types. The map reveals active promoters, enhancers and CTCF (CCCTC-binding factor) sites in nearly 11% of the mouse genome and significantly expands the annotation of mammalian regulatory sequences.