ENCODE data in the UCSC Genome Browser: year 5 update - PubMed (original) (raw)
. 2013 Jan;41(Database issue):D56-63.
doi: 10.1093/nar/gks1172. Epub 2012 Nov 27.
Cricket A Sloan, Venkat S Malladi, Timothy R Dreszer, Katrina Learned, Vanessa M Kirkup, Matthew C Wong, Morgan Maddren, Ruihua Fang, Steven G Heitner, Brian T Lee, Galt P Barber, Rachel A Harte, Mark Diekhans, Jeffrey C Long, Steven P Wilder, Ann S Zweig, Donna Karolchik, Robert M Kuhn, David Haussler, W James Kent
Affiliations
- PMID: 23193274
- PMCID: PMC3531152
- DOI: 10.1093/nar/gks1172
ENCODE data in the UCSC Genome Browser: year 5 update
Kate R Rosenbloom et al. Nucleic Acids Res. 2013 Jan.
Abstract
The Encyclopedia of DNA Elements (ENCODE), http://encodeproject.org, has completed its fifth year of scientific collaboration to create a comprehensive catalog of functional elements in the human genome, and its third year of investigations in the mouse genome. Since the last report in this journal, the ENCODE human data repertoire has grown by 898 new experiments (totaling 2886), accompanied by a major integrative analysis. In the mouse genome, results from 404 new experiments became available this year, increasing the total to 583, collected during the course of the project. The University of California, Santa Cruz, makes this data available on the public Genome Browser http://genome.ucsc.edu for visual browsing and data mining. Download of raw and processed data files are all supported. The ENCODE portal provides specialized tools and information about the ENCODE data sets.
Figures
Figure 1.
ENCODE data displayed in the UCSC Genome Browser together with annotations from the ENCODE Analysis Hub in the region of the nucleoporin gene NUP133 demonstrate the power this diversity of data provides for visual interpretation. The GENCODE Basic gene set shows this gene having four protein-coding splice variants and three smaller non-coding transcripts nearby. The proteogenomics track shows support for many of the coding exons, with protein localized in the nucleus, but not in plasma membrane or mitochondria. The long polyA RNA signal shows strong peaks over the exons and low intron signal in the cytosol, with greater signal in the nucleus. This is expected because nuclear mRNAs are not all completely spliced. The Combined Genome Segmentation integrates signal from many histones and classifies regions into those with characteristics of promoters (red), enhancers (yellow), insulators (blue), transcribed regions (green) and repressed (gray). Below are signal tracks from four of the eight histone modifications used as input to the segmentation. The promoter and transcribed regions agree with the RNA evidence, and like the RNA evidence show no evidence of transcription of the non-coding gene to the right of NUP133. Underneath the GM12878 histone signals is a track that overlays one of the histone signals, H3K27Ac, in seven different cell lines (with GM12878 shown in red). A peak in H3K27Ac appears at the enhancer, but as is often the case with enhancers, this appears to be relatively cell specific in contrast to the larger peak near the promoter, where the black coloration indicates the peak is shared by many cell types. The DNAse hypersensitivity and transcription factor tracks also provide evidence for both promoter and enhancer. Finally the mappability track indicates regions where short reads are not uniquely mappable, indicating the data are incomplete and therefore harder to interpret. Although most of this region is mappable, there are many small regions throughout and one larger region on the right where mapping is problematic. Overall, the ENCODE data in this region show strong evidence that this is a nuclear-localized protein-coding gene with a promoter that is used in a wide variety of cell types, and is likely to be regulated by tissue-specific enhancers as well.
Figure 2.
The ENCODE Analysis Hub at the EBI hosts over 2800 ENCODE data sets, organized in six tracks controlled via the track menu shown here.
Figure 3.
All three screens of the Experiment Matrix for mouse are shown overlaid. The Data Summary screen lists experiments by data type, and provides launching to the two matrix screens that organize the data by assay and cell type. Clicking the appropriate table row or matrix cell launches a Track or File search tool (based on the Track/File selector control) that allows further refinement of the selection for browsing or download.
Similar articles
- ENCODE whole-genome data in the UCSC Genome Browser.
Rosenbloom KR, Dreszer TR, Pheasant M, Barber GP, Meyer LR, Pohl A, Raney BJ, Wang T, Hinrichs AS, Zweig AS, Fujita PA, Learned K, Rhead B, Smith KE, Kuhn RM, Karolchik D, Haussler D, Kent WJ. Rosenbloom KR, et al. Nucleic Acids Res. 2010 Jan;38(Database issue):D620-5. doi: 10.1093/nar/gkp961. Epub 2009 Nov 17. Nucleic Acids Res. 2010. PMID: 19920125 Free PMC article. - ENCODE whole-genome data in the UCSC Genome Browser: update 2012.
Rosenbloom KR, Dreszer TR, Long JC, Malladi VS, Sloan CA, Raney BJ, Cline MS, Karolchik D, Barber GP, Clawson H, Diekhans M, Fujita PA, Goldman M, Gravell RC, Harte RA, Hinrichs AS, Kirkup VM, Kuhn RM, Learned K, Maddren M, Meyer LR, Pohl A, Rhead B, Wong MC, Zweig AS, Haussler D, Kent WJ. Rosenbloom KR, et al. Nucleic Acids Res. 2012 Jan;40(Database issue):D912-7. doi: 10.1093/nar/gkr1012. Epub 2011 Nov 9. Nucleic Acids Res. 2012. PMID: 22075998 Free PMC article. - ENCODE whole-genome data in the UCSC genome browser (2011 update).
Raney BJ, Cline MS, Rosenbloom KR, Dreszer TR, Learned K, Barber GP, Meyer LR, Sloan CA, Malladi VS, Roskin KM, Suh BB, Hinrichs AS, Clawson H, Zweig AS, Kirkup V, Fujita PA, Rhead B, Smith KE, Pohl A, Kuhn RM, Karolchik D, Haussler D, Kent WJ. Raney BJ, et al. Nucleic Acids Res. 2011 Jan;39(Database issue):D871-5. doi: 10.1093/nar/gkq1017. Epub 2010 Oct 30. Nucleic Acids Res. 2011. PMID: 21037257 Free PMC article. - UCSC genome browser tutorial.
Zweig AS, Karolchik D, Kuhn RM, Haussler D, Kent WJ. Zweig AS, et al. Genomics. 2008 Aug;92(2):75-84. doi: 10.1016/j.ygeno.2008.02.003. Epub 2008 Jun 2. Genomics. 2008. PMID: 18514479 Review. - The UCSC Genome Browser: What Every Molecular Biologist Should Know.
Mangan ME, Williams JM, Kuhn RM, Lathe WC 3rd. Mangan ME, et al. Curr Protoc Mol Biol. 2014 Jul 1;107:19.9.1-19.9.36. doi: 10.1002/0471142727.mb1909s107. Curr Protoc Mol Biol. 2014. PMID: 24984850 Free PMC article. Review.
Cited by
- Genomic Validation in the UK Biobank Cohort Suggests a Role of C8B and MFG-E8 in the Pathogenesis of Trigeminal Neuralgia.
Lafta MS, Rukh G, Hamdeh SA, Molero Y, Sokolov AV, Rostami E, Schiöth HB. Lafta MS, et al. J Mol Neurosci. 2024 Oct 3;74(4):91. doi: 10.1007/s12031-024-02263-x. J Mol Neurosci. 2024. PMID: 39361088 Free PMC article. - Dichotomous intronic polyadenylation profiles reveal multifaceted gene functions in the pan-cancer transcriptome.
Sun J, Kim JY, Jun S, Park M, de Jong E, Chang JW, Cheng S, Fan D, Chen Y, Griffin TJ, Lee JH, You HJ, Zhang W, Yong J. Sun J, et al. Exp Mol Med. 2024 Oct 1. doi: 10.1038/s12276-024-01289-w. Online ahead of print. Exp Mol Med. 2024. PMID: 39349823 - Multiplex, single-cell CRISPRa screening for cell type specific regulatory elements.
Chardon FM, McDiarmid TA, Page NF, Daza RM, Martin BK, Domcke S, Regalado SG, Lalanne JB, Calderon D, Li X, Starita LM, Sanders SJ, Ahituv N, Shendure J. Chardon FM, et al. Nat Commun. 2024 Sep 18;15(1):8209. doi: 10.1038/s41467-024-52490-4. Nat Commun. 2024. PMID: 39294132 Free PMC article. - Schizophrenia is associated with altered DNA methylation variance.
Kiltschewskij DJ, Reay WR, Cairns MJ. Kiltschewskij DJ, et al. Mol Psychiatry. 2024 Sep 13. doi: 10.1038/s41380-024-02749-5. Online ahead of print. Mol Psychiatry. 2024. PMID: 39271751 - Haplotype analysis identifies functional elements in monoclonal gammopathy of unknown significance.
Thomsen H, Chattopadhyay S, Weinhold N, Vodicka P, Vodickova L, Hoffmann P, Nöthen MM, Jöckel KH, Schmidt B, Hajek R, Hallmans G, Pettersson-Kymmer U, Späth F, Goldschmidt H, Hemminki K, Försti A. Thomsen H, et al. Blood Cancer J. 2024 Aug 20;14(1):140. doi: 10.1038/s41408-024-01121-8. Blood Cancer J. 2024. PMID: 39164264 Free PMC article.
References
- ENCODE Consortium. The ENCODE (ENCyclopedia Of DNA Elements) Project. Science. 2004;306:636–640. - PubMed
Publication types
MeSH terms
Grants and funding
- 3P41HG002371-12S1/HG/NHGRI NIH HHS/United States
- HHMI/Howard Hughes Medical Institute/United States
- 5U41HG004568-04/HG/NHGRI NIH HHS/United States
- 095908/Wellcome Trust/United Kingdom
- 3U41HG004568-03S1/HG/NHGRI NIH HHS/United States
- 5P41HG002371-12/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources