BigWig and BigBed: enabling browsing of large distributed datasets - PubMed (original) (raw)
BigWig and BigBed: enabling browsing of large distributed datasets
W J Kent et al. Bioinformatics. 2010.
Abstract
Summary: BigWig and BigBed files are compressed binary indexed files containing data at several resolutions that allow the high-performance display of next-generation sequencing experiment results in the UCSC Genome Browser. The visualization is implemented using a multi-layered software approach that takes advantage of specific capabilities of web-based protocols and Linux and UNIX operating systems files, R trees and various indexing and compression tricks. As a result, only the data needed to support the current browser view is transmitted rather than the entire file, enabling fast remote access to large distributed data sets.
Availability and implementation: Binaries for the BigWig and BigBed creation and parsing utilities may be downloaded at http://hgdownload.cse.ucsc.edu/admin/exe/linux.x86\_64/. Source code for the creation and visualization software is freely available for non-commercial use at http://hgdownload.cse.ucsc.edu/admin/jksrc.zip, implemented in C and supported on Linux. The UCSC Genome Browser is available at http://genome.ucsc.edu.
Figures
Fig. 1.
Genome Browser image of BigWig annotation tracks. The top track is displayed as a bar graph, the bottom track as a point graph. Shading is used to distinguish the mean (dark), one standard deviation above the mean (medium) and the maximum (light). Peaks with clipped tops are colored magenta.
Similar articles
- Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser.
Raney BJ, Dreszer TR, Barber GP, Clawson H, Fujita PA, Wang T, Nguyen N, Paten B, Zweig AS, Karolchik D, Kent WJ. Raney BJ, et al. Bioinformatics. 2014 Apr 1;30(7):1003-5. doi: 10.1093/bioinformatics/btt637. Epub 2013 Nov 13. Bioinformatics. 2014. PMID: 24227676 Free PMC article. - The UCSC genome browser and associated tools.
Kuhn RM, Haussler D, Kent WJ. Kuhn RM, et al. Brief Bioinform. 2013 Mar;14(2):144-61. doi: 10.1093/bib/bbs038. Epub 2012 Aug 20. Brief Bioinform. 2013. PMID: 22908213 Free PMC article. - Navigating protected genomics data with UCSC Genome Browser in a Box.
Haeussler M, Raney BJ, Hinrichs AS, Clawson H, Zweig AS, Karolchik D, Casper J, Speir ML, Haussler D, Kent WJ. Haeussler M, et al. Bioinformatics. 2015 Mar 1;31(5):764-6. doi: 10.1093/bioinformatics/btu712. Epub 2014 Oct 27. Bioinformatics. 2015. PMID: 25348212 Free PMC article. - UCSC genome browser tutorial.
Zweig AS, Karolchik D, Kuhn RM, Haussler D, Kent WJ. Zweig AS, et al. Genomics. 2008 Aug;92(2):75-84. doi: 10.1016/j.ygeno.2008.02.003. Epub 2008 Jun 2. Genomics. 2008. PMID: 18514479 Review. - The UCSC genome browser: what every molecular biologist should know.
Mangan ME, Williams JM, Kuhn RM, Lathe WC 3rd. Mangan ME, et al. Curr Protoc Mol Biol. 2009 Oct;Chapter 19:Unit19.9. doi: 10.1002/0471142727.mb1909s88. Curr Protoc Mol Biol. 2009. PMID: 19816931 Free PMC article. Review.
Cited by
- Analysis of H3K4me3 and H3K27me3 bivalent promotors in HER2+ breast cancer cell lines reveals variations depending on estrogen receptor status and significantly correlates with gene expression.
Kaukonen D, Kaukonen R, Polit L, Hennessy BT, Lund R, Madden SF. Kaukonen D, et al. BMC Med Genomics. 2020 Jul 3;13(1):92. doi: 10.1186/s12920-020-00749-2. BMC Med Genomics. 2020. PMID: 32620123 Free PMC article. - Functional mapping of androgen receptor enhancer activity.
Huang CF, Lingadahalli S, Morova T, Ozturan D, Hu E, Yu IPL, Linder S, Hoogstraat M, Stelloo S, Sar F, van der Poel H, Altintas UB, Saffarzadeh M, Le Bihan S, McConeghy B, Gokbayrak B, Feng FY, Gleave ME, Bergman AM, Collins C, Hach F, Zwart W, Emberly E, Lack NA. Huang CF, et al. Genome Biol. 2021 May 11;22(1):149. doi: 10.1186/s13059-021-02339-6. Genome Biol. 2021. PMID: 33975627 Free PMC article. - Expression and methylation data from SLE patient and healthy control blood samples subdivided with respect to ARID3a levels.
Ward JM, Ratliff ML, Dozmorov MG, Wiley G, Guthridge JM, Gaffney PM, James JA, Webb CF. Ward JM, et al. Data Brief. 2016 Aug 31;9:213-9. doi: 10.1016/j.dib.2016.08.049. eCollection 2016 Dec. Data Brief. 2016. PMID: 27656675 Free PMC article. - The Ruby UCSC API: accessing the UCSC genome database using Ruby.
Mishima H, Aerts J, Katayama T, Bonnal RJ, Yoshiura K. Mishima H, et al. BMC Bioinformatics. 2012 Sep 21;13:240. doi: 10.1186/1471-2105-13-240. BMC Bioinformatics. 2012. PMID: 22994508 Free PMC article. - Considerations for creating and annotating the budding yeast Genome Map at SGD: a progress report.
Chan ET, Cherry JM. Chan ET, et al. Database (Oxford). 2012 Mar 20;2012:bar057. doi: 10.1093/database/bar057. Print 2012. Database (Oxford). 2012. PMID: 22434826 Free PMC article.
References
- Alekseyenko AV, Lee CJ. Nested containment list (NCList): a new algorithm for accelerating interval query of genome alignment and interval databases. Bioinformatics. 2007;23:1386–1393. - PubMed
- Guttman A. R-Trees: a dynamic index structure for spatial searching. Proceedings of 1984 ACM SIGMOD International Conference on Management of Data. 1984:47–57.
- Kent WJ, Brumbaugh H. autoSql and autoXml: code generators from the Genome Project. Linux J. 2002;99:68–77.
Publication types
MeSH terms
Grants and funding
- 5P41HG002371-09/HG/NHGRI NIH HHS/United States
- HHMI/Howard Hughes Medical Institute/United States
- P41 HG002371/HG/NHGRI NIH HHS/United States
- U41 HG004568/HG/NHGRI NIH HHS/United States
- 5U41HG004568-02/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials