BigWig and BigBed: enabling browsing of large distributed datasets - PubMed (original) (raw)

BigWig and BigBed: enabling browsing of large distributed datasets

W J Kent et al. Bioinformatics. 2010.

Abstract

Summary: BigWig and BigBed files are compressed binary indexed files containing data at several resolutions that allow the high-performance display of next-generation sequencing experiment results in the UCSC Genome Browser. The visualization is implemented using a multi-layered software approach that takes advantage of specific capabilities of web-based protocols and Linux and UNIX operating systems files, R trees and various indexing and compression tricks. As a result, only the data needed to support the current browser view is transmitted rather than the entire file, enabling fast remote access to large distributed data sets.

Availability and implementation: Binaries for the BigWig and BigBed creation and parsing utilities may be downloaded at http://hgdownload.cse.ucsc.edu/admin/exe/linux.x86\_64/. Source code for the creation and visualization software is freely available for non-commercial use at http://hgdownload.cse.ucsc.edu/admin/jksrc.zip, implemented in C and supported on Linux. The UCSC Genome Browser is available at http://genome.ucsc.edu.

PubMed Disclaimer

Figures

Fig. 1.

Fig. 1.

Genome Browser image of BigWig annotation tracks. The top track is displayed as a bar graph, the bottom track as a point graph. Shading is used to distinguish the mean (dark), one standard deviation above the mean (medium) and the maximum (light). Peaks with clipped tops are colored magenta.

Similar articles

Cited by

References

    1. Alekseyenko AV, Lee CJ. Nested containment list (NCList): a new algorithm for accelerating interval query of genome alignment and interval databases. Bioinformatics. 2007;23:1386–1393. - PubMed
    1. Guttman A. R-Trees: a dynamic index structure for spatial searching. Proceedings of 1984 ACM SIGMOD International Conference on Management of Data. 1984:47–57.
    1. Kent WJ, Brumbaugh H. autoSql and autoXml: code generators from the Genome Project. Linux J. 2002;99:68–77.
    1. Kent WJ, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. - PMC - PubMed
    1. Li H, et al. 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map (SAM) Format and SAMtools. Bioinformatics. 2009;25:2078–2079. - PMC - PubMed

Publication types

MeSH terms

Grants and funding

LinkOut - more resources