The UCSC Genome Browser database: update 2011 (original) (raw)
Journal Article
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
*To whom correspondence should be addressed. Tel: +1 831 459 1477 ; Fax:
+1 831 459 1809
; Email: pauline@soe.ucsc.edu
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
,
1 Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, 2 Center for Comparative Genomics and Bioinformatics, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA, 3 Centre for Genomic Regulation (CRG), Barcelona, Spain and 4 Howard Hughes Medical Institute, UCSC, Santa Cruz, CA 95064, USA
Search for other works by this author on:
Received:
15 September 2010
Accepted:
30 September 2010
Published:
18 October 2010
Cite
Pauline A. Fujita, Brooke Rhead, Ann S. Zweig, Angie S. Hinrichs, Donna Karolchik, Melissa S. Cline, Mary Goldman, Galt P. Barber, Hiram Clawson, Antonio Coelho, Mark Diekhans, Timothy R. Dreszer, Belinda M. Giardine, Rachel A. Harte, Jennifer Hillman-Jackson, Fan Hsu, Vanessa Kirkup, Robert M. Kuhn, Katrina Learned, Chin H. Li, Laurence R. Meyer, Andy Pohl, Brian J. Raney, Kate R. Rosenbloom, Kayla E. Smith, David Haussler, W. James Kent, The UCSC Genome Browser database: update 2011, Nucleic Acids Research, Volume 39, Issue suppl_1, 1 January 2011, Pages D876–D882, https://doi.org/10.1093/nar/gkq963
Close
Navbar Search Filter Mobile Enter search term Search
Abstract
The University of California, Santa Cruz Genome Browser ( http://genome.ucsc.edu ) offers online access to a database of genomic sequence and annotation data for a wide variety of organisms. The Browser also has many tools for visualizing, comparing and analyzing both publicly available and user-generated genomic data sets, aligning sequences and uploading user data. Among the features released this year are a gene search tool and annotation track drag-reorder functionality as well as support for BAM and BigWig/BigBed file formats. New display enhancements include overlay of multiple wiggle tracks through use of transparent coloring, options for displaying transformed wiggle data, a ‘mean+whiskers’ windowing function for display of wiggle data at high zoom levels, and more color schemes for microarray data. New data highlights include seven new genome assemblies, a Neandertal genome data portal, phenotype and disease association data, a human RNA editing track, and a zebrafish Conservation track. We also describe updates to existing tracks.
INTRODUCTION
The University of California, Santa Cruz (UCSC) Genome Browser provides online access to sequence and annotation data for the human genome and those of several other species ( 1 , 2 ). The level of annotation differs among species, with recent assemblies of the human genome being the most richly annotated. The Genome Browser contains mapping and sequencing annotation tracks describing assembly, gap and GC percent details for all assemblies. Most organisms also have tracks containing alignments of RefSeq genes ( 3 , 4 ), mRNAs and ESTs from GenBank ( 5 ) as well as gene and gene prediction tracks such as Ensembl Genes ( 6 ). UCSC Genes, a gene prediction track generated at UCSC that is based on data from RefSeq, GenBank, CCDS and UniProt ( 2 , 7 , 8 ), is present for the most recent human and mouse assemblies. Most organisms also have comparative genomic tracks showing pairwise genomic alignments between assemblies. Roughly half the organisms hosted in the browser have a multiple sequence alignment track (multiz) ( 9 ). Expression, regulation, variation and phenotype tracks are available for many organisms. Track descriptions can be accessed by clicking on a track item, the track title or the vertical bar to the left of the track in the image. Links to the corresponding locations on the NCBI Map Viewer ( 10 ) and Ensembl ( 6 ) genome browsers are also provided.
UCSC hosts the Data Coordination Center for the Encyclopedia of DNA Elements (ENCODE) project, using the Genome Browser website as its primary data portal ( 11–13 ). Genome-wide production phase data were initially published on the hg18 (NCBI build 36) human assembly and are currently being migrated to the hg19 (Genome Reference Consortium GRCh37) browser. (For more detail see Raney et al. in this issue.)
The Genome Browser includes many tools for visualizing and analyzing genomic data. Sequence data can be retrieved via the ‘Get DNA’ utility, the Table Browser ( 14 ) or direct download (see below). The Table Browser also serves as a tool for retrieving and exploring Genome Browser data through filtering, intersecting and correlating the underlying database tables. Output from the Table Browser can also be sent to other tools such as Galaxy ( 15 ) or GREAT (see ‘New features’ section) for subsequent analysis. The BLAT ( 16 ) and in silico PCR tools align sequences to genomes available in the browser. The LiftOver utility, available as both a web interface (at http://genome.ucsc.edu/cgi-bin/hgLiftOver ) and a command-line executable program (from http://hgdownload.cse.ucsc.ed/admin/exe/ ) translates genomic coordinates between assemblies. The Gene Sorter ( 17 ) allows users to explore the relationships between genes by comparing expression profiles, protein homology and other useful metrics of similarity. The Proteome Browser displays protein properties and sequence data as tracks and histograms ( 18 ). VisiGene ( 19 ) is a searchable database of Xenopus and mouse in situ images showing cytology and expression patterns. Users can upload and view their data in the context of the other browser tracks using the custom tracks tool ( 8 ). Once uploaded, custom track data can be manipulated using any of the standard Genome Browser functionalities including the Table Browser. Track display configurations can also be saved and shared using the Sessions tool ( 8 ). Finally, Genome Graphs displays hosted tracks and user-generated custom tracks in the context of a genome-wide view.
Bulk downloads of sequence and annotation data and Genome Browser source code can be found at http://hgdownload.cse.ucsc.edu/ . The source includes the browser bioinformatic command-line utilities ( http://genomewiki.ucsc.edu/index.php/Kent_source_utilities ). Instructions for setting up a mirror are available at http://genome.ucsc.edu/admin/mirror.html . Assemblies and data of interest can also be mirrored selectively (see http://genomewiki.ucsc.edu/index.php/Minimal_Browser_Installation ).
NEW FEATURES
Gene search
The gene search box takes a user directly to the UCSC/Known Genes or RefSeq record associated with a gene of interest, bypassing the default search of the entire database (see Figure 1 ). After two or more characters are entered, the search box suggests gene names, and upon selection of a particular gene, the gene’s coordinates appear in the position/search bar. In cases where the gene has several isoforms, the gene region is immediately displayed rather than requiring the user to first select a particular isoform, thus eliminating an extra navigation step.
Figure 1.
Genome Browser display on the hg19 human assembly showing the gene search box in use. After two or more characters are typed, the software suggests possible matching gene names. Tracks shown in this image, from top to bottom: base position, GNF Atlas 2 (with the red/blue on yellow background microarray color option enabled), SNPs (131) and Genome Variants (which includes five new datasets).
Drag-reorder
Tracks within the browser image can now be reordered more easily by clicking on the label or vertical bar to the left of the track and dragging it to a new position within the image. If the track is a member of a composite track, hovering over the bar will cause the bars of all related subtracks to turn blue, making it easier to distinguish the reordered subtracks that belong to a single composite track. Tracks can be restored to their default order via a button below the track image (see Figure 2 ).
Figure 2.
Genome Browser image on the hg18 human assembly showing the UCSC Genes, Conservation and Neandertal tracks (Human-Chimp coding differences, regions with the 5% lowest S, SNPs used to calculate S and alignments of Neandertal sequence reads). The Vi33.6 sequence read subtrack highlighted in green is being vertically dragged to a new position. Note that hovering the mouse over any component subtrack causes the vertical bars to the left of all related subtracks to turn blue.
BigBed/BigWig and BAM file support
In late 2009 we introduced two new file formats for very large data sets, BigBed and BigWig ( 20 ), and have continued to add support for the display of these files as built-in tracks and custom tracks. BigWig and BigBed files are compressed binary indexed files containing data at several resolutions that allow the high-performance display of next-generation sequencing experiment results in the Genome Browser. A big advantage of these file formats is that only the portions of the files needed to display a particular region are transferred to UCSC, enabling fast remote access to large distributed data sets.
We have also introduced support for the Binary Alignment/Map (BAM) file format in custom tracks and in multi-view composite tracks. BAM is the compressed binary version of the Sequence Alignment/Map (SAM) format ( 21 ), a compact and indexable representation of nucleotide sequence alignments. BAM file format employs an architecture similar to BigWig/BigBed files and thus segments of the BAM file are transmitted as needed to display the current browser view, unlike PSL and other human-readable alignments formats. This makes it possible to load very large BAM files as custom tracks in situations where the file size would preclude upload in other file formats. BAM custom tracks enable the display of high-coverage sequencing read alignments from the 1000 Genomes Project ( http://www.1000genomes.org/ ), other sequencing projects, and the underlying data from which SNPs and CNVs were called. (See the Neandertal Sequence Reads track in Figure 2 for an example of the BAM track display.)
Display enhancements
We have augmented the Genome Browser’s wiggle and microarray track display functionalities. Log-transformed wiggle data values may now be viewed in the browser, and we have also added a new windowing function for viewing wiggle data at zoomed-out levels. When a zoom-level is too large to show individual data values, the values must be combined to produce a plot point. With the ‘mean + whiskers’ function, it is possible to simultaneously view the mean data value overlaid with measures of its central tendency. The mean appears in a dark shade, 1 standard deviation around the mean in a medium shade, and the maximum/minimum in a light shade. Another new display feature is the transparent, multicolored overlay of multiple wiggles for some tracks (for an example, see Raney et al. in this issue). Standard and custom microarray tracks can be viewed in one of five combinations of red, green, blue, and yellow by selecting a scheme on the track details page ( Figure 1 ) or by specifying an ‘expColor’ value in the custom track’s settings (see http://genomewiki.cse.ucsc.edu/index.php/Microarray_track#Microarray_Custom_Tracks ).
Table Browser support for external software
This year the Table Browser benefited from an addition that allows users to send genomic region data to the Genomic Regions Enrichment of Annotations Tool (GREAT) ( 22 ). Given a set of genomic regions, such as segments of DNA selected through ChIP-Seq experiments, GREAT analyzes the cis -regulatory patterns in these regions and assesses their functional significance. GREAT users can also create UCSC custom tracks from these term-enriched subsets of genomic regions.
NEW DATA
We constantly add new annotation tracks and update existing tracks in the Genome Browser. Tracks that were released this year as a part of the ENCODE project are described in a separate publication. (See Raney et al. in this issue for a more information.)
Neandertal data portal
In May 2010 we released a group of tracks on the hg18 human browser and the panTro2 chimpanzee browser to accompany the initial publication of the Neandertal genome ( 23 ) (see Figure 2 ). Both the human and chimpanzee browsers display alignments of Neandertal sequence reads and assembled contig sequences, and the human browser also offers human-chimp coding differences, a selective sweep scan (S) score, regions with the 5% lowest S score, SNPs used to calculate S score, and Neandertal mitochondrial sequence from a prior publication ( 24 ). These tracks can be viewed in the human and chimpanzee browsers or accessed through the Neandertal portal page ( http://genome.ucsc.edu/Neandertal/ ), which also provides links to download the associated tables and data files.
Phenotype and disease association data
In the past year we released two new human phenotype and disease association tracks. The first is based on DECIPHER, a database of submicroscopic chromosomal imbalances based on clinical information about chromosomal microdeletions/duplications/insertions, translocations and inversions ( 25 ). This track shows genomic regions of reported cases and their associated phenotype information. The second track displays SNPs from the Catalog of Published Genome-Wide Association Studies ( http://www.genome.gov/gwastudies ), a curated and regularly updated collection of SNPs identified by published studies attempting to assay at least 100 000 SNPs ( 26 ).
Human RNA editing
We have added an RNA editing track on the human (hg18) assembly based on the DARNED database ( 27 ), a catalog of RNA sequences that are edited after transcription, along with their corresponding genomic coordinates. Only post-transcriptional editing that results in small changes to the identity of a nucleic acid are included in this track; it does not include other RNA processing such as splicing or methylation. The data were obtained from several research papers on RNA editing and were mapped to the human reference genome.
New assemblies
Since September 2009 we have updated the genome assemblies for marmoset, tetraodon, zebrafish and cat. We have also added new browsers for pig, European rabbit, giant panda, African savannah elephant and California sea hare. Each browser contains the baseline set of tracks and an additional complement of comparative genomic and other annotation tracks.
New tracks in other organisms
The chicken browser now features a track displaying the alignment of California condor ( Gymnogyps californianus ) transcripts (sequenced using 454 high-throughput DNA sequencing) to the galGal3 chicken genome. The condor read sequences were obtained from the NCBI Trace Archives ( 28 ). We have also released a Conservation track for the danRer6 zebrafish assembly showing multiple alignments of six vertebrate species and measurements of evolutionary conservation using phastCons from the PHAST package. Conserved elements identified by phastCons are displayed in the companion ‘Most Conserved’ track.
Updates to existing tracks and assemblies
Many Genome Browser tracks are updated regularly. The Database of Genomic Variants (DGV) ( 29 , 30 ) tracks, which detail genomic variants found among healthy human individuals, were updated to version 9 in the hg17 and hg18 assemblies and added to the hg19 assembly browser. The ORFeome Clones tracks, which show alignments of clones from the ORFeome Collaboration ( 31 ), were also updated for human assemblies. The human Genome Variants tracks were augmented to include Korean (SJK) ( 32 ) and 1000 Genomes high-coverage pilot individuals (NA12878, NA12891, NA12892 and NA19240) ( Figure 1 ).
A number of annotations present on the human assembly hg18 were added to the new hg19 assembly, most notably tracks showing UCSC Genes and conservation. New in hg19 is the SNP track based on dbSNP build 131 ( 33 ) ( Figure 1 ). The UCSC Genes track is a moderately conservative set of gene predictions based on data from RefSeq, Genbank, CCDS and UniProt. The Conservation track shows multiple alignments of 46 vertebrate species and measurements of evolutionary conservation using two methods (phastCons and phyloP) from the PHAST package for all vertebrate species, as well as primate and placental mammal subsets. The SNP track contains over 26 million mappings of more than 23 million reference SNPs that have been mapped to the reference genome by dbSNP. This represents a significant increase from the provisional hg19 mappings of build 130 ( 33 ). As we continue to migrate the bulk of our hg18 annotation tracks to the hg19 assembly, we encourage our contributors to submit hg19-based data sets for inclusion in this effort.
Tracks that are regularly updated on the mouse browsers include the International Gene Trap Consortium (IGTC) tracks ( 34 ) (updated monthly), the Mouse Genome Informatics MGI tracks ( 35 ), which show quantitative trait loci, phenotypes and alleles, and the IKMC Genes tracks ( 36 ), which show the genes targeted by the International Knockout Mouse Consortium for generating mouse embryonic stem cells containing a null mutation in every gene in the mouse genome.
Some of our regularly updated tracks appear on multiple browsers. These include the Consensus Coding Sequence (CCDS) ( 37 ) tracks, which were updated on the human and mouse genomes, the Mammalian Gene Collection (MGC) tracks ( 38 ) on the human, mouse, rat, cow and frog genomes, and the Ensembl Genes tracks ( 6 ), available on approximately 25 different organisms. RefSeq and mRNA tracks, which display aligned sequences from all organisms in GenBank ( 5 ), are updated nightly, and EST tracks are updated weekly.
FUTURE DIRECTIONS
We plan to incorporate several new features as well as exciting new variation and medical genomics data over the next year. We will also continue to add new and updated vertebrate and other selected model organism assemblies that have been deposited into GenBank. (Only assemblies registered and deposited at NCBI will be considered for hosting at UCSC, as stipulated in the Browser Genome Release Agreement instituted by NCBI, Ensembl, and UCSC.)
By late 2010 we plan to release a utility that enables users to quickly search track names and descriptions. This tool will provide both simple and advanced search interfaces, with the advanced interface allowing users to further refine their search criteria and search the metadata associated with ENCODE tracks (e.g. cell line, transcription factor, stage, etc.). Also by the end of 2010, users will be able to quickly access configuration and navigation shortcuts on the Genome Browser image by right-clicking on the vertical bar to the left of a track.
We are developing data hub support that will make it possible to view user-supplied data (such as BigWig, BigBed and BAM files) with the more sophisticated track display options currently used on other UCSC tracks such as composite tracks. We are also working on several improvements to the display of BAM files such as filtering by flag and density-wiggle view. We plan to enable data extraction from BAM file-based tracks via the Table Browser.
We anticipate adding a number of new variation tracks, including data from the 1000 Genomes project as well as from dbVar, a new structural variation database at NCBI. We are currently working on browser display support for data stored in Variant Call Format (VCF; http://1000genomes.org/wiki/doku.php?id=1000_genomes:analysis:vcf4.0 ), a format developed by the 1000 Genomes Project to represent variant data. Additionally we are discussing strategies for distinguishing SNPs annotated as ‘clinically associated’ by dbSNP in our SNP annotations, and looking into other stratifications of these data such as singly mapped versus multiply mapped.
We plan to import additional personal genome variant tracks from the Pennsylvania State University Bioinformatics Genome Browser ( http://main.genome-browser.bx.psu.edu/ ), including updated 1000 Genomes high-coverage trio variants as well as variants from five Khoisan and Bantu genomes ( 39 ) and a Paleo-Eskimo Saqqaq genome ( 40 ).
We intend to add medical genomics data from the International Standards for Cytogenomic Arrays (ISCA) consortium ( 41 ). These data should help clinicians interpret array CGH results by aggregating the results of potentially thousands of cases from clinics all over the world in one place. The data will be released to dbGaP and dbVar at NCBI and then integrated into the browser for display in the context of our other content.
Finally, we plan to offer cloud support for mirrors, providing a Genome Browser image that will enable labs to instantiate a browser for private use without the overhead of a local server ( 42 ) and supporting the simple construction of new Genome Browsers on novel genome sequences.
CONTACTING US
We have two public, moderated mailing lists for user support: genome@soe.ucsc.edu for general questions about the Genome Browser, and genome-mirror@soe.ucsc.edu for questions specific to the setup and maintenance of Genome Browser mirrors. Archives of both lists are searchable from our contacts page at http://genome.ucsc.edu/contacts.html . You may also reach us at genome- www@soe.ucsc.edu , the preferred address for inquiring about mirror site licenses and reporting server errors. Messages sent to this address are not archived in a publicly searchable location.
FUNDING
Grants from the NHGRI (P41HG002371 to G.B., H.C., M.D., P.F., A.H., F.H., D.K., V.K., W.J.K., R.K., C.L., L.M, B.R. and A.Z.; U41HG004568 to M.C., T.D., M.G., F.H., W.J.K., K.L., K.R. and B.R.); NCI (U24CA143858 to F.H. and W.J.K); subcontracts from the NIDCR (U01DE20057 to G.B. and R.K.), NHGRI (P01HG5062 to G.B., W.J.K. and B.R; U54HG004555 to M.D. and R.H.; U41HG004269 to A.H. and W.J.K.; U01HG004695 to W.J.K.); NICHD (RC2HD064525 to H.C., A.H. and R.K.); NIEHS (U01ES017154 to W.J.K). Support from HHMI to D.H. Funding for open access charges: HHMI.
Conflict of interest statement. P.A.F., B.R., A.S.Z., A.S.H., D.K., G.P.B., H.C., M.D., T.R.D., B.M.G., R.A.H., F.H., V.K., R.M.K., K.L., C.H.L., L.R.M., A.P., B.J.R., K.R.R., K.E.S., D.H. and W.J.K. receive royalties from the sale of UCSC Genome Browser source code licenses to commercial entities.
ACKNOWLEDGEMENTS
The authors would like to thank the many data contributors whose work makes the Genome Browser possible, our Scientific Advisory Board for steering our efforts, our users for their consistent support and valuable feedback, and our outstanding team of system administrators: Jorge Garcia, Erich Weiler, Victoria Lin and Alex Wolfe.
REFERENCES
1
et al.
The UCSC Genome Browser Database
,
Nucleic Acids Res.
,
2003
, vol.
31
(pg.
51
-
54
)
2
et al.
The UCSC Genome Browser database: update 2010
,
Nucleic Acids Res.
,
2010
, vol.
38
(pg.
D613
-
D619
)
3
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
,
Nucleic Acids Res.
,
2007
, vol.
35
(pg.
D61
-
D65
)
4
NCBI Reference Sequences: current status, policy and new initiatives
,
Nucleic Acids Res.
,
2009
, vol.
37
(pg.
D32
-
D36
)
5
GenBank
,
Nucleic Acids Res.
,
2010
, vol.
38
(pg.
D46
-
D51
)
6
et al.
Ensembl's 10th year
,
Nucleic Acids Res.
,
2010
, vol.
38
(pg.
D557
-
D562
)
7
The UCSC Known Genes
,
Bioinformatics
,
2006
, vol.
22
(pg.
1036
-
1046
)
8
et al.
The UCSC Genome Browser Database: 2008 update
,
Nucleic Acids Res.
,
2008
, vol.
36
(pg.
D773
-
D779
)
9
et al.
Aligning multiple genomic sequences with the threaded blocks et al. gner
,
Genome Res.
,
2004
, vol.
14
(pg.
708
-
715
)
10
et al.
Database resources of the National Center for Biotechnology Information
,
Nucleic Acids Res.
,
2009
, vol.
37
(pg.
D5
-
D15
)
11
The ENCODE Project Consortium
The ENCODE (ENCyclopedia Of DNA Elements) Project
,
Science
,
2004
, vol.
306
(pg.
636
-
640
)
12
et al.
The ENCODE Project at UC Santa Cruz
,
Nucleic Acids Res.
,
2007
, vol.
35
(pg.
D663
-
D667
)
13
et al.
ENCODE whole-genome data in the UCSC Genome Browser
,
Nucleic Acids Res.
,
2010
, vol.
38
(pg.
D620
-
D625
)
14
The UCSC Table Browser data retrieval tool
,
Nucleic Acids Res.
,
2004
, vol.
32
(pg.
D493
-
D496
)
15
et al.
Galaxy: a platform for interactive large-scale genome analysis
,
Genome Res.
,
2005
, vol.
15
(pg.
1451
-
1455
)
16
BLAT–the BLAST-like alignment tool
,
Genome Res.
,
2002
, vol.
12
(pg.
656
-
664
)
17
Exploring relationships and mining data with the UCSC Gene Sorter
,
Genome Res.
,
2005
, vol.
15
(pg.
737
-
741
)
18
The UCSC Proteome Browser
,
Nucleic Acids Res.
,
2005
, vol.
33
(pg.
D454
-
D458
)
19
et al.
The UCSC Genome Browser Database: update 2007
,
Nucleic Acids Res.
,
2007
, vol.
35
(pg.
D668
-
D673
)
20
BigWig and BigBed: enabling browsing of large distributed datasets
,
Bioinformatics
,
2010
, vol.
26
(pg.
2204
-
2207
)
21
and 1000_Genome_Project_Data_Processing_Subgroup (2009) The Sequence alignment/map (SAM) format and SAMtools
,
Bioinformatics
, vol.
25
(pg.
2078
-
2079
)
22
GREAT improves functional interpretation of cis-regulatory regions
,
Nat. Biotechnol.
,
2010
, vol.
28
(pg.
495
-
501
)
23
et al.
A draft sequence of the Neandertal genome
,
Science
,
2010
, vol.
328
(pg.
710
-
722
)
24
et al.
A complete Neandertal mitochondrial genome sequence determined by high-throughput sequencing
,
Cell
,
2008
, vol.
134
(pg.
416
-
426
)
25
DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources
,
Am. J. Hum. Genet.
,
2009
, vol.
84
(pg.
524
-
533
)
26
Potential etiologic and functional implications of genome-wide association loci for human diseases and traits
,
Proc. Natl Acad. Sci. USA
,
2009
, vol.
106
(pg.
9362
-
9367
)
27
DARNED: a DAtabase of RNa EDiting in humans
,
Bioinformatics
,
2010
, vol.
26
(pg.
1772
-
1776
)
28
et al.
The value of avian genomics to the conservation of wildlife
,
BMC Genomics
,
2009
, vol.
10
Suppl. 2
pg.
S10
29
Detection of large-scale variation in the human genome
,
Nat. Genet.
,
2004
, vol.
36
(pg.
949
-
951
)
30
Structural variation in the human genome
,
Nat. Rev. Genet.
,
2006
, vol.
7
(pg.
85
-
97
)
31
et al.
hORFeome v3.1: a resource of human open reading frames representing over 10,000 human genes
,
Genomics
,
2007
, vol.
89
(pg.
307
-
315
)
32
et al.
The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group
,
Genome Res.
,
2009
, vol.
19
(pg.
1622
-
1629
)
33
dbSNP: the NCBI database of genetic variation
,
Nucleic Acids Res.
,
2001
, vol.
29
(pg.
308
-
311
)
34
et al.
The International Gene Trap Consortium Website: a portal to all publicly available gene trap cell lines in mouse
,
Nucleic Acids Res.
,
2006
, vol.
34
(pg.
D642
-
D648
)
35
and the Mouse Genome Database Group (2008) The Mouse Genome Database (MGD): mouse biology and model systems
,
Nucleic Acids Res.
, vol.
36
(pg.
D724
-
D728
)
36
The Comprehensive Knockout Mouse Project Consortium
, et al.
The Knockout Mouse Project
,
Nat. Genet.
,
2004
, vol.
36
(pg.
921
-
924
)
37
et al.
The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes
,
Genome Res.
,
2009
, vol.
18
(pg.
1316
-
1323
)
38
et al.
The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)
,
Genome Res.
,
2004
, vol.
14
(pg.
2121
-
2127
)
39
et al.
Complete Khoisan and Bantu genomes from southern Africa
,
Nature
,
2010
, vol.
463
(pg.
943
-
947
)
40
et al.
Ancient human genome sequence of an extinct Palaeo-Eskimo
,
Nature
,
2010
, vol.
463
(pg.
757
-
762
)
41
et al.
Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies
,
Am. J. Hum. Genet.
,
2010
, vol.
86
(pg.
749
-
764
)
42
The case for cloud computing in genome informatics
,
Genome Biol.
,
2010
, vol.
11
pg.
207
© The Author(s) 2010. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
I agree to the terms and conditions. You must accept the terms and conditions.
Submit a comment
Name
Affiliations
Comment title
Comment
You have entered an invalid code
Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.
Citations
Views
Altmetric
Metrics
Total Views 4,654
3,709 Pageviews
945 PDF Downloads
Since 12/1/2016
Month: | Total Views: |
---|---|
December 2016 | 3 |
January 2017 | 4 |
February 2017 | 23 |
March 2017 | 22 |
April 2017 | 21 |
May 2017 | 18 |
June 2017 | 13 |
July 2017 | 18 |
August 2017 | 23 |
September 2017 | 34 |
October 2017 | 15 |
November 2017 | 11 |
December 2017 | 50 |
January 2018 | 53 |
February 2018 | 52 |
March 2018 | 57 |
April 2018 | 37 |
May 2018 | 57 |
June 2018 | 50 |
July 2018 | 41 |
August 2018 | 56 |
September 2018 | 48 |
October 2018 | 51 |
November 2018 | 44 |
December 2018 | 46 |
January 2019 | 60 |
February 2019 | 49 |
March 2019 | 45 |
April 2019 | 70 |
May 2019 | 63 |
June 2019 | 63 |
July 2019 | 70 |
August 2019 | 69 |
September 2019 | 72 |
October 2019 | 50 |
November 2019 | 53 |
December 2019 | 35 |
January 2020 | 38 |
February 2020 | 35 |
March 2020 | 45 |
April 2020 | 23 |
May 2020 | 19 |
June 2020 | 51 |
July 2020 | 67 |
August 2020 | 47 |
September 2020 | 29 |
October 2020 | 46 |
November 2020 | 57 |
December 2020 | 62 |
January 2021 | 57 |
February 2021 | 61 |
March 2021 | 93 |
April 2021 | 62 |
May 2021 | 75 |
June 2021 | 67 |
July 2021 | 58 |
August 2021 | 49 |
September 2021 | 64 |
October 2021 | 60 |
November 2021 | 40 |
December 2021 | 36 |
January 2022 | 54 |
February 2022 | 42 |
March 2022 | 47 |
April 2022 | 48 |
May 2022 | 65 |
June 2022 | 48 |
July 2022 | 44 |
August 2022 | 43 |
September 2022 | 63 |
October 2022 | 54 |
November 2022 | 55 |
December 2022 | 54 |
January 2023 | 32 |
February 2023 | 42 |
March 2023 | 59 |
April 2023 | 48 |
May 2023 | 67 |
June 2023 | 29 |
July 2023 | 61 |
August 2023 | 20 |
September 2023 | 46 |
October 2023 | 40 |
November 2023 | 33 |
December 2023 | 93 |
January 2024 | 80 |
February 2024 | 109 |
March 2024 | 135 |
April 2024 | 57 |
May 2024 | 45 |
June 2024 | 26 |
July 2024 | 49 |
August 2024 | 60 |
September 2024 | 75 |
October 2024 | 44 |
Citations
771 Web of Science
×
Email alerts
Citing articles via
More from Oxford Academic