CORG: a database for COmparative Regulatory Genomics - PubMed (original) (raw)

CORG: a database for COmparative Regulatory Genomics

C Dieterich et al. Nucleic Acids Res. 2003.

Abstract

Sequence conservation in non-coding, upstream regions of orthologous genes from man and mouse is likely to reflect common regulatory DNA sites. Motivated by this assumption we have delineated a catalogue of conserved non-coding sequence blocks and provide the CORG-'COmparative Regulatory Genomics'-database. The data were computed based on statistically significant local suboptimal alignments of 15 kb regions upstream of the translation start sites of, currently, 10 793 pairs of orthologous genes. The resulting conserved non-coding blocks were annotated with EST matches for easier detection of non-coding mRNA and with hits to known transcription factor binding sites. CORG data are accessible from the ENSEMBL web site via a DAS service as well as a specially developed web service (http://corg.molgen.mpg.de) for query and interactive visualization of the conserved blocks and their annotation.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Schematic depiction of linking DAS tracks and comparative display in CORG. The upper half of the figure shows a cutout from the ENSEMBL ContigView that is centered on the upstream region of the human IL6 gene. Two DAS tracks showing positions of human EPD promoters and CORG conserved non-coding blocks are highlighted in this context. The user can easily jump to the comparative display that is shown in the lower half of the figure by clicking on any CNB feature. The left bar of the applet window is the legend or key to what is displayed in the window center. Human CNBs are shown in light gray, whereas mouse CNBs are displayed in orange. Putative transcription factor binding sites appear either in blue (human genome) or yellow (mouse genome). Each individual track can be activated or deactivated by clicking on the corresponding legend symbol. Each legend symbol has a lever assigned to it. This lever can be used to adjust the proportion of sequence features displayed for a particular track. Less important sequence features (e.g. low-scoring alignments, low-quality binding sites) will start to disappear first as the lever is moved to lower percentage values. The area below the central display lets you control the zoom level of the display and the exact clipping down to the nucleotide level. Floating menus (not shown) providing links to additional information appear as the mouse is clicked over any feature. Selecting the option ‘Alignment view’ from a floating menu opened the little window on the right. In this window, information on different alignment properties like number, ranking according to score, overall proportion of matches and so on is displayed.

References

    1. Morgenstern B., Rinner,O., Abdeddaim,S., Haase,D., Mayer,K.F., Dress,A.W. and Mewes,H.W. (2002) Exon discovery by genomic sequence alignment. Bioinformatics, 18, 777–787. - PubMed
    1. Wiehe T., Gebauer-Jung,S., Mitchell-Olds,T. and Guigo,R. (2001) SGP-1: prediction and validation of homologous genes based on sequence alignments. Genome Res., 11, 1574–1583. - PMC - PubMed
    1. Loots G.G., Ovcharenko,I., Pachter,L., Dubchak,I. and Rubin,E.M. (2002) rVista for comparative sequence-based discovery of functional transcription factor binding. Genome Res., 12, 832–839. - PMC - PubMed
    1. Gottgens B., Barton,L.M., Chapman,M.A., Sinclair,A.M., Knudsen,B., Grafham,D., Gilbert,J.G., Rogers,J., Bentley,D.R. and Green,A.R. (2002) Transcriptional regulation of the stem cell leukemia gene (SCL)-comparative analysis of five vertebrate SCL loci. Genome Res., 12, 749–759. - PMC - PubMed
    1. Hardison R.C. Conserved noncoding sequences are reliable guides to regulatory elements. (2000) Trends Genet., 16, 369–372. - PubMed

MeSH terms

LinkOut - more resources