dictyBase, the model organism database for Dictyostelium discoideum (original) (raw)

Journal Article

,

dictyBase, Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611, USA

Search for other works by this author on:

,

dictyBase, Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611, USA

Search for other works by this author on:

,

dictyBase, Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611, USA

Search for other works by this author on:

,

dictyBase, Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611, USA

Search for other works by this author on:

,

dictyBase, Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611, USA

Search for other works by this author on:

,

dictyBase, Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611, USA

Search for other works by this author on:

dictyBase, Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611, USA

Search for other works by this author on:

Received:

14 September 2005

Revision received:

13 October 2005

Accepted:

13 October 2005

Published:

01 January 2006

Cite

Rex L. Chisholm, Pascale Gaudet, Eric M. Just, Karen E. Pilcher, Petra Fey, Sohel N. Merchant, Warren A. Kibbe, dictyBase, the model organism database for Dictyostelium discoideum, Nucleic Acids Research, Volume 34, Issue suppl_1, 1 January 2006, Pages D423–D427, https://doi.org/10.1093/nar/gkj090
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

dictyBase (http://dictybase.org) is the model organism database (MOD) for the social amoeba Dictyostelium discoideum. The unique biology and phylogenetic position of Dictyostelium offer a great opportunity to gain knowledge of processes not characterized in other organisms. The recent completion of the 34 MB genome sequence, together with the sizable scientific literature using Dictyostelium as a research organism, provided the necessary tools to create a well-annotated genome. dictyBase has leveraged software developed by the Saccharomyces Genome Database and the Generic Model Organism Database project. This has reduced the time required to develop a full-featured MOD and greatly facilitated our ability to focus on annotation and providing new functionality. We hope that manual curation of the Dictyostelium genome will facilitate the annotation of other genomes.

INTRODUCTION

dictyBase (http://dictybase.org) is the model organism database (MOD) for the social amoeba Dictyostelium discoideum. Like other MODs such as FlyBase (1), WormBase (2,3), Mouse Genome Informatics (4) and the Saccharomyces Genome Database (SGD) (5), dictyBase (6) uses the organism's genome sequence to organize the biological knowledge resulting from experimental studies using Dictyostelium. Dictyostelium is a model organism widely used for biomedical research. An amoeba during most of its life, starvation induces a very interesting developmental program during which individual cells stream together by chemotaxis to form a multicellular tissue (7). A morphogenetic process involving cell migration and cellular morphogenesis transforms a simple mound of cells into a slug or pseudoplasmodium establishing a relatively simple developmental pattern. This structure then develops into a fruiting body consisting of multiple cell types including spores and terminally differentiated stalk cells. These features together with the efficient genetic manipulation by gene targeting and replacement as well as insertional mutagenesis and suppressor screens have made Dictyostelium a popular experimental system. Research using Dictyostelium has been critical in understanding fundamental processes such as cell migration (8), cell signaling (9,10), phagocytosis and morphogenesis. Recently Dictyostelium has also been used to help establish the mechanisms of action of medically important drugs such as cisplatin (1114), used to treat various cancers, and lithium and valproic acid (15), used to treat depressive disorders. The recently completed Dictyostelium genome sequence is housed at dictyBase and has been integrated with other experimental data to provide investigators with a rich new resource that will facilitate future studies using Dictyostelium experimentally and for comparative genomics.

GENOME SEQUENCE COMPLETED

In May 2005 the sequence of the complete genome of Dictyostelium was reported (16). The A + T rich genome (77.6%) consists of 34 million bases of DNA sequence that are predicted to encode 13 573 genes—a number of genes comparable with Drosophila. dictyBase houses and maintains the genome sequence produced by the International Sequencing Consortium. As the first free living protozoan to be completely sequenced, the predicted proteome supports the hypothesis that Dictyostelium represents an early branch in the Eukaryotic Tree of Life that diverged after the split between animals, plants and fungi, with Dictyostelium and other amoebae more closely related to animals. Many Dictyostelium proteins are more similar to human orthologs than are those of Saccharomyces cerevisiae. The genome has several notable features. The sequence is very gene dense with an average gene spacing of 2.5 kb. The abundance of short sequence repeats produces unusual runs of amino acids, such as polyasparagine and polyglutamine tracts of 20 residues or more. These amino acid repeats occur in >2000 of the predicted proteins. The genome also contains a surprisingly large number of polyketide synthases, which may produce a previously unanticipated large complement of natural products. Consistent with this apparently large secondary metabolism, the genome contains many ABC transporters that may be involved in export of these natural products. Also encoded are a large number of proteins containing multiple EGF repeats that have been postulated to play roles in adhesion or cell recognition.

These fascinating genomic features together with the unique biology of Dictyostelium and the rich body of Dictyostelium literature (nearly 8000 publications) provide a valuable opportunity to annotate many of the genes with functional information that will widen the spectrum of characterized proteins in public databases. This emphasizes the need for a well-curated database for _Dictyostelium_—a need .that dictyBase strives to fulfill.

As the MOD for D.discoideum, dictyBase provides highly curated information that facilitates research by providing a searchable, structured repository of Dictyostelium experimental results. In addition to the complete genome sequence, groups performing high-throughput experiments such as large-scale mutagenesis and microarray-based gene expression studies are depositing their data in dictyBase for integration and distribution to the research community. dictyBase uses the genomic DNA sequence as a scaffold on which to organize and display the biological knowledge and experimental evidence derived from Dictyostelium research. dictyBase is the most comprehensive source of information regarding Dictyostelium, and the database presents the highest quality annotations of Dictyostelium genes. Table 1 presents the data content of dictyBase as of September 2005. Nearly all of the data in dictyBase is available for bulk download through our download center (http://dictybase.org/downloads/).

The heart of dictyBase is the Gene Page. A Gene Page is available for each gene that has been shown or predicted to exist in the genome. The Gene Page collects and organizes all of the available information related to that gene into several sections. Figure 1 shows an example of a typical Gene Page. A General Information section provides the official gene name together with any other names that have been used to refer to that gene. This assures that a search of the database will allow retrieval of a gene using any name that has been used to refer to that gene. Scientific curators verify published and proposed gene names to encourage use of the Dictyostelium Nomenclature Guidelines (http://dictybase.org/nomenclatureguidelines.htm) and to ensure that names are not duplicated. When appropriate, curators communicate with authors to discuss and modify gene names. Also in this section, gene product names and a brief description of the gene product are presented, as is the unique dictyBaseID. Next, the Chromosomal Coordinates section presents a graphical view of the gene and associated sequences such as expressed sequence tags (ESTs), cDNAs and other GenBank entries. An Associated Sequences section provides links to sequences corresponding to each of the sequence features known for that gene, including gene predictions as well as ESTs, cDNAs and genomic sequences submitted by individual researchers to other public databases. The Protein Information section contains the size, molecular weight and protein domains of the predicted protein.

Functional information is presented in sections for Gene Ontology (GO) annotations and phenotypes whenever this information is available. Phenotypic data is represented using a restricted vocabulary that describes the consequences of mutations reported for that gene. The Gene Ontology section lists terms from each of the function, process and component ontologies associated with the gene (17,18). Following links in each of these sections display additional details for these annotations. Clicking on a GO term or a phenotype will list all of the other genes annotated with the same term. The Expression section leads to developmental profiles for the gene that have been generated by large-scale microarray based studies performed at the Baylor College of Medicine and the University of California, San Diego. The Links section on the Gene Page provides links to GenBank, UniProt, GeneDB at the Sanger Institute and, where appropriate, to signaling pathways at the Signal Transduction Knowledge Environment at Science (19,20). This section also links to researchers in our colleague database who are investigating that gene or gene product. A Literature Section lists the most recent publications referring to the gene and a link to a Literature Guide that contains a complete listing of papers relevant to that gene. As of the fall of 2005, >500 Gene Pages had a Summary paragraph written by a dictyBase curator that contains the current knowledge regarding that gene. dictyBase curators are in the process of manually reviewing all of the automated gene predictions in light of available data such as ESTs and cDNAs. Manually reviewed genes are indicated by the presence of a ‘Curated Model’ on the Gene Page. As of September 2005 dictyBase curators have reviewed nearly 2500 genes, which corresponds to ∼20% of the total number of genes.

GENERAL RESOURCES FOR DICTYOSTELIUM

dictyBase also serves as a clearinghouse for resource materials for students, researchers and educators using Dictyostelium in a wide range of classroom and laboratory settings. We maintain archives of the dictyNews, a weekly electronic newsletter that presents abstracts of papers available upon acceptance for publication, as well as a growing collection of protocols for working with Dictyostelium.

DICTY STOCK CENTER (DSC)

dictyBase also provides a direct portal to the DSC maintained at Columbia University. This resource provides access to Dictyostelium strains, including natural isolates, targeted mutants and GFP labeled strains. The collection also contains plasmids used to manipulate gene expression or create targeted gene disruptions. Currently the DSC has >700 strains and >150 plasmids all available to researchers. A shopping cart system allows Stock Center users to add strains and proceed to a checkout system familiar to any user of online ordering systems. For users who are also registered as colleagues in the dictyBase colleagues database, addresses for delivery and contact information are populated directly from the database. dictyBase holds all of the Stock Center data and provides the informatics support for the DSC. This will enable increased integration between gene information and strains available through the DSC.

DICTYBASE INCORPORATES SOFTWARE DEVELOPED BY OTHER MODS AND THE GENERIC MODEL ORGANISM DATABASE (GMOD) PROJECT

dictyBase was initially established as a clone of the software developed and generously provided by SGD. dictyBase developers have subsequently integrated several software packages from the GMOD project (http://gmod.org). These include the GBrowse tool for chromosome and map displays and, most recently, the Chado database schema for storing sequence and sequence feature information. This entailed porting of the Chado schema to Oracle since both dictyBase and SGD are implemented in Oracle. dictyBase has developed new code that separates communication with the database from code that generates the HTML interfaces. This object layer facilitated adoption of the Chado schema without significant impact on the presentation interface and enables good software practices such as unit testing.

CONCLUSION AND FUTURE DIRECTIONS

dictyBase strives to present a comprehensive, reliable, carefully curated dataset that integrates sequences, functional annotation, phenotypes, expression data and the research literature, while retaining the flexibility to integrate new data types as they become available. Our goal is to assure that the information in dictyBase is accessible in an effective and intuitive interface useful for biologists, while also maximizing its utility for the bioinformatics community to use computationally. Our operating principles are to respond to users' needs, to capitalize on the vast efforts of the Dictyostelium research community, and to provide an important link in the network of MODs. As additional amoebae genome sequences become available dictyBase hopes to implement, and where necessary, develop interfaces and tools that facilitate comparative genomics of this diverse group of organisms.

Table 1

Data and annotations in dictyBase (September 2005)

• 13 573 Automated Gene Predictions
• 1387 GenBank records
• 155 032 ESTs
• 6011 PubMed references
• 1131 Colleagues
• 2445 Curated Models
• Nine alternative transcripts
• Gene products for 5228 genes
• Brief descriptions for 1657 genes
• Mutant phenotypes for 322 genes
• GO annotations for 5697 genes
• Summary paragraphs for 502 genes
• External data:
○ 6801 Microarray expression profiles (BCM and UCSD)
In situ hybridization: 150 images (Tsukuba Atlas)
○ Insertional mutants: 817 links (BCM)
• DSC:
○ 728 strains
○ 153 plasmids
• 13 573 Automated Gene Predictions
• 1387 GenBank records
• 155 032 ESTs
• 6011 PubMed references
• 1131 Colleagues
• 2445 Curated Models
• Nine alternative transcripts
• Gene products for 5228 genes
• Brief descriptions for 1657 genes
• Mutant phenotypes for 322 genes
• GO annotations for 5697 genes
• Summary paragraphs for 502 genes
• External data:
○ 6801 Microarray expression profiles (BCM and UCSD)
In situ hybridization: 150 images (Tsukuba Atlas)
○ Insertional mutants: 817 links (BCM)
• DSC:
○ 728 strains
○ 153 plasmids

Table 1

Data and annotations in dictyBase (September 2005)

• 13 573 Automated Gene Predictions
• 1387 GenBank records
• 155 032 ESTs
• 6011 PubMed references
• 1131 Colleagues
• 2445 Curated Models
• Nine alternative transcripts
• Gene products for 5228 genes
• Brief descriptions for 1657 genes
• Mutant phenotypes for 322 genes
• GO annotations for 5697 genes
• Summary paragraphs for 502 genes
• External data:
○ 6801 Microarray expression profiles (BCM and UCSD)
In situ hybridization: 150 images (Tsukuba Atlas)
○ Insertional mutants: 817 links (BCM)
• DSC:
○ 728 strains
○ 153 plasmids
• 13 573 Automated Gene Predictions
• 1387 GenBank records
• 155 032 ESTs
• 6011 PubMed references
• 1131 Colleagues
• 2445 Curated Models
• Nine alternative transcripts
• Gene products for 5228 genes
• Brief descriptions for 1657 genes
• Mutant phenotypes for 322 genes
• GO annotations for 5697 genes
• Summary paragraphs for 502 genes
• External data:
○ 6801 Microarray expression profiles (BCM and UCSD)
In situ hybridization: 150 images (Tsukuba Atlas)
○ Insertional mutants: 817 links (BCM)
• DSC:
○ 728 strains
○ 153 plasmids

A typical dictyBase Gene Page. This sample Gene Page shows the types of information that are displayed on a dictyBase Gene Page.

Figure 1

A typical dictyBase Gene Page. This sample Gene Page shows the types of information that are displayed on a dictyBase Gene Page.

dictyBase is supported by NIH Grants GM64426 and HG0022. Funding to pay the Open Access publication charges for this article was provided by the National Institutes of Health (GM64426).

Conflict of interest statement. None declared.

REFERENCES

1

Drysdale, R.A. and Crosby, M.A.

2005

FlyBase: genes and gene models

Nucleic Acids Res

.

33

D390

–D395

2

Chen, N., Lawson, D., Bradnam, K., Harris, T.W., Stein, L.D.

2004

WormBase as an integrated platform for the C.elegans ORFeome

Genome Res

.

14

2155

–2161

3

Harris, T.W., Chen, N., Cunningham, F., Tello-Ruiz, M., Antoshechkin, I., Bastiani, C., Bieri, T., Blasiar, D., Bradnam, K., Chan, J., et al.

2004

WormBase: a multi-species resource for nematode biology and genomics

Nucleic Acids Res

.

32

D411

–D417

4

Eppig, J.T., Bult, C.J., Kadin, J.A., Richardson, J.E., Blake, J.A., Anagnostopoulos, A., Baldarelli, R.M., Baya, M., Beal, J.S., Bello, S.M., et al.

2005

The Mouse Genome Database (MGD): from genes to mice—a community resource for mouse biology

Nucleic Acids Res

.

33

D471

–D475

5

Christie, K.R., Weng, S., Balakrishnan, R., Costanzo, M.C., Dolinski, K., Dwight, S.S., Engel, S.R., Feierbach, B., Fisk, D.G., Hirschman, J.E., et al.

2004

Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms

Nucleic Acids Res

.

32

D311

–D314

6

Kreppel, L., Fey, P., Gaudet, P., Just, E., Kibbe, W.A., Chisholm, R.L., Kimmel, A.R.

2004

dictyBase: a new Dictyostelium discoideum genome database

Nucleic Acids Res

.

32

D332

–D333

7

Chisholm, R.L. and Firtel, R.A.

2004

Insights into morphogenesis from a simple developmental system

Nature Rev. Mol. Cell Biol

.

5

531

–541

8

Parent, C.A.

2004

Making all the right moves: chemotaxis in neutrophils and Dictyostelium

Curr. Opin. Cell Biol

.

16

4

–13

9

Van Haastert, P.J. and Devreotes, P.N.

2004

Chemotaxis: signalling the way forward

Nature Rev. Mol. Cell Biol

.

5

626

–634

10

Weeks, G., Gaudet, P., Insall, R.

2005

The Small GTPase Superfamily In Loomis, W.F. and Kuspa, A. (Eds.).

The Dictyostelium Genome

Norfolk, UK Horizon Bioscience pp.

173

–210

11

Li, G., Alexander, H., Schneider, N., Alexander, S.

2000

Molecular basis for resistance to the anticancer drug cisplatin in Dictyostelium

Microbiology

146

2219

–2227

12

Li, Z., Solomon, J.M., Isberg, R.R.

2005

Dictyostelium discoideum strains lacking the RtoA protein are defective for maturation of the Legionella pneumophila replication vacuole

Cell. Microbiol

.

7

431

–442

13

Min, J., Traynor, D., Stegner, A.L., Zhang, L., Hanigan, M.H., Alexander, H., Alexander, S.

2005

Sphingosine kinase regulates the sensitivity of Dictyostelium discoideum cells to the anticancer drug cisplatin

Eukaryotic Cell

4

178

–189

14

Min, J., Van Veldhoven, P.P., Zhang, L., Hanigan, M.H., Alexander, H., Alexander, S.

2005

Sphingosine-l-phosphate lyase regulates sensitivity of human cells to select chemotherapy drugs in a p38-dependent manner

Mol. Cancer Res

.

3

287

–296

15

Eickholt, B.J., Towers, G.J., Ryves, W.J., Eikel, D., Adley, K., Ylinen, L.M., Chadborn, N.H., Harwood, A.J., Nau, H., Williams, R.S.

2005

Effects of valproic acid derivatives on inositol trisphosphate depletion, teratogenicity, glycogen synthase kinase-3beta inhibition, and viral replication: a screening approach for new bipolar disorder drugs derived from the valproic acid core structure

Mol. Pharmacol

.

67

1426

–1433

16

Eichinger, L., Pachebat, J.A., Glockner, G., Rajandream, M.A., Sucgang, R., Berriman, M., Song, J., Olsen, R., Szafranski, K., Xu, Q., et al.

2005

The genome of the social amoeba Dictyostelium discoideum

Nature

435

43

–57

17

Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J.T., et al.

2000

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium

Nature Genet

.

25

25

–29

18

Harris, M.A., Clark, J., Ireland, A., Lomax, J., Ashburner, M., Foulger, R., Eilbeck, K., Lewis, S., Marshall, B., Mungall, C., et al.

2004

The Gene Ontology (GO) database and informatics resource

Nucleic Acids Res

.

32

D258

–D261

19

Kimmel, A.R. and Parent, C.A.

2003

The signal to move: D.discoideum go orienteering

Science

300

1525

–1527

20

Gough, N.R.

2002

Science's signal transduction knowledge environment: the connections maps database

Ann. N Y Acad. Sci

.

971

585

–587

© The Author 2006. Published by Oxford University Press. All rights reserved The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@oxfordjournals.org

I agree to the terms and conditions. You must accept the terms and conditions.

Submit a comment

Name

Affiliations

Comment title

Comment

You have entered an invalid code

Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.

Citations

Views

Altmetric

Metrics

Total Views 3,695

3,380 Pageviews

315 PDF Downloads

Since 12/1/2016

Month: Total Views:
December 2016 2
January 2017 1
February 2017 10
March 2017 2
April 2017 7
May 2017 7
June 2017 3
July 2017 3
August 2017 4
September 2017 5
October 2017 1
November 2017 7
December 2017 88
January 2018 63
February 2018 88
March 2018 122
April 2018 47
May 2018 49
June 2018 60
July 2018 33
August 2018 39
September 2018 56
October 2018 46
November 2018 55
December 2018 19
January 2019 51
February 2019 104
March 2019 216
April 2019 220
May 2019 162
June 2019 180
July 2019 150
August 2019 164
September 2019 182
October 2019 216
November 2019 180
December 2019 144
January 2020 92
February 2020 17
March 2020 8
April 2020 1
May 2020 9
June 2020 10
July 2020 15
August 2020 20
September 2020 13
October 2020 14
November 2020 9
December 2020 8
January 2021 10
February 2021 17
March 2021 28
April 2021 14
May 2021 16
June 2021 6
July 2021 15
August 2021 21
September 2021 12
October 2021 24
November 2021 32
December 2021 14
January 2022 15
February 2022 9
March 2022 15
April 2022 24
May 2022 12
June 2022 8
July 2022 12
August 2022 19
September 2022 15
October 2022 24
November 2022 5
December 2022 9
January 2023 18
February 2023 16
March 2023 16
April 2023 22
May 2023 11
June 2023 12
July 2023 5
August 2023 10
September 2023 22
October 2023 16
November 2023 6
December 2023 21
January 2024 17
February 2024 15
March 2024 20
April 2024 15
May 2024 11
June 2024 8
July 2024 20
August 2024 9
September 2024 16
October 2024 11

Citations

89 Web of Science

×

Email alerts

Citing articles via

More from Oxford Academic