Survey-based naming conventions for use in OBO Foundry ontology development (original) (raw)

BMC Bioinformatics volume 10, Article number: 125 (2009)Cite this article

Abstract

Background

A wide variety of ontologies relevant to the biological and medical domains are available through the OBO Foundry portal, and their number is growing rapidly. Integration of these ontologies, while requiring considerable effort, is extremely desirable. However, heterogeneities in format and style pose serious obstacles to such integration. In particular, inconsistencies in naming conventions can impair the readability and navigability of ontology class hierarchies, and hinder their alignment and integration. While other sources of diversity are tremendously complex and challenging, agreeing a set of common naming conventions is an achievable goal, particularly if those conventions are based on lessons drawn from pooled practical experience and surveys of community opinion.

Results

We summarize a review of existing naming conventions and highlight certain disadvantages with respect to general applicability in the biological domain. We also present the results of a survey carried out to establish which naming conventions are currently employed by OBO Foundry ontologies and to determine what their special requirements regarding the naming of entities might be. Lastly, we propose an initial set of typographic, syntactic and semantic conventions for labelling classes in OBO Foundry ontologies.

Conclusion

Adherence to common naming conventions is more than just a matter of aesthetics. Such conventions provide guidance to ontology creators, help developers avoid flaws and inaccuracies when editing, and especially when interlinking, ontologies. Common naming conventions will also assist consumers of ontologies to more readily understand what meanings were intended by the authors of ontologies used in annotating bodies of data.

Background

A wide variety of ontologies, controlled vocabularies, and other terminological artifacts relevant to the biological or medical domains are available through open access portals such as the Ontology Lookup Service (OLS) [1], and the number of such artifacts is growing rapidly. One of the goals of the Open Biomedical Ontologies (OBO) Foundry initiative [2] is to facilitate integration among these diverse ontologies. However, such integration demands considerable effort and differences in format and style can only add obstacles to the execution of this task [3]. The heterogeneity within the set of existing ontologies derives from the use of diverse ontology engineering methodologies and is manifest in the adoption by different communities of Description Logic, Common Logic, or other formalisms. The spectrum of syntaxes used to express these formalisms, such as the Web Ontology Language (OWL) or the OBO format, and the commitment of individual communities to conceptualist or realism-based philosophical approaches are also contributing factors.

Here we focus on issues of nomenclature [4], and specifically on the naming conventions used for labeling classes in ontologies, which are an additional contributing factor to the problem of heterogeneity. Even in this relatively straightforward area, no conventions have achieved broad acceptance (see survey section below).

The lack of naming conventions or their inconsistent usage can impair readability and navigation when viewing ontology class hierarchies. We believe that clear and explicit naming becomes of even greater importance when interlinking ontologies (for example via owl:import, obo dbxref and other referencing and mapping statements [[5](/articles/10.1186/1471-2105-10-125#ref-CR5 "Exploiting patterns in Ontology Mapping[ http://iswc2007.semanticweb.org/papers/950.pdf

            ]")\], or when ontology engineers need to collaborate with external groups to align their ontologies and to ensure effective maintenance of modularity).

While other sources of diversity are tremendously complex and challenging, it is our belief that establishing a set of naming conventions for the OBO Foundry is a tractable goal, particularly if those conventions are based on lessons drawn from pooled practical experience and targeted surveying.

There is of course no shortage of initiatives for the development of specifications and standards tackling naming [[6](/articles/10.1186/1471-2105-10-125#ref-CR6 "ISO/IEC 11179–5, Information technology – Metadata registries (MDR) – Part 5:Naming and identification principles[ http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=35347

            ]")–[9](/articles/10.1186/1471-2105-10-125#ref-CR9 "IUBMB-IUPAC Joint Commission on Biochemical Nomenclature (JCBN)[
              http://www.iupac.org/divisions/VIII/jcbn/index.html
              
            ]")\]. However, where naming conventions have been developed, widespread application has been hampered by several factors, most notably domain specificity, document inaccessibility and format dependency. A comprehensive survey of existing naming convention documents can be found at the dedicated OBO Foundry naming conventions website \[[10](/articles/10.1186/1471-2105-10-125#ref-CR10 "Naming Conventions for OBO Foundry Ontology engineering[
              http://obofoundry.org/wiki/index.php/Naming
              
            ]")\].

Domain specificity

One significant obstacle to common adoption is that many of the proposed conventions are domain-specific and not generally extendible to other fields; for example, the Human Genome Organization (HUGO) nomenclature [11] is restricted to gene names. Other conventions refer only to entities occurring within programming languages [[12](/articles/10.1186/1471-2105-10-125#ref-CR12 "The New C Standard, An Economic and Cultural Commentary[ http://citeseer.ist.psu.edu/jones02new.html

            ]")\] or to the naming of natural language documents \[[13](/articles/10.1186/1471-2105-10-125#ref-CR13 "Brown SH, Lincoln M, Hardenbrook S, Petukhova ON, Rosenbloom ST, Carpenter P, Elkin P: Derivation and evaluation of a document-naming nomenclature. J Am Med Inform Assoc 2001, 8: 379–390.")\].

Document inaccessibility

A second obstacle relates to poor documentation. A naming convention whose documentation is unclear, or is dispersed in multiple documents or document sections, artificially constrains its own chances of acceptance. This is the case with the BioPAX manual [[14](/articles/10.1186/1471-2105-10-125#ref-CR14 "BioPAX – biological pathways exchange language, Documentation[ http://www.biopax.org/release/biopax-level2-documentation.pdf

            ]")\], which is in addition overly tool-centric in that it addresses only Protégé-OWL issues. Another deficiency is the commercial or semi-proprietary nature of conventions such as the International Organization for Standardization (ISO) standards \[[15](/articles/10.1186/1471-2105-10-125#ref-CR15 "ISO, International Organization for Standardization[
              http://www.iso.org/
              
            ]")\]. Many of these proposed conventions also impair access through information overload, there being around forty ISO documents addressing naming issues alone. Other naming conventions are described only implicitly and via unintuitive search attributes, or are not available on-line, making access difficult.

Format and implementation dependency

Sometimes only certain naming issues are tackled by a naming convention – usually those most germane to a particular format. The Gene Ontology (GO) Editorial Style Guide [[16](/articles/10.1186/1471-2105-10-125#ref-CR16 "The Gene Ontology Editorial Style Guide[ http://www.geneontology.org/GO.usage.shtml#conventions

            ]")\] for example, is of limited coverage and applicability, as it is embedded in an OBO-format specific document. The ANSI/ISO Z39.19-2005 Standard \[[8](/articles/10.1186/1471-2105-10-125#ref-CR8 "NISO (Ed): ANSI/NISO Z39.19–2005, Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies. Bethesda, Maryland, U.S.A: National Information Standards Organization, NISO Press; 2005.")\] is applicable only to terms organized in an _is-a_ hierarchy without relations and therefore lacks proper conventions for representing ontological classes and properties in semantically complex ontologies.

In the case of the Ontology Engineering and Patterns Task Force of the Semantic Web Best Practices and Deployment working group [[17](/articles/10.1186/1471-2105-10-125#ref-CR17 "Semantic web best practices and deployment group, Ontology Engineering and Patterns Task Force[ http://www.w3.org/2001/sw/BestPractices/OEP/

            ]")\], the guidelines are restricted to the OWL format and are dispersed throughout many documents and document sections.

To overcome this diversity and fragmentation members of the OBO Foundry and of the Metabolomics Standards Initiative (MSI) ontology working group [18] have set up an infrastructure group that is attempting to:

In this communication we present the preliminary results of a survey of the naming conventions applied by ontology groups listed under the OBO Foundry, together with an initial set of what we believe are robust conventions for formulation of terms in ontologies and a list of open issues that need to be resolved in the future.

Results

Survey

To determine the sources of heterogeneity in naming and to initiate a discussion among the ontology groups associated with the OBO Foundry, we carried out a survey. The goal was to allow us to:

The survey was conducted by contacting the custodians of the 66 OBO ontologies (as of November 2007) either by email or telephone. Each respondent then received a questionnaire that was divided into four parts, covering:

    1. Ontology engineering process and level of awareness of the OBO Foundry
    1. Current practice in naming entities and documentation thereof
    1. Implementation of different name categories
    1. Questions on particular naming conventions

The full questionnaire, the complete set of answers and the consolidated results are available from the OBO Foundry wiki [[10](/articles/10.1186/1471-2105-10-125#ref-CR10 "Naming Conventions for OBO Foundry Ontology engineering[ http://obofoundry.org/wiki/index.php/Naming

            ]")\]. For more information on the survey results and list of participants see the Additional file [1](/articles/10.1186/1471-2105-10-125#MOESM1): SurveyResults.zip.

Naming Conventions

Our proposed set of naming conventions, founded on the survey results, is summarized in Table 1. In further discussions, we refer to the entities of which an ontology consists (in some circles these are called classes and relations) as its representational units [[19](/articles/10.1186/1471-2105-10-125#ref-CR19 "Smith B, Kusnierczyk W, Schober D, Ceusters W: Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain. KR-MED 2006 2006. [ http://ontology.buffalo.edu/bfo/Terminology_for_Ontologies.pdf

            ]")\]. A representational unit can be accompanied by one or more synonymous names of different categories. Any type of name that is chosen to be displayed in the hierarchy is called 'display name' (called 'browser key' in Protégé). Where the form of that name is controlled by a set of explicit rules we refer to it as a 'formal name'. To ensure that the conventions proposed here are expressed unambiguously we employ the following additional name categories, which we hope will also have general utility:

Table 1 The initial set of OBO Foundry naming conventions

Full size table

Further types of names can be distinguished, such as 'lexical variant' (including abbreviations and acronyms), 'phonetic variant' and 'foreign language translation'. The one rule that governs all these name categories is that they all must be exact synonyms. Since Protégé and OBO Edit do not deal with external lexical formats in an integrated way, we recommend storing lexical variants in the ontology itself to make them immediately accessible e.g. when mapping ontologies and identifying homonyms.

The lack of defined name categories in the available representation languages has been recognized by the Ontology Task Force of the W3C Semantic Web Health Care and Life Sciences Interest Group [[7](/articles/10.1186/1471-2105-10-125#ref-CR7 "The HCLS Ontology Task Force[ http://esw.w3.org/topic/HCLS/Labels_and_Definitions

            ]")\] and the lack of clear guidance on which kind of name the representation language idioms rdfs:label (OWL) and term name (OBO) should contain, has contributed significantly to the current heterogeneity in naming between ontologies. Our minimum recommendation is to assign an editor-preferred name, to which all of the naming conventions described in Table [1](/articles/10.1186/1471-2105-10-125#Tab1) should be applied, and one or more user-preferred names, which are less controlled and chosen to match end user expectations and usage frequency. The utility of having separate editor- and user-preferred names is exemplified by the response to question 4.1.2 in our survey by the developers of the Drosophila development ontology where they describe the balance they attempt to strike between making names explicit, keeping them concise and avoiding straying too far from community usage.

Discussion

Naming conventions for ontology engineering do not necessarily apply to other domains. For example, our recommendation "1.2 Use context independent names" (see Table 1) will not make sense in the domain of database schemata or object-oriented programming. Terms from ontologies can be used in annotations outside the ontological context, whereas a java class is always situated in a class library hierarchy and embedded in code, providing its full context and therefore its name does not need to be fully explicit. However, general naming conventions such as "1. Be clear and unambiguous" and "2. Be univocal" can be applied in database schema generation, class naming in object oriented programming, natural language generation, even Wikipedia article naming. Formulation of universally applicable naming conventions in the bio-ontology space is no easy task due to the multidimensional complexity of the area, deriving not least from its intrinsically interdisciplinary character. Therefore, although we have carried out a comprehensive survey of existing naming convention documents in different domains [[10](/articles/10.1186/1471-2105-10-125#ref-CR10 "Naming Conventions for OBO Foundry Ontology engineering[ http://obofoundry.org/wiki/index.php/Naming

            ]")\], we have deliberately confined ourselves here to considering the needs of the OBO Foundry community.

Exceptions

When conventions have been established their application may be non-trivial, not least because of the exceptions which different groups will want to make to given rules. In cases where the conventions cannot be strictly applied, common sense should be used. Here we describe some situations of this sort highlighted by our survey.

Positive names (see 2.4 in Table 1)

The responses to question 4.8.1 showed that most groups already try to avoid negative names and names containing expressions such as 'without' or 'excluding'; yet nearly half of the survey respondents still found examples of negative names in their ontologies. It seems it can be difficult to decide when a term is negative; e.g., 'unhealthy', 'immaterial anatomical entity', 'nonlinear transformation', 'inorganic' and 'rotenone-insensitive'. The difficulty in defining the criteria for 'negative' indicates that the convention cannot be enforced strictly, but we hold that it is nonetheless a valuable guideline. Further, we recommend that explicit exclusions should not be made within names; e.g., as in 'hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amides' (GO:0016812).

Word separator (see 3.3 in Table 1)

We recommend the use of white space as separator in editor-preferred names. A consequence of the default behaviour of the Protégé 3.x Editor is that it encourages the use of the rdf:ID field to capture class names. Since this field can't contain spaces, developers using Protégé often use the underscore as a word separator. This can be cured by avoiding use of the rdf:ID field to record editor-preferred names and to use instead the rdfs:label field.

Special character formatting and symbols (see 3.5 in Table 1)

The survey revealed that ontologies dealing with chemicals and using the IUPAC nomenclature need to apply character formatting to their names for purposes of semantic disambiguation. In ChEBI for example the full chemical name is represented with unrestricted character formatting, for example: CHEBI 30666: bis [tricarbonyl(η5-cyclopentadienyl)molybdenum](Mo-Mo). Since character formatting is not supported by most ontology editors and languages, the groups involved often develop specific tools to meet their requirements. For this reason ChEBI and the Systems Biology ontology have developed front ends built on top of relational databases to manage their ontologies. Defined character transformation rules can be used to encode special formatting for example as has been done by the Biological Imaging Methods Ontology, which uses [] for superscripts and [[]] for subscripts. In general these should be avoided.

Benefits and applications

The application of common naming guidelines brings the following benefits:

By increasing the robustness of ontology class names, a standard naming convention will:

The proposed set of conventions is currently being applied by the Ontology for Biomedical Investigation (OBI) project [[20](/articles/10.1186/1471-2105-10-125#ref-CR20 "Ontology for Biomedical Investigations (OBI)[ http://obi.sourceforge.net/

            ]")\] and by the Proteomics Standards Initiative (PSI) \[[21](/articles/10.1186/1471-2105-10-125#ref-CR21 "Hermjakob H: The HUPO Proteomics Standards Initiative – Overcoming the Fragmentation of Proteomics Data. Proteomics 2006, 6: 34–38. 10.1002/pmic.200600537")\] and MSI ontology working groups. An example that illustrates how syntactic normalization enhances readability and navigability of the OBI ontology class hierarchy can be found on the OBO Foundry wiki \[[10](/articles/10.1186/1471-2105-10-125#ref-CR10 "Naming Conventions for OBO Foundry Ontology engineering[
              http://obofoundry.org/wiki/index.php/Naming
              
            ]")\].

The usefulness of design principles in general and naming conventions in particular increases considerably when they are supported by ontology editing tools [[22](/articles/10.1186/1471-2105-10-125#ref-CR22 "Kismeta Validator v1.1b, Enterprise Data Standards Validation and Enforcement[ http://www.kismeta.com/Validtr.html

            ]")\]. In particular, tools should check for compliance to such conventions and provide the functionality not only to enforce, but also to exploit, convention-based naming patterns. We are pleased to observe that implementations of such functionality have already begun to appear. For example, in the OBO Edit 2 tool \[[23](/articles/10.1186/1471-2105-10-125#ref-CR23 "Day-Richter J, Harris MA, Haendel M, Lewis S: OBO-Edit – an ontology editor for biologists. Bioinformatics 2007, 23: 2198–2200. 10.1093/bioinformatics/btm112")\] redundant class names are indicated and users can also define their own verification checks by specifying filters and error messages that will be displayed for each name that matches (or fails to match) the conventions defined. This verification system can serve as a framework upon which to build robust checks for conformity to naming conventions, either as a built-in OBO Edit module or as externally provided plug-ins (John Day-Richter personal communication). Also tools such as OBOL that use the lexical information in class names are already being applied to find inconsistencies within and between labels, and to aid ontology integration and ontology engineering in general through the methodology of cross-products \[[24](/articles/10.1186/1471-2105-10-125#ref-CR24 "Mungall CM: Obol: Integrating Language and Meaning in Bio-Ontologies. Comparative and Functional Genomics 2004, 5: 509–520. 10.1002/cfg.435")\].

Some aspects of what we propose here mirror features of so-called Constrained Natural Languages, CNL [[25](/articles/10.1186/1471-2105-10-125#ref-CR25 "Controlled Languages: An Introduction[ http://www.shlrc.mq.edu.au/masters/students/raltwarg/clgrammar.htm

            ]")\]. In particular, defined restrictions on the use of grammar and terminology can be found in CNL, and exploiting developments in this field could prove fruitful. However we must be careful not to be seen to be trying to impose too great a burden on ontology editors by attempting to require them to learn another full representation language. It is important to stress that having conventions for default names (using the editor-preferred name as display name) does not place restrictions on the use of less formal or colloquial names, which can and should still be captured as synonyms.

Impact on GO

As the longest established ontology in the OBO Foundry, GO has already invested effort in establishing its own naming conventions, having formerly suffered under many of the common pitfalls in naming described in this paper, for example, the use of catch-all terms such as "unlocalized" and "molecular function unknown" [[26](/articles/10.1186/1471-2105-10-125#ref-CR26 "Smith B, Köhler J, Kumar A: On the Application of Formal Principles to Life Science Data: a Case Study in the Gene Ontology. DILS 2004, 79–94. [ http://ontology.buffalo.edu/medo/Database_Integration.pdf

            ]")\]. Some of the recommendations outlined here have been inherited from the GO community, which in turn will move to include this whole set of naming conventions into the GO style guide. The impact on GO will certainly be positive, especially where it is used in combination with other OBO Foundry ontologies. For example, GO is considering changing to the context-independent name "cell nucleus" (as already used in FMA), instead of "nucleus" to distinguish it from "atomic nuclei" in ChEBI. The avoidance of conjunctions in term names will decompose terms like "actin polymerization and/or depolymerization", and the restriction to positive names will prevent or lead to the refactoring of terms like 'non-eye photoreceptor cell development' in GO.

Open Issues

The surveying process reported in this paper has been informative, and has provided evidence to support the various conventions presented herein. Furthermore, several responders explicitly stated that the questionnaire made them aware of issues which they had not thought of previously; and in some cases went on to indicate other areas where they considered that conventions would be helpful, such as:

Although work on some of the above issues has already started, these open issues are of importance and will be tackled in a next round of guideline development by the OBO Foundry coordinators, in collaboration with the OBO Foundry ontology developers.

Conclusion

The effective and efficient description of scientific information is the ultimate goal of this work. Mature, consensus-based conventions to guide ontology development are a crucial requisite for the achievement of this goal. We have presented an initial set of naming conventions primarily (but certainly not exclusively) for use in OBO Foundry ontologies. The justifications for the conventions presented were founded on answers from ontology editor practitioners gathered by means of a survey carried out within the OBO Foundry community.

The resulting set of conventions should be viewed as a primer, to be expanded and refined on the basis of input from practitioners. These conventions were discussed and approved by representatives of the OBO Foundry ontologies at the first OBO Foundry Summit meeting in July 2008 at the European Bioinformatics Institute (EBI), Cambridge, UK, funded by the UK's Biotechnology and Biological Sciences Research Council (BB/E025080/1) and the Elixir project http://www.elixir-europe.org. Further feedback will allow us to continue refining and ultimately to finalize this proposal at the second OBO Foundry Summit meeting in June 2009 at the EBI. As part of this iterative development process we will continue to engage with other efforts, particular those outside the OBO Foundry community such as the W3C Semantic Web Health Care and Life Sciences Interest Group and the Ontology Engineering and Patterns Task Force of the W3C Semantic Web Best Practices and Deployment working group.

Abbreviations

When an abbreviation or acronym becomes more commonly used in everyday language than its full name:

for example 'LASER', then it should be used as the name, with its expanded name captured as a synonym. In other words, usage frequency can take precedence over the rule of acronym avoidance.

References

  1. Cote RG, Jones P, Apweiler R, Hermjakob H: The Ontology Lookup Service, a lightweight cross-platform tool for controlled vocabulary queries. BMC Bioinformatics 2006, 7: 97. 10.1186/1471-2105-7-97
    Article PubMed Central PubMed Google Scholar
  2. Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A, Mungall CJ, et al.: The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol 2007, 25: 1251–1255. 10.1038/nbt1346
    Article PubMed Central CAS PubMed Google Scholar
  3. Bodenreider O, Stevens R: Bio-ontologies: current trends and future directions. Brief Bioinform 2006, 7: 256–274. 10.1093/bib/bbl027
    Article PubMed Central CAS PubMed Google Scholar
  4. Tuason O, Chen L, Liu H, Blake JA, Friedman C: Biological nomenclatures: a source of lexical knowledge and ambiguity. Pac Symp Biocomput 2004, 238–249.
    Google Scholar
  5. Exploiting patterns in Ontology Mapping[http://iswc2007.semanticweb.org/papers/950.pdf]
  6. ISO/IEC 11179–5, Information technology – Metadata registries (MDR) – Part 5:Naming and identification principles[http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=35347]
  7. The HCLS Ontology Task Force[http://esw.w3.org/topic/HCLS/Labels_and_Definitions]
  8. NISO (Ed): ANSI/NISO Z39.19–2005, Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies. Bethesda, Maryland, U.S.A: National Information Standards Organization, NISO Press; 2005.
    Google Scholar
  9. IUBMB-IUPAC Joint Commission on Biochemical Nomenclature (JCBN)[http://www.iupac.org/divisions/VIII/jcbn/index.html]
  10. Naming Conventions for OBO Foundry Ontology engineering[http://obofoundry.org/wiki/index.php/Naming]
  11. Wright MW, Bruford EA: Human and orthologous gene nomenclature. Gene 2006, 369: 1–6. 10.1016/j.gene.2005.10.029
    Article CAS PubMed Google Scholar
  12. The New C Standard, An Economic and Cultural Commentary[http://citeseer.ist.psu.edu/jones02new.html]
  13. Brown SH, Lincoln M, Hardenbrook S, Petukhova ON, Rosenbloom ST, Carpenter P, Elkin P: Derivation and evaluation of a document-naming nomenclature. J Am Med Inform Assoc 2001, 8: 379–390.
    Article PubMed Central CAS PubMed Google Scholar
  14. BioPAX – biological pathways exchange language, Documentation[http://www.biopax.org/release/biopax-level2-documentation.pdf]
  15. ISO, International Organization for Standardization[http://www.iso.org/]
  16. The Gene Ontology Editorial Style Guide[http://www.geneontology.org/GO.usage.shtml#conventions]
  17. Semantic web best practices and deployment group, Ontology Engineering and Patterns Task Force[http://www.w3.org/2001/sw/BestPractices/OEP/]
  18. Sansone SA, Fan T, Goodacre R, Griffin JL, Hardy NW, Kaddurah-Daouk R, Kristal BS, Lindon J, Mendes P, Morrison N, et al.: The metabolomics standards initiative. Nat Biotechnol 2007, 25: 846–848. 10.1038/nbt0807-846b
    Article CAS PubMed Google Scholar
  19. Smith B, Kusnierczyk W, Schober D, Ceusters W: Towards a Reference Terminology for Ontology Research and Development in the Biomedical Domain. KR-MED 2006 2006. [http://ontology.buffalo.edu/bfo/Terminology_for_Ontologies.pdf]
    Google Scholar
  20. Ontology for Biomedical Investigations (OBI)[http://obi.sourceforge.net/]
  21. Hermjakob H: The HUPO Proteomics Standards Initiative – Overcoming the Fragmentation of Proteomics Data. Proteomics 2006, 6: 34–38. 10.1002/pmic.200600537
    Article PubMed Google Scholar
  22. Kismeta Validator v1.1b, Enterprise Data Standards Validation and Enforcement[http://www.kismeta.com/Validtr.html]
  23. Day-Richter J, Harris MA, Haendel M, Lewis S: OBO-Edit – an ontology editor for biologists. Bioinformatics 2007, 23: 2198–2200. 10.1093/bioinformatics/btm112
    Article CAS PubMed Google Scholar
  24. Mungall CM: Obol: Integrating Language and Meaning in Bio-Ontologies. Comparative and Functional Genomics 2004, 5: 509–520. 10.1002/cfg.435
    Article PubMed Central CAS PubMed Google Scholar
  25. Controlled Languages: An Introduction[http://www.shlrc.mq.edu.au/masters/students/raltwarg/clgrammar.htm]
  26. Smith B, Köhler J, Kumar A: On the Application of Formal Principles to Life Science Data: a Case Study in the Gene Ontology. DILS 2004, 79–94. [http://ontology.buffalo.edu/medo/Database_Integration.pdf]
    Google Scholar

Download references

Acknowledgements

We kindly acknowledge the members of the OBO Foundry ontologies for their valuable contribution to the survey. In particular we thank Robert Stevens, Luisa Montecchi-Palazzi, Judith Blake and the members of the OBI working group for their comments and contributions in fruitful discussions. We also gratefully thank the ontology communities under OBO Foundry for contributing to the survey and the BBSRC (BB/D524283/1, BB/E025080/1), the EU Network of Excellence NuGO (NoE 503630), the EU Carcinogenomics (PL037712) to SAS and PRS for funding the activities of DS. BS's contribution to this work was supported by the NIH Roadmap for Medical Research, Grant 1 U 54 HG004028 (National Center for Biomedical Ontology).

Author information

Authors and Affiliations

  1. EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
    Daniel Schober, Jane Lomax, Chris F Taylor, Philippe Rocca-Serra & Susanna-Assunta Sansone
  2. Institute of Medical Biometry and Medical Informatics (IMBI), University Medical Center, 79104, Freiburg, Germany
    Daniel Schober
  3. Center of Excellence in Bioinformatics and Life Sciences, and Department of Philosophy, University at Buffalo, NY, USA
    Barry Smith
  4. Berkeley Bioinformatics and Ontologies Project, Lawrence Berkeley National Labs, Berkeley, CA, 94720, USA
    Suzanna E Lewis & Chris Mungall
  5. Department of Information and Computer Science, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
    Waclaw Kusnierczyk
  6. NERC Environmental Bioinformatics Centre (NEBC), Mansfield Road, Oxford, OX1 3SR, UK
    Chris F Taylor

Authors

  1. Daniel Schober
    You can also search for this author inPubMed Google Scholar
  2. Barry Smith
    You can also search for this author inPubMed Google Scholar
  3. Suzanna E Lewis
    You can also search for this author inPubMed Google Scholar
  4. Waclaw Kusnierczyk
    You can also search for this author inPubMed Google Scholar
  5. Jane Lomax
    You can also search for this author inPubMed Google Scholar
  6. Chris Mungall
    You can also search for this author inPubMed Google Scholar
  7. Chris F Taylor
    You can also search for this author inPubMed Google Scholar
  8. Philippe Rocca-Serra
    You can also search for this author inPubMed Google Scholar
  9. Susanna-Assunta Sansone
    You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toSusanna-Assunta Sansone.

Additional information

Authors' contributions

This work was largely informed by the requirements of the annotation projects lead by SAS and PRS, who coordinated this work. DS was the knowledge engineer who reviewed the existing conventions and with SAS, PRS, BS, SL, CM and JL designed the survey. WK, BS and PRS worked with DS in defining the appropriate terminology for describing the naming conventions. Contributions and critical reviews by all the authors, in particular PRS, CT, SL, BS and SAS, delivered the final manuscript. Authors read and approved the final manuscript

Electronic supplementary material

12859_2008_2855_MOESM1_ESM.zip

Additional file 1: Surveying naming conventions within OBO Foundry ontologies. This SurveyResults.zip is a webpage presenting the results of the naming conventions survey that was carried out within the OBO Foundry ontologies. It contains diagrams and tables illustrating the answers to the survey's questions, as well as the discussion of these results. (ZIP 244 KB)

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Schober, D., Smith, B., Lewis, S.E. et al. Survey-based naming conventions for use in OBO Foundry ontology development.BMC Bioinformatics 10, 125 (2009). https://doi.org/10.1186/1471-2105-10-125

Download citation

Keywords