Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources (original) (raw)

Journal Article

,

Charité Centrum für Therapieforschung, Charité—Universitätsmedizin Berlin Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Berlin 10117, Germany

Einstein Center Digital Future, Berlin 10117, Germany

Monarch Initiative, monarchinitiative.org

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

Oregon Health & Science University, Portland, OR 97217, USA

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London EC1M 6BQ, UK

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

Oregon Health & Science University, Portland, OR 97217, USA

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Cambridge, UK

Search for other works by this author on:

,

Monarch Initiative, monarchinitiative.org

Linus Pauling institute, Oregon State University, Corvallis, OR, USA

Search for other works by this author on:

... Show more

Received:

17 September 2018

Revision received:

18 October 2018

Accepted:

24 October 2018

Published:

22 November 2018

Cite

Sebastian Köhler, Leigh Carmody, Nicole Vasilevsky, Julius O B Jacobsen, Daniel Danis, Jean-Philippe Gourdine, Michael Gargano, Nomi L Harris, Nicolas Matentzoglu, Julie A McMurry, David Osumi-Sutherland, Valentina Cipriani, James P Balhoff, Tom Conlin, Hannah Blau, Gareth Baynam, Richard Palmer, Dylan Gratian, Hugh Dawkins, Michael Segal, Anna C Jansen, Ahmed Muaz, Willie H Chang, Jenna Bergerson, Stanley J F Laulederkind, Zafer Yüksel, Sergi Beltran, Alexandra F Freeman, Panagiotis I Sergouniotis, Daniel Durkin, Andrea L Storm, Marc Hanauer, Michael Brudno, Susan M Bello, Murat Sincan, Kayli Rageth, Matthew T Wheeler, Renske Oegema, Halima Lourghi, Maria G Della Rocca, Rachel Thompson, Francisco Castellanos, James Priest, Charlotte Cunningham-Rundles, Ayushi Hegde, Ruth C Lovering, Catherine Hajek, Annie Olry, Luigi Notarangelo, Morgan Similuk, Xingmin A Zhang, David Gómez-Andrés, Hanns Lochmüller, Hélène Dollfus, Sergio Rosenzweig, Shruti Marwaha, Ana Rath, Kathleen Sullivan, Cynthia Smith, Joshua D Milner, Dorothée Leroux, Cornelius F Boerkoel, Amy Klion, Melody C Carter, Tudor Groza, Damian Smedley, Melissa A Haendel, Chris Mungall, Peter N Robinson, Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources, Nucleic Acids Research, Volume 47, Issue D1, 08 January 2019, Pages D1018–D1027, https://doi.org/10.1093/nar/gky1105
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

The Human Phenotype Ontology (HPO)—a standardized vocabulary of phenotypic abnormalities associated with 7000+ diseases—is used by thousands of researchers, clinicians, informaticians and electronic health record systems around the world. Its detailed descriptions of clinical abnormalities and computable disease definitions have made HPO the de facto standard for deep phenotyping in the field of rare disease. The HPO’s interoperability with other ontologies has enabled it to be used to improve diagnostic accuracy by incorporating model organism data. It also plays a key role in the popular Exomiser tool, which identifies potential disease-causing variants from whole-exome or whole-genome sequencing data. Since the HPO was first introduced in 2008, its users have become both more numerous and more diverse. To meet these emerging needs, the project has added new content, language translations, mappings and computational tooling, as well as integrations with external community data. The HPO continues to collaborate with clinical adopters to improve specific areas of the ontology and extend standardized disease descriptions. The newly redesigned HPO website (www.human-phenotype-ontology.org) simplifies browsing terms and exploring clinical features, diseases, and human genes.

INTRODUCTION

A cornerstone of differential diagnostics and translational research is deep phenotyping: the computational analysis of detailed, individual clinical abnormalities (1,2). The Human Phenotype Ontology (HPO) provides the most comprehensive resource for computational deep phenotyping and has become the de facto standard for deep phenotyping in the field of rare disease—whether for computable disease definitions, description of clinical abnormalities or to aid genomic diagnostics. A foundational and integrative component of the Monarch Initiative (3,4), the HPO has been adopted internationally by numerous organizations, both academic and commercial; these include the 100,000 Genomes Project, the NIH Undiagnosed Disease Program and Network (UDP and UDN), the Undiagnosed Diseases Network International (UDNI), RD-CONNECT, SOLVE-RD and many others (5–9). The HPO recently achieved status as an International Rare Disease Research Consortium (IRDiRC) recognized resource and is in use by the Global Alliance for Genomics and Health (10) and the associated Matchmaker Exchange (3,11). Here we describe integrated HPO resources which we have revised, expanded, or invented since the previous articles in this series (12,13).

Previously, we reported on a range of algorithms that had been developed by our group and others to support phenotype-driven genomic diagnostics (12). Since then, the HPO has been applied to an increasing range of use cases. Usage of HPO is now commonplace for the analysis of clinical whole-exome and genome sequencing (WES/WGS) data (14–25) as well as for data integration in translational research and bioinformatics (16,26–39). A phenotype risk score based on a mapping of electronic health-record (EHR)-derived billing codes to HPO terms allowed high-throughput ascertainment of EHR phenotypes such that cases and controls of Mendelian diseases could be distinguished and the pathogenicity of variants associated with Mendelian diseases was characterized (40). In another setting, EHR narratives were explored to extract HPO terms by natural language processing and the resulting terms were successfully used to prioritize causal genes for Mendelian diseases in pediatric patients (41). Additionally, an increasing number of commercial applications are using HPO terms. For instance, the SimulConsult Genome-Phenome Analyzer uses HPO terms to tag findings. This is currently being used to document findings entered by the users with codes in exported reports, and the codes will also be used to identify findings in the electronic health record as inputs to be considered in diagnosis (42). A key feature of the HPO is its logical interoperability with basic research ontologies such as the Mammalian Phenotype Ontology (MP) (43), Uberon (44) and the Cell Ontology (45). This interoperability is leveraged within the Exomiser tool (described below). The International Mouse Phenotyping Consortium (IMPC) recently identified 360 new candidate molecular causes of human Mendelian diseases (46); these included an inherited heart disease ‘_Arrhythmogenic Right Ventricular Dysplasia_’ that affects the heart muscle, and ‘Charcot-Marie-Tooth disease’, which is characterized by nerve damage leading to muscle weakness and an awkward way of walking. This discovery was made possible because (i) the human diseases had been defined in terms of their component HPO phenotypes; (ii) the mouse phenotypes were mapped to the MP; and (iii) Monarch’s phenotype comparison algorithm (47) is designed to traverse HP and MP with ease. Similarly, the Rat Genome Database (RGD) annotates genes, QTLs and strains for phenotype using phenotype terms from the Mammalian Phenotype (MP) Ontology (43); more recently, RGD has converted their annotations of human phenotypes from MP to HPO (48).

HPO has been adopted as the phenotypic annotation ontology of choice for many large-scale rare disease genome-phenome databases and analysis tools including the RD-Connect Genome-Phenome Analysis Platform (GPAP) (49), the Broad Center for Mendelian Genomics and its SEQR platform, the rare disease arm of the UK 100,000 Genomes Project, the NIH Undiagnosed Diseases Program and the Undiagnosed Diseases Network International (UDNI). This is creating a vast body of clinically validated, linked genome-phenome data that not only assists in the diagnosis of the subjects themselves but can be exploited for further developments of the ontology and associated diagnostic algorithms. For example, the RD-Connect GPAP mandates submission of HPO-coded phenotypic data through the PhenoTips tool, using custom-designed disease-specific data collection forms on top of the ‘enter-what-you-see’ HPO entry box. The average number of phenotypic annotations per index case is eight (with an average of six observed and two excluded features) and the GPAP now contains linked genome-phenome datasets on 5000 individuals. Through data submission from European Reference Networks in the Horizon 2020-funded Solve-RD project this number will increase to >20 000 datasets in the coming 2–3 years. The GPAP allows the user to filter variants using predefined gene panels for specific groups of pathologies or alternatively gene lists created ‘on the fly’ based on the HPO terms provided with the individual case. These major databases are not only contributing to gene discovery and diagnosis of the unsolved patients included in the platforms (10) but also providing source data for many computational developments. Within the Solve-RD project (https://solve-rd.eu), RD-Connect worked with Orphanet and HPO to implement the first version of the Phenopackets standard (https://github.com/phenopackets) and export ∼600 cases in Phenopacket format, including clinical phenotype (HPO annotation), clinical diagnosis (ORDO), molecular diagnosis (OMIM) and gene name of genes identified as causal or candidate. The export included both solved cases and unsolved cases that contain sufficient information for phenotypic algorithm evaluation. In addition, work is ongoing that will enable assessment of the correlation between the level, detail and quantity of phenotypic annotation and the solve rate, which will provide clinicians with better advice on the level of detail to provide in their annotations and feed back into improvements to algorithms such as those implemented in Exomiser.

Ontologies should be responsive to the community (43). In the past 2 years we have made improvements to the ontology based on input from clinicians and researchers, as is evidenced by term requests that have been submitted via our GitHub tracker (12). There, we provide a template that guides users through the process of providing information including the suggested term label, definitions and comments, synonyms, references and diseases that should be annotated to the new term. Periodically we also organize collaborative workshops with clinical groups that would like to revise and extend entire areas of the HPO. Five such workshops have been conducted since the 2017 HPO update (Table 1).

Table 1.

Community workshops and collaborations aimed at HPO content expansion and refinement

Organization Location Focus
Undiagnosed Diseases Network (UDN); Stanford Center for Inherited Cardiovascular Diseases (SCICD) Stanford University, CA, USA (March 2017) Cardiology
European Reference Network for Rare Eye Disease (ERN-EYE) Mont Sainte-Odile, France (October 2017) Ophthalmology
National Institute of Allergy and Infectious Disease (NIAID) National Institutes of Health, Bethesda, MD, USA (May and July 2018) Allergy and immunology
Neuro-MIG European network for brain malformations (www.neuro-mig.org) St Julians, Malta; Lisbon, Portugal (February 2018; September 2018) Malformations of cortical development (MCD)
European Society for Immunodeficiencies (ESID) and the European Reference network on rare primary immunodeficiency, autoinflammatory and autoimmune diseases (ERN-RITA) Vienna Austria (September 2018) Inborn errors of immunity.
Organization Location Focus
Undiagnosed Diseases Network (UDN); Stanford Center for Inherited Cardiovascular Diseases (SCICD) Stanford University, CA, USA (March 2017) Cardiology
European Reference Network for Rare Eye Disease (ERN-EYE) Mont Sainte-Odile, France (October 2017) Ophthalmology
National Institute of Allergy and Infectious Disease (NIAID) National Institutes of Health, Bethesda, MD, USA (May and July 2018) Allergy and immunology
Neuro-MIG European network for brain malformations (www.neuro-mig.org) St Julians, Malta; Lisbon, Portugal (February 2018; September 2018) Malformations of cortical development (MCD)
European Society for Immunodeficiencies (ESID) and the European Reference network on rare primary immunodeficiency, autoinflammatory and autoimmune diseases (ERN-RITA) Vienna Austria (September 2018) Inborn errors of immunity.

Table 1.

Community workshops and collaborations aimed at HPO content expansion and refinement

Organization Location Focus
Undiagnosed Diseases Network (UDN); Stanford Center for Inherited Cardiovascular Diseases (SCICD) Stanford University, CA, USA (March 2017) Cardiology
European Reference Network for Rare Eye Disease (ERN-EYE) Mont Sainte-Odile, France (October 2017) Ophthalmology
National Institute of Allergy and Infectious Disease (NIAID) National Institutes of Health, Bethesda, MD, USA (May and July 2018) Allergy and immunology
Neuro-MIG European network for brain malformations (www.neuro-mig.org) St Julians, Malta; Lisbon, Portugal (February 2018; September 2018) Malformations of cortical development (MCD)
European Society for Immunodeficiencies (ESID) and the European Reference network on rare primary immunodeficiency, autoinflammatory and autoimmune diseases (ERN-RITA) Vienna Austria (September 2018) Inborn errors of immunity.
Organization Location Focus
Undiagnosed Diseases Network (UDN); Stanford Center for Inherited Cardiovascular Diseases (SCICD) Stanford University, CA, USA (March 2017) Cardiology
European Reference Network for Rare Eye Disease (ERN-EYE) Mont Sainte-Odile, France (October 2017) Ophthalmology
National Institute of Allergy and Infectious Disease (NIAID) National Institutes of Health, Bethesda, MD, USA (May and July 2018) Allergy and immunology
Neuro-MIG European network for brain malformations (www.neuro-mig.org) St Julians, Malta; Lisbon, Portugal (February 2018; September 2018) Malformations of cortical development (MCD)
European Society for Immunodeficiencies (ESID) and the European Reference network on rare primary immunodeficiency, autoinflammatory and autoimmune diseases (ERN-RITA) Vienna Austria (September 2018) Inborn errors of immunity.

The HPO project additionally has a long-term collaboration with Orphanet in the framework HIPBI-RD (harmonizing phenomics information for a better interoperability in the rare disease field), a project that was funded by the E-Rare 3 ERA-NET program (50) and will be continued in the framework of the SOLVE-RD project, as well as in the European Joint Co-fund Programme for Rare Diseases (EJP-RD). This project has resulted in more than 60 000 HPO annotations for diseases in the Orphanet database and over one thousand new term requests and other improvements of existing HPO terms. Phenotype-disease annotations include the frequency of occurrence of a phenotype in a disease (see Table 2), as well as the fact that a phenotype is part of established diagnostic criteria or is a pathognomonic sign. These annotations are available for download and can be consulted in the Orphanet website. Furthermore, this collaboration has produced the HPO-ORDO Ontological Module (HOOM in which the HPO and Orphanet Rare Diseases Ontology can be used together).

Table 2.

The HPO records the frequencies of phenotypic features in three different ways

Frequency categories
Term ID Definition
Obligate HP:0040280 Always present, i.e. in 100% of the cases.
Very frequent HP:0040281 Present in 80–99% of the cases.
Frequent HP:0040282 Present in 30–79% of the cases.
Occasional HP:0040283 Present in 5–29% of the cases.
Very rare HP:0040284 Present in 1–4% of the cases.
Excluded HP:0040285 Present in 0% of the cases.
Percentage of persons in which a phenotypic feature is observed
Percentage x% This is used to record frequency of a feature in a disease if the number of probands is not available, e.g. 42%.
Number of persons in a cohort in whom a phenotypic feature was observed
N of M notation n/m This is used to record how many persons with a certain disease were observed to have a given phenotypic feature represented by an HPO term, e.g. 5/13. This should be used only if the feature was ruled out in the remaining m-n individuals.
Frequency categories
Term ID Definition
Obligate HP:0040280 Always present, i.e. in 100% of the cases.
Very frequent HP:0040281 Present in 80–99% of the cases.
Frequent HP:0040282 Present in 30–79% of the cases.
Occasional HP:0040283 Present in 5–29% of the cases.
Very rare HP:0040284 Present in 1–4% of the cases.
Excluded HP:0040285 Present in 0% of the cases.
Percentage of persons in which a phenotypic feature is observed
Percentage x% This is used to record frequency of a feature in a disease if the number of probands is not available, e.g. 42%.
Number of persons in a cohort in whom a phenotypic feature was observed
N of M notation n/m This is used to record how many persons with a certain disease were observed to have a given phenotypic feature represented by an HPO term, e.g. 5/13. This should be used only if the feature was ruled out in the remaining m-n individuals.

Frequency information can be used by differential diagnostic algorithms such as BOQA (62). If possible, HPO annotations are made with the precise counts, but percentages or overall frequency categories are used if that is all that is available. The frequency categories are aligned with those of Orphanet.

Table 2.

The HPO records the frequencies of phenotypic features in three different ways

Frequency categories
Term ID Definition
Obligate HP:0040280 Always present, i.e. in 100% of the cases.
Very frequent HP:0040281 Present in 80–99% of the cases.
Frequent HP:0040282 Present in 30–79% of the cases.
Occasional HP:0040283 Present in 5–29% of the cases.
Very rare HP:0040284 Present in 1–4% of the cases.
Excluded HP:0040285 Present in 0% of the cases.
Percentage of persons in which a phenotypic feature is observed
Percentage x% This is used to record frequency of a feature in a disease if the number of probands is not available, e.g. 42%.
Number of persons in a cohort in whom a phenotypic feature was observed
N of M notation n/m This is used to record how many persons with a certain disease were observed to have a given phenotypic feature represented by an HPO term, e.g. 5/13. This should be used only if the feature was ruled out in the remaining m-n individuals.
Frequency categories
Term ID Definition
Obligate HP:0040280 Always present, i.e. in 100% of the cases.
Very frequent HP:0040281 Present in 80–99% of the cases.
Frequent HP:0040282 Present in 30–79% of the cases.
Occasional HP:0040283 Present in 5–29% of the cases.
Very rare HP:0040284 Present in 1–4% of the cases.
Excluded HP:0040285 Present in 0% of the cases.
Percentage of persons in which a phenotypic feature is observed
Percentage x% This is used to record frequency of a feature in a disease if the number of probands is not available, e.g. 42%.
Number of persons in a cohort in whom a phenotypic feature was observed
N of M notation n/m This is used to record how many persons with a certain disease were observed to have a given phenotypic feature represented by an HPO term, e.g. 5/13. This should be used only if the feature was ruled out in the remaining m-n individuals.

Frequency information can be used by differential diagnostic algorithms such as BOQA (62). If possible, HPO annotations are made with the precise counts, but percentages or overall frequency categories are used if that is all that is available. The frequency categories are aligned with those of Orphanet.

LOGICAL ENHANCEMENTS AND INTEROPERABILITY

The HPO provides textual definitions for ease of use, but it also has a robust logical representation with OWL-based logical definitions based on species-neutral ontologies such as Uberon, the Gene Ontology, the Cell Ontology and others. For instance, Delayed patellar ossification (HP:0006454) is defined with reference to the PATO term delayed (PATO:0000502), the Gene Ontology term ossification (GO:0001503) and the Uberon term for patella (UBERON:0002446). The OBO version of the ontology is a simplified version of the full OWL version that contains all of the terms as well as their subclass (is-a) relations, but does not contain the computational logical definitions.

‘has part’ some

(delayed

and (‘inheres in’ some

(ossification

and (‘occurs in’ some patella)))

and (‘has modifier’ some abnormal))

These logical definitions can be used for quality control (51), to infer new classifications (is_a/subclass relationships) that were not explicitly asserted and for cross-species phenotype analysis (46). However, this can only work if compatible sets of definitions are used.

Manually maintaining compatible logical definitions across large ontologies such as the HPO is error-prone and may lead to inconsistent description in one ontology and especially across different phenotype ontologies. Even specialized branches of the ontology, such as the ones addressing morphological abnormalities, can have divergent logical definitions. Pattern-based ontology development practices (52,53) are increasingly used to manage the generation of logical definitions. Rather than encoding logical definitions manually in OWL using an ontology editor, pattern-based development separates the blueprint of the logical definition—essentially the definition with placeholder variables—from the actual definition of the term, which is usually encoded in the form of a spreadsheet record. Members of the Monarch Initiative are contributing to community tools for pattern-based development using Dead Simple Ontology Design Patterns (DOSDP, (52)) and the Ontology Development Kit (ODK).

To support the use of model organisms to further human health research, developers of the Mammalian Phenotype (MP) ontology (54) have collaborated with the HPO team to develop compatible logical definitions, but these efforts were restricted to comparison of individual definitions and resulted in manual changes to the respective ontologies. Pattern-based development offers a more accurate and scalable alternative by developing common patterns that all phenotype ontologies (i.e. all organisms) can refer to and that can be applied to a whole branch of an ontology at once. For example, the ‘increasedSize’ pattern defines a blueprint for a logical definition as follows: ‘‘has_part’ some (‘increased size’ and (‘inheres_in’ some %s) and (‘qualifier’ some ‘abnormal’))’. Using DOSDP in conjunction with the ODK, any phenotype ontology developer who needs to define a phenotype describing the increased size of something (such as an anatomical entity) can now simply commit to the increasedSize pattern. More than 40 patterns specifically for phenotype ontology development are currently available in the Uber-Phenotype (UPheno) repository.

The clinical features represented in HPO are connected via subclass relations. Other relationships between those classes hold as well, but have not previously been encoded computationally. For example, phenotype ontologies may have two separate classes to represent the increase and decrease in size of an anatomical entity such as the liver. To represent such relations, we have added opposite relations to all terms in HPO using a text and logic-based approach (see phenopposites GitHub repository under ‘Availability’).

The Monarch Initiative has been a key organizer of a community effort to use pattern-based ontology development to reconcile logical definitions on a large scale across well-established and emerging phenotype ontologies including HPO, MP, and phenotype ontologies for Caenorhabditis elegans, Xenopus and Drosophila. To that end, we recently organized a Phenotype Ontology development and reconciliation workshop (Phenotype Ontologies Traversing All The Organisms: POTATO). At this workshop, more than 40 ontology curators, developers and biomedical experts came together to learn about our updated tool-chain for pattern-based development and to discuss discrepancies between the logical definitions across various phenotype ontologies. As a result of the meeting, representatives of all the phenotypes ontologies have committed to an ongoing collaboration to align their respective ontologies by developing sets of common design patterns and using these to define terms in their ontologies. The outcome of these community efforts will be an integrated ecosystem of phenotype ontologies that can be leveraged in HPO-based clinical diagnostics and disease mechanism discovery.

DISEASE ANNOTATIONS

The HPO project provides a comprehensive set of computable definitions of rare diseases in the form of annotations which describe the clinical features (HPO terms) that characterize each disease. Each annotated feature can have metadata including its typical age of onset and the frequency (for instance, the HPO lists the frequency of Protrusio acetabuli [HP:0003179] in persons with Marfan syndrome as 113/146 based on a published clinical study (55)). Such annotation metadata can be used to improve the accuracy of the HPO-based matching algorithms (56).

Recent updates to our corpus of disease annotations include a new file format with robust representation of clinical modifiers, as well as migration to the Monarch Merged Disease Ontology (MONDO), which provides a unified set of disease terms and definitions with computationally declared equivalencies to resources such as OMIM and Orphanet. The annotation data is readily available for computational use via Monarch’s Biolink API (see resources below). We have also produced a new stand-alone tool to aid curation of the disease annotations.

Thirty-six new molecular phenotypes have been added to the HPO. These new terms were identified from metabolomics data provided by the Metabolomics Core from the Undiagnosed Disease Network, the Human Metabolome Database (HMDB) and articles related to inborn errors of metabolism. The new terms were curated in a spreadsheet that captured information about metabolite name, corresponding chemicals and their identifiers (ChEBI and HMDB), direction of change (increase/decrease), location of the abnormal metabolite concentration (blood, urine, cerebrospinal fluid), synonyms, gene/locus association, disease identifiers for associated diseases (OMIM or MONDO IDs) and key publication (PubMed IDs). For instance, an increased level of galactonate in red blood cell (HP:0410063) is associated with patients with galactosemia (MONDO:0018116; gene: GALT).

The new Clinical modifier subontology allows more expressive and precise disease definitions and can also be used to annotate individual patients. This subontology contains terms to describe severity, positionality and external factors that tend to trigger or ameliorate the features of a disease. The previous Onset subontology has been expanded to a Clinical course subontology, which additionally contains terms to describe mortality, progression of disease and the temporal pattern of features of disease (Figure 1). The frequency of features can be described in one of three methods (Table 2).

Overview of the clinical modifier (A, left) and clinical course (B, right) subontologies. These subontology terms can be used in combination with existing HPO terms to qualify and enrich their meaning. (C) A schematic presentation of one HPO annotation for the disease familial cold autoinflammatory syndrome 2 (FCAS2). In a publication on this disease, three of three reported patients were found to have episodic fever with infantile (or earlier) onset that was triggered by exposure to cold (63).

Figure 1.

Overview of the clinical modifier (A, left) and clinical course (B, right) subontologies. These subontology terms can be used in combination with existing HPO terms to qualify and enrich their meaning. (C) A schematic presentation of one HPO annotation for the disease familial cold autoinflammatory syndrome 2 (FCAS2). In a publication on this disease, three of three reported patients were found to have episodic fever with infantile (or earlier) onset that was triggered by exposure to cold (63).

Screenshot of the new HPO Website application. Users can search for HPO terms, annotated diseases, or disease-associated genes using an autocomplete widget. The hierarchical structure of the ontology is shown in an abbreviated fashion for clarity’s sake. Only the direct parent and child terms of the currently displayed term are shown in the hierarchy. The total number of decedent terms is shown for each term in the hierarchy to help users decide which parts of the ontology to explore.

Figure 2.

Screenshot of the new HPO Website application. Users can search for HPO terms, annotated diseases, or disease-associated genes using an autocomplete widget. The hierarchical structure of the ontology is shown in an abbreviated fashion for clarity’s sake. Only the direct parent and child terms of the currently displayed term are shown in the hierarchy. The total number of decedent terms is shown for each term in the hierarchy to help users decide which parts of the ontology to explore.

The HPO annotation file format had remained unchanged since the first publication of the HPO in 2008 (57); to accommodate the aforementioned new annotation resources, we have updated the annotation file format. This format has slots to capture clinical modifiers, sex-specific features of disease and to track the history of biocuration of terms (Table 3).

Table 3.

New HPO annotation file format

Field Item Required Example
1 Database ID Yes MIM:154700, ORPHA:558 or MONDO:0007947
2 DB_Name Yes Achondrogenesis, type IB
3 Qualifier No NOT or empty
4 HPO_ID Yes HP:0002487
5 DB_Reference Yes OMIM:154700 or PMID:15517394
6 Evidence Yes IEA
7 Onset No HP:0003577
8 Frequency No HP:0003577 or 12/45 or 22%
9 Sex No MALE or FEMALE
10 Modifier No HP:0025257
11 Aspect Yes ‘P’ or ‘C’ or ‘I’ or ‘M’
12 BiocurationBy Yes HPO:skoehler[YYYY-MM-DD]
Field Item Required Example
1 Database ID Yes MIM:154700, ORPHA:558 or MONDO:0007947
2 DB_Name Yes Achondrogenesis, type IB
3 Qualifier No NOT or empty
4 HPO_ID Yes HP:0002487
5 DB_Reference Yes OMIM:154700 or PMID:15517394
6 Evidence Yes IEA
7 Onset No HP:0003577
8 Frequency No HP:0003577 or 12/45 or 22%
9 Sex No MALE or FEMALE
10 Modifier No HP:0025257
11 Aspect Yes ‘P’ or ‘C’ or ‘I’ or ‘M’
12 BiocurationBy Yes HPO:skoehler[YYYY-MM-DD]

The file contains 12 tab-separated fields, some of which can be left empty. The ‘Modifier’ and ‘BiocurationBy’ fields can contain multiple items separated by semicolons. For instance, to indicate that a disease is characterized by a skin rash (HP:0000988) that is Recurrent (HP:0031796) and Triggered by cold (HP:0025206) one would annotate HP:0031796;HP:0025206 in the Modifier column. Many annotations go through multiple stages of biocuration. In this case, the individual biocuration events are also added as a semicolon-separated list.

Table 3.

New HPO annotation file format

Field Item Required Example
1 Database ID Yes MIM:154700, ORPHA:558 or MONDO:0007947
2 DB_Name Yes Achondrogenesis, type IB
3 Qualifier No NOT or empty
4 HPO_ID Yes HP:0002487
5 DB_Reference Yes OMIM:154700 or PMID:15517394
6 Evidence Yes IEA
7 Onset No HP:0003577
8 Frequency No HP:0003577 or 12/45 or 22%
9 Sex No MALE or FEMALE
10 Modifier No HP:0025257
11 Aspect Yes ‘P’ or ‘C’ or ‘I’ or ‘M’
12 BiocurationBy Yes HPO:skoehler[YYYY-MM-DD]
Field Item Required Example
1 Database ID Yes MIM:154700, ORPHA:558 or MONDO:0007947
2 DB_Name Yes Achondrogenesis, type IB
3 Qualifier No NOT or empty
4 HPO_ID Yes HP:0002487
5 DB_Reference Yes OMIM:154700 or PMID:15517394
6 Evidence Yes IEA
7 Onset No HP:0003577
8 Frequency No HP:0003577 or 12/45 or 22%
9 Sex No MALE or FEMALE
10 Modifier No HP:0025257
11 Aspect Yes ‘P’ or ‘C’ or ‘I’ or ‘M’
12 BiocurationBy Yes HPO:skoehler[YYYY-MM-DD]

The file contains 12 tab-separated fields, some of which can be left empty. The ‘Modifier’ and ‘BiocurationBy’ fields can contain multiple items separated by semicolons. For instance, to indicate that a disease is characterized by a skin rash (HP:0000988) that is Recurrent (HP:0031796) and Triggered by cold (HP:0025206) one would annotate HP:0031796;HP:0025206 in the Modifier column. Many annotations go through multiple stages of biocuration. In this case, the individual biocuration events are also added as a semicolon-separated list.

A new tool called HPOWorkbench has been developed to enable browsing through HPO terms and annotations. It can generate GitHub issues directly and can be used by collaborators to provide feedback or new suggestions.

EXOMISER UPDATE

Exomiser utilizes the HPO to find potential disease-causing variants from whole-exome or whole-genome sequencing data. The last two major updates to the Exomiser software have focused on decoupling the data updates from the software release cycle and enabling analysis of either GRCh37 or GRCh38 genomic samples. We updated the variant data sources to also include allele frequency data from gnomAD, TOPMed and the UK10 datasets and added annotations for variant pathogenicity from ClinVar. We also added the ability for users to specify fine-grained maximum allele frequencies to be used for prioritizing alleles under different inheritance models and assigning these to likely syndromes based on the phenotype matches. Moreover, the Exomiser variant data sources have not only been decoupled from the software release cycle, but also from the phenotype ontologies and disease annotations. This ensures that we can release Exomiser with the very latest disease and model organism annotations and that they can be updated on demand. These user-facing updates have happened against a background of continued engineering and performance improvements. As a result of the continued development and usage, the Exomiser also recently received the approval of the International Rare Diseases Research Consortium (IRDiRC) as a recognized resource. We have also been able to build on HPO being chosen as the terminology for clinical phenotype data collection by the UK National Health Service (NHS) by introducing Exomiser as a key variant prioritization service for the 100 000 Genomes Project and future NHS-commissioned service for rare disease genetic testing. Benchmarking on the solved cases to date shows Exomiser can identify over 80% of the diagnoses in the top five candidates (unpublished communication from the 100K Genome project).

SYNONYMS AND TRANSLATIONS

One of the key advantages of ontologies is that semantic meaning is attached to concepts, rather than to their names. This enables each entity to have one or more synonyms, as well as translations into other languages. Multiple groups have taken advantage of this ability to create synonyms for HPO concepts for diverse settings, including enabling self-phenotyping by patients without medical expertise and enabling capture of data in diverse languages, with subsequent international sharing and analysis.

Patients themselves are an eager and untapped source of information about symptoms and phenotypes, however, medical terminology is often perplexing to them, making it difficult to use resources like the HPO. Further, some phenotypes go unnoticed by the clinician (such as those only seen at home). To enable patients to use the HPO directly and to improve collaboration and communication between patients and their physicians, we have recently added ‘layperson’ synonyms to the entirety of the HPO (58). Approximately 36% of the HPO terms have at least one layperson synonym, 89% of the MONDO diseases annotated to HPO have at least one HPO annotation with a layperson synonym and 60% of all disease annotations refer to HPO terms with lay translations. This coverage suggests that the layperson HPO would be useful in a diagnostic setting despite incomplete coverage. Efforts are currently underway to evaluate the diagnostic utility of the layHPO, both synthetically as well as in cohorts of previously diagnosed rare disease patients.

The Sanford Health Imagenetics program has deployed an online screening tool for patients to self-report traits, signs, and symptoms in a questionnaire format that is mapped to HPO and leverages the layperson synonyms. This is integrated with the Sanford Imagenetics population-based genotyping initiative. The Genetic and Rare Diseases Information Center (GARD), a program of the National Center for Advancing Translational Sciences Office of Rare Diseases Research (NCATS-ORDR), provides reliable, public-friendly information for over 7000 genetic and/or rare diseases (59). GARD recently incorporated tables on the disease webpages that display information from the HPO including the medical terms for associated symptoms and phenotypic abnormalities, the related layperson synonyms, the frequency of the phenotypic features and the link to the HPO webpage for the specific term. By displaying the plain-language vocabulary along with the medical terminology, patients and families become familiar with the language they are commonly exposed to in the literature and clinical settings. The public utilizes the HPO medical terms and layperson synonyms to better understand the broad spectrum of clinical findings associated with a specific disease and to search and navigate the GARD website and other resources to retrieve information about multiple diseases associated with a given phenotype. Inclusion of the HPO data on the GARD website makes the disease webpages more robust, educates the rare disease community and empowers them to become partners in their medical care.

The labels, synonyms and textual definitions of the HPO are also being translated into several languages including French, Spanish, Italian, German, Dutch, Portuguese, Turkish, Japanese, Russian and Chinese; this is critical to ensure equitable health care and precision public health (See project homepage below). Tools such as PhenoTips (60) already make use of the existing Spanish and French translations, together with a user interface in those languages to enable HPO-based phenotyping for clinicians who are not fluent in English. In the Spanish Undiagnosed Disease Network clinicians phenotype patients in Spanish, and then share with the Matchmaker Exchange (13). One further example is the Life Languages project in Western Australia (WA), which is using the HPO to translate medical and biological terms into partner Aboriginal Australian Languages. This is being integrated with HPO term extraction from 3D facial images as part of the Pilbara Faces program in remote WA.

NEW HPO WEBSITE

The HPO website application has been redesigned and rebuilt from the ground up to be both more responsive and more intuitive (Figure 2). Made possible by the new single-page app approach and lightweight microservices, the new application loads faster and supports intuitive search capabilities, such as auto-complete and term highlight features, to allow the user to efficiently browse through the ontology data and corresponding hierarchy. The HPO website uses the ProtVista tool to display genes and genetic variants associated with Mendelian diseases (61). The redesign also sets the stage for better integration with monarchinitiative.org to facilitate exploration of similar genes and phenotypes across species.

HPO FOR MEDICAL EDUCATION

Clinical features in HPO are also connected to disease nosologies (medical classification schemes) such as ORDO, OMIM, and MONDO. These relationships are typically curated from literature; however, they can also be crowd-sourced. Phenotate (http://phenotate.org), which was developed in the framework of the HIPBI-RD project, is a web-based tool that allows undergraduate or medical students, as well as medical residents, to annotate OMIM and ORDO diseases with HPO phenotypes by completing classroom exercises. Students are encouraged to refer to the literature to select the correct symptoms and enter the references used into their annotations. In a second-year undergraduate molecular genetics class (MGY200) at the University of Toronto, 78 students used Phenotate to annotate three genetic diseases: Marfan syndrome (MFS), Friedreich’s ataxia (FRDA) and congenital myasthenic syndrome. Overall, students collectively provided more comprehensive annotations than clinicians who also submitted annotations. Phenotate is an open platform, available for use by anyone teaching genetics. By crowdsourcing annotations, Phenotate hopes to improve the HPO and related nosologies, while also offering students an educational tool that supplements their coursework.

CONCLUSION

In the 2 years since the previous Nucleic Acids Research database article (12), the HPO has continued to grow in both reach and scope. The HPO has put a strong emphasis on working with interested members of the community to revise and extend individual areas of the HPO, and we welcome interactions with more groups in any area of medicine. The HPO project has begun to develop resources for laypersons to interact with the HPO and software designed for patients. Annotations and improved representation of phenotypes in the HPO have been greatly improved for several areas of medicine thanks to community interactions.

DATA AVAILABILITY

FUNDING

National Institutes of Health (NIH), Monarch Initiative [OD #5R24OD011883]; Forums for Integrative Phenomics [U13 CA221044-01]; NCATS Data Translator [1OT3TR002019]; NCATS National Center for Digital Health Informatics Innovation [U24 TR002306]; NIH Data Commons [1 OT3 OD02464-01 UNCCH]; Cost Action CA 16118 Neuro-MIG; British Heart Foundation Programme Grant [RG/13/5/30112]; Division of Intramural Research; NIAID; NIH; E-RARE project Hipbi-RD [01GM1608]; European Union’s Horizon 2020 Research and Innovation Programme [779257]. Funding for open access charge: NIH; Donald A. Roux Family Fund (to P.N.R.).

Conflict of interest statement. None declared.

REFERENCES

Delude

C.M.

Deep phenotyping: the details of disease

.

Nature

.

2015

;

527

:

S14

S15

.

Robinson

P.N.

Deep phenotyping for precision medicine

.

Hum. Mutat.

2012

;

33

:

777

780

.

Mungall

C.J.

,

Washington

N.L.

,

Nguyen-Xuan

J.

,

Condit

C.

,

Smedley

D.

,

Köhler

S.

,

Groza

T.

,

Shefchek

K.

,

Hochheiser

H.

,

Robinson

P.N.

et al.

Use of model organism and disease databases to support matchmaking for human disease gene discovery

.

Hum. Mutat.

2015

;

36

:

979

984

.

Mungall

C.J.

,

McMurry

J.A.

,

Köhler

S.

,

Balhoff

J.P.

,

Borromeo

C.

,

Brush

M.

,

Carbon

S.

,

Conlin

T.

,

Dunn

N.

,

Engelstad

M.

et al.

The Monarch Initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species

.

Nucleic Acids Res.

2017

;

45

:

D712

D722

.

Ramoni

R.B.

,

Mulvihill

J.J.

,

Adams

D.R.

,

Allard

P.

,

Ashley

E.A.

,

Bernstein

J.A.

,

Gahl

W.A.

,

Hamid

R.

,

Loscalzo

J.

,

McCray

A.T.

et al.

The undiagnosed diseases network: Accelerating discovery about health and disease

.

Am. J. Hum. Genet.

2017

;

100

:

185

192

.

Taruscio

D.

,

Groft

S.C.

,

Cederroth

H.

,

Melegh

B.

,

Lasko

P.

,

Kosaki

K.

,

Baynam

G.

,

McCray

A.

,

Gahl

W.A.

Undiagnosed Diseases Network International (UDNI): white paper for global actions to meet patient needs

.

Mol. Genet. Metab.

2015

;

116

:

223

225

.

Gahl

W.A.

,

Mulvihill

J.J.

,

Toro

C.

,

Markello

T.C.

,

Wise

A.L.

,

Ramoni

R.B.

,

Adams

D.R.

,

Tifft

C.J.

UDN

The NIH Undiagnosed Diseases Program and Network: applications to modern medicine

.

Mol. Genet. Metab.

2016

;

117

:

393

400

.

Gall

T.

,

Valkanas

E.

,

Bello

C.

,

Markello

T.

,

Adams

C.

,

Bone

W.P.

,

Brandt

A.J.

,

Brazill

J.M.

,

Carmichael

L.

,

Davids

M.

et al.

Defining disease, diagnosis, and translational medicine within a homeostatic perturbation paradigm: The national institutes of health undiagnosed diseases program experience

.

Front. Med.

2017

;

4

:

62

.

Thompson

R.

,

Johnston

L.

,

Taruscio

D.

,

Monaco

L.

,

Béroud

C.

,

Gut

I.G.

,

Hansson

M.G.

,

't Hoen

P.B.

,

Patrinos

G.P.

,

Dawkins

H.

et al.

RD-Connect: an integrated platform connecting databases, registries, biobanks and clinical bioinformatics for rare disease research

.

J. Gen. Intern. Med.

2014

;

29

:

S780

S787

.

Boycott

K.M.

,

Rath

A.

,

Chong

J.X.

,

Hartley

T.

,

Alkuraya

F.S.

,

Baynam

G.

,

Brookes

A.J.

,

Brudno

M.

,

Carracedo

A.

,

den Dunnen

J.T.

et al.

International cooperation to enable the diagnosis of all rare genetic diseases

.

Am. J. Hum. Genet.

2017

;

100

:

695

705

.

Philippakis

A.A.

,

Azzariti

D.R.

,

Beltran

S.

,

Brookes

A.J.

,

Brownstein

C.A.

,

Brudno

M.

,

Brunner

H.G.

,

Buske

O.J.

,

Carey

K.

,

Doll

C.

et al.

The Matchmaker Exchange: a platform for rare disease gene discovery

.

Hum. Mutat.

2015

;

36

:

915

921

.

Köhler

S.

,

Vasilevsky

N.A.

,

Engelstad

M.

,

Foster

E.

,

McMurry

J.

,

Aymé

S.

,

Baynam

G.

,

Bello

S.M.

,

Boerkoel

C.F.

,

Boycott

K.M.

et al.

The human phenotype ontology in 2017

.

Nucleic Acids Res.

2017

;

45

:

D865

D876

.

Köhler

S.

,

Doelken

S.C.

,

Mungall

C.J.

,

Bauer

S.

,

Firth

H.V.

,

Bailleul-Forestier

I.

,

Black

G.C.

,

Brown

D.L.

,

Brudno

M.

,

Campbell

J.

et al.

The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data

.

Nucleic Acids Res.

2014

;

42

:

D966

D974

.

Taylor

R.L.

,

Parry

N.R.A.

,

Barton

S.J.

,

Campbell

C.

,

Delaney

C.M.

,

Ellingford

J.M.

,

Hall

G.

,

Hardcastle

C.

,

Morarji

J.

,

Nichol

E.J.

et al.

Panel-Based clinical genetic testing in 85 children with inherited retinal disease

.

Ophthalmology

.

2017

;

124

:

985

991

.

Fang

H.

,

Wu

Y.

,

Yang

H.

,

Yoon

M.

,

Jiménez-Barrón

L.T.

,

Mittelman

D.

,

Robison

R.

,

Wang

K.

,

Lyon

G.J.

Whole genome sequencing of one complex pedigree illustrates challenges with genomic medicine

.

BMC Med. Genomics

.

2017

;

10

:

10

.

Posey

J.E.

,

Rosenfeld

J.A.

,

James

R.A.

,

Bainbridge

M.

,

Niu

Z.

,

Wang

X.

,

Dhar

S.

,

Wiszniewski

W.

,

Akdemir

Z.H.

,

Gambin

T.

et al.

Molecular diagnostic experience of whole-exome sequencing in adult patients

.

Genet. Med.

2016

;

18

:

678

685

.

Retterer

K.

,

Juusola

J.

,

Cho

M.T.

,

Vitazka

P.

,

Millan

F.

,

Gibellini

F.

,

Vertino-Bell

A.

,

Smaoui

N.

,

Neidich

J.

,

Monaghan

K.G.

et al.

Clinical application of whole-exome sequencing across clinical indications

.

Genet. Med.

2016

;

18

:

696

704

.

Zhu

Q.

,

Liu

H.

,

Chute

C.G.

,

Ferber

M.

EHR based genetic testing knowledge base (iGTKB) development

.

BMC Med. Inform. Decis. Mak.

2015

;

15

:

S3

.

Fujiwara

T.

,

Yamamoto

Y.

,

Kim

J.-D.

,

Buske

O.

,

Takagi

T.

PubCaseFinder: A case-report-based, phenotype-driven differential-diagnosis system for rare diseases

.

Am. J. Hum. Genet.

2018

;

103

:

389

399

.

Baker

K.

,

Gordon

S.L.

,

Melland

H.

,

Bumbak

F.

,

Scott

D.J.

,

Jiang

T.J.

,

Owen

D.

,

Turner

B.J.

,

Boyd

S.G.

,

Rossi

M.

et al.

SYT1-associated neurodevelopmental disorder: a case series

.

Brain

.

2018

;

141

:

2576

2591

.

Thiffault

I.

,

Farrow

E.

,

Zellmer

L.

,

Berrios

C.

,

Miller

N.

,

Gibson

M.

,

Caylor

R.

,

Jenkins

J.

,

Faller

D.

,

Soden

S.

et al.

Clinical genome sequencing in an unbiased pediatric cohort

.

Genet. Med.

2018

;

doi:10.1038/s41436-018-0075-8

.

Stokman

M.F.

,

van der Zwaag

B.

,

van de Kar

N.C.A.J.

,

van Haelst

M.M.

,

van Eerde

A.M.

,

van der Heijden

J.W.

,

Kroes

H.Y.

,

Ippel

E.

,

Schulp

A.J.A.

,

van Gassen

K.L.

et al.

Clinical and genetic analyses of a Dutch cohort of 40 patients with a nephronophthisis-related ciliopathy

.

Pediatr. Nephrol.

2018

;

33

:

1701

1712

.

Short

P.J.

,

McRae

J.F.

,

Gallone

G.

,

Sifrim

A.

,

Won

H.

,

Geschwind

D.H.

,

Wright

C.F.

,

Firth

H.V.

,

FitzPatrick

D.R.

,

Barrett

J.C.

et al.

De novo mutations in regulatory elements in neurodevelopmental disorders

.

Nature

.

2018

;

555

:

611

616

.

Tumienė

B.

,

Maver

A.

,

Writzl

K.

,

Hodžić

A.

,

Čuturilo

G.

,

Kuzmanić-Šamija

R.

,

Čulić

V.

,

Peterlin

B.

Diagnostic exome sequencing of syndromic epilepsy patients in clinical practice

.

Clin. Genet.

2018

;

93

:

1057

1062

.

Trujillano

D.

,

Bertoli-Avella

A.M.

,

Kumar Kandaswamy

K.

,

Weiss

M.E.

,

Köster

J.

,

Marais

A.

,

Paknia

O.

,

Schröder

R.

,

Garcia-Aznar

J.M.

,

Werber

M.

et al.

Clinical exome sequencing: results from 2819 samples reflecting 1000 families

.

Eur. J. Hum. Genet.

2017

;

25

:

176

182

.

Meyer

K.

,

Kirchner

M.

,

Uyar

B.

,

Cheng

J.-Y.

,

Russo

G.

,

Hernandez-Miranda

L.R.

,

Szymborska

A.

,

Zauber

H.

,

Rudolph

I.M.

,

Willnow

T.E.

et al.

Mutations in disordered regions can cause disease by creating dileucine motifs

.

Cell

.

2018

;

175

:

239

253

.

Chen

C.

,

Chen

D.

,

Xue

H.

,

Liu

X.

,

Zhang

T.

,

Tang

S.

,

Li

W.

,

Xu

X.

IDGenetics: a comprehensive database for genes and mutations of intellectual disability related disorders

.

Neurosci. Lett.

2018

;

685

:

96

101

.

Haghighi

A.

,

Krier

J.B.

,

Toth-Petroczy

A.

,

Cassa

C.A.

,

Frank

N.Y.

,

Carmichael

N.

,

Fieg

E.

,

Bjonnes

A.

,

Mohanty

A.

,

Briere

L.C.

et al.

An integrated clinical program and crowdsourcing strategy for genomic sequencing and Mendelian disease gene discovery

.

NPJ Genome Med.

2018

;

3

:

21

.

Doğan

T.

HPO2GO: prediction of human phenotype ontology term associations for proteins using cross ontology annotation co-occurrences

.

PeerJ

.

2018

;

6

:

e5298

.

Rao

A.

,

Vg

S.

,

Joseph

T.

,

Kotte

S.

,

Sivadasan

N.

,

Srinivasan

R.

Phenotype-driven gene prioritization for rare diseases using graph convolution on heterogeneous networks

.

BMC Med. Genomics

.

2018

;

11

:

57

.

MacLennan

A.H.

,

Kruer

M.C.

,

Baynam

G.

,

Moreno-De-Luca

A.

,

Wilson

Y.A.

,

Zhu

C.

,

Wintle

R.F.

,

Gecz

J.

members of the International Cerebral Palsy Genomics Consortium

Cerebral palsy and genomics: an international consortium

.

Dev. Med. Child Neurol.

2018

;

60

:

209

210

.

Saklatvala

J.R.

,

Dand

N.

,

Simpson

M.A.

Text-mined phenotype annotation and vector-based similarity to improve identification of similar phenotypes and causative genes in monogenic disease patients

.

Hum. Mutat.

2018

;

39

:

643

652

.

Adler

A.

,

Kirchmeier

P.

,

Reinhard

J.

,

Brauner

B.

,

Dunger

I.

,

Fobo

G.

,

Frishman

G.

,

Montrone

C.

,

Mewes

H.W.

,

Arnold

M.

et al.

PhenoDis: a comprehensive database for phenotypic characterization of rare cardiac diseases

.

Orphanet. J. Rare Dis.

2018

;

13

:

22

.

Cornish

A.J.

,

David

A.

,

Sternberg

M.J.E.

PhenoRank: reducing study bias in gene prioritization through simulation

.

Bioinformatics

.

2018

;

34

:

2087

2095

.

Singh

T.

,

Kurki

M.I.

,

Curtis

D.

,

Purcell

S.M.

,

Crooks

L.

,

McRae

J.

,

Suvisaari

J.

,

Chheda

H.

,

Blackwood

D.

,

Breen

G.

et al.

Rare loss-of-function variants in SETD1A are associated with schizophrenia and developmental disorders

.

Nat. Neurosci.

2016

;

19

:

571

577

.

Posey

J.E.

,

Harel

T.

,

Liu

P.

,

Rosenfeld

J.A.

,

James

R.A.

,

Coban Akdemir

Z.H.

,

Walkiewicz

M.

,

Bi

W.

,

Xiao

R.

,

Ding

Y.

et al.

Resolution of disease phenotypes resulting from multilocus genomic variation

.

N. Engl. J. Med.

2017

;

376

:

21

31

.

Beck

T.

,

Hastings

R.K.

,

Gollapudi

S.

,

Free

R.C.

,

Brookes

A.J.

GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies

.

Eur. J. Hum. Genet.

2014

;

22

:

949

952

.

Li

M.J.

,

Liu

Z.

,

Wang

P.

,

Wong

M.P.

,

Nelson

M.R.

,

Kocher

J.-P.A.

,

Yeager

M.

,

Sham

P.C.

,

Chanock

S.J.

,

Xia

Z.

et al.

GWASdb v2: an update database for human genetic variants identified by genome-wide association studies

.

Nucleic Acids Res.

2016

;

44

:

D869

D876

.

Sveinbjornsson

G.

,

Albrechtsen

A.

,

Zink

F.

,

Gudjonsson

S.A.

,

Oddson

A.

,

Másson

G.

,

Holm

H.

,

Kong

A.

,

Thorsteinsdottir

U.

,

Sulem

P.

et al.

Weighting sequence variants based on their annotation increases power of whole-genome association studies

.

Nat. Genet.

2016

;

48

:

314

317

.

Bastarache

L.

,

Hughey

J.J.

,

Hebbring

S.

,

Marlo

J.

,

Zhao

W.

,

Ho

W.T.

,

Van Driest

S.L.

,

McGregor

T.L.

,

Mosley

J.D.

,

Wells

Q.S.

et al.

Phenotype risk scores identify patients with unrecognized Mendelian disease patterns

.

Science

.

2018

;

359

:

1233

1239

.

Son

J.H.

,

Xie

G.

,

Yuan

C.

,

Ena

L.

,

Li

Z.

,

Goldstein

A.

,

Huang

L.

,

Wang

L.

,

Shen

F.

,

Liu

H.

et al.

Deep phenotyping on electronic health records facilitates genetic diagnosis by clinical exomes

.

Am. J. Hum. Genet.

2018

;

103

:

58

73

.

Segal

M.M.

,

Rahm

A.K.

,

Hulse

N.C.

,

Wood

G.

,

Williams

J.L.

,

Feldman

L.

,

Moore

G.J.

,

Gehrum

D.

,

Yefko

M.

,

Mayernick

S.

et al.

Experience with integrating diagnostic decision support software with electronic health records: Benefits versus risks of information sharing

.

EGEMS

.

2017

;

5

:

23

.

Smith

C.L.

,

Goldsmith

C.-A.W.

,

Eppig

J.T.

The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information

.

Genome Biol.

2005

;

6

:

R7

.

Haendel

M.A.

,

Balhoff

J.P.

,

Bastian

F.B.

,

Blackburn

D.C.

,

Blake

J.A.

,

Bradford

Y.

,

Comte

A.

,

Dahdul

W.M.

,

Dececchi

T.A.

,

Druzinsky

R.E.

et al.

Unification of multi-species vertebrate anatomy ontologies for comparative biology in Uberon

.

J. Biomed. Semantics

.

2014

;

5

:

21

.

Bard

J.

,

Rhee

S.Y.

,

Ashburner

M.

An ontology for cell types

.

Genome Biol.

2005

;

6

:

R21

.

Meehan

T.F.

,

Conte

N.

,

West

D.B.

,

Jacobsen

J.O.

,

Mason

J.

,

Warren

J.

,

Chen

C.K.

,

Tudose

I.

,

Relac

M.

,

Matthews

P.

et al.

Disease model discovery from 3,328 gene knockouts by The International Mouse Phenotyping Consortium

.

Nat. Genet.

2017

;

49

:

1231

1238

.

Robinson

P.N.

,

Köhler

S.

,

Oellrich

A.

Sanger Mouse Genetics Project

Sanger Mouse Genetics Project

Wang

K.

,

Mungall

C.J.

,

Lewis

S.E.

,

Washington

N.

,

Bauer

S.

,

Seelow

D.S.

et al.

Improved exome prioritization of disease genes through cross-species phenotype comparison

.

Genome Res.

2014

;

24

:

340

348

.

Shimoyama

M.

,

De Pons

J.

,

Hayman

G.T.

,

Laulederkind

S.J.F.

,

Liu

W.

,

Nigam

R.

,

Petri

V.

,

Smith

J.R.

,

Tutaj

M.

,

Wang

S.J.

et al.

The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease

.

Nucleic Acids Res.

2015

;

43

:

D743

D750

.

Lochmüller

H.

,

Badowska

D.M.

,

Thompson

R.

,

Knoers

N.V.

,

Aartsma-Rus

A.

,

Gut

I.

,

Wood

L.

,

Harmuth

T.

,

Durudas

A.

,

Graessner

H.

et al.

RD-Connect, NeurOmics and EURenOmics: collaborative European initiative for rare diseases

.

Eur. J. Hum. Genet.

2018

;

26

:

778

785

.

Maiella

S.

,

Olry

A.

,

Hanauer

M.

,

Lanneau

V.

,

Lourghi

H.

,

Donadille

B.

,

Rodwell

C.

,

Köhler

S.

,

Seelow

D.

,

Jupp

S.

et al.

Harmonising phenomics information for a better interoperability in the rare disease field

.

Eur. J. Med. Genet.

2018

;

doi:10.1016/j.ejmg.2018.01.013

.

Köhler

S.

,

Bauer

S.

,

Mungall

C.J.

,

Carletti

G.

,

Smith

C.L.

,

Schofield

P.

,

Gkoutos

G.V.

,

Robinson

P.N.

Improving ontologies by automatic reasoning and evaluation of logical definitions

.

BMC Bioinformatics

.

2011

;

12

:

418

.

Osumi-Sutherland

D.

,

Courtot

M.

,

Balhoff

J.P.

,

Mungall

C.

Dead simple OWL design patterns

.

J. Biomed. Semantics

.

2017

;

8

:

18

.

Xiang

Z.

,

Zheng

J.

,

Lin

Y.

,

He

Y.

Ontorat: automatic generation of new ontology terms, annotations, and axioms based on ontology design patterns

.

J. Biomed. Semantics

.

2015

;

6

:

4

.

Smith

C.L.

,

Eppig

J.T.

The Mammalian Phenotype Ontology as a unifying standard for experimental and high-throughput phenotyping data

.

Mamm. Genome

.

2012

;

23

:

653

668

.

Chun

K.J.

,

Yang

J.H.

,

Jang

S.Y.

,

Lee

S.H.

,

Gwag

H.B.

,

Chung

T.-Y.

,

Huh

J.

,

Ki

C.S.

,

Sung

K.

,

Choi

S.H.

et al.

Analysis of protrusio acetabuli using a CT-based diagnostic method in korean patients with marfan syndrome: Prevalence and association with other manifestations

.

J. Korean Med. Sci.

2015

;

30

:

1260

1265

.

Köhler

S.

Improved ontology-based similarity calculations using a study-wise annotation model

.

Database

.

2018

;

2018

:

doi:10.1093/database/bay026

.

Robinson

P.N.

,

Köhler

S.

,

Bauer

S.

,

Seelow

D.

,

Horn

D.

,

Mundlos

S.

The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease

.

Am. J. Hum. Genet.

2008

;

83

:

610

615

.

Vasilevsky

N.A.

,

Foster

E.D.

,

Engelstad

M.E.

,

Carmody

L.

,

Might

M.

,

Chambers

C.

,

Dawkins

H.J.S.

,

Lewis

J.

,

Della Rocca

M.G.

,

Snyder

M.

et al.

Plain-language medical vocabulary for precision diagnosis

.

Nat. Genet.

2018

;

50

:

474

476

.

Lewis

J.

,

Snyder

M.

,

Hyatt-Knorr

H.

Marking 15 years of the genetic and rare diseases information center

.

Transl. Sci. Rare Dis.

2017

;

2

:

77

88

.

Girdea

M.

,

Dumitriu

S.

,

Fiume

M.

,

Bowdin

S.

,

Boycott

K.M.

,

Chénier

S.

,

Chitayat

D.

,

Faghfoury

H.

,

Meyn

M.S.

,

Ray

P.N.

et al.

PhenoTips: Patient phenotyping software for clinical and research use

.

Hum. Mutat.

2013

;

34

:

1057

1065

.

Watkins

X.

,

Garcia

L.J.

,

Pundir

S.

,

Martin

M.J.

UniProt Consortium

ProtVista: visualization of protein sequence annotations

.

Bioinformatics

.

2017

;

33

:

2040

2041

.

Bauer

S.

,

Köhler

S.

,

Schulz

M.H.

,

Robinson

P.N.

Bayesian ontology querying for accurate and noise-tolerant semantic searches

.

Bioinformatics

.

2012

;

28

:

2502

2508

.

Jéru

I.

,

Duquesnoy

P.

,

Fernandes-Alnemri

T.

,

Cochet

E.

,

Yu

J.W.

,

Lackmy-Port-Lis

M.

,

Grimprel

E.

,

Landman-Parker

J.

,

Hentgen

V.

,

Marlin

S.

et al.

Mutations in NALP12 cause hereditary periodic fever syndromes

.

Proc. Natl. Acad. Sci. U.S.A.

2008

;

105

:

1614

1619

.

© The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

I agree to the terms and conditions. You must accept the terms and conditions.

Submit a comment

Name

Affiliations

Comment title

Comment

You have entered an invalid code

Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.

Citations

Views

Altmetric

Metrics

Total Views 24,645

19,811 Pageviews

4,834 PDF Downloads

Since 11/1/2018

Month: Total Views:
November 2018 970
December 2018 514
January 2019 604
February 2019 671
March 2019 532
April 2019 568
May 2019 530
June 2019 431
July 2019 401
August 2019 428
September 2019 398
October 2019 434
November 2019 366
December 2019 345
January 2020 388
February 2020 340
March 2020 298
April 2020 217
May 2020 281
June 2020 422
July 2020 402
August 2020 396
September 2020 465
October 2020 461
November 2020 455
December 2020 431
January 2021 487
February 2021 403
March 2021 481
April 2021 390
May 2021 340
June 2021 317
July 2021 363
August 2021 356
September 2021 392
October 2021 385
November 2021 387
December 2021 310
January 2022 358
February 2022 342
March 2022 352
April 2022 364
May 2022 363
June 2022 247
July 2022 236
August 2022 255
September 2022 340
October 2022 391
November 2022 308
December 2022 315
January 2023 284
February 2023 232
March 2023 237
April 2023 290
May 2023 227
June 2023 159
July 2023 183
August 2023 183
September 2023 176
October 2023 221
November 2023 139
December 2023 177
January 2024 274
February 2024 210
March 2024 237
April 2024 242
May 2024 209
June 2024 160
July 2024 143
August 2024 150
September 2024 183
October 2024 99

Citations

416 Web of Science

×

Email alerts

Citing articles via

More from Oxford Academic