Biomedical word sense disambiguation with ontologies and metadata: automation meets accuracy (original) (raw)
Related papers
Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
Database : the journal of biological databases and curation, 2015
Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association between genes and biomedical literature to disambiguate gene names. We obtained 93.6% precision for the test gene set and 80.4% for the area under a receiver-operating characteristics curve for gene and article association. The core algorithm was implemented using a graphics processing unit-based MapReduce framework to handle big data and to improve performance. We conclude that Ontology Fingerprints can help disambiguate gene names mentioned in text and analyse the association between genes and articles. Database URL: http://www.ontologyfingerprint.org.
Assigning Gene Ontology terms to biotext by classification methods
Abstract Biomedical literature databases constitute valuable repositories of up to date scientific knowledge. The development of efficient classification methods in order to facilitate the organization of these databases and the extraction of novel biomedical knowledge is becoming increasingly important. Several of these methods use bio-ontologies, like Gene Ontology to concisely describe and classify biological documents.