Constructing biological knowledge bases by extracting information from text sources - PubMed (original) (raw)

Affiliations

PMID: 10786289

Constructing biological knowledge bases by extracting information from text sources

M Craven et al. Proc Int Conf Intell Syst Mol Biol. 1999.

Abstract

Recently, there has been much effort in making databases for molecular biology more accessible and interoperable. However, information in text form, such as MEDLINE records, remains a greatly underutilized source of biological information. We have begun a research effort aimed at automatically mapping information from text sources into structured representations, such as knowledge bases. Our approach to this task is to use machine-learning methods to induce routines for extracting facts from text. We describe two learning methods that we have applied to this task--a statistical text classification method, and a relational learning method--and our initial experiments in learning such information-extraction routines. We also present an approach to decreasing the cost of learning information-extraction routines by learning from "weakly" labeled training data.

PubMed Disclaimer

Cited by

Biomedical literature mining: graph kernel-based learning for gene-gene interaction extraction.
Hsieh AR, Tsai CY. Hsieh AR, et al. Eur J Med Res. 2024 Aug 2;29(1):404. doi: 10.1186/s40001-024-01983-5. Eur J Med Res. 2024. PMID: 39095899 Free PMC article.
Chemical-protein relation extraction with ensembles of carefully tuned pretrained language models.
Weber L, Sänger M, Garda S, Barth F, Alt C, Leser U. Weber L, et al. Database (Oxford). 2022 Nov 18;2022:baac098. doi: 10.1093/database/baac098. Database (Oxford). 2022. PMID: 36399413 Free PMC article.
Auto-CORPus: A Natural Language Processing Tool for Standardizing and Reusing Biomedical Literature.
Beck T, Shorter T, Hu Y, Li Z, Sun S, Popovici CM, McQuibban NAR, Makraduli F, Yeung CS, Rowlands T, Posma JM. Beck T, et al. Front Digit Health. 2022 Feb 15;4:788124. doi: 10.3389/fdgth.2022.788124. eCollection 2022. Front Digit Health. 2022. PMID: 35243479 Free PMC article.
Automated extraction of genes associated with antibiotic resistance from the biomedical literature.
Brincat A, Hofmann M. Brincat A, et al. Database (Oxford). 2022 Jan 29;2022(2022):baab077. doi: 10.1093/database/baab077. Database (Oxford). 2022. PMID: 35134132 Free PMC article.
Large-scale protein-protein post-translational modification extraction with distant supervision and confidence calibrated BioBERT.
Elangovan A, Li Y, Pires DEV, Davis MJ, Verspoor K. Elangovan A, et al. BMC Bioinformatics. 2022 Jan 4;23(1):4. doi: 10.1186/s12859-021-04504-x. BMC Bioinformatics. 2022. PMID: 34983371 Free PMC article.

MeSH terms

LinkOut - more resources

Other Literature Sources
- The Lens - Patent Citations

Constructing biological knowledge bases by extracting information from text sources - PubMed (original) (raw)