Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports - PubMed (original) (raw)
Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports
Harsha Gurulingappa et al. J Biomed Inform. 2012 Oct.
Free article
Abstract
A significant amount of information about drug-related safety issues such as adverse effects are published in medical case reports that can only be explored by human readers due to their unstructured nature. The work presented here aims at generating a systematically annotated corpus that can support the development and validation of methods for the automatic extraction of drug-related adverse effects from medical case reports. The documents are systematically double annotated in various rounds to ensure consistent annotations. The annotated documents are finally harmonized to generate representative consensus annotations. In order to demonstrate an example use case scenario, the corpus was employed to train and validate models for the classification of informative against the non-informative sentences. A Maximum Entropy classifier trained with simple features and evaluated by 10-fold cross-validation resulted in the F₁ score of 0.70 indicating a potential useful application of the corpus.
Copyright © 2012 Elsevier Inc. All rights reserved.
Similar articles
- Portable automatic text classification for adverse drug reaction detection via multi-corpus training.
Sarker A, Gonzalez G. Sarker A, et al. J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8. J Biomed Inform. 2015. PMID: 25451103 Free PMC article. - The EU-ADR corpus: annotated drugs, diseases, targets, and their relationships.
van Mulligen EM, Fourrier-Reglat A, Gurwitz D, Molokhia M, Nieto A, Trifiro G, Kors JA, Furlong LI. van Mulligen EM, et al. J Biomed Inform. 2012 Oct;45(5):879-84. doi: 10.1016/j.jbi.2012.04.004. Epub 2012 Apr 25. J Biomed Inform. 2012. PMID: 22554700 - Linguistic analysis of large-scale medical incident reports for patient safety.
Fujita K, Akiyama M, Park K, Yamaguchi EN, Furukawa H. Fujita K, et al. Stud Health Technol Inform. 2012;180:250-4. Stud Health Technol Inform. 2012. PMID: 22874190 - Documentation in pharmacovigilance: using an ontology to extend and normalize Pubmed queries.
Delamarre D, Lillo-Le Louët A, Guillot L, Jamet A, Sadou E, Ouazine T, Burgun A, Jaulent MC. Delamarre D, et al. Stud Health Technol Inform. 2010;160(Pt 1):518-22. Stud Health Technol Inform. 2010. PMID: 20841741 - Application of the intelligent techniques in transplantation databases: a review of articles published in 2009 and 2010.
Sousa FS, Hummel AD, Maciel RF, Cohrs FM, Falcão AE, Teixeira F, Baptista R, Mancini F, da Costa TM, Alves D, Pisa IT. Sousa FS, et al. Transplant Proc. 2011 May;43(4):1340-2. doi: 10.1016/j.transproceed.2011.02.028. Transplant Proc. 2011. PMID: 21620124 Review.
Cited by
- EnzChemRED, a rich enzyme chemistry relation extraction dataset.
Lai PT, Coudert E, Aimo L, Axelsen K, Breuza L, de Castro E, Feuermann M, Morgat A, Pourcel L, Pedruzzi I, Poux S, Redaschi N, Rivoire C, Sveshnikova A, Wei CH, Leaman R, Luo L, Lu Z, Bridge A. Lai PT, et al. Sci Data. 2024 Sep 9;11(1):982. doi: 10.1038/s41597-024-03835-7. Sci Data. 2024. PMID: 39251610 Free PMC article. - BERT-based language model for accurate drug adverse event extraction from social media: implementation, evaluation, and contributions to pharmacovigilance practices.
Dong F, Guo W, Liu J, Patterson TA, Hong H. Dong F, et al. Front Public Health. 2024 Apr 23;12:1392180. doi: 10.3389/fpubh.2024.1392180. eCollection 2024. Front Public Health. 2024. PMID: 38716250 Free PMC article. - Surveying biomedical relation extraction: a critical examination of current datasets and the proposal of a new resource.
Huang MS, Han JC, Lin PY, You YT, Tsai RT, Hsu WL. Huang MS, et al. Brief Bioinform. 2024 Mar 27;25(3):bbae132. doi: 10.1093/bib/bbae132. Brief Bioinform. 2024. PMID: 38609331 Free PMC article. Review. - Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature.
VanSchaik JT, Jain P, Rajapuri A, Cheriyan B, Thyvalikakath TP, Chakraborty S. VanSchaik JT, et al. Heliyon. 2023 Aug 22;9(9):e19265. doi: 10.1016/j.heliyon.2023.e19265. eCollection 2023 Sep. Heliyon. 2023. PMID: 37809371 Free PMC article. - Revisiting Relation Extraction in the era of Large Language Models.
Wadhwa S, Amir S, Wallace BC. Wadhwa S, et al. Proc Conf Assoc Comput Linguist Meet. 2023 Jul;2023:15566-15589. doi: 10.18653/v1/2023.acl-long.868. Proc Conf Assoc Comput Linguist Meet. 2023. PMID: 37674787 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical