IOS Press Ebooks - Why Discourse Structures in Medical Reports Matter for the Validity of Automatically Generated Text Knowledge Bases (original) (raw)
Why Discourse Structures in Medical Reports Matter for the Validity of Automatically Generated Text Knowledge Bases
Authors
Udo Hahn, Martin Romacker, Stefan Schulz
Pages
633 - 638
DOI
10.3233/978-1-60750-896-0-633
Series
Ebook
Abstract
The automatic analysis of medical full-texts currently suffers from neglecting text coherence phenomena such as reference relations between discourse units. This has unwarranted effects on the description adequacy of medical knowledge bases automatically generated from texts. The resulting representation bias can be characterized in terms of artificially fragmented, incomplete and invalid knowledge structures. We discuss three types of textual phenomena (pronominal and nominal anaphora, as well as textual ellipsis) and outline basic methodologies how to deal with them.