IOS Press Ebooks - Why Discourse Structures in Medical Reports Matter for the Validity of Automatically Generated Text Knowledge Bases (original) (raw)

Why Discourse Structures in Medical Reports Matter for the Validity of Automatically Generated Text Knowledge Bases

Authors

Udo Hahn, Martin Romacker, Stefan Schulz

Pages

633 - 638

DOI

10.3233/978-1-60750-896-0-633

Series

Ebook

Abstract

The automatic analysis of medical full-texts currently suffers from neglecting text coherence phenomena such as reference relations between discourse units. This has unwarranted effects on the description adequacy of medical knowledge bases automatically generated from texts. The resulting representation bias can be characterized in terms of artificially fragmented, incomplete and invalid knowledge structures. We discuss three types of textual phenomena (pronominal and nominal anaphora, as well as textual ellipsis) and outline basic methodologies how to deal with them.