Characterizing the Response Space of Questions: a Corpus Study for English and Polish (original) (raw)

Characterizing the Response Space of Questions: data and theory

Dialogue & Discourse

The main aim of this paper is to provide a characterization of the response space for questions using a taxonomy grounded in a dialogical formal semantics. As a starting point we take the typology for responses in the form of questions provided in \cite{lupginz-jlm}. This work develops a wide coverage taxonomy for question/question sequences observable in corpora including the BNC, CHILDES, and BEE, as well as formal modeling of all the postulated classes. Our aim is to extend this work to cover \emph{all} responses to questions. We present the extended typology of responses to questions based on a corpus studies of BNC, BEE, Maptask and CornellMovie with include 506, 262, 467, and 678 question/response pairs respectively. We compare the data for English with data from Polish using the Spokes corpus (694 question/response pairs). We discuss annotation reliability and disagreement analysis. We sketch how each class can be formalized using a dialogical semantics appropriate for dialog...

A corpus-based taxonomy of question responses

In this paper we consider the issue of answering a query with a query. Although these are common, with the exception of Clarification Requests, they have not been studied empirically. After briefly reviewing different theoretical approaches on this subject, we present a corpus study of query responses in the British National Corpus and develop a taxonomy for query responses. We sketch a formal analysis of the response categories in the framework of KoS.

A Taxonomy of Real-Life Questions and Answers in Dialogue

2019

We present a taxonomy of questions and answers based on real-life data extracted from spontaneous dialogue corpora. This classification allowed us to build a fine-grained annotation schema, which we applied to several languages: English, French, Italian and Chinese.

Toward Dialogue Modeling: A Semantic Annotation Scheme for Questions and Answers

Proceedings of the 13th Linguistic Annotation Workshop

The present study proposes an annotation scheme for classifying the content and discourse contribution of question-answer pairs. We propose detailed guidelines for using the scheme and apply them to dialogues in English, Spanish, and Dutch. Finally, we report on initial machine learning experiments for automatic annotation.

A linguistic analysis of question taxonomies

Journal of The American Society for Information Science and Technology, 2005

Recent work in automatic question answering has called for question taxonomies as a critical component of the process of machine understanding of questions. There is a long tradition of classifying questions in library reference services, and digital reference services have a strong need for automation to support scalability. Digital reference and question answering systems have the potential to arrive at a highly fruitful symbiosis. To move towards this goal, an extensive review was conducted of bodies of literature from several fields that deal with questions, to identify question taxonomies that exist in these bodies of literature. In the course of this review, five question taxonomies were identified, at four levels of linguistic analysis.

Generic Question Classification for Dialogue Systems

Machine Learning Techniques and Data Science Trends

We present in this paper a new classification approach for identifying questions during human-machine interactions and more specifically in dialogue systems. The difficulty in this task is first to be domainindependent, reusable whatever the dialogue application and second to be capable of a real time processing, in order to fit with the needs of reactivity in dialogue systems. The task is then different than that of question classification usually addressed in question-answering systems. We propose in this paper a hierarchical classifier in two steps, filtering first question/no-question utterances and second the type of the question. Our method reaches a f-score of 98% for the first step and 97% for the second one, representing the state of the art for this task.

Toward a formalisation of speech-act functions of questions in conversation

2000

In this paper we address the relationship between questions as grammatical and semantic entities and questions as pragmatic entities arguing that a contextually-conditioned associ- ation holds between the former, as interrogative formulae, and the latter, as particular types of speech acts (namely offers and requests). This argument is supported by evidence from a corpus of spontaneous Cypriot Greek exchanges. This

The DialogBank

This paper presents the DialogBank, a new language resource consisting of dialogues with gold standard annotations according to the ISO 24617-2 standard. Some of these dialogues have been taken from existing corpora and have been re-annotated according to the ISO standard; others have been annotated directly according to the standard. The ISO 24617-2 annotations have been designed according to the ISO principles for semantic annotation, as formulated in ISO 24617-6. The DialogBank makes use of three alternative representation formats, which are shown to be interoperable.

Characterizing the Response Space of Questions: a Corpus Study for English and Polish (original) (raw)

Related papers