Is Graph Structure Necessary for Multi-hop Question Answering? (original) (raw)
Abstract:Recently, attempting to model texts as graph structure and introducing graph neural networks to deal with it has become a trend in many NLP research areas. In this paper, we investigate whether the graph structure is necessary for multi-hop question answering. Our analysis is centered on HotpotQA. We construct a strong baseline model to establish that, with the proper use of pre-trained models, graph structure may not be necessary for multi-hop question answering. We point out that both graph structure and adjacency matrix are task-related prior knowledge, and graph-attention can be considered as a special case of self-attention. Experiments and visualized analysis demonstrate that graph-attention or the entire graph structure can be replaced by self-attention or Transformers.
Submission history
From: Yiming Cui [view email]
[v1] Tue, 7 Apr 2020 02:59:42 UTC (187 KB)
[v2] Thu, 29 Oct 2020 09:29:19 UTC (468 KB)