MIss RoBERTa WiLDe: Metaphor Identification Using Masked Language Model with Wiktionary Lexical Definitions (original) (raw)
Related papers
Metaphor Detection Using Contextual Word Embeddings From Transformers
Proceedings of the Second Workshop on Figurative Language Processing, 2020
The detection of metaphors can provide valuable information about a given text and is crucial to sentiment analysis and machine translation. In this paper, we outline the techniques for word-level metaphor detection used in our submission to the Second Shared Task on Metaphor Detection. We propose using both BERT and XLNet language models to create contextualized embeddings and a bidirectional LSTM to identify whether a given word is a metaphor. Our best model achieved F1-scores of 68.0% on VUA AllPOS, 73.0% on VUA Verbs, 66.9% on TOEFL AllPOS, and 69.7% on TOEFL Verbs, placing 7th, 6th, 5th, and 5th respectively. In addition, we outline another potential approach with a KNN-LSTM ensemble model that we did not have enough time to implement given the deadline for the competition. We show that a KNN classifier provides a similar F1-score on a validation set as the LSTM and yields different information on metaphors.
Cracking the Figurative Code: A Survey of Metaphor Detection Techniques
Soft Computing Research Society eBooks, 2022
Metaphor Detection is a crucial area of study in computational linguistics and natural language processing, as it enables the understanding and communication of abstract ideas through the use of concrete imagery. This survey paper aims to provide an overview of the current state-of-the-art approaches that tackle this issue and analyze trends in the domain across the years. The survey recapitulates the existing methodologies for metaphor detection, highlighting their key contributions and limitations. The methods are assigned three broad categories: feature-engineering-based, traditional deep learning-based, and transformerbased approaches. An analysis of the strengths and weaknesses of each category is showcased. Furthermore, the paper explores the annotated corpora that have been developed to facilitate the development and evaluation of metaphor detection models. By providing a comprehensive overview of the work already done and the research gaps present in preexisting literature, this survey paper hopes to help future research endeavors, and thus contribute to the advancement of metaphor detection methodologies.
Construction Artifacts in Metaphor Identification Datasets
2022
Metaphor identification aims at understanding whether a given expression is used figuratively in context. However, in this paper we show how existing metaphor identification datasets can be gamed by fully ignoring the potential metaphorical expression or the context in which it occurs. We test this hypothesis in a variety of datasets and settings, and show that metaphor identification systems based on language models without complete information can be competitive with those using the full context. This is due to the construction procedures to build such datasets, which introduce unwanted biases for positive and negative classes. Finally, we test the same hypothesis on datasets that are carefully sampled from natural corpora and where this bias is not present, making these datasets more challenging and reliable.
2013 19th International Conference on Control Systems and Computer Science, 2013
ABSTRACT Because of the ubiquity of metaphors in language, metaphor processing is a very important task in the field of natural language processing. The first step towards metaphor processing, and probably the most difficult one, is metaphor detection. In the first part of this paper, we review the theoretical background for metaphors and the models and implementations that have been proposed for their detection. We then build corpora for detecting three types of metaphors: IS-A metaphors, metaphors formed with the preposition 'of' and metaphors formed with a verb. For the first two tasks, we train supervised classifiers using semantic features. For the third task, we use features commonly used in text categorization.
SupervisedWord-Level Metaphor Detection: Experiments with Concreteness and Reweighting of Examples
Proceedings of the Third Workshop on Metaphor in NLP (at NAACL 2015), 2015
We present a supervised machine learning system for word-level classification of all content words in a running text as being metaphorical or non-metaphorical. The system provides a substantial improvement upon a previously published baseline, using re-weighting of the training examples and using features derived from a concreteness database. We observe that while the first manipulation was very effective, the second was only slightly so. Possible reasons for these observations are discussed.
Testing the role of metadata in metaphor identification
Proceedings of the Second Workshop on Figurative Language Processing
This paper describes the adaptation and application of a neural network system for the automatic detection of metaphors. The LSTM BiRNN system participated in the shared task of metaphor identification that was part of the Second Workshop of Figurative Language Processing (FigLang2020) held at the Annual Conference of the Association for Computational Linguistics (ACL2020). The particular focus of our approach is on the potential influence that the metadata given in the ETS Corpus of Non-Native Written English might have on the automatic detection of metaphors in this dataset. The article first discusses the annotated ETS learner data, highlighting some of its peculiarities and inherent biases of metaphor use. A series of evaluations follow in order to test whether specific metadata influence the system performance in the task of automatic metaphor identification. The system is available under the APLv2 open-source license.
Linguistic Analysis Improves Neural Metaphor Detection
2019
In the field of metaphor detection, deep learning systems are the ubiquitous and achieve strong performance on many tasks. However, due to the complicated procedures for manually identifying metaphors, the datasets available are relatively small and fraught with complications. We show that using syntactic features and lexical resources can automatically provide additional high-quality training data for metaphoric language, and this data can cover gaps and inconsistencies in metaphor annotation, improving state-of-the-art word-level metaphor identification. This novel application of automatically improving training data improves classification across numerous tasks, and reconfirms the necessity of high-quality data for deep learning frameworks.
Proceedings of the Second Workshop on Figurative Language Processing, 2020
This paper describes the ETS entry to the 2020 Metaphor Detection shared task. Our contribution consists of a sequence of experiments using BERT, starting with a baseline, strengthening it by spell-correcting the TOEFL corpus , followed by a multi-task learning setting , where one of the tasks is the token-level metaphor classification as per the shared task, while the other is meant to provide additional training that we hypothesized to be relevant to the main task. In one case, out-of-domain data manually annotated for metaphor is used for the auxiliary task; in the other case, in-domain data automatically annotated for idioms is used for the auxiliary task. Both multi-task experiments yield promising results.
Automated Identification of Metaphors in Annotated Corpus(Based on Substance Terms)
2021
The automatic or automated metaphor identification remains a challenging problem. The methods proposed so far have been mostly developed for the English language and can be roughly divided into two groups: intended for annotated and non-annotated corpora. In addition, neural networks are used. It should also be noted that the application of recently developed methods for measuring the degree of semantic association of collocation components (T-score, MI, logDice, etc.) fails to detect metaphorical expressions. Previously, we presented a method of automated identification of metaphorical expressions (adjective + noun) for non-annotated corpora of Ukrainian prose texts, based on the analysis of dictionary definitions. This paper describes a method of automated identification of metaphors in the semantically annotated corpus of texts. This algorithm is based on the theoretical propositions and readings of metaphor within the framework of Conceptual Metaphor Theory. The methodology cont...
MetaNet: Deep semantic automatic metaphor analysis
Proceedings of the 3rd Workshop on Metaphor in NLP, NAACL HLT 2015
This paper describes a system that makes use of a repository of formalized frames and metaphors to automatically detect, categorize, and analyze expressions of metaphor in corpora. The output of this system can be used as a basis for making further refinements to the system , as well as supporting deep semantic analysis of metaphor expressions in corpora. This in turn provides a way to ground and test empirical conceptual metaphor theory, as well as serving as a means to gain insights into the ways conceptual metaphors are expressed in language.