Farjana Sultana Mim - Academia.edu (original) (raw)

Uploads

Papers by Farjana Sultana Mim

Cornell University - arXiv, Oct 26, 2021

Most of the existing work that focus on the identification of implicit knowledge in arguments gen... more Most of the existing work that focus on the identification of implicit knowledge in arguments generally represent implicit knowledge in the form of commonsense or factual knowledge. However, such knowledge is not sufficient to understand the implicit reasoning link between individual argumentative components (i.e., claim and premise). In this work, we focus on identifying the implicit knowledge in the form of argumentation knowledge which can help in understanding the reasoning link in arguments. Being inspired by the Argument from Consequences scheme, we propose a semi-structured template to represent such argumentation knowledge that explicates the implicit reasoning in arguments via causality. We create a novel two-phase annotation process with simplified guidelines and show how to collect and filter high quality implicit reasonings via crowdsourcing. We find substantial inter-annotator agreement for quality evaluation between experts, but find evidence that casts a few questions on the feasibility of collecting high quality semistructured implicit reasoning through our crowdsourcing process. We release our materials (i.e., crowdsourcing guidelines and collected implicit reasonings) to facilitate further research towards the structured representation of argumentation knowledge.

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021

Existing approaches for automated essay scoring and document representation learning typically re... more Existing approaches for automated essay scoring and document representation learning typically rely on discourse parsers to incorporate discourse structure into text representation. However, the performance of parsers is not always adequate, especially when they are used on noisy texts, such as student essays. In this paper, we propose an unsupervised pre-training approach to capture discourse structure of essays in terms of coherence and cohesion that does not require any discourse parser or annotation. We introduce several types of token, sentence and paragraph-level corruption techniques for our proposed pre-training approach and augment masked language modeling pre-training with our pre-training method to leverage both contextualized and discourse information. Our proposed unsupervised approach achieves a new state-of-the-art result on the task of essay Organization scoring.

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2019

Existing document embedding approaches mainly focus on capturing sequences of words in documents.... more Existing document embedding approaches mainly focus on capturing sequences of words in documents. However, some document classification and regression tasks such as essay scoring need to consider discourse structure of documents. Although some prior approaches consider this issue and utilize discourse structure of text for document classification, these approaches are dependent on computationally expensive parsers. In this paper, we propose an unsupervised approach to capture discourse structure in terms of coherence and cohesion for document embedding that does not require any expensive parser or annotation. Extrinsic evaluation results show that the document representation obtained from our approach improves the performance of essay Organization scoring and Argument Strength scoring.

Scientia Horticulturae, 2018

Maturity is the most important factor to determine the storage-life and quality of fruits like ma... more Maturity is the most important factor to determine the storage-life and quality of fruits like mangoes. Fruit maturity can be recognized by different attributes and among them skin color is the most significant criteria for judging maturity. Typically, human experts visually detect the fruit color to identify the maturity stages which is very prone to error. In this paper, a method of digital image processing has been proposed to classify mangoes into six maturity stages according to the United States department of agriculture (USDA) standard classification. The experimentation considers sample images of more than 100 mangoes of different stages. A total of 24 image features are extracted and then correlation based and information gain based evaluation has been performed in order to select the most informative feature sets. Categorization is done using the decision tree which provides up to 96% classification accuracy.

Cornell University - arXiv, Apr 4, 2022

In argumentative discourse, persuasion is often achieved by refuting or attacking others argument... more In argumentative discourse, persuasion is often achieved by refuting or attacking others arguments. Attacking is not always straightforward and often comprise complex rhetorical moves such that arguers might agree with a logic of an argument while attacking another logic. Moreover, arguer might neither deny nor agree with any logics of an argument, instead ignore them and attack the main stance of the argument by providing new logics and presupposing that the new logics have more value or importance than the logics present in the attacked argument. However, no existing studies in the computational argumentation capture such complex rhetorical moves in attacks or the presuppositions or value judgements in them. In order to address this gap, we introduce LPAttack, a novel annotation scheme that captures the common modes and complex rhetorical moves in attacks along with the implicit presuppositions and value judgements in them. Our annotation study shows moderate inter-annotator agreement, indicating that human annotation for the proposed scheme is feasible. We publicly release our annotated corpus and the annotation guidelines.

Proceedings of the 8th Workshop on Argument Mining, 2021

Annotation of implicit reasoning (i.e., warrant) in arguments is a critical resource to train mod... more Annotation of implicit reasoning (i.e., warrant) in arguments is a critical resource to train models in gaining deeper understanding and correct interpretation of arguments. However, warrants are usually annotated in unstructured form, having no restriction on their lexical structure which sometimes makes it difficult to interpret how warrants relate to any of the information given in claim and premise. Moreover, assessing and determining better warrants from the large variety of reasoning patterns of unstructured warrants becomes a formidable task. Therefore, in order to annotate warrants in a more interpretative and restrictive way, we propose two methodologies to annotate warrants in a semi-structured form. To the best of our knowledge, we are the first to show how such semi-structured warrants can be annotated on a large scale via crowdsourcing. We demonstrate through extensive quality evaluation that our methodologies enable collecting better quality warrants in comparison to unstructured annotations. To further facilitate research towards the task of explicating warrants in arguments, we release our materials publicly (i.e., crowdsourcing guidelines and collected warrants).

Providing feedback on the argumentation of learner is essential for development of critical think... more Providing feedback on the argumentation of learner is essential for development of critical thinking skills, but it takes a lot of time and effort. To reduce the burden on teachers, we aim to automate a process of giving feedback, especially giving diagnostic comments which point out the weaknesses inherent in the argumentation. It is advisable to give specific diagnostic comments so that learners can recognize the diagnosis without misunderstanding. However, it is not obvious how the task of providing specific diagnostic comments should be formulated. We present a formulation of the task as template selection and slot filling to make an automatic evaluation easier and the behavior of the model more tractable. The key to the formulation is the possibility of creating a template set that is sufficient for practical use. In this paper, we define three criteria that a template set should satisfy: expressiveness, informativeness, and uniqueness, and verify the feasibility to create a te...

Cornell University - arXiv, Oct 26, 2021

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2019

Scientia Horticulturae, 2018

Cornell University - arXiv, Apr 4, 2022

Proceedings of the 8th Workshop on Argument Mining, 2021