Data Set For Sentiment Analysis On Bengali News Comments And Its Baseline Evaluation (original) (raw)

2019 International Conference on Bangla Speech and Language Processing (ICBSLP), 2019

Abstract

The biggest challenge of Bengali language processing is creating a strong data set to do research on. The main focus of this paper is to introduce an authentic and credible data set and this dataset is open for all to be used for educational purposes1 for Bengali sentiment analysis where the data was extracted from a well known online news portal’s user comments. Here comments on various news were scraped, and for detecting the true sentiments of the sentences, five labels of sentiments were used. An online crowd sourcing platform was used for data annotation. To ensure the credibility and validity of the data set, every entry of the data set was tagged three times. Three models of text classification were used for baseline evaluation to check the validity of the data set. This data set might be of valuable help for future works and researches on Bengali sentiment analysis.

Summit Haque hasn't uploaded this paper.

Let Summit know you want this paper to be uploaded.

Ask for this paper to be uploaded.