Munirul Mansur - Academia.edu (original) (raw)
Uploads
Papers by Munirul Mansur
In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorizati... more In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of length 2 or 3 are the most useful for categorization. Using gram lengths more than 3 reduces the performance of categorization.
In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorizati... more In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of length 2 or 3 are the most useful for categorization. Using gram lengths more than 3 reduces the performance of categorization.