Analysis of N-Gram based text categorization for Bangla in a newspaper

Includes bibliographical references (page 7).

Podrobná bibliografie
Hlavní autoři: Mansur, Munirul, UzZaman, Naushad, Khan, Mumit
Další autoři: Center for Research on Bangla Language Processing (CRBLP), BRAC University
Médium: Článek
Jazyk:English
Vydáno: BRAC University 2010
On-line přístup:http://hdl.handle.net/10361/623
id 10361-623
record_format dspace
spelling 10361-6232019-09-29T05:27:34Z Analysis of N-Gram based text categorization for Bangla in a newspaper Mansur, Munirul UzZaman, Naushad Khan, Mumit Center for Research on Bangla Language Processing (CRBLP), BRAC University Includes bibliographical references (page 7). In this paper, we study the outcome of using ngram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of length 2 or 3 are the most useful for categorization. Using gram lengths more than 3reduces the performance of categorization. Munirul Mansur Naushad UzZaman Mumit Khan 2010-10-21T09:14:58Z 2010-10-21T09:14:58Z 2006 2006 Article http://hdl.handle.net/10361/623 en 7 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
description Includes bibliographical references (page 7).
author2 Center for Research on Bangla Language Processing (CRBLP), BRAC University
author_facet Center for Research on Bangla Language Processing (CRBLP), BRAC University
Mansur, Munirul
UzZaman, Naushad
Khan, Mumit
format Article
author Mansur, Munirul
UzZaman, Naushad
Khan, Mumit
spellingShingle Mansur, Munirul
UzZaman, Naushad
Khan, Mumit
Analysis of N-Gram based text categorization for Bangla in a newspaper
author_sort Mansur, Munirul
title Analysis of N-Gram based text categorization for Bangla in a newspaper
title_short Analysis of N-Gram based text categorization for Bangla in a newspaper
title_full Analysis of N-Gram based text categorization for Bangla in a newspaper
title_fullStr Analysis of N-Gram based text categorization for Bangla in a newspaper
title_full_unstemmed Analysis of N-Gram based text categorization for Bangla in a newspaper
title_sort analysis of n-gram based text categorization for bangla in a newspaper
publisher BRAC University
publishDate 2010
url http://hdl.handle.net/10361/623
work_keys_str_mv AT mansurmunirul analysisofngrambasedtextcategorizationforbanglainanewspaper
AT uzzamannaushad analysisofngrambasedtextcategorizationforbanglainanewspaper
AT khanmumit analysisofngrambasedtextcategorizationforbanglainanewspaper
_version_ 1814306969501040640