Analysis of N-Gram based text categorization for Bangla in a newspaper
Includes bibliographical references (page 7).
Hlavní autoři: | , , |
---|---|
Další autoři: | |
Médium: | Článek |
Jazyk: | English |
Vydáno: |
BRAC University
2010
|
On-line přístup: | http://hdl.handle.net/10361/623 |
id |
10361-623 |
---|---|
record_format |
dspace |
spelling |
10361-6232019-09-29T05:27:34Z Analysis of N-Gram based text categorization for Bangla in a newspaper Mansur, Munirul UzZaman, Naushad Khan, Mumit Center for Research on Bangla Language Processing (CRBLP), BRAC University Includes bibliographical references (page 7). In this paper, we study the outcome of using ngram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of length 2 or 3 are the most useful for categorization. Using gram lengths more than 3reduces the performance of categorization. Munirul Mansur Naushad UzZaman Mumit Khan 2010-10-21T09:14:58Z 2010-10-21T09:14:58Z 2006 2006 Article http://hdl.handle.net/10361/623 en 7 pages application/pdf BRAC University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
description |
Includes bibliographical references (page 7). |
author2 |
Center for Research on Bangla Language Processing (CRBLP), BRAC University |
author_facet |
Center for Research on Bangla Language Processing (CRBLP), BRAC University Mansur, Munirul UzZaman, Naushad Khan, Mumit |
format |
Article |
author |
Mansur, Munirul UzZaman, Naushad Khan, Mumit |
spellingShingle |
Mansur, Munirul UzZaman, Naushad Khan, Mumit Analysis of N-Gram based text categorization for Bangla in a newspaper |
author_sort |
Mansur, Munirul |
title |
Analysis of N-Gram based text categorization for Bangla in a newspaper |
title_short |
Analysis of N-Gram based text categorization for Bangla in a newspaper |
title_full |
Analysis of N-Gram based text categorization for Bangla in a newspaper |
title_fullStr |
Analysis of N-Gram based text categorization for Bangla in a newspaper |
title_full_unstemmed |
Analysis of N-Gram based text categorization for Bangla in a newspaper |
title_sort |
analysis of n-gram based text categorization for bangla in a newspaper |
publisher |
BRAC University |
publishDate |
2010 |
url |
http://hdl.handle.net/10361/623 |
work_keys_str_mv |
AT mansurmunirul analysisofngrambasedtextcategorizationforbanglainanewspaper AT uzzamannaushad analysisofngrambasedtextcategorizationforbanglainanewspaper AT khanmumit analysisofngrambasedtextcategorizationforbanglainanewspaper |
_version_ |
1814306969501040640 |