Analysis of N-Gram based text categorization for Bangla in a newspaper corpus
This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2006.
מחבר ראשי: | |
---|---|
מחברים אחרים: | |
פורמט: | Thesis |
יצא לאור: |
BRAC University
2010
|
נושאים: | |
גישה מקוונת: | http://hdl.handle.net/10361/61 |
id |
10361-61 |
---|---|
record_format |
dspace |
spelling |
10361-612022-01-26T10:23:16Z Analysis of N-Gram based text categorization for Bangla in a newspaper corpus Mansur, Munirul Khan, Mumit Department of Computer Science and Engineering, BRAC University Computer science and engineering This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2006. Cataloged from PDF version of thesis report. Includes bibliographical references (page 30). The goal of any classification is to build a set of models that can correctly predict the class of different objects. Text categorization is one such application and can be used in many classification task, e.g. news categorization, language identification, authorship attribution, text genre categorization, recommendation systems etc. In this paper we analyze the performance of n-gram based text categorization for Bangla in a Bangladeshi newspaper, Prothom-Alo corpus. Munirul Mansur B. Computer Science and Engineering 2010-09-06T06:29:41Z 2010-09-06T06:29:41Z 2006 2006-08 Thesis ID 02101043 http://hdl.handle.net/10361/61 BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. application/pdf BRAC University |
institution |
Brac University |
collection |
Institutional Repository |
topic |
Computer science and engineering |
spellingShingle |
Computer science and engineering Mansur, Munirul Analysis of N-Gram based text categorization for Bangla in a newspaper corpus |
description |
This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2006. |
author2 |
Khan, Mumit |
author_facet |
Khan, Mumit Mansur, Munirul |
format |
Thesis |
author |
Mansur, Munirul |
author_sort |
Mansur, Munirul |
title |
Analysis of N-Gram based text categorization for Bangla in a newspaper corpus |
title_short |
Analysis of N-Gram based text categorization for Bangla in a newspaper corpus |
title_full |
Analysis of N-Gram based text categorization for Bangla in a newspaper corpus |
title_fullStr |
Analysis of N-Gram based text categorization for Bangla in a newspaper corpus |
title_full_unstemmed |
Analysis of N-Gram based text categorization for Bangla in a newspaper corpus |
title_sort |
analysis of n-gram based text categorization for bangla in a newspaper corpus |
publisher |
BRAC University |
publishDate |
2010 |
url |
http://hdl.handle.net/10361/61 |
work_keys_str_mv |
AT mansurmunirul analysisofngrambasedtextcategorizationforbanglainanewspapercorpus |
_version_ |
1814309837246300160 |