Analysis of N-Gram based text categorization for Bangla in a newspaper corpus

This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2006.

מידע ביבליוגרפי
מחבר ראשי: Mansur, Munirul
מחברים אחרים: Khan, Mumit
פורמט: Thesis
יצא לאור: BRAC University 2010
נושאים:
גישה מקוונת:http://hdl.handle.net/10361/61
id 10361-61
record_format dspace
spelling 10361-612022-01-26T10:23:16Z Analysis of N-Gram based text categorization for Bangla in a newspaper corpus Mansur, Munirul Khan, Mumit Department of Computer Science and Engineering, BRAC University Computer science and engineering This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2006. Cataloged from PDF version of thesis report. Includes bibliographical references (page 30). The goal of any classification is to build a set of models that can correctly predict the class of different objects. Text categorization is one such application and can be used in many classification task, e.g. news categorization, language identification, authorship attribution, text genre categorization, recommendation systems etc. In this paper we analyze the performance of n-gram based text categorization for Bangla in a Bangladeshi newspaper, Prothom-Alo corpus. Munirul Mansur B. Computer Science and Engineering 2010-09-06T06:29:41Z 2010-09-06T06:29:41Z 2006 2006-08 Thesis ID 02101043 http://hdl.handle.net/10361/61 BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. application/pdf BRAC University
institution Brac University
collection Institutional Repository
topic Computer science and engineering
spellingShingle Computer science and engineering
Mansur, Munirul
Analysis of N-Gram based text categorization for Bangla in a newspaper corpus
description This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2006.
author2 Khan, Mumit
author_facet Khan, Mumit
Mansur, Munirul
format Thesis
author Mansur, Munirul
author_sort Mansur, Munirul
title Analysis of N-Gram based text categorization for Bangla in a newspaper corpus
title_short Analysis of N-Gram based text categorization for Bangla in a newspaper corpus
title_full Analysis of N-Gram based text categorization for Bangla in a newspaper corpus
title_fullStr Analysis of N-Gram based text categorization for Bangla in a newspaper corpus
title_full_unstemmed Analysis of N-Gram based text categorization for Bangla in a newspaper corpus
title_sort analysis of n-gram based text categorization for bangla in a newspaper corpus
publisher BRAC University
publishDate 2010
url http://hdl.handle.net/10361/61
work_keys_str_mv AT mansurmunirul analysisofngrambasedtextcategorizationforbanglainanewspapercorpus
_version_ 1814309837246300160