Syntactic part of speech tagging guidelines for Bangla text

Includes bibliographical references (page 68).

ग्रंथसूची विवरण
मुख्य लेखकों: Mahmud, Altaf, Khan, Mumit
अन्य लेखक: Center for Research on Bangla Language Processing (CRBLP), BRAC University
स्वरूप: Technical report
भाषा:English
प्रकाशित: BRAC University 2010
विषय:
ऑनलाइन पहुंच:http://hdl.handle.net/10361/640
id 10361-640
record_format dspace
spelling 10361-6402019-09-29T05:39:23Z Syntactic part of speech tagging guidelines for Bangla text Mahmud, Altaf Khan, Mumit Center for Research on Bangla Language Processing (CRBLP), BRAC University Bangla language processing Includes bibliographical references (page 68). Recently, several techniques have been tested to automatically assign part-of-speeches to Bangla texts using different tag sets. But there is always a need for a standard tagset for Bangla that has been formally published for syntactical bracketing, along with a details POS tagging guideline for the annotators which shows how a word should be tagged in a particular context. This document presents a guideline for annotating Bangla text by part-of-speech to assist the syntactical bracketing task. This tagset consists of total 55 tags tried to precisely distribute all of the required syntactic categories and encode necessary syntactic information to facilitate advanced linguistic analysis of a morphologically rich and flexible word ordered language. After trained a simple Brill tagger on a manually tagged corpus consists of around 25,000 words, overall accuracy has been achieved 70.6% which is comparable to minimal standard set by different experimental results using any simple supervised learning method on Bangla text. Altaf Mahmud Mumit Khan 2010-10-27T04:29:02Z 2010-10-27T04:29:02Z 2009 2009 Technical report http://hdl.handle.net/10361/640 en 73 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
topic Bangla language processing
spellingShingle Bangla language processing
Mahmud, Altaf
Khan, Mumit
Syntactic part of speech tagging guidelines for Bangla text
description Includes bibliographical references (page 68).
author2 Center for Research on Bangla Language Processing (CRBLP), BRAC University
author_facet Center for Research on Bangla Language Processing (CRBLP), BRAC University
Mahmud, Altaf
Khan, Mumit
format Technical report
author Mahmud, Altaf
Khan, Mumit
author_sort Mahmud, Altaf
title Syntactic part of speech tagging guidelines for Bangla text
title_short Syntactic part of speech tagging guidelines for Bangla text
title_full Syntactic part of speech tagging guidelines for Bangla text
title_fullStr Syntactic part of speech tagging guidelines for Bangla text
title_full_unstemmed Syntactic part of speech tagging guidelines for Bangla text
title_sort syntactic part of speech tagging guidelines for bangla text
publisher BRAC University
publishDate 2010
url http://hdl.handle.net/10361/640
work_keys_str_mv AT mahmudaltaf syntacticpartofspeechtaggingguidelinesforbanglatext
AT khanmumit syntacticpartofspeechtaggingguidelinesforbanglatext
_version_ 1814308788713291776