Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages

Includes bibliographical references (page 6-8).

Bibliografski detalji
Glavni autori: Hasan, Muhammad Fahim, Naushad UzZaman, Khan, Mumit
Daljnji autori: Center for Research on Bangla language Processing (CRBLP), BRAC University
Format: Članak
Jezik:English
Izdano: BRAC University 2010
Teme:
Online pristup:http://hdl.handle.net/10361/330
id 10361-330
record_format dspace
spelling 10361-3302019-09-29T05:27:14Z Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages Hasan, Muhammad Fahim Naushad UzZaman Khan, Mumit Center for Research on Bangla language Processing (CRBLP), BRAC University Part-of-speech tagging Language processing Includes bibliographical references (page 6-8). Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of automating the process have been developed and employed for English and other Western languages. Some similar work, most of which utilize the stochastic approaches for POS Tagging has also been done in the same area for South Asian languages. We experimented with some of the widely-used approaches for POS Tagging on three South Asian languages, Bangla, Hindi and Telegu, using corpora of different sizes. We observed the performance of the approaches and found the Brill’s transformation based tagger’s performance to be superior to the other approaches in all of our experiments, though the use of this approach has been very limited until recently. Fahim Muhammad Hasan Naushad UzZaman Mumit Khan 2010-10-05T05:03:09Z 2010-10-05T05:03:09Z 2007 2007 Article http://hdl.handle.net/10361/330 en 8 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
topic Part-of-speech tagging
Language processing
spellingShingle Part-of-speech tagging
Language processing
Hasan, Muhammad Fahim
Naushad UzZaman
Khan, Mumit
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
description Includes bibliographical references (page 6-8).
author2 Center for Research on Bangla language Processing (CRBLP), BRAC University
author_facet Center for Research on Bangla language Processing (CRBLP), BRAC University
Hasan, Muhammad Fahim
Naushad UzZaman
Khan, Mumit
format Article
author Hasan, Muhammad Fahim
Naushad UzZaman
Khan, Mumit
author_sort Hasan, Muhammad Fahim
title Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
title_short Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
title_full Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
title_fullStr Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
title_full_unstemmed Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
title_sort comparison of unigram, bigram, hmm and brill's pos tagging approaches for some south asian languages
publisher BRAC University
publishDate 2010
url http://hdl.handle.net/10361/330
work_keys_str_mv AT hasanmuhammadfahim comparisonofunigrambigramhmmandbrillspostaggingapproachesforsomesouthasianlanguages
AT naushaduzzaman comparisonofunigrambigramhmmandbrillspostaggingapproachesforsomesouthasianlanguages
AT khanmumit comparisonofunigrambigramhmmandbrillspostaggingapproachesforsomesouthasianlanguages
_version_ 1814309321791504384