Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages
Includes bibliographical references (page 6-8).
Glavni autori: | , , |
---|---|
Daljnji autori: | |
Format: | Članak |
Jezik: | English |
Izdano: |
BRAC University
2010
|
Teme: | |
Online pristup: | http://hdl.handle.net/10361/330 |
id |
10361-330 |
---|---|
record_format |
dspace |
spelling |
10361-3302019-09-29T05:27:14Z Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages Hasan, Muhammad Fahim Naushad UzZaman Khan, Mumit Center for Research on Bangla language Processing (CRBLP), BRAC University Part-of-speech tagging Language processing Includes bibliographical references (page 6-8). Part-of-Speech (POS) Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. POS Tagging is important in various areas of Natural Language Processing. Different methods of automating the process have been developed and employed for English and other Western languages. Some similar work, most of which utilize the stochastic approaches for POS Tagging has also been done in the same area for South Asian languages. We experimented with some of the widely-used approaches for POS Tagging on three South Asian languages, Bangla, Hindi and Telegu, using corpora of different sizes. We observed the performance of the approaches and found the Brill’s transformation based tagger’s performance to be superior to the other approaches in all of our experiments, though the use of this approach has been very limited until recently. Fahim Muhammad Hasan Naushad UzZaman Mumit Khan 2010-10-05T05:03:09Z 2010-10-05T05:03:09Z 2007 2007 Article http://hdl.handle.net/10361/330 en 8 pages application/pdf BRAC University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Part-of-speech tagging Language processing |
spellingShingle |
Part-of-speech tagging Language processing Hasan, Muhammad Fahim Naushad UzZaman Khan, Mumit Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages |
description |
Includes bibliographical references (page 6-8). |
author2 |
Center for Research on Bangla language Processing (CRBLP), BRAC University |
author_facet |
Center for Research on Bangla language Processing (CRBLP), BRAC University Hasan, Muhammad Fahim Naushad UzZaman Khan, Mumit |
format |
Article |
author |
Hasan, Muhammad Fahim Naushad UzZaman Khan, Mumit |
author_sort |
Hasan, Muhammad Fahim |
title |
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages |
title_short |
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages |
title_full |
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages |
title_fullStr |
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages |
title_full_unstemmed |
Comparison of Unigram, Bigram, HMM and Brill's POS tagging approaches for some South Asian languages |
title_sort |
comparison of unigram, bigram, hmm and brill's pos tagging approaches for some south asian languages |
publisher |
BRAC University |
publishDate |
2010 |
url |
http://hdl.handle.net/10361/330 |
work_keys_str_mv |
AT hasanmuhammadfahim comparisonofunigrambigramhmmandbrillspostaggingapproachesforsomesouthasianlanguages AT naushaduzzaman comparisonofunigrambigramhmmandbrillspostaggingapproachesforsomesouthasianlanguages AT khanmumit comparisonofunigrambigramhmmandbrillspostaggingapproachesforsomesouthasianlanguages |
_version_ |
1814309321791504384 |