Research report on Bengla OCR training and testing methods

Includes bibliographical references (page 6-7).

Détails bibliographiques
Auteur principal: Hasnat, Md. Abul
Autres auteurs: Center for Research on Bangla Language Processing (CRBLP), BRAC University
Format: Technical report
Langue:English
Publié: BRAC University 2010
Sujets:
Accès en ligne:http://hdl.handle.net/10361/657
id 10361-657
record_format dspace
spelling 10361-6572019-09-29T05:39:21Z Research report on Bengla OCR training and testing methods Hasnat, Md. Abul Center for Research on Bangla Language Processing (CRBLP), BRAC University Bangla language processing Bangla OCR Includes bibliographical references (page 6-7). In this paper we present the training and recognition mechanism of a Hidden Markov Model (HMM) based multi-font Optical Character Recognition (OCR) system for Bengali character. In our approach, the central idea is to separate the HMM model for each segmented character or word. The system uses HTK toolkit for data preparation, model training and recognition. The Features of each trained character are calculated by applying the Discrete Cosine Transform (DCT) to each pixel value of the character image where the image is divided into several frames according to its size. The extracted features of each frame are used as discrete probability distributions which will be given as input parameters to each HMM model. In the case of recognition, a model for each separated character or word is built up using the same approach. This model is given to the HTK toolkit to perform the recognition using the Viterbi Decoding method. The experimental results show significant performance over models using neural network based training and recognition systems. Md. Abul Hasnat 2010-10-28T04:03:53Z 2010-10-28T04:03:53Z 2007 2007 Technical report http://hdl.handle.net/10361/657 en 7 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
topic Bangla language processing
Bangla OCR
spellingShingle Bangla language processing
Bangla OCR
Hasnat, Md. Abul
Research report on Bengla OCR training and testing methods
description Includes bibliographical references (page 6-7).
author2 Center for Research on Bangla Language Processing (CRBLP), BRAC University
author_facet Center for Research on Bangla Language Processing (CRBLP), BRAC University
Hasnat, Md. Abul
format Technical report
author Hasnat, Md. Abul
author_sort Hasnat, Md. Abul
title Research report on Bengla OCR training and testing methods
title_short Research report on Bengla OCR training and testing methods
title_full Research report on Bengla OCR training and testing methods
title_fullStr Research report on Bengla OCR training and testing methods
title_full_unstemmed Research report on Bengla OCR training and testing methods
title_sort research report on bengla ocr training and testing methods
publisher BRAC University
publishDate 2010
url http://hdl.handle.net/10361/657
work_keys_str_mv AT hasnatmdabul researchreportonbenglaocrtrainingandtestingmethods
_version_ 1814307848377597952