Integrating Bangla script recognition support in tesseract OCR

Includes bibliographical references (page 5).

Manylion Llyfryddiaeth
Prif Awduron: Hasnat, Md. Abul, Chowdhury, Muttakinur Rahman, Khan, Mumit
Awduron Eraill: Center for Research on Bangla Language Processing (CRBLP), BRAC University
Fformat: Erthygl
Iaith:English
Cyhoeddwyd: BRAC University 2010
Pynciau:
Mynediad Ar-lein:http://hdl.handle.net/10361/635
id 10361-635
record_format dspace
spelling 10361-6352019-09-29T05:27:38Z Integrating Bangla script recognition support in tesseract OCR Hasnat, Md. Abul Chowdhury, Muttakinur Rahman Khan, Mumit Center for Research on Bangla Language Processing (CRBLP), BRAC University Optical character reader (OCR) Bangla language processing Includes bibliographical references (page 5). Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract is capable of only recognizing English, French, Italian, German, Spanish and Dutch. However, it is possible to make Tesseract recognize other scripts if the engine is trained with the requisite data. In this paper, we present a complete methodology to integrate Bangla script recognition support in Tesseract. Md. Abul Hasnat Muttakinur Rahman Chowdhury Mumit Khan 2010-10-25T06:03:34Z 2010-10-25T06:03:34Z 2009 2009 Article http://hdl.handle.net/10361/635 en 5 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
topic Optical character reader (OCR)
Bangla language processing
spellingShingle Optical character reader (OCR)
Bangla language processing
Hasnat, Md. Abul
Chowdhury, Muttakinur Rahman
Khan, Mumit
Integrating Bangla script recognition support in tesseract OCR
description Includes bibliographical references (page 5).
author2 Center for Research on Bangla Language Processing (CRBLP), BRAC University
author_facet Center for Research on Bangla Language Processing (CRBLP), BRAC University
Hasnat, Md. Abul
Chowdhury, Muttakinur Rahman
Khan, Mumit
format Article
author Hasnat, Md. Abul
Chowdhury, Muttakinur Rahman
Khan, Mumit
author_sort Hasnat, Md. Abul
title Integrating Bangla script recognition support in tesseract OCR
title_short Integrating Bangla script recognition support in tesseract OCR
title_full Integrating Bangla script recognition support in tesseract OCR
title_fullStr Integrating Bangla script recognition support in tesseract OCR
title_full_unstemmed Integrating Bangla script recognition support in tesseract OCR
title_sort integrating bangla script recognition support in tesseract ocr
publisher BRAC University
publishDate 2010
url http://hdl.handle.net/10361/635
work_keys_str_mv AT hasnatmdabul integratingbanglascriptrecognitionsupportintesseractocr
AT chowdhurymuttakinurrahman integratingbanglascriptrecognitionsupportintesseractocr
AT khanmumit integratingbanglascriptrecognitionsupportintesseractocr
_version_ 1814307105365032960