Integrating Bangla script recognition support in tesseract OCR
Includes bibliographical references (page 5).
Prif Awduron: | , , |
---|---|
Awduron Eraill: | |
Fformat: | Erthygl |
Iaith: | English |
Cyhoeddwyd: |
BRAC University
2010
|
Pynciau: | |
Mynediad Ar-lein: | http://hdl.handle.net/10361/635 |
id |
10361-635 |
---|---|
record_format |
dspace |
spelling |
10361-6352019-09-29T05:27:38Z Integrating Bangla script recognition support in tesseract OCR Hasnat, Md. Abul Chowdhury, Muttakinur Rahman Khan, Mumit Center for Research on Bangla Language Processing (CRBLP), BRAC University Optical character reader (OCR) Bangla language processing Includes bibliographical references (page 5). Tesseract is considered one of the most accurate free software OCR engines currently available. It was originally developed by Hewlett-Packard from 1985 until 1995, and is currently maintained by Google. At present, Tesseract is capable of only recognizing English, French, Italian, German, Spanish and Dutch. However, it is possible to make Tesseract recognize other scripts if the engine is trained with the requisite data. In this paper, we present a complete methodology to integrate Bangla script recognition support in Tesseract. Md. Abul Hasnat Muttakinur Rahman Chowdhury Mumit Khan 2010-10-25T06:03:34Z 2010-10-25T06:03:34Z 2009 2009 Article http://hdl.handle.net/10361/635 en 5 pages application/pdf BRAC University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Optical character reader (OCR) Bangla language processing |
spellingShingle |
Optical character reader (OCR) Bangla language processing Hasnat, Md. Abul Chowdhury, Muttakinur Rahman Khan, Mumit Integrating Bangla script recognition support in tesseract OCR |
description |
Includes bibliographical references (page 5). |
author2 |
Center for Research on Bangla Language Processing (CRBLP), BRAC University |
author_facet |
Center for Research on Bangla Language Processing (CRBLP), BRAC University Hasnat, Md. Abul Chowdhury, Muttakinur Rahman Khan, Mumit |
format |
Article |
author |
Hasnat, Md. Abul Chowdhury, Muttakinur Rahman Khan, Mumit |
author_sort |
Hasnat, Md. Abul |
title |
Integrating Bangla script recognition support in tesseract OCR |
title_short |
Integrating Bangla script recognition support in tesseract OCR |
title_full |
Integrating Bangla script recognition support in tesseract OCR |
title_fullStr |
Integrating Bangla script recognition support in tesseract OCR |
title_full_unstemmed |
Integrating Bangla script recognition support in tesseract OCR |
title_sort |
integrating bangla script recognition support in tesseract ocr |
publisher |
BRAC University |
publishDate |
2010 |
url |
http://hdl.handle.net/10361/635 |
work_keys_str_mv |
AT hasnatmdabul integratingbanglascriptrecognitionsupportintesseractocr AT chowdhurymuttakinurrahman integratingbanglascriptrecognitionsupportintesseractocr AT khanmumit integratingbanglascriptrecognitionsupportintesseractocr |
_version_ |
1814307105365032960 |