Bangla optical character recognition from printed text using Tesseract Engine

Cataloged from PDF version of thesis.

Detalles Bibliográficos
Main Authors:	Faruque, MD Yamin, Adeeb, MD Zahin, Kamal, Muhammad Maswood, Ahmed, Redwan
Outros autores:	Arif, Hossain
Formato:	Thesis
Idioma:	en_US
Publicado:	Brac University 2021
Subjects:	Optical Character Recognition Bangla Language Bangla OCR Tesseract RNN LSTM Open CV Otsu’s Thresholding Algorithm Python jTessEditorFX Image Processing Custom Dictionary
Acceso en liña:	http://hdl.handle.net/10361/15541

id	10361-15541
record_format	dspace
spelling	10361-155412022-01-26T10:15:58Z Bangla optical character recognition from printed text using Tesseract Engine Faruque, MD Yamin Adeeb, MD Zahin Kamal, Muhammad Maswood Ahmed, Redwan Arif, Hossain Department of Computer Science and Engineering, Brac University Optical Character Recognition Bangla Language Bangla OCR Tesseract RNN LSTM Open CV Otsu’s Thresholding Algorithm Python jTessEditorFX Image Processing Custom Dictionary Cataloged from PDF version of thesis. Includes bibliographical references (pages 44-45). This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021. Optical Character Recognition or OCR is a technology that enables us to detect and extract text from images. In our project, we are designing our OCR system around the Bangla language. This is primarily because, there are many models of text recognition of the English language in the market but there are very few on Bangla. Ourproposedsystemcomprisesofacquiringtheinputimage,pre-processing it,passingitintotheTesseractOCRengine(thebackboneofoursystem)andfinally getting digital output of the text. We have used the latest version of Tesseract, that is, version 5 and even though this is in its alpha stage, it is still stable for endusers. Next, to improve accuracy, we have focused on pre-processing the image as thoroughly as possible and laid out our chosen algorithm in each step. For example, forbinarization,wehaveusedOtsu’sThresholdingalgorithmasthisgaveusthebest results. For segmentation, we have used the Fully Automatic Page Segmentation from Tesseracts own repertoire of segmentation modes. Then we have done our training through Tesseract’s new LSTM engine and improved upon their existing trainedfilewithourfonts. Wehaveselectedthesefontsbasedontheirpopularityof use. Wecalculatedouraccuracyatthewordlevelandourmodelgaveusanaverage accuracy of 95.9% on multiple fonts and on multiple real life scenarios. At best case scenariowehaveevenmanagedtosecure100%accuracy. Finally, wehavediscussed future improvements like the addition of a custom dictionary in our model and how it would increase the overall accuracy in all cases. MD Yamin Faruque MD Zahin Adeeb Muhammad Maswood Kamal Redwan Ahmed B. Computer Science 2021-10-26T05:02:39Z 2021-10-26T05:02:39Z 2021 2021-01 Thesis ID: 16201059 ID: 19241013 ID: 16201038 ID: 16241005 http://hdl.handle.net/10361/15541 en_US Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 45 Pages application/pdf Brac University
institution	Brac University
collection	Institutional Repository
language	en_US
topic	Optical Character Recognition Bangla Language Bangla OCR Tesseract RNN LSTM Open CV Otsu’s Thresholding Algorithm Python jTessEditorFX Image Processing Custom Dictionary
spellingShingle	Optical Character Recognition Bangla Language Bangla OCR Tesseract RNN LSTM Open CV Otsu’s Thresholding Algorithm Python jTessEditorFX Image Processing Custom Dictionary Faruque, MD Yamin Adeeb, MD Zahin Kamal, Muhammad Maswood Ahmed, Redwan Bangla optical character recognition from printed text using Tesseract Engine
description	Cataloged from PDF version of thesis.
author2	Arif, Hossain
author_facet	Arif, Hossain Faruque, MD Yamin Adeeb, MD Zahin Kamal, Muhammad Maswood Ahmed, Redwan
format	Thesis
author	Faruque, MD Yamin Adeeb, MD Zahin Kamal, Muhammad Maswood Ahmed, Redwan
author_sort	Faruque, MD Yamin
title	Bangla optical character recognition from printed text using Tesseract Engine
title_short	Bangla optical character recognition from printed text using Tesseract Engine
title_full	Bangla optical character recognition from printed text using Tesseract Engine
title_fullStr	Bangla optical character recognition from printed text using Tesseract Engine
title_full_unstemmed	Bangla optical character recognition from printed text using Tesseract Engine
title_sort	bangla optical character recognition from printed text using tesseract engine
publisher	Brac University
publishDate	2021
url	http://hdl.handle.net/10361/15541
work_keys_str_mv	AT faruquemdyamin banglaopticalcharacterrecognitionfromprintedtextusingtesseractengine AT adeebmdzahin banglaopticalcharacterrecognitionfromprintedtextusingtesseractengine AT kamalmuhammadmaswood banglaopticalcharacterrecognitionfromprintedtextusingtesseractengine AT ahmedredwan banglaopticalcharacterrecognitionfromprintedtextusingtesseractengine
_version_	1814308551744552960

Bangla optical character recognition from printed text using Tesseract Engine

Títulos similares