Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.

Մատենագիտական մանրամասներ
Հիմնական հեղինակներ: Bhuiyan, Md Raihanul Islam, Efaz, Mahin Shahriar, Reza, Tanjim, Ria, Aditi Saha
Այլ հեղինակներ: Hossain, Muhammad Iqbal
Ձևաչափ: Թեզիս
Լեզու:English
Հրապարակվել է: Brac University 2024
Խորագրեր:
Առցանց հասանելիություն:http://hdl.handle.net/10361/23540
id 10361-23540
record_format dspace
spelling 10361-235402024-06-24T21:04:06Z Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters Bhuiyan, Md Raihanul Islam Efaz, Mahin Shahriar Reza, Tanjim Ria, Aditi Saha Hossain, Muhammad Iqbal Reza, Md. Tanzim Department of Computer Science and Engineering, Brac University Bangla text recognition Compound characters BanglaBorno VGG16 architecture Machine learning This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 39-41). Bangla is one of the most popular languages in the world and more than 210 Million people use it as their first or second language. The literature of Bangla has a rich history and dates back thousands of years. However, Bangla characters have a compound structure; some contain more than one simple character to form a single compound character. There is a lot of work on character recognition but the structure of the compound characters makes the detection of Bangla Compound Characters a difficult task. The existing method on Bangla compound characters uses a list of compound characters as the dataset, trains models on the whole image, and detects the characters. Using this method on handwritten characters, the accuracy decreases when the characters are slightly different from the train images or the characters consist of two different simple characters that are not in the train images. To overcome this problem, our research focus is to detect character type i.e. simple or compound using VGG 16 architecture and YOLO, and if it is a compound character, it can detect the underlying simple characters inside the compound characters. To conduct our research, we created a new Bengali Handwritten character dataset called “BanglaBorno” as the existing datasets had some limitations in the quantity of compound characters or the quality of the images. Md Raihanul Islam Bhuiyan Mahin Shahriar Efaz Tanjim Reza Aditi Saha Ria B.Sc in Computer Science 2024-06-24T04:54:41Z 2024-06-24T04:54:41Z ©2023 2023-09 Thesis ID 23341083 ID 23341084 ID 20101065 ID 23341085 http://hdl.handle.net/10361/23540 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 53 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic Bangla text recognition
Compound characters
BanglaBorno
VGG16 architecture
Machine learning
spellingShingle Bangla text recognition
Compound characters
BanglaBorno
VGG16 architecture
Machine learning
Bhuiyan, Md Raihanul Islam
Efaz, Mahin Shahriar
Reza, Tanjim
Ria, Aditi Saha
Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.
author2 Hossain, Muhammad Iqbal
author_facet Hossain, Muhammad Iqbal
Bhuiyan, Md Raihanul Islam
Efaz, Mahin Shahriar
Reza, Tanjim
Ria, Aditi Saha
format Thesis
author Bhuiyan, Md Raihanul Islam
Efaz, Mahin Shahriar
Reza, Tanjim
Ria, Aditi Saha
author_sort Bhuiyan, Md Raihanul Islam
title Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters
title_short Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters
title_full Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters
title_fullStr Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters
title_full_unstemmed Segmentation of Bangla compound characters: underlying simple character detection from handwritten compound characters
title_sort segmentation of bangla compound characters: underlying simple character detection from handwritten compound characters
publisher Brac University
publishDate 2024
url http://hdl.handle.net/10361/23540
work_keys_str_mv AT bhuiyanmdraihanulislam segmentationofbanglacompoundcharactersunderlyingsimplecharacterdetectionfromhandwrittencompoundcharacters
AT efazmahinshahriar segmentationofbanglacompoundcharactersunderlyingsimplecharacterdetectionfromhandwrittencompoundcharacters
AT rezatanjim segmentationofbanglacompoundcharactersunderlyingsimplecharacterdetectionfromhandwrittencompoundcharacters
AT riaaditisaha segmentationofbanglacompoundcharactersunderlyingsimplecharacterdetectionfromhandwrittencompoundcharacters
_version_ 1814308917626273792