Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.

ग्रंथसूची विवरण
मुख्य लेखकों: Tuhin, Saikat Halder, Islam, MD Touhidul, Islam, MD. Tauhidul
अन्य लेखक: Rahman, Mr. Tanvir
स्वरूप: थीसिस
भाषा:en_US
प्रकाशित: Brac University 2023
विषय:
ऑनलाइन पहुंच:http://hdl.handle.net/10361/17732
id 10361-17732
record_format dspace
spelling 10361-177322023-01-16T21:01:45Z Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language Tuhin, Saikat Halder Islam, MD Touhidul Islam, MD. Tauhidul Rahman, Mr. Tanvir Bin Ahsraf, Mr.Faisal Department of Computer Science and Engineering, Brac University Cyberbullying Social Media Suicide Bangla Language Word Embedding Machine Learning Random Forest Bullying Cyberbullying. Social media--Moral and ethical aspects. This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022. Cataloged from PDF version of thesis. Includes bibliographical references (pages 42-44). Cyberbullying which is defined as bullying perpetrated through the use of informa tion and communication technology is a serious problem nowadays. As a result of the invention of social networks friendships through different social media, relation ships, and social communications have all gone to a new level with new definitions. In fact, people become friends with someone whom he/she cannot even know face to face. With such a huge amount of users on the internet, cyberbullying has become a widespread global phenomenon. It not only makes a person mentally low but also has become one of the most important reasons for committing suicide. Being the seventh most speaking language in the world and increasing usage of the online platform, Bangla speaking people badly need an effective cyberbullying detection to handle this issue. In this thesis paper, we explore the spread of cyberbullying in fluence through the pairwise interactions between users. For cyberbullying through language, we will collect users’ unique comments from social media and check them with the help of psychological references. After that, those comments will be cat egorized using Word embedding, an evaluation tool to categorize text, so that the dataset will be shortened and ready for classification. Lastly, the dataset will be to a machine learning classifier named Random Forest in detecting the cyberbullying comments. The performance and accuracy of numerous frequently used machine learning approaches on Bangla text are investigated in this study. In addition, the influence of user-specific information, such as location, age, gender, number of likes, number of comments, and so on, is examined for the identification of Bangla cy berbullying. Random Forest is the top effective algorithm for Bangla cyberbullying identification when just posts or comments are used to identify, according to exper imental data, with 95.78% accuracy. Therefore, Random Forest is used for applying the approach on social media since it works better. Saikat Halder Tuhin MD Touhidul Islam MD. Tauhidul Islam B. Computer Science 2023-01-16T08:31:51Z 2023-01-16T08:31:51Z 2022 2022-05 Thesis ID: 18301063 ID: 18301106 ID: 19101276 http://hdl.handle.net/10361/17732 en_US Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 44 Pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language en_US
topic Cyberbullying
Social Media
Suicide
Bangla Language
Word Embedding
Machine Learning
Random Forest
Bullying
Cyberbullying.
Social media--Moral and ethical aspects.
spellingShingle Cyberbullying
Social Media
Suicide
Bangla Language
Word Embedding
Machine Learning
Random Forest
Bullying
Cyberbullying.
Social media--Moral and ethical aspects.
Tuhin, Saikat Halder
Islam, MD Touhidul
Islam, MD. Tauhidul
Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.
author2 Rahman, Mr. Tanvir
author_facet Rahman, Mr. Tanvir
Tuhin, Saikat Halder
Islam, MD Touhidul
Islam, MD. Tauhidul
format Thesis
author Tuhin, Saikat Halder
Islam, MD Touhidul
Islam, MD. Tauhidul
author_sort Tuhin, Saikat Halder
title Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language
title_short Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language
title_full Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language
title_fullStr Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language
title_full_unstemmed Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language
title_sort cyberbullying detection using machine learning from social media comments in bangla language
publisher Brac University
publishDate 2023
url http://hdl.handle.net/10361/17732
work_keys_str_mv AT tuhinsaikathalder cyberbullyingdetectionusingmachinelearningfromsocialmediacommentsinbanglalanguage
AT islammdtouhidul cyberbullyingdetectionusingmachinelearningfromsocialmediacommentsinbanglalanguage
AT islammdtauhidul cyberbullyingdetectionusingmachinelearningfromsocialmediacommentsinbanglalanguage
_version_ 1814307000678350848