Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022.
मुख्य लेखकों: | , , |
---|---|
अन्य लेखक: | |
स्वरूप: | थीसिस |
भाषा: | en_US |
प्रकाशित: |
Brac University
2023
|
विषय: | |
ऑनलाइन पहुंच: | http://hdl.handle.net/10361/17732 |
id |
10361-17732 |
---|---|
record_format |
dspace |
spelling |
10361-177322023-01-16T21:01:45Z Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language Tuhin, Saikat Halder Islam, MD Touhidul Islam, MD. Tauhidul Rahman, Mr. Tanvir Bin Ahsraf, Mr.Faisal Department of Computer Science and Engineering, Brac University Cyberbullying Social Media Suicide Bangla Language Word Embedding Machine Learning Random Forest Bullying Cyberbullying. Social media--Moral and ethical aspects. This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022. Cataloged from PDF version of thesis. Includes bibliographical references (pages 42-44). Cyberbullying which is defined as bullying perpetrated through the use of informa tion and communication technology is a serious problem nowadays. As a result of the invention of social networks friendships through different social media, relation ships, and social communications have all gone to a new level with new definitions. In fact, people become friends with someone whom he/she cannot even know face to face. With such a huge amount of users on the internet, cyberbullying has become a widespread global phenomenon. It not only makes a person mentally low but also has become one of the most important reasons for committing suicide. Being the seventh most speaking language in the world and increasing usage of the online platform, Bangla speaking people badly need an effective cyberbullying detection to handle this issue. In this thesis paper, we explore the spread of cyberbullying in fluence through the pairwise interactions between users. For cyberbullying through language, we will collect users’ unique comments from social media and check them with the help of psychological references. After that, those comments will be cat egorized using Word embedding, an evaluation tool to categorize text, so that the dataset will be shortened and ready for classification. Lastly, the dataset will be to a machine learning classifier named Random Forest in detecting the cyberbullying comments. The performance and accuracy of numerous frequently used machine learning approaches on Bangla text are investigated in this study. In addition, the influence of user-specific information, such as location, age, gender, number of likes, number of comments, and so on, is examined for the identification of Bangla cy berbullying. Random Forest is the top effective algorithm for Bangla cyberbullying identification when just posts or comments are used to identify, according to exper imental data, with 95.78% accuracy. Therefore, Random Forest is used for applying the approach on social media since it works better. Saikat Halder Tuhin MD Touhidul Islam MD. Tauhidul Islam B. Computer Science 2023-01-16T08:31:51Z 2023-01-16T08:31:51Z 2022 2022-05 Thesis ID: 18301063 ID: 18301106 ID: 19101276 http://hdl.handle.net/10361/17732 en_US Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 44 Pages application/pdf Brac University |
institution |
Brac University |
collection |
Institutional Repository |
language |
en_US |
topic |
Cyberbullying Social Media Suicide Bangla Language Word Embedding Machine Learning Random Forest Bullying Cyberbullying. Social media--Moral and ethical aspects. |
spellingShingle |
Cyberbullying Social Media Suicide Bangla Language Word Embedding Machine Learning Random Forest Bullying Cyberbullying. Social media--Moral and ethical aspects. Tuhin, Saikat Halder Islam, MD Touhidul Islam, MD. Tauhidul Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language |
description |
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2022. |
author2 |
Rahman, Mr. Tanvir |
author_facet |
Rahman, Mr. Tanvir Tuhin, Saikat Halder Islam, MD Touhidul Islam, MD. Tauhidul |
format |
Thesis |
author |
Tuhin, Saikat Halder Islam, MD Touhidul Islam, MD. Tauhidul |
author_sort |
Tuhin, Saikat Halder |
title |
Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language |
title_short |
Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language |
title_full |
Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language |
title_fullStr |
Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language |
title_full_unstemmed |
Cyberbullying Detection using Machine Learning from Social Media comments in Bangla Language |
title_sort |
cyberbullying detection using machine learning from social media comments in bangla language |
publisher |
Brac University |
publishDate |
2023 |
url |
http://hdl.handle.net/10361/17732 |
work_keys_str_mv |
AT tuhinsaikathalder cyberbullyingdetectionusingmachinelearningfromsocialmediacommentsinbanglalanguage AT islammdtouhidul cyberbullyingdetectionusingmachinelearningfromsocialmediacommentsinbanglalanguage AT islammdtauhidul cyberbullyingdetectionusingmachinelearningfromsocialmediacommentsinbanglalanguage |
_version_ |
1814307000678350848 |