Detecting sarcasm in Bengali comments using NLP
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | English |
Published: |
Brac University
2024
|
Subjects: | |
Online Access: | http://hdl.handle.net/10361/22082 |
id |
10361-22082 |
---|---|
record_format |
dspace |
spelling |
10361-220822024-01-09T21:02:58Z Detecting sarcasm in Bengali comments using NLP Chowdhury, Md. Jamiur Rahman Shakil, Mr. Arif Department of Computer Science and Engineering, Brac University Natural language processing Sentiment analysis Machine learning Support vector machines Word2vec Bert Computational linguistics. Natural language processing (Computer science) This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 31-33). Natural Language Processing (NLP) is a subset of Machine Learning which resides at the intersection of Linguistics and Computer Science. It deals with the capability of computers to learn and work with human languages. With the emergence of social media platforms, modern-day communication is being digitalized more than ever. To keep up with this rapid flow of development, the advancement of automated text processing and artificial language interpretation has become necessary. These concerns have given birth to a domain called Sentiment Analysis where blocks of text are processed to extract prominent sentiments that are prevalent within them. These sentiments can be happiness, sadness, anger, disgust, etc. Over the past few years, similar studies have garnered the attention of a vast number of computer scientists and linguists but as the study progresses and expands in the form of lan guages, concentrations, and contexts more and more challenges have started to show up. One of these challenges is the interpretation of figurative language. Figurative language refers to the structure of speech where the actual meaning defers from the literal meaning. The best example of this is Sarcasm which is a sort of figu rative language used with an intention of mockery or humor. Detecting sarcasm is considered to be one of the most challenging tasks in the domain of NLP due to the figurative structure and creative nature of sarcastic texts and the lack of relevant data on the internet. Determining sarcasm can often be difficult for even human beings as one has to have a strong understanding of the context to detect sarcasm. However, many studies have achieved respectable results by following the context unaware unimodal methods using classical Machine Learning, Deep and Hy brid Neural Networks. Motivated by such research, the objective of this paper is to take a step toward detecting sarcasm in the Bengali Language domain using Sup port Vector Machine (SVM), Cogniinsight(Word2Vec), and Bidirectional Encoder Representations from Transformers (BERT) on a novel dataset. To the best of my knowledge, this will be the first-ever initiative taken toward detecting sarcasm in Bengali Language using BERT. Md. Jamiur Rahman Chowdhury B.Sc. in Computer Science and Engineering 2024-01-09T05:26:55Z 2024-01-09T05:26:55Z 2023 2023-01 Thesis ID: 18101448 http://hdl.handle.net/10361/22082 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 33 pages application/pdf Brac University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Natural language processing Sentiment analysis Machine learning Support vector machines Word2vec Bert Computational linguistics. Natural language processing (Computer science) |
spellingShingle |
Natural language processing Sentiment analysis Machine learning Support vector machines Word2vec Bert Computational linguistics. Natural language processing (Computer science) Chowdhury, Md. Jamiur Rahman Detecting sarcasm in Bengali comments using NLP |
description |
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2023. |
author2 |
Shakil, Mr. Arif |
author_facet |
Shakil, Mr. Arif Chowdhury, Md. Jamiur Rahman |
format |
Thesis |
author |
Chowdhury, Md. Jamiur Rahman |
author_sort |
Chowdhury, Md. Jamiur Rahman |
title |
Detecting sarcasm in Bengali comments using NLP |
title_short |
Detecting sarcasm in Bengali comments using NLP |
title_full |
Detecting sarcasm in Bengali comments using NLP |
title_fullStr |
Detecting sarcasm in Bengali comments using NLP |
title_full_unstemmed |
Detecting sarcasm in Bengali comments using NLP |
title_sort |
detecting sarcasm in bengali comments using nlp |
publisher |
Brac University |
publishDate |
2024 |
url |
http://hdl.handle.net/10361/22082 |
work_keys_str_mv |
AT chowdhurymdjamiurrahman detectingsarcasminbengalicommentsusingnlp |
_version_ |
1814307951609905152 |