A framework for sentiment analysis: a data-driven approach

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.

Chi tiết về thư mục
Những tác giả chính: Islam, Md. Jahedul, Sarker, Tonmoy, Shuvo, Md. Shubiour, Hossen, Md. Robin, Ahmedh, Minhaz Uddin
Tác giả khác: Parvez, Mohammad Zavid
Định dạng: Luận văn
Ngôn ngữ:English
Được phát hành: Brac University 2021
Những chủ đề:
Truy cập trực tuyến:http://hdl.handle.net/10361/15156
id 10361-15156
record_format dspace
spelling 10361-151562022-01-26T10:19:57Z A framework for sentiment analysis: a data-driven approach Islam, Md. Jahedul Sarker, Tonmoy Shuvo, Md. Shubiour Hossen, Md. Robin Ahmedh, Minhaz Uddin Parvez, Mohammad Zavid Rahman, Md. Anisur Alam, Md. Golam Rabiul Department of Computer Science and Engineering, Brac University Sentiment Analysis Attribute selection Pattern Extraction Classification Accuracy Application of Machine Learning Classification Machine learning This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021. Cataloged from PDF version of thesis. Includes bibliographical references (page 28-30). Internet is free and straightforward access to an immense measure of crude content information that can be mined for sentiment analysis. For a long time, this is being used for market research, user opinion mining, recommendation systems, analyze people’s views on a topic, etc. Many different techniques have been developed, yet a lot of complication remains. Selecting and understanding attribute patterns in a text dataset is important to build a good model and know where this model can be used. Different text datasets have different relations between their attributes and classes. For example, let’s take a dataset with totally random English texts labelled as positive or negative. We expect to see that extracted attributes for the positive or negative class are very heavy with general words that we consider positive or negative in everyday English use. However, if the dataset is created on a niche topic, such as an economic, pandemic, etc, we would probably see that positive and negative classes are now heavy with words specific to these topics, or they may not be considered important at all by the classifier. However, we might want to give importance to those niche-specific attributes specifically. In this paper, we take five different datasets of different instance lengths. We use Weka as a tool and go through some attribute selection techniques, do sentence-level sentiment analysis, and finally extract patterns from the datasets to analyze them. There are few related works on these datasets and our technique performed better than the existing works.We have been successful to beat Fuzzy method in terms of accuracy and better extraction of polarity in texts. Our approach have been proven to better work with the datasets than many former methods.In thispaper, we aim to present a method that can easily be fruitful to any dataset for textmining and can have a decent accuracy In this paper, we aim to present a method that can easily be fruitful to any dataset for text mining and can have a decent accuracy. Md. Jahedul Islam Tonmoy Sarker Md. Shubiour Shuvo Md. Robin Hossen Minhaz Uddin Ahmed B. Computer Science 2021-10-06T07:30:22Z 2021-10-06T07:30:22Z 2021 2021-06 Thesis ID: 17101430 ID: 17301052 ID: 17301132 ID: 17301110 ID: 17301087 http://hdl.handle.net/10361/15156 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 30 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic Sentiment Analysis
Attribute selection
Pattern Extraction
Classification
Accuracy
Application of Machine Learning
Classification
Machine learning
spellingShingle Sentiment Analysis
Attribute selection
Pattern Extraction
Classification
Accuracy
Application of Machine Learning
Classification
Machine learning
Islam, Md. Jahedul
Sarker, Tonmoy
Shuvo, Md. Shubiour
Hossen, Md. Robin
Ahmedh, Minhaz Uddin
A framework for sentiment analysis: a data-driven approach
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.
author2 Parvez, Mohammad Zavid
author_facet Parvez, Mohammad Zavid
Islam, Md. Jahedul
Sarker, Tonmoy
Shuvo, Md. Shubiour
Hossen, Md. Robin
Ahmedh, Minhaz Uddin
format Thesis
author Islam, Md. Jahedul
Sarker, Tonmoy
Shuvo, Md. Shubiour
Hossen, Md. Robin
Ahmedh, Minhaz Uddin
author_sort Islam, Md. Jahedul
title A framework for sentiment analysis: a data-driven approach
title_short A framework for sentiment analysis: a data-driven approach
title_full A framework for sentiment analysis: a data-driven approach
title_fullStr A framework for sentiment analysis: a data-driven approach
title_full_unstemmed A framework for sentiment analysis: a data-driven approach
title_sort framework for sentiment analysis: a data-driven approach
publisher Brac University
publishDate 2021
url http://hdl.handle.net/10361/15156
work_keys_str_mv AT islammdjahedul aframeworkforsentimentanalysisadatadrivenapproach
AT sarkertonmoy aframeworkforsentimentanalysisadatadrivenapproach
AT shuvomdshubiour aframeworkforsentimentanalysisadatadrivenapproach
AT hossenmdrobin aframeworkforsentimentanalysisadatadrivenapproach
AT ahmedhminhazuddin aframeworkforsentimentanalysisadatadrivenapproach
AT islammdjahedul frameworkforsentimentanalysisadatadrivenapproach
AT sarkertonmoy frameworkforsentimentanalysisadatadrivenapproach
AT shuvomdshubiour frameworkforsentimentanalysisadatadrivenapproach
AT hossenmdrobin frameworkforsentimentanalysisadatadrivenapproach
AT ahmedhminhazuddin frameworkforsentimentanalysisadatadrivenapproach
_version_ 1814309049087295488