Important keywords extraction from documents using semantic analysis

This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017.

Détails bibliographiques
Auteurs principaux: Hasan, H. M. Mahedi, Sanyal, Falguni
Autres auteurs: Ali, Md. Haider
Format: Thèse
Langue:English
Publié: 2017
Sujets:
Accès en ligne:http://hdl.handle.net/10361/8245
id 10361-8245
record_format dspace
spelling 10361-82452022-01-26T10:08:21Z Important keywords extraction from documents using semantic analysis Hasan, H. M. Mahedi Sanyal, Falguni Ali, Md. Haider Chaki, Dipankar Department of Computer Science and Engineering, BRAC University Natural Language Processing (NLP) Semantic analysis TextBlob POS-tagging N-grams Keyword This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017. Cataloged from PDF version of thesis report. Includes bibliographical references (page 50 - 51). Keyword extraction is an automatic selection of terms which describes the content of a document. Keywords define the terms that represent the core information from the documents. In order to go through massive amount of documents to find out the relevant information, keyword extraction will be the key approach. This approach will help us to understand the depth of a document even before we read it. In this research, we have found out different approaches and algorithms that have been used in keyword extraction technique. Conditional random fields (CRF), Support vector machine (SVM), NP-chunk, N-grams, Multiple linear regression, Logistic regression, and semantic analysis has been used to find out important keywords from a document. Immense research shows us that SVM and CRF gives better results where CRF accuracy is greater than SVM based on F1 score (The balance between precision and recall). According to precision, SVM shows better result than CRF. But, in case of recall, logit shows the greater result. Semantic relation between words is also another key feature in keyword extraction techniques. Semantic analysis is very effective field in natural language processing and using semantic relation, it is possible to find out the relation between words as well as between the lines. In this thesis paper, we have used semantic analysis and processing the documents to find out the important keywords from documents. B. Computer Science and Engineering 2017-06-15T05:16:02Z 2017-06-15T05:16:02Z 2017 2017-04 Thesis ID 13101270 ID 13301058 http://hdl.handle.net/10361/8245 en BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 51 pages application/pdf
institution Brac University
collection Institutional Repository
language English
topic Natural Language Processing (NLP)
Semantic analysis
TextBlob
POS-tagging
N-grams
Keyword
spellingShingle Natural Language Processing (NLP)
Semantic analysis
TextBlob
POS-tagging
N-grams
Keyword
Hasan, H. M. Mahedi
Sanyal, Falguni
Important keywords extraction from documents using semantic analysis
description This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017.
author2 Ali, Md. Haider
author_facet Ali, Md. Haider
Hasan, H. M. Mahedi
Sanyal, Falguni
format Thesis
author Hasan, H. M. Mahedi
Sanyal, Falguni
author_sort Hasan, H. M. Mahedi
title Important keywords extraction from documents using semantic analysis
title_short Important keywords extraction from documents using semantic analysis
title_full Important keywords extraction from documents using semantic analysis
title_fullStr Important keywords extraction from documents using semantic analysis
title_full_unstemmed Important keywords extraction from documents using semantic analysis
title_sort important keywords extraction from documents using semantic analysis
publishDate 2017
url http://hdl.handle.net/10361/8245
work_keys_str_mv AT hasanhmmahedi importantkeywordsextractionfromdocumentsusingsemanticanalysis
AT sanyalfalguni importantkeywordsextractionfromdocumentsusingsemanticanalysis
_version_ 1814307468112560128