Important keywords extraction from documents using semantic analysis
This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017.
Auteurs principaux: | , |
---|---|
Autres auteurs: | |
Format: | Thèse |
Langue: | English |
Publié: |
2017
|
Sujets: | |
Accès en ligne: | http://hdl.handle.net/10361/8245 |
id |
10361-8245 |
---|---|
record_format |
dspace |
spelling |
10361-82452022-01-26T10:08:21Z Important keywords extraction from documents using semantic analysis Hasan, H. M. Mahedi Sanyal, Falguni Ali, Md. Haider Chaki, Dipankar Department of Computer Science and Engineering, BRAC University Natural Language Processing (NLP) Semantic analysis TextBlob POS-tagging N-grams Keyword This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017. Cataloged from PDF version of thesis report. Includes bibliographical references (page 50 - 51). Keyword extraction is an automatic selection of terms which describes the content of a document. Keywords define the terms that represent the core information from the documents. In order to go through massive amount of documents to find out the relevant information, keyword extraction will be the key approach. This approach will help us to understand the depth of a document even before we read it. In this research, we have found out different approaches and algorithms that have been used in keyword extraction technique. Conditional random fields (CRF), Support vector machine (SVM), NP-chunk, N-grams, Multiple linear regression, Logistic regression, and semantic analysis has been used to find out important keywords from a document. Immense research shows us that SVM and CRF gives better results where CRF accuracy is greater than SVM based on F1 score (The balance between precision and recall). According to precision, SVM shows better result than CRF. But, in case of recall, logit shows the greater result. Semantic relation between words is also another key feature in keyword extraction techniques. Semantic analysis is very effective field in natural language processing and using semantic relation, it is possible to find out the relation between words as well as between the lines. In this thesis paper, we have used semantic analysis and processing the documents to find out the important keywords from documents. B. Computer Science and Engineering 2017-06-15T05:16:02Z 2017-06-15T05:16:02Z 2017 2017-04 Thesis ID 13101270 ID 13301058 http://hdl.handle.net/10361/8245 en BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 51 pages application/pdf |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
Natural Language Processing (NLP) Semantic analysis TextBlob POS-tagging N-grams Keyword |
spellingShingle |
Natural Language Processing (NLP) Semantic analysis TextBlob POS-tagging N-grams Keyword Hasan, H. M. Mahedi Sanyal, Falguni Important keywords extraction from documents using semantic analysis |
description |
This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017. |
author2 |
Ali, Md. Haider |
author_facet |
Ali, Md. Haider Hasan, H. M. Mahedi Sanyal, Falguni |
format |
Thesis |
author |
Hasan, H. M. Mahedi Sanyal, Falguni |
author_sort |
Hasan, H. M. Mahedi |
title |
Important keywords extraction from documents using semantic analysis |
title_short |
Important keywords extraction from documents using semantic analysis |
title_full |
Important keywords extraction from documents using semantic analysis |
title_fullStr |
Important keywords extraction from documents using semantic analysis |
title_full_unstemmed |
Important keywords extraction from documents using semantic analysis |
title_sort |
important keywords extraction from documents using semantic analysis |
publishDate |
2017 |
url |
http://hdl.handle.net/10361/8245 |
work_keys_str_mv |
AT hasanhmmahedi importantkeywordsextractionfromdocumentsusingsemanticanalysis AT sanyalfalguni importantkeywordsextractionfromdocumentsusingsemanticanalysis |
_version_ |
1814307468112560128 |