Automatic text summarization using fuzzy c–means clustering
This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.
Hoofdauteurs: | , , |
---|---|
Andere auteurs: | |
Formaat: | Thesis |
Taal: | English |
Gepubliceerd in: |
BRAC University
2018
|
Online toegang: | http://hdl.handle.net/10361/10154 |
id |
10361-10154 |
---|---|
record_format |
dspace |
spelling |
10361-101542022-01-26T10:23:17Z Automatic text summarization using fuzzy c–means clustering Rahman, A. M. Muntasir Saleheen, Nasif Noor Anam, Shakil Ashraful Arif, Hossain Department of Computer Science and Engineering, BRAC University This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018. Cataloged from PDF version of thesis. Includes bibliographical references. Automatic text summarization process has been significantly explored throughout the years to cope with the staggering increase of virtual data. Text summarization process is commonly divided into two areas-Extractive and Abstractive. Abstractive summarization processes generate unique sentences that are different from the sentences in original document keeping the same theme, whereas Extractive summarization processes largely depend on sentence extraction techniques- implementing graph models or sentence-based models. In this paper, a sentence-based model has been proposed where the sentence ranking procedure adopts fuzzy C-Means (FCM) clustering, an unsupervised classification method, for sentence extraction purpose. The sentence scoring task relies on five key features, including Topic Sentence which is the first novelty of the proposed model. Furthermore, C-Means clustering is a soft-computing technique that is usually used for pattern recognition tasks but can be improved significantly by hard clustering the membership of the elements which has not been regarded in similar processes in any of the previous works, adding to the novelty of the presented model. Standard summary evaluation techniques have been used to gauge the precision, recall and f-measure of the proposed FCM model and have been compared with different summarizers from different perspectives. Summarizers having different dataset and approaches such as, bushy path, GSM, baseline, TextRank have been compared to the proposed model using ROUGE method. The outcome shows that the FCM model surpasses the previous approaches by a significant margin. A. M. Muntasir Rahman Nasif Noor Saleheen Shakil Ashraful Anam B. Computer Science and Engineering 2018-05-15T08:22:35Z 2018-05-15T08:22:35Z 2018 2018-04 Thesis ID 14101139 ID 14301003 ID 14301088 http://hdl.handle.net/10361/10154 en BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 31 pages application/pdf BRAC University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
description |
This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018. |
author2 |
Arif, Hossain |
author_facet |
Arif, Hossain Rahman, A. M. Muntasir Saleheen, Nasif Noor Anam, Shakil Ashraful |
format |
Thesis |
author |
Rahman, A. M. Muntasir Saleheen, Nasif Noor Anam, Shakil Ashraful |
spellingShingle |
Rahman, A. M. Muntasir Saleheen, Nasif Noor Anam, Shakil Ashraful Automatic text summarization using fuzzy c–means clustering |
author_sort |
Rahman, A. M. Muntasir |
title |
Automatic text summarization using fuzzy c–means clustering |
title_short |
Automatic text summarization using fuzzy c–means clustering |
title_full |
Automatic text summarization using fuzzy c–means clustering |
title_fullStr |
Automatic text summarization using fuzzy c–means clustering |
title_full_unstemmed |
Automatic text summarization using fuzzy c–means clustering |
title_sort |
automatic text summarization using fuzzy c–means clustering |
publisher |
BRAC University |
publishDate |
2018 |
url |
http://hdl.handle.net/10361/10154 |
work_keys_str_mv |
AT rahmanammuntasir automatictextsummarizationusingfuzzycmeansclustering AT saleheennasifnoor automatictextsummarizationusingfuzzycmeansclustering AT anamshakilashraful automatictextsummarizationusingfuzzycmeansclustering |
_version_ |
1814309765633802240 |