Automatic text summarization using fuzzy c–means clustering

This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.

Bibliografische gegevens
Hoofdauteurs: Rahman, A. M. Muntasir, Saleheen, Nasif Noor, Anam, Shakil Ashraful
Andere auteurs: Arif, Hossain
Formaat: Thesis
Taal:English
Gepubliceerd in: BRAC University 2018
Online toegang:http://hdl.handle.net/10361/10154
id 10361-10154
record_format dspace
spelling 10361-101542022-01-26T10:23:17Z Automatic text summarization using fuzzy c–means clustering Rahman, A. M. Muntasir Saleheen, Nasif Noor Anam, Shakil Ashraful Arif, Hossain Department of Computer Science and Engineering, BRAC University This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018. Cataloged from PDF version of thesis. Includes bibliographical references. Automatic text summarization process has been significantly explored throughout the years to cope with the staggering increase of virtual data. Text summarization process is commonly divided into two areas-Extractive and Abstractive. Abstractive summarization processes generate unique sentences that are different from the sentences in original document keeping the same theme, whereas Extractive summarization processes largely depend on sentence extraction techniques- implementing graph models or sentence-based models. In this paper, a sentence-based model has been proposed where the sentence ranking procedure adopts fuzzy C-Means (FCM) clustering, an unsupervised classification method, for sentence extraction purpose. The sentence scoring task relies on five key features, including Topic Sentence which is the first novelty of the proposed model. Furthermore, C-Means clustering is a soft-computing technique that is usually used for pattern recognition tasks but can be improved significantly by hard clustering the membership of the elements which has not been regarded in similar processes in any of the previous works, adding to the novelty of the presented model. Standard summary evaluation techniques have been used to gauge the precision, recall and f-measure of the proposed FCM model and have been compared with different summarizers from different perspectives. Summarizers having different dataset and approaches such as, bushy path, GSM, baseline, TextRank have been compared to the proposed model using ROUGE method. The outcome shows that the FCM model surpasses the previous approaches by a significant margin. A. M. Muntasir Rahman Nasif Noor Saleheen Shakil Ashraful Anam B. Computer Science and Engineering 2018-05-15T08:22:35Z 2018-05-15T08:22:35Z 2018 2018-04 Thesis ID 14101139 ID 14301003 ID 14301088 http://hdl.handle.net/10361/10154 en BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 31 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
description This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.
author2 Arif, Hossain
author_facet Arif, Hossain
Rahman, A. M. Muntasir
Saleheen, Nasif Noor
Anam, Shakil Ashraful
format Thesis
author Rahman, A. M. Muntasir
Saleheen, Nasif Noor
Anam, Shakil Ashraful
spellingShingle Rahman, A. M. Muntasir
Saleheen, Nasif Noor
Anam, Shakil Ashraful
Automatic text summarization using fuzzy c–means clustering
author_sort Rahman, A. M. Muntasir
title Automatic text summarization using fuzzy c–means clustering
title_short Automatic text summarization using fuzzy c–means clustering
title_full Automatic text summarization using fuzzy c–means clustering
title_fullStr Automatic text summarization using fuzzy c–means clustering
title_full_unstemmed Automatic text summarization using fuzzy c–means clustering
title_sort automatic text summarization using fuzzy c–means clustering
publisher BRAC University
publishDate 2018
url http://hdl.handle.net/10361/10154
work_keys_str_mv AT rahmanammuntasir automatictextsummarizationusingfuzzycmeansclustering
AT saleheennasifnoor automatictextsummarizationusingfuzzycmeansclustering
AT anamshakilashraful automatictextsummarizationusingfuzzycmeansclustering
_version_ 1814309765633802240