Identification of fake news using machine learning in distributed system

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.

Detalhes bibliográficos
Main Authors: Saif, Mehruz, Kanon, MD. Kamal Haque, Hasan, Nazmul, Hossen, MD. Shamim, Anannya, Fatema Zohra
Outros Autores: Akhond, Mostafijur Rahman
Formato: Thesis
Idioma:English
Publicado em: Brac University 2021
Assuntos:
Acesso em linha:http://hdl.handle.net/10361/15199
id 10361-15199
record_format dspace
spelling 10361-151992022-01-26T10:21:44Z Identification of fake news using machine learning in distributed system Saif, Mehruz Kanon, MD. Kamal Haque Hasan, Nazmul Hossen, MD. Shamim Anannya, Fatema Zohra Akhond, Mostafijur Rahman Department of Computer Science and Engineering, Brac University PySpark ML RDD(Resilient Distributed Dataset) Random Forest Factorization Machine Classifier Linear SVC Logistic Regression Machine Learning This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021. Cataloged from PDF version of thesis. Includes bibliographical references (page 41-42). The World Wide Web’s launch and the rapid adoption of social media platforms (such as Facebook and Twitter) paved the way for unparalleled levels of information diffusion in human history. Consumers are creating and sharing more information on social media platforms than ever before, some of it is erroneous, deceptive, or has no influence on reality. Access to news information has become considerably simpler and more comfortable thanks to the Internet and social media. Online users may often follow events of interest, and the widespread usage of mobile devices makes this process easier. However, with great potential comes enormous responsibility. There are also a number of websites dedicated nearly entirely to the dissemination of fake news. Since it’s a serious issue with a large-scale dataset, identification of fake news is very vital in this era, as social media and online newspapers are in large numbers in the web arena. That’s why it is easy to spread rumors and create chaos. Also, the size of data sets is increasing day by day. Data is expanding at a quicker rate than processing rates. As a result, algorithms that need a huge quantity of data and processing are frequently conducted on a distributed computing system that separates multiple nodes on several machines which have concurrency of components and lack of a global clock. Also, nobody has used a distributed system to detect fake news before. In our paper, we tried to run 4 PySpark algorithms based on SPARK-Context which provides massive storage for big data processing and analysis and also has been found to be 100 times quicker in-memory, while disk performance was shown to be 10 times quicker on several devices at the same time. So that we can control and real-time monitoring over the news and data before it goes viral in the media. Mehruz Saif MD. Kamal Haque Kanon Nazmul Hasan MD. Shamim Hossen Fatema Zohra Anannya B. Computer Science 2021-10-10T09:28:29Z 2021-10-10T09:28:29Z 2021 2021-06 Thesis ID 19101665 ID 19201139 ID 19301277 ID 15301101 ID 17101176 http://hdl.handle.net/10361/15199 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 42 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic PySpark ML
RDD(Resilient Distributed Dataset)
Random Forest
Factorization Machine Classifier
Linear SVC
Logistic Regression
Machine Learning
spellingShingle PySpark ML
RDD(Resilient Distributed Dataset)
Random Forest
Factorization Machine Classifier
Linear SVC
Logistic Regression
Machine Learning
Saif, Mehruz
Kanon, MD. Kamal Haque
Hasan, Nazmul
Hossen, MD. Shamim
Anannya, Fatema Zohra
Identification of fake news using machine learning in distributed system
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.
author2 Akhond, Mostafijur Rahman
author_facet Akhond, Mostafijur Rahman
Saif, Mehruz
Kanon, MD. Kamal Haque
Hasan, Nazmul
Hossen, MD. Shamim
Anannya, Fatema Zohra
format Thesis
author Saif, Mehruz
Kanon, MD. Kamal Haque
Hasan, Nazmul
Hossen, MD. Shamim
Anannya, Fatema Zohra
author_sort Saif, Mehruz
title Identification of fake news using machine learning in distributed system
title_short Identification of fake news using machine learning in distributed system
title_full Identification of fake news using machine learning in distributed system
title_fullStr Identification of fake news using machine learning in distributed system
title_full_unstemmed Identification of fake news using machine learning in distributed system
title_sort identification of fake news using machine learning in distributed system
publisher Brac University
publishDate 2021
url http://hdl.handle.net/10361/15199
work_keys_str_mv AT saifmehruz identificationoffakenewsusingmachinelearningindistributedsystem
AT kanonmdkamalhaque identificationoffakenewsusingmachinelearningindistributedsystem
AT hasannazmul identificationoffakenewsusingmachinelearningindistributedsystem
AT hossenmdshamim identificationoffakenewsusingmachinelearningindistributedsystem
AT anannyafatemazohra identificationoffakenewsusingmachinelearningindistributedsystem
_version_ 1814309363764953088