Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis

This thesis is submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2023.

Detalles Bibliográficos
Autor principal: Hossain, Syed Abed
Otros Autores: Rahman, Mohammad Zahidur
Formato: Tesis
Lenguaje:English
Publicado: Brac University 2024
Materias:
Acceso en línea:http://hdl.handle.net/10361/23002
id 10361-23002
record_format dspace
spelling 10361-230022024-06-03T05:52:47Z Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis Hossain, Syed Abed Rahman, Mohammad Zahidur Department of Computer Science and Engineering, Brac University DNS traffic management Machine learning Virtual machine User behavior analysis Computer communication systems Machine learning Human-computer interaction. This thesis is submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 81-85). This research presents a comprehensive approach to network traffic management and analysis by leveraging DNS log analysis, machine learning techniques, and Software-Defined Networking (SDN) integration. In an office environment, a DNS server was set up to collect DNS logs from nearly 200 users over a month. The collected data was subjected to data cleaning and additional information extraction in Google BigQuery. Demographic analysis was conducted using Google LookerStudio, providing valuable insights into user behavior patterns during different office hours. Subsequently, various supervised and unsupervised machine learning models were employed to predict browsing categories based on the DNS log analysis. Among the models evaluated, the Random Forest Classifier (RFC) demonstrated exceptional performance, achieving high accuracy, precision, recall, and F1 Score during training, with values of 82.54%, 82.79%, 82.54%, and 81.81%, respectively. The trained RFC model showcased its robustness in minimizing the discrepancy between predicted probabilities and actual class values. The trained model was then exported and integrated into a virtual Linux machine to simulate an SDN environment. The experimental results showcased the system’s high accuracy in categorizing DNS queries during real-time testing, with 100% accuracy achieved for categories like Ads and Entertainment, and impressive accuracy rates of 98.57%, 87.5%, and 87.21% for Search Engines, Social Networks, and CDNs, respectively. The system’s reliability and effectiveness in intelligently managing network traffic were further demonstrated with slightly lower but still respectable accuracies of 81.82% and 80.95% for Computer/Technology and Learning categories, respectively. The predictive capabilities of the system have practical applications for office network management, including website blocking, traffic rerouting based on predictions, and bandwidth management, all facilitated through the SDN controller. The findings of this study highlight the efficacy of combining DNS log analysis, machine learning, and SDN integration for enhancing network security, optimizing resource allocation, and delivering an enhanced user experience in a standard office environment. The presented approach can serve as a blueprint for efficient network traffic management and intelligent decision-making in similar settings. Syed Abed Hossain M.Sc. in Computer Science 2024-05-29T09:15:39Z 2024-05-29T09:15:39Z ©2023 2023-08 Thesis ID 21266019 http://hdl.handle.net/10361/23002 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 97 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic DNS traffic management
Machine learning
Virtual machine
User behavior analysis
Computer communication systems
Machine learning
Human-computer interaction.
spellingShingle DNS traffic management
Machine learning
Virtual machine
User behavior analysis
Computer communication systems
Machine learning
Human-computer interaction.
Hossain, Syed Abed
Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis
description This thesis is submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science and Engineering, 2023.
author2 Rahman, Mohammad Zahidur
author_facet Rahman, Mohammad Zahidur
Hossain, Syed Abed
format Thesis
author Hossain, Syed Abed
author_sort Hossain, Syed Abed
title Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis
title_short Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis
title_full Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis
title_fullStr Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis
title_full_unstemmed Efficient network traffic management and intelligent decision-making through machine learning and DNS log analysis
title_sort efficient network traffic management and intelligent decision-making through machine learning and dns log analysis
publisher Brac University
publishDate 2024
url http://hdl.handle.net/10361/23002
work_keys_str_mv AT hossainsyedabed efficientnetworktrafficmanagementandintelligentdecisionmakingthroughmachinelearninganddnsloganalysis
_version_ 1814308414364319744