Prediction of diabetes induced complications using different machine learning algorithms

This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.

书目详细资料
Main Authors: Rahman, Tahsinur, Farzana, Sheikh Mastura, Khanom, Aniqa Zaida
其他作者: Alam, Md. Ashraful
格式: Thesis
语言:English
出版: BRAC University 2018
主题:
在线阅读:http://hdl.handle.net/10361/10945
id 10361-10945
record_format dspace
spelling 10361-109452022-01-26T10:05:01Z Prediction of diabetes induced complications using different machine learning algorithms Rahman, Tahsinur Farzana, Sheikh Mastura Khanom, Aniqa Zaida Alam, Md. Ashraful Department of Computer Science and Engineering, BRAC University PCA Diabetes complications missForest SVM Naïve bayes Logistic regression Computers -- Intelligence (AI) & Semantics Computers -- Machine Theory Mathematical theory of computation This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018. Cataloged from PDF version of thesis. Includes bibliographical references (pages 58-61). Machine Learning is an ever expanding field of Artificial Intelligence which uses huge amount of data to develop algorithms that can detect patterns and systems. One such application of Machine Learning is developing predictive models for disease prediction. On the other hand, in spite of huge advancements in Medical Science and discovery of complex diseases making everyone more health conscious, there is no way in Medical Science to predict prevalence of diseases. However, upon having relevant data Machine Learning methods can predict onset of many diseases. This paper presents the comparative analysis of different Machine Learning algorithms and their results in predicting the health complications related to Diabetes Mellitus. Diabetes Mellitus is a medical condition of the Pancreas in which the body‘s ability to produce or respond to the hormone, Insulin, diminishes. As a result, over time it damages other organs in the body- primarily Kidney, Liver, Eyes, Heart and Brain. Since in most cases the threats posed by Diabetes are not known before it is too late, hence it requires a great amount of consciousness in order to prevent onset of other related diseases. To this day, there is no prevention of Diabetes, since it is largely dependent on the genetics of a person. However, if a person is monitored closely it is possible to indicate Diabetes related complications. This proposed model uses time series data of a year that contains 164 features including results of different pathological tests. Methods such as Logistic Regression, SVM, Naïve Bayes, Decision Tree and Random Forest have been used in a supervised environment to predict the probability of Diabetes induced Nephropathy and Cardiovascular disease. PCA was applied beforehand to reduce the dimensionality of the dataset. Decision Tree without PCA produced the best results for Nephropathy with an AUC score of 0.87. While Naïve Bayes without PCA produced the best results for Cardiovascular disease, with an AUC score of 0.74. In summary, the model proposed in this paper predicts the risk of Nephropathy better than the risk of Cardiovascular disease. Tahsinur Rahman Sheikh Mastura Farzana Aniqa Zaida Khanom B. Computer Science and Engineering 2018-12-03T06:53:41Z 2018-12-03T06:53:41Z 2018 2018-08 Thesis ID 15101128 ID 15101077 ID 15101106 http://hdl.handle.net/10361/10945 en BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 61 pages application/pdf BRAC University
institution Brac University
collection Institutional Repository
language English
topic PCA
Diabetes complications
missForest
SVM
Naïve bayes
Logistic regression
Computers -- Intelligence (AI) & Semantics
Computers -- Machine Theory
Mathematical theory of computation
spellingShingle PCA
Diabetes complications
missForest
SVM
Naïve bayes
Logistic regression
Computers -- Intelligence (AI) & Semantics
Computers -- Machine Theory
Mathematical theory of computation
Rahman, Tahsinur
Farzana, Sheikh Mastura
Khanom, Aniqa Zaida
Prediction of diabetes induced complications using different machine learning algorithms
description This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.
author2 Alam, Md. Ashraful
author_facet Alam, Md. Ashraful
Rahman, Tahsinur
Farzana, Sheikh Mastura
Khanom, Aniqa Zaida
format Thesis
author Rahman, Tahsinur
Farzana, Sheikh Mastura
Khanom, Aniqa Zaida
author_sort Rahman, Tahsinur
title Prediction of diabetes induced complications using different machine learning algorithms
title_short Prediction of diabetes induced complications using different machine learning algorithms
title_full Prediction of diabetes induced complications using different machine learning algorithms
title_fullStr Prediction of diabetes induced complications using different machine learning algorithms
title_full_unstemmed Prediction of diabetes induced complications using different machine learning algorithms
title_sort prediction of diabetes induced complications using different machine learning algorithms
publisher BRAC University
publishDate 2018
url http://hdl.handle.net/10361/10945
work_keys_str_mv AT rahmantahsinur predictionofdiabetesinducedcomplicationsusingdifferentmachinelearningalgorithms
AT farzanasheikhmastura predictionofdiabetesinducedcomplicationsusingdifferentmachinelearningalgorithms
AT khanomaniqazaida predictionofdiabetesinducedcomplicationsusingdifferentmachinelearningalgorithms
_version_ 1814307186273157120