Prediction of diabetes induced complications using different machine learning algorithms
This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018.
Main Authors: | , , |
---|---|
其他作者: | |
格式: | Thesis |
语言: | English |
出版: |
BRAC University
2018
|
主题: | |
在线阅读: | http://hdl.handle.net/10361/10945 |
id |
10361-10945 |
---|---|
record_format |
dspace |
spelling |
10361-109452022-01-26T10:05:01Z Prediction of diabetes induced complications using different machine learning algorithms Rahman, Tahsinur Farzana, Sheikh Mastura Khanom, Aniqa Zaida Alam, Md. Ashraful Department of Computer Science and Engineering, BRAC University PCA Diabetes complications missForest SVM Naïve bayes Logistic regression Computers -- Intelligence (AI) & Semantics Computers -- Machine Theory Mathematical theory of computation This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018. Cataloged from PDF version of thesis. Includes bibliographical references (pages 58-61). Machine Learning is an ever expanding field of Artificial Intelligence which uses huge amount of data to develop algorithms that can detect patterns and systems. One such application of Machine Learning is developing predictive models for disease prediction. On the other hand, in spite of huge advancements in Medical Science and discovery of complex diseases making everyone more health conscious, there is no way in Medical Science to predict prevalence of diseases. However, upon having relevant data Machine Learning methods can predict onset of many diseases. This paper presents the comparative analysis of different Machine Learning algorithms and their results in predicting the health complications related to Diabetes Mellitus. Diabetes Mellitus is a medical condition of the Pancreas in which the body‘s ability to produce or respond to the hormone, Insulin, diminishes. As a result, over time it damages other organs in the body- primarily Kidney, Liver, Eyes, Heart and Brain. Since in most cases the threats posed by Diabetes are not known before it is too late, hence it requires a great amount of consciousness in order to prevent onset of other related diseases. To this day, there is no prevention of Diabetes, since it is largely dependent on the genetics of a person. However, if a person is monitored closely it is possible to indicate Diabetes related complications. This proposed model uses time series data of a year that contains 164 features including results of different pathological tests. Methods such as Logistic Regression, SVM, Naïve Bayes, Decision Tree and Random Forest have been used in a supervised environment to predict the probability of Diabetes induced Nephropathy and Cardiovascular disease. PCA was applied beforehand to reduce the dimensionality of the dataset. Decision Tree without PCA produced the best results for Nephropathy with an AUC score of 0.87. While Naïve Bayes without PCA produced the best results for Cardiovascular disease, with an AUC score of 0.74. In summary, the model proposed in this paper predicts the risk of Nephropathy better than the risk of Cardiovascular disease. Tahsinur Rahman Sheikh Mastura Farzana Aniqa Zaida Khanom B. Computer Science and Engineering 2018-12-03T06:53:41Z 2018-12-03T06:53:41Z 2018 2018-08 Thesis ID 15101128 ID 15101077 ID 15101106 http://hdl.handle.net/10361/10945 en BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 61 pages application/pdf BRAC University |
institution |
Brac University |
collection |
Institutional Repository |
language |
English |
topic |
PCA Diabetes complications missForest SVM Naïve bayes Logistic regression Computers -- Intelligence (AI) & Semantics Computers -- Machine Theory Mathematical theory of computation |
spellingShingle |
PCA Diabetes complications missForest SVM Naïve bayes Logistic regression Computers -- Intelligence (AI) & Semantics Computers -- Machine Theory Mathematical theory of computation Rahman, Tahsinur Farzana, Sheikh Mastura Khanom, Aniqa Zaida Prediction of diabetes induced complications using different machine learning algorithms |
description |
This thesis is submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2018. |
author2 |
Alam, Md. Ashraful |
author_facet |
Alam, Md. Ashraful Rahman, Tahsinur Farzana, Sheikh Mastura Khanom, Aniqa Zaida |
format |
Thesis |
author |
Rahman, Tahsinur Farzana, Sheikh Mastura Khanom, Aniqa Zaida |
author_sort |
Rahman, Tahsinur |
title |
Prediction of diabetes induced complications using different machine learning algorithms |
title_short |
Prediction of diabetes induced complications using different machine learning algorithms |
title_full |
Prediction of diabetes induced complications using different machine learning algorithms |
title_fullStr |
Prediction of diabetes induced complications using different machine learning algorithms |
title_full_unstemmed |
Prediction of diabetes induced complications using different machine learning algorithms |
title_sort |
prediction of diabetes induced complications using different machine learning algorithms |
publisher |
BRAC University |
publishDate |
2018 |
url |
http://hdl.handle.net/10361/10945 |
work_keys_str_mv |
AT rahmantahsinur predictionofdiabetesinducedcomplicationsusingdifferentmachinelearningalgorithms AT farzanasheikhmastura predictionofdiabetesinducedcomplicationsusingdifferentmachinelearningalgorithms AT khanomaniqazaida predictionofdiabetesinducedcomplicationsusingdifferentmachinelearningalgorithms |
_version_ |
1814307186273157120 |