A feature selection approach to determine obesity using machine learning method

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.

Bibliografiset tiedot
Päätekijät: Antor, Shahidul Alam, Ahmed, Jawad, Nayen, Zulker, Tabassum, Farisha, Mahbub, Rasheda
Muut tekijät: Ajwad, Rasif
Aineistotyyppi: Opinnäyte
Kieli:English
Julkaistu: Brac University 2021
Aiheet:
Linkit:http://hdl.handle.net/10361/14972
id 10361-14972
record_format dspace
spelling 10361-149722022-01-26T10:13:18Z A feature selection approach to determine obesity using machine learning method Antor, Shahidul Alam Ahmed, Jawad Nayen, Zulker Tabassum, Farisha Mahbub, Rasheda Ajwad, Rasif Ashraf, Faisal Bin Department of Computer Science and Engineering, Brac University Obesity BMI Machine Learning Naive Bayes Random Forest Decision tree K-Nearest Neighbors (KNN) Logistic Regression Obesity. This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021. Cataloged from PDF version of thesis. Includes bibliographical references (pages 51-53). The new health concern that is proliferating in developing and impoverished countries is obesity. It is recognized as a complex health issue caused by various factors such as genetics, behaviour, and other issues. Obesity is not just about physique or look; it is a persistent medical illness that opens the body to many diseases and shortens life.Obesity frequently results in a wide variety of other disorders, including cardiovascular disease, hypertension, diabetes, numerous malignancies, and more. The developed countries have already undertaken a few measures and are deeply concerned about their health issues. Thus, the people of low or mid-income countries are still unaware of this fact and will face significant health challenges in the future. Specifically, in Bangladesh, many people have diabetes, and recently, many people died due to heart disease and cancer, which could be prevented if they were health concerns. Recent studies say that the young generation is more prone to obesity as they are more influenced by western lifestyles, eating many junk foods, and spending the maximum of their time on the internet. Our research has collected more than 500 people’s data from different groups of people around Bangladesh. We aim to predict the future outcome at which BMI value range people are more prone to diseases. To predict the outcome, we have analyzed our sample dataset using machine learning approaches such as Naive Bayes, Random Forest, decision tree, The k-nearest neighbours (KNN), Logistic Regression. Among these algorithms, Decision Tree has given the best accuracy of 96.67%. For selecting essential variables from the dataset, we used the BorutaShap wrapper feature selection method. This algorithm delivers a better subset of attributes from a high volume of data and trains the model faster. As the Boruta algorithm selects the best feature, reduces the model size, and identifies the key features, it became easy to train our data set, so we got a better accuracy level using this algorithm in our reach. This researcher will help the people of Bangladesh to understand obesity and its detrimental aspects. Moreover, it will assist them in being more conscious of their health conditions and predicting which BMI level is a risk for them. Shahidul Alam Antor Jawad Ahmed Zulker Nayen Farisha Tabassum Rasheda Mahbub B. Computer Science 2021-09-04T13:27:50Z 2021-09-04T13:27:50Z 2021 2021-06 Thesis ID 17101242 ID 17101336 ID 17101273 ID 17101154 ID 17301077 http://hdl.handle.net/10361/14972 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 54 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic Obesity
BMI
Machine Learning
Naive Bayes
Random Forest
Decision tree
K-Nearest Neighbors (KNN)
Logistic Regression
Obesity.
spellingShingle Obesity
BMI
Machine Learning
Naive Bayes
Random Forest
Decision tree
K-Nearest Neighbors (KNN)
Logistic Regression
Obesity.
Antor, Shahidul Alam
Ahmed, Jawad
Nayen, Zulker
Tabassum, Farisha
Mahbub, Rasheda
A feature selection approach to determine obesity using machine learning method
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2021.
author2 Ajwad, Rasif
author_facet Ajwad, Rasif
Antor, Shahidul Alam
Ahmed, Jawad
Nayen, Zulker
Tabassum, Farisha
Mahbub, Rasheda
format Thesis
author Antor, Shahidul Alam
Ahmed, Jawad
Nayen, Zulker
Tabassum, Farisha
Mahbub, Rasheda
author_sort Antor, Shahidul Alam
title A feature selection approach to determine obesity using machine learning method
title_short A feature selection approach to determine obesity using machine learning method
title_full A feature selection approach to determine obesity using machine learning method
title_fullStr A feature selection approach to determine obesity using machine learning method
title_full_unstemmed A feature selection approach to determine obesity using machine learning method
title_sort feature selection approach to determine obesity using machine learning method
publisher Brac University
publishDate 2021
url http://hdl.handle.net/10361/14972
work_keys_str_mv AT antorshahidulalam afeatureselectionapproachtodetermineobesityusingmachinelearningmethod
AT ahmedjawad afeatureselectionapproachtodetermineobesityusingmachinelearningmethod
AT nayenzulker afeatureselectionapproachtodetermineobesityusingmachinelearningmethod
AT tabassumfarisha afeatureselectionapproachtodetermineobesityusingmachinelearningmethod
AT mahbubrasheda afeatureselectionapproachtodetermineobesityusingmachinelearningmethod
AT antorshahidulalam featureselectionapproachtodetermineobesityusingmachinelearningmethod
AT ahmedjawad featureselectionapproachtodetermineobesityusingmachinelearningmethod
AT nayenzulker featureselectionapproachtodetermineobesityusingmachinelearningmethod
AT tabassumfarisha featureselectionapproachtodetermineobesityusingmachinelearningmethod
AT mahbubrasheda featureselectionapproachtodetermineobesityusingmachinelearningmethod
_version_ 1814308140726878208