Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning

This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023.

ग्रंथसूची विवरण
मुख्य लेखकों: Alam, Salman, Oni, Atquiya Labiba, Samir, Jubair, Hossain, Asif Mosharrof
अन्य लेखक: Alam, Md.Golam Rabiul
स्वरूप: थीसिस
भाषा:English
प्रकाशित: Brac University 2023
विषय:
ऑनलाइन पहुंच:http://hdl.handle.net/10361/21953
id 10361-21953
record_format dspace
spelling 10361-219532023-12-11T21:02:34Z Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning Alam, Salman Oni, Atquiya Labiba Samir, Jubair Hossain, Asif Mosharrof Alam, Md.Golam Rabiul Reza, Md.Tanzim Department of Computer Science and Engineering, Brac University Sickle cell Clinical data Genotype Federated learning Few-shot siamese Federated siamese bidirectional LSTM Machine learning Computer algorithms Sickle cell anemia This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023. Cataloged from PDF version of thesis. Includes bibliographical references (pages 42-44). Sickle Cell Disease is a monogenic genetic disorder which often leads to various repercussions affecting multiple vital organs simultaneously. However, the treat- ment for Sickle Cell is diverse and often varies from patient to patient, but several background studies revealed the progression and symptoms of Sickle Cell can be predicted to a great extent based on a patient’s genetic mutation type in the HBB gene. Moreover, such research regarding genetic mutation prediction can be seen in other fields of medicine such as cancer, but in the case of Sickle Cell it is scarce. Fur- thermore, other limitations include complexity and unavailability of genetic testing, limited clinical data available and privacy concerns regarding medical information of patients. Hence, our study aimed to build a Federated Siamese Bidirectional LSTM to predict the Sickle Cell genotype from clinical data, in case of sparse and decentralized data. Consequently, a Sickle Cell clinical dataset with 216 instances and 4 different genotype class labels was pre-processed accordingly to train and evaluate the model performance. The dataset was then used to create pairs with corresponding similarity scores and the Siamese Bi-LSTM was trained for several epochs to compute similarity between two instances. The data was divided among client devices in case of federated, while the Siamese Bi-LSTM trained locally to update the global model and the test data was then used to assess their perfor- mance. Thus, based on the performance analysis the Siamese Bi-LSTM achieved accuracy of 90.45% with f1 score of 90.66% and the Federated Siamese Bi-LSTM model (FFSB-LSTM) achieved accuracy of 88.25% and f1 score of 88.57% show- ing significant improvement compared to the baseline KNN and Logistic Regression models. Salman Alam Atquiya Labiba Oni Jubair Samir Asif Mosharrof Hossain B.Sc. in Computer Science and Engineering 2023-12-11T07:29:01Z 2023-12-11T07:29:01Z 2023 2023-05 Thesis ID 19301037 ID 19301039 ID 22241149 ID 19201006 http://hdl.handle.net/10361/21953 en Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. 55 pages application/pdf Brac University
institution Brac University
collection Institutional Repository
language English
topic Sickle cell
Clinical data
Genotype
Federated learning
Few-shot siamese
Federated siamese bidirectional LSTM
Machine learning
Computer algorithms
Sickle cell anemia
spellingShingle Sickle cell
Clinical data
Genotype
Federated learning
Few-shot siamese
Federated siamese bidirectional LSTM
Machine learning
Computer algorithms
Sickle cell anemia
Alam, Salman
Oni, Atquiya Labiba
Samir, Jubair
Hossain, Asif Mosharrof
Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning
description This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023.
author2 Alam, Md.Golam Rabiul
author_facet Alam, Md.Golam Rabiul
Alam, Salman
Oni, Atquiya Labiba
Samir, Jubair
Hossain, Asif Mosharrof
format Thesis
author Alam, Salman
Oni, Atquiya Labiba
Samir, Jubair
Hossain, Asif Mosharrof
author_sort Alam, Salman
title Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning
title_short Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning
title_full Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning
title_fullStr Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning
title_full_unstemmed Prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learning
title_sort prediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional lstm and federated learning
publisher Brac University
publishDate 2023
url http://hdl.handle.net/10361/21953
work_keys_str_mv AT alamsalman predictionofgeneticmutationfromclinicaldataofsicklecelldiseaseusingfewshotsiamesebidirectionallstmandfederatedlearning
AT oniatquiyalabiba predictionofgeneticmutationfromclinicaldataofsicklecelldiseaseusingfewshotsiamesebidirectionallstmandfederatedlearning
AT samirjubair predictionofgeneticmutationfromclinicaldataofsicklecelldiseaseusingfewshotsiamesebidirectionallstmandfederatedlearning
AT hossainasifmosharrof predictionofgeneticmutationfromclinicaldataofsicklecelldiseaseusingfewshotsiamesebidirectionallstmandfederatedlearning
_version_ 1814309071668379648