Show simple item record

dc.contributor.advisorAlam, Md.Golam Rabiul
dc.contributor.advisorReza, Md.Tanzim
dc.contributor.authorAlam, Salman
dc.contributor.authorOni, Atquiya Labiba
dc.contributor.authorSamir, Jubair
dc.contributor.authorHossain, Asif Mosharrof
dc.date.accessioned2023-12-11T07:29:01Z
dc.date.available2023-12-11T07:29:01Z
dc.date.copyright2023
dc.date.issued2023-05
dc.identifier.otherID 19301037
dc.identifier.otherID 19301039
dc.identifier.otherID 22241149
dc.identifier.otherID 19201006
dc.identifier.urihttp://hdl.handle.net/10361/21953
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 42-44).
dc.description.abstractSickle Cell Disease is a monogenic genetic disorder which often leads to various repercussions affecting multiple vital organs simultaneously. However, the treat- ment for Sickle Cell is diverse and often varies from patient to patient, but several background studies revealed the progression and symptoms of Sickle Cell can be predicted to a great extent based on a patient’s genetic mutation type in the HBB gene. Moreover, such research regarding genetic mutation prediction can be seen in other fields of medicine such as cancer, but in the case of Sickle Cell it is scarce. Fur- thermore, other limitations include complexity and unavailability of genetic testing, limited clinical data available and privacy concerns regarding medical information of patients. Hence, our study aimed to build a Federated Siamese Bidirectional LSTM to predict the Sickle Cell genotype from clinical data, in case of sparse and decentralized data. Consequently, a Sickle Cell clinical dataset with 216 instances and 4 different genotype class labels was pre-processed accordingly to train and evaluate the model performance. The dataset was then used to create pairs with corresponding similarity scores and the Siamese Bi-LSTM was trained for several epochs to compute similarity between two instances. The data was divided among client devices in case of federated, while the Siamese Bi-LSTM trained locally to update the global model and the test data was then used to assess their perfor- mance. Thus, based on the performance analysis the Siamese Bi-LSTM achieved accuracy of 90.45% with f1 score of 90.66% and the Federated Siamese Bi-LSTM model (FFSB-LSTM) achieved accuracy of 88.25% and f1 score of 88.57% show- ing significant improvement compared to the baseline KNN and Logistic Regression models.en_US
dc.description.statementofresponsibilitySalman Alam
dc.description.statementofresponsibilityAtquiya Labiba Oni
dc.description.statementofresponsibilityJubair Samir
dc.description.statementofresponsibilityAsif Mosharrof Hossain
dc.format.extent55 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectSickle cellen_US
dc.subjectClinical dataen_US
dc.subjectGenotypeen_US
dc.subjectFederated learningen_US
dc.subjectFew-shot siameseen_US
dc.subjectFederated siamese bidirectional LSTMen_US
dc.subject.lcshMachine learning
dc.subject.lcshComputer algorithms
dc.subject.lcshSickle cell anemia
dc.titlePrediction of genetic mutation from clinical data of sickle cell disease using few-shot siamese bidirectional LSTM and federated learningen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB.Sc. in Computer Science and Engineering


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record