Show simple item record

dc.contributor.advisorMottalib, Md. Abdul
dc.contributor.advisorAjwad, Ajwad
dc.contributor.authorChoudhury, Joydhriti
dc.contributor.authorRoshni, Tanzima Rahman
dc.contributor.authorChowdhury, Md. Tawhidul Islam
dc.contributor.authorRayon, Raihanoor Reza
dc.date.accessioned2019-10-28T04:04:38Z
dc.date.available2019-10-28T04:04:38Z
dc.date.copyright2019
dc.date.issued2019-04
dc.identifier.otherID 15301125
dc.identifier.otherID 15301125
dc.identifier.otherID 16101321
dc.identifier.otherID 18141021
dc.identifier.urihttp://hdl.handle.net/10361/12810
dc.descriptionThis thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019.en_US
dc.descriptionCataloged from PDF version of thesis.
dc.descriptionIncludes bibliographical references (pages 38-40).
dc.description.abstractMicroarray data is used to create groups of similar genes based on their phenotypic attributes. Information extracted from these groups of gene can be applied to path- way analysis, disease predictions, target identification in drug design and many other important applications and functionalities in biology. However, how to determine a distance metric to measure the similarities among genes has always been a great chal- lenge. In our work, we have studied sixteen combination of distance-linkage combina- tional metrics and tried to and the groups of similar genes based on their expression level by building phylogenetic tree. Furthermore, to validate our endings we have evaluate the output of the same trails on three different datasets. Our work suggests that, Maximum distance metric with the combination of Average linkage metrics gives the optimal quality while grouping similar genes together by building a phylogenetic tree.en_US
dc.description.statementofresponsibilityJoydhriti Choudhury
dc.description.statementofresponsibilityTanzima Rahman Roshni
dc.description.statementofresponsibilityMd. Tawhidul Islam Chowdhury
dc.description.statementofresponsibilityRaihanoor Reza Rayon
dc.format.extent65 pages
dc.language.isoenen_US
dc.publisherBrac Universityen_US
dc.rightsBrac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subjectBioinformaticsen_US
dc.subjectMicroarrayen_US
dc.subjectGene expressionen_US
dc.subjectPhylogenetic treeen_US
dc.subjectHierarchical clusteringen_US
dc.subjectDistance metricen_US
dc.subjectLinkage methoden_US
dc.subject.lcshCluster analysis.
dc.titleIdentifying the best metrics to find the best quality clusters of genes from gene expression dataen_US
dc.typeThesisen_US
dc.contributor.departmentDepartment of Computer Science and Engineering, Brac University
dc.description.degreeB. Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record