dc.contributor.advisor | Mottalib, Md. Abdul | |
dc.contributor.advisor | Ajwad, Ajwad | |
dc.contributor.author | Choudhury, Joydhriti | |
dc.contributor.author | Roshni, Tanzima Rahman | |
dc.contributor.author | Chowdhury, Md. Tawhidul Islam | |
dc.contributor.author | Rayon, Raihanoor Reza | |
dc.date.accessioned | 2019-10-28T04:04:38Z | |
dc.date.available | 2019-10-28T04:04:38Z | |
dc.date.copyright | 2019 | |
dc.date.issued | 2019-04 | |
dc.identifier.other | ID 15301125 | |
dc.identifier.other | ID 15301125 | |
dc.identifier.other | ID 16101321 | |
dc.identifier.other | ID 18141021 | |
dc.identifier.uri | http://hdl.handle.net/10361/12810 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2019. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 38-40). | |
dc.description.abstract | Microarray data is used to create groups of similar genes based on their phenotypic
attributes. Information extracted from these groups of gene can be applied to path-
way analysis, disease predictions, target identification in drug design and many other
important applications and functionalities in biology. However, how to determine a
distance metric to measure the similarities among genes has always been a great chal-
lenge. In our work, we have studied sixteen combination of distance-linkage combina-
tional metrics and tried to and the groups of similar genes based on their expression
level by building phylogenetic tree. Furthermore, to validate our endings we have
evaluate the output of the same trails on three different datasets. Our work suggests
that, Maximum distance metric with the combination of Average linkage metrics gives
the optimal quality while grouping similar genes together by building a phylogenetic
tree. | en_US |
dc.description.statementofresponsibility | Joydhriti Choudhury | |
dc.description.statementofresponsibility | Tanzima Rahman Roshni | |
dc.description.statementofresponsibility | Md. Tawhidul Islam Chowdhury | |
dc.description.statementofresponsibility | Raihanoor Reza Rayon | |
dc.format.extent | 65 pages | |
dc.language.iso | en | en_US |
dc.publisher | Brac University | en_US |
dc.rights | Brac University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Bioinformatics | en_US |
dc.subject | Microarray | en_US |
dc.subject | Gene expression | en_US |
dc.subject | Phylogenetic tree | en_US |
dc.subject | Hierarchical clustering | en_US |
dc.subject | Distance metric | en_US |
dc.subject | Linkage method | en_US |
dc.subject.lcsh | Cluster analysis. | |
dc.title | Identifying the best metrics to find the best quality clusters of genes from gene expression data | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, Brac University | |
dc.description.degree | B. Computer Science | |