dc.contributor.advisor | Ali, Md. Haider | |
dc.contributor.author | Galib, Asadullah Al | |
dc.contributor.author | Rahman, Mohammad Mohaimanur | |
dc.date.accessioned | 2017-05-31T05:14:35Z | |
dc.date.available | 2017-05-31T05:14:35Z | |
dc.date.copyright | 2017 | |
dc.date.issued | 2017-04 | |
dc.identifier.other | ID 12201098 | |
dc.identifier.other | ID 13301088 | |
dc.identifier.uri | http://hdl.handle.net/10361/8215 | |
dc.description | This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017. | en_US |
dc.description | Cataloged from PDF version of thesis report. | |
dc.description | Includes bibliographical references (page 56-57). | |
dc.description.abstract | Inferring regulatory network from gene expression data only is considered a challenging task in systems biology, and the introduction of various high-throughput DNA microarray technologies in the collection of expression data has significantly increased the amount of data to be analyzed by existing algorithms. All of these algorithms focus on different issues regarding the inference of gene regulatory network (GRN) and their methodologies work better only for certain types of datasets and/or regulatory networks. As a result, they have inherent limitations in dealing with different types of datasets. In this paper, we propose a novel method to infer gene regulatory network from expression data which utilizes K-means Clustering along with some properties of entropy from information theory. The proposed method has two main components, first grouping the genes of a dataset into given number of clusters and then finding statistically significant interactions among genes of each individual cluster and selected nearby clusters. To achieve this, an information theoretic approach based on Entropy Reduction is used to finally generate a regulatory interaction matrix consisting of all genes. The purpose of grouping genes in clusters based on the similarity of expression level is to minimize the search space of regulatory interactions among genes. The Entropy Reduction Technique (ERT) finds regulatory interactions with reduced number of genes. To assess the performance of our algorithm, we used datasets from DREAM5 – Network Inference challenge [6], DREAM4 – In Silico Network challenge [7] and one in silico dataset generated by GeneNetWeaver [8]. The performance of our algorithm was compared with the result of ARACNE, a popular information theoretic approach to reverse engineer gene regulatory network from expression dataset. We used precision and recall as performance measures. Our algorithm showed significant improvement in the precision and recall percentage over the network generated by ARACNE. We also compared our results among different threshold values and different numbers of clusters with three versions of our algorithm -No Clustering, Unmerged Clustering and Selected Merged Clustering. | en_US |
dc.description.statementofresponsibility | Asadullah Al Galib | |
dc.description.statementofresponsibility | Mohammad Mohaimanur Rahman | |
dc.format.extent | 57 pages. | |
dc.language.iso | en | en_US |
dc.rights | BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Gene regulatory network | en_US |
dc.subject | Entropy reduction | en_US |
dc.title | Inference of gene regulatory metwork (GRN) rrom gene expression data using k-means clustering and entropy based selection of interactions | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, BRAC University | |
dc.description.degree | B. Computer Science and Engineering | |