Inference of gene regulatory metwork (GRN) rrom gene expression data using k-means clustering and entropy based selection of interactions

Galib, Asadullah Al; Rahman, Mohammad Mohaimanur

dc.contributor.advisor	Ali, Md. Haider
dc.contributor.author	Galib, Asadullah Al
dc.contributor.author	Rahman, Mohammad Mohaimanur
dc.date.accessioned	2017-05-31T05:14:35Z
dc.date.available	2017-05-31T05:14:35Z
dc.date.copyright	2017
dc.date.issued	2017-04
dc.identifier.other	ID 12201098
dc.identifier.other	ID 13301088
dc.identifier.uri	http://hdl.handle.net/10361/8215
dc.description	This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2017.	en_US
dc.description	Cataloged from PDF version of thesis report.
dc.description	Includes bibliographical references (page 56-57).
dc.description.abstract	Inferring regulatory network from gene expression data only is considered a challenging task in systems biology, and the introduction of various high-throughput DNA microarray technologies in the collection of expression data has significantly increased the amount of data to be analyzed by existing algorithms. All of these algorithms focus on different issues regarding the inference of gene regulatory network (GRN) and their methodologies work better only for certain types of datasets and/or regulatory networks. As a result, they have inherent limitations in dealing with different types of datasets. In this paper, we propose a novel method to infer gene regulatory network from expression data which utilizes K-means Clustering along with some properties of entropy from information theory. The proposed method has two main components, first grouping the genes of a dataset into given number of clusters and then finding statistically significant interactions among genes of each individual cluster and selected nearby clusters. To achieve this, an information theoretic approach based on Entropy Reduction is used to finally generate a regulatory interaction matrix consisting of all genes. The purpose of grouping genes in clusters based on the similarity of expression level is to minimize the search space of regulatory interactions among genes. The Entropy Reduction Technique (ERT) finds regulatory interactions with reduced number of genes. To assess the performance of our algorithm, we used datasets from DREAM5 – Network Inference challenge [6], DREAM4 – In Silico Network challenge [7] and one in silico dataset generated by GeneNetWeaver [8]. The performance of our algorithm was compared with the result of ARACNE, a popular information theoretic approach to reverse engineer gene regulatory network from expression dataset. We used precision and recall as performance measures. Our algorithm showed significant improvement in the precision and recall percentage over the network generated by ARACNE. We also compared our results among different threshold values and different numbers of clusters with three versions of our algorithm -No Clustering, Unmerged Clustering and Selected Merged Clustering.	en_US
dc.description.statementofresponsibility	Asadullah Al Galib
dc.description.statementofresponsibility	Mohammad Mohaimanur Rahman
dc.format.extent	57 pages.
dc.language.iso	en	en_US
dc.rights	BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Gene regulatory network	en_US
dc.subject	Entropy reduction	en_US
dc.title	Inference of gene regulatory metwork (GRN) rrom gene expression data using k-means clustering and entropy based selection of interactions	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, BRAC University
dc.description.degree	B. Computer Science and Engineering

Files in this item

Name:: 12201098, 13301088_CSE.pdf
Size:: 2.028Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1586]

Show simple item record