Investigation cloud data storage

Sumaiya Binte Mostafa; Tabassum, Firoza

dc.contributor.advisor	Khan, Mumit
dc.contributor.author	Sumaiya Binte Mostafa
dc.contributor.author	Tabassum, Firoza
dc.date.accessioned	2013-04-16T05:16:55Z
dc.date.available	2013-04-16T05:16:55Z
dc.date.copyright	2012
dc.date.issued	12/12/2012
dc.identifier.other	ID 08301001
dc.identifier.other	ID 09101028
dc.identifier.uri	http://hdl.handle.net/10361/2303
dc.description	Cataloged from PDF version of thesis report.
dc.description	Includes bibliographical references (page 93).
dc.description	This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2012.	en_US
dc.description.abstract	A cloud database is a database that typically runs on a cloud computing platform. Of the databases available on the cloud, traditional data model is SQL-based. The recent trend is to move on to NOSQL data model. Now, the question is which database approach is better to choose in this era of ‘Big Data’? SQL databases are difficult to scale, meaning they are not natively suited to a cloud environment, although cloud database services based on SQL are attempting to address this challenge. On the other hand, NOSQL databases are built to service heavy read/write loads and are able scale up and down easily, and therefore they are more natively suited to running on the cloud. Our aim for thesis is to investigate suitable data storage for cloud. Considering the ‘Big Data’ scenario of today’s world, we set forth to choose the NOSQL database model as the preferred solution for cloud computing. This paper aims to show two investigations on different branches of cloud data storage. The first analysis is based on the case study of performance benchmarking on 3 popular NOSQL databases - MongoDB, Cassandra, and HBase. The next part of investigation includes an experiment on the most popular ‘Big Data’ management framework – namely, Hadoop. Hadoop uses MapReduce for parallel computation, but writing MapReduce function is hard for programmers. So, our experiment is to configure HIVE data warehousing system on the top of Hadoop as a wrapper, so that end users gets benefit of using a SQL-like language, which is known as ‘HiveQL’ and provided by HIVE even if with the environment of complex MapReduce function.	en_US
dc.description.statementofresponsibility	Sumaiya Binte Mostafa
dc.description.statementofresponsibility	Firoza Tabassum
dc.format.extent	107 pages
dc.language.iso	en	en_US
dc.publisher	BRAC University	en_US
dc.rights	BRAC University thesis are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Computer science and engineering
dc.subject	Cloud Data	en_US
dc.title	Investigation cloud data storage	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, BRAC University
dc.description.degree	B. Computer Science and Engineering

Files in this item

Name:: Thesis_Paper.pdf
Size:: 3.793Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1480]

Show simple item record