• Login
    • Library Home
    View Item 
    •   BracU IR
    • School of Data and Sciences (SDS)
    • Department of Computer Science and Engineering (CSE)
    • Thesis & Report, BSc (Computer Science and Engineering)
    • View Item
    •   BracU IR
    • School of Data and Sciences (SDS)
    • Department of Computer Science and Engineering (CSE)
    • Thesis & Report, BSc (Computer Science and Engineering)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Investigation cloud data storage

    Thumbnail
    View/Open
    Thesis_Paper.pdf (3.793Mb)
    Date
    12/12/2012
    Publisher
    BRAC University
    Author
    Sumaiya Binte Mostafa
    Tabassum, Firoza
    Metadata
    Show full item record
    URI
    http://hdl.handle.net/10361/2303
    Abstract
    A cloud database is a database that typically runs on a cloud computing platform. Of the databases available on the cloud, traditional data model is SQL-based. The recent trend is to move on to NOSQL data model. Now, the question is which database approach is better to choose in this era of ‘Big Data’? SQL databases are difficult to scale, meaning they are not natively suited to a cloud environment, although cloud database services based on SQL are attempting to address this challenge. On the other hand, NOSQL databases are built to service heavy read/write loads and are able scale up and down easily, and therefore they are more natively suited to running on the cloud. Our aim for thesis is to investigate suitable data storage for cloud. Considering the ‘Big Data’ scenario of today’s world, we set forth to choose the NOSQL database model as the preferred solution for cloud computing. This paper aims to show two investigations on different branches of cloud data storage. The first analysis is based on the case study of performance benchmarking on 3 popular NOSQL databases - MongoDB, Cassandra, and HBase. The next part of investigation includes an experiment on the most popular ‘Big Data’ management framework – namely, Hadoop. Hadoop uses MapReduce for parallel computation, but writing MapReduce function is hard for programmers. So, our experiment is to configure HIVE data warehousing system on the top of Hadoop as a wrapper, so that end users gets benefit of using a SQL-like language, which is known as ‘HiveQL’ and provided by HIVE even if with the environment of complex MapReduce function.
    Keywords
    Computer science and engineering; Cloud Data
     
    Description
    Cataloged from PDF version of thesis report.
     
    Includes bibliographical references (page 93).
     
    This thesis report is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2012.
    Department
    Department of Computer Science and Engineering, BRAC University
    Collections
    • Thesis & Report, BSc (Computer Science and Engineering)

    Copyright © 2008-2019 Ayesha Abed Library, Brac University 
    Contact Us | Send Feedback
     

     

    Policy Guidelines

    • BracU Policy
    • Publisher Policy

    Browse

    All of BracU Institutional RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Copyright © 2008-2019 Ayesha Abed Library, Brac University 
    Contact Us | Send Feedback