Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/118221
Title: Supporting Efficient Database Processing in Mapreduce
Authors: LU PENG
Keywords: Database, Index, Query Processing, Cloud Computing, MapReduce
Issue Date: 6-Aug-2014
Citation: LU PENG (2014-08-06). Supporting Efficient Database Processing in Mapreduce. ScholarBank@NUS Repository.
Abstract: Cloud computing has emerged as a multi-billion dollar industry and as a successful paradigm for web-scale application deployment. However, due to the heterogeneity and massiveness nature of data in the Cloud, current Cloud systems trade rigorous data management functionalities for better versatility and scalability. The overarching goal of this dissertation is to exploit the opportunity for a better marriage of RDBMS technologies and Cloud Computing systems. This dissertation advances the research in this topic by improving two critical facets of large scale data processing systems. First, we propose an architecture to support the usage of DBMS-like indexes in MapReduce systems to facilitate the storage and processing of structured data. We start with devising a bitmap-based indexing scheme that provides superior space ef?ciency, and improves the performance of MapReduce programs on a speci?c category of data. We then generalize the index application, and propose a generalized index framework for MapReduce systems to handle large data and applications. Second, we propose models and techniques to incorporate the power of MapReduce with parallel database system technologies in query processing.
URI: http://scholarbank.nus.edu.sg/handle/10635/118221
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
LuP.pdf1.1 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.