Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/29537
DC FieldValue
dc.titleLlama: Leveraging columnar storage for scalable join processing in mapreduce.
dc.contributor.authorLIN YUTING
dc.date.accessioned2011-11-30T18:00:31Z
dc.date.available2011-11-30T18:00:31Z
dc.date.issued2011-08-25
dc.identifier.citationLIN YUTING (2011-08-25). Llama: Leveraging columnar storage for scalable join processing in mapreduce.. ScholarBank@NUS Repository.
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/29537
dc.description.abstractTo achieve high reliability and scalability, most large-scale data warehouse systems have adopted the cluster-based architecture. In this paper, we propose the design of a new cluster-based data warehouse system, Llama, a hybrid data management system which combines the features of row-wise and column-wise database systems. In Llama, columns are formed into correlation groups to provide the basis for the vertical partitioning of tables. Llama employs a distributed file system (DFS) to disseminate data among cluster nodes. Above the DFS, a MapReduce-based query engine is supported. We design a new join algorithm to facilitate fast join processing. We present a performance study on TPC-H dataset and compare Llama with Hive, a data warehouse infrastructure built on top of Hadoop. The experiment is conducted on EC2. The results show that Llama has an efficient load performance and its query performance is significantly better than the traditional MapReduce framework based on row-wise storage.
dc.language.isoen
dc.subjectcolumn store, MapReduce, join
dc.typeThesis
dc.contributor.departmentCOMPUTER SCIENCE
dc.contributor.supervisorOOI BENG CHIN
dc.description.degreeMaster's
dc.description.degreeconferredMASTER OF SCIENCE
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Master's Theses (Open)

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
LinYT.pdf485.03 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.