Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/27922
Title: Systems and Networking
Authors: HOU SONG
Keywords: Performance analysis, Hadoop, distributed computing, MapReduce, queuing model, parameter optimization
Issue Date: 22-Jun-2011
Citation: HOU SONG (2011-06-22). Systems and Networking. ScholarBank@NUS Repository.
Abstract: MapReduce is a more and more popular distributed computing framework, especially in largescale data analysis. Although it has been adopted in many places, a theoretical analysis of itsbehavior is lacking. This thesis introduces an analytical model for MapReduce with three parts,the average task performance, the random behavior and the waiting time. This model is thenvalidated using measured data from three categories of workloads. The model's usefulness isdemonstrated by three optimization processes, which give reasonable conclusions yet differentfrom current understandings.
URI: http://scholarbank.nus.edu.sg/handle/10635/27922
Appears in Collections:Master's Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
HouS.pdf2.38 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.