Please use this identifier to cite or link to this item:
Title: Systems and Networking
Authors: HOU SONG
Keywords: Performance analysis, Hadoop, distributed computing, MapReduce, queuing model, parameter optimization
Issue Date: 22-Jun-2011
Citation: HOU SONG (2011-06-22). Systems and Networking. ScholarBank@NUS Repository.
Abstract: MapReduce is a more and more popular distributed computing framework, especially in largescale data analysis. Although it has been adopted in many places, a theoretical analysis of itsbehavior is lacking. This thesis introduces an analytical model for MapReduce with three parts,the average task performance, the random behavior and the waiting time. This model is thenvalidated using measured data from three categories of workloads. The model's usefulness isdemonstrated by three optimization processes, which give reasonable conclusions yet differentfrom current understandings.
Appears in Collections:Master's Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
HouS.pdf2.38 MBAdobe PDF



Page view(s)

checked on May 23, 2019


checked on May 23, 2019

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.