Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/48345
DC FieldValue
dc.titleScalable Data Analysis on MapReduce-based Systems
dc.contributor.authorWANG ZHENGKUI
dc.date.accessioned2013-11-30T18:12:13Z
dc.date.available2013-11-30T18:12:13Z
dc.date.issued2013-06-19
dc.identifier.citationWANG ZHENGKUI (2013-06-19). Scalable Data Analysis on MapReduce-based Systems. ScholarBank@NUS Repository.
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/48345
dc.description.abstractMany of today's applications, such as scientific, financial and social networking applications, are generating and collecting data at an alarming rate. As the size of data grows, it becomes increasingly challenging to analyze these datasets. The high computation and I/O cost of processing large amount of data make it difficult for these applications to meet the performance demands of end-users. Meanwhile, the MapReduce framework has emerged as a powerful parallel computation paradigm for data processing on large-scale clusters. As such, there has been much effort in developing MapReduce-based algorithms to improve performance. However, there remain many challenges in exploiting MapReduce for efficient data analysis. Thus, in this thesis, we develop new scalable, efficient and practical parallel data processing algorithms, frameworks and systems for computation intensive analysis and data intensive analysis on MapReduce-based systems. Specially, we explore two extremely important and challenging analyses: combinatorial statistical analysis and online analytical processing (OLAP) cube analysis. The experimental results demonstrated the efficiency, effectiveness and scalability of our techniques. We believe that our research in this thesis brings us one step closer towards developing scalable and efficient big data analysis systems.
dc.language.isoen
dc.subjectScalable data analysis, MapReduce, OLAP and Data Warehousing, parallel statistics test, Graph OLAP, data cubes
dc.typeThesis
dc.contributor.departmentNUS GRAD SCH FOR INTEGRATIVE SCI & ENGG
dc.contributor.supervisorTAN KIAN LEE
dc.description.degreePh.D
dc.description.degreeconferredDOCTOR OF PHILOSOPHY
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Ph.D Theses (Open)

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
wangzk.pdf16.2 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.