Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/48345
DC Field | Value | |
---|---|---|
dc.title | Scalable Data Analysis on MapReduce-based Systems | |
dc.contributor.author | WANG ZHENGKUI | |
dc.date.accessioned | 2013-11-30T18:12:13Z | |
dc.date.available | 2013-11-30T18:12:13Z | |
dc.date.issued | 2013-06-19 | |
dc.identifier.citation | WANG ZHENGKUI (2013-06-19). Scalable Data Analysis on MapReduce-based Systems. ScholarBank@NUS Repository. | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/48345 | |
dc.description.abstract | Many of today's applications, such as scientific, financial and social networking applications, are generating and collecting data at an alarming rate. As the size of data grows, it becomes increasingly challenging to analyze these datasets. The high computation and I/O cost of processing large amount of data make it difficult for these applications to meet the performance demands of end-users. Meanwhile, the MapReduce framework has emerged as a powerful parallel computation paradigm for data processing on large-scale clusters. As such, there has been much effort in developing MapReduce-based algorithms to improve performance. However, there remain many challenges in exploiting MapReduce for efficient data analysis. Thus, in this thesis, we develop new scalable, efficient and practical parallel data processing algorithms, frameworks and systems for computation intensive analysis and data intensive analysis on MapReduce-based systems. Specially, we explore two extremely important and challenging analyses: combinatorial statistical analysis and online analytical processing (OLAP) cube analysis. The experimental results demonstrated the efficiency, effectiveness and scalability of our techniques. We believe that our research in this thesis brings us one step closer towards developing scalable and efficient big data analysis systems. | |
dc.language.iso | en | |
dc.subject | Scalable data analysis, MapReduce, OLAP and Data Warehousing, parallel statistics test, Graph OLAP, data cubes | |
dc.type | Thesis | |
dc.contributor.department | NUS GRAD SCH FOR INTEGRATIVE SCI & ENGG | |
dc.contributor.supervisor | TAN KIAN LEE | |
dc.description.degree | Ph.D | |
dc.description.degreeconferred | DOCTOR OF PHILOSOPHY | |
dc.identifier.isiut | NOT_IN_WOS | |
Appears in Collections: | Ph.D Theses (Open) |
Show simple item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
wangzk.pdf | 16.2 MB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.