Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/129135
Title: | WRITE-INTENSIVE DATA MANAGEMENT IN LOG-STRUCTURED STORAGE | Authors: | WANG SHENG | Keywords: | Storage, Indexing, Log-Structured, Write-Intensive, Observational Data, Intrinsic Clustering | Issue Date: | 19-Apr-2016 | Citation: | WANG SHENG (2016-04-19). WRITE-INTENSIVE DATA MANAGEMENT IN LOG-STRUCTURED STORAGE. ScholarBank@NUS Repository. | Abstract: | Due to the rapid development of information technologies, real-world workloads are becoming write-intensive and large-scale. In this thesis, we work towards designing solutions for managing write-intensive workloads with the adoption of log-structured techniques. We first propose a distributed log-structured storage, providing high write-throughput. It removes the write bottleneck by unifying data and log repositories, and supports fast failure recovery. Second, we design a novel indexing method on top of log-storage to support efficient range queries. It works well for observational data, which is write-intensive in nature. It utilizes intrinsic clustering property in data source, and gracefully reduces indexing overhead. Lastly, we provide an extended solution for indexing multi-dimensional observational data. It overcomes the data sparsity in multi-dimensional spaces, and minimizes space over-coverage introduced by conventional methods. We evaluate proposed approaches using real workloads, and observe that though our approaches are optimized for write throughput, they still preserve good read efficiency. | URI: | http://scholarbank.nus.edu.sg/handle/10635/129135 |
Appears in Collections: | Ph.D Theses (Open) |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
WangS.pdf | 2.33 MB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.