Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/129135
Title: WRITE-INTENSIVE DATA MANAGEMENT IN LOG-STRUCTURED STORAGE
Authors: WANG SHENG
Keywords: Storage, Indexing, Log-Structured, Write-Intensive, Observational Data, Intrinsic Clustering
Issue Date: 19-Apr-2016
Citation: WANG SHENG (2016-04-19). WRITE-INTENSIVE DATA MANAGEMENT IN LOG-STRUCTURED STORAGE. ScholarBank@NUS Repository.
Abstract: Due to the rapid development of information technologies, real-world workloads are becoming write-intensive and large-scale. In this thesis, we work towards designing solutions for managing write-intensive workloads with the adoption of log-structured techniques. We first propose a distributed log-structured storage, providing high write-throughput. It removes the write bottleneck by unifying data and log repositories, and supports fast failure recovery. Second, we design a novel indexing method on top of log-storage to support efficient range queries. It works well for observational data, which is write-intensive in nature. It utilizes intrinsic clustering property in data source, and gracefully reduces indexing overhead. Lastly, we provide an extended solution for indexing multi-dimensional observational data. It overcomes the data sparsity in multi-dimensional spaces, and minimizes space over-coverage introduced by conventional methods. We evaluate proposed approaches using real workloads, and observe that though our approaches are optimized for write throughput, they still preserve good read efficiency.
URI: http://scholarbank.nus.edu.sg/handle/10635/129135
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
WangS.pdf2.33 MBAdobe PDF

OPEN

NoneView/Download

Page view(s)

118
checked on Nov 16, 2018

Download(s)

103
checked on Nov 16, 2018

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.