Please use this identifier to cite or link to this item: https://doi.org/10.1016/S0169-023X(99)00031-2
Title: Efficient indexing of high-dimensional data through dimensionality reduction
Authors: Goh, C.H. 
Lim, A.
Ooi, B.C. 
Tan, K.-L. 
Issue Date: 2000
Source: Goh, C.H.,Lim, A.,Ooi, B.C.,Tan, K.-L. (2000). Efficient indexing of high-dimensional data through dimensionality reduction. Data and Knowledge Engineering 32 (2) : 115-130. ScholarBank@NUS Repository. https://doi.org/10.1016/S0169-023X(99)00031-2
Abstract: The performance of the R-tree indexing method is known to deteriorate rapidly when the dimensionality of data increases. In this paper, we present a technique for dimensionality reduction by grouping d distinct attributes into k disjoint clusters and mapping each cluster to a linear space. The resulting k-dimensional space (which may be much smaller than d) can then be indexed using an R-tree efficiently. We present algorithms for decomposing a query region on the native d-dimensional space to corresponding query regions in the k-dimensional space, as well as search and update operations for the `dimensionally-reduced' R-tree. Experiments using real data sets for point, region, and OLAP queries were conducted. The results indicate that there is potential for significant performance gains over a naive strategy in which an R-tree index is created on the native d-dimensional space.
Source Title: Data and Knowledge Engineering
URI: http://scholarbank.nus.edu.sg/handle/10635/39124
ISSN: 0169023X
DOI: 10.1016/S0169-023X(99)00031-2
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

SCOPUSTM   
Citations

11
checked on Dec 13, 2017

WEB OF SCIENCETM
Citations

9
checked on Nov 3, 2017

Page view(s)

36
checked on Dec 16, 2017

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.