Please use this identifier to cite or link to this item:
https://doi.org/10.1016/S0169-023X(99)00031-2
Title: | Efficient indexing of high-dimensional data through dimensionality reduction | Authors: | Goh, C.H. Lim, A. Ooi, B.C. Tan, K.-L. |
Issue Date: | 2000 | Citation: | Goh, C.H., Lim, A., Ooi, B.C., Tan, K.-L. (2000). Efficient indexing of high-dimensional data through dimensionality reduction. Data and Knowledge Engineering 32 (2) : 115-130. ScholarBank@NUS Repository. https://doi.org/10.1016/S0169-023X(99)00031-2 | Abstract: | The performance of the R-tree indexing method is known to deteriorate rapidly when the dimensionality of data increases. In this paper, we present a technique for dimensionality reduction by grouping d distinct attributes into k disjoint clusters and mapping each cluster to a linear space. The resulting k-dimensional space (which may be much smaller than d) can then be indexed using an R-tree efficiently. We present algorithms for decomposing a query region on the native d-dimensional space to corresponding query regions in the k-dimensional space, as well as search and update operations for the `dimensionally-reduced' R-tree. Experiments using real data sets for point, region, and OLAP queries were conducted. The results indicate that there is potential for significant performance gains over a naive strategy in which an R-tree index is created on the native d-dimensional space. | Source Title: | Data and Knowledge Engineering | URI: | http://scholarbank.nus.edu.sg/handle/10635/39124 | ISSN: | 0169023X | DOI: | 10.1016/S0169-023X(99)00031-2 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.