Toward efficient multifeature query processing

Please use this identifier to cite or link to this item: https://doi.org/10.1109/TKDE.2006.51

DC Field	Value
dc.title	Toward efficient multifeature query processing
dc.contributor.author	Jagadish, H.V.
dc.contributor.author	Ooi, B.C.
dc.contributor.author	Shen, H.T.
dc.contributor.author	Tan, K.-L.
dc.date.accessioned	2013-07-04T07:31:15Z
dc.date.available	2013-07-04T07:31:15Z
dc.date.issued	2006
dc.identifier.citation	Jagadish, H.V., Ooi, B.C., Shen, H.T., Tan, K.-L. (2006). Toward efficient multifeature query processing. IEEE Transactions on Knowledge and Data Engineering 18 (3) : 350-361. ScholarBank@NUS Repository. https://doi.org/10.1109/TKDE.2006.51
dc.identifier.issn	10414347
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/38981
dc.description.abstract	In many advanced applications, data are described by multiple high-dimensional features. Moreover, different queries may weight these features differently; some may not even specify all the features. In this paper, we propose our solution to support efficient query processing in these applications. We devise a novel representation that compactly captures f features into two components: The first component is a 2D vector that reflects a distance range (minimum and maximum values) of the f features with respect to a reference point (the center of the space) in a metric space and the second component is a bit signature, with two bits per dimension, obtained by analyzing each feature's descending energy histogram. This representation enables two levels of filtering: The first component prunes away points that do not share similar distance ranges, while the bit signature filters away points based on the dimensions of the relevant features. Moreover, the representation facilitates the use of a single index structure to further speed up processing. We employ the classical B +-tree for this purpose. We also propose a KNN search algorithm that exploits the access orders of critical dimensions of highly selective features and partial distances to prune the search space more effectively. Our extensive experiments on both real-life and synthetic data sets show that the proposed solution offers significant performance advantages over sequential scan and retrieval methods using single and multiple VA-files. © 2006 IEEE.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/TKDE.2006.51
dc.source	Scopus
dc.subject	High-dimensional
dc.subject	Indexing
dc.subject	Multifeature
dc.subject	Query processing
dc.subject	Weighted query
dc.type	Article
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.1109/TKDE.2006.51
dc.description.sourcetitle	IEEE Transactions on Knowledge and Data Engineering
dc.description.volume	18
dc.description.issue	3
dc.description.page	350-361
dc.description.coden	ITKEE
dc.identifier.isiut	000234675800005
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM