Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/38920
DC FieldValue
dc.titleEfficient processing of distributed iceberg semi-joins
dc.contributor.authorImthiyaz, M.K.
dc.contributor.authorXiaoan, D.
dc.contributor.authorKalnis, P.
dc.date.accessioned2013-07-04T07:29:54Z
dc.date.available2013-07-04T07:29:54Z
dc.date.issued2004
dc.identifier.citationImthiyaz, M.K.,Xiaoan, D.,Kalnis, P. (2004). Efficient processing of distributed iceberg semi-joins. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 3180 : 634-643. ScholarBank@NUS Repository.
dc.identifier.issn03029743
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/38920
dc.description.abstractThe Iceberg SemiJoin (ISJ) of two datasets R and S returns the tuples in R which join with at least k tuples of S. The ISJ operator is essential in many practical applications including OLAP, Data Mining and Information Retrieval. In this paper we consider the distributed evaluation of Iceberg SemiJoins, where R and S reside on remote servers. We developed an efficient algorithm which employs Bloom filters. The novelty of our approach is that we interleave the evaluation of the Iceberg set in server S with the pruning of unmatched tuples in server R. Therefore, we are able to (i) eliminate unnecessary tuples early, and (ii) extract accurate Bloom filters from the intermediate hash tables which are constructed during the generation of the Iceberg set. Compared to conventional two-phase approaches, our experiments demonstrate that our method transmits up to 80% less data through the network, while reducing the disk I/O cost. © Springer-Verlag Berlin Heidelberg 2004.
dc.sourceScopus
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.sourcetitleLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.description.volume3180
dc.description.page634-643
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.