Please use this identifier to cite or link to this item: https://doi.org/10.1016/S0169-023X(01)00055-6
DC FieldValue
dc.titleJoin and multi-join processing in data integration systems
dc.contributor.authorTan, K.-L.
dc.contributor.authorKwang Eng, P.
dc.contributor.authorChin Ooi, B.
dc.contributor.authorZhang, M.
dc.date.accessioned2013-07-04T07:34:01Z
dc.date.available2013-07-04T07:34:01Z
dc.date.issued2002
dc.identifier.citationTan, K.-L., Kwang Eng, P., Chin Ooi, B., Zhang, M. (2002). Join and multi-join processing in data integration systems. Data and Knowledge Engineering 40 (2) : 217-239. ScholarBank@NUS Repository. https://doi.org/10.1016/S0169-023X(01)00055-6
dc.identifier.issn0169023X
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/39103
dc.description.abstractQuery processing in a data integration system is complicated by a lack of quality statistics about the data, unpredictable and bursty data transfer rates, and slow or unavailable data sources. Conventional query processing algorithms, which are based on a blocking execution model, are no longer attractive because of their long initial response time. Moreover, the execution engine may be stalled by slow data delivery rates or unavailable data sources. In this paper, we adopt a non-blocking execution model for evaluating queries. We propose a symmetric partition-based join algorithm, called AJoin, that can operate with small memory requirement, produce first few answer tuples quickly, and blocks only when all available data have been examined. We also examine heuristics to manage the partitions and address the memory management issues of AJoin. To evaluate multi-join query plans, we also proposed two new strategies, m-AJoin and Pm-AJoin. Both strategies evaluate each join operation using AJoin. While m-AJoin accesses data from remote sources in its entirety, Pm-AJoin accesses remote data in chunks of smaller partitions. Our performance study shows the effectiveness of the proposed approaches for join and multi-join processing in a multi-user data integration system. © 2002 Elsevier Science B.V. All rights reserved.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1016/S0169-023X(01)00055-6
dc.sourceScopus
dc.subjectBlocking execution model
dc.subjectData integration
dc.subjectInitial response time
dc.subjectMulti-join
dc.subjectQuery processing
dc.subjectSymmetric hash join
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1016/S0169-023X(01)00055-6
dc.description.sourcetitleData and Knowledge Engineering
dc.description.volume40
dc.description.issue2
dc.description.page217-239
dc.description.codenDKENE
dc.identifier.isiut000173163900006
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.