Please use this identifier to cite or link to this item: https://doi.org/10.1007/s10115-007-0111-5
DC FieldValue
dc.titleA new concise representation of frequent itemsets using generators and a positive border
dc.contributor.authorLiu, G.
dc.contributor.authorLi, J.
dc.contributor.authorWong, L.
dc.date.accessioned2013-07-04T07:46:54Z
dc.date.available2013-07-04T07:46:54Z
dc.date.issued2008
dc.identifier.citationLiu, G., Li, J., Wong, L. (2008). A new concise representation of frequent itemsets using generators and a positive border. Knowledge and Information Systems 17 (1) : 35-56. ScholarBank@NUS Repository. https://doi.org/10.1007/s10115-007-0111-5
dc.identifier.issn02191377
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/39671
dc.description.abstractA complete set of frequent itemsets can get undesirably large due to redundancy when the minimum support threshold is low or when the database is dense. Several concise representations have been previously proposed to eliminate the redundancy. Generator based representations rely on a negative border to make the representation lossless. However, the number of itemsets on a negative border sometimes even exceeds the total number of frequent itemsets. In this paper, we propose to use a positive border together with frequent generators to form a lossless representation. A positive border is usually orders of magnitude smaller than its corresponding negative border. A set of frequent generators plus its positive border is always no larger than the corresponding complete set of frequent itemsets, thus it is a true concise representation. The generalized form of this representation is also proposed. We develop an efficient algorithm, called GrGrowth, to mine generators and positive borders as well as their generalizations. The GrGrowth algorithm uses the depth-first-search strategy to explore the search space, which is much more efficient than the breadth-first-search strategy adopted by most of the existing generator mining algorithms. Our experiment results show that the GrGrowth algorithm is significantly faster than level-wise algorithms for mining generator based representations, and is comparable to the state-of-the-art algorithms for mining frequent closed itemsets. © Springer-Verlag London Limited 2007.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1007/s10115-007-0111-5
dc.sourceScopus
dc.subjectConcise representation
dc.subjectDatamining
dc.subjectFrequent itemset mining
dc.subjectGenerator
dc.subjectPositive border
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1007/s10115-007-0111-5
dc.description.sourcetitleKnowledge and Information Systems
dc.description.volume17
dc.description.issue1
dc.description.page35-56
dc.identifier.isiut000259960200003
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.