Scoring the data using association rules

Please use this identifier to cite or link to this item: https://doi.org/10.1023/A:1021931008240

DC Field	Value
dc.title	Scoring the data using association rules
dc.contributor.author	Liu, B.
dc.contributor.author	Ma, Y.
dc.contributor.author	Wong, C.K.
dc.contributor.author	Yu, P.S.
dc.date.accessioned	2013-07-04T07:34:36Z
dc.date.available	2013-07-04T07:34:36Z
dc.date.issued	2003
dc.identifier.citation	Liu, B., Ma, Y., Wong, C.K., Yu, P.S. (2003). Scoring the data using association rules. Applied Intelligence 18 (2) : 119-135. ScholarBank@NUS Repository. https://doi.org/10.1023/A:1021931008240
dc.identifier.issn	0924669X
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/39129
dc.description.abstract	In many data mining applications, the objective is to select data cases of a target class. For example, in direct marketing, marketers want to select likely buyers of a particular product for promotion. In such applications, it is often too difficult to predict who will definitely be in the target class (e.g., the buyer class) because the data used for modeling is often very noisy and has a highly imbalanced class distribution. Traditionally, classification systems are used to solve this problem. Instead of classifying each data case to a definite class (e.g., buyer or non-buyer), a classification system is modified to produce a class probability estimate (or a score) for the data case to indicate the likelihood that the data case belongs to the target class (e.g., the buyer class). However, existing classification systems only aim to find a subset of the regularities or rules that exist in data. This subset of rules only gives a partial picture of the domain. In this paper, we show that the target selection problem can be mapped to association rule mining to provide a more powerful solution to the problem. Since association rule mining aims to find all rules in data, it is thus able to give a complete picture of the underlying relationships in the domain. The complete set of rules enables us to assign a more accurate class probability estimate to each data case. This paper proposes an effective and efficient technique to compute class probability estimates using association rules. Experiment results using public domain data and real-life application data show that in general the new technique performs markedly better than the state-of-the-art classification system C4.5, boosted C4.5, and the Naive Bayesian system.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1023/A:1021931008240
dc.source	Scopus
dc.subject	Association rules
dc.subject	Classifications
dc.subject	Data mining
dc.subject	Scoring
dc.subject	Target selection
dc.type	Article
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.1023/A:1021931008240
dc.description.sourcetitle	Applied Intelligence
dc.description.volume	18
dc.description.issue	2
dc.description.page	119-135
dc.description.coden	APITE
dc.identifier.isiut	000180391800001
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM