Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/39962
DC FieldValue
dc.titleIntelliClean: A knowledge-based intelligent data cleaner
dc.contributor.authorLee, M.L.
dc.contributor.authorLing, T.W.
dc.contributor.authorLow, W.L.
dc.date.accessioned2013-07-04T07:53:33Z
dc.date.available2013-07-04T07:53:33Z
dc.date.issued2000
dc.identifier.citationLee, M.L.,Ling, T.W.,Low, W.L. (2000). IntelliClean: A knowledge-based intelligent data cleaner. Proceeding of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining : 290-294. ScholarBank@NUS Repository.
dc.identifier.isbn1581132336
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/39962
dc.description.abstractExisting data cleaning methods work on the basis of computing the degree of similarity between nearby records in a sorted database. High recall is achieved by accepting records with low degrees of similarity as duplicates, at the cost of lower precision. High precision is achieved analogously at the cost of lower recall. This is the recall-precision dilemma. In this paper, we propose a generic knowledge-based framework for effective data cleaning that implements existing cleaning strategies and more. We develop a new method to compute transitive closure under uncertainty which handles the merging of groups of inexact duplicate records. Experimental results show that this framework can identify duplicates and anomalies with high recall and precision.
dc.sourceScopus
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.sourcetitleProceeding of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
dc.description.page290-294
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.