Please use this identifier to cite or link to this item: https://doi.org/10.1145/1255175.1255213
DC FieldValue
dc.titleAdaptive sorted neighborhood methods for efficient record linkage
dc.contributor.authorYan, S.
dc.contributor.authorLee, D.
dc.contributor.authorKan, M.-Y.
dc.contributor.authorGiles, L.C.
dc.date.accessioned2013-07-04T08:05:50Z
dc.date.available2013-07-04T08:05:50Z
dc.date.issued2007
dc.identifier.citationYan, S., Lee, D., Kan, M.-Y., Giles, L.C. (2007). Adaptive sorted neighborhood methods for efficient record linkage. Proceedings of the ACM International Conference on Digital Libraries : 185-194. ScholarBank@NUS Repository. https://doi.org/10.1145/1255175.1255213
dc.identifier.isbn1595936440
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/40504
dc.description.abstractTraditionally, record linkage algorithms have played an important role in maintaining digital libraries - i.e., identifying matching citations or authors for consolidation in updating or integrating digital libraries. As such, a variety of record linkage algorithms have been developed and deployed successfully. Often, however, existing solutions have a set of parameters whose values are set by human experts off-lineand are fixed during the execution. Since finding the ideal values of such parameters is not straightforward, or no such single ideal value even exists, the applicability of existing solutions to new scenarios or domains is greatly hampered. To remedy this problem, we argue that one can achieve significant improvement by adaptively and dynamically changing such parameters of record linkage algorithms. To validate our hypothesis, we take a classical record linkage algorithm, the sorted neighborhood method (SNM), and demonstrate how we can achieve improved accuracy and performance by adaptively changing its fixed sliding window size. Our claim is analytically and empirically validated using both real and synthetic data sets of digital libraries and other domains. Copyright 2007 ACM.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1145/1255175.1255213
dc.sourceScopus
dc.subjectCitation matching
dc.subjectEntity resolution
dc.subjectRecord linkage
dc.subjectSorted neighborhood
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1145/1255175.1255213
dc.description.sourcetitleProceedings of the ACM International Conference on Digital Libraries
dc.description.page185-194
dc.identifier.isiut000266062800029
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.