Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/99592
DC FieldValue
dc.titleSimilarity detection among data files - a machine learning approach
dc.contributor.authorDash, M.
dc.contributor.authorLiu, H.
dc.date.accessioned2014-10-27T06:05:39Z
dc.date.available2014-10-27T06:05:39Z
dc.date.issued1997
dc.identifier.citationDash, M.,Liu, H. (1997). Similarity detection among data files - a machine learning approach. Proceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX : 172-179. ScholarBank@NUS Repository.
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/99592
dc.description.abstractIn any database, description files are essential to understand the data files in it. However, it is not uncommon that one is left with data files without any description file. An example is the aftermath of a system crash; other examples are related to security problems. Manual determination of the subject of a data file can be a difficult and tedious task particularly if files are look-alike. An example is a big survey database where data files that look alike are actually related to different subjects. Two data files on the same subject will probably have similar semantic structures of attributes. We detect the similarity between two attributes. Then we create clusters of attributes to compare the similarity of the subjects of two data files. And finally a machine learning technique is used to predict the subject of unseen data files.
dc.sourceScopus
dc.typeConference Paper
dc.contributor.departmentINFORMATION SYSTEMS & COMPUTER SCIENCE
dc.description.sourcetitleProceedings of the IEEE Knowledge & Data Engineering Exchange Workshop, KDEX
dc.description.page172-179
dc.description.coden275
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.