Please use this identifier to cite or link to this item:
Title: Image-based document vectors for text retrieval
Authors: Yu, Z.
Tan, C.L. 
Keywords: Document image
N-gram algorithm
Similarity measure
Text retrieval
Issue Date: 2000
Citation: Yu, Z.,Tan, C.L. (2000). Image-based document vectors for text retrieval. Proceedings - International Conference on Pattern Recognition 15 (4) : 393-396. ScholarBank@NUS Repository.
Abstract: We propose a method for constructing a vector for a document image to represent its content to facilitate text retrieval. The method is based on an N-Gram algorithm for text similarity measure based on the frequency of occurrence of n-character strings appearing in the electronic text. Instead of using ASCII values, the present study investigates the use of character images to obtain the document vector and has found promising results for use in our news article retrieval project. © 2000 IEEE.
Source Title: Proceedings - International Conference on Pattern Recognition
ISSN: 10514651
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Page view(s)

checked on Dec 8, 2018

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.