Please use this identifier to cite or link to this item:
https://doi.org/10.1145/1390334.1390496
Title: | A word shape coding method for camera-based document images | Authors: | Li, N. Tan, C.L. |
Keywords: | Document image retrieval Perspective distortion |
Issue Date: | 2008 | Citation: | Li, N.,Tan, C.L. (2008). A word shape coding method for camera-based document images. ACM SIGIR 2008 - 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Proceedings : 771-772. ScholarBank@NUS Repository. https://doi.org/10.1145/1390334.1390496 | Abstract: | This paper reports a word shape coding method to facilitate retrieval of camera-based document images without OCR. Due to perspective distortion, many reported word shape coding methods fail on camera-based images. In this paper, the problem is addressed by approximating the perspective transformation with an affine transformation, and employing an affine invariant, namely length ratio, to represent the connected components. Components in a document image are classified into a few clusters, each of which is assigned with a representative symbol. Retrieval are based on "words" comprising of symbols. The experiment results showed that the proposed method achieved an average retrieval precision of 93.43% and recall of 94.22%. | Source Title: | ACM SIGIR 2008 - 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Proceedings | URI: | http://scholarbank.nus.edu.sg/handle/10635/41622 | ISBN: | 9781605581644 | DOI: | 10.1145/1390334.1390496 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.