Please use this identifier to cite or link to this item:
https://doi.org/10.1109/TPAMI.2008.89
Title: | Document image retrieval through word shape coding | Authors: | Lu, S. Li, L. Tan, C.L. |
Keywords: | Document image analysis Document image retrieval Word shape coding |
Issue Date: | 2008 | Citation: | Lu, S., Li, L., Tan, C.L. (2008). Document image retrieval through word shape coding. IEEE Transactions on Pattern Analysis and Machine Intelligence 30 (11) : 1913-1918. ScholarBank@NUS Repository. https://doi.org/10.1109/TPAMI.2008.89 | Abstract: | This paper presents a document retrieval technique that is capable of searching document images without OCR (optical character recognition). The proposed technique retrieves document images by a new word shape coding scheme, which captures the document content through annotating each word image by a word shape code. In particular, we annotate word images by using a set of topological shape features including character ascenders/descenders, character holes, and character water reservoirs. With the annotated word shape codes, document images can be retrieved by either query keywords or a query document image. Experimental results show that the proposed document image retrieval technique is fast, efficient, and tolerant to various types of document degradation. © 2008 IEEE. | Source Title: | IEEE Transactions on Pattern Analysis and Machine Intelligence | URI: | http://scholarbank.nus.edu.sg/handle/10635/39670 | ISSN: | 01628828 | DOI: | 10.1109/TPAMI.2008.89 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.