Please use this identifier to cite or link to this item:
Title: Document image retrieval through word shape coding
Authors: Lu, S.
Li, L. 
Tan, C.L. 
Keywords: Document image analysis
Document image retrieval
Word shape coding
Issue Date: 2008
Citation: Lu, S., Li, L., Tan, C.L. (2008). Document image retrieval through word shape coding. IEEE Transactions on Pattern Analysis and Machine Intelligence 30 (11) : 1913-1918. ScholarBank@NUS Repository.
Abstract: This paper presents a document retrieval technique that is capable of searching document images without OCR (optical character recognition). The proposed technique retrieves document images by a new word shape coding scheme, which captures the document content through annotating each word image by a word shape code. In particular, we annotate word images by using a set of topological shape features including character ascenders/descenders, character holes, and character water reservoirs. With the annotated word shape codes, document images can be retrieved by either query keywords or a query document image. Experimental results show that the proposed document image retrieval technique is fast, efficient, and tolerant to various types of document degradation. © 2008 IEEE.
Source Title: IEEE Transactions on Pattern Analysis and Machine Intelligence
ISSN: 01628828
DOI: 10.1109/TPAMI.2008.89
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.