Please use this identifier to cite or link to this item:
Title: Document image retrieval through word shape coding
Authors: Lu, S.
Li, L. 
Tan, C.L. 
Keywords: Document image analysis
Document image retrieval
Word shape coding
Issue Date: 2008
Citation: Lu, S., Li, L., Tan, C.L. (2008). Document image retrieval through word shape coding. IEEE Transactions on Pattern Analysis and Machine Intelligence 30 (11) : 1913-1918. ScholarBank@NUS Repository.
Abstract: This paper presents a document retrieval technique that is capable of searching document images without OCR (optical character recognition). The proposed technique retrieves document images by a new word shape coding scheme, which captures the document content through annotating each word image by a word shape code. In particular, we annotate word images by using a set of topological shape features including character ascenders/descenders, character holes, and character water reservoirs. With the annotated word shape codes, document images can be retrieved by either query keywords or a query document image. Experimental results show that the proposed document image retrieval technique is fast, efficient, and tolerant to various types of document degradation. © 2008 IEEE.
Source Title: IEEE Transactions on Pattern Analysis and Machine Intelligence
ISSN: 01628828
DOI: 10.1109/TPAMI.2008.89
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.


checked on Feb 21, 2019


checked on Feb 13, 2019

Page view(s)

checked on Feb 9, 2019

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.