Please use this identifier to cite or link to this item:
Title: Text block segmentation using pyramid structure
Authors: Tan, C.L. 
Zhang, Z. 
Keywords: Document image analysis
Text segmentation
Issue Date: 2001
Citation: Tan, C.L.,Zhang, Z. (2001). Text block segmentation using pyramid structure. Proceedings of SPIE - The International Society for Optical Engineering 4307 : 297-306. ScholarBank@NUS Repository.
Abstract: Text block segmentation is necessary in document layout analysis. An algorithm and its implementation that segregates text block by block (a block is either a title or a paragraph) from the provided document, e.g. newspaper image, based on pyramid structure is described in this paper. The pyramid structure, which is amenable for parallel processing on output, is a multi-resolution image representation. The pyramid structure also simulates what the human eyes see the document from afar visualizing the block structure of the document. The block segmentation can identify the titles, and distinguish different paragraphs based on the indentation between them. Our implementation will be used in a news articles retrieval project.
Source Title: Proceedings of SPIE - The International Society for Optical Engineering
ISSN: 0277786X
DOI: 10.1117/12.410849
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.


checked on Jul 9, 2019

Page view(s)

checked on Jul 5, 2019

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.