Please use this identifier to cite or link to this item:
Title: Page segmentation and text extraction from gray scale image in microfilm format
Authors: Yuan, Q.
Tan, C.L. 
Keywords: Edge detection
Gray scale image
Microfilm format
Page segmentation
Text extraction
Issue Date: 2001
Citation: Yuan, Q.,Tan, C.L. (2001). Page segmentation and text extraction from gray scale image in microfilm format. Proceedings of SPIE - The International Society for Optical Engineering 4307 : 323-332. ScholarBank@NUS Repository.
Abstract: The paper deals with a suitably designed system that is being used to separate textual regions from graphics regions and locate textual data from textured background. We presented a method based on edge detection to automatically locate text in some noise infected grayscale newspaper images with microfilm format. The algorithm first finds the appropriate edges of textual region using Canny edge detector, and then by edge merging it makes use of edge features to do block segmentation and classification, afterwards feature aided connected component analysis was used to group homogeneous textual regions together within the scope of its bounding box. We can obtain an efficient block segmentation with reduced memory size by introducing the TLC. The proposed method has been used to locate text in a group of newspaper images with multiple page layout. Initial results are encouraging, we would expand the experiment data to over 300 microfilm images with different layout structures, promising result is anticipated with corresponding modification on the prototype of former algorithm to make it more robust and suitable to different cases.
Source Title: Proceedings of SPIE - The International Society for Optical Engineering
ISSN: 0277786X
DOI: 10.1117/12.410852
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.


checked on Jan 11, 2019

Page view(s)

checked on Dec 8, 2018

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.