Please use this identifier to cite or link to this item:
https://doi.org/10.1117/12.410852
Title: | Page segmentation and text extraction from gray scale image in microfilm format | Authors: | Yuan, Q. Tan, C.L. |
Keywords: | Edge detection Gray scale image Microfilm format Page segmentation Text extraction |
Issue Date: | 2001 | Citation: | Yuan, Q.,Tan, C.L. (2001). Page segmentation and text extraction from gray scale image in microfilm format. Proceedings of SPIE - The International Society for Optical Engineering 4307 : 323-332. ScholarBank@NUS Repository. https://doi.org/10.1117/12.410852 | Abstract: | The paper deals with a suitably designed system that is being used to separate textual regions from graphics regions and locate textual data from textured background. We presented a method based on edge detection to automatically locate text in some noise infected grayscale newspaper images with microfilm format. The algorithm first finds the appropriate edges of textual region using Canny edge detector, and then by edge merging it makes use of edge features to do block segmentation and classification, afterwards feature aided connected component analysis was used to group homogeneous textual regions together within the scope of its bounding box. We can obtain an efficient block segmentation with reduced memory size by introducing the TLC. The proposed method has been used to locate text in a group of newspaper images with multiple page layout. Initial results are encouraging, we would expand the experiment data to over 300 microfilm images with different layout structures, promising result is anticipated with corresponding modification on the prototype of former algorithm to make it more robust and suitable to different cases. | Source Title: | Proceedings of SPIE - The International Society for Optical Engineering | URI: | http://scholarbank.nus.edu.sg/handle/10635/41068 | ISSN: | 0277786X | DOI: | 10.1117/12.410852 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.