Fiducial line based skew estimation

Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.patcog.2005.03.023

DC Field	Value
dc.title	Fiducial line based skew estimation
dc.contributor.author	Yuan, B.
dc.contributor.author	Tan, C.L.
dc.date.accessioned	2013-07-04T07:51:10Z
dc.date.available	2013-07-04T07:51:10Z
dc.date.issued	2005
dc.identifier.citation	Yuan, B., Tan, C.L. (2005). Fiducial line based skew estimation. Pattern Recognition 38 (12) : 2333-2350. ScholarBank@NUS Repository. https://doi.org/10.1016/j.patcog.2005.03.023
dc.identifier.issn	00313203
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/39856
dc.description.abstract	Skew estimation for textual document images is a well-researched topic and numerals of methods have been reported in the literature. One of the major challenges is the presence of interfering non-textual objects of various types and quantities in the document images. Many existing methods require proper separation of the textual objects which are well aligned from the non-textual objects which are mostly nonaligned. Some comparative evaluation work on the existing methods chooses only the text zones of the test image database. Therefore, the object filtering or zoning stage is crucial to the skew detection stage. However, it is difficult if not impossible to design general-purpose filters that are able to discriminate noises from textual components. This paper presents a robust, general-purpose skew estimation method that does not need any filtering or zoning preprocessing. In fact, this method does apply filtering, but not on the input components at the beginning of the detection process, rather on the output spectrum at the end of the detection process. Therefore, the problem of finding a textual component filter has been transformed into finding a convolution filter on the output accumulator array. This method consists of three steps: (1) the calculation of the slopes of the virtual lines that pass through the centroids of all the unique pairs of the connected components in an image, and quantizes the arctangents of the slopes into a 1-D accumulator array that covers the range from -90° to +90°; (2) a special convolution on the resultant histogram, after which there remain only the prominent peaks that possibly correspond to the skew angles of the image; (3) the verification of the detection result. Its computational complexity and detection precision are uncoupled, unlike those projection-profile-based or Hough-transform-based methods whose speeds drop when higher precision is in demand. Speedup measures on the baseline implementation are also presented. The University of Washington English Document Image Database I (UWDB-I) contains a large number of scanned document images with significant amount of non-textual objects. Therefore, it is a good image database for evaluating the proposed method. © 2005 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1016/j.patcog.2005.03.023
dc.source	Scopus
dc.subject	Centroids
dc.subject	Component pairs
dc.subject	Fiducial lines
dc.subject	Noise immunity
dc.subject	Skew estimation
dc.subject	UWDB-I
dc.type	Article
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.1016/j.patcog.2005.03.023
dc.description.sourcetitle	Pattern Recognition
dc.description.volume	38
dc.description.issue	12
dc.description.page	2333-2350
dc.description.coden	PTNRA
dc.identifier.isiut	000232703000010
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM