Publication

Automatic detection of document script and orientation

Citations
Altmetric:
Alternative Title
Abstract
This paper presents an identification technique that automatically detects the underlying script and orientation of scanned document images. In the proposed technique, document script and orientation are identified by using the stroke density and distribution, which convert each document image into a document vector. For each script at each orientation, a number of reference document vectors are first constructed. Script and orientation of the query document are then determined according to the similarity between the query document vector and multiple preconstructed reference document vectors by using the Knearest neighbor algorithm. Experiments show that the proposed technique is tolerant to the document skew and able to detect orientations of documents of different scripts. © 2007 IEEE.
Keywords
Source Title
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
Publisher
Series/Report No.
Organizational Units
Organizational Unit
Rights
Date
2007
DOI
10.1109/ICDAR.2007.4378711
Type
Conference Paper
Related Datasets
Related Publications