Please use this identifier to cite or link to this item:
https://doi.org/10.1109/CSNT.2013.129
Title: | Skewness and nearest neighbour based approach for historical document classification | Authors: | Kavitha, S. Shivakumara, P. Hemantha Kumar, G. |
Keywords: | Cursiveness Indus document Modifiers Nearest neighbour Skewness Straightness |
Issue Date: | 2013 | Citation: | Kavitha, S., Shivakumara, P., Hemantha Kumar, G. (2013). Skewness and nearest neighbour based approach for historical document classification. Proceedings - 2013 International Conference on Communication Systems and Network Technologies, CSNT 2013 : 602-606. ScholarBank@NUS Repository. https://doi.org/10.1109/CSNT.2013.129 | Abstract: | Classification of document is essential before feeding to OCR as there is no universal OCR which recognizes multiple scripts. Besides, classification of ancient historical documents such as Indus script is more challenging due to seal form inscribed on durable surfaces (stones) that does not have definite writing style. This result in characters may look different in different seals and nonuniform spacing between text lines. Therefore, in this paper, we propose two approaches, namely, Skewness based Approach (SA) for Indus document classification from English and South Indian scripts and Nearest Neighbour based Approach (NNA) for classification of English from South Indian scripts. The SA explores the fact that skewness between the components in the Indus document image with respect to x-axis is higher than skewness between the components in English and South Indian documents. The NNA identifies the presence or absence of modifiers which are common in South Indian document images and are not present in English document images to study the straightness and cursiveness of the components for classification. The method is evaluated on 600 different document images, which include 100 documents of each type. The comparative study with existing methods shows that the proposed method is superior to existing methods in terms of classification rate. © 2013 IEEE. | Source Title: | Proceedings - 2013 International Conference on Communication Systems and Network Technologies, CSNT 2013 | URI: | http://scholarbank.nus.edu.sg/handle/10635/78352 | DOI: | 10.1109/CSNT.2013.129 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.