Please use this identifier to cite or link to this item:
|Title:||Self learning classification for degraded document images by sparse representation|
|Keywords:||Document Image Binarization|
Self Learning Classification
|Citation:||Su, B., Tian, S., Lu, S., Dinh, T.A., Tan, C.L. (2013). Self learning classification for degraded document images by sparse representation. Proceedings of the International Conference on Document Analysis and Recognition, ICDAR : 155-159. ScholarBank@NUS Repository. https://doi.org/10.1109/ICDAR.2013.38|
|Abstract:||Document Image Binarization is a technique to segment text out from the background region of a document image, which is a challenging task due to high intensity variations of the document foreground and background. Recently, a series of document image binarization contests (DIBCOs) had been held that have drawn great research interest in this area. Several document binarization techniques have been proposed and achieve great performance on the contest datasets. However, those proposed techniques may not perform well on all kinds of degraded document images because it is difficult to design a classification method that correctly models the non-uniform degraded document background and text foreground simultaneously. In this paper, we propose a self learning classification framework that combines binary outputs of different binarization methods. The proposed framework makes used of the sparse representation to re-classify the document pixels and produces a better binary results. The experimental results on the recent DIBCO contests show the great performance and robustness of our proposed framework on different kinds of degraded document images. © 2013 IEEE.|
|Source Title:||Proceedings of the International Conference on Document Analysis and Recognition, ICDAR|
|Appears in Collections:||Staff Publications|
Show full item record
Files in This Item:
There are no files associated with this item.
checked on Mar 20, 2019
WEB OF SCIENCETM
checked on Mar 5, 2019
checked on Feb 2, 2019
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.