Efficient mobile phone Chinese optical character recognition systems by use of heuristic fuzzy rules and bigram Markov language models | ScholarBank@NUS

Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.asoc.2007.02.013

Title:	Efficient mobile phone Chinese optical character recognition systems by use of heuristic fuzzy rules and bigram Markov language models
Authors:	Cheok, A.D. Jian, Z. Chng, E.S.
Keywords:	Fuzzy logic Heuristic Markov model Statistical language model
Issue Date:	Mar-2008
Citation:	Cheok, A.D., Jian, Z., Chng, E.S. (2008-03). Efficient mobile phone Chinese optical character recognition systems by use of heuristic fuzzy rules and bigram Markov language models. Applied Soft Computing Journal 8 (2) : 1005-1017. ScholarBank@NUS Repository. https://doi.org/10.1016/j.asoc.2007.02.013
Abstract:	Statistical language models are very useful tools to improve the recognition accuracy of optical character recognition (OCR) systems. In previous systems, segmentation by maximum word matching, semantic class segmentation, or trigram language models have been used. However, these methods have some disadvantages, such as inaccuracies due to a preference for longer words (which may be erroneous), failure to recognize word dependencies, complex semantic training data segmentation, and a requirement of high memory. To overcome these problems, we propose a novel bigram Markov language model in this paper. This type of model does not have large word preferences and does not require semantically segmented training data. Furthermore, unlike trigram models, the memory requirement is small. Thus, the scheme is suitable for handheld and pocket computers, which are expected to be a major future application of text recognition systems. However, due to a simple language model, the bigram Markov model alone can introduce more errors. Hence in this paper, a novel algorithm combining bigram Markov language models with heuristic fuzzy rules is described. It is found that the recognition accuracy is improved through the use of the algorithm, and it is well suited to mobile and pocket computer applications, including as we will show in the experimental results, the ability to run on mobile phones. The main contribution of this paper is to show how fuzzy techniques as linguistic rules can be used to enhance the accuracy of a crisp recognition system, and still have low computational complexity. © 2007 Elsevier B.V. All rights reserved.
Source Title:	Applied Soft Computing Journal
URI:	http://scholarbank.nus.edu.sg/handle/10635/55799
ISSN:	15684946
DOI:	10.1016/j.asoc.2007.02.013
Appears in Collections:	Staff Publications

Show full item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Altmetric

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.