Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/99152
Title: A data compression scheme for Chinese text files using Huffman coding and a two-level dictionary
Authors: Ong, G.H. 
Huang, S.Y.
Issue Date: May-1995
Source: Ong, G.H.,Huang, S.Y. (1995-05). A data compression scheme for Chinese text files using Huffman coding and a two-level dictionary. Information Sciences 84 (1-2) : 85-99. ScholarBank@NUS Repository.
Abstract: This paper presents a data compression scheme for Chinese text files. Due to the skewness of the distribution of Chinese ideograms, the Huffman coding method is adopted. By storing the frequencies of the encoding symbols rather than their Huffman codes in a dictionary, applying differential coding where it saves space, and structuring the dictionary in the Huffman coding scheme into a two-level dictionary structure, the algorithm produces significant improvement on the compression results. The proposed method is evaluated by comparing its performance with three well-known compression algorithms. This algorithm should also be applicable to other ideogram-based or oriental-language texts. Also, it has the potential to reduce the dictionary size in a bigram- or trigram-based semi-adaptive compression scheme for English texts. © 1995.
Source Title: Information Sciences
URI: http://scholarbank.nus.edu.sg/handle/10635/99152
ISSN: 00200255
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Page view(s)

38
checked on Feb 16, 2018

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.