Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/132711
Title: Linearization of zipfian distribution for chinese characters
Authors: Teng Lua, Kim 
Issue Date: 1992
Citation: Teng Lua, Kim (1992). Linearization of zipfian distribution for chinese characters. Journal of information processing 15 (1) : 10-16. ScholarBank@NUS Repository.
Abstract: In this paper, we report our results of least-square fittings to 4 sets of data derived from Chinese characters, namely, character strokes, radicals, characters and words. We have found that fitting using a power series, ie ft versus Rt (f is the frequency of occurrence, R the rank and t is constant) is better than the use of a logarithm series derived from the original simple Zipf's law, ei f R=constant, or log f=c-log R. The dependency of f versus R is found to be of order 5 as we have found that t=0.2. We have also discovered a secondary dependency of f on R of lower order. This secondary dependency can be modeled using a cosine function.
Source Title: Journal of information processing
URI: http://scholarbank.nus.edu.sg/handle/10635/132711
ISSN: 03876101
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.