Please use this identifier to cite or link to this item:
|Title:||Linearization of zipfian distribution for chinese characters|
|Authors:||Teng Lua, Kim|
|Citation:||Teng Lua, Kim (1992). Linearization of zipfian distribution for chinese characters. Journal of information processing 15 (1) : 10-16. ScholarBank@NUS Repository.|
|Abstract:||In this paper, we report our results of least-square fittings to 4 sets of data derived from Chinese characters, namely, character strokes, radicals, characters and words. We have found that fitting using a power series, ie ft versus Rt (f is the frequency of occurrence, R the rank and t is constant) is better than the use of a logarithm series derived from the original simple Zipf's law, ei f R=constant, or log f=c-log R. The dependency of f versus R is found to be of order 5 as we have found that t=0.2. We have also discovered a secondary dependency of f on R of lower order. This secondary dependency can be modeled using a cosine function.|
|Source Title:||Journal of information processing|
|Appears in Collections:||Staff Publications|
Show full item record
Files in This Item:
There are no files associated with this item.
checked on Oct 18, 2018
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.