Please use this identifier to cite or link to this item:
|Title:||Linearization of zipfian distribution for chinese characters||Authors:||Teng Lua, Kim||Issue Date:||1992||Citation:||Teng Lua, Kim (1992). Linearization of zipfian distribution for chinese characters. Journal of information processing 15 (1) : 10-16. ScholarBank@NUS Repository.||Abstract:||In this paper, we report our results of least-square fittings to 4 sets of data derived from Chinese characters, namely, character strokes, radicals, characters and words. We have found that fitting using a power series, ie ft versus Rt (f is the frequency of occurrence, R the rank and t is constant) is better than the use of a logarithm series derived from the original simple Zipf's law, ei f R=constant, or log f=c-log R. The dependency of f versus R is found to be of order 5 as we have found that t=0.2. We have also discovered a secondary dependency of f on R of lower order. This secondary dependency can be modeled using a cosine function.||Source Title:||Journal of information processing||URI:||http://scholarbank.nus.edu.sg/handle/10635/132711||ISSN:||03876101|
|Appears in Collections:||Staff Publications|
Show full item record
Files in This Item:
There are no files associated with this item.
checked on Nov 8, 2019
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.