Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/132711
Title: | Linearization of zipfian distribution for chinese characters | Authors: | Teng Lua, Kim | Issue Date: | 1992 | Citation: | Teng Lua, Kim (1992). Linearization of zipfian distribution for chinese characters. Journal of information processing 15 (1) : 10-16. ScholarBank@NUS Repository. | Abstract: | In this paper, we report our results of least-square fittings to 4 sets of data derived from Chinese characters, namely, character strokes, radicals, characters and words. We have found that fitting using a power series, ie ft versus Rt (f is the frequency of occurrence, R the rank and t is constant) is better than the use of a logarithm series derived from the original simple Zipf's law, ei f R=constant, or log f=c-log R. The dependency of f versus R is found to be of order 5 as we have found that t=0.2. We have also discovered a secondary dependency of f on R of lower order. This secondary dependency can be modeled using a cosine function. | Source Title: | Journal of information processing | URI: | http://scholarbank.nus.edu.sg/handle/10635/132711 | ISSN: | 03876101 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.