Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/129092
DC FieldValue
dc.titleBuilding class-based language models with contextual statistics
dc.contributor.authorBai, Shuanghu
dc.contributor.authorLi, Haizhou
dc.contributor.authorLin, Zhiwei
dc.contributor.authorYuan, Baosheng
dc.date.accessioned2016-10-26T11:01:56Z
dc.date.available2016-10-26T11:01:56Z
dc.date.issued1998
dc.identifier.citationBai, Shuanghu, Li, Haizhou, Lin, Zhiwei, Yuan, Baosheng (1998). Building class-based language models with contextual statistics. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings 1 : 173-176. ScholarBank@NUS Repository.
dc.identifier.issn07367791
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/129092
dc.description.abstractIn this paper, novel clustering algorithms are proposed by using the contextual statistics of words for class-based language models. The Minimum Discriminative Information (MDI) is used as a distance measure. Three algorithms are implemented to build bigram language models for a vocabulary of 50,000 words over a corpus of over 200 million words. The computational cost of algorithms and resulting LM perplexity are studied. The comparisons between the MDI algorithm and the Maximum Mutual Information algorithm are also given to demonstrate the effectiveness and the efficiency of the new algorithms. It is shown that the MDI approaches make the tree-building clustering[2] possible with large vocabulary.
dc.sourceScopus
dc.typeConference Paper
dc.contributor.departmentELECTRICAL AND COMPUTER ENGINEERING
dc.contributor.departmentINSTITUTE OF SYSTEMS SCIENCE
dc.description.sourcetitleICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
dc.description.volume1
dc.description.page173-176
dc.description.codenIPROD
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.