Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/78183
Title: | Improved combinatory categorial grammar induction with boundary words and bayesian inference | Authors: | Huang, Y. Zhang, M. Tan, C.L. |
Keywords: | Bayesian model Boundary words Combinatory categorial grammar Grammar induction |
Issue Date: | 2012 | Citation: | Huang, Y.,Zhang, M.,Tan, C.L. (2012). Improved combinatory categorial grammar induction with boundary words and bayesian inference. 24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers : 1257-1274. ScholarBank@NUS Repository. | Abstract: | Combinatory Categorial Grammar (CCG) is an expressive grammar formalism which is able to capture long-range dependencies. However, building large and wide-coverage treebanks for CCG is expensive and time-consuming. In this paper, we focus on the problem of unsupervised CCG induction from plain texts. Based on the baseline model in (Bisk and Hockenmaier, 2012), we propose following two improvements: (1) we utilize boundary part-of-speech (POS) tags to capture lexical information; (2) we perform nonparametric Bayesian inference based on the Pitman-Yor process to learn compact grammars. Experiments on English Penn treebank demonstrate the effectiveness of our boundary model and Bayesian learning. © 2012 The COLING. | Source Title: | 24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers | URI: | http://scholarbank.nus.edu.sg/handle/10635/78183 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.