Discriminative induction of sub-tree alignment using limited labeled data | ScholarBank@NUS

Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/42137

Title:	Discriminative induction of sub-tree alignment using limited labeled data
Authors:	Sun, J. Zhang, M. Tan, C.L.
Issue Date:	2010
Citation:	Sun, J.,Zhang, M.,Tan, C.L. (2010). Discriminative induction of sub-tree alignment using limited labeled data. Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference 2 : 1047-1055. ScholarBank@NUS Repository.
Abstract:	We employ Maximum Entropy model to conduct sub-tree alignment between bilingual phrasal structure trees. Various lexical and structural knowledge is explored to measure the syntactic similarity across Chinese-English bilingual tree pairs. In the experiment, we evaluate the sub-tree alignment using both gold standard tree bank and the automatically parsed corpus with manually annotated sub-tree alignment. Compared with a heuristic similarity based method, the proposed method significantly improves the performance with only limited sub-tree aligned data. To examine its effectiveness for multilingual applications, we further attempt different approaches to apply the sub-tree alignment in both phrase and syntax based SMT systems. We then compare the performance with that of the widely used word alignment. Experimental results on benchmark data show that sub-tree alignment benefits both systems by relaxing the constraint of the word alignment.
Source Title:	Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference
URI:	http://scholarbank.nus.edu.sg/handle/10635/42137
Appears in Collections:	Staff Publications

Show full item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.