Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/39953
Title: | Improved statistical machine translation for resource-poor languages using related resource-rich languages | Authors: | Nakov, P. Ng, H.T. |
Issue Date: | 2009 | Citation: | Nakov, P.,Ng, H.T. (2009). Improved statistical machine translation for resource-poor languages using related resource-rich languages. EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009 : 1358-1367. ScholarBank@NUS Repository. | Abstract: | We propose a novel language-independent approach for improving statistical machine translation for resource-poor languages by exploiting their similarity to resource-rich ones. More precisely, we improve the translation from a resource-poor source language X 1 into a resource-rich language Y given a bi-text containing a limited number of parallel sentences for X 1-Y and a larger bi-text for X 2-Y for some resource-rich language X 2 that is closely related to X1. The evaluation for Indonesian→English (using Malay) and Spanish→English (using Portuguese and pretending Spanish is resource-poor) shows an absolute gain of up to 1.35 and 3.37 Bleu points, respectively, which is an improvement over the rivaling approaches, while using much less additional data. © 2009 ACL and AFNLP. | Source Title: | EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009 | URI: | http://scholarbank.nus.edu.sg/handle/10635/39953 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.