Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/39953
Title: Improved statistical machine translation for resource-poor languages using related resource-rich languages
Authors: Nakov, P. 
Ng, H.T. 
Issue Date: 2009
Source: Nakov, P.,Ng, H.T. (2009). Improved statistical machine translation for resource-poor languages using related resource-rich languages. EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009 : 1358-1367. ScholarBank@NUS Repository.
Abstract: We propose a novel language-independent approach for improving statistical machine translation for resource-poor languages by exploiting their similarity to resource-rich ones. More precisely, we improve the translation from a resource-poor source language X 1 into a resource-rich language Y given a bi-text containing a limited number of parallel sentences for X 1-Y and a larger bi-text for X 2-Y for some resource-rich language X 2 that is closely related to X1. The evaluation for Indonesian→English (using Malay) and Spanish→English (using Portuguese and pretending Spanish is resource-poor) shows an absolute gain of up to 1.35 and 3.37 Bleu points, respectively, which is an improvement over the rivaling approaches, while using much less additional data. © 2009 ACL and AFNLP.
Source Title: EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009
URI: http://scholarbank.nus.edu.sg/handle/10635/39953
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Page view(s)

59
checked on Dec 9, 2017

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.