Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/78356
Title: | Source language adaptation for resource-poor machine translation | Authors: | Wang, P. Nakov, P. Ng, H.T. |
Issue Date: | 2012 | Citation: | Wang, P.,Nakov, P.,Ng, H.T. (2012). Source language adaptation for resource-poor machine translation. EMNLP-CoNLL 2012 - 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the Conference : 286-296. ScholarBank@NUS Repository. | Abstract: | We propose a novel, language-independent approach for improving machine translation from a resource-poor language to X by adapting a large bi-text for a related resource-rich language and X (the same target language). We assume a small bi-text for the resource-poor language to X pair, which we use to learn word-level and phrase-level paraphrases and cross-lingual morphological variants between the resource-rich and the resource-poor language; we then adapt the former to get closer to the latter. Our experiments for Indonesian/Malay-English translation show that using the large adapted resource-rich bi-text yields 6.7 BLEU points of improvement over the unadapted one and 2.6 BLEU points over the original small bi-text. Moreover, combining the small bi-text with the adapted bi-text outperforms the corresponding combinations with the unadapted bi-text by 1.5-3 BLEU points. We also demonstrate applicability to other languages and domains. © 2012 Association for Computational Linguistics. | Source Title: | EMNLP-CoNLL 2012 - 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the Conference | URI: | http://scholarbank.nus.edu.sg/handle/10635/78356 | ISBN: | 9781937284435 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.