Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/40582
Title: Decomposability of translation metrics for improved evaluation and efficient algorithms
Authors: Chiang, D.
DeNeefe, S.
Chan, Y.S. 
Ng, H.T. 
Issue Date: 2008
Source: Chiang, D.,DeNeefe, S.,Chan, Y.S.,Ng, H.T. (2008). Decomposability of translation metrics for improved evaluation and efficient algorithms. EMNLP 2008 - 2008 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference: A Meeting of SIGDAT, a Special Interest Group of the ACL : 610-619. ScholarBank@NUS Repository.
Abstract: BLEU is the de facto standard for evaluation and development of statistical machine translation systems. We describe three real-world situations involving comparisons between different versions of the same systems where one can obtain improvements in BLEU scores that are questionable or even absurd. These situations arise because BLEU lacks the property of decomposability, a property which is also computationally convenient for various applications. We propose a very conservative modification to BLEU and a cross between BLEU and word error rate that address these issues while improving correlation with human judgments. © 2008 Association for Computational Linguistics.
Source Title: EMNLP 2008 - 2008 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference: A Meeting of SIGDAT, a Special Interest Group of the ACL
URI: http://scholarbank.nus.edu.sg/handle/10635/40582
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Page view(s)

57
checked on Dec 9, 2017

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.