Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/138677
Title: LEXICAL CHAIN BASED DOCUMENT-LEVEL MACHINE TRANSLATION
Authors: DING YANG
Keywords: statistical machine translation, SMT, lexical chain, cohesion, document-level, larger context
Issue Date: 14-Jul-2017
Citation: DING YANG (2017-07-14). LEXICAL CHAIN BASED DOCUMENT-LEVEL MACHINE TRANSLATION. ScholarBank@NUS Repository.
Abstract: Most statistical machine translation (SMT) systems translate each source sentence independently without considering document-level context. As a result, it often leads to inconsistent or inappropriate word selections throughout the translation of the entire document. In the meanwhile, considering larger context often suffers from the high computational complexity and noisy information. We argue that an efficient and wise extraction of related information can help determine the choice of words in the target language. Inspired by previous approaches in document summarization, we propose to leverage lexical chains as skeletons of a document to identify related words and perform translation under the constraint of generated target lexical chains. Experimental results have shown that the proposed models based on lexical chains can substantially improve the translation quality in terms of BLEU on large-scale datasets.
URI: http://scholarbank.nus.edu.sg/handle/10635/138677
Appears in Collections:Master's Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
ding.pdf1.08 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.