Acta Scientiarum Naturalium Universitatis Pekinensis ›› 2016, Vol. 52 ›› Issue (1): 127-133.DOI: 10.13209/j.0479-8023.2016.013
Previous Articles Next Articles
TANG Haiqing, XIONG Deyi
Received:
Online:
Published:
Contact:
唐海庆, 熊德意
通讯作者:
基金资助:
Abstract:
The limited semantic knowledge is used in the phrase-based statistical machine translation (SMT), which causes that the translation quality of long-distance verb and its object is low. A selectional preference based translation model is proposed, which inducts the semantic constraints that a verb imposes on its object to select the proper argument-head word for the predicate with long distance. The authors train the corpus to obtain the conditional probability based selectional preferences for verb, and integrate the selectional preferences into a phrase-based translation system and evaluate on a Chinese-to-English translation task with large-scale training data. Experiment results show that the integration of selectional preference into SMT can effectively capture the long-distance semantic dependencies and improve the translation quality.
Key words: semantic knowledge, selectional preference, semantic constraints, semantic dependencies
摘要:
针对基于短语的统计机器翻译使用有限的语义知识, 导致长距离的动宾短语对翻译质量不高的问题, 提出基于动词选择偏向性的翻译模型, 引入动词对宾语的语义约束信息, 为动词找到合适的宾语翻译。首先使用条件概率方法, 训练动词对宾语的选择偏向性, 然后将选择偏向性作为一个新特征, 集成到基于短语的翻译系统中。在大规模测试数据集上完成汉语到英语的翻译, 实验结果表明, 基于选择偏向性的翻译模型能够很好地捕获长距离的语义依赖关系, 从而提高译文质量。
关键词: 语义知识, 选择偏向性, 语义约束, 语义依赖
CLC Number:
TP391
TANG Haiqing, XIONG Deyi. A Selectional Preference Based Translation Model for SMT[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2016, 52(1): 127-133.
唐海庆, 熊德意. 基于选择偏向性的统计机器翻译模型[J]. 北京大学学报(自然科学版), 2016, 52(1): 127-133.
Add to citation manager EndNote|Ris|BibTeX
URL: https://xbna.pku.edu.cn/EN/10.13209/j.0479-8023.2016.013
https://xbna.pku.edu.cn/EN/Y2016/V52/I1/127