中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Research for Uyghur-Chinese neural machine translation

文献类型:期刊论文

作者Kong, Jinying; Yang, Yating; Zhou, Xi; Wang, Lei; Li, Xiao
刊名Lecture Notes in Computer Science
出版日期2016
卷号10102期号:12页码:141-152
ISSN号03029743
DOI10.1007/978-3-319-50496-4_12
英文摘要

he problem of rare and unknown words is an important issue in Uyghur-Chinese machine translation, especially using neural machine translation model. We propose a novel way to deal with the rare and unknown words. Based on neural machine translation of using pointers over input sequence, our approach which consists of preprocess and post-process can be used in all neural machine translation model. Pre-process modify the Uyghur-Chinese corpus to extend the ability of pointer network, and the post- process retranslating the raw translation by a phrase-based machine translation model or a wordlist. Experiment show that neural machine translation model used the approach proposed by this paper get a higher BLEU score than the phrase-based model in Uyghur-Chinese MT.

源URL[http://ir.xjipc.cas.cn/handle/365002/5784]  
专题新疆理化技术研究所_多语种信息技术研究室
作者单位1.Institute of Acoustics Chinese Academy of Sciences, Beijing, China
2.100049, China
3.University of Chinese Academy of Sciences, Beijing
4.Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi
5.830011, China
6.Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi
推荐引用方式
GB/T 7714
Kong, Jinying,Yang, Yating,Zhou, Xi,et al. Research for Uyghur-Chinese neural machine translation[J]. Lecture Notes in Computer Science,2016,10102(12):141-152.
APA Kong, Jinying,Yang, Yating,Zhou, Xi,Wang, Lei,&Li, Xiao.(2016).Research for Uyghur-Chinese neural machine translation.Lecture Notes in Computer Science,10102(12),141-152.
MLA Kong, Jinying,et al."Research for Uyghur-Chinese neural machine translation".Lecture Notes in Computer Science 10102.12(2016):141-152.

入库方式: OAI收割

来源:新疆理化技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。