中国科学院机构知识库网格系统: Learning Confidence for Transformer-based Neural Machine Translation

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Learning Confidence for Transformer-based Neural Machine Translation

文献类型：会议论文


作者	Yu, Lu1,3 ; Jiali, Zeng 2; Jiajun, Zhang1,3 ; Shuangzhi, Wu 2; Mu, Li 2
出版日期	2022-05
会议日期	2022-5
会议地点	线上
关键词	神经机器翻译
页码	2353-2364
英文摘要	Confidence estimation aims to quantify the confidence of the model prediction, providing an expectation of success. A well-calibrated confidence estimate enables accurate failure prediction and proper risk measurement when given noisy samples and out-of-distribution data in real-world settings. However, this task remains a severe challenge for neural machine translation (NMT), where probabilities from softmax distribution fail to describe when the model is probably mistaken. To address this problem, we propose an unsupervised confidence estimate learning jointly with the training of the NMT model. We explain confidence as how many hints the NMT model needs to make a correct prediction, and more hints indicate low confidence. Specifically, the NMT model is given the option to ask for hints to improve translation accuracy at the cost of some slight penalty. Then, we approximate their level of confidence by counting the number of hints the model uses. We demonstrate that our learned confidence estimate achieves high accuracy on extensive sentence/word-level quality estimation tasks. Analytical results verify that our confidence estimate can correctly assess underlying risk in two real-world scenarios: (1) discovering noisy samples and (2) detecting out-of-domain data. We further propose a novel confidence-based instance-specific label smoothing approach based on our learned confidence estimate, which outperforms standard label smoothing.
语种	英语
URL标识	查看原文
源URL	[http://ir.ia.ac.cn/handle/173211/51845]
专题	模式识别国家重点实验室_自然语言处理
通讯作者	Jiajun, Zhang
作者单位	1.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences 2.Tencent Cloud Xiaowei 3.School of Artificial Intelligence, University of Chinese Academy of Sciences
推荐引用方式 GB/T 7714	Yu, Lu,Jiali, Zeng,Jiajun, Zhang,et al. Learning Confidence for Transformer-based Neural Machine Translation[C]. 见:. 线上. 2022-5.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。