中国科学院机构知识库网格系统: Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning

文献类型：会议论文


作者	Guo LT(郭龙腾)1,2 ; Liu J(刘静)2 ; Zhu XX(朱欣鑫)2 ; He XJ(何兴建)1,2 ; Jiang J(江洁)1,2 ; Lu HQ(卢汉清)2
出版日期	2020
会议日期	2021.01.07
会议地点	日本横滨
英文摘要	Most image captioning models are autoregressive, i.e. they generate each word by conditioning on previously generated words, which leads to heavy latency during inference. Recently, non-autoregressive decoding has been proposed in machine translation to speed up the inference time by generating all words in parallel. Typically, these models use the word-level cross-entropy loss to optimize each word independently. However, such a learning process fails to consider the sentence-level consistency, thus resulting in inferior generation quality of these non-autoregressive models. In this paper, we propose a Non-Autoregressive Image Captioning (NAIC) model with a novel training paradigm: Counterfactuals-critical Multi-Agent Learning (CMAL). CMAL formulates NAIC as a multi-agent reinforcement learning system where positions in the target sequence are viewed as agents that learn to cooperatively maximize a sentence-level reward. Besides, we propose to utilize massive unlabeled images to boost captioning performance. Extensive experiments on MSCOCO image captioning benchmark show that our NAIC model achieves a performance comparable to state-of-the-art autoregressive models, while brings 13.9x decoding speedup.
源URL	[http://ir.ia.ac.cn/handle/173211/44986]
专题	自动化研究所_模式识别国家重点实验室_图像与视频分析团队中国科学院自动化研究所
通讯作者	Liu J(刘静)
作者单位	1.School of Artificial Intelligence, University of Chinese Academy of Sciences 2.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
推荐引用方式 GB/T 7714	Guo LT,Liu J,Zhu XX,et al. Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning[C]. 见:. 日本横滨. 2021.01.07.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。