中国科学院机构知识库网格系统: Image Caption Generation with Part of Speech Guidance

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Image Caption Generation with Part of Speech Guidance

文献类型：期刊论文


作者	Xinwei He; Baoguang Shi; Xiang Bai; Gui-Song Xia; Zhaoxiang Zhang; Weisheng Dong
刊名	Pattern Recognition Letters
出版日期	2017
期号	1 页码:1-9
关键词	Image Caption Generation Part-of-speech Tags Long Short-term Memory Visual Attributes
英文摘要	As a fundamental problem in image understanding, image caption generation has attracted much attention from both computer vision and natural language processing communities. In this paper, we focus on how to exploit the structure information of a natural sentence, which is used to describe the content of an image. We discover that the Part of Speech (PoS) tags of a sentence, are very effective cues for guiding the Long Short-Term Memory (LSTM) based word generator. More specifically, given a sentence, the PoS tag of each word is utilized to determine whether it is essential to input image representation into the word generator. Benefiting from such a strategy, our model can closely connect the visual attributes of an image to the word concepts in the natural language space. Experimental results on the most popular benchmark datasets, e.g., Flickr30k and MS COCO, consistently demonstrate that our method can significantly enhance the performance of a standard image caption generation model, and achieve the conpetitive results.
WOS记录号	WOS:000458876700028
源URL	[http://ir.ia.ac.cn/handle/173211/21589]
专题	自动化研究所_智能感知与计算研究中心
推荐引用方式 GB/T 7714	Xinwei He,Baoguang Shi,Xiang Bai,et al. Image Caption Generation with Part of Speech Guidance[J]. Pattern Recognition Letters,2017(1):1-9.
APA	Xinwei He,Baoguang Shi,Xiang Bai,Gui-Song Xia,Zhaoxiang Zhang,&Weisheng Dong.(2017).Image Caption Generation with Part of Speech Guidance.Pattern Recognition Letters(1),1-9.
MLA	Xinwei He,et al."Image Caption Generation with Part of Speech Guidance".Pattern Recognition Letters .1(2017):1-9.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。