中国科学院机构知识库网格系统: Inductive Zero-Shot Image Annotation via Embedding Graph

Inductive Zero-Shot Image Annotation via Embedding Graph

文献类型：期刊论文


作者	Wang, Fangxin1,2 ; Liu, Jie1 ; Zhang, Shuwu1,3 ; Zhang, Guixuan1 ; Li, Yuejun1,2 ; Yuan, Fei1
刊名	IEEE ACCESS
出版日期	2019
卷号	7 页码:107816-107830
关键词	Contextualized word embeddings graph convolutional network image annotation Node2Vec zero-shot
ISSN号	2169-3536
DOI	10.1109/ACCESS.2019.2925383
通讯作者	Liu, Jie(jie.liu@ia.ac.cn)
英文摘要	Conventional image annotation systems can only handle those images having labels within the exist library, but cannot recognize those novel labels. In order to learn new concepts, one has to gather large amount of labeled images and train the model from scratch. More importantly, it can come with a high price to collect those labeled images. For these reasons, we put forward a zero-shot image annotation model, to reduce the demand for the images with novel labels. In this paper, we focus on the two big challenges of zero-shot image annotation: polysemous words and a strong bias in the generalized zero-shot setting. For the first problem, instead of training on large corpus datasets as previous methods, we propose to adopt Node2Vec to obtain contextualized word embeddings, which can easily produce word vectors of the polysemous words. For the second problem, we alleviate the strong bias in two ways: on one hand, we utilize a model based on graph convolutional network (GCN) to make target images involved in the training process; on the other hand, we put forward a novel semantic coherent (SC) loss to capture the semantic relations of the source and target labels. The extensive experiments on NUSWIDE, COCO, IAPR TC-12, and Core15k datasets show the superiority of the proposed model and the annotation performance get improved by 4%-6% comparing with state-of-the-art methods.
资助项目	National Key R&D Program of China[2018YFC0809200] ; National Key Research and Development Plan[61602480]
WOS研究方向	Computer Science ; Engineering ; Telecommunications
语种	英语
WOS记录号	WOS:000481980800008
出版者	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
资助机构	National Key R&D Program of China ; National Key Research and Development Plan
源URL	[http://ir.ia.ac.cn/handle/173211/26113]
专题	数字内容技术与服务研究中心_新媒体服务与管理技术
通讯作者	Liu, Jie
作者单位	1.Chinese Acad Sci, Inst Automat, Beijing, Peoples R China 2.Univ Chinese Acad Sci, Beijing, Peoples R China 3.Beijing Film Acad, AICFVE, Beijing, Peoples R China
推荐引用方式 GB/T 7714	Wang, Fangxin,Liu, Jie,Zhang, Shuwu,et al. Inductive Zero-Shot Image Annotation via Embedding Graph[J]. IEEE ACCESS,2019,7:107816-107830.
APA	Wang, Fangxin,Liu, Jie,Zhang, Shuwu,Zhang, Guixuan,Li, Yuejun,&Yuan, Fei.(2019).Inductive Zero-Shot Image Annotation via Embedding Graph.IEEE ACCESS,7,107816-107830.
MLA	Wang, Fangxin,et al."Inductive Zero-Shot Image Annotation via Embedding Graph".IEEE ACCESS 7(2019):107816-107830.

入库方式： OAI收割

来源：自动化研究所

下载0

Inductive Zero-Shot Image Annotation via Embedding Graph

其他版本