中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
CLOSE: Coupled content-semantic embedding

文献类型:期刊论文

作者Ren, Junhong; Zhang, Wensheng
刊名SIGNAL IMAGE AND VIDEO PROCESSING
出版日期2019-09-01
卷号13期号:6页码:1087-1095
关键词Video captioning Coupled content-semantic embedding Multi-content embedding
ISSN号1863-1703
DOI10.1007/s11760-019-01449-w
通讯作者Ren, Junhong(junhong.ren@ia.ac.cn)
英文摘要This paper proposes a novel coupled content semantic embedding (CLOSE) method with its application to video captioning. The motivation behind this design is to seek a consistent latent space between the content-semantic pair, in which the pair with same attribute is close to each other. Under the framework constructed on content-semantic embedding, CLOSE first learns two independent and reversible content-content and semantic-semantic embeddings, respectively, and then aggregates the two items via a coupled content-semantic embedding. Benefitting from the reversible property, our CLOSE can be pretrained with quantities of unlabeled data. In addition, casting on the work setting of feature embedding, a paradigm named multi-content embedding (MCE) is developed to describe the multi-focus information. Typically, MCE is capable of learning a feature embedding that can capture multiple discriminative contents. Extensive experiments compared with state-of-the-art methods on benchmark datasets, i.e., MSVD and MSR-VTT, demonstrate the effectiveness and superiority of the proposed CLOSE.
资助项目National Natural Science Foundation of China[61403376]
WOS研究方向Engineering ; Imaging Science & Photographic Technology
语种英语
WOS记录号WOS:000481886600006
出版者SPRINGER LONDON LTD
资助机构National Natural Science Foundation of China
源URL[http://ir.ia.ac.cn/handle/173211/27526]  
专题精密感知与控制研究中心_人工智能与机器学习
通讯作者Ren, Junhong
作者单位Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Ren, Junhong,Zhang, Wensheng. CLOSE: Coupled content-semantic embedding[J]. SIGNAL IMAGE AND VIDEO PROCESSING,2019,13(6):1087-1095.
APA Ren, Junhong,&Zhang, Wensheng.(2019).CLOSE: Coupled content-semantic embedding.SIGNAL IMAGE AND VIDEO PROCESSING,13(6),1087-1095.
MLA Ren, Junhong,et al."CLOSE: Coupled content-semantic embedding".SIGNAL IMAGE AND VIDEO PROCESSING 13.6(2019):1087-1095.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。