中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization

文献类型:期刊论文

作者Xiao-Yu Zhang3; Hai-Chao Shi3; Chang-Sheng Li1; Li-Xin Duan2
刊名Machine Intelligence Research
出版日期2022
卷号19期号:3页码:227-246
关键词Knowledge transfer weakly supervised learning self-attention mechanism representation learning action localization
ISSN号2731-538X
DOI10.1007/s11633-022-1333-4
英文摘要Action recognition and localization in untrimmed videos is important for many applications and have attracted a lot of attention. Since full supervision with frame-level annotation places an overwhelming burden on manual labeling effort, learning with weak video-level supervision becomes a potential solution. In this paper, we propose a novel weakly supervised framework to recognize actions and locate the corresponding frames in untrimmed videos simultaneously. Considering that there are abundant trimmed videos publicly available and well-segmented with semantic descriptions, the instructive knowledge learned on trimmed videos can be fully leveraged to analyze untrimmed videos. We present an effective knowledge transfer strategy based on inter-class semantic relevance. We also take advantage of the self-attention mechanism to obtain a compact video representation, such that the influence of background frames can be effectively eliminated. A learning architecture is designed with twin networks for trimmed and untrimmed videos, to facilitate transferable self-attentive representation learning. Extensive experiments are conducted on three untrimmed benchmark datasets (i.e., THUMOS14, ActivityNet1.3, and MEXaction2), and the experimental results clearly corroborate the efficacy of our method. It is especially encouraging to see that the proposed weakly supervised method even achieves comparable results to some fully supervised methods.
源URL[http://ir.ia.ac.cn/handle/173211/55943]  
专题自动化研究所_学术期刊_International Journal of Automation and Computing
作者单位1.School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
2.School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
3.Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
推荐引用方式
GB/T 7714
Xiao-Yu Zhang,Hai-Chao Shi,Chang-Sheng Li,et al. TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization[J]. Machine Intelligence Research,2022,19(3):227-246.
APA Xiao-Yu Zhang,Hai-Chao Shi,Chang-Sheng Li,&Li-Xin Duan.(2022).TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization.Machine Intelligence Research,19(3),227-246.
MLA Xiao-Yu Zhang,et al."TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization".Machine Intelligence Research 19.3(2022):227-246.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。