中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Deep Video Decaptioning

文献类型:会议论文

作者Chu, Pengpeng3; Quan, Weize2,4; Wang, Tong3; Wang, Pan1; Ren, Peiran1; Yan, Dong-Ming2,4
出版日期2021
会议日期2021.11.22-2021.11.25
会议地点Virtual Conference
英文摘要

Video decaptioning aims to remove subtitles from and repair occluded areas in videos. However, recent deep-learning-based inpainting methods mostly require the masks indicating the corrupted parts, and these masks are unavailable for the input subtitled videos. Moreover, useful information hidden in the background of subtitles might be lost when these masked areas are directly regarded as invalid as the common setting of inpainting methods. In addition, existing blind video decaptioning methods often suffer from incomplete subtitles removal. In this paper, we propose a generic framework for video decaptioning, which consists of a caption mask extraction network and a frame-attentionbased decaptioning network. The former is trained with supervision information using our proposed automatic annotation method, and predicts the position of the subtitle and background. The latter adopts an encoder-decoder architecture with the skip connection. The encoder extracts the features of all input frames. Then, multiple frame attention modules are used to aggregate these features from the spatial and temporal dimensions. Finally, the fused features are reconstructed into a target frame using the decoder. Extensive experiments demonstrate that our proposed method can accurately remove subtitles from videos in real time (60+ FPS), and outperforms the state-of-the-art approaches. Code is available at https://github.com/Linya-lab/Video_Decaptioning.

语种英语
源URL[http://ir.ia.ac.cn/handle/173211/51502]  
专题多模态人工智能系统全国重点实验室
通讯作者Wang, Tong
作者单位1.Alibaba Group, Hangzhou
2.University of Chinese Academy of Sciences
3.Donghua University
4.NLPR, Institute of Automation, Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Chu, Pengpeng,Quan, Weize,Wang, Tong,et al. Deep Video Decaptioning[C]. 见:. Virtual Conference. 2021.11.22-2021.11.25.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。