中国科学院机构知识库网格系统: Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation

Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation

文献类型：期刊论文


作者	Yin, Yingjie1,2,3 ; Xu, De1,2 ; Wang, Xingang1,2 ; Zhang, Lei 3
刊名	IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
出版日期	2021-02-12
页码	11
关键词	Feature extraction Kernel Object segmentation Faces Probabilistic logic Learning systems Image segmentation Deep appearance learning directional deep embedding learning directional statistics-based learning video object segmentation (VOS)
ISSN号	2162-237X
DOI	10.1109/TNNLS.2021.3054769
通讯作者	Yin, Yingjie(haidaying.jie@163.com)
英文摘要	Most recent semisupervised video object segmentation (VOS) methods rely on fine-tuning deep convolutional neural networks online using the given mask of the first frame or predicted masks of subsequent frames. However, the online fine-tuning process is usually time-consuming, limiting the practical use of such methods. We propose a directional deep embedding and appearance learning (DDEAL) method, which is free of the online fine-tuning process, for fast VOS. First, a global directional matching module (GDMM), which can be efficiently implemented by parallel convolutional operations, is proposed to learn a semantic pixel-wise embedding as an internal guidance. Second, an effective directional appearance model-based statistics is proposed to represent the target and background on a spherical embedding space for VOS. Equipped with the GDMM and the directional appearance model learning module, DDEAL learns static cues from the labeled first frame and dynamically updates cues of the subsequent frames for object segmentation. Our method exhibits the state-of-the-art VOS performance without using online fine-tuning. Specifically, it achieves a J & F mean score of 74.8% on DAVIS 2017 data set and an overall score G of 71.3% on the large-scale YouTube-VOS data set, while retaining a speed of 25 fps with a single NVIDIA TITAN Xp GPU. Furthermore, our faster version runs 31 fps with only a little accuracy loss.
资助项目	National Natural Science Foundation of China[61703398] ; Science and Technology Program of Beijing Municipal Science and Technology Commission[Z191100008019004] ; Hong Kong Research Grants Council (RGC) General Research Fund (GRF)[PolyU 152135/16E] ; Hong Kong Scholars Program[XJ2017031]
WOS研究方向	Computer Science ; Engineering
语种	英语
WOS记录号	WOS:000733511700001
出版者	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
资助机构	National Natural Science Foundation of China ; Science and Technology Program of Beijing Municipal Science and Technology Commission ; Hong Kong Research Grants Council (RGC) General Research Fund (GRF) ; Hong Kong Scholars Program
源URL	[http://ir.ia.ac.cn/handle/173211/47109]
专题	精密感知与控制研究中心_精密感知与控制
通讯作者	Yin, Yingjie
作者单位	1.Chinese Acad Sci, Res Ctr Precis Sensing & Control, Inst Automat, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China 3.Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
推荐引用方式 GB/T 7714	Yin, Yingjie,Xu, De,Wang, Xingang,et al. Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,2021:11.
APA	Yin, Yingjie,Xu, De,Wang, Xingang,&Zhang, Lei.(2021).Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation.IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,11.
MLA	Yin, Yingjie,et al."Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation".IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021):11.

入库方式： OAI收割

来源：自动化研究所

下载0

Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation

其他版本