Video parsing via spatiotemporally analysis with images
文献类型:期刊论文
作者 | Li, Xuelong![]() ![]() |
刊名 | multimedia tools and applications
![]() |
出版日期 | 2016-10-01 |
卷号 | 75期号:19页码:11961-11976 |
关键词 | Semantic video parsing Transfer learning Maximum a posterior (MAP) inference Markov Random Felds (MRF) Prior contextual constraint |
ISSN号 | 1380-7501 |
产权排序 | 1 |
通讯作者 | lu, xiaoqiang (luxq666666@gmail.com) |
英文摘要 | effective parsing of video through the spatial and temporal domains is vital to many computer vision problems because it is helpful to automatically label objects in video instead of manual fashion, which is tedious. some literatures propose to parse the semantic information on individual 2d images or individual video frames, however, these approaches only take use of the spatial information, ignore the temporal continuity information and fail to consider the relevance of frames. on the other hand, some approaches which only consider the spatial information attempt to propagate labels in the temporal domain for parsing the semantic information of the whole video, yet the non-injective and non-surjective natures can cause the black hole effect. in this paper, inspirited by some annotated image datasets (e.g., stanford background dataset, labelme, and sift-flow), we propose to transfer or propagate such labels from images to videos. the proposed approach consists of three main stages: i) the posterior category probability density function (pdf) is learned by an algorithm which combines frame relevance and label propagation from images. ii) the prior contextual constraint pdf on the map of pixel categories through whole video is learned by the markov random fields (mrf). iii) finally, based on both learned pdfs, the final parsing results are yielded up to the maximum a posterior (map) process which is computed via a very efficient graph-cut based integer optimization algorithm. the experiments show that the black hole effect can be effectively handled by the proposed approach. |
WOS标题词 | science & technology ; technology |
类目[WOS] | computer science, information systems ; computer science, software engineering ; computer science, theory & methods ; engineering, electrical & electronic |
研究领域[WOS] | computer science ; engineering |
关键词[WOS] | energy minimization ; graph cuts ; segmentation ; algorithms |
收录类别 | SCI ; EI |
语种 | 英语 |
WOS记录号 | WOS:000382678200021 |
源URL | [http://ir.opt.ac.cn/handle/181661/28254] ![]() |
专题 | 西安光学精密机械研究所_光学影像学习与分析中心 |
作者单位 | Chinese Acad Sci, Xian Inst Opt & Precis Mech, Ctr OPT IMagery Anal & Learning OPTIMAL, State Key Lab Transient Opt & Photon, Xian 710119, Shaanxi, Peoples R China |
推荐引用方式 GB/T 7714 | Li, Xuelong,Mou, Lichao,Lu, Xiaoqiang. Video parsing via spatiotemporally analysis with images[J]. multimedia tools and applications,2016,75(19):11961-11976. |
APA | Li, Xuelong,Mou, Lichao,&Lu, Xiaoqiang.(2016).Video parsing via spatiotemporally analysis with images.multimedia tools and applications,75(19),11961-11976. |
MLA | Li, Xuelong,et al."Video parsing via spatiotemporally analysis with images".multimedia tools and applications 75.19(2016):11961-11976. |
入库方式: OAI收割
来源:西安光学精密机械研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。