中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Recognize Complex Events From Static Images by Fusing Deep Channels

文献类型:会议论文

作者Yuanjun Xiong; Kai Zhu; Dahua Lin; Xiaoou Tang
出版日期2015
会议名称IEEE Conference on Computer Vision and Pattern Recognition
会议地点美国波士顿
英文摘要A considerable portion of web images capture events that occur in our personal lives or social activities. In this paper, we aim to develop an effective method for recognizing events from such images. Despite the sheer amount of study on event recognition, most existing methods rely on videos and are not directly applicable to this task. Generally, events are complex phenomena that involve interactions among people and objects, and therefore analysis of event photos requires techniques that can go beyond recognizing individual objects and carry out joint reasoning based on evidences of multiple aspects. Inspired by the recent success of deep learning, we formulate a multi-layer framework to tackle this problem, which takes into account both visual appearance and the interactions among humans and objects, and combines them via semantic fusion. An important issue arising here is that humans and objects discovered by detectors are in the form of bounding boxes, and there is no straightforward way to represent their interactions and incorporate them with a deep network. We address this using a novel strategy that projects the detected instances onto multi-scale spatial maps. On a large dataset with $60,000$ images, the proposed method achieved substantial improvement over the state-of-the-art, raising the accuracy of event recognition by over $10\%$.
收录类别EI
语种英语
源URL[http://ir.siat.ac.cn:8080/handle/172644/6696]  
专题深圳先进技术研究院_集成所
作者单位2015
推荐引用方式
GB/T 7714
Yuanjun Xiong,Kai Zhu,Dahua Lin,et al. Recognize Complex Events From Static Images by Fusing Deep Channels[C]. 见:IEEE Conference on Computer Vision and Pattern Recognition. 美国波士顿.

入库方式: OAI收割

来源:深圳先进技术研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。