Recognize Complex Events From Static Images by Fusing Deep Channels
文献类型:会议论文
作者 | Yuanjun Xiong; Kai Zhu; Dahua Lin; Xiaoou Tang |
出版日期 | 2015 |
会议名称 | IEEE Conference on Computer Vision and Pattern Recognition |
会议地点 | 美国波士顿 |
英文摘要 | A considerable portion of web images capture events that occur in our personal lives or social activities. In this paper, we aim to develop an effective method for recognizing events from such images. Despite the sheer amount of study on event recognition, most existing methods rely on videos and are not directly applicable to this task. Generally, events are complex phenomena that involve interactions among people and objects, and therefore analysis of event photos requires techniques that can go beyond recognizing individual objects and carry out joint reasoning based on evidences of multiple aspects. Inspired by the recent success of deep learning, we formulate a multi-layer framework to tackle this problem, which takes into account both visual appearance and the interactions among humans and objects, and combines them via semantic fusion. An important issue arising here is that humans and objects discovered by detectors are in the form of bounding boxes, and there is no straightforward way to represent their interactions and incorporate them with a deep network. We address this using a novel strategy that projects the detected instances onto multi-scale spatial maps. On a large dataset with $60,000$ images, the proposed method achieved substantial improvement over the state-of-the-art, raising the accuracy of event recognition by over $10\%$. |
收录类别 | EI |
语种 | 英语 |
源URL | [http://ir.siat.ac.cn:8080/handle/172644/6696] ![]() |
专题 | 深圳先进技术研究院_集成所 |
作者单位 | 2015 |
推荐引用方式 GB/T 7714 | Yuanjun Xiong,Kai Zhu,Dahua Lin,et al. Recognize Complex Events From Static Images by Fusing Deep Channels[C]. 见:IEEE Conference on Computer Vision and Pattern Recognition. 美国波士顿. |
入库方式: OAI收割
来源:深圳先进技术研究院
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。