Human behaviour recognition with mid-level representations for crowd understanding and analysis
文献类型:期刊论文
作者 | Sun, Bangyong2,3; Yuan, Nianzeng2; Li, Shuying1; Wu, Siyuan3; Wang, Nan3,4 |
刊名 | IET Image Processing |
ISSN号 | 17519659 |
DOI | 10.1049/ipr2.12147 |
产权排序 | 1 |
英文摘要 | Crowd understanding and analysis have received increasing attention for couples of decades, and development of human behaviour recognition strongly supports the application of crowd understanding and analysis. Human behaviour recognition usually seeks to automatically analyse ongoing movements and actions in different camera views by using various machine learning methodologies in unknown video clips or image sequences. Compared to other data modalities such as documents and images, processing video data demands much higher computational and storage resources. The idea of using middle level semantic concepts to represent human actions from videos is explored and it is argued that these semantic attributes enable the construction of more descriptive methods for human action recognition. The mid-level attributes, initialized by a cluster processing, are built upon low level features and fully utilize the discrepancies in different action classes, which can capture the importance of each attribute for each action class. In this way, the representation is constructed to be semantically rich and capable of highly discriminative performance even paired with simple linear classifiers. The method is verified on three challenging datasets (KTH, UCF50 and HMDB51), and the experimental results demonstrate that our method achieves better results than the baseline methods on human action recognition. © 2021 The Authors. IET Image Processing published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology |
语种 | 英语 |
出版者 | John Wiley and Sons Inc |
WOS记录号 | WOS:000624128900001 |
源URL | [http://ir.opt.ac.cn/handle/181661/94526] |
专题 | 西安光学精密机械研究所_光学影像学习与分析中心 |
通讯作者 | Li, Shuying; Wu, Siyuan |
作者单位 | 1.School of Automation, Xi'an University of Posts & Telecommunications, Xi'an; Shaanxi; 710121, China 2.College of Printing, Packaging Engineering and Digital Media, Xi'an University of Technology, Xi'an; Shaanxi; 710048, China; 3.The Key Laboratory of Spectral Imaging Technology CAS, Xi'an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an; Shaanxi; 710119, China; 4.University of Chinese Academy of Sciences, 19A Yuquanlu, Beijing; 100049, China; |
推荐引用方式 GB/T 7714 | Sun, Bangyong,Yuan, Nianzeng,Li, Shuying,et al. Human behaviour recognition with mid-level representations for crowd understanding and analysis[J]. IET Image Processing. |
APA | Sun, Bangyong,Yuan, Nianzeng,Li, Shuying,Wu, Siyuan,&Wang, Nan. |
MLA | Sun, Bangyong,et al."Human behaviour recognition with mid-level representations for crowd understanding and analysis".IET Image Processing |
入库方式: OAI收割
来源:西安光学精密机械研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。