Pose estimation at night in infrared images using a lightweight multi-stage attention network
文献类型:期刊论文
作者 | Zang, Ying; Fan, Chunpeng; Zheng ZY(郑泽宇)![]() |
刊名 | SIGNAL IMAGE AND VIDEO PROCESSING
![]() |
出版日期 | 2021 |
卷号 | 15期号:8页码:1757-1765 |
关键词 | Pose estimation Far-infrared image LMANet Spatial attention Channel attention |
ISSN号 | 1863-1703 |
产权排序 | 3 |
英文摘要 | Human Keypoints Detection is a relatively basic task in computer vision; it is the pre-task of human action recognition, behavior analysis and human-computer interaction. Since most abnormal actions occur at night, how to effectively extract skeleton sequence data in a low-light or completely dark environment poses a huge challenge for its identification. This paper proposes to use far infrared images to detection key points of the human body, which can solve the problem of human pose estimation under challenging weather conditions such as total darkness, smoke, inclement weather and glare. However, far-infrared images have some shortcomings, such as low resolution, noise and thermal characteristics; the skeleton data need to be provided in real time for the next stage of task. Based on the above reasons, this paper proposes a lightweight multi-stage attention network (LMANet) to detect the key points of human at night. This new network structure adds context information through the large receptive field, which helps to assist the detection of neighboring key points through this information, but for the sake of lightweight consideration, this article only extends the network to two stages. In addition, this article uses the attention module to effectively select channels with a large amount of information and highlight the features of key points, while eliminating background interference. In order to detect key points of the human in various complex environments, we use techniques such as difficult sample mining which improves the accuracy of key points with low confidence. Our network has been verified on two visible light datasets, fully demonstrating excellent performance. This paper successfully introduces far-infrared images into the field of pose estimation, because there is no public dataset for far-infrared pose estimation. In this paper, 700 images are selected for annotation from multiple public far-infrared object detection, segmentation and action recognition datasets; our algorithm is verified on this dataset; the effect is very good. After the paper is published, we will publish our key points of the human body annotated documents. |
WOS研究方向 | Engineering ; Imaging Science & Photographic Technology |
语种 | 英语 |
WOS记录号 | WOS:000646502800002 |
源URL | [http://ir.sia.cn/handle/173321/28858] ![]() |
专题 | 沈阳自动化研究所_数字工厂研究室 |
通讯作者 | Zang, Ying |
作者单位 | 1.Shenyang Institute of Computing Technology, Chinese Academy of Sciences, Shenyang 110168, China 2.University of Chinese Academy of Sciences, Beijing 100049, China 3.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110168, China 4.Hangzhou PingxingShijie Co., Ltd., Hangzhou 311203, China 5.School of Information Engineering, Huzhou University, Huzhou 313000, China |
推荐引用方式 GB/T 7714 | Zang, Ying,Fan, Chunpeng,Zheng ZY,et al. Pose estimation at night in infrared images using a lightweight multi-stage attention network[J]. SIGNAL IMAGE AND VIDEO PROCESSING,2021,15(8):1757-1765. |
APA | Zang, Ying,Fan, Chunpeng,Zheng ZY,&Yang, Dongsheng.(2021).Pose estimation at night in infrared images using a lightweight multi-stage attention network.SIGNAL IMAGE AND VIDEO PROCESSING,15(8),1757-1765. |
MLA | Zang, Ying,et al."Pose estimation at night in infrared images using a lightweight multi-stage attention network".SIGNAL IMAGE AND VIDEO PROCESSING 15.8(2021):1757-1765. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。