Self-attention Guidance Based Crowd Localization and Counting
文献类型:期刊论文
作者 | Zhouzhou Ma1,2; Guanghua Gu1,2; Wenrui Zhao1,2 |
刊名 | Machine Intelligence Research
![]() |
出版日期 | 2024 |
卷号 | 21期号:5页码:966-982 |
关键词 | Crowd localization crowd counting transformer point supervision object detection |
ISSN号 | 2731-538X |
DOI | 10.1007/s11633-023-1428-6 |
英文摘要 | Most existing studies on crowd analysis are limited to the level of counting, which cannot provide the exact location of individuals. This paper proposes a self-attention guidance based crowd localization and counting network (SA-CLCN), which can simultaneously locate and count crowds. We take the form of object detection, using the original point annotations of crowd datasets as supervision to train the network. Ultimately, the center point coordinate of each head as well as the number of crowds are predicted. Specifically, to cope with the spatial and positional variations of the crowd, the proposed method introduces transformer to construct a global local feature extractor (GLFE) together with the convolutional structure. It establishes the near-to-far dependency between elements so that the global context and local detail features of the crowd image can be extracted simultaneously. Then, this paper designs a pyramid feature fusion module (PFFM) to fuse the global and local information from high level to low level to obtain a multiscale feature representation. In downstream tasks, this paper predicts candidate point offsets and confidence scores by a simple regression header and classification header. In addition, the Hungarian algorithm is used to match the predicted point set and the labelled point set to facilitate the calculation of losses. The proposed network avoids the errors or higher costs associated with using traditional density maps or bounding box annotations. Importantly, we have conducted extensive experiments on several crowd datasets, and the proposed method has produced competitive results in both counting and localization. |
源URL | [http://ir.ia.ac.cn/handle/173211/59425] ![]() |
专题 | 自动化研究所_学术期刊_International Journal of Automation and Computing |
作者单位 | 1.School of Information Science and Engineering, Yanshan University, Qinhuangdao 066000, China 2.Hebei Key Laboratory of Information Transmission and Signal Processing, Qinhuangdao 066000, China |
推荐引用方式 GB/T 7714 | Zhouzhou Ma,Guanghua Gu, Wenrui Zhao. Self-attention Guidance Based Crowd Localization and Counting[J]. Machine Intelligence Research,2024,21(5):966-982. |
APA | Zhouzhou Ma,Guanghua Gu,& Wenrui Zhao.(2024).Self-attention Guidance Based Crowd Localization and Counting.Machine Intelligence Research,21(5),966-982. |
MLA | Zhouzhou Ma,et al."Self-attention Guidance Based Crowd Localization and Counting".Machine Intelligence Research 21.5(2024):966-982. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。