中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search

文献类型:会议论文

作者Jing Y(荆雅)2,3; Si CY(司晨阳)2,3; Wang JB(王君波)2,3; Wang W(王威)2,3; Wang L(王亮)1,2,3; Tan TN(谭铁牛)1,2,3
出版日期2020-02
会议日期2020-2
会议地点New York, USA
英文摘要

Text-based person search aims to retrieve the corresponding person images in an image database by virtue of a describing sentence about the person, which poses great potential for various applications such as video surveillance. Extracting visual contents corresponding to the human description is the key to this cross-modal matching problem. Moreover, correlated images and descriptions involve different granularities of semantic relevance, which is usually ignored in previous methods. To exploit the multilevel corresponding visual contents, we propose a pose-guided multi-granularity attention network (PMA). Firstly, we propose a coarse alignment network (CA) to select the related image regions to the global description by a similarity-based attention. To further capture the phrase-related visual body part, a fine-grained alignment network (FA) is proposed, which employs pose information to learn latent semantic alignment between visual body part and textual noun phrase. To verify the effectiveness of our model, we perform extensive experiments on the CUHK Person Description Dataset (CUHK-PEDES) which is currently the only available dataset for text-based person search. Experimental results show that our approach outperforms the state-of-the-art methods by 15 % in terms of the top-1 metric.
 

源URL[http://ir.ia.ac.cn/handle/173211/44445]  
专题自动化研究所_智能感知与计算研究中心
通讯作者Wang W(王威)
作者单位1.Center for Excellence in Brain Science and Intelligence Technology (CEBSIT), Institute of Automation, Chinese Academy of Sciences (CASIA)
2.University of Chinese Academy of Sciences (UCAS)
3.Center for Research on Intelligent Perception and Computing (CRIPAC), National Laboratory of Pattern Recognition (NLPR)
推荐引用方式
GB/T 7714
Jing Y,Si CY,Wang JB,et al. Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search[C]. 见:. New York, USA. 2020-2.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。