Cross-Modal Cross-Domain Moment Alignment Network for Person Search
文献类型:会议论文
作者 | Jing Y(荆雅)1,3![]() ![]() ![]() ![]() |
出版日期 | 2020-06 |
会议日期 | 2020-6 |
会议地点 | virtual |
英文摘要 | Text-based person search has drawn increasing attention due to its wide applications in video surveillance. However, most of the existing models depend heavily on paired image-text data, which is very expensive to acquire. Moreover, they always face huge performance drop when directly exploiting them to new domains. To overcome this problem, we make the first attempt to adapt the model to new target domains in the absence of pairwise labels, which combines the challenges from both cross-modal (text-based) person search and cross-domain person search. Specially, we propose a moment alignment network (MAN) to solve the cross-modal cross-domain person search task in this paper. The idea is to learn three effective moment alignments including domain alignment (DA), cross-modal alignment (CA) and exemplar alignment (EA), which together can learn domain-invariant and semantic aligned cross-modal representations to improve model generalization. Extensive experiments are conducted on CUHK Person Description dataset (CUHK-PEDES) and Richly Annotated Pedestrian dataset (RAP). Experimental results show that our proposed model achieves the state-of-the-art performances on five transfer tasks. |
源URL | [http://ir.ia.ac.cn/handle/173211/44446] ![]() |
专题 | 自动化研究所_智能感知与计算研究中心 |
通讯作者 | Wang W(王威) |
作者单位 | 1.University of Chinese Academy of Sciences (UCAS) 2.Center for Excellence in Brain Science and Intelligence Technology (CEBSIT), Institute of Automation, Chinese Academy of Sciences (CASIA) 3.Center for Research on Intelligent Perception and Computing (CRIPAC), National Laboratory of Pattern Recognition (NLPR) |
推荐引用方式 GB/T 7714 | Jing Y,Wang W,Wang L,et al. Cross-Modal Cross-Domain Moment Alignment Network for Person Search[C]. 见:. virtual. 2020-6. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。