中国科学院机构知识库网格系统: Cross-Modal Knowledge Adaptation for Language-Based Person Search

Cross-Modal Knowledge Adaptation for Language-Based Person Search

文献类型：期刊论文


作者	Chen, Yucheng 2,3,4; Huang, Rui 5; Chang, Hong 2,3,4; Tan, Chuanqi 1; Xue, Tao 1; Ma, Bingpeng 4
刊名	IEEE TRANSACTIONS ON IMAGE PROCESSING
出版日期	2021
卷号	30 页码:4057-4069
关键词	Feature extraction Task analysis Lighting Learning systems Logic gates Knowledge engineering Training Language-based person search cross-modal knowledge adaptation image-specific information
ISSN号	1057-7149
DOI	10.1109/TIP.2021.3068825
英文摘要	In this paper, we present a method named Cross-Modal Knowledge Adaptation (CMKA) for language-based person search. We argue that the image and text information are not equally important in determining a person's identity. In other words, image carries image-specific information such as lighting condition and background, while text contains more modal agnostic information that is more beneficial to cross-modal matching. Based on this consideration, we propose CMKA to adapt the knowledge of image to the knowledge of text. Specially, text-to-image guidance is obtained at different levels: individuals, lists, and classes. By combining these levels of knowledge adaptation, the image-specific information is suppressed, and the common space of image and text is better constructed. We conduct experiments on the CUHK-PEDES dataset. The experimental results show that the proposed CMKA outperforms the state-of-the-art methods.
资助项目	Natural Science Foundation of China (NSFC)[61876171] ; Natural Science Foundation of China (NSFC)[61976203] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[AC01202005015] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[2019-INT006]
WOS研究方向	Computer Science ; Engineering
语种	英语
WOS记录号	WOS:000638400000007
出版者	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
源URL	[http://119.78.100.204/handle/2XEOYT63/16635]
专题	中国科学院计算技术研究所期刊论文_英文
通讯作者	Ma, Bingpeng
作者单位	1.Tencent, Beijing 100193, Peoples R China 2.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Beijing, Peoples R China 3.Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China 4.Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China 5.Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
推荐引用方式 GB/T 7714	Chen, Yucheng,Huang, Rui,Chang, Hong,et al. Cross-Modal Knowledge Adaptation for Language-Based Person Search[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2021,30:4057-4069.
APA	Chen, Yucheng,Huang, Rui,Chang, Hong,Tan, Chuanqi,Xue, Tao,&Ma, Bingpeng.(2021).Cross-Modal Knowledge Adaptation for Language-Based Person Search.IEEE TRANSACTIONS ON IMAGE PROCESSING,30,4057-4069.
MLA	Chen, Yucheng,et al."Cross-Modal Knowledge Adaptation for Language-Based Person Search".IEEE TRANSACTIONS ON IMAGE PROCESSING 30(2021):4057-4069.

入库方式： OAI收割

来源：计算技术研究所

下载0

Cross-Modal Knowledge Adaptation for Language-Based Person Search

其他版本