Cross-Modal Knowledge Adaptation for Language-Based Person Search
文献类型:期刊论文
作者 | Chen, Yucheng2,3,4; Huang, Rui5; Chang, Hong2,3,4; Tan, Chuanqi1; Xue, Tao1; Ma, Bingpeng4 |
刊名 | IEEE TRANSACTIONS ON IMAGE PROCESSING |
出版日期 | 2021 |
卷号 | 30页码:4057-4069 |
ISSN号 | 1057-7149 |
关键词 | Feature extraction Task analysis Lighting Learning systems Logic gates Knowledge engineering Training Language-based person search cross-modal knowledge adaptation image-specific information |
DOI | 10.1109/TIP.2021.3068825 |
英文摘要 | In this paper, we present a method named Cross-Modal Knowledge Adaptation (CMKA) for language-based person search. We argue that the image and text information are not equally important in determining a person's identity. In other words, image carries image-specific information such as lighting condition and background, while text contains more modal agnostic information that is more beneficial to cross-modal matching. Based on this consideration, we propose CMKA to adapt the knowledge of image to the knowledge of text. Specially, text-to-image guidance is obtained at different levels: individuals, lists, and classes. By combining these levels of knowledge adaptation, the image-specific information is suppressed, and the common space of image and text is better constructed. We conduct experiments on the CUHK-PEDES dataset. The experimental results show that the proposed CMKA outperforms the state-of-the-art methods. |
资助项目 | Natural Science Foundation of China (NSFC)[61876171] ; Natural Science Foundation of China (NSFC)[61976203] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[AC01202005015] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[2019-INT006] |
WOS研究方向 | Computer Science ; Engineering |
语种 | 英语 |
出版者 | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
WOS记录号 | WOS:000638400000007 |
源URL | [http://119.78.100.204/handle/2XEOYT63/16635] |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Ma, Bingpeng |
作者单位 | 1.Tencent, Beijing 100193, Peoples R China 2.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Beijing, Peoples R China 3.Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China 4.Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China 5.Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China |
推荐引用方式 GB/T 7714 | Chen, Yucheng,Huang, Rui,Chang, Hong,et al. Cross-Modal Knowledge Adaptation for Language-Based Person Search[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2021,30:4057-4069. |
APA | Chen, Yucheng,Huang, Rui,Chang, Hong,Tan, Chuanqi,Xue, Tao,&Ma, Bingpeng.(2021).Cross-Modal Knowledge Adaptation for Language-Based Person Search.IEEE TRANSACTIONS ON IMAGE PROCESSING,30,4057-4069. |
MLA | Chen, Yucheng,et al."Cross-Modal Knowledge Adaptation for Language-Based Person Search".IEEE TRANSACTIONS ON IMAGE PROCESSING 30(2021):4057-4069. |
入库方式: OAI收割
来源:计算技术研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。