中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Cross-Modal Knowledge Adaptation for Language-Based Person Search

文献类型:期刊论文

作者Chen, Yucheng2,3,4; Huang, Rui5; Chang, Hong2,3,4; Tan, Chuanqi1; Xue, Tao1; Ma, Bingpeng4
刊名IEEE TRANSACTIONS ON IMAGE PROCESSING
出版日期2021
卷号30页码:4057-4069
ISSN号1057-7149
关键词Feature extraction Task analysis Lighting Learning systems Logic gates Knowledge engineering Training Language-based person search cross-modal knowledge adaptation image-specific information
DOI10.1109/TIP.2021.3068825
英文摘要In this paper, we present a method named Cross-Modal Knowledge Adaptation (CMKA) for language-based person search. We argue that the image and text information are not equally important in determining a person's identity. In other words, image carries image-specific information such as lighting condition and background, while text contains more modal agnostic information that is more beneficial to cross-modal matching. Based on this consideration, we propose CMKA to adapt the knowledge of image to the knowledge of text. Specially, text-to-image guidance is obtained at different levels: individuals, lists, and classes. By combining these levels of knowledge adaptation, the image-specific information is suppressed, and the common space of image and text is better constructed. We conduct experiments on the CUHK-PEDES dataset. The experimental results show that the proposed CMKA outperforms the state-of-the-art methods.
资助项目Natural Science Foundation of China (NSFC)[61876171] ; Natural Science Foundation of China (NSFC)[61976203] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[AC01202005015] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[2019-INT006]
WOS研究方向Computer Science ; Engineering
语种英语
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
WOS记录号WOS:000638400000007
源URL[http://119.78.100.204/handle/2XEOYT63/16635]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Ma, Bingpeng
作者单位1.Tencent, Beijing 100193, Peoples R China
2.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Beijing, Peoples R China
3.Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
4.Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
5.Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
推荐引用方式
GB/T 7714
Chen, Yucheng,Huang, Rui,Chang, Hong,et al. Cross-Modal Knowledge Adaptation for Language-Based Person Search[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2021,30:4057-4069.
APA Chen, Yucheng,Huang, Rui,Chang, Hong,Tan, Chuanqi,Xue, Tao,&Ma, Bingpeng.(2021).Cross-Modal Knowledge Adaptation for Language-Based Person Search.IEEE TRANSACTIONS ON IMAGE PROCESSING,30,4057-4069.
MLA Chen, Yucheng,et al."Cross-Modal Knowledge Adaptation for Language-Based Person Search".IEEE TRANSACTIONS ON IMAGE PROCESSING 30(2021):4057-4069.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。