中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
PAN: Prototype-based Adaptive Network for Robust Cross-Modal Retrieval

文献类型:会议论文

作者Zhixiong Zeng1,2; Shuai Wang1,2; Nan Xu1,2; Wenji Mao1,2
出版日期2021-07
会议日期2021.7.11
会议地点Virtual Event
英文摘要

In practical applications of cross-modal retrieval, test queries of the retrieval system may vary greatly and come from unknown category. Meanwhile, due to the cost and difficulty of data collection as well as other issues, the available data for cross-modal retrieval are often imbalanced over different modalities. In this paper, we address two important issues to increase the robustness of cross-modal retrieval system for real-world applications: handling test queries from unknown category and modality-imbalanced training data. The first issue has not been addressed by existing methods and the second issue was not well addressed in the related research. To tackle the above issues, we take the advantage of prototype learning, and propose a prototype-based adaptive network (PAN) for robust cross-modal retrieval. Our method leverages a unified prototype to represent each semantic category across modalities, which provides discriminative information of different categories and takes unified prototypes as anchors to learn cross-modal representations adaptively. Moreover, we propose a novel prototype propagation strategy to reconstruct balanced representations which preserves the semantic consistency and modality heterogeneity. Experimental results on the benchmark datasets demonstrate the effectiveness of our method compared to the SOTA methods, and further robustness tests show the superiority of our method in solving the above issues.

源URL[http://ir.ia.ac.cn/handle/173211/52011]  
专题自动化研究所_复杂系统管理与控制国家重点实验室_互联网大数据与安全信息学研究中心
通讯作者Wenji Mao
作者单位1.Institute of Automation, Chinese Academy of Sciences
2.School of Artifcial Intelligence, University of Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Zhixiong Zeng,Shuai Wang,Nan Xu,et al. PAN: Prototype-based Adaptive Network for Robust Cross-Modal Retrieval[C]. 见:. Virtual Event. 2021.7.11.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。