A semi-supervised cross-modal memory bank for cross-modal retrieval
文献类型:期刊论文
作者 | Huang, Yingying1,2,3; Hu, Bingliang1![]() ![]() |
刊名 | NEUROCOMPUTING
![]() |
出版日期 | 2024-04-28 |
卷号 | 579 |
关键词 | Common space Cross-modal memory bank Pseudo-labels Class probability |
ISSN号 | 0925-2312;1872-8286 |
DOI | 10.1016/j.neucom.2024.127430 |
产权排序 | 1 |
英文摘要 | The core of semi -supervised cross -modal retrieval tasks lies in leveraging limited supervised information to measure the similarity between cross -modal data. Current approaches assume an association between unlabelled data and pre -defined k -nearest neighbour data, relying on classifier performance for this selection. With diminishing labelled data, classifier performance weakens, resulting in erroneous associations among unlabelled instances. Moreover, the lack of interpretability in class probabilities of unlabelled data hinders classifier learning. Thus, this paper focuses on learning pseudo -labels for unlabelled data, providing pseudosupervision to aid classifier learning. Specifically, a cross -modal memory bank is proposed, dynamically storing feature representations in a common space and class probability representations in a label space for each cross -modal data. Pseudo -labels are derived by computing feature representation similarity and adjusting class probabilities. During this process, imposing constraints on the classification loss between labelled data and contrastive losses between paired cross -modal data is a prerequisite for the successful learning of pseudolabels. This procedure significantly contributes to enhancing the credibility of these pseudo -labels. Empirical findings demonstrate that using only 10% labelled data, compared to prevailing semi -supervised techniques, this method achieves improvements of 2.6%, 1.8%, and 4.9% in MAP@50 on the Wikipedia, NUS -WIDE, and MS-COCO datasets, respectively. |
语种 | 英语 |
WOS记录号 | WOS:001198409500001 |
出版者 | ELSEVIER |
源URL | [http://ir.opt.ac.cn/handle/181661/97392] ![]() |
专题 | 西安光学精密机械研究所_光学影像学习与分析中心 |
通讯作者 | Wang, Quan |
作者单位 | 1.Key Lab Biomed Spect, Xian 710119, Shaanxi, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 3.Chinese Acad Sci, Key Lab Spectral Imaging Technol, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China |
推荐引用方式 GB/T 7714 | Huang, Yingying,Hu, Bingliang,Zhang, Yipeng,et al. A semi-supervised cross-modal memory bank for cross-modal retrieval[J]. NEUROCOMPUTING,2024,579. |
APA | Huang, Yingying,Hu, Bingliang,Zhang, Yipeng,Gao, Chi,&Wang, Quan.(2024).A semi-supervised cross-modal memory bank for cross-modal retrieval.NEUROCOMPUTING,579. |
MLA | Huang, Yingying,et al."A semi-supervised cross-modal memory bank for cross-modal retrieval".NEUROCOMPUTING 579(2024). |
入库方式: OAI收割
来源:西安光学精密机械研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。