中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding

文献类型:期刊论文

作者Dan Li; Changde Du; Haibao Wang; Qiongyi Zhou; Huiguang He
刊名IEEE Transactions on Multimedia
出版日期2022-07
卷号24页码:3287-3299
DOI10.1109/TMM.2021.3104980
英文摘要

Multi-label semantic decoding is a challenging task with great scientific significance and application value. The existing methods mainly focus on label learning and ignore the amount of information contained in the sample itself,especially non-image sample,which may limit their performance. To address these issues,we propose a novel semi-supervised modality assistance co-training network,which utilizes image modality to assist non-image modality for multi-label learning. In real application,there are two thorny issues: (i) non-image modality tends to be missing owing to the difficulty in obtaining them; (ii) although the image modality is easy to obtain from the Internet,image label annotation is still time-consuming and expensive. Therefore,the proposed method utilizes a small number of paired & labeled images and non-image modalities,and a large number of unpaired & unlabeled images from web sources to improve results. It consists of the modality-specific feature generators,the feature translators and the label relationship network. Specifically,the modality-specific feature generators are used to generate different features (views) for each modality. Semantic translators are employed to capture the relationship between the paired modalities and impute the missing modality feature by using unpaired & unlabeled images. Label relation network is a graph convolution network (GCN) aiming to capture the correlation between labels. To mine the information in unlabeled features,the co-training mechanism is considered. With this mechanism,we introduce a multi-view orthogonality constraint and a multi-label co-regularization constraint. Extensive experiments on three computer vision and neuroscience datasets demonstrate the effectiveness of the proposed method.

源URL[http://ir.ia.ac.cn/handle/173211/50896]  
专题类脑智能研究中心_神经计算及脑机交互
通讯作者Huiguang He
推荐引用方式
GB/T 7714
Dan Li,Changde Du,Haibao Wang,et al. Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding[J]. IEEE Transactions on Multimedia,2022,24:3287-3299.
APA Dan Li,Changde Du,Haibao Wang,Qiongyi Zhou,&Huiguang He.(2022).Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding.IEEE Transactions on Multimedia,24,3287-3299.
MLA Dan Li,et al."Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding".IEEE Transactions on Multimedia 24(2022):3287-3299.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。