Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding
文献类型:期刊论文
作者 | Dan Li; Changde Du; Haibao Wang; Qiongyi Zhou; Huiguang He |
刊名 | IEEE Transactions on Multimedia |
出版日期 | 2022-07 |
卷号 | 24页码:3287-3299 |
DOI | 10.1109/TMM.2021.3104980 |
英文摘要 | Multi-label semantic decoding is a challenging task with great scientific significance and application value. The existing methods mainly focus on label learning and ignore the amount of information contained in the sample itself,especially non-image sample,which may limit their performance. To address these issues,we propose a novel semi-supervised modality assistance co-training network,which utilizes image modality to assist non-image modality for multi-label learning. In real application,there are two thorny issues: (i) non-image modality tends to be missing owing to the difficulty in obtaining them; (ii) although the image modality is easy to obtain from the Internet,image label annotation is still time-consuming and expensive. Therefore,the proposed method utilizes a small number of paired & labeled images and non-image modalities,and a large number of unpaired & unlabeled images from web sources to improve results. It consists of the modality-specific feature generators,the feature translators and the label relationship network. Specifically,the modality-specific feature generators are used to generate different features (views) for each modality. Semantic translators are employed to capture the relationship between the paired modalities and impute the missing modality feature by using unpaired & unlabeled images. Label relation network is a graph convolution network (GCN) aiming to capture the correlation between labels. To mine the information in unlabeled features,the co-training mechanism is considered. With this mechanism,we introduce a multi-view orthogonality constraint and a multi-label co-regularization constraint. Extensive experiments on three computer vision and neuroscience datasets demonstrate the effectiveness of the proposed method. |
源URL | [http://ir.ia.ac.cn/handle/173211/50896] |
专题 | 类脑智能研究中心_神经计算及脑机交互 |
通讯作者 | Huiguang He |
推荐引用方式 GB/T 7714 | Dan Li,Changde Du,Haibao Wang,et al. Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding[J]. IEEE Transactions on Multimedia,2022,24:3287-3299. |
APA | Dan Li,Changde Du,Haibao Wang,Qiongyi Zhou,&Huiguang He.(2022).Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding.IEEE Transactions on Multimedia,24,3287-3299. |
MLA | Dan Li,et al."Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding".IEEE Transactions on Multimedia 24(2022):3287-3299. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。