Multimodal graph convolutional networks for high quality content recognition
文献类型:期刊论文
作者 | Wang, Jinguang1; Hu, Jun1; Qian, Shengsheng2![]() ![]() ![]() |
刊名 | NEUROCOMPUTING
![]() |
出版日期 | 2020-10-28 |
卷号 | 412页码:42-51 |
关键词 | High quality content recognition Graph convolutional networks Positive unlabeled learning |
ISSN号 | 0925-2312 |
DOI | 10.1016/j.neucom.2020.04.145 |
通讯作者 | Qian, Shengsheng(shengsheng.qian@nlpr.ia.ac.cn) |
英文摘要 | With the development of the Internet, more and more creators publish articles on social media. How to automatically filter high quality content from a large number of multimedia articles is one of the core functions of information recommendation, search engine, and other systems. However, existing approaches typically suffer from two limitations: (1) They usually model content as word sequences, which ignores the semantics provided by non-consecutive phrases, long-distance word dependency, and visual information. (2) They rely on a large amount of manually annotated data to train a quality assessment model while users may only provide labels of interest in a single class for a small number of samples in reality. To address these limitations, we propose a Multimodal Graph Convolutional Networks (MGCN) to model the semantic representations in a unified framework for High Quality Content Recognition. Instead of viewing text content as word sequences, we convert them into graphs, which can model non-consecutive phrases and long-distance word dependency for better obtaining the composition of semantics. Besides, visual content is also modeled into the graphs to provide complementary semantics. A well-designed graph convolutional network is proposed to capture the semantic representations based on these graphs. Furthermore, we employ a non-negative risk estimator for high quality content recognition and the loss is back-propagated for model learning. Experiments on real data sets validate the effectiveness of our approach. (c) 2020 Elsevier B.V. All rights reserved. |
资助项目 | National Key Research and Development Program of China[2017YFB1002804] ; National Natural Science Foundation of China[61720106006] ; National Natural Science Foundation of China[61802405] ; National Natural Science Foundation of China[61702509] ; National Natural Science Foundation of China[61832002] ; National Natural Science Foundation of China[61936005] ; National Natural Science Foundation of China[61872424] ; Key Research Program of Frontier Sciences, CAS[QYZDJ-SSW-JSC039] ; K.C.Wong Education Foundation |
WOS研究方向 | Computer Science |
语种 | 英语 |
WOS记录号 | WOS:000571637700005 |
出版者 | ELSEVIER |
资助机构 | National Key Research and Development Program of China ; National Natural Science Foundation of China ; Key Research Program of Frontier Sciences, CAS ; K.C.Wong Education Foundation |
源URL | [http://ir.ia.ac.cn/handle/173211/42009] ![]() |
专题 | 自动化研究所_模式识别国家重点实验室_多媒体计算与图形学团队 |
通讯作者 | Qian, Shengsheng |
作者单位 | 1.Hefei Univ Technol, Hefei, Peoples R China 2.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China |
推荐引用方式 GB/T 7714 | Wang, Jinguang,Hu, Jun,Qian, Shengsheng,et al. Multimodal graph convolutional networks for high quality content recognition[J]. NEUROCOMPUTING,2020,412:42-51. |
APA | Wang, Jinguang,Hu, Jun,Qian, Shengsheng,Fang, Quan,&Xu, Changsheng.(2020).Multimodal graph convolutional networks for high quality content recognition.NEUROCOMPUTING,412,42-51. |
MLA | Wang, Jinguang,et al."Multimodal graph convolutional networks for high quality content recognition".NEUROCOMPUTING 412(2020):42-51. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。