中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Improving Distant Supervised Relation Extraction with Noise Detection Strategy

文献类型:期刊论文

作者Meng, XY (Meng, Xiaoyan)[ 1,2,3,4 ]; Jiang, TH (Jiang, Tonghai)[ 1,3 ]; Zhou, X (Zhou, Xi)[ 1,2,3 ]; Ma, B (Ma, Bo)[ 1,2,3 ]; Wang, Y (Wang, Yi)[ 1,2,3 ]; Zhao, F (Zhao, Fan)[ 1,2,3 ]
刊名APPLIED SCIENCES-BASEL
出版日期2021
卷号11期号:5页码:1-14
关键词relation extraction distant supervision noise detection
ISSN号2076-3417
DOI10.3390/app11052046
英文摘要

Distant supervised relation extraction (DSRE) is widely used to extract novel relational facts from plain text, so as to improve the knowledge graph. However, distant supervision inevitably suffers from the noisy labeling problem that will severely damage the performance of relation extraction. Currently, most DSRE methods are mainly focused on reducing the weights of noisy sentences, ignoring the bag-level noise where all sentences in a bag are wrongly labeled. In this paper, we present a novel noise detection-based relation extraction approach (NDRE) to automatically detect noisy labels with entity information and dynamically correct them, which can alleviate both instance-level and bag-level noisy problems. By this means, we can extend the dataset from the Web tables without introducing more noise. In this approach, to embed the semantics of sentences from corpus and web tables, we firstly propose a powerful sentence coder that employs an internal multi-head self-attention mechanism between the piecewise max-pooling convolutional neural network. Second, we adopt a noise detection strategy, which is expected to dynamically detect and correct the original noisy label according to the similarity between sentence representation and entity-aware embeddings. Then, we aggregate the information from corpus and web tables to make the final relation prediction. Experimental results on a public benchmark dataset demonstrate that our proposed approach achieves significant improvements over the state-of-the-art baselines and can effectively reduce the noisy labeling problem.

WOS记录号WOS:000627953500001
源URL[http://ir.xjipc.cas.cn/handle/365002/7821]  
专题新疆理化技术研究所_多语种信息技术研究室
通讯作者Jiang, TH (Jiang, Tonghai)[ 1,3 ]
作者单位1.Xinjiang Agr Univ, Coll Comp & Informat Engn, Urumqi 830052, Peoples R China
2.Chinese Acad Sci, Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China
3.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
4.Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
推荐引用方式
GB/T 7714
Meng, XY ,Jiang, TH ,Zhou, X ,et al. Improving Distant Supervised Relation Extraction with Noise Detection Strategy[J]. APPLIED SCIENCES-BASEL,2021,11(5):1-14.
APA Meng, XY ,Jiang, TH ,Zhou, X ,Ma, B ,Wang, Y ,&Zhao, F .(2021).Improving Distant Supervised Relation Extraction with Noise Detection Strategy.APPLIED SCIENCES-BASEL,11(5),1-14.
MLA Meng, XY ,et al."Improving Distant Supervised Relation Extraction with Noise Detection Strategy".APPLIED SCIENCES-BASEL 11.5(2021):1-14.

入库方式: OAI收割

来源:新疆理化技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。