中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Set Generation Networks for End-to-End Knowledge Base Population

文献类型:会议论文

作者Sui DB(隋典伯)2,3; Chenhao Wang2,3; Yubo Chen2,3; Kang Liu2,3; Jun Zhao2,3; Wei Bi1
出版日期2021-11
会议日期2021-11
会议地点Online and Punta Cana, Dominican Republic
英文摘要

The task of knowledge base population (KBP) aims to discover facts about entities from texts and expand a knowledge base with these facts. Previous studies shape end-to-end KBP as a machine translation task, which is required to convert unordered fact into a sequence according to a pre-specified order. However, the facts stated in a sentence are unordered in essence. In this paper, we formulate end-to-end KBP as a direct set generation problem, avoiding considering the order of multiple facts. To solve the set generation problem, we propose networks featured by transformers with non-autoregressive parallel decoding. Unlike previous approaches that use an autoregressive decoder to generate facts one by one, the proposed networks can directly output the final set of facts in one shot. Furthermore, to train the networks, we also design a set-based loss that forces unique predictions via bipartite matching. Compared with cross-entropy loss that highly penalizes small shifts in fact order, the proposed bipartite matching loss is invariant to any permutation of predictions. Benefiting from getting rid of the burden of predicting the order of multiple facts, our proposed networks achieve state-of-the-art (SoTA) performance on two benchmark datasets.

源URL[http://ir.ia.ac.cn/handle/173211/48934]  
专题模式识别国家重点实验室_自然语言处理
作者单位1.Tencent AI Lab
2.School of Artificial Intelligence, University of Chinese Academy of Sciences
3.National Laboratory of Pattern Recognition, Institute of Automation, CAS
推荐引用方式
GB/T 7714
Sui DB,Chenhao Wang,Yubo Chen,et al. Set Generation Networks for End-to-End Knowledge Base Population[C]. 见:. Online and Punta Cana, Dominican Republic. 2021-11.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。