中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation

文献类型:期刊论文

作者XU Xin ; GUO Jinlong ; HONG Yunjia ; JIN Biyi
刊名chinese journal of library and information science
出版日期2013-03-25
卷号6期号:1页码:64-77
关键词Ontology Semantic annotation Semantic retrieval Entity retrieval|KIM
ISSN号1674-3393
通讯作者xu xin (e-mail:xxu@infor.ecnu.edu.cn)
中文摘要

purpose: the objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

design/methodology/approach: an integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

findings: the research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

research limitations: due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of kim platform. therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

practical implications: our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

originality/value: the integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. our result verified the effectiveness of the combined index strategy.

英文摘要

purpose: the objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

design/methodology/approach: an integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

findings: the research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

research limitations: due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of kim platform. therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

practical implications: our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

originality/value: the integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. our result verified the effectiveness of the combined index strategy.

学科主题编辑出版
资助信息this work is supported by the national social science foundation of china (grant no. 11ctq003).
原文出处http://www.chinalibraries.net
公开日期2013-04-27
源URL[http://ir.las.ac.cn/handle/12502/6151]  
专题文献情报中心_Journal of Data and Information Science_Chinese Journal of Library and Information Science-2013
推荐引用方式
GB/T 7714
XU Xin,GUO Jinlong,HONG Yunjia,et al. An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation[J]. chinese journal of library and information science,2013,6(1):64-77.
APA XU Xin,GUO Jinlong,HONG Yunjia,&JIN Biyi.(2013).An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation.chinese journal of library and information science,6(1),64-77.
MLA XU Xin,et al."An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation".chinese journal of library and information science 6.1(2013):64-77.

入库方式: OAI收割

来源:文献情报中心

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。