An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation
XU Xin; GUO Jinlong; HONG Yunjia; JIN Biyi; Xu Xin (E-mail:xxu@infor.ecnu.edu.cn)
2013-03-25
发表期刊Chinese Journal of Library and Information Science
ISSN1674-3393
卷号6期号:1页码:64-77
摘要

Purpose: The objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

Design/methodology/approach: An integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

Findings: The research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. Meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

Research limitations: Due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. Meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of KIM Platform. Therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

Practical implications: Our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

Originality/value: The integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. Our result verified the effectiveness of the combined index strategy.

;

Purpose: The objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

Design/methodology/approach: An integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

Findings: The research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. Meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

Research limitations: Due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. Meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of KIM Platform. Therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

Practical implications: Our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

Originality/value: The integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. Our result verified the effectiveness of the combined index strategy.

关键词Ontology Semantic Annotation Semantic Retrieval Entity Retrieval|kim|Retrieval|kim
学科领域编辑出版
URL查看原文
项目资助者This work is supported by the National Social Science Foundation of China (Grant No. 11CTQ003).
文献类型期刊论文
条目标识符http://ir.las.ac.cn/handle/12502/6151
专题Journal of Data and Information Science_Chinese Journal of Library and Information Science-2013
通讯作者Xu Xin (E-mail:xxu@infor.ecnu.edu.cn)
推荐引用方式
GB/T 7714
XU Xin,GUO Jinlong,HONG Yunjia,et al. An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation[J]. Chinese Journal of Library and Information Science,2013,6(1):64-77.
APA XU Xin,GUO Jinlong,HONG Yunjia,JIN Biyi,&Xu Xin .(2013).An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation.Chinese Journal of Library and Information Science,6(1),64-77.
MLA XU Xin,et al."An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation".Chinese Journal of Library and Information Science 6.1(2013):64-77.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
XU Xin.pdf(9785KB) 开放获取使用许可请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[XU Xin]的文章
[GUO Jinlong]的文章
[HONG Yunjia]的文章
百度学术
百度学术中相似的文章
[XU Xin]的文章
[GUO Jinlong]的文章
[HONG Yunjia]的文章
必应学术
必应学术中相似的文章
[XU Xin]的文章
[GUO Jinlong]的文章
[HONG Yunjia]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。