An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation
XU Xin; GUO Jinlong; HONG Yunjia; JIN Biyi; Xu Xin (E-mail:xxu@infor.ecnu.edu.cn)
2013-03-25
Source PublicationChinese Journal of Library and Information Science
ISSN1674-3393
Volume6Issue:1Pages:64-77
Abstract

Purpose: The objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

Design/methodology/approach: An integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

Findings: The research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. Meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

Research limitations: Due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. Meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of KIM Platform. Therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

Practical implications: Our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

Originality/value: The integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. Our result verified the effectiveness of the combined index strategy.

;

Purpose: The objective of this paper is to testify the effect of ontology-based semantic annotation on the performance of document retrieval.

Design/methodology/approach: An integrated document retrieval method is put forward in this paper, in which the entities of documents are annotated by the upper ontology and domain ontology, then the documents are further indexed by the entity annotation as well as traditional keywords.

Findings: The research result shows that the structured entity retrieval and relation retrieval can be realized by the ontology-based entity index, which is beyond the ability of the tradition keyword-based retrieval. Meanwhile, the experiment shows that the recall and precision of document retrieval are improved effectively.

Research limitations: Due to the small amount of our current tourism domain ontology, the document retrieval with the ontology-based semantic index is limited by the size of ontology and the precision of semantic annotation. Meanwhile, the semantic annotation algorithm mainly relies on the current information extraction strategy of KIM Platform. Therefore, the performance of disambiguation and relation extraction algorithm need to be further improved.

Practical implications: Our method can improve the efficiency of document retrieval system, which facilitates the knowledge and document management in corporations, governments and other organizations.

Originality/value: The integrated document retrieval method proposed in the paper can combine the entity index based on the general ontology with domain ontology and the keyword index. Our result verified the effectiveness of the combined index strategy.

KeywordOntology Semantic Annotation Semantic Retrieval Entity Retrieval|kim|Retrieval|kim
Subject Area编辑出版
URL查看原文
Funding OrganizationThis work is supported by the National Social Science Foundation of China (Grant No. 11CTQ003).
Document Type期刊论文
Identifierhttp://ir.las.ac.cn/handle/12502/6151
CollectionJournal of Data and Information Science_Chinese Journal of Library and Information Science-2013
Corresponding AuthorXu Xin (E-mail:xxu@infor.ecnu.edu.cn)
Recommended Citation
GB/T 7714
XU Xin,GUO Jinlong,HONG Yunjia,et al. An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation[J]. Chinese Journal of Library and Information Science,2013,6(1):64-77.
APA XU Xin,GUO Jinlong,HONG Yunjia,JIN Biyi,&Xu Xin .(2013).An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation.Chinese Journal of Library and Information Science,6(1),64-77.
MLA XU Xin,et al."An integrated document retrieval method combining entity annotation and keyword index: A KIM platform implementation".Chinese Journal of Library and Information Science 6.1(2013):64-77.
Files in This Item: Download All
File Name/Size DocType Version Access License
XU Xin.pdf(9785KB) 开放获取LicenseView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[XU Xin]'s Articles
[GUO Jinlong]'s Articles
[HONG Yunjia]'s Articles
Baidu academic
Similar articles in Baidu academic
[XU Xin]'s Articles
[GUO Jinlong]'s Articles
[HONG Yunjia]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[XU Xin]'s Articles
[GUO Jinlong]'s Articles
[HONG Yunjia]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: XU Xin.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.