Heuristics based semantic annotation of biodiversity documents in Chinese
DUAN Yufeng; HEI Zhenzhen; JU Fei; CUI Hong; Duan Yufeng (E-mail:yfduan@infor.ecnu.edu.cn)
2013-06-25
Source PublicationChinese Journal of Library and Information Science
ISSN1674-3393
Volume6Issue:2Pages:33-46
AbstractPurpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.

Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.

Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.

Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.

Practical implications & Originality/value: The performance of heuristics based approach, enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.; Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.

Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.

Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.

Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.

Practical implications & Originality/value: The performance of heuristics based approach, enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.
KeywordHeuritistics Based Method Leading Word Analysis Taxonomic Descriptions Semantic Annotation
Subject Area编辑出版
URL查看原文
Funding OrganizationThis work is jointly supported by the National Social Science Foundation of China (Grant No:11BTQ024) and the Foundation for Humanities and Social Sciences of the Chinese Ministry of Education (Grant No:10YJC87004)
Document Type期刊论文
Identifierhttp://ir.las.ac.cn/handle/12502/6238
CollectionJournal of Data and Information Science_Chinese Journal of Library and Information Science-2013
Corresponding AuthorDuan Yufeng (E-mail:yfduan@infor.ecnu.edu.cn)
Recommended Citation
GB/T 7714
DUAN Yufeng,HEI Zhenzhen,JU Fei,et al. Heuristics based semantic annotation of biodiversity documents in Chinese[J]. Chinese Journal of Library and Information Science,2013,6(2):33-46.
APA DUAN Yufeng,HEI Zhenzhen,JU Fei,CUI Hong,&Duan Yufeng .(2013).Heuristics based semantic annotation of biodiversity documents in Chinese.Chinese Journal of Library and Information Science,6(2),33-46.
MLA DUAN Yufeng,et al."Heuristics based semantic annotation of biodiversity documents in Chinese".Chinese Journal of Library and Information Science 6.2(2013):33-46.
Files in This Item: Download All
File Name/Size DocType Version Access License
DUAN Yufeng.pdf(1903KB) 开放获取--View Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[DUAN Yufeng]'s Articles
[HEI Zhenzhen]'s Articles
[JU Fei]'s Articles
Baidu academic
Similar articles in Baidu academic
[DUAN Yufeng]'s Articles
[HEI Zhenzhen]'s Articles
[JU Fei]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[DUAN Yufeng]'s Articles
[HEI Zhenzhen]'s Articles
[JU Fei]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: DUAN Yufeng.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.