Heuristics based semantic annotation of biodiversity documents in Chinese
DUAN Yufeng; HEI Zhenzhen; JU Fei; CUI Hong; Duan Yufeng (E-mail:yfduan@infor.ecnu.edu.cn)
2013-06-25
发表期刊Chinese Journal of Library and Information Science
ISSN1674-3393
卷号6期号:2页码:33-46
摘要Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.

Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.

Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.

Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.

Practical implications & Originality/value: The performance of heuristics based approach, enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.; Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.

Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.

Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.

Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.

Practical implications & Originality/value: The performance of heuristics based approach, enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.
关键词Heuritistics Based Method Leading Word Analysis Taxonomic Descriptions Semantic Annotation
学科领域编辑出版
URL查看原文
项目资助者This work is jointly supported by the National Social Science Foundation of China (Grant No:11BTQ024) and the Foundation for Humanities and Social Sciences of the Chinese Ministry of Education (Grant No:10YJC87004)
文献类型期刊论文
条目标识符http://ir.las.ac.cn/handle/12502/6238
专题Journal of Data and Information Science_Chinese Journal of Library and Information Science-2013
通讯作者Duan Yufeng (E-mail:yfduan@infor.ecnu.edu.cn)
推荐引用方式
GB/T 7714
DUAN Yufeng,HEI Zhenzhen,JU Fei,et al. Heuristics based semantic annotation of biodiversity documents in Chinese[J]. Chinese Journal of Library and Information Science,2013,6(2):33-46.
APA DUAN Yufeng,HEI Zhenzhen,JU Fei,CUI Hong,&Duan Yufeng .(2013).Heuristics based semantic annotation of biodiversity documents in Chinese.Chinese Journal of Library and Information Science,6(2),33-46.
MLA DUAN Yufeng,et al."Heuristics based semantic annotation of biodiversity documents in Chinese".Chinese Journal of Library and Information Science 6.2(2013):33-46.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
DUAN Yufeng.pdf(1903KB) 开放获取--请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[DUAN Yufeng]的文章
[HEI Zhenzhen]的文章
[JU Fei]的文章
百度学术
百度学术中相似的文章
[DUAN Yufeng]的文章
[HEI Zhenzhen]的文章
[JU Fei]的文章
必应学术
必应学术中相似的文章
[DUAN Yufeng]的文章
[HEI Zhenzhen]的文章
[JU Fei]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。