中国科学院文献情报中心机构知识库
Advanced  
NSL OpenIR  > Journal of Data and Information Science  > Chinese Journal of Library and Information Science-2013  > 期刊论文
Title: Heuristics based semantic annotation of biodiversity documents in Chinese
Author: DUAN Yufeng ; HEI Zhenzhen ; JU Fei ; CUI Hong
Source: Chinese Journal of Library and Information Science
Issued Date: 2013-06-25
Volume: 6, Issue:2, Pages:33-46
Keyword: Heuritistics based method ; Leading word analysis ; Taxonomic descriptions ; Semantic annotation
Subject: 编辑出版
Corresponding Author: Duan Yufeng (E-mail:yfduan@infor.ecnu.edu.cn)
Abstract: Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.

Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.

Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.

Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.

Practical implications & Originality/value: The performance of heuristics based approach, enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.
English Abstract: Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.

Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.

Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.

Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.

Practical implications & Originality/value: The performance of heuristics based approach, enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.
Related URLs: 查看原文
Content Type: 期刊论文
URI: http://ir.las.ac.cn/handle/12502/6238
Appears in Collections:Chinese Journal of Library and Information Science-2013_期刊论文

Files in This Item: Download All
File Name/ File Size Content Type Version Access License
DUAN Yufeng.pdf(1903KB)----开放获取--View Download

Recommended Citation:
DUAN Yufeng,HEI Zhenzhen,JU Fei,et al. Heuristics based semantic annotation of biodiversity documents in Chinese[J]. Chinese Journal of Library and Information Science,2013,6(2):33-46.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[DUAN Yufeng]'s Articles
[HEI Zhenzhen]'s Articles
[JU Fei]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[DUAN Yufeng]‘s Articles
[HEI Zhenzhen]‘s Articles
[JU Fei]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
文件名: DUAN Yufeng.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院文献情报中心 - Feedback
Powered by CSpace