中国科学院文献情报中心机构知识库
Advanced  
NSL OpenIR  > 中国科学院成都文献情报中心  > 信息技术部  > 期刊论文
Title: Empirical study of constructing a knowledge organization system of patent documents using topic modeling
Author: Hu ZY(胡正银) ; Fang S(方曙) ; Liang T(梁田)
Source: Scientometrics
Issued Date: 2014-06-04
Volume: vol.100, Issue:3, Pages:787-799. DOI 10.1007/s11192-014-1328-1
Keyword: Topic model ; Term clumping ; Knowledge organization system ; Text clustering ; Principal Component Analysis
Subject: 信息组织与服务 ; 信息技术
Indexed Type: SCI
Corresponding Author: Hu ZY(胡正银)
English Abstract: A knowledge organization system (KOS) can help easily indicate the deep knowledge structure of a patent document set. Compared to classification code systems, a personalized KOS made up of topics can represent the technology information in a more agile, detailed manner. This paper presents an approach to automatically construct a KOS of patent documents based on term clumping, Latent Dirichlet Allocation (LDA) model, K-Means clustering and Principal Components Analysis (PCA). Term clumping is adopted to generate a better bag-of-words for topic modeling and LDA model is applied to generate raw topics. Then by iteratively using K-Means clustering and PCA on the document set and topics matrix, we generated new upper topics and computed the relationships between topics to construct a KOS. Finally, documents are mapped to the KOS. The nodes of the KOS are topics which are represented by terms and their weights and the leaves are patent documents. We evaluated the approach with a set of Large Aperture Optical Elements (LAOE) patent documents as an empirical study and constructed the LAOE KOS. The method used discovered the deep semantic relationships between the topics and helped better describe the technology themes of LAOE. Based on the KOS, two types of applications were implemented: the automatic classification of patents documents and the categorical refinements above search results.
Language: 英语
Content Type: 期刊论文
URI: http://ir.las.ac.cn/handle/12502/7119
Appears in Collections:中国科学院成都文献情报中心_信息技术部_期刊论文

Files in This Item: Download All
File Name/ File Size Content Type Version Access License
Empirical study of constructing a knowledge organization system of patent documents using topic modeling.pdf(327KB)----开放获取
View Download

Recommended Citation:
Hu ZY,Fang S,Liang T. Empirical study of constructing a knowledge organization system of patent documents using topic modeling[J]. Scientometrics,2014,vol.100(3):787-799. DOI 10.1007/s11192-014-1328-1.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Hu ZY(胡正银)]'s Articles
[Fang S(方曙)]'s Articles
[Liang T(梁田)]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Hu ZY(胡正银)]‘s Articles
[Fang S(方曙)]‘s Articles
[Liang T(梁田)]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
文件名: Empirical study of constructing a knowledge organization system of patent documents using topic modeling.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院文献情报中心 - Feedback
Powered by CSpace