NSL OpenIR  > 中国科学院成都文献情报中心  > 信息技术部
Empirical study of constructing a knowledge organization system of patent documents using topic modeling
Hu ZY(胡正银); Fang S(方曙); Liang T(梁田)
2014-06-04
发表期刊Scientometrics
ISSN1588-2861
卷号vol.100期号:3页码:787-799. DOI 10.1007/s11192-014-1328-1
摘要A knowledge organization system (KOS) can help easily indicate the deep knowledge structure of a patent document set. Compared to classification code systems, a personalized KOS made up of topics can represent the technology information in a more agile, detailed manner. This paper presents an approach to automatically construct a KOS of patent documents based on term clumping, Latent Dirichlet Allocation (LDA) model, K-Means clustering and Principal Components Analysis (PCA). Term clumping is adopted to generate a better bag-of-words for topic modeling and LDA model is applied to generate raw topics. Then by iteratively using K-Means clustering and PCA on the document set and topics matrix, we generated new upper topics and computed the relationships between topics to construct a KOS. Finally, documents are mapped to the KOS. The nodes of the KOS are topics which are represented by terms and their weights and the leaves are patent documents. We evaluated the approach with a set of Large Aperture Optical Elements (LAOE) patent documents as an empirical study and constructed the LAOE KOS. The method used discovered the deep semantic relationships between the topics and helped better describe the technology themes of LAOE. Based on the KOS, two types of applications were implemented: the automatic classification of patents documents and the categorical refinements above search results.
关键词Topic Model Term Clumping Knowledge Organization System Text Clustering Principal Component Analysis
学科领域信息组织与服务 ; 信息技术
收录类别SCI
语种英语
文献类型期刊论文
条目标识符http://ir.las.ac.cn/handle/12502/7119
专题中国科学院成都文献情报中心_信息技术部
通讯作者Hu ZY(胡正银)
推荐引用方式
GB/T 7714
Hu ZY,Fang S,Liang T. Empirical study of constructing a knowledge organization system of patent documents using topic modeling[J]. Scientometrics,2014,vol.100(3):787-799. DOI 10.1007/s11192-014-1328-1.
APA Hu ZY,Fang S,&Liang T.(2014).Empirical study of constructing a knowledge organization system of patent documents using topic modeling.Scientometrics,vol.100(3),787-799. DOI 10.1007/s11192-014-1328-1.
MLA Hu ZY,et al."Empirical study of constructing a knowledge organization system of patent documents using topic modeling".Scientometrics vol.100.3(2014):787-799. DOI 10.1007/s11192-014-1328-1.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Empirical study of c(327KB) 开放获取请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Hu ZY(胡正银)]的文章
[Fang S(方曙)]的文章
[Liang T(梁田)]的文章
百度学术
百度学术中相似的文章
[Hu ZY(胡正银)]的文章
[Fang S(方曙)]的文章
[Liang T(梁田)]的文章
必应学术
必应学术中相似的文章
[Hu ZY(胡正银)]的文章
[Fang S(方曙)]的文章
[Liang T(梁田)]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。