NSL OpenIR  > 中国科学院成都文献情报中心  > 信息技术部
Empirical study of constructing a knowledge organization system of patent documents using topic modeling
Hu ZY(胡正银); Fang S(方曙); Liang T(梁田)
2014-06-04
Source PublicationScientometrics
ISSN1588-2861
Volumevol.100Issue:3Pages:787-799. DOI 10.1007/s11192-014-1328-1
AbstractA knowledge organization system (KOS) can help easily indicate the deep knowledge structure of a patent document set. Compared to classification code systems, a personalized KOS made up of topics can represent the technology information in a more agile, detailed manner. This paper presents an approach to automatically construct a KOS of patent documents based on term clumping, Latent Dirichlet Allocation (LDA) model, K-Means clustering and Principal Components Analysis (PCA). Term clumping is adopted to generate a better bag-of-words for topic modeling and LDA model is applied to generate raw topics. Then by iteratively using K-Means clustering and PCA on the document set and topics matrix, we generated new upper topics and computed the relationships between topics to construct a KOS. Finally, documents are mapped to the KOS. The nodes of the KOS are topics which are represented by terms and their weights and the leaves are patent documents. We evaluated the approach with a set of Large Aperture Optical Elements (LAOE) patent documents as an empirical study and constructed the LAOE KOS. The method used discovered the deep semantic relationships between the topics and helped better describe the technology themes of LAOE. Based on the KOS, two types of applications were implemented: the automatic classification of patents documents and the categorical refinements above search results.
KeywordTopic Model Term Clumping Knowledge Organization System Text Clustering Principal Component Analysis
Subject Area信息组织与服务 ; 信息技术
Indexed BySCI
Language英语
Document Type期刊论文
Identifierhttp://ir.las.ac.cn/handle/12502/7119
Collection中国科学院成都文献情报中心_信息技术部
Corresponding AuthorHu ZY(胡正银)
Recommended Citation
GB/T 7714
Hu ZY,Fang S,Liang T. Empirical study of constructing a knowledge organization system of patent documents using topic modeling[J]. Scientometrics,2014,vol.100(3):787-799. DOI 10.1007/s11192-014-1328-1.
APA Hu ZY,Fang S,&Liang T.(2014).Empirical study of constructing a knowledge organization system of patent documents using topic modeling.Scientometrics,vol.100(3),787-799. DOI 10.1007/s11192-014-1328-1.
MLA Hu ZY,et al."Empirical study of constructing a knowledge organization system of patent documents using topic modeling".Scientometrics vol.100.3(2014):787-799. DOI 10.1007/s11192-014-1328-1.
Files in This Item: Download All
File Name/Size DocType Version Access License
Empirical study of c(327KB) 开放获取View Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hu ZY(胡正银)]'s Articles
[Fang S(方曙)]'s Articles
[Liang T(梁田)]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hu ZY(胡正银)]'s Articles
[Fang S(方曙)]'s Articles
[Liang T(梁田)]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hu ZY(胡正银)]'s Articles
[Fang S(方曙)]'s Articles
[Liang T(梁田)]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: Empirical study of constructing a knowledge organization system of patent documents using topic modeling.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.