NSL OpenIR  > 中国科学院文献情报中心(北京)  > 信息系统部
科技大数据增值丰富化方法研究与工具研发
孔贝贝1; 谢靖1,2; 钱力1,2; 常志军1,2; 吴振新1,2
2019
Source Publication数据分析与知识发现
Volume3Issue:7Pages:113-122
Abstract

目的】解决科技大数据数据源分散、质量不高、内容单薄等问题。【方法】采用数据清洗、实体对齐、实体字段融合、冲突检测等增值计算方法, 设计开发一套科技大数据增值丰富化的工具。【结果】通过本文研发的丰富化工具, 在人员、机构、会议、期刊实体及实体关系层面实现实体数据对齐, 实体字段内容增加5-10倍, 实体分析维度提升2-3倍。【局限】增值数据的及时性、规范性需要结合服务需求在实际应用中不断优化提升。【结论】研究成果提升了科技大数据知识发现平台以及相关情报智能分析系统的数据服务维度及深度。

Other Abstract

[Objective] This paper tries to address the issues facing sci-tech big data, such as source dispersal, low quality, and poor content. [Methods] We used value-added computing methods, such as data cleansing, entity alignment, entity field fusion, conflict detection, etc., to develop tools for the enrichment of sci-tech big data. [Results] The developed tools achieved entity data alignment at the levels of personnel, organization, conference, journal and relationship among them. The contents of the entity fields were increased by 5 to 10 times, and the entity analysis dimension was increased by 2 to 3 times. [Limitations] The timeliness and standardization of value-added data need to be optimized and improved based on service needs. [Conclusions] The proposed methods and tools enhance the knowledge discovery of the sci-tech big data and intelligent information analysis systems.

Keyword科技大数据 数据增值 丰富化方法
DOI10.11925/infotech.2096-3467.2018.1355.
URL查看原文
Indexed ByCSCD ; CSSCI ; 中文核心期刊要目总览
Language中文
Citation statistics
Document Type期刊论文
Identifierhttp://ir.las.ac.cn/handle/12502/10539
Collection中国科学院文献情报中心(北京)_信息系统部
Corresponding Author谢靖
Affiliation1.中国科学院文献情报中心
2.中国科学院大学
First Author Affilication中国科学院文献情报中心
Corresponding Author Affilication中国科学院文献情报中心
Recommended Citation
GB/T 7714
孔贝贝,谢靖,钱力,等. 科技大数据增值丰富化方法研究与工具研发[J]. 数据分析与知识发现,2019,3(7):113-122.
APA 孔贝贝,谢靖,钱力,常志军,&吴振新.(2019).科技大数据增值丰富化方法研究与工具研发.数据分析与知识发现,3(7),113-122.
MLA 孔贝贝,et al."科技大数据增值丰富化方法研究与工具研发".数据分析与知识发现 3.7(2019):113-122.
Files in This Item: Download All
File Name/Size DocType Version Access License
科技大数据增值丰富化方法研究与工具研发_(1138KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[孔贝贝]'s Articles
[谢靖]'s Articles
[钱力]'s Articles
Baidu academic
Similar articles in Baidu academic
[孔贝贝]'s Articles
[谢靖]'s Articles
[钱力]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[孔贝贝]'s Articles
[谢靖]'s Articles
[钱力]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 科技大数据增值丰富化方法研究与工具研发_孔贝贝.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.