科技大数据增值丰富化方法研究与工具研发 | |
孔贝贝1![]() ![]() ![]() ![]() | |
2019 | |
Source Publication | 数据分析与知识发现
![]() |
Volume | 3Issue:7Pages:113-122 |
Abstract | 【目的】解决科技大数据数据源分散、质量不高、内容单薄等问题。【方法】采用数据清洗、实体对齐、实体字段融合、冲突检测等增值计算方法, 设计开发一套科技大数据增值丰富化的工具。【结果】通过本文研发的丰富化工具, 在人员、机构、会议、期刊实体及实体关系层面实现实体数据对齐, 实体字段内容增加5-10倍, 实体分析维度提升2-3倍。【局限】增值数据的及时性、规范性需要结合服务需求在实际应用中不断优化提升。【结论】研究成果提升了科技大数据知识发现平台以及相关情报智能分析系统的数据服务维度及深度。 |
Other Abstract | [Objective] This paper tries to address the issues facing sci-tech big data, such as source dispersal, low quality, and poor content. [Methods] We used value-added computing methods, such as data cleansing, entity alignment, entity field fusion, conflict detection, etc., to develop tools for the enrichment of sci-tech big data. [Results] The developed tools achieved entity data alignment at the levels of personnel, organization, conference, journal and relationship among them. The contents of the entity fields were increased by 5 to 10 times, and the entity analysis dimension was increased by 2 to 3 times. [Limitations] The timeliness and standardization of value-added data need to be optimized and improved based on service needs. [Conclusions] The proposed methods and tools enhance the knowledge discovery of the sci-tech big data and intelligent information analysis systems. |
Keyword | 科技大数据 数据增值 丰富化方法 |
DOI | 10.11925/infotech.2096-3467.2018.1355. |
URL | 查看原文 |
Indexed By | CSCD ; CSSCI ; 中文核心期刊要目总览 |
Language | 中文 |
CSCD ID | CSCD:6698711 |
Citation statistics |
Cited Times:1[CSCD]
[CSCD Record]
|
Document Type | 期刊论文 |
Identifier | http://ir.las.ac.cn/handle/12502/10539 |
Collection | 中国科学院文献情报中心(北京)_信息系统部 |
Corresponding Author | 谢靖 |
Affiliation | 1.中国科学院文献情报中心 2.中国科学院大学 |
First Author Affilication | 中国科学院文献情报中心 |
Corresponding Author Affilication | 中国科学院文献情报中心 |
Recommended Citation GB/T 7714 | 孔贝贝,谢靖,钱力,等. 科技大数据增值丰富化方法研究与工具研发[J]. 数据分析与知识发现,2019,3(7):113-122. |
APA | 孔贝贝,谢靖,钱力,常志军,&吴振新.(2019).科技大数据增值丰富化方法研究与工具研发.数据分析与知识发现,3(7),113-122. |
MLA | 孔贝贝,et al."科技大数据增值丰富化方法研究与工具研发".数据分析与知识发现 3.7(2019):113-122. |
Files in This Item: | Download All | |||||
File Name/Size | DocType | Version | Access | License | ||
科技大数据增值丰富化方法研究与工具研发_(1138KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | View Download |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment