A view on "big data" and its relation to Informetrics
ROUSSEAU, Ronald; Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)
2012-11-20
发表期刊Chinese Journal of Library and Information Science
ISSN1674-3393
卷号5期号:3页码:12-26
摘要

Purpose: big data offer a huge challenge. Their very existence leads to the contradiction that the more data we have the less accessible they become, as the particular piece of information one is searching for may be buried among terabytes of other data. In this contribution we discuss the origin of big data and point to three challenges when big data arise: Data storage, data processing and generating insights.

Design/methodology/approach: Computer-related challenges can be expressed by the CAP theorem which states that it is only possible to simultaneously provide any two of the three following properties in distributed applications: Consistency (C), availability (A) and partition tolerance (P). As an aside we mention Amdahl’s law and its application for scientific collaboration. We further discuss data mining in large databases and knowledge representation for handling the results of data mining exercises. We further offer a short informetric study of the field of big data, and point to the ethical dimension of the big data phenomenon.

Findings: There still are serious problems to overcome before the field of big data can deliver on its promises.

Implications and limitations: This contribution offers a personal view, focusing on the information science aspects, but much more can be said about software aspects.

Originality/value: We express the hope that the information scientists, including librarians, will be able to play their full role within the knowledge discovery, data mining and big data communities, leading to exciting developments, the reduction of scientific bottlenecks and really innovative applications.

关键词Big Data Cap Theorem Knowledge Representation Data Mining Ethical Concerns
学科领域编辑出版
URL查看原文
文献类型期刊论文
条目标识符http://ir.las.ac.cn/handle/12502/5605
专题Journal of Data and Information Science_Chinese Journal of Library and Information Science-2012
通讯作者Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)
推荐引用方式
GB/T 7714
ROUSSEAU, Ronald,Ronald ROUSSEAU . A view on "big data" and its relation to Informetrics[J]. Chinese Journal of Library and Information Science,2012,5(3):12-26.
APA ROUSSEAU, Ronald,&Ronald ROUSSEAU .(2012).A view on "big data" and its relation to Informetrics.Chinese Journal of Library and Information Science,5(3),12-26.
MLA ROUSSEAU, Ronald,et al."A view on "big data" and its relation to Informetrics".Chinese Journal of Library and Information Science 5.3(2012):12-26.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
12-26-Ronald Roussea(931KB) 开放获取使用许可请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[ROUSSEAU, Ronald]的文章
[Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)]的文章
百度学术
百度学术中相似的文章
[ROUSSEAU, Ronald]的文章
[Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)]的文章
必应学术
必应学术中相似的文章
[ROUSSEAU, Ronald]的文章
[Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。