A view on "big data" and its relation to Informetrics
ROUSSEAU, Ronald; Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)
Source PublicationChinese Journal of Library and Information Science

Purpose: big data offer a huge challenge. Their very existence leads to the contradiction that the more data we have the less accessible they become, as the particular piece of information one is searching for may be buried among terabytes of other data. In this contribution we discuss the origin of big data and point to three challenges when big data arise: Data storage, data processing and generating insights.

Design/methodology/approach: Computer-related challenges can be expressed by the CAP theorem which states that it is only possible to simultaneously provide any two of the three following properties in distributed applications: Consistency (C), availability (A) and partition tolerance (P). As an aside we mention Amdahl’s law and its application for scientific collaboration. We further discuss data mining in large databases and knowledge representation for handling the results of data mining exercises. We further offer a short informetric study of the field of big data, and point to the ethical dimension of the big data phenomenon.

Findings: There still are serious problems to overcome before the field of big data can deliver on its promises.

Implications and limitations: This contribution offers a personal view, focusing on the information science aspects, but much more can be said about software aspects.

Originality/value: We express the hope that the information scientists, including librarians, will be able to play their full role within the knowledge discovery, data mining and big data communities, leading to exciting developments, the reduction of scientific bottlenecks and really innovative applications.

KeywordBig Data Cap Theorem Knowledge Representation Data Mining Ethical Concerns
Subject Area编辑出版
Document Type期刊论文
CollectionJournal of Data and Information Science_Chinese Journal of Library and Information Science-2012
Corresponding AuthorRonald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)
Recommended Citation
GB/T 7714
ROUSSEAU, Ronald,Ronald ROUSSEAU . A view on "big data" and its relation to Informetrics[J]. Chinese Journal of Library and Information Science,2012,5(3):12-26.
APA ROUSSEAU, Ronald,&Ronald ROUSSEAU .(2012).A view on "big data" and its relation to Informetrics.Chinese Journal of Library and Information Science,5(3),12-26.
MLA ROUSSEAU, Ronald,et al."A view on "big data" and its relation to Informetrics".Chinese Journal of Library and Information Science 5.3(2012):12-26.
Files in This Item: Download All
File Name/Size DocType Version Access License
12-26-Ronald Roussea(931KB) 开放获取LicenseView Download
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[ROUSSEAU, Ronald]'s Articles
[Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)]'s Articles
Baidu academic
Similar articles in Baidu academic
[ROUSSEAU, Ronald]'s Articles
[Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[ROUSSEAU, Ronald]'s Articles
[Ronald ROUSSEAU (E-mail:ronald.rousseau@khbo.be)]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 12-26-Ronald Rousseau[3].pdf
Format: Adobe PDF
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.