Ontology modeling of semantics in social media: Public issue knowledge base (PIKB) of the Weibo
ZHOU Yan; LI Wei; YUAN Xingfu; ZHANG Pengyi; Zhang Pengyi (E-mail: pengyi@pku.edu.cn)
2014-03-25
发表期刊Chinese Journal of Library and Information Science
ISSN1674-3393
卷号7期号:1页码:16-30
摘要
Purpose: This study aims to construct an ontology to model the semantics of social media streams, in particular, trending topics and public issues.

Design/methodology/approach: Our knowledge base included 10 public events and topics from Weibo respectively, which were collected through keyword search and a crawler program. We used a semi-automatic approach to model and annotate the semantics in social media, and adapted the multi-layered ontology to refine the design based on previous researches, then we used named entity recognition (NER) to extract entities to instantiate the ontology. Relationships were extracted based on co-occurrence measures. Finally, we manually conducted post-filtering evaluation and edited the extracted entities and relationships.

Findings: An initial assessment demonstrated that our multi-layered ontology supports various types of queries and analyses in the public issue knowledge base (PIKB), which can serve as an effective tool to query, understand and trace public issues.

Research limitations: Manual involvement cannot meet the requirements for challenges of sustainable developments. Since the relationships extracted are fully based on the co-occurrence of entities, rich semantic relationships, such as how much the key players have been involved, could not be fully reflected. Besides, the user evaluation is necessary for further ontology assessment.

Practical implications: The PIKB can be used by regular Web users and policy makers to query, understand, and make sense of public events and topics. The methodology and reusable ontology model are useful for institutions that are interested in making use of the social media data.

Originality/value: In this study, a multi-layered ontology is applied to model the evolving semantics of public events and trending topics in social media, and the semi-automatic approach could make it possible to extract entities and relationships from large amount of unstructured short texts of user generated content (UGC) from social media.
;
Purpose: This study aims to construct an ontology to model the semantics of social media streams, in particular, trending topics and public issues.

Design/methodology/approach: Our knowledge base included 10 public events and topics from Weibo respectively, which were collected through keyword search and a crawler program. We used a semi-automatic approach to model and annotate the semantics in social media, and adapted the multi-layered ontology to refine the design based on previous researches, then we used named entity recognition (NER) to extract entities to instantiate the ontology. Relationships were extracted based on co-occurrence measures. Finally, we manually conducted post-filtering evaluation and edited the extracted entities and relationships.

Findings: An initial assessment demonstrated that our multi-layered ontology supports various types of queries and analyses in the public issue knowledge base (PIKB), which can serve as an effective tool to query, understand and trace public issues.

Research limitations: Manual involvement cannot meet the requirements for challenges of sustainable developments. Since the relationships extracted are fully based on the co-occurrence of entities, rich semantic relationships, such as how much the key players have been involved, could not be fully reflected. Besides, the user evaluation is necessary for further ontology assessment.

Practical implications: The PIKB can be used by regular Web users and policy makers to query, understand, and make sense of public events and topics. The methodology and reusable ontology model are useful for institutions that are interested in making use of the social media data.

Originality/value: In this study, a multi-layered ontology is applied to model the evolving semantics of public events and trending topics in social media, and the semi-automatic approach could make it possible to extract entities and relationships from large amount of unstructured short texts of user generated content (UGC) from social media.
关键词Ontology Knowledge Organization Public Issue Knowledge Base (Pikb) Public Issues Social Media
学科领域编辑出版
URL查看原文
项目资助者This work is supported by Beijing Thinker Workshop (Grant No. XK201211001).
文献类型期刊论文
条目标识符http://ir.las.ac.cn/handle/12502/6818
专题Journal of Data and Information Science_Chinese Journal of Library and Information Science-2014
通讯作者Zhang Pengyi (E-mail: pengyi@pku.edu.cn)
推荐引用方式
GB/T 7714
ZHOU Yan,LI Wei,YUAN Xingfu,et al. Ontology modeling of semantics in social media: Public issue knowledge base (PIKB) of the Weibo[J]. Chinese Journal of Library and Information Science,2014,7(1):16-30.
APA ZHOU Yan,LI Wei,YUAN Xingfu,ZHANG Pengyi,&Zhang Pengyi .(2014).Ontology modeling of semantics in social media: Public issue knowledge base (PIKB) of the Weibo.Chinese Journal of Library and Information Science,7(1),16-30.
MLA ZHOU Yan,et al."Ontology modeling of semantics in social media: Public issue knowledge base (PIKB) of the Weibo".Chinese Journal of Library and Information Science 7.1(2014):16-30.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
2.Yan ZHOU.pdf(3377KB) 开放获取请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[ZHOU Yan]的文章
[LI Wei]的文章
[YUAN Xingfu]的文章
百度学术
百度学术中相似的文章
[ZHOU Yan]的文章
[LI Wei]的文章
[YUAN Xingfu]的文章
必应学术
必应学术中相似的文章
[ZHOU Yan]的文章
[LI Wei]的文章
[YUAN Xingfu]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。