Title
A Comparative Study on Two Ground Truth Inference Algorithms based on Manually Labeled Social Media Data
Abstract
In the booming information era, smart devices such as smart phones accompany peoples' lives all the time. Social media platforms provide users with uninterrupted communication and information acquisition including posting users' feelings and sharing ideas. This study focuses on short texts posted by users. Their true meaning is defined as ground truth. However, acquiring it from the users directly is extremely difficult and time-consuming. In other words, in many cases, short texts do not have their ground truth. Thus, we deal with a no ground truth problem. In this work, we ask for labelers to label short texts completely based on their own judgment of these texts. Two ground truth inference approaches, majority voting (MV) and positive label frequency threshold (PLAT), integrate the labels from different labelers and deduce the ground truth. We then analyze which one better suits for labeling unlabeled short texts. The work is of great significance in helping us obtain useful knowledge from massive social media data.
Year
DOI
Venue
2019
10.1109/ICNSC.2019.8743287
2019 IEEE 16th International Conference on Networking, Sensing and Control (ICNSC)
Keywords
Field
DocType
Social media data,short text classification,ground truth inference algorithms
Social media,Ask price,Information retrieval,Inference,Computer science,Information acquisition,Control engineering,Ground truth,Majority rule,Feeling
Conference
ISSN
ISBN
Citations 
1810-7869
978-1-7281-0085-2
0
PageRank 
References 
Authors
0.34
15
4
Name
Order
Citations
PageRank
Xiaoyu Lu1105.31
MengChu Zhou28989534.94
Haoyue Liu300.34
Liang Qi415627.14