Abstract | ||
---|---|---|
With the rapid development of social media, online users are allowed to share their opinions conveniently. However, the ground truth for sentiments and emotions in social media is often constructed through surveys, hashtags or emoticons, where the labels may contain many errors. There are also amateurs and malicious users expressing offensive opinions or spreading fraudulent reviews, which has been identified as a growing threat to the trustworthiness of online comments. Thus, it is valuable for us to reconcile this noise in the ground truth when training sentiment and emotion classifiers. In this paper, we propose a hidden de-noising classification model (HDCM) that does not need any outsourcing systems or lexicons to estimate the actual sentimental or emotional category of each instance from corpora with noisy labels. The simplicity of assigning the category to a document by users under any contexts, and the authority of a user in assigning categories to documents with various domains are modeled as the unobserved hidden constraints in HDCM. Extensive evaluations using datasets with different scales of noisy labels validate the effectiveness of the proposed model for both sentiment and emotion classification tasks. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1016/j.knosys.2016.08.012 | Knowl.-Based Syst. |
Keywords | Field | DocType |
Hidden de-noising classification model,Sentiment analysis,Emotion detection,Noisy label | Data mining,Social media,Computer science,Sentiment analysis,Trustworthiness,Emotion classification,Outsourcing,Emotion detection,Ground truth,Artificial intelligence,Machine learning,Offensive | Journal |
Volume | Issue | ISSN |
111 | C | 0950-7051 |
Citations | PageRank | References |
5 | 0.41 | 44 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yaowei Wang | 1 | 134 | 29.62 |
Yanghui Rao | 2 | 256 | 23.32 |
Xueying Zhan | 3 | 8 | 1.12 |
Huijun Chen | 4 | 15 | 1.26 |
Maoquan Luo | 5 | 5 | 0.41 |
Jian Yin | 6 | 861 | 97.01 |