Title
Using multiple sources to construct a sentiment sensitive thesaurus for cross-domain sentiment classification
Abstract
We describe a sentiment classification method that is applicable when we do not have any labeled data for a target domain but have some labeled data for multiple other domains, designated as the source domains. We automatically create a sentiment sensitive thesaurus using both labeled and unlabeled data from multiple source domains to find the association between words that express similar sentiments in different domains. The created thesaurus is then used to expand feature vectors to train a binary classifier. Unlike previous cross-domain sentiment classification methods, our method can efficiently learn from multiple source domains. Our method significantly outperforms numerous baselines and returns results that are better than or comparable to previous cross-domain sentiment classification methods on a benchmark dataset containing Amazon user reviews for different types of products.
Year
Venue
Keywords
2011
ACL
amazon user review,similar sentiment,multiple source domain,sentiment sensitive thesaurus,previous cross-domain sentiment classification,source domain,sentiment classification method,different type,unlabeled data,different domain
Field
DocType
Volume
Feature vector,Information retrieval,Binary classification,Computer science,Artificial intelligence,Natural language processing,Labeled data
Conference
P11-1
Citations 
PageRank 
References 
23
0.83
15
Authors
3
Name
Order
Citations
PageRank
danushka bollegala169266.77
David J. Weir284083.84
John Carroll31971222.19