Title
Semantic Twitter sentiment analysis based on a fuzzy thesaurus.
Abstract
We define a new, fully automated and domain-independent method for building feature vectors from Twitter text corpus for machine learning sentiment analysis based on a fuzzy thesaurus and sentiment replacement. The proposed method measures the semantic similarity of Tweets with features in the feature space instead of using terms’ presence or frequency feature vectors. Thus, we account for the sentiment of the context instead of just counting sentiment words. We use sentiment replacement to reduce the dimensionality of the feature space and a fuzzy thesaurus to incorporate semantics. Experimental results show that sentiment replacement yields up to 35% reduction in the dimensionality of the feature space. Moreover, feature vectors developed based on a fuzzy thesaurus show improvement of sentiment classification performance with multinomial naïve Bayes and support vector machine classifiers with accuracies of 83 and 85%, respectively, on the Stanford testing dataset. Incorporating the fuzzy thesaurus resulted in the best accuracy compared to the baselines with an increase greater than 3%. Comparable results were obtained with a larger dataset, the STS-Gold, indicating the robustness of the proposed method. Furthermore, comparison of results with previous work shows that the proposed method outperforms other methods reported in the literature using the same benchmark data.
Year
DOI
Venue
2018
10.1007/s00500-017-2994-8
Soft Comput.
Keywords
Field
DocType
Text mining, Fuzzy thesaurus, Semantic analysis, Text context, Twitter sentiment analysis
Semantic similarity,Feature vector,Computer science,Sentiment analysis,Support vector machine,Fuzzy logic,Text corpus,Curse of dimensionality,Robustness (computer science),Artificial intelligence,Natural language processing,Machine learning
Journal
Volume
Issue
ISSN
22
18
1432-7643
Citations 
PageRank 
References 
1
0.35
17
Authors
3
Name
Order
Citations
PageRank
Heba Ismail143.45
B. Belkhouche2464.20
Nazar Zaki313914.31