Title
Fuzzy C-means for english sentiment classification in a distributed system.
Abstract
Sentiment classification plays a significant role in everyday life, in political activities, in activities relating to commodity production, and commercial activities. Finding a solution for the accurate and timely classification of emotions is a challenging task. In this research, we propose a new model for big data sentiment classification in the parallel network environment. Our proposed model uses the Fuzzy C-Means (FCM) method for English sentiment classification with Hadoop MAP (M) /REDUCE (R) in Cloudera. Cloudera is a parallel network environment. Our proposed model can classify the sentiments of millions of English documents in the parallel network environment. We tested our model using the testing data set (which comprised 25,000 English reviews, 12,500 being positive and 12,500 negative) and achieved 60.2 % accuracy. Our English training data set has 60,000 English sentences, comprising 30,000 positive English sentences and 30,000 negative English sentences.
Year
DOI
Venue
2017
10.1007/s10489-016-0858-z
Appl. Intell.
Keywords
Field
DocType
Sentiment classification,English sentiment classification,Opinion mining,English document opinion mining,Fuzzy C-Means,FCM,Cloudera,Parallel environment,Parallel network,Parallel network environment,Distributed system
Training set,Everyday life,Commodity production,Computer science,Sentiment analysis,Fuzzy logic,Emotion classification,Artificial intelligence,Test data,Natural language processing,Big data,Machine learning
Journal
Volume
Issue
ISSN
46
3
0924-669X
Citations 
PageRank 
References 
11
0.49
34
Authors
5
Name
Order
Citations
PageRank
Vo Ngoc Phu1494.43
Nguyen Duy Dat2252.06
Vo Thi Ngoc Tran3140.89
Thi Ngoc Chau Vo4498.68
Tuan V. Nguyen5425.75