Title
Sentiment analysis for Chinese reviews of movies in multi-genre based on morpheme-based features and collocations
Abstract
The application of sentiment analysis, also known as opinion mining, is more difficult in Chinese than in Indo-European languages, due to the compounding nature of Chinese words and phrases, and relatively lack of reliable resources in Chinese. This study used seed words, Chinese morphemes, which are mono-syllabic characters that function as individual words or be combined to create Chinese words and phrases, to classify movie reviews found on Yahoo! Taiwan. We utilized higher Pointwise Mutual Information (PMI) collocations, which consist of selected morpheme-level compounded features to build classifiers. The contributions of this study include the following: (Bird 2006) proposing a method of generating domain-dependent Chinese morphemes directly from large data set without any predefined sentimental resources; (Bradley and Lang 1999) building morpheme-based classifiers applicable in various movie genres, and shown to produce better results than other classifiers based on keywords (NTUSD and HowNet) or feature selection (TFIDF); (Church and Hanks in Computational linguistics, 16(1), 22-29 1990) identifying compounds that have different semantic polarities depending on contexts.
Year
DOI
Venue
2015
10.1007/s10796-014-9498-1
Information Systems Frontiers
Keywords
Field
DocType
Sentiment analysis,Opinion mining,Morpheme,Feature,Collocation,Chinese movie
Morpheme,Feature selection,tf–idf,Computer science,Sentiment analysis,Film genre,Computational linguistics,Natural language processing,Artificial intelligence,Pointwise mutual information,Collocation
Journal
Volume
Issue
ISSN
17
6
1387-3326
Citations 
PageRank 
References 
9
0.43
29
Authors
2
Name
Order
Citations
PageRank
hengli yang134427.53
August F. Y. Chao2111.46