Title
Cross-domain comparison of algorithm performance in extracting aspect-based opinions from Chinese online reviews.
Abstract
Extracting aspects and opinions is the basis of sentiment analysis in fine-grained manner. It is often conducted in one of the following two ways: rule-based approaches and machine learning approaches. However, no conclusion has been drawn yet on the matter of multi-domains applicability in Chinese, so robustness and reliability across different fields are being of concern to these algorithms. We compare ten approaches of aspect-opinion extraction on Chinese corpora from seven domains. The compared methods include TF-based model plus POS, CRFs-based opinion mining, SVM-based opinion mining, MNB-based opinion mining, HMM-based opinion mining, RFM-based opinion mining, RNN-based opinion mining, KNN-based opinion mining, CART-based opinion mining and LPM-based opinion mining. We collect 3146 Chinese reviews as corpora including digital camera, cosmetics, book, hotel, movie, cellphone and restaurant. Experiments reveal the following results: (1) no algorithm dominates over all domains, (2) machine learning algorithms outperform rule-based approaches, (3) the length of text affects the accuracy of opinion mining negatively for rule-based approaches, while some machine learning methods are good at extracting long reviews, (4) for HMM-based model, RFM-based model, RNN-based model, KNN-based model, CART-based model and LPM-based model, the performances are similar in terms of precision and recall, (5) overall, SVM-based approach performs best among almost all the domains for opinion mining.
Year
DOI
Venue
2017
10.1007/s13042-016-0596-x
Int. J. Machine Learning & Cybernetics
Keywords
Field
DocType
Online review, Product aspect, Opinion extraction, Sentiment analysis, Chinese review
Computer science,Sentiment analysis,Precision and recall,Support vector machine,Algorithm,Robustness (computer science),Artificial intelligence,Hidden Markov model,Machine learning,Opinion extraction,CRFS
Journal
Volume
Issue
ISSN
8
3
1868-808X
Citations 
PageRank 
References 
6
0.39
53
Authors
3
Name
Order
Citations
PageRank
Wei Wang160.73
Guanyin Tan260.39
hongwei wang3368.68