Title
Discovering the Language of Wine Reviews: A Text Mining Account.
Abstract
It is widely held that smells and flavors are impossible to put into words. In this paper we test this claim by seeking predictive patterns in wine reviews, which ostensibly aim to provide guides to perceptual content. Wine reviews have previously been critiqued as random and meaningless. We collected an English corpus of wine reviews with their structured metadata, and applied machine learning techniques to automatically predict the wineu0027s color, grape variety, and country of origin. To train the three supervised classifiers, three different information sources were incorporated: lexical bag-of-words features, domain-specific terminology features, and semantic word embedding features. In addition, using regression analysis we investigated basic review properties, i.e., review length, average word length, and their relationship to the scalar values of price and review score. Our results show that wine experts do share a common vocabulary to describe wines and they use this in a consistent way, which makes it possible to automatically predict wine characteristics based on the review text alone. This means that odors and flavors may be more expressible in language than typically acknowledged.
Year
Venue
Field
2018
LREC
Metadata,Terminology,Computer science,Supervised learning,Natural language processing,Artificial intelligence,Word embedding,Perception,Vocabulary,Wine,Terminology extraction
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Els Lefever116623.26
Iris Hendrickx228530.91
Ilja Croijmans301.35
Antal Van Den Bosch41038132.37
Asifa Majid51410.85