Classification Of Wine Quality With Imbalanced Data - Citegraph

Paper Info

Title
Classification Of Wine Quality With Imbalanced Data

Abstract
We propose a data analysis approach to classify wine into different quality categories. A data set of white wines of 4898 observations obtained from the Minho region in Portugal was used in our analysis. As the occurrence of events in the data set was imbalanced with about 93% of the observations are from one category, we applied the Synthetic Minority Over-Sampling Technique (SMOTE) to over sample the minority class. The balanced data was used to model a classifier that categorizes a wine into three categories as high quality, normal quality, and poor quality. Three different classification techniques were used: decision tree, adaptive boosting (AdaBoost), and random forest. Our experiments show that the random forest technique seems to produce the desired results with the least percentage of error.

Year	DOI	Venue
2016	10.1109/ICIT.2016.7475021	PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT)
Keywords	Field	DocType
classification, imbalanced data, SMOTE, wine quality	Data modeling,Decision tree,AdaBoost,Pattern recognition,Prediction algorithms,Boosting (machine learning),Artificial intelligence,Engineering,Random forest,Classifier (linguistics),Wine	Conference
Citations	PageRank	References
0	0.34	0
Authors
4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Gongzhu Hu	1	351	56.01
Tan Xi	2	0	0.34
Faraz Mohammed	3	0	0.34
Huaikou Miao	4	451	68.03

1