Title
An online and incremental GRLVQ algorithm for prototype generation based on granular computing.
Abstract
In supervised classification, learning vector quantization (LVQ) methods are commonly used due to their intuitive structure based on prototypical instances that reduce considerably the computations in the classification process. Several improvements of LVQ have been proposed based on heuristics including LVQ3, and GLVQ. All these methods use the Euclidean distance to evaluate the similarity between prototypes and objects, which may be inappropriate if features are not equally scaled. Metric adaption techniques try to alleviate this problem by learning discriminative distance measures from the training data. Generalized relevance learning vector quantization is one of such improvements. However, in big data problems LVQ algorithms require incremental learning mechanisms. This paper introduces an LVQ-algorithm based on granular computing for prototye-based classification equipped with incremental learning mechanisms. The proposed algorithm is able to group entities with similar features, and at the same time proposes new prototypes to better cover the class distribution with prototyping elements. Two steps for the automatic control of prototypes are proposed: the first one controls the number of prototypes by a usage-frequency indicator; whereas the second one, is designed to learn the relevance of data dimensions, producing an automatic pruning of useless dimensions, avoiding a high computational load and increasing the interpretability of the resulting model. The proposed method is evaluated in benchmark data and obtains competitive performance with state-of-the-art solutions. In the case of big data sets, we obtained the best accuracy rate of about 72 % with a good compression rate of around 94 %.
Year
DOI
Venue
2017
10.1007/s00500-016-2042-0
Soft Comput.
Keywords
Field
DocType
GRLVQ, Online, Incremental, Big data
Data mining,Computer science,Heuristics,Vector quantization,Artificial intelligence,Discriminative model,Interpretability,Mathematical optimization,Euclidean distance,Learning vector quantization,Algorithm,Granular computing,Machine learning,Distance measures
Journal
Volume
Issue
ISSN
21
14
1433-7479
Citations 
PageRank 
References 
3
0.37
21
Authors
2
Name
Order
Citations
PageRank
Israel Cruz-Vega1276.05
Hugo Jair Escalante293973.89