Title
Weighting and Sampling Data for Individual Classifiers and Bagging with Genetic Algorithms.
Abstract
An imbalanced or inappropriate dataset can have a negative influence in classification model training. In this paper we present an evolutionary method that effectively weights or samples the tuples from the training dataset and tries to minimize the negative effects from innaprotirate datasets. The genetic algorithm with genotype of real numbers is used to evolve the weights or occurrence number for each learning tuple in the dataset. This technique is used with individual classifiers and in combination with the ensemble technique of bagging, where multiple classification models work together in a classification process. We present two variations – weighting the tuples and sampling the classification tuples. Both variations are experimentally tested in combination with individual classifiers (C4.5 and Naive Bayes methods) and in combination with bagging ensemble. Results show that both variations are promising techniques, as they produced better classification models than methods without weighting or sampling, which is also supported with statistical analysis.
Year
DOI
Venue
2015
10.5220/0005592201800187
IJCCI (ECTA)
Keywords
Field
DocType
Classification,Genetic Algorithm,Instance Selection,Weighting,Bagging
Weighting,Pattern recognition,Naive Bayes classifier,Computer science,Tuple,Bootstrap aggregating,Artificial intelligence,Sampling (statistics),Artificial neural network,Real number,Machine learning,Genetic algorithm
Conference
Volume
ISBN
Citations 
1
978-1-5090-1968-7
0
PageRank 
References 
Authors
0.34
14
3
Name
Order
Citations
PageRank
Saso Karakatic1447.02
Marjan Hericko230544.16
Vili Podgorelec319933.00