Title | ||
---|---|---|
Weighting and Sampling Data for Individual Classifiers and Bagging with Genetic Algorithms. |
Abstract | ||
---|---|---|
An imbalanced or inappropriate dataset can have a negative influence in classification model training. In this paper we present an evolutionary method that effectively weights or samples the tuples from the training dataset and tries to minimize the negative effects from innaprotirate datasets. The genetic algorithm with genotype of real numbers is used to evolve the weights or occurrence number for each learning tuple in the dataset. This technique is used with individual classifiers and in combination with the ensemble technique of bagging, where multiple classification models work together in a classification process. We present two variations â weighting the tuples and sampling the classification tuples. Both variations are experimentally tested in combination with individual classifiers (C4.5 and Naive Bayes methods) and in combination with bagging ensemble. Results show that both variations are promising techniques, as they produced better classification models than methods without weighting or sampling, which is also supported with statistical analysis. |
Year | DOI | Venue |
---|---|---|
2015 | 10.5220/0005592201800187 | IJCCI (ECTA) |
Keywords | Field | DocType |
Classification,Genetic Algorithm,Instance Selection,Weighting,Bagging | Weighting,Pattern recognition,Naive Bayes classifier,Computer science,Tuple,Bootstrap aggregating,Artificial intelligence,Sampling (statistics),Artificial neural network,Real number,Machine learning,Genetic algorithm | Conference |
Volume | ISBN | Citations |
1 | 978-1-5090-1968-7 | 0 |
PageRank | References | Authors |
0.34 | 14 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Saso Karakatic | 1 | 44 | 7.02 |
Marjan Hericko | 2 | 305 | 44.16 |
Vili Podgorelec | 3 | 199 | 33.00 |