Title
Instance selection by genetic-based biological algorithm
Abstract
Instance selection is an important research problem of data pre-processing in the data mining field. The aim of instance selection is to reduce the data size by filtering out noisy data, which may degrade the mining performance, from a given dataset. Genetic algorithms have presented an effective instance selection approach to improve the performance of data mining algorithms. However, current approaches only pursue the simplest evolutionary process based on the most reasonable and simplest rules. In this paper, we introduce a novel instance selection algorithm, namely a genetic-based biological algorithm (GBA). GBA fits a "biological evolution" into the evolutionary process, where the most streamlined process also complies with the reasonable rules. In other words, after long-term evolution, organisms find the most efficient way to allocate resources and evolve. Consequently, we can closely simulate the natural evolution of an algorithm, such that the algorithm will be both efficient and effective. Our experiments are based on comparing GBA with five state-of-the-art algorithms over 50 different domain datasets from the UCI Machine Learning Repository. The experimental results demonstrate that GBA outperforms these baselines, providing the lowest classification error rate and the least storage requirement. Moreover, GBA is very computational efficient, which only requires slightly larger computational cost than GA.
Year
DOI
Venue
2015
10.1007/s00500-014-1339-0
Soft Computing - A Fusion of Foundations, Methodologies and Applications
Keywords
Field
DocType
data reduction,genetic algorithms,machine learning,data mining
Data mining,Biological evolution,Computer science,Baseline (configuration management),Instance selection,Artificial intelligence,Data mining algorithm,Genetic algorithm,Word error rate,Algorithm,Filter (signal processing),Machine learning,Data reduction
Journal
Volume
Issue
ISSN
19
5
1433-7479
Citations 
PageRank 
References 
6
0.45
17
Authors
5
Name
Order
Citations
PageRank
Zong-Yao Chen1393.71
Chih-fong Tsai2125554.93
William Eberle322821.51
Wei-Chao Lin41117.55
Shih-Wen Ke512910.72