Title
Maximizing Pattern Separation In Discretizing Continuous Features For Classification Purposes
Abstract
Discretization is a fundamental phase for many classification algorithms: it aims at finding a proper set of cutoffs that subdivide a continuous domain into homogeneous intervals; the points in each interval should have a high probability of belonging to the same class. This paper proposes two different approaches for discretization: the first one consists in retrieving the optimal set of separation points through the solution of a proper linear programming problem. Since the optimal solution may require an excessive computational burden, an alternative technique, based on the iterative addition of separation points, is described. The greedy algorithm is evaluated on some artificial datasets and compared with other well-known discretization techniques such as EntMDL. The results of the simulations show the good performances of the novel algorithm in terms both of accuracy of the solution and of computational effort required for its generation.
Year
DOI
Venue
2010
10.1109/IJCNN.2010.5596838
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010
Keywords
Field
DocType
propulsion,greedy algorithm,greedy algorithms,classification algorithms,linear program,linear programming,learning artificial intelligence
Discretization,Mathematical optimization,Pattern recognition,Homogeneous,Computer science,Greedy algorithm,Artificial intelligence,Linear programming,Statistical classification,Greedy randomized adaptive search procedure,Machine learning
Conference
ISSN
Citations 
PageRank 
1098-7576
1
0.36
References 
Authors
5
2
Name
Order
Citations
PageRank
Enrico Ferrari1163.55
Marco Muselli222024.97