Title
Learning data structure from classes: A case study applied to population genetics
Abstract
In most cases, the main goal of machine learning and data mining applications is to obtain good classifiers. However, final users, for instance researchers in other fields, sometimes prefer to infer new knowledge about their domain that may be useful to confirm or reject their hypotheses. This paper presents a learning method that works along these lines, in addition to reporting three interesting applications in the field of population genetics in which the aim is to discover relationships between species or breeds according to their genotypes. The proposed method has two steps: first it builds a hierarchical clustering of the set of classes and then a hierarchical classifier is learned. Both models can be analyzed by experts to extract useful information about their domain. In addition, we propose a new method for learning the hierarchical classifier. By means of a voting scheme employing pairwise binary models constrained by the hierarchical structure, the proposed classifier is computationally more efficient than previous approaches while improving on their performance.
Year
DOI
Venue
2012
10.1016/j.ins.2011.12.022
Inf. Sci.
Keywords
Field
DocType
good classifier,population genetics,hierarchical structure,new knowledge,hierarchical clustering,hierarchical classifier,new method,case study,data structure,useful information,proposed classifier,data mining application,clustering
Hierarchical clustering,Pairwise comparison,Data mining,Data structure,Voting,Computer science,Artificial intelligence,Hierarchical classifier,Classifier (linguistics),Cluster analysis,Machine learning,Binary number
Journal
Volume
ISSN
Citations 
193,
0020-0255
0
PageRank 
References 
Authors
0.34
25
4
Name
Order
Citations
PageRank
J. J. del Coz150.79
Jorge Díez225020.46
Antonio Bahamonde333531.96
Félix Goyache4333.33