Title
A numerical refinement operator based on multi-instance learning
Abstract
We present a numerical refinement operator based on multiinstance learning. In the approach, the task of handling numerical variables in a clause is delegated to statistical multi-instance learning schemes. To each clause, there is an associated multi-instance classification model with the numerical variables of the clause as input. Clauses are built in a greedy manner, where each refinement adds new numerical variables which are used additionally to the numerical variables already known to the multi-instance model. In our experiments, we tested this approach with multi-instance learners available in the Weka workbench (like MISVMs). These clauses are used in a boosting approach that can take advantage of the margin information, going beyond standard covering procedures or the discrete boosting of rules, like in SLIPPER. The approach is evaluated on the problem of hexose binding site prediction, a pharmacological application and mutagenicity prediction. In two of the three applications, the task is to find configurations of points with certain properties in 3D space that characterize either a binding site or drug activity: the logical part of the clause constitutes the points with their properties, whereas the multi-instance model constrains the distances among the points. In summary, the new numerical refinement operator is interesting both theoretically as a new synthesis of logical and statistical learning and practically as a new method for characterizing binding sites and pharmacophores in biochemical applications.
Year
DOI
Venue
2010
10.1007/978-3-642-21295-6_5
ILP
Keywords
Field
DocType
binding site,multi-instance model,statistical multi-instance,multi-instance learner,multi-instance learning,new numerical variable,new numerical refinement operator,associated multi-instance classification model,new method,numerical refinement operator,numerical variable
Inductive logic programming,Workbench,Logical conjunction,Computer science,Algorithm,Theoretical computer science,Artificial intelligence,Operator (computer programming),Statistical learning,Boosting (machine learning),Logic programming,Machine learning
Conference
Volume
ISSN
Citations 
6489
0302-9743
3
PageRank 
References 
Authors
0.40
9
4
Name
Order
Citations
PageRank
Erick Alphonse1494.47
Tobias Girschick2565.83
Fabian Buchwald3474.65
Stefan Kramer41313141.90