Title
Adapted transfer of distance measures for quantitative structure-activity relationships
Abstract
Quantitative structure-activity relationships (QSARs) are regression models relating chemical structure to biological activity. Such models allow to make predictions for toxicologically or pharmacologically relevant endpoints, which constitute the target outcomes of trials or experiments. The task is often tackled by instance-based methods (like k-nearest neighbors), which are all based on the notion of chemical (dis) similarity. Our starting point is the observation by Raymond and Willett that the two big families of chemical distance measures, fingerprint-based and maximum common subgaph based measures, provide orthogonal information about chemical similarity. The paper presents a novel method for finding suitable combinations of them, called adapted transfer, which adapts a distance measure learned on another, related dataset to a given dataset. Adapted transfer thus combines distance learning and transfer learning in a novel manner. In a set of experiments, we compare adapted transfer with distance learning on the target dataset itself and inductive transfer without adaptations. In our experiments, we visualize the performance of the methods by learning curves (i.e., depending on training set size) and present a quantitative comparison for 10% and 100% of the maximum training set size.
Year
DOI
Venue
2010
10.1007/978-3-642-16184-1_24
Discovery Science
Keywords
Field
DocType
chemical similarity,quantitative structure-activity relationship,inductive transfer,training set size,chemical distance measure,maximum common subgaph,related dataset,distance measure,chemical structure,maximum training set size,target dataset,learning curve,transfer learning,biological activity,regression model,quantitative structure activity relationship,distance learning,k nearest neighbor
Data mining,Inductive transfer,Regression analysis,Computer science,Transfer of learning,Artificial intelligence,Pattern recognition,Multiple kernel learning,Fingerprint,Chemical similarity,Learning curve,Machine learning,Distance measures
Conference
Volume
ISSN
ISBN
6332
0302-9743
3-642-16183-9
Citations 
PageRank 
References 
2
0.38
13
Authors
4
Name
Order
Citations
PageRank
U. Rückert1755103.61
Tobias Girschick2565.83
Fabian Buchwald3474.65
Stefan Kramer41313141.90