Title
A Distance-Based Boolean Applicability Domain for Classification of High Throughput Screening Data.
Abstract
In Quantitative Structure-Activity Relationship (QSAR) modeling, one must come up with an activity model but also with an applicability domain for that model. Some existing methods to create an applicability domain are complex, hard to implement, and/or difficult to interpret. Also, they often require the user to select a threshold value, or they embed an empirical constant. In this work, we propose a trivial to interpret and fully automatic Distance-Based Boolean Applicability Domain (DBBAD) algorithm for category QSAR. In retrospective experiments on High Throughput Screening data sets, this applicability domain improves the classification performance and early retrieval of support vector machine and random forest applicability domain improves the based classifiers, while improving thescaffold diversity among top-ranked active molecules.
Year
DOI
Venue
2019
10.1021/acs.jcim.8b00499
JOURNAL OF CHEMICAL INFORMATION AND MODELING
DocType
Volume
Issue
Journal
59
1
ISSN
Citations 
PageRank 
1549-9596
0
0.34
References 
Authors
0
2
Name
Order
Citations
PageRank
Francois Berenger1234.11
Yoshihiro Yamanishi2126883.44