Title
A note on utilising binary features as ligand descriptors.
Abstract
It is common in cheminformatics to represent the properties of a ligand as a string of 1's and 0's, with the intention of elucidating, inter alia, the relationship between the chemical structure of a ligand and its bioactivity. In this commentary we note that, where relevant but non-redundant features are binary, they inevitably lead to a classifier capable of capturing only a linear relationship between structural features and activity. If, instead, we were to use relevant but non-redundant real-valued features, the resulting predictive model would be capable of describing a non-linear structure-activity relationship. Hence, we suggest that real-valued features, where available, are to be preferred in this scenario.
Year
DOI
Venue
2015
10.1186/s13321-015-0105-3
Journal of Cheminformatics
Keywords
Field
DocType
Binary descriptors, Ligand chemical structure, Linear relationship, Bernoulli distribution
Data mining,Feature selection,Computer science,Ligand,Mutual information,Bioinformatics,Classifier (linguistics),Cheminformatics,Binary descriptor,Binary number
Journal
Volume
Issue
ISSN
7
1
1758-2946
Citations 
PageRank 
References 
0
0.34
5
Authors
3
Name
Order
Citations
PageRank
Hamse Y Mussa119213.13
John B O Mitchell238432.48
Robert C Glen358961.78