Title
Using Tversky similarity searches for core hopping: finding the needles in the haystack.
Abstract
The combination of Daylight fingerprints and the Tversky coefficient is a powerful method for performing core hopping, that is, scaffold (or lead) hopping where the main structural difference between the query and bioactive target molecule is located in the central core of the molecular structure. However, a major disadvantage of this approach is the fact that a large number of false positives (in the context of core hopping) are retrieved. The tool we have developed and which is described here can be used to postprocess the hits from Daylight Tversky similarity searches by fragmenting the molecules and subsequently annotating them in a way that assists the users in removing false positives and enables them to better focus on molecules of interest. To validate our approach, we have selected four biological targets for which scaffold hopping examples have been reported. We present results from searches in databases containing published activity data and the subsequent analysis of the hits aimed at establishing the potential of our approach.
Year
DOI
Venue
2009
10.1021/ci900092y
JOURNAL OF CHEMICAL INFORMATION AND MODELING
Keywords
Field
DocType
similarity search
Data mining,Haystack,Chemistry,Bioinformatics,False positive paradox
Journal
Volume
Issue
ISSN
49
6
1549-9596
Citations 
PageRank 
References 
5
0.47
0
Authors
1
Name
Order
Citations
PageRank
Stefan Senger1132.71