Title
A scalable approach to combinatorial library design for drug discovery.
Abstract
In this paper, we propose an algorithm for the design of lead generation libraries required in combinatorial drug discovery. This algorithm addresses simultaneously the two key criteria of diversity and representativeness Of Compounds in the resulting library and is computationally efficient when applied to a large class of lead generation design problems. At the same time, additional constraints on experimental resources are also incorporated in the framework presented in this paper. A computationally efficient scalable algorithm is developed, where the ability of the deterministic annealing algorithm to identify clusters is exploited to truncate computations over the entire data set to computations over individual clusters. An analysis of this algorithm quantifies the tradeoff between the error due to truncation and computational effort. Results applied on test data sets corroborate the analysis and show improvement by factors as large as 10 or more, depending on the data sets.
Year
DOI
Venue
2008
10.1021/ci700023y
JOURNAL OF CHEMICAL INFORMATION AND MODELING
Keywords
Field
DocType
drug discovery
Truncation,Cluster (physics),Data mining,Data set,Drug discovery,Computer science,Combinatorial chemistry,Deterministic annealing,Test data,Bioinformatics,Scalability,Computation
Journal
Volume
Issue
ISSN
48
1
1549-9596
Citations 
PageRank 
References 
5
0.49
0
Authors
3
Name
Order
Citations
PageRank
Puneet Sharma127138.61
Srinivasa Salapaka260.86
Carolyn L. Beck340160.19