Title
Benchmarking Platform For Ligand-Based Virtual Screening
Abstract
Virtual screening (VS) of databases of chemical compounds has become a common step in the drug discovery process. Ligand-based virtual screening is a variant of VS where similarity to known active compounds is utilized in the discovery of new bioactive molecules. The cornerstone, which determines success of virtual screening, is the used molecular similarity measure. Currently, there is no superior approach to modeling molecular similarity and design of new similarity approaches is an active research field in cheminformatics. Therefore, proper benchmarking is of utter importance. In this paper, we describe common pitfalls of current approach to benchmarking of new methods. We focus on the importance of reproducibility and design of benchmarking datasets. Moreover, we identify the dataset difficulty as an important, yet not wildly utilized, property of the benchmarking data. To solve the identified issues we present a new benchmarking platform. The platform implements most commonly used molecular representations and includes datasets of varying difficulty levels as well as scripts which make the platform easy to use and extend. The existing representations are benchmarked using the proposed platform and results are presented. The benchmarking platform is available at https://github.com/skodapetr/lbvs-environment.
Year
Venue
Keywords
2016
2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)
Platform,Virtual screening,Benchmarking data
Field
DocType
ISSN
Data mining,Drug discovery,Similarity measure,Computer science,Artificial intelligence,Bioinformatics,Virtual screening,Benchmarking,Cheminformatics,Machine learning,Benchmark (computing),Scripting language
Conference
2156-1125
Citations 
PageRank 
References 
0
0.34
11
Authors
2
Name
Order
Citations
PageRank
Petr Skoda1399.56
David Hoksza29021.53