Title
Grid heterogeneity in in-silico experiments: an exploration of drug screening using DOCK on cloud environments.
Abstract
Large-scale in-silico screening is a necessary part of drug discovery and Grid computing is one answer to this demand. A disadvantage of using Grid computing is the heterogeneous computational environments characteristic of a Grid. In our study, we have found that for the molecular docking simulation program DOCK, different clusters within a Grid organization can yield inconsistent results. Because DOCK in-silico virtual screening (VS) is currently used to help select chemical compounds to test with in-vitro experiments, such differences have little effect on the validity of using virtual screening before subsequent steps in the drug discovery process. However, it is difficult to predict whether the accumulation of these discrepancies over sequentially repeated VS experiments will significantly alter the results if VS is used as the primary means for identifying potential drugs. Moreover, such discrepancies may be unacceptable for other applications requiring more stringent thresholds. This highlights the need for establishing a more complete solution to provide the best scientific accuracy when executing an application across Grids. One possible solution to platform heterogeneity in DOCK performance explored in our study involved the use of virtual machines as a layer of abstraction. This study investigated the feasibility and practicality of using virtual machine and recent cloud computing technologies in a biological research application. We examined the differences and variations of DOCK VS variables, across a Grid environment composed of different clusters, with and without virtualization. The uniform computer environment provided by virtual machines eliminated inconsistent DOCK VS results caused by heterogeneous clusters, however, the execution time for the DOCK VS increased. In our particular experiments, overhead costs were found to be an average of 41% and 2% in execution time for two different clusters, while the actual magnitudes of the execution time costs were minimal. Despite the increase in overhead, virtual clusters are an ideal solution for Grid heterogeneity. With greater development of virtual cluster technology in Grid environments, the problem of platform heterogeneity may be eliminated through virtualization, allowing greater usage of VS, and will benefit all Grid applications in general.
Year
DOI
Venue
2010
10.3233/978-1-60750-583-9-181
Studies in Health Technology and Informatics
Keywords
Field
DocType
Virtual Screening,DOCK,Platform Heterogeneity,Virtualization
DOCK,World Wide Web,Medicine,Grid,In silico,Cloud computing
Conference
Volume
ISSN
Citations 
159
0926-9630
3
PageRank 
References 
Authors
0.41
0
5
Name
Order
Citations
PageRank
Wen-Wai Yim130.41
Shu Chien250.84
Yasuyuki Kusumoto330.41
Susumu Date413328.14
Jason H. Haga5172.94