Title
AClAP, Autonomous hierarchical agglomerative Cluster Analysis based protocol to partition conformational datasets.
Abstract
Sampling the conformational space is a fundamental step for both ligand- and structure-based drug design. However, the rational organization of different molecular conformations still remains a challenge. In fact, for drug design applications, the sampling process provides a redundant conformation set whose thorough analysis can be intensive, or even prohibitive. We propose a statistical approach based on cluster analysis aimed at rationalizing the output of methods such as Monte Carlo, genetic, and reconstruction algorithms. Although some software already implements clustering procedures, at present, a universally accepted protocol is still missing.We integrated hierarchical agglomerative cluster analysis with a clusterability assessment method and a user independent cutting rule, to form a global protocol that we implemented in a MATLAB metalanguage program (AClAP). We tested it on the conformational space of a quite diverse set of drugs generated via Metropolis Monte Carlo simulation, and on the poses we obtained by reiterated docking runs performed by four widespread programs. In our tests, AClAP proved to remarkably reduce the dimensionality of the original datasets at a negligible computational cost. Moreover, when applied to the outcomes of many docking programs together, it was able to point to the crystallographic pose.AClAP is available at the "AClAP" section of the website http://www.scfarm.unibo.it.
Year
DOI
Venue
2006
10.1093/bioinformatics/btl212
ISMB (Supplement of Bioinformatics)
Keywords
Field
DocType
thorough analysis,autonomous hierarchical agglomerative,drug design application,accepted protocol,aclap result,cluster analysis,diverse set,docking program,conformational datasets,conformational space,monte carlo,metropolis monte carlo simulation,genetics,drug design,monte carlo simulation
Hierarchical clustering,Data mining,Monte Carlo method,MATLAB,Computer science,Curse of dimensionality,Software,Sampling (statistics),Metalanguage,Bioinformatics,Cluster analysis
Conference
Volume
Issue
ISSN
22
14
1367-4811
Citations 
PageRank 
References 
2
0.56
6
Authors
4
Name
Order
Citations
PageRank
Giovanni Bottegoni1867.15
Walter Rocchia210514.23
Maurizio Recanatini3186.57
Andrea Cavalli4173.15