Title
Using Data Mining To Improve Mutation In A Tool For Molecular Evolution
Abstract
We have developed an evolutionary algorithm-based program for drug design, the Molecule Evoluator. This program transforms known molecules into new molecules which may have improved properties relative to the parent molecule. Transforming the parent molecule into a derivative by mutation is necessary to find molecules with increased fitness. However, mutations that just randomly add and substitute atoms often result in molecules that contain undesirable chemical substructures, and can therefore not be used as drugs. We therefore want to add knowledge to the program about which mutations result in proper chemical structures and which ones do not. In this research we have mined a large chemical database, the World Drug Index, to obtain the frequencies of small substructures in drug-like molecules. Some of our mutation operators were subsequently modified to use these frequencies. Testing the new mutation frequencies on another large database of molecules, the NCI database, we found that the knowledge-based mutations more often produced existing molecules than the original mutations. This suggests that the modified mutations produce molecules that are easier to synthesize and more drug-like compared to the molecules generated using the original uninformed mutation operators.
Year
DOI
Venue
2005
10.1109/CEC.2005.1554700
2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS
Keywords
Field
DocType
data mining,biochemistry,genetics,chemical database,evolutionary algorithm,indexation,knowledge base,evolutionary computation,molecular evolution,drug design,molecular biophysics,chemical structure
Data mining,Evolutionary algorithm,Computer science,Molecule,Molecular evolution,Evolutionary computation,Molecular biophysics,Chemical database,Mutation operator,Mutation
Conference
Citations 
PageRank 
References 
0
0.34
6
Authors
3
Name
Order
Citations
PageRank
Eric-Wubbo Lameijer1375.84
Adriaan P IJzerman219018.68
Joost N. Kok31429121.49