Title
ANI-1: A data set of 20M off-equilibrium DFT calculations for organic molecules.
Abstract
One of the grand challenges in modern theoretical chemistry is designing and implementing approximations that expedite ab initio methods without loss of accuracy. Machine learning (ML), in particular neural networks, are emerging as a powerful approach to constructing various forms of transferable atomistic potentials. They have been successfully applied in a variety of applications in chemistry, biology, catalysis, and solid-state physics. However, these models are heavily dependent on the quality and quantity of data used in their fitting. Fitting highly flexible ML potentials comes at a cost: a vast amount of reference data is required to properly train these models. We address this need by providing access to a large computational DFT database, which consists of 20M conformations for 57,454 small organic molecules. We believe it will become a new standard benchmark for comparison of current and future methods in the ML potential community.
Year
Venue
Field
2017
arXiv: Chemical Physics
Reference data (financial markets),Computational chemistry,Grand Challenges,Ab initio,Artificial neural network,Organic molecules,Physics
DocType
Volume
Citations 
Journal
abs/1708.04987
0
PageRank 
References 
Authors
0.34
2
3
Name
Order
Citations
PageRank
Justin Smith19711.74
Olexandr Isayev2194.26
Adrian E Roitberg3233.56