Title
Characterizing In Situ and In Transit Analytics of Molecular Dynamics Simulations for Next-Generation Supercomputers
Abstract
Molecular Dynamics (MD) simulations executed on state-of-the-art supercomputers are producing data at rates faster than it can be written out to disk. In situ and in transit analysis of data generated by MD simulations reduce the original volume of information by several orders of magnitude, thereby alleviating the negative impact of I/O bottlenecks. This work focuses on characterizing the impact of in situ and in transit analytics on the overall MD workflow performance, and the capability for capturing rapid, rare events in the simulated molecular system. The MD simulation and analysis processes share data via remote direct memory access (RDMA) using DataSpaces. Our metrics of interest are time spent waiting in I/O by the MD simulation, lost frames of the MD simulation, and idle time of the analysis. We measure these metrics for a diverse set of molecular systems and characterize their trends for in situ and in transit configurations. We then model which frames are dropped and which ones are analyzed for a real use case. The insights gained from this study are generally applicable for in situ and in transit workflows that require optimization of parameters to minimize loss in workflow performance and analytic accuracy.
Year
DOI
Venue
2019
10.1109/eScience.2019.00027
2019 15th International Conference on eScience (eScience)
Keywords
DocType
ISBN
Scientific workflows, data analytics, performance, workload modeling, remote direct memory access
Conference
978-1-7281-2452-0
Citations 
PageRank 
References 
1
0.35
16
Authors
10
Name
Order
Citations
PageRank
michela taufer135253.04
Ewa Deelman25948420.48
Stephen Thomas310.35
Michael R. Wyatt II410.35
Tu Mai Anh Do562.47
Loic Pottier611.03
Rafael Ferreira da Silva745035.94
Harel Weinstein87410.97
Michel A. Cuendet910.35
Trilce Estrada1012018.27