Title
Scaling embedded in-situ indexing with deltaFS.
Abstract
Analysis of large-scale simulation output is a core element of scientific inquiry, but analysis queries may experience significant I/O overhead when the data is not structured for efficient retrieval. While in-situ processing allows for improved time-to-insight for many applications, scaling in-situ frameworks to hundreds of thousands of cores can be difficult in practice. The DeltaFS in-situ indexing is a new approach for in-situ processing of massive amounts of data to achieve efficient point and small-range queries. This paper describes the challenges and lessons learned when scaling this in-situ processing function to hundreds of thousands of cores. We propose techniques for scalable all-to-all communication that is memory and bandwidth efficient, concurrent indexing, and specialized LSM-Tree formats. Combining these techniques allows DeltaFS to control the cost of in-situ processing while maintaining 3 orders of magnitude query speedup when scaling alongside the popular VPIC particle-in-cell code to 131,072 cores.
Year
DOI
Venue
2018
10.1109/SC.2018.00006
SC
Keywords
Field
DocType
Indexing,Data models,Computational modeling,Trajectory,Analytical models,Libraries
Bandwidth efficient,Data modeling,Orders of magnitude (numbers),Computer science,Parallel computing,Search engine indexing,Scaling,Trajectory,Scalability,Speedup
Conference
ISBN
Citations 
PageRank 
978-1-5386-8384-2
4
0.44
References 
Authors
42
9
Name
Order
Citations
PageRank
Qing Zheng1915.40
Charles D. Cranor258252.19
Danhao Guo340.44
Gregory R. Ganger44560383.16
George Amvrosiadis511110.40
Garth A. Gibson684961.69
Bradley W. Settlemyer712013.00
Gary Grider825316.11
Fan Guo943818.96