Title
I/O streaming evaluation of batch queries for data-intensive computational turbulence
Abstract
We describe a method for evaluating computational turbulence queries, including Lagrange Polynomial interpolation, based on partial sums that allows the underlying data to be accessed in any order and in parts. We exploit these properties to stream data from disk in a single pass and concurrently evaluate batch queries. The combination of sequential I/O and data sharing improves performance by an order of magnitude when compared with direct evaluation of each query. The technique also supports distributed evaluation of queries in a database cluster, assembling the partial sums from each node at the query mediator. Interpolation is fundamental to computational turbulence, over 95% of queries use these routines, and the partial sums method allows the JHU Turbulence Database Cluster to realize scale and throughput for our scientists' data-intensive workloads.
Year
DOI
Venue
2011
10.1145/2063384.2063423
SC
Keywords
Field
DocType
data sharing,query mediator,jhu turbulence database cluster,partial sum,stream data,partial sums method,computational turbulence query,data-intensive computational turbulence,underlying data,batch query,direct evaluation,query optimization,polynomial interpolation,data intensive computing,vectors,sorting,interpolation,distributed databases,kernel,partial sums,database management systems,polynomials
Query optimization,Lagrange polynomial,Data-intensive computing,Polynomial,Computer science,Parallel computing,Interpolation,Sorting,Input/output,Distributed database,Distributed computing
Conference
Citations 
PageRank 
References 
6
0.55
21
Authors
5
Name
Order
Citations
PageRank
Kalin Kanov1113.06
Eric Perlman2545.00
Randal Burns31955115.15
Yanif Ahmad491345.62
Alexander Szalay512410.16