Title
MTC envelope: defining the capability of large scale computers in the context of parallel scripting applications
Abstract
Many scientific applications can be efficiently expressed with the parallel scripting (many-task computing, MTC) paradigm. These applications are typically composed of several stages of computation, with tasks in different stages coupled by a shared file system abstraction. However, we often see poor performance when running these applications on large scale computers due to the applications' frequency and volume of filesystem I/O and the absence of appropriate optimizations in the context of parallel scripting applications. In this paper, we show the capability of existing large scale computers to run parallel scripting applications by first defining the MTC envelope and then evaluating the envelope by benchmarking a suite of shared filesystem performance metrics. We also seek to determine the origin of the performance bottleneck by profiling the parallel scripting applications' I/O behavior and mapping the I/O operations to the MTC envelope. We show an example shared filesystem envelope and present a method to predict the I/O performance given the applications' level of I/O concurrency and I/O amount. This work is instrumental in guiding the development of parallel scripting applications to make efficient use of existing large scale computers, and to evaluate performance improvements in the hardware/software stack that will better facilitate parallel scripting applications.
Year
DOI
Venue
2013
10.1145/2462902.2462913
HPDC
Keywords
Field
DocType
o concurrency,o performance,mtc envelope,performance bottleneck,o operation,o amount,o behavior,parallel scripting,parallel scripting application,large scale computer,distributed file system
Distributed File System,Bottleneck,File system,Computer science,Concurrency,Profiling (computer programming),Software,Operating system,Computation,Distributed computing,Scripting language
Conference
ISBN
Citations 
PageRank 
978-1-4503-1910-2
11
0.56
References 
Authors
21
5
Name
Order
Citations
PageRank
Zhao Zhang1110.56
Daniel S. Katz21496121.04
Michael Wilde3110.56
Justin M. Wozniak446435.32
Foster Ian5229382663.24