Title
Automating Data-Throttling Analysis for Data-Intensive Workflows
Abstract
Data movement between tasks in scientific workflows has received limited attention compared to task execution. Often the staging of data between tasks is either assumed or the time delay in data transfer is considered to be negligible (compared to task execution). Where data consists of files, such file transfers are accomplished as fast as the network links allow, and once transferred, the files are buffered/stored at their destination. Where a task requires multiple files to execute (from different tasks), it must, however, remain idle until all files are available. Hence, network bandwidth and buffer/storage within a workflow are often not used effectively. We propose an automated workflow structural analysis method for Directed Acyclic Graphs (DAGs) which utilises information from previous workflow executions. The method obtains data-throttling values for the data transfer to enable network bandwidth and buffer/storage capacity to be managed more efficiently. We convert a DAG representation into a Petri net model and analyse the resulting graph using an iterative method to compute data-throttling values. Our approach is demonstrated using the Montage workflow.
Year
DOI
Venue
2012
10.1109/CCGrid.2012.27
CCGrid
Keywords
Field
DocType
previous workflow execution,data-throttling analysis,task execution,data-intensive workflows,montage workflow,network link,network bandwidth,automated workflow structural analysis,different task,data movement,iterative method,data transfer,petri net,scientific computing,data analysis,data models,iteration method,network topology,directed graphs,data consistency,petri nets,structure analysis,bandwidth,computational modeling,directed acyclic graph,system performance
Workflow technology,Petri net,Data transmission,Computer science,Directed graph,Directed acyclic graph,Real-time computing,Workflow engine,Workflow,Workflow management system,Distributed computing
Conference
Citations 
PageRank 
References 
1
0.36
13
Authors
3
Name
Order
Citations
PageRank
Ricardo J. Rodriguez131.07
Rafael Tolosana-Calasanz216618.52
Omer F. Rana32181229.52