Title
Collaborative Scientific Workflow Composition as a Service: An Infrastructure Supporting Collaborative Data Analytics Workflow Design and Management
Abstract
The need for collaborative data analytics increases significantly when confronted with the challenges of big data. Although workflow tools offer a formal way to define, automate, and repeat multi-step computational procedures, designing complex data processing workflow requires collaboration from multiple people with complementary expertise. Existing tools are not suitable to support collaborative design of comprehensive workflows. To address such a challenge, this paper reports the design and development of a software infrastructure with the capability of supporting collaborative data-oriented workflow composition and management, adding a key component to existing cyberinfrastructure that will support big data collaboration through the Internet. A collaborative provenance query model (CPM) is presented together with graph-based patterns and algebra. A hypergraph theory-based provenance mining technique is reported. The research extends an existing open-source workflow tool, by adding system-level facilities to support human interaction and cooperation that are essential for an effective and efficient scientific collaboration.
Year
DOI
Venue
2016
10.1109/CIC.2016.039
2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC)
Keywords
Field
DocType
collaborative workflow design,scientific workflow,big data analytics,collaborative provenance
Data science,Computer science,Cyberinfrastructure,Artificial intelligence,Workflow engine,Workflow,The Internet,Computer vision,Workflow technology,World Wide Web,XPDL,Big data,Workflow management system
Conference
ISBN
Citations 
PageRank 
978-1-5090-4608-9
0
0.34
References 
Authors
24
7
Name
Order
Citations
PageRank
Jia Zhang111624.54
Qihao Bao2172.41
Xiaoyi Duan321.07
Lu, Shiyong42022126.17
Lijun Xue500.34
Runyu Shi660.79
Pingbo Tang7288.53