Title
Data Decomposition in Biomedical e-Science Applications
Abstract
As the focus of e-Science is moving toward the forth paradigm and data intensive science, data access remains dependent on the architecture of the used e-Science infrastructure. Such architecture is in general job-driven, i.e., a (grid) job is a sequence of commands that run on the same worker node. Making use of the infrastructure involves having a parallelized application. This is done foremost by data decomposition. In general practice of parallel programming, data decomposition depends on the programmer's experience and knowledge about the used data and the algorithm/application. On the other hand, data mining scientists have an established foundation for data decomposition, automatic decomposition methods are already in use, methodologies and patterns are defined. Our experience in porting biomedical applications to the Dutch e-Science infrastructure shows that the used data decomposition to gain parallelism fit to some degree a subgroup of the data mining decomposition patterns, i.e., object set decomposition. In this paper we discuss porting three biomedical packages to a grid computing environment, two for medical imaging and one for DNA sequencing. We show how the data access of the applications was reengineered around the executables to make use of the parallel capacity of e-Science infrastructure.
Year
DOI
Venue
2011
10.1109/eScienceW.2011.7
eScience Workshops
Keywords
Field
DocType
data mining scientist,biomedical e-science applications,data decomposition,data intensive science,e-science infrastructure,biomedical application,data mining decomposition pattern,dutch e-science infrastructure,automatic decomposition method,data access,used data,workflows,parallel programming,information retrieval,dna sequence,data mining,decomposition method,grid computing,bioinformatics
Data mining,Grid computing,Programmer,Computer science,e-Science,Porting,Workflow,Data access,Grid,Distributed computing,Executable
Conference
Citations 
PageRank 
References 
0
0.34
13
Authors
9