Title
High performance workflow implementation for protein surface characterization using grid technology.
Abstract
BACKGROUND: This study concerns the development of a high performance workflow that, using grid technology, correlates different kinds of Bioinformatics data, starting from the base pairs of the nucleotide sequence to the exposed residues of the protein surface. The implementation of this workflow is based on the Italian Grid.it project infrastructure, that is a network of several computational resources and storage facilities distributed at different grid sites. METHODS: Workflows are very common in Bioinformatics because they allow to process large quantities of data by delegating the management of resources to the information streaming. Grid technology optimizes the computational load during the different workflow steps, dividing the more expensive tasks into a set of small jobs. RESULTS: Grid technology allows efficient database management, a crucial problem for obtaining good results in Bioinformatics applications. The proposed workflow is implemented to integrate huge amounts of data and the results themselves must be stored into a relational database, which results as the added value to the global knowledge. CONCLUSION: A web interface has been developed to make this technology accessible to grid users. Once the workflow has started, by means of the simplified interface, it is possible to follow all the different steps throughout the data processing. Eventually, when the workflow has been terminated, the different features of the protein, like the amino acids exposed on the protein surface, can be compared with the data present in the output database.
Year
DOI
Venue
2005
10.1186/1471-2105-6-S4-S19
BMC Bioinformatics
Keywords
Field
DocType
web interface,software design,automation,computational biology,proteins,database management,microarrays,database management systems,internet,bioinformatics,amino acid,algorithms,computer graphics,data processing,base pair,nucleotide sequence,systems integration,relational database
Software design,Grid computing,Computer science,Automation,Software,Bioinformatics,Workflow management system,Workflow,Grid,System integration
Journal
Volume
Issue
ISSN
6
S-4
1471-2105
Citations 
PageRank 
References 
25
0.53
6
Authors
5
Name
Order
Citations
PageRank
Ivan Merelli129435.36
Giulia Morra2564.20
Daniele D'Agostino313023.39
Andrea Clematis422338.08
Luciano Milanesi582387.40