Title
BioMAJ: a flexible framework for databanks synchronization and processing.
Abstract
Large- and medium-scale computational molecular biology projects require accurate bioinformatics software and numerous heterogeneous biological databanks, which are distributed around the world. BioMAJ provides a flexible, robust, fully automated environment for managing such massive amounts of data. The JAVA application enables automation of the data update cycle process and supervision of the locally mirrored data repository. We have developed workflows that handle some of the most commonly used bioinformatics databases. A set of scripts is also available for post-synchronization data treatment consisting of indexation or format conversion (for NCBI blast, SRS, EMBOSS, GCG, etc.). BioMAJ can be easily extended by personal homemade processing scripts. Source history can be kept via html reports containing statements of locally managed databanks.
Year
DOI
Venue
2008
10.1093/bioinformatics/btn325
BIOINFORMATICS
Keywords
Field
DocType
computational biology,programming languages,algorithms,database management systems
Data mining,Synchronization,Computer science,Automation,Software,Information repository,Bioinformatics,Java,Workflow,Database,Scripting language,License
Journal
Volume
Issue
ISSN
24
16
1367-4803
Citations 
PageRank 
References 
4
0.56
2
Authors
10