Title
An XML standard for the dissemination of annotated 2D gel electrophoresis data complemented with mass spectrometry results.
Abstract
Many proteomics initiatives require a seamless bioinformatics integration of a range of analytical steps between sample collection and systems modeling immediately assessable to the participants involved in the process. Proteomics profiling by 2D gel electrophoresis to the putative identification of differentially expressed proteins by comparison of mass spectrometry results with reference databases, includes many components of sample processing, not just analysis and interpretation, are regularly revisited and updated. In order for such updates and dissemination of data, a suitable data structure is needed. However, there are no such data structures currently available for the storing of data for multiple gels generated through a single proteomic experiments in a single XML file. This paper proposes a data structure based on XML standards to fill the void that exists between data generated by proteomics experiments and storing of data.In order to address the resulting procedural fluidity we have adopted and implemented a data model centered on the concept of annotated gel (AG) as the format for delivery and management of 2D Gel electrophoresis results. An eXtensible Markup Language (XML) schema is proposed to manage, analyze and disseminate annotated 2D Gel electrophoresis results. The structure of AG objects is formally represented using XML, resulting in the definition of the AGML syntax presented here.The proposed schema accommodates data on the electrophoresis results as well as the mass-spectrometry analysis of selected gel spots. A web-based software library is being developed to handle data storage, analysis and graphic representation. Computational tools described will be made available at http://bioinformatics.musc.edu/agml. Our development of AGML provides a simple data structure for storing 2D gel electrophoresis data.
Year
DOI
Venue
2004
10.1186/1471-2105-5-9
BMC Bioinformatics
Keywords
Field
DocType
life and medical sciences,data (general),data structures,data/0401001,programming languages,microarrays,algorithms,data model,2d gel electrophoresis,system modeling,bioinformatics,xml schema,computational biology,proteomics,data structure,computer graphics,mass spectrometry,data storage,extensible markup language
Data structure,Data mining,Two-dimensional gel electrophoresis,Mass spectrometry data format,XML,Proteomics,Profiling (computer programming),Computer science,Software,Bioinformatics,RDF
Journal
Volume
Issue
ISSN
5
1
1471-2105
Citations 
PageRank 
References 
7
1.16
5
Authors
5
Name
Order
Citations
PageRank
Romesh Stanislaus1883.56
Liu Hong Jiang271.16
Martha Swartz371.16
John Arthur4432.11
Jonas S Almeida573142.25