Title
An analysis of extensible modelling for functional genomics data.
Abstract
Background: Several data formats have been developed for large scale biological experiments, using a variety of methodologies. Most data formats contain a mechanism for allowing extensions to encode unanticipated data types. Extensions to data formats are important because the experimental methodologies tend to be fairly diverse and rapidly evolving, which hinders the creation of formats that will be stable over time. Results: In this paper we review the data formats that exist in functional genomics, some of which have become de facto or de jure standards, with a particular focus on how each domain has been modelled, and how each format allows extensions. We describe the tasks that are frequently performed over data formats and analyse how well each task is supported by a particular modelling structure. Conclusion: From our analysis, we make recommendations as to the types of modelling structure that are most suitable for particular types of experimental annotation. There are several standards currently under development that we believe could benefit from systematically following a set of guidelines.
Year
DOI
Venue
2005
10.1186/1471-2105-6-235
BMC Bioinformatics
Keywords
Field
DocType
functional genomics,data collection,microarray analysis,genomics,algorithms,bioinformatics,computer simulation,data type,mass spectrometry,microarrays
Data science,ENCODE,Data collection,Computer science,Functional genomics,Data type,Geography Markup Language,Software,Bioinformatics,Proprietary format,Extensibility
Journal
Volume
Issue
ISSN
6
1
1471-2105
Citations 
PageRank 
References 
5
0.57
3
Authors
2
Name
Order
Citations
PageRank
Andrew R. Jones1757.65
Norman W. Paton23059359.26