Title
Data product configuration management and versioning in large-scale production of satellite scientific data
Abstract
This paper describes a formal structure for keeping track of files, source code, scripts, and related material for large-scale Earth science data production. We first describe the environment and processes that govern this configuration management problem. Then, we show that a graph with typed nodes and arcs can describe the derivation of production design and of the produced files and their metadata. The graph provides three useful by-products: • a hierarchical data file inventory structure that can help system users find particular files, • methods for creating production graphs that govern job scheduling and provenance graphs that track all of the sources and transformations between raw data input and a particular output file, •a systematic relationship between different elements of the structure and development documentation.
Year
DOI
Venue
2003
10.1007/3-540-39195-9_9
SCM
Keywords
Field
DocType
production graph,large-scale earth science data,satellite scientific data,inventory structure,data product configuration management,provenance graph,production design,formal structure,raw data input,large-scale production,particular output file,particular file,hierarchical data,source code,configuration management,product design,scientific data,job scheduling
Metadata,Source code,Computer science,Raw data,Job scheduler,Configuration management,Data file,Hierarchical database model,Database,Software versioning,Distributed computing
Conference
Volume
ISSN
ISBN
2649
0302-9743
3-540-14036-0
Citations 
PageRank 
References 
6
0.93
5
Authors
1
Name
Order
Citations
PageRank
Bruce R. Barkstrom17317.54