Title | ||
---|---|---|
Data product configuration management and versioning in large-scale production of satellite scientific data |
Abstract | ||
---|---|---|
This paper describes a formal structure for keeping track of files, source code, scripts, and related material for large-scale Earth science data production. We first describe the environment and processes that govern this configuration management problem. Then, we show that a graph with typed nodes and arcs can describe the derivation of production design and of the produced files and their metadata. The graph provides three useful by-products: • a hierarchical data file inventory structure that can help system users find particular files, • methods for creating production graphs that govern job scheduling and provenance graphs that track all of the sources and transformations between raw data input and a particular output file, •a systematic relationship between different elements of the structure and development documentation. |
Year | DOI | Venue |
---|---|---|
2003 | 10.1007/3-540-39195-9_9 | SCM |
Keywords | Field | DocType |
production graph,large-scale earth science data,satellite scientific data,inventory structure,data product configuration management,provenance graph,production design,formal structure,raw data input,large-scale production,particular output file,particular file,hierarchical data,source code,configuration management,product design,scientific data,job scheduling | Metadata,Source code,Computer science,Raw data,Job scheduler,Configuration management,Data file,Hierarchical database model,Database,Software versioning,Distributed computing | Conference |
Volume | ISSN | ISBN |
2649 | 0302-9743 | 3-540-14036-0 |
Citations | PageRank | References |
6 | 0.93 | 5 |
Authors | ||
1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bruce R. Barkstrom | 1 | 73 | 17.54 |