Title
Advancing distributed data management for the HydroShare hydrologic information system.
Abstract
HydroShare (https://www.hydroshare.org) is an online collaborative system to support the open sharing of hydrologic data, analytical tools, and computer models. Hydrologic data and models are often large, extending to multi-gigabyte or terabyte scale, and as a result, the scalability of centralized data management poses challenges for a system such as HydroShare. A distributed data management framework that enables distributed physical data storage and management in multiple locations thus becomes a necessity. We use the iRODS (Integrated Rule-Oriented Data System) data grid middleware as the distributed data storage and management back end in HydroShare. iRODS provides a unified virtual file system for distributed physical storages in multiple locations and enables data federation across geographically dispersed institutions around the world. In this paper, we describe the iRODS-based distributed data management approaches implemented in HydroShare to provide a practical demonstration of a production system for supporting big data in the environmental sciences.
Year
DOI
Venue
2018
10.1016/j.envsoft.2017.12.008
Environmental Modelling & Software
Keywords
Field
DocType
Distributed data management,Big data,Data sharing,Hydrologic information systems,Collaborative environment,iRODS
Information system,Middleware,Computer science,Data sharing,Data grid,Distributed data store,Data management,Big data,Management science,Scalability,Distributed computing
Journal
Volume
Issue
ISSN
102
C
1364-8152
Citations 
PageRank 
References 
0
0.34
6
Authors
6
Name
Order
Citations
PageRank
Hong Yi113.41
Ray Idaszak242.77
Michael J. Stealey331.23
Chris Calloway400.68
Alva L. Couch519729.24
David G. Tarboton614316.55