Title
The practical obstacles of data transfer: why researchers still love scp
Abstract
The importance of computing facilities is heralded every six months with the announcement of the new Top500 list, showcasing the world's fastest supercomputers. Unfortunately, with great computing capability does not come great long-term data storage capacity, which often means users must move their data to their local site archive, to remote sites where they may be doing future computation or analysis, or back to their home institution, else face the dreaded data purge that most HPC centers employ to keep utilization of large parallel filesystems low to manage performance and capacity. At HPC centers, data transfer is crucial to the scientific workflow and will increase in importance as computing systems grow in size. The Energy Sciences Network (ESnet) recently launched its fifth generation network, a 100 Gbps high-performance, unclassified national network connecting more than 40 DOE research sites to support scientific research and collaboration. Despite the tenfold increase in bandwidth to DOE research sites amenable to multiple data transfer streams and high throughput, in practice, researchers often under-utilize the network and resort to painfully-slow single stream transfer methods such as scp to avoid the complexity of using multiple stream tools such as GridFTP and bbcp, and contend with frustration from the lack of consistency of available tools between sites. In this study we survey and assess the data transfer methods provided at several DOE supported computing facilities, including both leadership-computing facilities, connected through ESnet. We present observed transfer rates, suggested optimizations, and discuss the obstacles the tools must overcome to receive wide-spread adoption over scp.
Year
DOI
Venue
2013
10.1145/2534695.2534703
NDM@SC
Field
DocType
Citations 
Data transmission,Supercomputer,TOP500,Computer science,Computer data storage,Computer network,Throughput,GridFTP,Workflow,Distributed computing,Scientific method
Conference
1
PageRank 
References 
Authors
0.43
0
3
Name
Order
Citations
PageRank
Hai Ah Nam1161.74
Jason Hill2211.64
Suzanne Parete-Koon3201.73