Title
Data I/O Management Approach For The Post-Hoc Visualization Of Big Simulation Data Results
Abstract
Leading-edge supercomputers, such as the K computer, have generated a vast amount of simulation results, and most of these datasets were stored on the file system for the post-hoc analysis such as visualization. In this work, we first investigated the data generation trends of the K computer by analyzing some operational log data files. We verified a tendency of generating large amounts of distributed files as simulation outputs, and in most cases, the number of files has been proportional to the number of utilized computational nodes, that is, each computational node producing one or more files. Considering that the computational cost of visualization tasks is usually much smaller than that required for large-scale numerical simulations, a flexible data input/output (I/O) management mechanism becomes highly useful for the post-hoc visualization and analysis. In this work, we focused on the xDMlib data management library, and its flexible data I/O mechanism in order to enable flexible data loading of big computational climate simulation results. In the proposed approach, a pre-processing is executed on the target distributed files for generating a light-weight metadata necessary for the elaboration of the data assignment mapping used in the subsequent data loading process. We evaluated the proposed approach by using a 32-node visualization cluster, and the K computer. Besides the inevitable performance penalty associated with longer data loading time, when using smaller number of processes, there is a benefit for avoiding any data replication via copy, conversion, or extraction. In addition, users will be able to freely select any number of nodes, without caring about the number of distributed files, for the post-hoc visualization and analysis purposes.
Year
DOI
Venue
2018
10.1142/S1793962318400068
INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING
Keywords
Field
DocType
Scientific visualization, distributed data, data management, K computer
Data mining,Mathematical optimization,File system,Visualization,Computer science,Input/output,Approaches of management,Data management,Scientific visualization
Journal
Volume
Issue
ISSN
9
3
1793-9623
Citations 
PageRank 
References 
0
0.34
6
Authors
7
Name
Order
Citations
PageRank
Jorji Nonaka13012.97
Eduardo Camilo Inacio2175.28
Kenji Ono3219.22
Mario A. R. Dantas46925.60
Yasuhiro Kawashima500.34
Tomohiro Kawanabe611.71
Fumiyoshi Shoji7527.36