Title
Workload characteristics of DNA sequence analysis: from storage systems' perspective
Abstract
The recent development of NGS (Next Generation Sequencing) methods has greatly increased the amount of genome data and created the need for high-performance computing and high-performance storage systems. The key issue in developing high-performance storage systems is building a storage system that is optimized for NGS analysis pipeline. In this paper, we implemented a tool to collect and analyze I/O workload in NGS analysis pipeline. Using this tool, we executed NGS analysis pipeline and analyzed the characteristics of I/Os collected in the experiment.
Year
DOI
Venue
2014
10.1145/2555486.2555490
RAPIDO
Keywords
Field
DocType
key issue,high-performance computing,dna sequence analysis,workload characteristic,next generation sequencing,genome data,high-performance storage system,storage system,recent development,o workload,ngs analysis pipeline,bioinformatics
Genome,Data mining,Workload,Computer data storage,Computer science,DNA sequencing,Sequence analysis
Conference
Citations 
PageRank 
References 
0
0.34
6
Authors
6
Name
Order
Citations
PageRank
Kyeongyeol Lim100.34
Geehan Park200.34
Minsuk Choi300.68
Youjip Won455854.71
Dong-Oh Kim5156.72
Hongyeon Kim644.76