Title
A Framework of Write Optimization on Read-Optimized Out-of-Core Column-Store Databases
Abstract
The column-store database features a faster data reading speed and higher data compression efficiency compared with traditional row-based databases. However, optimizing write operations in the column-store database is one of the well-known challenges. Most existing works on write performance optimization focus on main-memory column-store databases. In this work, we investigate optimizing write operation update and deletion on out-of-core OOC, or external memory column-store databases. We propose a general framework to work for both normal OOC storage or big data storage, such as Hadoop Distributed File System HDFS. On normal OOC storage, we propose an innovative data storage format called Timestamped Binary Association Table or TBAT. Based on TBAT, a new update method, called Asynchronous Out-of-Core Update or AOC Update, is designed to replace the traditional update. On big data storage, we further extend TBAT onto HDFS and propose the Asynchronous Map-Only Update or AMO Update to replace the traditional update. Fast selection methods are developed in both contexts to improve data retrieving speed. A significant improvement in speed performance is shown in the extensive experiments when performing write operations on TBAT in normal and Map-Reduce environment.
Year
DOI
Venue
2015
10.1007/978-3-319-22849-5_12
DEXA
Field
DocType
Citations 
Distributed File System,Asynchronous communication,Data mining,Computer science,Computer data storage,Out-of-core algorithm,External storage,Data compression,Big data,Database,Auxiliary memory
Conference
1
PageRank 
References 
Authors
0.36
15
2
Name
Order
Citations
PageRank
Feng Yu1204.40
Wen-Chi Hou2387274.15