Title
A Versatile Event-Driven Data Model in HBase Database for Multi-source Data of Power Grid
Abstract
As a column-oriented, distributed and high fault-tolerant database, HBase is gradually adopted by manufactory industry due to its huge capacity, fast random read and bulk processing performance. Power grid usually generates huge amount of data from multi-sources with hundreds of complicated data types. However, JOIN operation performance is low in HBase since it was not designed to serve relational queries as classic relational database. How to store the data to ensure a sufficient performance of JOIN operation and random read is the critical problem that must to be solved in HBase applications. In this paper, we proposed an event-driven HBase data model to resolve this problem. In our data model, each record of data is defined as a unique event that happened in power grid. Any type of data from any data sources can be distinguished in database. Therefore our data model can store multi-source data generated by power grid devices. In addition, JOIN operation is integrated in our data model, which improves the performance of reading data from multiple data sources. This is realized by designing a specific RowKey in tables. We also proposed a schema including a novel Virtual ColumnFamily, which resolves the compatibility problem of storing data from multi-sources. Virtual ColumnFamily is realized by designing specific qualifiers. To verify the effectiveness of our data model, we conducted empirical experiments on a Hadoop platform to compare our optimized schema and original schema. Experimental results showed that our data model ensures that our optimized schema performs better than the original schema based on original data model.
Year
DOI
Venue
2016
10.1109/SmartCloud.2016.28
2016 IEEE International Conference on Smart Cloud (SmartCloud)
Keywords
Field
DocType
HBase,Data model,Event-driven,JOIN operation,Virtual ColumnFamily
Data warehouse,Data modeling,Data mining,Relational database,Database model,Semi-structured model,Computer science,Database design,Data model,Elasticity (data store),Database
Conference
ISBN
Citations 
PageRank 
978-1-5090-5264-6
1
0.43
References 
Authors
0
8
Name
Order
Citations
PageRank
Bin Liu120.81
Yongxin Zhu246658.07
chang wang33312.55
Yufeng Chen461.30
Tian Huang5537.40
WeiWei Shi65112.26
Mengjun Li710.43
Yishu Mao863.28