Title
Change data capture in NoSQL databases: A functional and performance comparison.
Abstract
Requirements for data storage and processing have reached new levels, with applications relying on the analysis of large amounts of data in order to support everyday life services to end users. Since the costs of maintaining and managing databases are significant, change data capture (CDC) techniques can be used to determine which parts of a data source have changed, and thus assist in the management of large volumes of data in data warehouses. In this paper we investigate a number of CDC techniques suitable for NoSQL databases. CDC techniques can be used to track modifications in a source database, which later can be made available to a target database. Our base system and testbed are based on Apache Cassandra, which is a NoSQL database that offers high performance and scalability. Cassandra is combined with a MapReduce framework, which is used to implement the logic of each CDC technique and is suitable for highly distributed and parallel computing. This paper also presents both a functional comparison of the different CDC techniques, as well as a performance evaluation in a real testbed.
Year
DOI
Venue
2015
10.1109/ISCC.2015.7405574
ISCC
Keywords
Field
DocType
performance evaluation,parallel computing,distributed computing,MapReduce framework,Apache Cassandra,data warehouses,CDC,data analysis,data processing,data storage,performance comparison,functional comparison,NoSQL databases,change data capture
Data warehouse,Relational database,Computer science,Testbed,XML database,NoSQL,Elasticity (data store),Database,Change data capture,Scalability,Distributed computing
Conference
Citations 
PageRank 
References 
0
0.34
7
Authors
5
Name
Order
Citations
PageRank
Felipe Mathias Schmidt100.34
Cláudio F. R. Geyer211423.10
Alberto E. Schaeffer Filho312220.30
Stefan Deßloch427091.40
Yong Hu541.50