Title
CODD: A Dataless Approach to Big Data Testing.
Abstract
The construction and development of the so-called Big Data systems has occupied centerstage in the data management community in recent years. However, there has been comparatively little attention paid to the testing of such systems, an essential pre-requisite for successful deployment. This is surprising given that traditional testing techniques, which typically involve construction of representative databases and regression query suites, are completely impractical at Big Data scale -- simply due to the time and space overheads involved in their execution. For instance, consider the situation where a database engineer wishes to evaluate the query optimizer's behavior on a futuristic Big Data setup featuring \"yottabyte\" (1024 bytes) sized relational tables. Obviously, just generating this data, let alone storing it, is practically infeasible even on the best of systems.
Year
DOI
Venue
2015
10.14778/2824032.2824123
PVLDB
Field
DocType
Volume
Query optimization,Data mining,Byte,Software deployment,Computer science,Yottabyte,Data management,Big data,Database,Overhead (business)
Journal
8
Issue
ISSN
Citations 
12
2150-8097
0
PageRank 
References 
Authors
0.34
1
2
Name
Order
Citations
PageRank
Ashoke S.100.34
Jayant R. Haritsa22004228.38