Title
Rex: Extrapolating Relational Data In A Representative Way
Abstract
Generating synthetic data is useful in multiple application areas (e.g., database testing, software testing). Nevertheless, existing synthetic data generators generally lack the necessary mechanism to produce realistic data, unless a complex set of inputs are given from the user, such as the characteristics of the desired data. An automated and efficient technique is needed for generating realistic data. In this paper, we propose ReX, a novel extrapolation system targeting relational data-bases that aims to produce a representative extrapolated database given an original one and a natural scaling rate. Furthermore, we evaluate our system in comparison with an existing realistic scaling method, UpSizeR, by measuring the representativeness of the extrapolated database to the original one, the accuracy for approximate query answering, the database size, and their performance. Results show that our solution significantly outperforms the compared method for all considered dimensions.
Year
DOI
Venue
2015
10.1007/978-3-319-20424-6_10
DATA SCIENCE
Keywords
Field
DocType
Representative extrapolation, Scaling problem, Synthetic data generation, Relational database
Data mining,Relational database,Computer science,Representativeness heuristic,Database design,Database testing,Synthetic data,Extrapolation,Scaling,Software testing
Conference
Volume
ISSN
Citations 
9147
0302-9743
1
PageRank 
References 
Authors
0.35
17
4
Name
Order
Citations
PageRank
Teodora Sandra Buda1267.50
Thomas Cerqueus24510.23
John Murphy359752.43
Morten Kristiansen4112.67