Title
Executable schema mappings for statistical data processing.
Abstract
Data processing is the core of any statistical information system. Statisticians are interested in specifying transformations and manipulations of data at a high level, in terms of entities of statistical models. We illustrate here a proposal where a high-level language, EXL, is used for the declarative specification of statistical programs, and a translation into executable form in various target systems is available. The language is based on the theory of schema mappings, in particular those defined by a specific class of tgds, which we actually use to optimize user programs and facilitate the translation towards several target systems. The characteristics of such class guarantee good tractability properties and the applicability in Big Data settings. A concrete implementation, EXLEngine, has been carried out and is currently used at the Bank of Italy.
Year
DOI
Venue
2018
https://doi.org/10.1007/s10619-017-7212-2
Distributed and Parallel Databases
Keywords
Field
DocType
Schema mappings,Statistical data,Scalable data processing,ETL
Information system,Data processing,Programming language,Computer science,Theoretical computer science,Software,Statistical model,Big data,Schema (psychology),Executable,Distributed computing
Journal
Volume
Issue
ISSN
36
2
0926-8782
Citations 
PageRank 
References 
0
0.34
23
Authors
4
Name
Order
Citations
PageRank
P. Atzeni17760.41
Luigi Bellomarini24613.11
Francesca Bugiotti312913.03
Marco De Leonardis400.34