Title
Embedded Functional Dependencies and Data-completeness Tailored Database Design
Abstract
AbstractWe establish a principled schema design framework for data with missing values. The framework is based on the new notion of an embedded functional dependency, which is independent of the interpretation of missing values, able to express completeness and integrity requirements on application data, and capable of capturing redundant data value occurrences that may cause problems with processing data that meets the requirements. We establish axiomatic, algorithmic, and logical foundations for reasoning about embedded functional dependencies. These foundations enable us to introduce generalizations of Boyce-Codd and Third normal forms that avoid processing difficulties of any application data, or minimize these difficulties across dependency-preserving decompositions, respectively. We show how to transform any given schema into application schemata that meet given completeness and integrity requirements, and the conditions of the generalized normal forms. Data over those application schemata are therefore fit for purpose by design. Extensive experiments with benchmark schemata and data illustrate the effectiveness of our framework for the acquisition of the constraints, the schema design process, and the performance of the schema designs in terms of updates and join queries.
Year
DOI
Venue
2021
10.1145/3450518
ACM Transactions on Database Systems
Keywords
DocType
Volume
Boyce-Codd normal form, database design, decomposition, functional dependency, key, missing value, normal form, redundancy, synthesis, third normal form, updates
Journal
46
Issue
ISSN
Citations 
2
0362-5915
1
PageRank 
References 
Authors
0.35
0
2
Name
Order
Citations
PageRank
Ziheng Wei186.92
Sebastian Link246239.59