Title
On the efficient construction of multislices from recurrences
Abstract
Recurrences are defined as sets of time instants associated with events and they are present in many application domains, including public transport schedules and personal calendars. Because of their large size, recurrences are rarely stored explicitly, but some form of compact representation is used. Multislices are a compact representation that is well suited for storage in relational databases. A multislice is a set of time slices where each slice employs a hierarchy of time granularities to compactly represent multiple recurrences. In this paper we investigate the construction of multislices from recurrences. We define the compression ratio of a multislice, show that different construction strategies produce multislices with different compression ratios, and prove that the construction of minimal multislices, i.e., multislices with a maximal compression ratio, is an NP-hard problem. We propose a scalable algorithm, termed LMerge, for the construction of multislices from recurrences. Experiments with real-world recurrences from public transport schedules confirm the scalability and usefulness of LMerge: the generated multislices are very close to minimal multislices, achieving an average compression ratio of approx. 99%. A comparison with a baseline algorithm that iteratively merges pairs of mergeable slices shows significant improvements of LMerge over the baseline approach.
Year
DOI
Venue
2010
10.1007/978-3-642-13818-8_5
SSDBM
Keywords
Field
DocType
compression ratio,public transport schedule,maximal compression ratio,average compression ratio,compact representation,different compression ratio,time slice,efficient construction,minimal multislices,time granularity,different construction strategy,relational database,public transport,np hard problem
Data mining,Approx,Relational database,Computer science,Bipartite graph,Algorithm,Schedule,Compression ratio,Scalable algorithms,Database,Scalability
Conference
Volume
ISSN
ISBN
6187
0302-9743
3-642-13817-9
Citations 
PageRank 
References 
0
0.34
12
Authors
3
Name
Order
Citations
PageRank
Romans Kasperovics192.59
Michael H. Böhlen21173179.40
Johann Gamper346554.06