Title
Distribution-aware compressed full-text indexes
Abstract
In this paper we address the problem of building a compressed self-index that, given a distribution for the pattern queries and a bound on the space occupancy, minimizes the expected query-time within that index-space bound. We solve this problem by exploiting a reduction to the problem of finding a minimum weight K-link path in a particular Directed Acyclic Graph. Interestingly enough, our solution is independent of the underlying compressed index in use. Our experiments compare this optimal strategy with several other standard approaches, showing its effectiveness in practice.
Year
DOI
Venue
2013
10.1007/s00453-013-9782-3
Algorithmica
Keywords
DocType
Volume
Full-text indexing,Compressed full-text indexes,Succinct data structures,Dynamic programming
Journal
67
Issue
ISSN
Citations 
4
0178-4617
1
PageRank 
References 
Authors
0.35
20
3
Name
Order
Citations
PageRank
Paolo Ferragina12220130.64
Jouni Sirén222214.85
Rossano Venturini343531.07