Title
Efficient Algorithms for Two Extensions of LPF Table: The Power of Suffix Arrays
Abstract
Suffix arrays provide a powerful data structure to solve several questions related to the structure of all the factors of a string. We show how they can be used to compute efficiently two new tables storing different types of previous factors (past segments) of a string. The concept of a longest previous factor is inherent to Ziv-Lempel factorization of strings in text compression, as well as in statistics of repetitions and symmetries. The longest previous reverse factor for a given position i is the longest factor starting at i, such that its reverse copy occurs before, while the longest previous non-overlapping factor is the longest factor v starting at i which has an exact copy occurring before. The previous copies of the factors are required to occur in the prefix ending at position i 驴 1. We design algorithms computing the table of longest previous reverse factors (LPrF table) and the table of longest previous non-overlapping factors (LPnF table). The latter table is useful to compute repetitions while the former is a useful tool for extracting symmetries. These tables are computed, using two previously computed read-only arrays (SUF and LCP) composing the suffix array, in linear time on any integer alphabet. The tables have not been explicitly considered before, but they have several applications and they are natural extensions of the LPF table which has been studied thoroughly before. Our results improve on the previous ones in several ways. The running time of the computation no longer depends on the size of the alphabet, which drops a log factor. Moreover the newly introduced tables store additional information on the structure of the string, helpful to improve, for example, gapped palindrome detection and text compression using reverse factors.
Year
DOI
Venue
2010
10.1007/978-3-642-11266-9_25
conference on current trends in theory and practice of informatics
Keywords
Field
DocType
suffix arrays,longest previous factor,lpf table,previous factor,longest factor v,longest previous reverse factor,suffix array,longest previous non-overlapping factor,previous copy,longest factor,text compression,efficient algorithms,linear time,data structure
Integer,Data structure,Suffix,Computer science,Algorithm,Arithmetic,Suffix array,Factorization,Longest repeated substring problem,Time complexity,Longest common substring problem,Distributed computing
Conference
Volume
ISSN
Citations 
5901
0302-9743
8
PageRank 
References 
Authors
0.50
13
5
Name
Order
Citations
PageRank
Maxime Crochemore12655281.75
Costas S. Iliopoulos21534167.43
Marcin Kubica362929.26
Wojciech Rytter42290181.52
Tomasz Waleń570639.62