Title
On the Longest Common Factor Problem
Abstract
The Longest Common Factor (LCF) of a set of strings is a well studied problem having a wide range of applications in Bioinformatics: from microarrays to DNA sequences analysis. This problem has been solved by Hui (2000) who uses a famous constant-time solution to the Lowest Common Ancestor (LCA) problem in trees coupled with use of suffix trees. A data structure for the LCA problem, although linear in space and construction time, introduces a multiplicative constant in both space and time that reduces the range of applications in many biological applications. In this article we present a new method for solving the LCF problem using the suffix tree structure with an auxiliary array that take space O(n). Our algorithm works in time O(n log a), where n is the total input size and a is the size of the alphabet. We also consider a different version of our algorithm that applies to DAWGs. In this case, we prove that the algorithm works in both time and space proportional to data DAWG's size.
Year
DOI
Venue
2008
10.1007/978-0-387-09680-3_10
International Federation for Information Processing
Keywords
Field
DocType
lowest common ancestor,dna sequence analysis,data structure
Data structure,Discrete mathematics,Combinatorics,Lowest common ancestor,Multiplicative function,Suffix,Spacetime,Suffix tree,Generalized suffix tree,Mathematics,Longest common substring problem
Conference
Volume
ISSN
Citations 
273
1571-5736
1
PageRank 
References 
Authors
0.36
12
4
Name
Order
Citations
PageRank
Maxime Crochemore12655281.75
A. Gabriele2243.37
Filippo Mignosi356999.71
Mauriana Pesaresi410.36