Title
Characterization and extraction of irredundant tandem motifs
Abstract
We address the problem of extracting pairs of subwords (m1,m2) from a text string s of length n, such that, given also an integer constant d in input, m1 and m2 occur in tandem within a maximum distance of d symbols in s. The main effort of this work is to eliminate the possible redundancy from the candidate set of the so found tandem motifs. To this aim, we first introduce the concept of maximality, characterized by four specific conditions, that we show to be not deducible by the corresponding notion of maximality already defined for "simple" (i.e., non tandem) motifs. Then, we further eliminate the remaining redundancy by defining the concept of irredundancy for tandem motifs. We prove that the number of non-overlapping irredundant tandems is O(d2n) which, considering d as a constant, leads to a linear number of tandems in the length of the input string. This is an order of magnitude less than previously developed compact indexes for tandem extraction. As a further contribution we show an algorithm to extract this compact irredundant index.
Year
DOI
Venue
2012
10.1007/978-3-642-34109-0_41
SPIRE
Keywords
Field
DocType
non tandem,irredundant tandem,irredundant tandem motif,length n,tandem motif,input string,possible redundancy,compact irredundant index,compact index,tandem extraction,linear number
Integer,Discrete mathematics,Tandem,Combinatorics,Redundancy (engineering),Order of magnitude,Mathematics
Conference
Volume
ISSN
Citations 
7608
0302-9743
5
PageRank 
References 
Authors
0.42
18
3
Name
Order
Citations
PageRank
Laxmi Parida177377.21
Cinzia Pizzi213915.73
Simona E. Rombo319222.21