Title
The subsequence composition of a string
Abstract
Words that appear as constrained subsequences in a text-string are considered as possible indicators of the host string structure, hence also as a possible means of sequence comparison and classification. The constraint consists of imposing a bound on the number @w of positions in the text that may intervene between any two consecutive characters of a subsequence. A subset of such @w-sequences is then characterized that consists, in intuitive terms, of sequences that could not be enriched with more characters without losing some occurrence in the text. A compact spatial representation is then proposed for these representative sequences, within which a number of parameters can be defined and measured. In the final part of the paper, such parameters are empirically analyzed on a small collection of text-strings endowed with various degrees of structure.
Year
DOI
Venue
2009
10.1016/j.tcs.2009.07.017
Theor. Comput. Sci.
Keywords
DocType
Volume
intuitive term,Suffix graph,subsequence composition,small collection,String complexity measures,sequence comparison,possible indicator,consecutive character,final part,representative sequence,Core equivalence classes,Special subsequences,compact spatial representation,possible mean,host string structure,Constrained subsequences
Journal
410
Issue
ISSN
Citations 
43
Theoretical Computer Science
1
PageRank 
References 
Authors
0.35
8
3
Name
Order
Citations
PageRank
Alberto Apostolico11441182.20
Fabio Cunial2729.68
MA HA310.35