Title
Constrained pattern matching
Abstract
Constrained sequences are strings satisfying certain additional structural restrictions (e.g., some patterns are forbidden). They find applications in communication, digital recording, and biology. In this article, we restrict our attention to the so-called (d,k) constrained binary sequences in which any run of zeros must be of length at least d and at most k, where 0≤ dk. In many applications, one needs to know the number of occurrences of a given pattern w in such sequences, for which we coin the term constrained pattern matching. For a given word w, we first estimate the mean and the variance of the number of occurrences of w in a (d,k) sequence generated by a memoryless source. Then we present the central limit theorem and large deviations results. As a by-product, we enumerate asymptotically the number of (d,k) sequences with exactly r occurrences of w, and compute Shannon entropy of (d,k) sequences with a given number of occurrences of w. We also apply our results to detect under- and overrepresented patterns in neuronal data (spike trains), which satisfy structural constraints that match the framework of (d,k) binary sequences. Throughout this article we use techniques of analytic combinatorics such as combinatorial calculus, generating functions, and complex asymptotics.
Year
DOI
Venue
2011
10.1145/1921659.1921671
ACM Transactions on Algorithms
Keywords
Field
DocType
neuronal spike trains,overrepresented pattern,central limit theorem,shannon entropy,autocorrelation polynomials,pattern w,binary sequence,languages,structural constraint,complex asymptotics,certain additional structural restriction,analytic combinatorics,pattern matching,word w,constrained sequences,satisfiability,generating function
Discrete mathematics,Combinatorics,Digital recording,Algorithm,Pattern matching,Mathematics,restrict,Binary number
Journal
Volume
Issue
ISSN
7
2
1549-6325
Citations 
PageRank 
References 
0
0.34
21
Authors
2
Name
Order
Citations
PageRank
Yongwook Choi1455.23
Wojciech Szpankowski21557192.33