Title
MDL estimation for small sample sizes and its application to segmenting binary strings
Abstract
Minimum Description Length (MDL) estimation has proven itself of major importance in a large number of applications many of which are in the fields of computer vision and pattern recognition. A problem is encountered in applying the associated formulas, however, especially those associated with model cost. This is because most of these are asymptotic forms appropriate only for large sample sizes. J. Rissanen has recently derived sharper code-length formulas valid for much smaller sample sizes. Because of the importance of these results, it is our intent here to present a tutorial description of them. In keeping with this goal we have chosen a simple application whose relative tractability allows it to be explored more deeply than most problems: the segmentation of binary strings based on a piecewise Bernoulli assumption. By that we mean that the strings are assumed to be divided into substrings, the bits of which are assumed to have been generated by a single (within a substring) Bernoulli source.
Year
DOI
Venue
1997
10.1109/CVPR.1997.609336
CVPR
Keywords
Field
DocType
minimum description length,large sample size,smaller sample size,mdl estimation,binary string,piecewise bernoulli assumption,associated formula,large number,bernoulli source,asymptotic form,small sample size,j. rissanen,major importance,pattern recognition,sample size,computational complexity,image segmentation,stochastic processes,application software,computer vision,encoding,unsupervised learning
Substring,Pattern recognition,Computer science,Segmentation,Minimum description length,Image segmentation,Artificial intelligence,Sample size determination,Piecewise,Computational complexity theory,Bernoulli's principle
Conference
Volume
Issue
ISSN
1997
1
1063-6919
ISBN
Citations 
PageRank 
0-8186-7822-4
3
2.32
References 
Authors
0
1
Name
Order
Citations
PageRank
Byron Dom12600825.93