Title
On the Randomness of Compressed Data
Abstract
It seems reasonable to expect from a good compression method that its output should not be further compressible, because it should behave essentially like random data. We investigate this premise for a variety of known compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method. Arithmetic coding seems to produce perfectly random output, whereas that of Huffman or Ziv-Lempel coding still contains many dependencies. In particular, the output of Huffman coding has already been proven to be random under certain conditions, and we show here that arithmetic coding may produce an output that is identical to that of Huffman.
Year
DOI
Venue
2019
10.1109/DCC.2019.00093
2019 Data Compression Conference (DCC)
Keywords
Field
DocType
Randomness,Kullback Leibler,Lossless Compression
Computer science,Theoretical computer science,Randomness
Conference
ISSN
ISBN
Citations 
1068-0314
978-1-7281-0658-8
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
Shmuel T. Klein143477.80
Dana Shapira214432.15