Title | ||
---|---|---|
Estimating small frequency moments of data stream: a characteristic function approach |
Abstract | ||
---|---|---|
A data stream is viewed as a sequence of $M$ updates of the form
$(\text{index},i,v)$ to an $n$-dimensional integer frequency vector $f$, where
the update changes $f_i$ to $f_i + v$, and $v$ is an integer and assumed to be
in $\{-m, ..., m\}$. The $p$th frequency moment $F_p$ is defined as
$\sum_{i=1}^n \abs{f_i}^p$. We consider the problem of estimating $F_p$ to
within a multiplicative approximation factor of $1\pm \epsilon$, for $p \in
[0,2]$. Several estimators have been proposed for this problem, including
Indyk's median estimator \cite{indy:focs00}, Li's geometric means estimator
\cite{pinglib:2006}, an \Hss-based estimator \cite{gc:random07}. The first two
estimators require space $\tilde{O}(\epsilon^{-2})$, where the $\tilde{O}$
notation hides polylogarithmic factors in $\epsilon^{-1}, m, n$ and $M$.
Recently, Kane, Nelson and Woodruff in \cite{knw:soda10} present a
space-optimal and novel estimator, called the log-cosine estimator. In this
paper, we present an elementary analysis of the log-cosine estimator in a
stand-alone setting. The analysis in \cite{knw:soda10} is more complicated. |
Year | Venue | Keywords |
---|---|---|
2010 | Clinical Orthopaedics and Related Research | characteristic function,data structure,geometric mean |
Field | DocType | Volume |
Integer,Discrete mathematics,Frequency moments,Combinatorics,Multiplicative function,Characteristic function (probability theory),Data stream,Geometric mean,Mathematics,Estimator | Journal | abs/1005.1 |
Citations | PageRank | References |
0 | 0.34 | 7 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sumit Ganguly | 1 | 813 | 236.01 |
Purushottam Kar | 2 | 379 | 22.55 |