Realized Kernel Estimator Under Microstructure Noise

Time Series · Hard · Free problem

Observed tick prices satisfy $Y_{t_i} = X_{t_i} + \epsilon_{t_i}$, where $X$ is an Ito semimartingale (the efficient price) and $\epsilon_{t_i}$ is i.i.d. mean-zero noise with variance $\omega^2$ (market microstructure noise from bid-ask bounce, rounding, etc.).

You have $n+1$ equally spaced observations $Y_{t_0}, Y_{t_1}, \ldots, Y_{t_n}$ over $[0, T]$.

Define a realized-kernel (RK) variance estimator using a symmetric, non-negative-definite kernel function $k(\cdot)$ and bandwidth $H$.

State the conditions under which this estimator is consistent for the integrated variance $IV = \int_0^T \sigma_s^2 \, ds$ as $n \to \infty$, in terms of $H$ and $H/n$.

For the Parzen kernel specifically, give the asymptotically optimal order of $H$ in $n$, and explain why the realized kernel is robust to noise while naive realized variance is not.

Hints

Think about what happens to the sum of squared returns when prices contain i.i.d. noise -- how does the expected value of $RV_n$ depend on $n$ and $\omega^2$?
The noise is i.i.d., so its autocovariance structure is very simple: nonzero only at lags 0 and $\pm 1$. A kernel that includes these lags with appropriate weights can cancel the noise bias.
For optimal bandwidth, balance the squared bias (which grows with $H/n$) against the variance (also of order $H/n$). The Parzen kernel's smoothness gives a specific rate -- set $H \propto n^{3/5}$ to minimize MSE.

Worked Solution

How to Think About It: The core tension is this: you want to estimate the integrated variance of the efficient price $X$, but you only observe noisy prices $Y$. Naive realized variance (sum of squared returns) does not converge to $IV$ -- instead it blows up, because the noise contributes a term proportional to

Intuition

The realized kernel is one of the most important tools in high-frequency econometrics, and the intuition behind it is surprisingly simple. Naive realized variance fails at high frequency because market microstructure noise (bid-ask bounce, price discreteness, latency) adds spurious variation that grows linearly with the number of observations. The genius of the realized kernel is recognizing that the noise is essentially uncorrelated across time, so its fingerprint shows up only at lag 0 and lag 1 in the autocovariance function. By constructing a weighted sum of autocovariances -- giving full weight at lag 0 and smoothly decaying weights at higher lags -- you can subtract off exactly the noise contribution while retaining the signal.

The bandwidth $H$ controls this trade-off: too small and you do not include enough lags to cancel the noise; too large and each autocovariance estimate becomes noisy itself (and you introduce bias from the signal's own autocovariance structure). The optimal $H \sim n^{3/5}$ is a bias-variance sweet spot. In practice, this is why quant desks do not just "sample at 5-minute bars" to avoid noise -- they use kernel or pre-averaging estimators on tick data and extract far more information. The $n^{-1/5}$ convergence rate, while slower than the classical $n^{-1/2}$, is the best you can do without modeling the noise parametrically, and it is the theoretical foundation behind production volatility estimation at firms that trade at high frequency.