Jump-Robust Volatility Estimation with Bipower Variation

Stochastic Processes · Hard · Free problem

Suppose the efficient log-price $X_t$ follows a semimartingale of the form

$dX_t = \mu_t \, dt + \sigma_t \, dW_t + J_t \, dN_t$

where $\sigma_t$ is a stochastic volatility process, $W_t$ is a Brownian motion, and $N_t$ is a finite-activity jump process (Poisson with intensity $\lambda$) with jump sizes $J_t$. You observe the price at $n$ equally spaced times over $[0, 1]$ (one trading day), so the sampling interval is $\Delta = 1/n$. There is no microstructure noise.

Define the log-returns $r_i = X_{i\Delta} - X_{(i-1)\Delta}$ for $i = 1, \ldots, n$.

Define realized variance $RV_n$ and bipower variation $BV_n$. Show that $BV_n$ converges in probability to integrated variance $IV = \int_0^1 \sigma_t^2 \, dt$ even when jumps are present, while $RV_n$ does not.

Using $RV_n$ and $BV_n$, construct a test statistic for the null hypothesis $H_0$: no jumps occurred during the day. State the asymptotic distribution of your test under $H_0$ and explain how you would set the rejection threshold for a given significance level $\alpha$.

Describe a practical procedure for estimating daily integrated variance $IV$ and jump variation $JV$ from one day of tick data, including how you handle the test outcome from part (2).

Hints

Think about what happens when you multiply two adjacent absolute returns: if a jump hits one of them, what is the order of magnitude of that product as sampling frequency increases?
The key identity is $E[|Z|] = \sqrt{2/\pi}$ for $Z \sim N(0,1)$. Use this to find the right scaling constant so that the sum of products of adjacent absolute returns targets $\int_0^1 \sigma_t^2 \, dt$.
For the jump test, consider the difference $RV_n - BV_n$: it converges to zero under no jumps and to $\sum (\Delta X_s)^2 > 0$ under jumps. Standardize this difference using a jump-robust estimator of integrated quarticity to get an asymptotically standard normal test statistic.

Worked Solution

How to Think About It: The core issue is that realized variance -- the sum of squared returns -- captures both continuous volatility and jumps. If you are a trader or risk manager, you need to separate the two: integrated variance tells you about diffusive risk (which you can hedge), while jump variation tells you about tail risk (which you mostly cannot). Bipower variation is the clever workaround: by multiplying adjacent absolute returns instead of squaring each return, you exploit the fact that jumps are rare (finite activity). A jump can hit one return, but it is extremely unlikely to hit two consecutive returns. So the product of neighbors is robust to jumps.

Part (i): Definitions and Consistency

Realized variance is the sum of squared returns:

$RV_n = \sum_{i=1}^{n} r_i^2$

By quadratic variation theory, as $n \to \infty$:

$RV_n \xrightarrow{p} \int_0^1 \sigma_t^2 \, dt + \sum_{s \leq 1} (\Delta X_s)^2 = IV + JV$

So $RV_n$ is contaminated by jumps -- it converges to integrated variance *plus* the sum of squared jump sizes.

Bipower variation replaces squaring with the product of adjacent absolute returns:

$BV_n = \frac{\pi}{2} \sum_{i=2}^{n} |r_i| \cdot |r_{i-1}|$

The scaling factor $\pi/2$ comes from the fact that for a standard normal $Z$, we have $E[|Z|] = \sqrt{2/\pi}$, so $E[|Z|]^2 = 2/\pi$, and multiplying by $\pi/2$ corrects the bias.

To see why $BV_n$ is jump-robust, consider what happens at high frequency. Each return $r_i \approx \sigma_{i\Delta} \sqrt{\Delta} \, Z_i + J_i \mathbf{1}_{\{\text{jump at } i\}}$ where $Z_i$ are approximately i.i.d. standard normals. Since $N_t$ has finite activity, the probability that two consecutive returns both contain jumps is $O(\Delta^2) \to 0$. So in the product $|r_i| \cdot |r_{i-1}|$, at most one factor can be inflated by a jump. The inflated factor is $O(1)$ (jump size), but the non-jump factor is $O(\sqrt{\Delta})$, so the product involving a jump is $O(\sqrt{\Delta})$. There are at most finitely many such terms (finite activity), so the total contribution of jump-contaminated terms is $O(\sqrt{\Delta}) \to 0$.

For the non-jump terms, by the law of large numbers:

$BV_n \approx \frac{\pi}{2} \sum_{i=2}^{n} \sigma_{i\Delta} \sqrt{\Delta} |Z_i| \cdot \sigma_{(i-1)\Delta} \sqrt{\Delta} |Z_{i-1}| = \frac{\pi}{2} \Delta \sum_{i=2}^{n} \sigma_{i\Delta} \sigma_{(i-1)\Delta} |Z_i| |Z_{i-1}|$

Since $Z_i$ and $Z_{i-1}$ are independent and $E[|Z|] = \sqrt{2/\pi}$, each $|Z_i| |Z_{i-1}|$ has expectation

Intuition

The fundamental insight behind bipower variation is an elegant exploitation of rarity. Finite-activity jumps are, by definition, rare -- they happen at isolated time points. At high frequency, consecutive returns almost never both contain jumps. So when you multiply adjacent absolute returns, any jump inflates one factor but the other factor remains $O(\sqrt{\Delta})$, making the product negligible. Squaring a single return, by contrast, amplifies the jump to $O(1)$. This is why realized variance sees jumps and bipower variation does not.

This separation of continuous and discontinuous risk is fundamental in modern market microstructure and risk management. Diffusive volatility is the risk you can delta-hedge; jump risk is what blows up your book. A trader who confuses the two will systematically underestimate tail risk on quiet days (attributing all variation to diffusion) and overestimate hedgeable risk on jumpy days. The Barndorff-Nielsen and Shephard framework gives you a principled, testable way to decompose total variation into its hedgeable and unhedgeable components -- and the entire machinery runs off a single day of tick data without any parametric model assumptions.