Daily Return Confidence Interval from Annual Data

Statistics · Hard · Free problem

You have $N$ years of simple annual returns $\{R_1, R_2, \ldots, R_N\}$ with sample mean $\bar{R}_Y$ and sample variance $s_Y^2$. You want to build a 95% confidence interval for the true daily mean return $\mu_d$.

Derive the confidence interval under three increasingly realistic assumptions:

I.I.D. daily returns: Daily log returns are independent and identically distributed with constant variance.
Serial correlation with known long-run variance: Daily returns are autocorrelated, but you know the long-run variance $\Omega$ (or have an estimate of it, e.g., from a Newey-West estimator).
Stochastic volatility: Daily variance is time-varying.

In each case, state your assumptions explicitly and show how to map annual statistics to daily statistics using aggregation identities.

Hints

The answer is always in $[1, n+1]$; you only care about elements in $[1, n]$. This observation is the key to reducing the problem.
Under i.i.d. daily returns, variances aggregate linearly: $\sigma_Y^2 = 252 \sigma_d^2$. Invert this to convert annual statistics to daily.
Under autocorrelation, replace $\sigma_d^2$ with the long-run variance $\Omega = \sigma_d^2(1 + 2\sum_k \rho_k)$. Positive autocorrelation widens the CI; mean reversion narrows it.

Worked Solution

How to Think About It: The challenge is unit conversion: you observed returns at annual frequency but want to make inference about a daily parameter. The conversion depends critically on how daily returns aggregate to annual returns -- and that depends on the dependence structure. Under i.i.d., variances add linearly and the conversion is clean. Under autocorrelation, variances add with cross-terms (the long-run variance is larger or smaller than the simple sum). Under stochastic volatility, even the variance of annual variance is uncertain, widening the CI further.

A useful sanity check: with $N = 20$ years of annual data and roughly 252 trading days per year, you have an effective sample of

Intuition

The depressing practical takeaway from this problem is how little statistical power annual return data provides. Even with 20 years of data, the 95% CI for the daily mean return under i.i.d. assumptions spans roughly $\pm 0.025\%$ per day -- which translates to $\pm 6\%$ annualized. Almost no trading strategy can be confidently distinguished from a zero-mean process using return data alone. This is why practitioners obsess over Sharpe ratios and use much higher-frequency data when available.

The three cases also teach a progressive lesson about model risk. Case (i) is the textbook answer. Case (ii) is what any experienced time-series econometrician would do. Case (iii) is the honest answer for equities, where volatility clustering is pervasive. Each step widens the CI -- and the actual confidence interval for a daily mean return estimate is usually much wider than naive calculations suggest. In practice, the standard error of the mean is dominated by volatility, not sample size, which is why mean estimation is so much harder than volatility estimation.