Defining an Alpha Signal
You observe daily close-to-close returns $r_i(t)$ for assets $i \in \{1, \ldots, m\}$ at times $t \in \mathbb{Z}$. Define what you mean by an "alpha" signal $s_i(t)$ intended to predict future returns.
Your definition must be precise and complete. Specifically, address all three of the following:
- Prediction horizon: Over what future window does $s_i(t)$ predict? How do you aggregate returns over that window?
- Portfolio construction rule: How do you translate signal values $s_i(t)$ into portfolio weights $w_i(t)$? What constraints do you impose?
- Excess/benchmark return: What does "predicting returns" mean -- raw returns, market-adjusted, factor-adjusted? How do you measure whether $s_i(t)$ is actually working?
Hints
- An alpha signal definition is incomplete unless it specifies what return it is predicting -- raw or benchmark-adjusted, over what horizon. Start there.
- The standard formalism is a cross-sectional regression of $h$-day excess returns on the signal at each time $t$: $\tilde{R}_i(t,h) = \alpha + \gamma s_i(t) + \varepsilon_i(t)$. Alpha exists if $\hat{\gamma}$ is consistently positive.
- For the portfolio rule, normalize the signal cross-sectionally to zero mean and unit variance, then set weights proportional to the normalized signal -- this ensures dollar neutrality and makes the IC a natural performance measure.
Worked Solution
How to Think About It: The word "alpha" gets used loosely in practice -- a discretionary PM means something different from a stat arb quant. The question is asking you to nail down the formal definition so that you could actually test whether a signal has alpha. That requires pinning down three things before you write a single regression: what you are predicting (the target return, including horizon and benchmark), how the signal maps to positions, and how you evaluate predictive power. If you leave any of these vague, your backtest is not well-defined. This is a design question, not a computation -- the right answer is a clean, testable definition.
Key Insight: Alpha is not a property of a signal alone -- it is a property of a (signal, target, portfolio rule) triple. Two traders can use the same raw signal and reach opposite conclusions about whether it has alpha, simply because they defined the prediction target differently.
The Method:
1. Fix the prediction horizon $h$. Define the $h$-day forward return for asset $i$ at time $t$: $R_i(t, h) = \sum_{\tau=1}^{h} r_i(t + \tau)$ or equivalently in log-return form $R_i(t,h) = \log(P_i(t+h)/P_i(t))$. For daily signals, $h = 1$ is common in high-frequency stat arb; $h = 5$ or $h = 21$ are typical for medium-frequency strategies. The choice of $h$ defines the signal's "shelf life" and determines turnover.
2. Define the benchmark/excess return. Raw returns contain systematic risk (market beta, sector tilts, factor exposures) that has nothing to do with your signal's predictive content. Strip these out by defining the excess return: $\tilde{R}_i(t, h) = R_i(t, h) - \beta_i^\top f(t, h)$ where $f(t,h)$ is a vector of factor returns (e.g., Fama-French, Barra) over the same window and $\beta_i$ is asset $i