Expected Value of an Optimal Stopping Card Game

Expectation · Hard · Free problem

You have a deck of $n$ cards. Each card's value is drawn independently and uniformly from the set $\{-1, +1, +2\}$. You turn the cards over one at a time. After seeing a card you may keep it (adding its value to your running gain and then deciding again on the next card) or stop and walk away with nothing more from this point. Because you can always walk away, a run of bad cards never drags your gain below $0$: this is a keep-or-discard game whose value is floored at $0$. You play to maximize expected payoff. Write $V_n$ for the value of the game with $n$ cards remaining, measured as the expected gain from this point starting from $0$.

Solve the problem for a deck of $n = 1$ card.
Solve the problem for a deck of $n = 2$ cards.
Generalize: write a recursive formula relating $V_n$ to $V_{n-1}$, the value with one fewer card remaining.

Hints

Treat each card as a decision: keep it and play on, or discard and walk away with nothing more from here. Since you can always walk away, the gain from any point on is worth at least $0$. Work backwards from the last card.
Let $V_n$ be the value with $n$ cards left, measured from a fresh $0$. After a draw $x$ you take $\max(x + V_{n-1},\, 0)$ -- continue or stop -- then average over the three card values.
Start from $V_0 = 0$, so $V_1 = \frac{1}{3}\sum_x \max(x, 0) = 1$. Then $V_2 = \frac{1}{3}\sum_x \max(x + 1, 0) = 5/3$, and in general $V_n = \frac{1}{3}\sum_{x \in \{-1,1,2\}} \max(x + V_{n-1}, 0)$.

Worked Solution

How to Think About It: This is an optimal-stopping problem dressed up as a card game. After each card you face one decision: keep it and carry on, or stop and take nothing more from this point. Because stopping is always available, the gain from any point forward is worth at least $0$ -- you never have to absorb a bad run. The right object is the value of "$n$ cards left, measured from a fresh $0

Loading problems...

Intuition

This is a miniature optimal-stopping problem, the structure behind every position with embedded optionality: exercise now (take the payoff) or wait for something better. Backward induction -- start at the end, compute the continuation value, and roll backwards -- is the universal tool. The interview signal is recognizing that the option to stop (the floor at $0$) gives the game positive value even though some draws are negative, and that the continuation value $V_{n-1}$ acts as a dynamic keep-or-discard threshold on the current draw.

The deeper lesson is option value. Even though a card averages /3$, the game is worth MORE than $n \cdot 2/3$ for small $n$ because discarding the worst draws removes downside at no cost -- the same reason an American option dominates its European twin, since early exercise (here, stopping) is an extra source of value. As $n$ grows, the positive drift dominates and $V_n$ approaches $n \cdot 2/3$ from above.