The Kelly criterion is really about the Central Limit Theorem


Kelly’s criterion: placing optimal bets

The Kelly criterion is meant to tell you how to make best use of useful-but-imperfect information. For example, suppose you’re able to place even-odds bets on the outcome of a coin flip. If you’re right, you double the money you bet, and if you’re wrong, you lose what you bet. To make things interesting, you know that it’s not a fair coin; it comes up heads with probability p=0.6p = 0.6. You’re going to get to play 100100 times. How should you place bets?

Well, suppose you start with bankroll BB and you bet some fraction ff each round. The fBfB you bet doubles with probability pp and disappears with probability 1p1-p, and the (1f)B(1-f)B you didn’t bet just sits there. So you end up with (1+f)B(1 + f)B with probability pp and (1f)B(1 - f)B with probability (p1)(p-1). The expected value is

p(1+f)B+(1p)(1f)B=(1+2pff)Bp(1+f)B + (1-p)(1-f)B = (1 + 2pf - f)B

Since 2p1=0.2>02p - 1 = 0.2 > 0, the bigger you can make ff, the larger this expected value. So you should bet as much as you possibly can! Let’s say you’re not allowed to take out a loan, so the most you can bet is everything, f=1.0f=1.0.

That’s not right …

Yes, it is. After 100100 rounds, you’ll almost certainly have lost all your money, but in the rare case that you haven’t (p1001022p^{100} \approx 10^{-22}), you’ll have multiplied your starting bankroll by a factor of (2p)100108(2p)^{100}\approx 10^8, an astronomical return! Deviating from this strategy really does reduce your expected return.

But if you don’t want to lose all your money, here’s an alternative perspective. Each round your bankroll either gets multiplied by 1+f1+f with probability pp or gets multiplied by 1f1-f with probability 1p1-p, so it’s natual to think about the log of our bankroll. If we bet a fraction ff as before, the expected log outcome is

logB+plog(1+f)+(1p)log(1f)\log B + p\log(1+f) + (1-p)\log(1-f)

To find the value of ff that maximizes this expected value, we find where the derivative with repsect to ff is zero: p1+f1p1f=0\frac{p}{1+f} - \frac{1-p}{1-f} = 0, or f=2p1\boxed{f = 2p - 1}. That’s the Kelly criterion value. In our example, you should bet 2p1=0.22p-1 = 0.2, i.e. 20%20\% of your bankroll each time.

Hey man, don’t tell me what my utility function is

When I first learned Kelly’s criterion, it bugged me that the argument is basically “haha, I guess maximizing expected result gives a wild result here, so let’s maximize expected log of the result this time.” The choice of a logarithmic utility function just seems arbitrary. The marginal value of money generally decreases as you have more of it, but there are lots of functions other than log you could use with that property. If you use a square root utility function, you get some other answer, and yet this answer is held up as objectively correct. What gives?

The log\log is about determining the distribution of outcomes

Using log of your bankroll as your utility function is a red herring. The log is doing something much more important: it allows you to apply the central limit theorem to determine the whole distribution of outcomes for a given betting fraction ff. In log space, when you bet a fraction ff of your bankroll, log(B)\log(B) increases by a random variable XX which does not depend on BB. Since each win or loss is independent, this means that the log of your bankroll after nn rounds will increase by Xtotal=X1+X2+XnX_\text{total} = X_1 + X_2 + \cdots X_n, where the XiX_i are independent and identically distributed.

Ordinarily, determining the distribution of a big sum of random variables like this is hard, but when they’re independent and identically distributed, the central limit theorem tells us that the distribution of XtotalX_\text{total} will be approximately normal with mean nμXn\mu_X and variance nσX2n\sigma^2_X. The real point of taking logs was to determine this distribution. If you bet a fraction ff of your bankroll, the mean and variance of XX are

μX=plog(1+f)+(1p)log(1f)σX2=p(1p)(log(1+f)log(1f))2\boxed{\begin{align*} \mu_X &= p\log(1+f) + (1-p)\log(1-f) \\ \sigma^2_X &= p(1-p)\bigl(\log(1+f) - \log(1-f)\bigr)^2 \end{align*}}

What good is the distribution of outcomes?

The distribution of outcomes does not make value judgments for you. If your utility function is logarithmic, go ahead and choose ff to maximize nμXn\mu_X, recovering the original Kelly criterion. If you have a different utility funciton, compute its expected value and choose ff to maximize that.

Knowing the full distribution of outcomes allows you to answer questions that don’t have to do with maximizing any utility function. For example, if you decide to go with the Kelly recommendation of betting f=0.2f=0.2, you may want to know “what’s the probability I’ll lose money after 100100 rounds?”

We can compute (using log base 1010 here) that μXtotal=100μX0.874\mu_{X_\text{total}} = 100\mu_X \approx 0.874 and σXtotal2=100σX20.744\sigma^2_{X_\text{total}} = 100\sigma^2_X \approx 0.744, so σXtotal0.863\sigma_{X_\text{total}} \approx 0.863. That means that by playing the Kelly strategy for 100100 rounds, the median outcome is that your bankroll will be multiplied by 100.8747.4910^{0.874} \approx 7.49, but losing money is only about one standard deviation away. You have a erfc(μXtotal/σXtotal)15%(\mu_{X_\text{total}}/\sigma_{X_\text{total}})\approx 15\% chance of losing money.

So … is the Kelly criterion right?

Strictly speaking, no, unless your utility function happens to be logarithmic. We can also quibble about what it means to be “right”. For example, the Kelly criterion will maximize your median result, and in the long run will in fact maximize any quantile, which looks pretty darn right. What more can you ask for? Well, if you really want to maximize your expected result, it’s wrong. If your utility function steps and saturates (e.g. your life depends on winning at least a certain amount, but the amount of the excess or deficit doesn’t matter), it’s wrong. But if you stop the analysis short of making a specific recommendation, you get something more valuable than a description of optimal play for the fellow with a logarithmic utility function: the distribution of results for any betting strategy. With that in hand, you can bring your own questions and values to compute what interests you.