The Kelly criterion is really about the Central Limit Theorem
Kelly’s criterion: placing optimal bets
The Kelly criterion is meant to tell you how to make best use of useful-but-imperfect information. For example, suppose you’re able to place even-odds bets on the outcome of a coin flip. If you’re right, you double the money you bet, and if you’re wrong, you lose what you bet. To make things interesting, you know that it’s not a fair coin; it comes up heads with probability . You’re going to get to play times. How should you place bets?
Well, suppose you start with bankroll and you bet some fraction each round. The you bet doubles with probability and disappears with probability , and the you didn’t bet just sits there. So you end up with with probability and with probability . The expected value is
Since , the bigger you can make , the larger this expected value. So you should bet as much as you possibly can! Let’s say you’re not allowed to take out a loan, so the most you can bet is everything, .
That’s not right …
Yes, it is. After rounds, you’ll almost certainly have lost all your money, but in the rare case that you haven’t (), you’ll have multiplied your starting bankroll by a factor of , an astronomical return! Deviating from this strategy really does reduce your expected return.
But if you don’t want to lose all your money, here’s an alternative perspective. Each round your bankroll either gets multiplied by with probability or gets multiplied by with probability , so it’s natual to think about the log of our bankroll. If we bet a fraction as before, the expected log outcome is
To find the value of that maximizes this expected value, we find where the derivative with repsect to is zero: , or . That’s the Kelly criterion value. In our example, you should bet , i.e. of your bankroll each time.
Hey man, don’t tell me what my utility function is
When I first learned Kelly’s criterion, it bugged me that the argument is basically “haha, I guess maximizing expected result gives a wild result here, so let’s maximize expected log of the result this time.” The choice of a logarithmic utility function just seems arbitrary. The marginal value of money generally decreases as you have more of it, but there are lots of functions other than log you could use with that property. If you use a square root utility function, you get some other answer, and yet this answer is held up as objectively correct. What gives?
The is about determining the distribution of outcomes
Using log of your bankroll as your utility function is a red herring. The log is doing something much more important: it allows you to apply the central limit theorem to determine the whole distribution of outcomes for a given betting fraction . In log space, when you bet a fraction of your bankroll, increases by a random variable which does not depend on . Since each win or loss is independent, this means that the log of your bankroll after rounds will increase by , where the are independent and identically distributed.
Ordinarily, determining the distribution of a big sum of random variables like this is hard, but when they’re independent and identically distributed, the central limit theorem tells us that the distribution of will be approximately normal with mean and variance . The real point of taking logs was to determine this distribution. If you bet a fraction of your bankroll, the mean and variance of are
What good is the distribution of outcomes?
The distribution of outcomes does not make value judgments for you. If your utility function is logarithmic, go ahead and choose to maximize , recovering the original Kelly criterion. If you have a different utility funciton, compute its expected value and choose to maximize that.
Knowing the full distribution of outcomes allows you to answer questions that don’t have to do with maximizing any utility function. For example, if you decide to go with the Kelly recommendation of betting , you may want to know “what’s the probability I’ll lose money after rounds?”
We can compute (using log base here) that and , so . That means that by playing the Kelly strategy for rounds, the median outcome is that your bankroll will be multiplied by , but losing money is only about one standard deviation away. You have a erfc chance of losing money.
So … is the Kelly criterion right?
Strictly speaking, no, unless your utility function happens to be logarithmic. We can also quibble about what it means to be “right”. For example, the Kelly criterion will maximize your median result, and in the long run will in fact maximize any quantile, which looks pretty darn right. What more can you ask for? Well, if you really want to maximize your expected result, it’s wrong. If your utility function steps and saturates (e.g. your life depends on winning at least a certain amount, but the amount of the excess or deficit doesn’t matter), it’s wrong. But if you stop the analysis short of making a specific recommendation, you get something more valuable than a description of optimal play for the fellow with a logarithmic utility function: the distribution of results for any betting strategy. With that in hand, you can bring your own questions and values to compute what interests you.