Overround: The Bookmaker's Bank Fee
/I was updating the MAFL Fund Performance page last evening and found myself pondering the topic of bookmaker overround.
Read MoreI was updating the MAFL Fund Performance page last evening and found myself pondering the topic of bookmaker overround.
Read MoreIn a recent blog I developed an empirical model of AFL scoring in which I assumed that the Scoring Shots generated by Home and Away teams could be modelled by a bivariate Negative Binomial and that the conversion of these shots into Goals could be modelled by Beta Binomials.
Read MoreIn the previous blog on this topic I posited that the Scoring Shot production of a team could be modelled as a Poisson random variable with some predetermined mean, and that the conversion of these Scoring Shots into Goals could be modelled as a BetaBinomial with fixed conversion probability and theta (a spread parameter).
Read MoreIn this blog I'm seeking to answer a single question: how are a team's subsequent head-to-head bookmaker prices affected by the returns they've provided to head-to-head wagering on them in recent weeks? More succinctly, how much less can you expect to make wagering on recent winners and how much more on recent losers?
Read MoreEinstein once said that "No problem can be solved from the same level of consciousness that created it". In a similar spirit - but with, regrettably and demonstrably, a mere fraction of the intellect - I find that there's something deeply satisfying about discovering that an approach to a problem you've been using can be characterised as a special case of a more general approach.
Read MoreIf you're making probability assessments one of the things you almost certainly want them to be is well-calibrated, and we know both from first-hand experience and a variety of analyses here on MatterOfStats over the years that the TAB Bookmaker is all of that.
Well he is, at least, well-calibrated as far as I can tell. His actual probability assessments aren't directly available but must, instead, be inferred from his head-to-head prices and I've come up with three ways of making this inference, using an Overround-Equalising, Risk-Equalising or an LPSO-Optimising approach.
Read MoreThe last few months have been a generally reflective time for me, and with my decision to leave unchanged the core of MAFL algorithms for 2014 I've been focussing some of that reflection on the eight full seasons I've now spent analysing and predicting AFL results.
Read MoreIn the previous blog I described a general framework for thinking about Bookmaker overround.
There I discussed, in the context of the two-outcome case, the choice of a functional form to describe a team's overround in terms of its true probability as assessed by the Bookmaker. As one very simple example I suggested oi = (1-pi), which we could use to model a Bookmaker who embeds overround in the price of any team by setting it to 1 minus the team's assessed probability of victory.
Whilst we could choose just about any function, including the one I've just described, for the purpose of modelling Bookmaker overround, choices that fit empirical reality are actually, I now realise, quite circumscribed. This is because of the observable fact that the total overround in any head-to-head market, T, appears to be constant, or close to it, in every game regardless of the market prices, and hence the underlying true probability assessments, of the teams involved. In other words, the total overround in the head-to-head market when 1st plays last is about the same as when 1st plays 2nd.
So, how does this constrain our choice of functional form? Well we know that T is defined as 1/m1 + 1/m2 - 1, where mi is that market price for team i, and that mi = 1/(pi(1+oi)), from which we can determine that:
If T is to remain constant across the full range of values of p1 then, we need the derivative with respect to p1 of the RHS of this equation to be zero for all values of p1. This implies that the functions chosen for o1 and o2 must satisfy the following equality:
I doubt that many functional forms o1 and o2 (both of which we're assuming are functions of p1, by the way) exist that will satisfy this equation for all values of p1, especially if we also impose the seemingly reasonable constraint that o1 and o2 be of equivalent form, albeit it that o1 might be expressed in terms of p1 and o2 in terms of (1-p1), which we can think of as p2.
Two forms that do satisfy the equation, the proof of which I'll leave as an exercise for any interested reader to check, are:
There may be another functional form that satisfies the equality above, but I can't find it. (There's a rigorous non-existence proof for you.) Certainly oi = 1 - pi, which was put forward earlier, doesn't satisfy it, and I can postulate a bunch of other plausible functional forms that similarly fail. What you find when you use these forms is that total overround changes with the value of p1.
So, if we want to choose functions for o1 and o2 that produce results consistent with the observed reality that total overround remains constant across all values of the assessed true probability of the two teams it seems that we've only three options (maybe four):
A fourth, largely unmodellable option would be that he simultaneously sets the market prices of both teams so that they together produce a market with the desired total overround while accounting for his assessment of the two team's victory probabilities so that a wager on either team has negative expectation. He does this, we'd assume, without employing a pre-determined functional form for the relationship between overround and probability for either team.
If these truly are the only logical options available to the Bookmaker then MAFL, it turns out, is already covering the complete range since we track the performance of a Predictor that models its probability assessments by following an Overround-Equalising approach, of another Predictor that does the same using a Risk-Equalising approach, and of a third (Bookie_LPSO) that pursues a strategy consistent with the third option above. That's serendipitously neat and tidy.
The only area for future investigation would be then to seek a solution superior to LPSO for the third approach described above. Here we could use any of the functional forms I listed in the previously blog, but could only apply them to the determination of the overround for one of the teams - say the home team or the favourite - with the price and hence overround for the remaining team determined by the need to produce a market with some pre-specified total overround.
That's enough for today though ...
MAFL is a website for ...