Measuring the Surprise in a Season's Results

In the previous blog we looked at the average level of surprisals generated by teams and by team pairings across all of VFL/AFL history and during the most-recent seasons. Today, as promised in that blog, I'm going to analyse surprisals using the same general methodology, but by season.
Read More

The Value of MARS Ratings Points Across Time

As we look at MARS Ratings across the expanse of VFL/AFL history, one thing we might want to consider is whether the value of a Rating Point has remained stable across those 13 decades. I've assumed, implicitly, that this is the case in the interpretations I've made of recent analyses, for example about the game's greatest upsets, but there's no a priori reason for this to be the case.
Read More

Which Teams Fare Better as Favourites?

In this blog, the next is a series in which I've been exploring the all-time MARS Ratings I created for every game from the start of 1897 to the end of the 2012 season, I'll be looking at how well each team has performed depending on the relative strength of its opponent, as measured by their MARS Rating. So, for example, we'll consider how well Collingwood tends to do when playing a team it is assessed as being much stronger than, a little stronger than, about as capable as, and so on.
Read More

The Greatest Upset Victories in VFL/AFL History (1897-2012)

The Suns' victory over the Saints in the first round of the 2013 season was heralded as an "upset win" for the Suns and one of the greatest in their short history. Undoubtedly their win was unexpected, but even the bookmakers rated the Suns as only $3.75 chances, so it was hardly a Halley's comet-like occurrence.
Read More

Team MARS Ratings Performance By Decade and Overall: 1897 to 2012

In the previous blog on the topic of all-time MARS Ratings I explained the process I used to derive team Ratings across history and then identified those teams that had achieved the highest (Essendon) and lowest (Fitzroy) MARS Ratings ever. We know then which teams have burned brightest - and which flickered dimmest - across VFL/AFL history. In this blog I want to explore more extended bursts of talent or apparent lack of it.
Read More

Are the Victory Margins for Some Games Harder to Predict than for Others?

It's unarguable that the winner of some games will be harder to predict than the winner of others. When genuine equal-favourites meet, for example, you've only a 50:50 chance of picking the winner, but you can give yourself a 90% chances of being right when a team with a 90% probability of victory meets a team with only a 10% chance. The nearer to equal-favouritism the two teams are, the more difficult the winner is to predict, and the further away we are from this situation the easier the game is to predict.
Read More

Clustering Your Way To Line Betting Success : Building a Predictive Model

In the previous blog I used a clustering algorithm - Partitioning Around Medoids (PAM) as it happens - to group games that were similar in terms of pre-game TAB Bookmaker odds, the teams' MARS Ratings, and whether or not the game was an Interstate clash. There it turned out that, even though I'd clustered using only pre-game data, the resulting clusters were highly differentiated with respect to the line betting success rates of the Home teams in each cluster.
Read More

Clustering Your Way To Line Betting Success

For today's blog I'll be creating a game clustering that uses as input only the information that we might reasonably know pre-game - for example, the pre-game team MARS Ratings, Bookmaker prices (or some metric derived from them), and information about the game venue.
Read More

Incorporating the Draw in Overround Calculations

At university, studying undergraduate Economics - which, granted, was a while ago - I particularly disliked theories premised on simplifying assumptions, which were introduced with an implicit promise, rarely honoured, to relax these assumptions later and nudge the theory a little closer to observed reality.
Read More

Building Simple Margin Predictors

Having a new - and, it seems, generally superior - way to calculate Bookmaker Implicit Probabilities is like having a new toy to play with. Most recently I've been using it to create a family of simple Margin Predictors, each optimised in a different way.
Read More

Using Risk-Equalising Probabilities for the Margin Predictors

With the exception of Combo_NN_2, all of the Margin Predictors rely on an algorithm that takes Bookmaker Implicit Probabilities as an input in some form: 

  • Bookie_3 and Bookie_9 use Bookmaker Implicit Probabilities directly
  • ProPred_3 and ProPred_7 use the outputs of the ProPred algorithm, which uses a log transform of Bookmaker Implicit Probabilities as one input
  • WinPred_3 and WinPred_7 use the outputs of the WinPred algorithm, which also uses a log transform of Bookmaker Implicit Probabilities as one input
  • H2H_U3, H2H_U10, H2H_A3 and H2H_A7 use the outputs of the Head-to-Head algorithm, which uses Bookmaker Implicit Probabilities as one input
  • Combo_7 uses Bookmaker Implicit Probabilities directly as well as via its use of the outputs of the Head-to-Head Algorithm
  • Combo_NN_2 uses Bookmaker Implicit Probabilities directly as well as via its use of the outputs of the ProPred, WinPred and H2H algorithms

For this short blog I've switched, in all of the underlying algorithms, the Implicit Probabilities calculated using the Risk-Equalising Approach as replacements for those calculated using the Overround-Equalising Approach and then compared the resulting MAPEs for seasons 2007 to 2012 for all the Margin Predictors.

Overall, all Margin Predictors except Bookie_3 benefit from the switch, however modestly. Bookie_9, which now will serve as a co-predictor in the MAFL Margin Fund, benefits most, knocking over one quarter of a point per game off its MAPE.

The uniformity of these improvements is made slightly more remarkable by the realisation that the Margin Predictors, built using Eureqa, were optimised for the probability outputs of the underlying algorithms when those algorithms were using Overround-Equalising Implicit Probabilities. So, for example, the equation for Bookie_9, which is:

Predicted Home Team Margin = 2.2205129 + 17.729506 * ln(Home Team Bookmaker Probability/(1-Home Team Bookmaker Probability)) + 2*Home Team Bookmaker Probability

was created by Eureqa to minimise the historical MAPE of this equation when the Home Team Bookmaker Probabilities being used were those calculated assuming Overround-Equalisation. The 0.26 points per game reduction in the MAPE is being achieved without re-optimising this equation but, instead, simply by replacing the Home Team Probabilities with those calculated using a Risk-Equalising Approach.

Bookie_3 is the one Margin Predictor that responds poorly to the switch of probabilities without an accompanying re-optimisation in Eureqa. When I performed such a re-optimisation, Eureqa came up with this remarkably simple equation:

Predicted Home Team Margin = 21 * ln(Home Team Bookmaker Probability/(1-Home Team Bookmaker Probability))

This predictor has an MAPE of 29.22 points per game, which is extraordinarily low for such an easy-to-use predictor.

CONCLUSION

Virtually every algorithm used in MAFL has now been shown to benefit, however slightly, from using Implicit Probabilities calculated using the Risk-Equalising instead of the Overround-Equalising Approach. Naturallly, this makes me wonder if there's an even better way ...

Maybe next year I'll look for it.

Bookmaker Implicit Probabilities: Empirical Value of the Risk-Equalising Approach

A few blogs back I developed the idea that bookmakers might embed overround in each team's price not equally but instead such that the resulting head-to-head market prices provide insurance for a fixed (in percentage point terms) calibration error of equivalent size for both teams. Since then I've made only passing comment about the empirical superiority of this approach (which I've called the Risk-Equalising Approach) relative to the previous approach (which I've called the Overround-Equalising Approach).
Read More

Determining Bookmaker Implicit Probabilities: The Risk-Equalising Approach

In the previous blog I developed a new way of divining a bookmaker's probability assessments of the two teams by assuming that he believes his maximum calibration error - the (negative) difference between his probability assessment for a team and its true probability of victory - is the same for each team in percentage point terms, and that he levies overround on each team's price so as to ensure that it will still deliver an expected profit even if his probability assessment is maximally in error.
Read More

Measuring Bookmaker Calibration Errors

We've found ample evidence in the past to assert that the TAB Bookmaker is well-calibrated, by which I mean that teams he rates as 40% chances tend to win about 40% of the time, teams he rates as 90% chances tend to win about 90% of the time and, more generally, that teams he rates as X% chances tend to win about X% of the time.
Read More