May 28, 2013

Really Simple Proves Remarkably Effective

May 28, 2013/ Tony Corke

The Really Simple Margin Predictors (RSMPs), which were purpose-built for season 2013, have shown themselves to be particularly accurate at forecasting game margins. So much so, in fact, that they're currently atop the MAFL Leaderboard, ahead of the more directly Bookmaker-derived Predictors like Bookie_3 that have excelled in previous years.

May 11, 2013

The Value of MARS Ratings Points Across Time Revisited

May 11, 2013/ Tony Corke

When last I considered the issue of the value of MARS Ratings across time I assessed their value in terms of a team's victory probability. Perhaps a more intuitive approach would be to, instead, value them in terms of a team's victory margin.

May 04, 2013

On The Randomness of Final AFL Scores

May 04, 2013/ Tony Corke

You'd be forgiven for assuming that the last digit of a team's final score in an AFL contest was essentially random - in other words that it was as likely to be a 0 as a 1, a 4 or a 9.

April 25, 2013

Measuring the Surprise in a Season's Results

April 25, 2013/ Tony Corke

In the previous blog we looked at the average level of surprisals generated by teams and by team pairings across all of VFL/AFL history and during the most-recent seasons. Today, as promised in that blog, I'm going to analyse surprisals using the same general methodology, but by season.

April 23, 2013

Which Teams Are The Surprisal Packets?

April 23, 2013/ Tony Corke

Anyone who's ever tipped sporting outcomes knows the frustration of the untippable team, the team that win when they're supposed to lose, and lose when they're supposed to win. With that idea in mind, which team's results, I wondered, have been the most difficult to predict across the history of the VFL/AFL.

April 19, 2013

The Value of MARS Ratings Points Across Time

April 19, 2013/ Tony Corke

As we look at MARS Ratings across the expanse of VFL/AFL history, one thing we might want to consider is whether the value of a Rating Point has remained stable across those 13 decades. I've assumed, implicitly, that this is the case in the interpretations I've made of recent analyses, for example about the game's greatest upsets, but there's no a priori reason for this to be the case.

April 11, 2013

Which Teams Fare Better as Favourites?

April 11, 2013/ Tony Corke

In this blog, the next is a series in which I've been exploring the all-time MARS Ratings I created for every game from the start of 1897 to the end of the 2012 season, I'll be looking at how well each team has performed depending on the relative strength of its opponent, as measured by their MARS Rating. So, for example, we'll consider how well Collingwood tends to do when playing a team it is assessed as being much stronger than, a little stronger than, about as capable as, and so on.

April 08, 2013

The Greatest Upset Victories in VFL/AFL History (1897-2012)

April 08, 2013/ Tony Corke

The Suns' victory over the Saints in the first round of the 2013 season was heralded as an "upset win" for the Suns and one of the greatest in their short history. Undoubtedly their win was unexpected, but even the bookmakers rated the Suns as only $3.75 chances, so it was hardly a Halley's comet-like occurrence.

April 06, 2013

Team MARS Ratings Performance By Decade and Overall: 1897 to 2012

April 06, 2013/ Tony Corke

In the previous blog on the topic of all-time MARS Ratings I explained the process I used to derive team Ratings across history and then identified those teams that had achieved the highest (Essendon) and lowest (Fitzroy) MARS Ratings ever. We know then which teams have burned brightest - and which flickered dimmest - across VFL/AFL history. In this blog I want to explore more extended bursts of talent or apparent lack of it.

April 02, 2013

And the Best and Worst VFL/AFL Teams of All-Time Are ...

April 02, 2013/ Tony Corke

I've promised for a while now to create MARS Ratings for the entire history of the VFL/AFL and with some time away from home over Easter, I'm now able to deliver on that promise.

March 01, 2013

Really Simple Margin Predictors

March 01, 2013/ Tony Corke

Sometimes it's interesting to see where an idea takes you. This blog started out as a wet weekend's musing: how predictive might a margin predictor be if it was based on a single variable?

February 24, 2013

Are the Victory Margins for Some Games Harder to Predict than for Others?

February 24, 2013/ Tony Corke

It's unarguable that the winner of some games will be harder to predict than the winner of others. When genuine equal-favourites meet, for example, you've only a 50:50 chance of picking the winner, but you can give yourself a 90% chances of being right when a team with a 90% probability of victory meets a team with only a 10% chance. The nearer to equal-favouritism the two teams are, the more difficult the winner is to predict, and the further away we are from this situation the easier the game is to predict.

February 21, 2013

Clustering Your Way To Line Betting Success : Building a Predictive Model

February 21, 2013/ Tony Corke

In the previous blog I used a clustering algorithm - Partitioning Around Medoids (PAM) as it happens - to group games that were similar in terms of pre-game TAB Bookmaker odds, the teams' MARS Ratings, and whether or not the game was an Interstate clash. There it turned out that, even though I'd clustered using only pre-game data, the resulting clusters were highly differentiated with respect to the line betting success rates of the Home teams in each cluster.

February 17, 2013

Clustering Your Way To Line Betting Success

February 17, 2013/ Tony Corke

For today's blog I'll be creating a game clustering that uses as input only the information that we might reasonably know pre-game - for example, the pre-game team MARS Ratings, Bookmaker prices (or some metric derived from them), and information about the game venue.

February 16, 2013

Incorporating the Draw in Overround Calculations

February 16, 2013/ Tony Corke

At university, studying undergraduate Economics - which, granted, was a while ago - I particularly disliked theories premised on simplifying assumptions, which were introduced with an implicit promise, rarely honoured, to relax these assumptions later and nudge the theory a little closer to observed reality.

February 14, 2013

The Probability of a Draw

February 14, 2013/ Tony Corke

Lately it seems I've been specialising in blogs on topics that I've covered before, and tonight's blog is no exception. It's on estimating the probability of a draw.

February 10, 2013

One Margin Predictor To Rule Them All

February 10, 2013/ Tony Corke

In the previous blog I investigated a number of additional approaches to determining the Bookmaker's Implicit Probability - and, by mathematical implication, his embedded overround - for each team based on observed head-to-head market prices.

February 06, 2013

Yet Another Look at Bookmaker Overround

February 06, 2013/ Tony Corke

Lately I've been pondering the challenge of determining how much overround the TAB Bookmaker has embedded in the head-to-head prices of each team in an AFL contest.

February 02, 2013

Building Simple Margin Predictors

February 02, 2013/ Tony Corke

Having a new - and, it seems, generally superior - way to calculate Bookmaker Implicit Probabilities is like having a new toy to play with. Most recently I've been using it to create a family of simple Margin Predictors, each optimised in a different way.

January 27, 2013

Using Risk-Equalising Probabilities for the Margin Predictors

January 27, 2013/ Tony Corke

With the exception of Combo_NN_2, all of the Margin Predictors rely on an algorithm that takes Bookmaker Implicit Probabilities as an input in some form:

Bookie_3 and Bookie_9 use Bookmaker Implicit Probabilities directly
ProPred_3 and ProPred_7 use the outputs of the ProPred algorithm, which uses a log transform of Bookmaker Implicit Probabilities as one input
WinPred_3 and WinPred_7 use the outputs of the WinPred algorithm, which also uses a log transform of Bookmaker Implicit Probabilities as one input
H2H_U3, H2H_U10, H2H_A3 and H2H_A7 use the outputs of the Head-to-Head algorithm, which uses Bookmaker Implicit Probabilities as one input
Combo_7 uses Bookmaker Implicit Probabilities directly as well as via its use of the outputs of the Head-to-Head Algorithm
Combo_NN_2 uses Bookmaker Implicit Probabilities directly as well as via its use of the outputs of the ProPred, WinPred and H2H algorithms

For this short blog I've switched, in all of the underlying algorithms, the Implicit Probabilities calculated using the Risk-Equalising Approach as replacements for those calculated using the Overround-Equalising Approach and then compared the resulting MAPEs for seasons 2007 to 2012 for all the Margin Predictors.

Overall, all Margin Predictors except Bookie_3 benefit from the switch, however modestly. Bookie_9, which now will serve as a co-predictor in the MAFL Margin Fund, benefits most, knocking over one quarter of a point per game off its MAPE.

The uniformity of these improvements is made slightly more remarkable by the realisation that the Margin Predictors, built using Eureqa, were optimised for the probability outputs of the underlying algorithms when those algorithms were using Overround-Equalising Implicit Probabilities. So, for example, the equation for Bookie_9, which is:

Predicted Home Team Margin = 2.2205129 + 17.729506 * ln(Home Team Bookmaker Probability/(1-Home Team Bookmaker Probability)) + 2*Home Team Bookmaker Probability

was created by Eureqa to minimise the historical MAPE of this equation when the Home Team Bookmaker Probabilities being used were those calculated assuming Overround-Equalisation. The 0.26 points per game reduction in the MAPE is being achieved without re-optimising this equation but, instead, simply by replacing the Home Team Probabilities with those calculated using a Risk-Equalising Approach.

Bookie_3 is the one Margin Predictor that responds poorly to the switch of probabilities without an accompanying re-optimisation in Eureqa. When I performed such a re-optimisation, Eureqa came up with this remarkably simple equation:

Predicted Home Team Margin = 21 * ln(Home Team Bookmaker Probability/(1-Home Team Bookmaker Probability))

This predictor has an MAPE of 29.22 points per game, which is extraordinarily low for such an easy-to-use predictor.

CONCLUSION

Virtually every algorithm used in MAFL has now been shown to benefit, however slightly, from using Implicit Probabilities calculated using the Risk-Equalising instead of the Overround-Equalising Approach. Naturallly, this makes me wonder if there's an even better way ...

Maybe next year I'll look for it.

Statistical Analyses