Friday, 26 August 2016

48 Games into the Championship.

The Championship may be only four match days old, but granular data on the state of the teams is beginning to pile up.

Over 1,000 goal attempts have been made, 300 plus of which required the keeper to try to at least make a save and Shane Duffy has already scored three league goals, although none for his actual employers.

Prediction is a constant balancing act between using recent data and larger samples that inevitably contain information from previous seasons, when a side may have had a very different lineup.

Huddersfield currently sit top of the Championship, while Newcastle, the short priced preseason favourites are closer to the relegation zone, from a points perspective than they are to the top of the pile.

The betting markets do not expect this situation to remain and Newcastle still head the market and the current leaders are given around a 4% chance of remaining in their current elevated position.

Fans of Huddersfield will no doubt relish their current position and perhaps dream that they are deserved pacesetters at this early stage, much as Crystal Palace. Swansea, Leicester supporters did in the early 2015/16 Premier league.

So is there any useful information to be gained from a sample size of just four games?

Many will be familiar with the idea that individual matches are rife with luck and looking at the process of chance creation, rather than just the relatively infrequent outcomes can be more predictive.

Huddersfield currently has a goal difference of +3, the smallest possible differential when acquiring 10 points from four matches and they have won each of their three victories by the margin of a single goal.

They've taken just slightly more attempts than they've faced and expected goals, based on shot type and position suggest that they might score, on average 4.5 goals and allow 5.5 from such chances.

They have a negative expected goal difference after four matches, that is only the 16th best record this season.

Small numbers of matches can also have very different strengths of schedules for different sides and Huddersfield has played a reasonably taxing first four games against relegated teams, Villa and Newcastle, along with Barnsley and Brentford.

Using interlocking collateral form of all 24 sides and their expected goal differential from Opta sourced data, the solutions that describe the events of the 48 games to date, place Huddersfield as the 12th best team in terms of strength of schedule corrected expected goals.

Newcastle are second under this approach, behind only Brighton.

All three promoted teams are comfortable inside the top 10, along with the likes of Wolves, Fulham, Derby, QPR and perhaps surprisingly, Reading.

Blackburn prop up whichever approach you use, with Nottingham Forest and Birmingham enjoying more elevated league positions than their shooting and schedule perhaps merits.

It's early days for the 24 team league and Huddersfield fans should perhaps screen capture for posterity this early incarnation.

Wednesday, 10 August 2016

The Premier League Goalkeeping Class of 2016/17.

Goal scorers, followed by goal keepers have been the most widely analysed positions in football.

The reasons are obvious, their main duties are closely connected. Strikers try the get the ball on target and into the net, whilst keepers do their best to prevent the latter.

In short, there is a readily identifiable series of actions and possible outcomes that can be used to attempt to define the abilities of the two positions.

Initially keepers were ranked simply by save percentage, the proportion of on target attempts that they prevented from turning into goals.

We should expect some variation in save percentages, even if every attempt carried exactly the same difficulty tariff and each keeper had the same talent for making saves.

If you toss a series of fair coins in a varied number of trials, the success rates of heads will largely fall in and around 50%, but some coins will appear more talented than others, just through chance.

Once you allow for this natural variation and  unequal number of attempts faced by the individual keepers, the save percentages of Premier League keepers is still more widely dispersed than though mere chance.

We can conclude that either, some keepers are better at saving shots than others, not all shots are savable to the same degree or much more likely a combination of at least these two factors.

Expected goals models, which use shot location, type, style and placement may be used in an attempt to quantify the task faced by keepers for each individual on target attempt.

A weakly struck chip that drifts gently towards the centre of the goal at midriff height is eminently more savable than a powerfully hit shot that deflects towards the top corner and historical precedence can be used to assign such efforts differing likelihoods of being saved.

This type of analysis quickly yields keepers who have allowed fewer goals than the average keeper described by such models would conceded from the attempts faced.

For example, in 2015/16 Lukasz Fabianski allowed 44 non penalty and non own goals from 157 goal bound attempts compared to an expected number conceded of nearly 50. Similarly, Kasper Schmeichel allowed 32 from 128 attempts against a par score of just over 35.

Both are over performers, but if we look more deeply at each attempt by simulating the range of outcomes using an average stand in keeper, the Swansea stopper appears to have put in a more solidly impressive performance.

An average keeper would emulate or better Fabianski's 2015/16 shot stopping performance around 9% of the time, whereas he would replicate or better Schmeichel's above par season in over a quarter of the simulated trials.

Over performing shot stopping, therefore is an encouraging sign, but by no means a clear indication of consistent, above average talent that may persist. It may just be par ability boosted by luck.

The table above shows the keepers who played in both of the last two seasons and how many times they saved more shots than predicted by an average ability expected goals model.

Ten keepers were above average in both seasons, whereas eight were below par in consecutive campaigns.

Stoke and England's Jack Butland combines youth with two season's of over performance in terms of goals allowed compared to the goal bound efforts he has been required to save.

However, just as Schmeichel's 2015/16 season has a 25% chance of being replicated by a lucky, average keeper, the same is true, only more so for both of Butland's seasons.

His impressive over-performance in 2014/15 from relatively few attempts faced and his smaller over achieving 2015/16 season from a much larger sample size was in both cases replicated or bettered in just under 50% of trials.

Depending upon where we draw this probabilistic line, many of the keepers who have had two most recent above par performing seasons against an expected goals model begin to fall away.

de Gea's most recent season is reproduced in nearly 30% of average trials. Similar reservations apply to Pantilimon, Robles, Forster, Ospina and more marginally Cech.

Only three keepers have over performed in the previous two seasons, with location/style/placement corrected expected goals campaigns that each have a likelihood of 10% or less of being replicated by our average keeper.  

Fabianski has the most impressive combination of dual over performance that is least likely to be emulated by an average performer, followed by Lloris and Adrian.

Conclusions should always be couched in probabilistic terms.

Butland and Forster, the two pretenders to Hart's England shirt may well be above average shot stoppers, but current evidence allows for the not insignificant possibility that both are reasonably capable keepers, who have enjoyed a run of good fortune.

And both may be a step or two behind possibly the best combination of likely longevity and current ability shown in the Premier League by Spurs' Hugo Lloris.

Friday, 5 August 2016

Ross McCormack, a £12 Million Gamble.

Aston Villa's descent into the Championship was one of the few certainties of the 2015/16 Premier League season.

Four points from a final possible 45, each gained against fellow relegated teams, mirrored Derby's meek, second half of the season collapse nine years earlier.

Villa's problems were extensive and wide ranging, but you had to return to Derby's debacle before you came across a relegated side who scored fewer goals than Villa did last term.

Relegated teams on average improve their goal scoring in the Championship, but even the most optimistic of projections would still likely leave Villa as one of the weakest Premiership attacks undertaking Championship duties.

Therefore, on a superficial level their acquisition of Ross McCormack from Fulham who scored 19 non penalty goals in just over 4,000 minutes of play, appears a sensible move.

However, McCormack turns 30 two weeks into the new season, typically an age when outfield attacking players have begun an aged related decline in output. A £12 million price tag also appears excessive.

Unless Villa are rewarded with an immediate or near immediate return to the top flight, they will be left with a rapidly depreciating asset.

McCormack has spent the majority of his time in England playing at Championship level, initially with Cardiff, then Leeds and latterly Fulham, without tempting a Premier league suitor, even in his prime.

Further alarm bells may ring when we look at the expected goals, based on shot location and type.

During McCormack's two most recent seasons at Fulham he has maintained an impressive volume of goal attempts.

He took slightly fewer shots per 90 in 2015/16, but from slightly better positions and once time played was factored in he was involved in trying to finish chances that were worth 0.25 expected goals per 90 in 2014/15 and 0.28 as season later.

However, his actual total non penalty goals scored rose from 13 to 19 a year later.

An over performance whereby 13 goals are scored from a cumulative expected goals total of 9.9 shouldn't surprise, an average player would achieve this through random chance around 20% of the time.

But 2015/16's efforts where 19 goals are scored compared to an expected 12, which no doubt contributed greatly to his price tag and sparked a bidding war between the relegated Premier League sides, is more difficult to dismiss as mere random fluctuation.

An average player, given McCormack's 2015/16 opportunities would score 19 or more goals just 3% of the time. So have Villa bought that rare commodity, a lethal finisher?

If we first imagine each of the 24 Championship teams has a striker who could have a small chance of over performing to the levels seen in McCormack's figures during a season.

Such an event that may have just a 3% chance of occurring for an individual will be more likely if we examine a larger group of players.

In short, if you had 24 players attempting McCormack's chances each season, you would expect at least one to produce his inflated return of 19 compared to the likely average of 12 goals around every other season, simply through chance.

Villa may hope they have bought a player who was capable of scoring 19 non penalty goals for Fulham last season, but erring on the side of caution, it may be better they assume they have bough a striker who is more likely a 12 NP goal a season purchase.

The good news for Villa fans is that McCormack was also a frequently involved creative influence in supplying chances for his teammates, He setup nine such goals in each of his seasons at Fulham.

On these occasions there were no major disconnects between expectation and reality. In both seasons you would expect an average player to score around 11 goals from the chances McCormack created.

Notwithstanding the possible difference in the quality of teammates in 2016/17, Villa's new buy will be unlikely to over perform to such heights in converting his West Midlands chances, but fans will hope he provides an all round contribution that goes some way to justifying the risk/reward from a £12 million outlay.