Wednesday, 15 May 2019

Non Shot Passing Profile for Liverpool 2018/19

Over the season, we've slowly introduced a non shot xG model in this blog.

We assign the likelihood that a goal will be scored (or conceded) by a team in possession at any location on the field.

Successfully advancing or turning the ball over at another position on the pitch changes the non shot xG for the possession and the difference between the two points can be used to quantify the on field action.

This framework can be used however the ball is moved, but an obvious single application is to evaluate passing and the resulting risk reward.

The approach sidesteps the need for a shot to be attempted to assign a value to an action, differentiates between safe passing with little purpose and includes a huge chunk of data that was previously ignored.

You can generally differentiate between two types of passing actions, one that advances the ball into a more dangerous position and one that moves the ball backwards to recycle a move.

These can obviously be further divided into successful and unsuccessful actions.

Therefore, at its broadest we can identify a player's non shot passing contribution into value added and lost by successful or unsuccessful attempts to progressively move the ball into a more dangerous area.And similarly, NS xG "lost" by a successful backward pass, where possession is maintained and potentially more harmfully, NS xG actually lost when unsuccessfully passing the ball towards one's own goal.

If we incorporate minutes played and overall team style, we may begin to identify important contributors and ways that a side attempts to move the ball around the field.

Here's Liverpool's Premier League season from 2018/19.

I've highlighted NSxG gained & lost from forward passes & that "lost" by successfully recycling the ball away from the opponent's goal.

The passing performance of the player's broadly splits into 4 separate categories.

Keita & Henderson take a back seat to the players in groups 2 & 4 when creating dangerous completed passes, but do frequently recycle the ball backwards.

Henderson has contributed 5% of the NS xG gained by Liverpool from a forward pass & accounted for 8% of the recycled, backward NS xG.

Group 2 are most active creatively, but do turn the ball over a lot. Although, that inevitably comes with the territory in which they operate and so you assume the two columns are an acceptable trade off.

Someone has to be entrusted with turning a good situation into a great one, even at the cost of losing the ball to an opponent.

Group 3 accumulate the lowest amount of improvement in NS xG, presumably by beginning moves from relatively deep areas and VvD aside, being relatively unadventurous.

The final group 4 are also fairly creative, operating in areas where even a short, completed pass can have a relatively large effect on NS xG and again the trade off is that often a large chunk of NS xG with which they have been entrusted can be quickly lost.

This group also retains possession, but cedes NS xG through laying the ball back from advanced areas of the field.

We might assume that these figures are the benchmark requirement for each position or group in the current Klopp side.

Wednesday, 6 March 2019

Title Winners Aren't Becoming More Dominant Over Time.

Are the title winning teams in the Premier League getting more dominant because they're getting so much richer?

It seems a logical conclusion to draw given that Manchester City won the league with an unprecedented 100 points in 2017/18.

That obviously makes them the highest points per game team in 20 team Premier League history, but without context, such figures are largely meaningless.

Taking the points per game high point as a selective cutoff point is invariably going to furnish any number of apparently positive trendlines, but without taking a deeper look at how the league as a whole has evolved over a period of time, they too are context-less trivia.

The first 20 team Premier League season in 1995/96 had 98 draws, by 2017/18 the number had 99. But singular seasons may hide an upward or downward trend and this appears to be the case with drawn matches and by extension the total points that were won in a whole season.

The 1990's averaged 104 draws per season compared to just 92 for the comparable number of most recent Premier League campaigns.

Here's what this means for the average number of points won by sides in each Premier League season since 1995/96.

There has been a steady upward trend for the average number of points won by all Premier League teams since the beginning of the 20 team era, as draws have tended to decrease, therefore reducing the number of matches where just two points are won compared to those where three are gained.

So are the top teams taking a bigger share of this expanded points pot, which may indicate that they are being more dominant that their predecessors were.

One way to look at this context corrected view is to see how remote the representative of each finishing position has become from the average points won by a side in a particular season.

Manchester City in 2017/18 were 2.5 standard deviations above the league average points won that season. But it's a level of dominance that was very similar to that attained by Chelsea in 2004/05, Arsenal in 2003/04 and Manchester United in 1999/2000.

Here's the plot of how far from the average points all 20 finishing positions have been since 1995/96.

OK, it's messy. But it's fairly easy to see that the title winners aren't powering upwards in a ever improving arc. In fact it pretty much flatline's and might even be encouraged to dip downwards if we wanted to be "creative".

Here's an easier on the eye trendline for each final position.

Once you add the context of the points gathering environment over time, Man City 2017/18 are just a bump in the road and not part of a general trend. None of the top three finishing positions have shown to have improved their dominance over the rest of the league.

There's been a slight uptick for 4th to 7th placed sides, a down tick for 7th to 12th. Then everyone holds station, until the two worst teams become slightly more competitive over time, but still go down.

Thursday, 21 February 2019

The Name Game.

Sports analytics, not just football (or soccer) has always had a problem when naming their metrics (see what I mean).

Corsi, TSR, Pythagorean and expected goals may work fine in a closed environment, but try sticking those terms into the mainstream and you're immediately on the back foot.

Jeff Stelling's rant wouldn't have been half as effective if he'd had to say "Chance quality, what's that!"

Anyway, we've already embarked on a second phase of attaching names to a brand new raft of models and performance indicators, except this time everyone's going to be scratching their heads about what it is that we're actually talking about.

Anyone who's ever posted an xG figure will be familiar with the "X get Y for their xG, why the difference" but the rise of the NS xG model will take that to new heights.

Shot based xG models (actually shots, headers and other body parts) all share a core set of inputs (location, type) and any additions simply move the dial slightly, but the steady onset of so call "Non Shot xG" models may lead to comparisons between models that bear very little relationship to one another.

538 has a NS xG model, defined thus,.

Non-shot expected goals is an estimate of how many goals a team could have scored given their nonshooting actions in and around their opponent’s penalty area.

Infogol has a NS xG model, but ours is based on the expected outcome of possession chains.

They currently share a name, but nothing else.

In an increasingly monetized situation it is understandable that some are reluctant or unable to share detailed descriptions of each model's makeup.

But, even if we can't avoid falling into the trap of using less than intuitive language to name commonly used metrics (as happened with xG), we perhaps should steer clear of using catch all terms, such as NSxG to describe future modelling efforts.

538's model appears to be event based, ours is possession based, so it's probably best to include this additional piece of information when presenting any NSxG models in the future.