Scoring Chances: Part I of Many
For those that don't know, Dennis King tracked scoring chances for the Oilers this past season and posted his results at mc79hockey.com after the games. This was outstanding work, and myself and many others will surely be using this information for weeks to come.
Scott at Gospel of Hockey compiled this data on a few occasions, I'm using his data from mid season and the final sums in this post. This link will generate the NHL.com player stats for the games in question, automatically skipping over the handful of games that Dennis missed. As usual, change the values in the URL to look at different segments of the season.
So to pick a big apple from the bottom of the tree, I'm taking a look at the effect of players on the quality of the scoring chance on the Oiler goal. Dennis used a binary system, meaning he either marked down an opportunity as a scoring chance or not at all, their was no grading of the quality of the scoring chance. This is the way that every team tracks them, as far as I know, and I think that this is wise.
So if presume that Dennis was fair and consistent, and the players on the ice were not having any impact at all on the quality of scoring chance against, then we can model what we expect to see, which is:
For the season as a whole the results are shown below, click to enlarge:
By way of example, for every 100 EV scoring chances that the opposition had when Smid was on the ice, 12 goals were scored. He was actually fantastically lucky in the first half of the season with GA rate of only 4 per 100 chances, and hit a stretch of snakeyes in the back half of the season resulting in that rate quadrupling. Get enough buggers rolling the dice and that sort of thing is bound to happen to one of them. If we divided up the games by odd and even game numbers, the similar thing will be there for someone else, though it's impossible to say who it will be, because randomness is random.
As you can see, the players are grouped tightly together. And the variation between them is small and accounted for entirely by expected chance variation.
I could have posted 9 more charts as generated by the random model, and they look identical in form, though obvious some players have better luck some simulated seasons than others. And if I had done that, I expect that about 1 in 10 people would have been able to pick out the real chart from the random clones.
And this statistic doesn't repeat at all, because it is likely both honestly measured and it is almost entirely luck, or near enough pure luck that it would be extremely difficult to hear the tiny skill component squeaking through the noise.
Now if I was playing defense for the Oilers last season this wouldn't be the case, the quality of the scoring chances would be through the roof, and it would skew this whole picture. But I didn't play for the Oilers, that was done by practised and trained professional hockey players.
As another check for homerism and bias, we expect the EV scoring chances per 100 shots-direct-at-net (Corsi+) to be similar both for and against. And it is:
35.6 For.
35.7 Against.
Damn.
So we should be good to go. This seems like a reasonable starting point, a check to make sure that the world is still round. Though it may well seem obvious to some and unbelievable to others.
And a couple of random thoughts to tag onto the end of this rambling post, based on my sense of it after kicking at this stuff a bit over the past couple of days:
Scott at Gospel of Hockey compiled this data on a few occasions, I'm using his data from mid season and the final sums in this post. This link will generate the NHL.com player stats for the games in question, automatically skipping over the handful of games that Dennis missed. As usual, change the values in the URL to look at different segments of the season.
So to pick a big apple from the bottom of the tree, I'm taking a look at the effect of players on the quality of the scoring chance on the Oiler goal. Dennis used a binary system, meaning he either marked down an opportunity as a scoring chance or not at all, their was no grading of the quality of the scoring chance. This is the way that every team tracks them, as far as I know, and I think that this is wise.
So if presume that Dennis was fair and consistent, and the players on the ice were not having any impact at all on the quality of scoring chance against, then we can model what we expect to see, which is:
- Over the first half of the season, using the top 20 players by ice time (Stortini being the cutoff point), the average will be 11.9 goals against per 100 scoring chances against. And we should see a dispersion of results (measured here with sample standard deviation) of about 3.1
- We in fact see a a standard deviation of 2.9. So, check.
- Over the second half of the season, the average will be 12.2 goals against per 100 scoring chances against. And we should see a dispersion of results (measured here with sample standard deviation) of about 2.8
- We in fact see a a standard deviation of 3.2. Check.
- Over the season as a whole, the average will be 12.1 goals against per 100 scoring chances against. And we should see a dispersion of results (measured here with sample standard deviation) of about 2.0
- We in fact see a a standard deviation of 1.5. Check.
- The rates of 'goals against per scoring chance against' for the players, this should not repeat from the first half of the season to the next. As a convenient measure of this I used Pearson Correlation.
- We in fact see small negative relationship for the results from the front half to the back half. So no repeatability at all. Check.
For the season as a whole the results are shown below, click to enlarge:
By way of example, for every 100 EV scoring chances that the opposition had when Smid was on the ice, 12 goals were scored. He was actually fantastically lucky in the first half of the season with GA rate of only 4 per 100 chances, and hit a stretch of snakeyes in the back half of the season resulting in that rate quadrupling. Get enough buggers rolling the dice and that sort of thing is bound to happen to one of them. If we divided up the games by odd and even game numbers, the similar thing will be there for someone else, though it's impossible to say who it will be, because randomness is random.As you can see, the players are grouped tightly together. And the variation between them is small and accounted for entirely by expected chance variation.
I could have posted 9 more charts as generated by the random model, and they look identical in form, though obvious some players have better luck some simulated seasons than others. And if I had done that, I expect that about 1 in 10 people would have been able to pick out the real chart from the random clones.
And this statistic doesn't repeat at all, because it is likely both honestly measured and it is almost entirely luck, or near enough pure luck that it would be extremely difficult to hear the tiny skill component squeaking through the noise.
Now if I was playing defense for the Oilers last season this wouldn't be the case, the quality of the scoring chances would be through the roof, and it would skew this whole picture. But I didn't play for the Oilers, that was done by practised and trained professional hockey players.
As another check for homerism and bias, we expect the EV scoring chances per 100 shots-direct-at-net (Corsi+) to be similar both for and against. And it is:
35.6 For.
35.7 Against.
Damn.
So we should be good to go. This seems like a reasonable starting point, a check to make sure that the world is still round. Though it may well seem obvious to some and unbelievable to others.
And a couple of random thoughts to tag onto the end of this rambling post, based on my sense of it after kicking at this stuff a bit over the past couple of days:
- Mike Babcock was right, possession is everything.
- It's probably fairer to use 'shots direct at net while on the ice', instead of time, as a leveling tool, especially when comparing players on different teams. This for any even strength statistic.


