StefanW
Registered User
As you all know, Eugene Melnyk has been making the claim that you do not have to spend cash like a drnuken sailor in order to win. The counter argument is that teams at the top of the spending scale are the ones who win more often. So who is right?
I'm not one to just take someone's word for it on important questions like this, so I decided to statistically test the relationship between spending in relation to other teams and winning. To do this, I compiled imformation about spending and success for each of the 30 teams over a 6 season period (07-08 to 12-13), which gave me 180 team cases (6 seasons X 30 teams) to work with. If you are not into stats you may want to stop reading now and just skip to the tables. For those who stick it out, I will try to keep things at a basic level so those who are only casually interested, and how little or no background in numbers, can still easily follow.
The variables I used were:
Season (08-09, 09-10, etc)
Team (Sens, Pens, Sharks, etc)
Spending (in relation to other teams, so the top spending team is coded 1, second is coded 2, etc)
Spending category (rank order spending divided into three groups: top third, middle third, and bottom third teams)
Result (0=missed playoffs, 1=out in first round, 2=out in second round, 3=out in third round, 4=lost cup final, 5=won the cup)
After checking to make sure the data is clean and good to go, the first thing I check the correlation between spending and result. Correlation basically means that when one of these variables changes, the other variable changes as well. The result was a Pearson correlation of .37, which means that when spending goes up the result moderately goes up. From a stats point of view this corrrelation is ok but nothing to write home about. It is important to note that correlaiton is not the same thing as causation. To use a non-hockey example, people getting into car accidents is strongly correlated with people shovelling snow. This does not mean shovelling snow causes car accidents.
The next thing I did was check how spending category relates to result. The results can be seen in the following table, which shows how often top third (first through 10th in spending), middle third (11th through 20th), and bottom third (21st through 30th) spending teams have had different level of success:
Result | Top Third | Middle Third | Bottom third
Missed Playoffs | 13 | 34 | 37
Out 1st Rnd | 20 | 13 | 15
Out 2nd Rnd | 12 | 8 | 4
Out 3rd Rnd | 8 | 2 | 2
Lost in Final | 4 | 1 | 1
Won the Cup | 3 | 2 | 1
When you eyeball these numbers it appears that teams at the top of spending are less likely to miss the playoffs and more likely to make it deep into the playoffs. However, in stats eyeballing something is not enough. What we do is figure out how likely the set of results charted in the above table is to occur completely by chance. To do this I ran a test called an analysis of variance (ANOVA for short). This set of results is highly unlikely to occur completely by chance, and are thus what we refer to as "statistically significant." For people who are into stats the results are F(29,150)=1.90, p<0.001. For those that are not, sorry if I made your eyes burn there.
The trouble with ANOVA with three groups is that the test only tells you if the overall model is significant. In other words, there is no real detail. E.g. top spending teams may be more likely to succeed than bottom spending teams, but there may be no statistical difference between top and middle spending teams. To drill down and compare results of the the three categories of teams I used what is referred to as a post hoc analysis. People who are not into stats can skip the rest of this paragraph because it is not important to you. For those who are into stats, I selected a Games-Howell post hoc test due to the unequal variances in the three groups.
The following table summarizes the comparison between the three groups. If the "significance" value is below 0.01 the difference between the two groups is unlikely to be a fluke.
Cap Category| Compared with | Standard Error | Significance
Top Third | Middle Third | 0.240 | 0.002
- | Bottom Third | 0.226 | <0.001
Middle Third | Top Third | 0.240 | 0.002
- | Bottom Third | 0.209 | 0.656
Bottom Third | Top Third | 0.226 | <0.001
- | Middle Third | 0.209 | 0.656
Based on this table, the top third of spending teams have significant greater success than middle or bottom spending teams, but there is no statisically significant difference between results for middle third and bottom third teams. So Mr Melnyk is right, sort of. If the goal is to do well, you can be frugal and still meet the objective. If the goal is to go into the third round of playoff and beyond, and to eventually win the cup, you are significantly more likely to succeed if your spending is in the top third of the league.
Limitations:
1) I was only able to find good cap data for a 6 years stretch. I would like to expand this to include the entire cap era.
2) The limited sample size lead to a pretty high standard error. I'm not happy with that.
3) Limited sample size meant I had to collapse teams into three categories. I would have preferred six groupings for a more refined analysis.
4) Rank ordering teams in this way is not ideal because the gap, for example, between first and second place spending teams may be greater than the gap between the 29th and 30 th place spending teams. This leads to pretty wonky variances.
Future Possible Analysis:
1) If I input the actual team spendings I can test Melynk's "what is important is dollar per point" theory that he trumpets in interviews. I'd like to do that when I have some more time.
2) Once I collect more seasons of data I would like to look at trends, and see if the gap between rich and poor teams is widening, closing, or staying the same over time.
3) I'd love to do more of a team by team analysis if I can get enough seasons of data.
Thanks for having a look, and feel free to make comments, criticisms, and suggestions. I slapped this together fairly quickly, and accept (and even expect) that I may have made embarassing mistakes along the way.
I'm not one to just take someone's word for it on important questions like this, so I decided to statistically test the relationship between spending in relation to other teams and winning. To do this, I compiled imformation about spending and success for each of the 30 teams over a 6 season period (07-08 to 12-13), which gave me 180 team cases (6 seasons X 30 teams) to work with. If you are not into stats you may want to stop reading now and just skip to the tables. For those who stick it out, I will try to keep things at a basic level so those who are only casually interested, and how little or no background in numbers, can still easily follow.
The variables I used were:
Season (08-09, 09-10, etc)
Team (Sens, Pens, Sharks, etc)
Spending (in relation to other teams, so the top spending team is coded 1, second is coded 2, etc)
Spending category (rank order spending divided into three groups: top third, middle third, and bottom third teams)
Result (0=missed playoffs, 1=out in first round, 2=out in second round, 3=out in third round, 4=lost cup final, 5=won the cup)
After checking to make sure the data is clean and good to go, the first thing I check the correlation between spending and result. Correlation basically means that when one of these variables changes, the other variable changes as well. The result was a Pearson correlation of .37, which means that when spending goes up the result moderately goes up. From a stats point of view this corrrelation is ok but nothing to write home about. It is important to note that correlaiton is not the same thing as causation. To use a non-hockey example, people getting into car accidents is strongly correlated with people shovelling snow. This does not mean shovelling snow causes car accidents.
The next thing I did was check how spending category relates to result. The results can be seen in the following table, which shows how often top third (first through 10th in spending), middle third (11th through 20th), and bottom third (21st through 30th) spending teams have had different level of success:
Missed Playoffs | 13 | 34 | 37
Out 1st Rnd | 20 | 13 | 15
Out 2nd Rnd | 12 | 8 | 4
Out 3rd Rnd | 8 | 2 | 2
Lost in Final | 4 | 1 | 1
Won the Cup | 3 | 2 | 1
When you eyeball these numbers it appears that teams at the top of spending are less likely to miss the playoffs and more likely to make it deep into the playoffs. However, in stats eyeballing something is not enough. What we do is figure out how likely the set of results charted in the above table is to occur completely by chance. To do this I ran a test called an analysis of variance (ANOVA for short). This set of results is highly unlikely to occur completely by chance, and are thus what we refer to as "statistically significant." For people who are into stats the results are F(29,150)=1.90, p<0.001. For those that are not, sorry if I made your eyes burn there.
The trouble with ANOVA with three groups is that the test only tells you if the overall model is significant. In other words, there is no real detail. E.g. top spending teams may be more likely to succeed than bottom spending teams, but there may be no statistical difference between top and middle spending teams. To drill down and compare results of the the three categories of teams I used what is referred to as a post hoc analysis. People who are not into stats can skip the rest of this paragraph because it is not important to you. For those who are into stats, I selected a Games-Howell post hoc test due to the unequal variances in the three groups.
The following table summarizes the comparison between the three groups. If the "significance" value is below 0.01 the difference between the two groups is unlikely to be a fluke.
Top Third | Middle Third | 0.240 | 0.002
- | Bottom Third | 0.226 | <0.001
Middle Third | Top Third | 0.240 | 0.002
- | Bottom Third | 0.209 | 0.656
Bottom Third | Top Third | 0.226 | <0.001
- | Middle Third | 0.209 | 0.656
Based on this table, the top third of spending teams have significant greater success than middle or bottom spending teams, but there is no statisically significant difference between results for middle third and bottom third teams. So Mr Melnyk is right, sort of. If the goal is to do well, you can be frugal and still meet the objective. If the goal is to go into the third round of playoff and beyond, and to eventually win the cup, you are significantly more likely to succeed if your spending is in the top third of the league.
Limitations:
1) I was only able to find good cap data for a 6 years stretch. I would like to expand this to include the entire cap era.
2) The limited sample size lead to a pretty high standard error. I'm not happy with that.
3) Limited sample size meant I had to collapse teams into three categories. I would have preferred six groupings for a more refined analysis.
4) Rank ordering teams in this way is not ideal because the gap, for example, between first and second place spending teams may be greater than the gap between the 29th and 30 th place spending teams. This leads to pretty wonky variances.
Future Possible Analysis:
1) If I input the actual team spendings I can test Melynk's "what is important is dollar per point" theory that he trumpets in interviews. I'd like to do that when I have some more time.
2) Once I collect more seasons of data I would like to look at trends, and see if the gap between rich and poor teams is widening, closing, or staying the same over time.
3) I'd love to do more of a team by team analysis if I can get enough seasons of data.
Thanks for having a look, and feel free to make comments, criticisms, and suggestions. I slapped this together fairly quickly, and accept (and even expect) that I may have made embarassing mistakes along the way.
Last edited: