Whileee
Registered User
- May 29, 2010
- 46,075
- 33,132
What? No one is doing what you are saying they are doing though. I am not suggesting what you think I'm suggesting.
This isn't something knowledge in advanced statistical analysis is needed.
Some players will peak prior to the average peak. Some will peak after. Most will peak closer than further.... sorta like a skewed normal curve.
No one knows for certain where either of these players will peak. No one can assume where either of these players peak. One can point out where it is historically most likely.
This is less a stats thing than a logic thing.
What we know:
Barkov's results at the same age has been better for every age we can compare.
Scheifele's results heavily accelerated this last season, which could be growth, luck/variance, or combination thereof.
Most players peak around 24-26, but not all do.
That's it.
It's not an "advanced" statistical principle, but there are statistical principles involved, because a fitted curve is a statistical representation of a set of data.
The main problem with the logic is the assumption that individuals all belong to the same sample set, and individual variations simply represent random fluctuations within that distribution.
Let me illustrate. Suppose you have two large bags, each with 1000 marbles that are black or white. In the first bag, 70% of the marbles are black. In the second bag, 30% of the marbles are black. If I take a large number of random samples (say 100 marbles per sample) from each of the bags, on average I would end up with 70 black marbles (out of 100) in samples from the first bag and 30 black marbles in the samples from the second bag. The sample mean from the first bag would be 70/100 (70%), and the sample mean from the second bag would be 30/100 (30%).
Now, if I mixed all of the marbles from the two bags together (i.e. 2000 marbles), and took a bunch of samples of 100 I would end up with a mean of 50 black marbles (i.e. 50%).
So, if I just focus on the combined sample and didn't know the proportions in the original sample, I could make the simplifying assumption that both bags started with 50% black marbles, though we know that neither bag had that proportion of black marbles.
How does this relate to hockey players and this topic? Well, if you assume that all players follow the same age / development trajectory, then you could suggest that the mean age for peak performance should be same for everyone, apart from some random statistical fluctuations. If, however, different groups of players follow different age / development trajectories, then lumping them together to predict when they would peak would ignore the different base trajectories. This is not simply a statistical nicety, it's also a logical concept. What statistical analysts should do is assemble more variables and outcomes to determine whether there are key factors that are associated with age / development trajectories, and age at peak performance. This could then be applied to develop a more robust model to predict when groups of players are likely to peak.
Analysts do this all the time to develop models such as the PCS, etc. Obviously, using a single variable (e.g. height) will result in a model that is not very reliable. The more influential variables that are included in the model, the more robust the model.