So, while doing a bit of catch-up on some threads and remembered this one, and some Twitter drama a couple of weeks ago that clarified some information on what we were just talking about. Some of this might be a bit garbled in the retelling, but I don't really care to go back to it, and want to focus on the conclusion.
My understanding is that the drama began when Dom Luczyczyn called out Andy and Rono, a pair of Czech brothers who also make analytics cards, for not crediting the source for their microstat data. I get the impression that Andy and Rono are not taken particularly seriously in that community, and the commentary they offer in tandem with their numbers often comes off as betraying a shallow understanding of the topic. This may be due to their limited English making them appear less articulate, and that's not their fault, but you still get the kind of "so and so is showing he's a true number one defenseman" kind of broad strokes stuff that doesn't strike me as all that informative.
Anyway, they responded by claiming that their source did not in fact want to be named, and alluded to them being an upstart in the field who perhaps weren't ready for prime time. People got into talking about the price for the data, and it turns out this data is something like seven times cheaper than what the Athletic is paying their supplier for Luczyczyn to work with. This made certain people suspicious, as something as high-effort as data collection for the entire NHL seems like something you wouldn't just do secretly and on the cheap. Rachel Doerrie, a former NHL employee, alluded to the existence of cheaper sources that "weren't as good" and sort of left it at that.
Jack Fraser, for his part, sources the microstat data from Corey Sznajder, who is just one guy who started tracking everything on his own volition - I personally think that's really cool and
the world really needs these guys. Sznajder apparently used to volunteer his data to Andy and Rono, but does not anymore for reasons I don't think ever came up. Speculation arose that the mystery source of data was possibly plagiarizing Sznajder, which I don't think anyone directly blamed on Andy and Rono. Things died down from there, with the usual punditry about whether Dom was 1) a defender of ethics in hockey analytics or 2) an abrasive jerk or 3) yes.
So, my takeaways on this, are:
1) the phrase "the analytics say" is near nonsense, as not only are there many sources for information of this kind, but that it goes through multiple steps, including data collection, number crunching, and graphic presentation.
2) some of the stuff you see is better than others.
3) not everything is JFresh, and he's not the Kleenex of analytics. That's one guy, and his output includes the work of peers including Sznajder and Bacon,
who's method is not as black-boxed as I implied upthread, and also sounds delicious.
4) this is exhausting, no wonder so many of you guys hate this.