Larry Hoover
Registered User
- Sep 16, 2012
- 1,009
- 1
So the creator of CHLStats took down the website because he was hired. Is there anyone out there who can data scrape?
Its really nice these people are creating sites as a springboard to something but its also a little self serving to just say too damn bad to the people who use it once their name gets called.
Having seen a few contracts of this nature, I wouldn't assume that it's self-serving - I would assume that it's contractually obligated.
I would imagine this to be the case as well.
Now, is anyone able to data scrape?
I can. Fairly easy to do. Seriously, if all it takes to get a job in the NHL is to get some data from websites and make some easy computations to estimate TOI and so on, I guess I'll be next in line
Its really nice these people are creating sites as a springboard to something but its also a little self serving to just say too damn bad to the people who use it once their name gets called.
I can. Fairly easy to do. Seriously, if all it takes to get a job in the NHL is to get some data from websites and make some easy computations to estimate TOI and so on, I guess I'll be next in line
So Mathletic, JetsFan815, others willing and able, you would have the gratitude of many if you could lead the way to building a replacement.
I have an important suggestion though, make it a community/open source/creative commons thing. That way it has a life beyond the interests of any given person or prospective employer.
I can and will help, I'm just not a programmer. Send me a message and lets get it done.
yeah Larry Hoover sent me a PM and asked me if I could build one. I'm working on it at the moment. Don't have much experience in HTML/CSS/JS and so on but I started learning it last weekend as my exams are now over (planned to learn it anyways).
I may share a dropbox folder or something of the sort before the website is actually done so people could benefit from the data.
I thought about making a more open website dedicated to scouting as you pointed out rather than just sharing stats. I also plan on sharing stats for the USHL, NCAA, European leagues and so on.
Let me know if you have any ideas you'd want me to implement.
What are the statistics you need? I can make this for you.
Mathletic, I'd look up Bootstrap if I were you. It's a neat framework for making websites. I'm eventually going to redo my website's layout with it.
Also use Github instead of Dropbox to share code as it is the more conventional way to do so.
This is great Mathletic, many thanks!
I agree with Kane One that Github better than Dropbox for hosting the project.
I have some ideas that build on the former CHLStats site but they can wait until you are ready.
So where is the data coming from that you will be scraping?
I already have all the data on my computer. I take them from gamesheets like this one:
http://www.whl.ca/schedule/scoresheet/game/40306
though I don't know if it's a good idea to say it publicly
How do you calculate advanced stats with that, such as Fenwick?
I already have all the data on my computer. I take them from gamesheets like this one:
http://www.whl.ca/schedule/scoresheet/game/40306
though I don't know if it's a good idea to say it publicly
I'd be happy to assist in a community driven project for CHL stats. I'm a full stack developer that has experience in HTML, CSS, JavaScript (Front end) and Node.JS, PHP, Python etc.. (Back end). I'm quite experienced with data scraping (posted the NHL data sources thread), so I'll look into possible reliable sources for the data. If you'd like to put the project on Github (I have a private account) or something, that's perfectly fine.
Anyhow, if you'd like to work together or make this open source, just let me know.
The problem is you can't derive advanced stats off of that data. To calculate advanced stats like Corsi you need Play By Play data similar to http://www.nhl.com/scores/htmlreports/20152016/PL020007.HTM published by the NHL