All Projects → andrewgiessel → Basketballcrawler

andrewgiessel / Basketballcrawler

This is a python module to scrape basketball-reference.com and convert various stats into usable data structures for analysis

Projects that are alternatives of or similar to Basketballcrawler

Abstractive Text Summarization
PyTorch implementation/experiments on Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond paper.
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Mnet deepcdr
Code for TMI 2018 "Joint Optic Disc and Cup Segmentation Based on Multi-label Deep Network and Polar Transformation"
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
How To Use Tensorflow For Time Series Live
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Bayes By Backprop
PyTorch implementation of "Weight Uncertainty in Neural Networks"
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Trimap generator
Generating automatic trimap through pixel dilation and strongly-connected-component algorithms
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Topic Model Tutorial
Tutorial on topic models in Python with scikit-learn
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Pydatadc 2018 Tidy
PyData 2018 tutorial for tidying data
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Kernel gateway demos
Demos associated with the kernel gateway incubator project
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
2018 19 Classes
https://cc-mnnit.github.io/2018-19-Classes/ - 🎒 💻 Material for Computer Club Classes
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Network Intrusion Detection
Machine Learning with the NSL-KDD dataset for Network Intrusion Detection
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Adversarial examples
对抗样本
Stars: ✭ 118 (+0%)
Mutual labels:  jupyter-notebook
Voice activity detector
A statistical model-based Voice Activity Detection
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Label smoothing
Corrupted labels and label smoothing
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Kaggle challenge
This is the code for "Kaggle Challenge LIVE" By Siraj Raval on Youtube
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Emobank
This repository contains EmoBank, a large-scale text corpus manually annotated with emotion according to the psychological Valence-Arousal-Dominance scheme.
Stars: ✭ 118 (+0%)
Mutual labels:  jupyter-notebook
Automunge
Artificial Learning, Intelligent Machines
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Defaultcreds Cheat Sheet
One place for all the default credentials to assist the Blue/Red teamers activities on finding devices with default password 🛡️
Stars: ✭ 1,949 (+1551.69%)
Mutual labels:  jupyter-notebook
Essential Sqlalchemy 2e
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Ml Dl Scripts
The repository provides usefull python scripts for ML and data analysis
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook
Code Challenges
Built for //BUILD 2017; this repo contains 15 minutes code challenges across data platform and analytics. Inclusive of; SQL Server on Linux, Azure SQL Database, Azure DocumentDB, Azure Search, HDInsight, MySQL as a Service, PostgreSQL as a Service, Bot Framework, Python Tools for Visual Studio, R Tools for Visual Studio. Get challenged, learn new stuff, fork the repo, contribute to the repo, and get a sneak peak the exciting world of Microsoft products and services.
Stars: ✭ 119 (+0.85%)
Mutual labels:  jupyter-notebook

basketballcrawler

Join the chat at https://gitter.im/andrewgiessel/basketballcrawler

This is a python module to scrape basketball-reference.com and convert various stats into usable data structures for analysis.

Here is a link to a sample IPython Notebook file demonstrating the library.

Requirements

Usage

Still developing the API. Right now you can get a list of all player overview urls, generate a list of game log urls for a given player, and convert that list into pandas dataframe.

Notes

players.json was generated on 02/11/2016 by buildPlayerDictionary() and savePlayerDictionary(). I'd recommend building your own, fresh copy. It takes about 10 minutes to scrape from the site. To create the most recent players.json, you can use as follows.

import basketballCrawler as bc
players = bc.buildPlayerDictionary()
bc.savePlayerDictionary(players, '/path/to/file')

You can also download generated players.json. However, note that it's a pretty large (13M) file.

players = bc.loadPlayerDictionary('/path/to/players.json')

In order to search player name, use searchForName function, for example,

searched_player = bc.searchForName(players, 'Murphey') # players is player dictionary

TODO

  • Local Database construction.
  • League-wide statistics.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].