/usau-py

Python utilities for scraping ultimate tournament data from USAU

Primary LanguageJupyter NotebookMIT LicenseMIT

usau-py

Python utilities for scraping ultimate tournament data from USAU

This repository includes some simple python modules for scraping the USAU website for tournament results, and some utility functions to manipulate and clean this data. It also includes some examples of such data downloaded as csv files, and some jupyter notebooks with some visualizations from this data.

The top_n_players.py CLI script provides a quick way to glean the top player contributions from tournaments like nationals, where player statistics are tracked with some reliability.

Notebooks (via nbviewer)

Installation

To install with setuptools:

# Check if https_proxy environment variable needs to be set
git clone https://github.com/azjps/usau-py.git
cd usau-py
./setup.py install

or alternatively with pip:

pip install git+https://github.com/azjps/usau-py.git

Dependencies

This should be both python2.7 and python3 compatible.

See the requirements.txt; pandas 0.13+ is the only main module required. pandas 0.17.1+, jupyter, and seaborn are recommended to run and view the notebooks. beautifulsoup4 and lxml are recommended for scraping web data on-the-fly from play.usaultimate.org.