/vagrant-pydata

A ansible-based vagrant setup for doing data analysis with Pandas et al.

Primary LanguageShell

Installation

  • Install virtual box
  • Install vagrant
  • Clone this repo: git clone https://github.com/mattbowen/vagrant-pydata.git
  • Install ansible (preferrably in a virtual environment): pip install ansible
  • Run vagrant up in the directory where you cloned the repository
  • Find something else to keep you busy for the next 30-45 minutes
  • Run vagrant ssh when the box is finished setting itself up

Usage

Out of the box, you'll have what should be all the packages you need for doing data analysis in python, including pandas, ipython notebook, BeautifulSoup, and scikit-learn. See the base requirements file for details.

To start doing data analysis, you can start a new project by typing mkproject <project_name> at the shell. This will create a new environment with all your dependencies in it and create a folder for your project's files. Then, to startup ipython notebook for a given project, run notebook <project_name>. This will startup ipython notebook on http://localhost:8888. It will also create a "notebooks" directory for your project.