/stats_py_vm

Ubuntu 64 bit Vagrant VM for statistics and numerical libraries in Python

Primary LanguageJupyter Notebook

stats_py_vm

About

This project provides a Ubuntu (16.04) Vagrant Virtual Machine (VM) with numerical and scientific libraries for Python. It includes the following libraries. It also contains Jupyter (formerly known as IPython) notebooks.

Visualization libraries

There are Ansible scripts that automatically install the software when the VM is started.

Running

  1. To start the virtual machine(VM) type

    vagrant up
    
  2. Connect to the VM

    vagrant ssh
    
  3. Install Jupyter notebook extensions (run only once)

    jupyter contrib nbextension install --user
    
  4. Install vim extension (optional)

    cd $(jupyter --data-dir)/nbextensions
    git clone https://github.com/lambdalisue/jupyter-vim-binding vim_binding
    
  5. Start the notebook

    /vagrant/vm/jupyter_notebook.sh
    
  6. Open the notebook in the browser at the URL.

    http://localhost:8888/
    

Learning scientific programming with Jupyter notebooks

Robert Johansson has multiple notebooks organized as a series of lectures.

  1. Change to the notebooks directory

    cd /vagrant/notebooks
    
  2. Get the notebooks

    git clone https://github.com/jrjohansson/scientific-python-lectures
    

Machine learning notebooks

Titanic notebooks

  1. Change to the notebooks directory

    cd /vagrant/notebooks/titanic
    
  2. Get the notebooks

    curl -O https://gist.githubusercontent.com/mwaskom/8224591/raw/41ef1127b03c62798890d8a362d3c4dd7e2a5ac8/titanic_seaborn.ipynb
    
  3. Install Slideshow

    sudo pip install RISE
    sudo jupyter-nbextension install rise --py --sys-prefix
    sudo jupyter-nbextension enable rise --py --sys-prefix
    
  4. Titanic prediction

    http://ahmedbesbes.com/how-to-score-08134-in-titanic-kaggle-challenge.html
    https://blog.socialcops.com/engineering/machine-learning-python/
    
  5. Titanic data description

    https://www.kaggle.com/c/titanic/data
    

Scikit-learn notebooks

To get the Scikit learn notebooks from PyCon 2015. The video for this conference is on Youtube

  1. Change to the notebooks directory

    cd /vagrant/notebooks
    
  2. Get the notebooks into the directory pycon2015

    svn export http://github.com/jakevdp/sklearn_pycon2015/trunk/notebooks pycon2015
    
  3. In your Jupyter notebook list at http://localhost:8888/ the notebooks will be in the pycon2015 directory.

Other Scikit resources

https://github.com/amueller/scipy_2015_sklearn_tutorial/tree/master/notebooks https://github.com/ogrisel/parallel_ml_tutorial

Ploting notebooks

Bokeh notebooks

  1. Change to the notebooks directory

    cd /vagrant/notebooks
    
  2. Get the notebooks

    git clone https://github.com/bokeh/bokeh-notebooks
    

Bqplot notebooks

  1. Change to the notebooks directory

    cd /vagrant/notebooks
    
  2. Get the notebooks

    git clone https://github.com/bloomberg/bqplot.git
    

Holoviews notebooks

  1. Change to the notebooks directory

    cd /vagrant/notebooks
    
  2. Get the notebooks

    git clone https://github.com/ioam/scipy-2017-holoviews-tutorial.git
    
  3. Get the data

    curl http://s3.amazonaws.com/datashader-data/nyc_taxi.zip
    

Requirements

The following software is needed to get the software from github and run Vagrant to set up the Python development environment. The Git environment also provides an SSH client for Windows.