/robustus

Tool to make and configure python virtualenv, setup necessary packages and cache them if necessary.

Primary LanguagePythonMIT LicenseMIT

robustus

Tool to make and configure python virtualenv, setup necessary packages and cache them if necessary.

When amount of 3rd-party libraries used by project grows it requires much effort and time to setup working environment with all the libraries and tools installed and paths configured. Robustus tries to minimize this effort by providing "one command" install scripts and ability to cache binary versions of packages to reuse them in future.

Prerequesties

  • python 2.7
  • virtualenv (to create new robustus environment)
  • boto (to download/upload cache to amazon s3 servers)
  • rsync (to download/upload cache to samba/ftp/http servers)
  • pytest (for testing)

Usage

First you need to create virtual environment.

robustus env <dir> --cache <binary package cache dir> <other virtualenv options>

Or you can convert existing virtualenv into robustus environment (it will install specific pip version and required packages). You can convert even virtualenv where you installed robustus.

robustus env <existing virtualenv> --cache <binary package cache dir>

Afterwards you can go to env directory and install packages using usual pip syntax.

robustus install numpy==1.7.2
robustus install -r <requirements file>
robustus install <other pip options>

Robustus will store binary packages in the cache directory specified by --cache option during creation of virtualenv ('wheelhouse' by default). or you can specify binary package cache where to install package.

robustus --cache ~/wheelhouse install numpy==1.7.2

You may also install non pip packages, e.g. opencv or cudamat. Robustus has platform specific scripts to setup them.

In order to list binary packages cached in robustus cache you can use freeze command.

robustus freeze

Remote caching

If package is not found in local cache, robustus by default will try to find a compiled package in a remote location. Remote cache website should have a page python-wheels/index.html that provides links to cached packages. Filename of the cached file corresponds to a wheel file created by pip with a mentioning of architecture and platform. Examples: scipy-0.13.3-cp27-none-linux_armv7l.whl pytest-2.3.5-py27-none-any.whl

Custom installable packages (e.g. opencv) may also be stored on a remote cache server with a custom filename that should be managed inside a package install script.

Remote cache is by default set to http://thirdparty-packages.braincorporation.net.

To change remote cache location use --find-links flag:

robustus install tornado==3.2.1 --find-links http://my_custom_remote_cache.net

To ignore remote cache use --no-remote-cache flag:

robustus install tornado==3.2.1 --no-remote-cache

Trouble shooting

Pip 1.5 requires --allow-external, --allow-unverified to be specified for some packages. If you see error like this:

Downloading/unpacking PyAudio
  Could not find any downloads that satisfy the requirement PyAudio
  Some externally hosted files were ignored (use --allow-external PyAudio to allow).

You should pass these options to robustus, so it'll bypass it to pip:

robustus install PyAudio --allow-all-external --allow-unverified PyAudio

Sometimes package version are not available in the remote wheelhouse, or fail to install correctly in ~/.robustus_rc. If you are having issues installing pip / python packages, try the following:

  • Open a new shell, outside of the virtual environment
  • $ rm ~/.robustus_rc/{package}.rob (Look around the directory, name will not match exactly)
  • $ pip install package==version (e.g. pep8==1.4.6)
  • Navigate to this repo and re-run source use_repo.sh

Cache Format

Robustus cache is a directory with binary package archives. Packages under distutils are compressed using wheel tool. These are archives created using bdist with *.whl extension.

Packages which doesn't support distutils doesn't have any specific format. Usually they are stored just as a directory with set of headers/libraries/executables obtained using library build tool installation utilities (e.g. cmake install).

Additionally robustus cache has .robustus directory where robustus specific information is stored. For each cached package there is a file <package_name>__<package_version>.rob (dots replaced with underscores). It is needed for robustus to know that package of specific version is stored in cache, so during install it won't build it again. By default this file is empty, but installation scripts can use it to store information required to install package (i.e. location of specific library within the cache).

As you can see you can freely move cache and merge them by just copying files. Though it is dangerous to remove files from the cache as well as move separate files from one cache to another.

Tagging

Robustus supports tagging as a way of recording the state of the multi-repo project. Git tags are created for all editable packages and pushed to the origin repo.

 robustus tag tag-name

Later, this state can be retrieved by cloning the base repo and

git checkout tag-name
robustus install -r requirements.txt --tag tag-name

Alternatively, you can also checkout a tag after cloning (this will not work if the requirements for the tag version are different) using:

robustus checkout tag-name

Underneath this functionality is implemented by a useful utility command

robustus perrepo anything after here

which will run whatever is specified once at each editable repo (including the base repo). This can be used to update all dependencies (e.g. git pull ...).

Misc

It it sometimes helpful to perform operations across all the editable repos in an environment (such as git updates etc.).

robustus perrepo [--not-base] operation

supports this.

A shortcut

robustus reset

will reset all the editable repos (including the base repo) to master, discarding any changes. By default this will warn you first (unless overridden with -f).

///Deprecated: You may also want to reuse existing binary package cache. You can do that by downloading cache directory before installing packages. Robustus has convenience command to do that. Cache can be stored as a directory or *.tar.gz, *.tar.bz or *.zip archive.

robustus download-cache <cache url>

In the same manner you can upload cache.

robustus upload-cache <cache url>
robustus --cache ~/wheelhouse upload-cache <cache url>

Make sure that binary package cache is suitable for your platform. It is highly recommended to use cached packages only on the machine there they have been compiled. If robustus cache is not empty external packages will be added to robustus cache.

To upload/download from amazon S3 cloud you should also specify bucket name, key and secret key.

robustus upload-cache cache.tar.bz -b <bucket_name> -k <key> -s <secret_key> --public
robustus download-cache cache.tar.bz -b <bucket_name> -k <key> -s <secret_key>
robustus download-cache https://s3.amazonaws.com/<bucket_name>/cache.tar.bz