/vm

A data journalism virtual machine image for VirtualBox

Primary LanguageShellMIT LicenseMIT

vm

A Linux-powered data journalism virtual machine image for VirtualBox. Meant for beginners and/or people interested in learning the data journalism stack. Direct link to the image: Data Journalism VM .ova (3GB)

Quickstart

With VirtualBox installed, kick open Terminal, paste the below command and press enter:

$ wget https://raw.githubusercontent.com/cirlabs/vm/master/install-vm.sh && bash install-vm.sh

Features

The Xubuntu 13.10 operating system and a bunch of libraries comingly used by data journalist. This includes (but is not limited to):

  • IPython
  • Django
  • SQLite, MySQL, PostgreSQL/PostGIS
  • PANDAS
  • QGIS
  • csvkit

See PACKAGE for full list

Requirements

  • VirtualBox (Choose your operating system)
  • About 10 GB of disk space locally or on an external HDD/thumbdrive
  • Time

Installing the Virtual Machine

Mac OS X / Linux

  1. Download the script: VM-1.0.zip
  2. Extract the .zip file. You should see a file called install-vm.sh inside the folder.
  3. Open Terminal. For Mac OS X, go to Applications/Utilities/Terminal.app. For Linux, press CTRL+ALT+T to open it.
  4. In Terminal, type bash and the path to install-vm.sh. The easiest way to do this is to type bash and then drag-and-drop install-vm.sh into the terminal prompt. You should have something like bash /Users/username/Downloads/VM-1.0/install-vm-sh.
  5. Press [ENTER] and follow the instructions

install-vm.sh does the following:

  1. Installs the VirtualBox extension pack
  2. Downloads the 3 GB virtual machine image from Amazon S3
  3. Imports it into VirtualBox and configures it
  4. (Optional) Prompts user to select path to install virtual machine, e.g., /Volumes/my-external-harddive/vms/
  5. Starts the virtual machine

Windows

Windows isn't UNIX based so this script won't work. You can install Cygwin and try it there but I have not tested it. You're probably better off downloading the VirtualBox image directly from our AWS S3 bucket and importing it that way.

Before you download and install the VM, make sure you have the VirtualBox extension pack installed for your version of VirtualBox (currently 4.3.10).

Do the following:

  1. Download and install VirtualBox Extension Pack 4.3.10 (10.4 MB) | Instructions
  2. Download and install the Data Journalism VM file (3GB) | Instructions
  3. During installation, click the Reinitialize MAC Address checkbox
  4. Start the virtual machine | Instructions

Roll your own

Not enterested in using VirtualBox but want to bootstrap a Linux box for data journalism? Checkout bootstrap.sh.

Cleanup

The 3 GB .ova file should be saved to ~/.vms. You can remove it, as well as any other files, to regain that disk space back. You can also remove the install-vm.sh file.

Limitations / Notes

  • The VM will have a 800x600 screen resolution on startup. There are ways to make your VM fit the size of your host display but that goes on beyond the scope of this project. See VirtualBox Manual 1.8.5. Resizing the machine's window for more details. Do note that installing VirtualBox Guest Additions only works for Windows and Linux users. Sorry Mac folks.
  • This VM is meant as a place to practice and learn about data journalism and software development. It's running a lightweight Linux distribtion and uses just enough RAM to not be a headache. While it is my hope you use this VM to learn and create some awesome projects, my greater hope would be that you abandon the VM altogether and configure your laptop directly with either Linux or whatever OS you prefer. Again, more than anything, this is more of a teaching tool than a fully supported out-the-box rig. That could change in the future, but that is the focus of this project as of now.

Help

Open an issue

Thanks

Thanks to xdissent for his work on ievms. I borrowed a lot of the VirtualBox fetch and check code from there.

License

MIT. See LICENSE for more information