A Linux-powered data journalism virtual machine image for VirtualBox. Meant for beginners and/or people interested in learning the data journalism stack. Direct link to the image: Data Journalism VM .ova (3GB)
With VirtualBox installed, kick open Terminal, paste the below command and press enter:
$ wget https://raw.githubusercontent.com/cirlabs/vm/master/install-vm.sh && bash install-vm.sh
The Xubuntu 13.10 operating system and a bunch of libraries comingly used by data journalist. This includes (but is not limited to):
- IPython
- Django
- SQLite, MySQL, PostgreSQL/PostGIS
- PANDAS
- QGIS
- csvkit
See PACKAGE for full list
- VirtualBox (Choose your operating system)
- About 10 GB of disk space locally or on an external HDD/thumbdrive
- Time
- Download the script: VM-1.0.zip
- Extract the .zip file. You should see a file called
install-vm.sh
inside the folder. - Open Terminal. For Mac OS X, go to
Applications/Utilities/Terminal.app
. For Linux, press CTRL+ALT+T to open it. - In Terminal, type
bash
and the path toinstall-vm.sh
. The easiest way to do this is to typebash
and then drag-and-dropinstall-vm.sh
into the terminal prompt. You should have something likebash /Users/username/Downloads/VM-1.0/install-vm-sh
. - Press [ENTER] and follow the instructions
install-vm.sh
does the following:
- Installs the VirtualBox extension pack
- Downloads the 3 GB virtual machine image from Amazon S3
- Imports it into VirtualBox and configures it
- (Optional) Prompts user to select path to install virtual machine, e.g.,
/Volumes/my-external-harddive/vms/
- Starts the virtual machine
Windows isn't UNIX based so this script won't work. You can install Cygwin and try it there but I have not tested it. You're probably better off downloading the VirtualBox image directly from our AWS S3 bucket and importing it that way.
Before you download and install the VM, make sure you have the VirtualBox extension pack installed for your version of VirtualBox (currently 4.3.10).
Do the following:
- Download and install VirtualBox Extension Pack 4.3.10 (10.4 MB) | Instructions
- Download and install the Data Journalism VM file (3GB) | Instructions
- During installation, click the Reinitialize MAC Address checkbox
- Start the virtual machine | Instructions
Not enterested in using VirtualBox but want to bootstrap a Linux box for data journalism? Checkout bootstrap.sh.
The 3 GB .ova file should be saved to ~/.vms
. You can remove it, as well as any other files, to regain that disk space back. You can also remove the install-vm.sh
file.
- The VM will have a 800x600 screen resolution on startup. There are ways to make your VM fit the size of your host display but that goes on beyond the scope of this project. See VirtualBox Manual 1.8.5. Resizing the machine's window for more details. Do note that installing VirtualBox Guest Additions only works for Windows and Linux users. Sorry Mac folks.
- This VM is meant as a place to practice and learn about data journalism and software development. It's running a lightweight Linux distribtion and uses just enough RAM to not be a headache. While it is my hope you use this VM to learn and create some awesome projects, my greater hope would be that you abandon the VM altogether and configure your laptop directly with either Linux or whatever OS you prefer. Again, more than anything, this is more of a teaching tool than a fully supported out-the-box rig. That could change in the future, but that is the focus of this project as of now.
Thanks to xdissent for his work on ievms. I borrowed a lot of the VirtualBox fetch and check code from there.
MIT. See LICENSE for more information